Linear mixed model selection for false discovery rate control in microarray data analysis

Biometrics. 2010 Jun;66(2):621-9. doi: 10.1111/j.1541-0420.2009.01286.x. Epub 2009 Jun 12.


In a microarray experiment, one experimental design is used to obtain expression measures for all genes. One popular analysis method involves fitting the same linear mixed model for each gene, obtaining gene-specific p-values for tests of interest involving fixed effects, and then choosing a threshold for significance that is intended to control false discovery rate (FDR) at a desired level. When one or more random factors have zero variance components for some genes, the standard practice of fitting the same full linear mixed model for all genes can result in failure to control FDR. We propose a new method that combines results from the fit of full and selected linear mixed models to identify differentially expressed genes and provide FDR control at target levels when the true underlying random effects structure varies across genes.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Artifacts*
  • Gene Expression Profiling / statistics & numerical data
  • Genes
  • Linear Models*
  • Oligonucleotide Array Sequence Analysis / methods
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*