A flexible two-stage procedure for identifying gene sets that are differentially expressed

Bioinformatics. 2009 Apr 15;25(8):1019-25. doi: 10.1093/bioinformatics/btp076. Epub 2009 Feb 11.


Motivation: Microarray data analysis has expanded from testing individual genes for differential expression to testing gene sets for differential expression. The tests at the gene set level may focus on multivariate expression changes or on the differential expression of at least one gene in the gene set. These tests may be powerful at detecting subtle changes in expression, but findings at the gene set level need to be examined further to understand whether they are informative and if so how.

Results: We propose to first test for differential expression at the gene set level but then proceed to test for differential expression of individual genes within discovered gene sets. We introduce the overall false discovery rate (OFDR) as an appropriate error rate to control when testing multiple gene sets and genes. We illustrate the advantage of this procedure over procedures that only test gene sets or individual genes.

MeSH terms

  • Computer Simulation*
  • Gene Expression Profiling / methods*
  • Gene Expression*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / methods