A simple procedure for estimating the false discovery rate

Bioinformatics. 2005 Mar 1;21(5):660-8. doi: 10.1093/bioinformatics/bti063. Epub 2004 Oct 12.

Abstract

Motivation: The most used criterion in microarray data analysis is nowadays the false discovery rate (FDR). In the framework of estimating procedures based on the marginal distribution of the P-values without any assumption on gene expression changes, estimators of the FDR are necessarily conservatively biased. Indeed, only an upper bound estimate can be obtained for the key quantity pi0, which is the probability for a gene to be unmodified. In this paper, we propose a novel family of estimators for pi0 that allows the calculation of FDR.

Results: The very simple method for estimating pi0 called LBE (Location Based Estimator) is presented together with results on its variability. Simulation results indicate that the proposed estimator performs well in finite sample and has the best mean square error in most of the cases as compared with the procedures QVALUE, BUM and SPLOSH. The different procedures are then applied to real datasets.

Availability: The R function LBE is available at http://ifr69.vjf.inserm.fr/lbe

Contact: broet@vjf.inserm.fr.

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Algorithms*
  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / genetics
  • Breast Neoplasms / metabolism
  • Computer Simulation
  • Data Interpretation, Statistical
  • Diagnosis, Computer-Assisted / methods
  • False Positive Reactions*
  • Gene Expression Profiling / methods*
  • Humans
  • Leukemia / diagnosis
  • Leukemia / genetics
  • Leukemia / metabolism
  • Models, Genetic*
  • Models, Statistical
  • Neoplasm Proteins / metabolism
  • Oligonucleotide Array Sequence Analysis / methods*

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins