Optimal screening for promising genes in 2-stage designs

Biostatistics. 2008 Oct;9(4):700-14. doi: 10.1093/biostatistics/kxn002. Epub 2008 Mar 18.

Abstract

Detecting genetic markers with biologically relevant effects remains a challenge due to multiple testing. Standard analysis methods focus on evidence against the null and protect primarily the type I error. On the other hand, the worthwhile alternative is specified for power calculations at the design stage. The balanced test as proposed by Moerkerke and others (2006) and Moerkerke and Goetghebeur (2006) incorporates this alternative directly in the decision criterion to achieve better power. Genetic markers are selected and ranked in order of the balance of evidence they contain against the null and the target alternative. In this paper, we build on this guiding principle to develop 2-stage designs for screening genetic markers when the cost of measurements is high. For a given marker, a first sample may already provide sufficient evidence for or against the alternative. If not, more data are gathered at the second stage which is then followed by a binary decision based on all available data. By optimizing parameters which determine the decision process over the 2 stages (such as the area of the "gray" zone which leads to the gathering of extra data), the expected cost per marker can be reduced substantially. We also demonstrate that, compared to 1-stage designs, 2-stage designs achieve a better balance between true negatives and positives for the same cost.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Gene Expression Profiling / methods
  • Gene Expression Profiling / statistics & numerical data*
  • Genome-Wide Association Study / methods
  • Genome-Wide Association Study / statistics & numerical data*
  • Genotype
  • Humans
  • Models, Genetic*
  • Models, Statistical*
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Phenotype
  • Polymorphism, Single Nucleotide