Spoiling the whole bunch: quality control aimed at preserving the integrity of high-throughput genotyping

Am J Hum Genet. 2010 Jul 9;87(1):123-8. doi: 10.1016/j.ajhg.2010.06.005.


False-positive or false-negative results attributable to undetected genotyping errors and confounding factors present a constant challenge for genome-wide association studies (GWAS) given the low signals associated with complex phenotypes and the noise associated with high-throughput genotyping. In the context of the genetics of kidneys in diabetes (GoKinD) study, we identify a source of error in genotype calling and demonstrate that a standard battery of quality-control (QC) measures is not sufficient to detect and/or correct it. We show that, if genotyping and calling are done by plate (batch), even a few DNA samples of marginally acceptable quality can profoundly alter the allele calls for other samples on the plate. In turn, this leads to significant differential bias in estimates of allele frequency between plates and, potentially, to false-positive associations, particularly when case and control samples are not sufficiently randomized to plates. This problem may become widespread as investigators tap into existing public databases for GWAS control samples. We describe how to detect and correct this bias by utilizing additional sources of information, including raw signal-intensity data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Diabetes Complications / genetics*
  • Diabetes Mellitus, Type 1 / genetics*
  • Diabetic Nephropathies / genetics
  • Genome-Wide Association Study / standards*
  • Genotype
  • Humans
  • Polymorphism, Single Nucleotide
  • Quality Control