Meta-analysis of genome-wide association studies with overlapping subjects

Am J Hum Genet. 2009 Dec;85(6):862-72. doi: 10.1016/j.ajhg.2009.11.001.


Data from multiple genome-wide association studies are often analyzed together for the purposes of combining information from several studies of the same disease or comparing results across different disorders. We provide a valid and efficient approach to such meta-analysis, allowing for overlapping study subjects. The available data may contain individual participant records or only meta-analytic summary results. Simulation studies demonstrate that failure to account for overlapping subjects can greatly inflate type I error when combining results from multiple studies of the same disease and can drastically reduce power when comparing results across different disorders. In addition, the proposed approach can be substantially more powerful than the simple approach of splitting the overlapping subjects among studies, especially for comparing results across different disorders. The advantages of the new approach are illustrated with empirical data from two sets of genome-wide association studies.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Alleles
  • Arthritis, Rheumatoid / genetics
  • Case-Control Studies
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Diabetes Mellitus, Type 1 / genetics
  • Gene Frequency
  • Genome-Wide Association Study*
  • Genotype
  • Humans
  • Meta-Analysis as Topic*
  • Models, Statistical
  • Odds Ratio
  • Reproducibility of Results
  • Schizophrenia / genetics