An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations

Nat Genet. 2012 Jun 17;44(7):825-30. doi: 10.1038/ng.2314.


Population structure causes genome-wide linkage disequilibrium between unlinked loci, leading to statistical confounding in genome-wide association studies. Mixed models have been shown to handle the confounding effects of a diffuse background of large numbers of loci of small effect well, but they do not always account for loci of larger effect. Here we propose a multi-locus mixed model as a general method for mapping complex traits in structured populations. Simulations suggest that our method outperforms existing methods in terms of power as well as false discovery rate. We apply our method to human and Arabidopsis thaliana data, identifying new associations and evidence for allelic heterogeneity. We also show how a priori knowledge from an A. thaliana linkage mapping study can be integrated into our method using a Bayesian approach. Our implementation is computationally efficient, making the analysis of large data sets (n > 10,000) practicable.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics
  • Bayes Theorem
  • Chromosome Mapping / methods
  • Genetic Loci*
  • Genome, Human*
  • Genome, Plant*
  • Genome-Wide Association Study / methods
  • Genotype
  • Humans
  • Linkage Disequilibrium
  • Models, Genetic*
  • Molecular Dynamics Simulation
  • Polymorphism, Single Nucleotide
  • Population Groups / genetics*
  • Quantitative Trait Loci