The joint allele frequency spectrum of multiple populations: a coalescent theory approach

Theor Popul Biol. 2012 Mar;81(2):179-95. doi: 10.1016/j.tpb.2011.11.004. Epub 2011 Dec 2.


The allele frequency spectrum is a series of statistics that describe genetic polymorphism, and is commonly used for inferring population genetic parameters and detecting natural selection. Population genetic theory on the allele frequency spectrum for a single population has been well studied using both coalescent theory and diffusion equations. Recently, the theory was extended to the joint allele frequency spectrum (JAFS) for three populations using diffusion equations and was shown to be very useful in inferring human demographic history. In this paper, I show that the JAFS can be analytically derived with coalescent theory for a basic model of two isolated populations and then extended to multiple populations and various complex scenarios, such as those involving population growth and bottleneck, migration, and positive selection. Simulation study is used to demonstrate the accuracy and applicability of the theoretical model. The coalescent theory-based approach for the JAFS can characterize the demographic history with comprehensive statistical models as the diffusion approach does, and in addition gains several novel advantages: the computational complexity of calculating the JAFS with coalescent theory is reduced, and thus it is feasible to analytically obtain the JAFS for multiple populations; the hitchhiking effect can be efficiently modeled in coalescent theory, enabling the development of methodologies for detecting selection via multi-population polymorphism data. As an alternative to the diffusion approximation approach, the coalescent theory for the JAFS also provides a foundation for population genetic inference with the advent of large-scale genomic polymorphism data.

Publication types

  • Comparative Study

MeSH terms

  • Alleles
  • Computational Biology
  • Gene Frequency / genetics*
  • Genetics, Population*
  • Humans
  • Likelihood Functions
  • Models, Genetic*
  • Models, Theoretical
  • Poisson Distribution
  • Polymorphism, Genetic / genetics*