Inference on population histories by approximating infinite alleles diffusion

Mol Biol Evol. 2013 Feb;30(2):457-68. doi: 10.1093/molbev/mss227. Epub 2012 Sep 19.


Reconstruction of the past is an important task of evolutionary biology. It takes place at different points in a hierarchy of molecular variation, including genes, individuals, populations, and species. Statistical inference about population histories has recently received considerable attention, following the development of computational tools to provide tractable approaches to this very challenging problem. Here, we introduce a likelihood-based approach which generalizes a recently developed model for random fluctuations in allele frequencies based on an approximation to the neutral Wright-Fisher diffusion. Our new framework approximates the infinite alleles Wright-Fisher model and uses an implementation with an adaptive Markov chain Monte Carlo algorithm. The method is especially well suited to data sets harboring large population samples and relatively few loci for which other likelihood-based models are currently computationally intractable. Using our model, we reconstruct the global population history of a major human pathogen, Streptococcus pneumoniae. The results illustrate the potential to reach important biological insights to an evolutionary process by a population genetics approach, which can appropriately accommodate very large population samples.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alleles*
  • Biological Evolution
  • Computer Simulation
  • Gene Frequency*
  • Genetics, Population
  • Likelihood Functions
  • Models, Genetic*
  • Streptococcus pneumoniae / genetics