Demographic inference from multiple whole genomes using a particle filter for continuous Markov jump processes

PLoS One. 2021 Mar 2;16(3):e0247647. doi: 10.1371/journal.pone.0247647. eCollection 2021.

Abstract

Demographic events shape a population's genetic diversity, a process described by the coalescent-with-recombination model that relates demography and genetics by an unobserved sequence of genealogies along the genome. As the space of genealogies over genomes is large and complex, inference under this model is challenging. Formulating the coalescent-with-recombination model as a continuous-time and -space Markov jump process, we develop a particle filter for such processes, and use waypoints that under appropriate conditions allow the problem to be reduced to the discrete-time case. To improve inference, we generalise the Auxiliary Particle Filter for discrete-time models, and use Variational Bayes to model the uncertainty in parameter estimates for rare events, avoiding biases seen with Expectation Maximization. Using real and simulated genomes, we show that past population sizes can be accurately inferred over a larger range of epochs than was previously possible, opening the possibility of jointly analyzing multiple genomes under complex demographic models. Code is available at https://github.com/luntergroup/smcsmc.

Publication types

  • Historical Article
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Asian People
  • Bayes Theorem
  • Computer Simulation
  • Demography / history*
  • Genetic Variation
  • Genetics, Population*
  • Genome, Human*
  • History, 21st Century
  • History, Ancient
  • History, Medieval
  • Humans
  • Markov Chains*
  • Models, Genetic*
  • Pedigree
  • Population Density
  • White People