The distribution of waiting distances in ancestral recombination graphs

Theor Popul Biol. 2021 Oct:141:34-43. doi: 10.1016/j.tpb.2021.06.003. Epub 2021 Jun 26.

Abstract

The ancestral recombination graph (ARG) contains the full genealogical information of the sample, and many population genetic inference problems can be solved using inferred or sampled ARGs. In particular, the waiting distance between tree changes along the genome can be used to make inference about the distribution and evolution of recombination rates. To this end, we here derive an analytic expression for the distribution of waiting distances between tree changes under the sequentially Markovian coalescent model and obtain an accurate approximation to the distribution of waiting distances for topology changes. We use these results to show that some of the recently proposed methods for inferring sequences of trees along the genome provide strongly biased distributions of waiting distances. In addition, we provide a correction to an undercounting problem facing all available ARG inference methods, thereby facilitating the use of ARG inference methods to estimate temporal changes in the recombination rate.

Keywords: Ancestral recombination graph; Recombination evolution; Sequentially Markovian coalescent; Tree inference methods.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Genome
  • Markov Chains
  • Models, Genetic*
  • Phylogeny
  • Recombination, Genetic*