Indel-based evolutionary distance and mouse-human divergence

Genome Res. 2004 Aug;14(8):1610-6. doi: 10.1101/gr.2450504.

Abstract

We propose a method for estimating the evolutionary distance between DNA sequences in terms of insertions and deletions (indels), defined as the per site number of indels accumulated in the course of divergence of the two sequences. We derive a maximal likelihood estimate of this distance from differences between lengths of orthologous introns or other segments of sequences delimited by conservative markers. When indels accumulate, lengths of orthologous introns diverge only slightly slower than linearly, because long indels occur with substantial frequencies. Thus, saturation is not a major obstacle for estimating indel-based evolutionary distance. For introns of medium lengths, our method recovers the known evolutionary distance between rat and mouse, 0.014 indels per site, with good precision. We estimate that mouse-human divergence exceeds rat-mouse divergence by a factor of 4, so that mouse-human evolutionary distance in terms of selectively neutral indels is 0.056. Because in mammals, indels are approximately 14 times less frequent than nucleotide substitutions, mouse-human evolutionary distance in terms of selectively neutral substitutions is approximately 0.8.

Publication types

  • Comparative Study

MeSH terms

  • Animals
  • Base Sequence
  • Evolution, Molecular*
  • Genetic Variation*
  • Humans
  • Introns
  • Likelihood Functions
  • Mice
  • Models, Genetic
  • Probability
  • Rats
  • Sequence Deletion*