On the dispersion index of a Markovian molecular clock

Math Biosci. 2001 Aug;172(2):115-28. doi: 10.1016/s0025-5564(01)00067-0.

Abstract

The number of nucleotide substitutions accumulated in a gene or in a lineage is an important random variable in the study of molecular evolution. Of particular interest is the ratio of the variance to the mean of that random variable, often known as the dispersion index. Because nucleotide substitution is most commonly modeled by a continuous-time four-state Markov chain, this paper provides a systematic method of computing the dispersion indices exhibited by a continuous-time four-state Markov chain. Using this method along with computer algebra and Monte Carlo simulation, this paper offers partially proven conjectures that were supported by thorough computer experiments. It is believed that the Tamura model, the equal-input model and the Takahata-Kimura model always exhibit dispersion indices less than 2. It is also believed that a general four-state model can be chosen to exhibit a dispersion index of any desired magnitude, although the chance of a randomly chosen such model exhibiting a dispersion index greater than 2 is as small as about 2%. Relevance of these findings to the neutral theory is discussed.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Computer Simulation
  • Evolution, Molecular*
  • Markov Chains*
  • Models, Genetic*
  • Monte Carlo Method