Mathematical elegance with biochemical realism: the covarion model of molecular evolution

J Mol Evol. 2001 Dec;53(6):711-23. doi: 10.1007/s002390010258.

Abstract

There is an apparent paradox in our understanding of molecular evolution. Current biochemically based models predict that evolutionary trees should not be recoverable for divergences beyond a few hundred million years. In practice, however, trees often appear to be recovered from much older times. Mathematical models, such as those assuming that sites evolve at different rates [including a Gamma distribution of rates across sites (RAS)] may in theory allow the recovery of some ancient divergences. However, such models require that each site maintain its characteristic rate over the whole evolutionary period. This assumption, however, contradicts the knowledge that tertiary structures diverge with time, invalidating the rate-constancy assumption of purely mathematical models. We report here that a hidden Markov version of the covarion model can meet both biochemical and statistical requirements for the analysis of sequence data. The model was proposed on biochemical grounds and can be implemented with only two additional parameters. The two hidden parts of this model are the proportion of sites free to vary (covarions) and the rate of interchange between fixed sites and these variable sites. Simulation results are consistent with this approach, providing a better framework for understanding anciently diverged sequences than the standard RAS models. However, a Gamma distribution of rates may approximate a covarion model and may possibly be justified on these grounds. The accurate reconstruction of older divergences from sequence data is still a major problem, and molecular evolution still requires mathematical models that also have a sound biochemical basis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Evolution, Molecular*
  • Models, Biological*
  • Models, Chemical