Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Dec;28(12):3355-65.
doi: 10.1093/molbev/msr170. Epub 2011 Jun 24.

Purifying selection can obscure the ancient age of viral lineages

Affiliations

Purifying selection can obscure the ancient age of viral lineages

Joel O Wertheim et al. Mol Biol Evol. 2011 Dec.

Abstract

Statistical methods for molecular dating of viral origins have been used extensively to infer the time of most common recent ancestor for many rapidly evolving pathogens. However, there are a number of cases, in which epidemiological, historical, or genomic evidence suggests much older viral origins than those obtained via molecular dating. We demonstrate how pervasive purifying selection can mask the ancient origins of recently sampled pathogens, in part due to the inability of nucleotide-based substitution models to properly account for complex patterns of spatial and temporal variability in selective pressures. We use codon-based substitution models to infer the length of branches in viral phylogenies; these models produce estimates that are often considerably longer than those obtained with traditional nucleotide-based substitution models. Correcting the apparent underestimation of branch lengths suggests substantially older origins for measles, Ebola, and avian influenza viruses. This work helps to reconcile some of the inconsistencies between molecular dating and other types of evidence concerning the age of viral lineages.

PubMed Disclaimer

Figures

F<sc>IG</sc>. 1.
FIG. 1.
Lengths of single branches simulated under a codon substitution model with variable selection pressures are underestimated when inferred under a GTR + Γ4 nucleotide substitution model. All quantities are means from 10,000 replicates. (A) Ancestral sequence (1,000 codons) was random (i.e., all codons equiprobable) with varied proportion of sites under strong purifying selection (βs=0) and sites evolving neutrally (αs=βs). (BD) Branch lengths were inferred using GTR + Γ4 on single branches simulated under neutral selection regimes (αs=βs) and empirical site-by-site substitution rate (IFEL) profiles for MeV/RPV/PPRV, EBOV, and AIV. Black crosses represent the degree of bias, defined as the ratio between the true branch length and the one inferred under GTR + Γ4 on IFEL sequences. Horizontal gray lines show saturation asymptotes and diagonal dashed line depicts the behavior of an unbiased estimator.
F<sc>IG</sc>. 2.
FIG. 2.
Branch lengths inferred under a Dual model provide reliable estimates of branches simulated under variable selection pressures (i.e., empirical IFEL profiles) for MeV/RPV/PPRV, EBOV, and AIV. The diagonal dashed lines depict the behavior of an unbiased estimator.
F<sc>IG</sc>. 3.
FIG. 3.
Long branches are disproportionately affected by evolutionary models that differ in their treatment of rate variation. Each datapoint represents the length of a single branch of the MeV/RPV/PPRV, EBOV, or AIV phylogeny inferred under GTR + Γ4 and an alternate evolutionary model. The extreme of the y axis represents infinite branch lengths under the Lineage+Dual model for EBOV and AIV. Dashed lines are an x=y reference.
F<sc>IG</sc>. 4.
FIG. 4.
MCC phylogeny for MeV, RPV, and PPRV. Branch lengths were optimized under (A) GTR + Γ4 and (B) Lineage+Dual models. The Lineage+Dual model branch lengths were estimated assuming two different sets of synonymous and nonsynonymous substitution rates: one for short branches and another for long internal branches. Both trees are shown on the same scale.

Similar articles

Cited by

References

    1. Baron MD, Kamata Y, Barras V, Goatley L, Barrett T. The genome sequence of the virulent Kabete `O' strain of rinderpest virus: comparison with the derived vaccine. J Gen Virol. 1996;77(Pt 12):3041–3046. - PubMed
    1. Beja-Pereira A, Caramelli D, Lalueza-Fox C, et al. (27 co-authors. The origin of European cattle: evidence from modern and ancient DNA. Proc Natl Acad Sci U S A. 2006;103:8113–8118. - PMC - PubMed
    1. Belshaw R, Gardner A, Rambaut A, Pybus OG. Pacing a small cage: mutation and RNA viruses. Trends Ecol Evol. 2008;23:188–193. - PMC - PubMed
    1. Belyi VA, Levine AJ, Skalka AM. multiple integrations of ancient bornavirus and ebolavirus/ marburgvirus sequences in vertebrate genomes. PLoS Pathog. 2010 Unexpected inheritance. 6:e1001030. - PMC - PubMed
    1. Black FL. Measles endemicity in insular populations: critical community size and its evolutionary implication. J Theor Biol. 1966;11:207–211. - PubMed

Publication types

LinkOut - more resources