Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov 30;2(11):e000094.
doi: 10.1099/mgen.0.000094. eCollection 2016 Nov.

Genome-scale rates of evolutionary change in bacteria

Affiliations

Genome-scale rates of evolutionary change in bacteria

Sebastian Duchêne et al. Microb Genom. .

Abstract

Estimating the rates at which bacterial genomes evolve is critical to understanding major evolutionary and ecological processes such as disease emergence, long-term host-pathogen associations and short-term transmission patterns. The surge in bacterial genomic data sets provides a new opportunity to estimate these rates and reveal the factors that shape bacterial evolutionary dynamics. For many organisms estimates of evolutionary rate display an inverse association with the time-scale over which the data are sampled. However, this relationship remains unexplored in bacteria due to the difficulty in estimating genome-wide evolutionary rates, which are impacted by the extent of temporal structure in the data and the prevalence of recombination. We collected 36 whole genome sequence data sets from 16 species of bacterial pathogens to systematically estimate and compare their evolutionary rates and assess the extent of temporal structure in the absence of recombination. The majority (28/36) of data sets possessed sufficient clock-like structure to robustly estimate evolutionary rates. However, in some species reliable estimates were not possible even with 'ancient DNA' data sampled over many centuries, suggesting that they evolve very slowly or that they display extensive rate variation among lineages. The robustly estimated evolutionary rates spanned several orders of magnitude, from approximately 10-5 to 10-8 nucleotide substitutions per site year-1. This variation was negatively associated with sampling time, with this relationship best described by an exponential decay curve. To avoid potential estimation biases, such time-dependency should be considered when inferring evolutionary time-scales in bacteria.

Keywords: bacteria; evolution; molecular clock; phylogeny; substitution rates; time-dependency.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Regressions of the root-to-tip genetic distance (expected nucleotide substitutions per site) as a function of sampling time (year) for 35 bacterial data sets. Each point corresponds to an individual sampled genome (SNP sequence in the alignment), and the red dashed line is the linear regression using least squares; the R2 coefficients are also shown. Shading corresponds to the degree of temporal structure according to the date-randomization test in beast; blue indicates strong temporal structure, while orange and red indicate moderate and low temporal structure, respectively.
Fig. 2.
Fig. 2.
Bayesian estimates of genome-scale nucleotide substitution rates for all bacterial data sets. The axis for the nucleotide substitution rate is shown on log10 scale. Circular points represent the mean rate estimate, and error bars correspond to the 95 % HPD values. Colours indicate the degree of temporal structure according to the date-randomization test as indicated in Fig. 1. For comparison, the x symbol represents the point estimates using regression, while the asterisk (*) corresponds to estimates that were negative, and are thus not shown.
Fig. 3.
Fig. 3.
Estimates of genome-scale nucleotide substitution rates using a Bayesian method (beast) compared to those estimated via root-to-tip regression. The line represents y=x. Points that fall on the line correspond to data sets for which the mean Bayesian estimates closely match those from the regression. Points above and below the line are data sets for which the Bayesian estimate is higher or lower than that from the regression, respectively. The colour corresponds to the degree of temporal structure according to the date-randomization test as indicated in Fig. 1.
Fig. 4.
Fig. 4.
Estimates of genome-wide nucleotide substitution rates in human-associated bacterial pathogens as a function of sampling time in years. The axes are shown on a log10 scale. The colour corresponds to the degree of temporal structure according to the date-randomization test; blue indicates strong temporal structure and orange indicates moderate temporal structure. The dashed line corresponds to the linear regression, while the solid line corresponds to the decay curve (both fitted using only the points with strong and moderate temporal structure). The grey lines represent 100 bootstrap replicates of the decay curve, and thus represent the uncertainty in the decay function.

Similar articles

Cited by

References

    1. Baines S. L., Holt K. E., Schultz M. B., Seemann T., Howden B. O., Jensen S. O., van Hal S. J., Coombs G. W., Firth N., et al. (2015). Convergent adaptation in the dominant global hospital clone ST239 of methicillin-resistant Staphylococcus aureus. MBio 6e00080-15.10.1128/mBio.00080-15 - DOI - PMC - PubMed
    1. Bart M. J., Harris S. R., Advani A., Arakawa Y., Bottero D., Bouchez V., Cassiday P. K., Chiang C. S., Dalby T., et al. (2014). Global population structure and evolution of Bordetella pertussis and their relationship with vaccination. MBio 5e01074-14.10.1128/mBio.01074-14 - DOI - PMC - PubMed
    1. Biek R., Pybus O. G., Lloyd-Smith J. O., Didelot X.(2015). Measurably evolving pathogens in the genomic era. Trends Ecol Evol 30306–313.10.1016/j.tree.2015.03.009 - DOI - PMC - PubMed
    1. Boni M. F., Posada D., Feldman M. W.(2007). An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 1761035–1047.10.1534/genetics.106.068874 - DOI - PMC - PubMed
    1. Bos K. I., Harkins K. M., Herbig A., Coscolla M., Weber N., Comas I., Forrest S. A., Bryant J. M., Harris S. R., et al. (2014). Pre-Columbian mycobacterial genomes reveal seals as a source of new world human tuberculosis. Nature 514494–497.10.1038/nature13591 - DOI - PMC - PubMed

Data Bibliography

    1. All the data sets used here are available online at: http://zenodo.org/record/45951#.Vr1M-JN95E4.

Publication types

LinkOut - more resources