Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Sep;207(1):281-295.
doi: 10.1534/genetics.117.300061. Epub 2017 Jul 27.

Recombination-Driven Genome Evolution and Stability of Bacterial Species

Affiliations

Recombination-Driven Genome Evolution and Stability of Bacterial Species

Purushottam D Dixit et al. Genetics. 2017 Sep.

Abstract

While bacteria divide clonally, horizontal gene transfer followed by homologous recombination is now recognized as an important contributor to their evolution. However, the details of how the competition between clonality and recombination shapes genome diversity remains poorly understood. Using a computational model, we find two principal regimes in bacterial evolution and identify two composite parameters that dictate the evolutionary fate of bacterial species. In the divergent regime, characterized by either a low recombination frequency or strict barriers to recombination, cohesion due to recombination is not sufficient to overcome the mutational drift. As a consequence, the divergence between pairs of genomes in the population steadily increases in the course of their evolution. The species lacks genetic coherence with sexually isolated clonal subpopulations continuously formed and dissolved. In contrast, in the metastable regime, characterized by a high recombination frequency combined with low barriers to recombination, genomes continuously recombine with the rest of the population. The population remains genetically cohesive and temporally stable. Notably, the transition between these two regimes can be affected by relatively small changes in evolutionary parameters. Using the Multi Locus Sequence Typing (MLST) data, we classify a number of bacterial species to be either the divergent or the metastable type. Generalizations of our framework to include selection, ecologically structured populations, and horizontal gene transfer of nonhomologous regions are discussed as well.

Keywords: bacterial evolution; population genetics; recombination.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic of the computation models. (a) Illustration of the numerical model. Ne bacterial organisms evolve together, we show only one pair of strains. Point mutations (red circles) occur at a fixed rate μ per base pair generation and genetic fragments of length ltr are transferred between organisms at a rate ρ per base pair per generation. (b) The schematics of the three models of recombination. In model (1), recombining stretches have fixed end points. As a result, different recombination tracks do not overlap. In models (2) and (3), the recombining stretches have variable end points and as a result different recombination tracks can potentially overlap with each other.
Figure 2
Figure 2
Three possible outcomes of gene transfer that change the divergence δ. XD, YD, XY, and XYD are the most recent common ancestors of the strains. The divergence δb before transfer and δa after transfer are shown in red and blue, respectively.
Figure 3
Figure 3
Stochastic evolution of local divergence. A typical evolutionary trajectory of the local divergence δ(t) within a single gene between a pair of strains. We have used μ=105, ρ=5×106 per base pair per generation, θ=1.5%, and δTE=1%. Red tracks indicate the divergence increasing linearly, at a rate 2μ per base pair generation, with time due to mutational drift. Green tracks indicate recombination events that suddenly increase the divergence and blue tracks indicate recombination events that suddenly decrease the divergence. Eventually, the divergence increases sufficiently and the local genomic region escapes the pull of recombination (red stretch at the right).
Figure 4
Figure 4
Stochastic evolution of genome-wide divergence. Genome-wide divergence Δ(t) as a function of time at θ/δTE=0.25. We have used δTE=1%, θ=0.25%, μ=2×106 per base pair per generation and ρ=2×108,2×107,105, and 2×105 per base pair per generation corresponding to δmut/δTE=2,0.2,0.04, and 2×103, respectively. The dashed black lines represent the ensemble average Δ(t). See Figure A1 in the Appendix for the evolution of Δ(t) over a longer timescale.
Figure 5
Figure 5
Quantifying metastability in genome evolution. The number of generations tdiv (in units of the population size Ne) required for a pair of genomes to diverge well beyond the average intrapopulation diversity (see main text). We calculate the time it takes for the ensemble average Δ(t) of the genome-wide average divergence to reach 2θ as a function of θ/δTE and δmut/δTE. We used δTE=1%, μ=2×106 per base pair generation. In our simulations, we varied ρ and θ to scan the (θ/δTE,δmut/δTE) space. The green diamonds represent four populations shown in Figure 4 and Figure 6 (see below).
Figure 6
Figure 6
Distribution of genome-wide divergences in a population. Distribution of all pairwise genome-wide divergences δij in a coevolving population for decreasing values of δmut/δTE: 2 in (a), 0.2 in (b), 0.04 in (c), and 0.002 in (d) In all four panels, dashed black lines represent time-averaged distributions π(Δ)¯, while solid lines represent typical snapshot distributions π(Δ) in a single population. Colors of solid lines match those in Figure 4 for the same values of parameters. Time-averaged and snapshot distributions were estimated by sampling 5×105 pairwise coalescent times from the time-averaged coalescent distribution pet/Ne and the instantaneous coalescent distribution pc(t) correspondingly (see text for details).
Figure 7
Figure 7
Comparison of different models of recombination. (a) The ensemble average Δ(t) of pairwise genome-wide divergence Δ(t) as a function of the pairwise coalescent time t in explicit simulations. Model (1) simulations have nonoverlapping transfers of segments of length is 5000 bp. Model (2) simulations have transfers of overlapping 5000-bp segments. Model (3) simulations have overlapping transfer of segments of average length 5000 bp. The value of δmut/δTE are on the right side. (b) The ensemble average distribution of genome-wide divergence between pairs of strains π¯(Δ) for the three models of recombination shown in Figure 1a when δmut/δTE=0.002.
Figure 8
Figure 8
Classifying real bacteria as metastable or divergent. Approximate position of several real-life bacterial spaces on the metastable-divergent phase diagram (see text for details). Abbreviations of species names are as follows: BC, Bacillus cereus; EC, Escherichia coli; EF, Enterococcus faecium; FP, Flavobacterium psychrophilum; HI, Haemophilus influenzae; HP1, Helicobacter pylori; HP2, Haemophilus parasuis; SE, Salmonella enterica; SP1, Streptococcus pneumoniae; SP2, Streptococcus pyogenes; VP, Vibrio parahaemolyticus; VV, Vibrio vulnificus.
Figure A1
Figure A1
Genome-wide divergence Δ(t) as a function of time at θ/δTE=0.25. We have used δTE=1%, θ=0.25%, μ=2×106 per base pair per generation and ρ=2×108,2×107,105, and 2×105 per base pair per generation corresponding to δmut/δTE=2,0.2,0.04, and 2×103 respectively. The dashed black lines represent the ensemble average Δ(t). The cyan lines show the time it takes for the ensemble-averaged genomic divergence Δ(t) to reach 2θ when δmut/δTE=0.04 (pink line).
Figure A2
Figure A2
The number of generations tdiv (in units of the population size Ne) required for a pair of genomes to diverge well beyond the average intrapopulation diversity. We calculate the time it takes for the ensemble average of the genome-wide average divergence to reach 2θ as a function of θ/δTE and δmut/δTE. We randomly sample θ (between 0.5 and 3%), δTE (between 0.5 and 5%), and ρ (between 2×107 and 2×105 per base pair per generation) while keeping the mutation rate constant at μ=2×105 per base pair generation.

Similar articles

Cited by

References

    1. Cooper T. F., 2007. Recombination speeds adaptation by reducing competition between beneficial mutations in populations of Escherichia coli. PLoS Biol. 5: e225. - PMC - PubMed
    1. Didelot X., Méric G., Falush D., Darling A. E., 2012. Impact of homologous and non-homologous recombination in the genomic evolution of Escherichia coli. BMC Genomics 13: 1. - PMC - PubMed
    1. Dixit P. D., Pang T. Y., Studier F. W., Maslov S., 2015. Recombinant transfer in the basic genome of Escherichia coli. Proc. Natl. Acad. Sci. USA 112: 9070–9075. - PMC - PubMed
    1. Doolittle W. F., 2012. Population genomics: how bacterial species form and why they don’t exist. Curr. Biol. 22: R451–R453. - PubMed
    1. Doroghazi J. R., Buckley D. H., 2011. A model for the effect of homologous recombination on microbial diversification. Genome Biol. Evol. 3: 1349. - PMC - PubMed