Gene frequency distributions reject a neutral model of genome evolution
- PMID: 23315380
- PMCID: PMC3595032
- DOI: 10.1093/gbe/evt002
Gene frequency distributions reject a neutral model of genome evolution
Abstract
Evolution of prokaryotes involves extensive loss and gain of genes, which lead to substantial differences in the gene repertoires even among closely related organisms. Through a wide range of phylogenetic depths, gene frequency distributions in prokaryotic pangenomes bear a characteristic, asymmetrical U-shape, with a core of (nearly) universal genes, a "shell" of moderately common genes, and a "cloud" of rare genes. We employ mathematical modeling to investigate evolutionary processes that might underlie this universal pattern. Gene frequency distributions for almost 400 groups of 10 bacterial or archaeal species each over a broad range of evolutionary distances were fit to steady-state, infinite allele models based on the distribution of gene replacement rates and the phylogenetic tree relating the species in each group. The fits of the theoretical frequency distributions to the empirical ones yield model parameters and estimates of the goodness of fit. Using the Akaike Information Criterion, we show that the neutral model of genome evolution, with the same replacement rate for all genes, can be confidently rejected. Of the three tested models with purifying selection, the one in which the distribution of replacement rates is derived from a stochastic population model with additive per-gene fitness yields the best fits to the data. The selection strength estimated from the fits declines with evolutionary divergence while staying well outside the neutral regime. These findings indicate that, unlike some other universal distributions of genomic variables, for example, the distribution of paralogous gene family membership, the gene frequency distribution is substantially affected by selection.
Figures
Similar articles
-
A neutral theory of genome evolution and the frequency distribution of genes.BMC Genomics. 2012 May 21;13:196. doi: 10.1186/1471-2164-13-196. BMC Genomics. 2012. PMID: 22613814 Free PMC article.
-
Stability along with extreme variability in core genome evolution.Genome Biol Evol. 2013;5(7):1393-402. doi: 10.1093/gbe/evt098. Genome Biol Evol. 2013. PMID: 23821522 Free PMC article.
-
Assessment of assumptions underlying models of prokaryotic pangenome evolution.BMC Biol. 2021 Feb 10;19(1):27. doi: 10.1186/s12915-021-00960-2. BMC Biol. 2021. PMID: 33563283 Free PMC article.
-
The Turbulent Network Dynamics of Microbial Evolution and the Statistical Tree of Life.J Mol Evol. 2015 Jun;80(5-6):244-50. doi: 10.1007/s00239-015-9679-7. Epub 2015 Apr 18. J Mol Evol. 2015. PMID: 25894542 Free PMC article. Review.
-
Mechanisms That Shape Microbial Pangenomes.Trends Microbiol. 2021 Jun;29(6):493-503. doi: 10.1016/j.tim.2020.12.004. Epub 2021 Jan 8. Trends Microbiol. 2021. PMID: 33423895 Review.
Cited by
-
Nutrition or nature: using elementary flux modes to disentangle the complex forces shaping prokaryote pan-genomes.BMC Ecol Evol. 2022 Aug 16;22(1):101. doi: 10.1186/s12862-022-02052-3. BMC Ecol Evol. 2022. PMID: 35974327 Free PMC article.
-
Selection on horizontally transferred and duplicated genes in sinorhizobium (ensifer), the root-nodule symbionts of medicago.Genome Biol Evol. 2014 May 6;6(5):1199-209. doi: 10.1093/gbe/evu090. Genome Biol Evol. 2014. PMID: 24803571 Free PMC article.
-
Endosymbiotic origin and differential loss of eukaryotic genes.Nature. 2015 Aug 27;524(7566):427-32. doi: 10.1038/nature14963. Epub 2015 Aug 19. Nature. 2015. PMID: 26287458
-
Recombination produces coherent bacterial species clusters in both core and accessory genomes.Microb Genom. 2015 Nov 5;1(5):e000038. doi: 10.1099/mgen.0.000038. eCollection 2015 Nov. Microb Genom. 2015. PMID: 28348822 Free PMC article.
-
MicroScope-an integrated resource for community expertise of gene functions and comparative analysis of microbial genomic and metabolic data.Brief Bioinform. 2019 Jul 19;20(4):1071-1084. doi: 10.1093/bib/bbx113. Brief Bioinform. 2019. PMID: 28968784 Free PMC article.
References
-
- Akaike H. New look at statistical-model identification. IEEE Trans Automat Control. AC. 1974;19(6):716–723.
-
- Baumdicker F, Hess WR, Pfaffelhuber P. The diversity of a distributed genome in bacterial populations. Ann Appl Probab. 2010;20(5):1567–1606.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
