Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Feb 10;18(1):151.
doi: 10.1186/s12864-017-3543-7.

The nucleotide composition of microbial genomes indicates differential patterns of selection on core and accessory genomes

Affiliations

The nucleotide composition of microbial genomes indicates differential patterns of selection on core and accessory genomes

Jon Bohlin et al. BMC Genomics. .

Abstract

Background: The core genome consists of genes shared by the vast majority of a species and is therefore assumed to have been subjected to substantially stronger purifying selection than the more mobile elements of the genome, also known as the accessory genome. Here we examine intragenic base composition differences in core genomes and corresponding accessory genomes in 36 species, represented by the genomes of 731 bacterial strains, to assess the impact of selective forces on base composition in microbes. We also explore, in turn, how these results compare with findings for whole genome intragenic regions.

Results: We found that GC content in coding regions is significantly higher in core genomes than accessory genomes and whole genomes. Likewise, GC content variation within coding regions was significantly lower in core genomes than in accessory genomes and whole genomes. Relative entropy in coding regions, measured as the difference between observed and expected trinucleotide frequencies estimated from mononucleotide frequencies, was significantly higher in the core genomes than in accessory and whole genomes. Relative entropy was positively associated with coding region GC content within the accessory genomes, but not within the corresponding coding regions of core or whole genomes.

Conclusion: The higher intragenic GC content and relative entropy, as well as the lower GC content variation, observed in the core genomes is most likely associated with selective constraints. It is unclear whether the positive association between GC content and relative entropy in the more mobile accessory genomes constitutes signatures of selection or selective neutral processes.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
%GC and phyla. The 16S based phylogenetic tree demonstrates how % GC varies at the phylum, genus and species levels for all microbes included in the study
Fig. 2
Fig. 2
Genomic GC content. The box plot indicates how GC content (vertical axis) is distributed in the core, accessory and whole genome regions (horizontal axis) of all strains analyzed in the present study, colored according to species as indicated by the legend to the right
Fig. 3
Fig. 3
Differences between core genome and accessory genome %GC. The figure shows log2-transformed GC content differences (vertical axis) between core and accessory genomes for all species considered in the present study (legends to the right), with respect to their corresponding genome-wide GC content (vertical axis)
Fig. 4
Fig. 4
Relative entropy. The box-plot demonstrates how relative entropy (horizontal axis) is distributed genome-wide and among accessory and core genomes (horizontal axis). All strains are colored according to the species they belong to, as designated by the legend to the right
Fig. 5
Fig. 5
GCVAR. The figure shows a box-plot of how GCVAR (vertical axis) varies genome-wide and in the core and accessory genomes (horizontal axis) considered in the present study. All strains are colored according to the species specified by the legend to the right

Similar articles

Cited by

References

    1. Hildebrand F, Meyer A, Eyre-Walker A. Evidence of selection upon genomic GC-content in bacteria. PLoS Genet. 2010;6(9):e1001107. doi: 10.1371/journal.pgen.1001107. - DOI - PMC - PubMed
    1. Bohlin J, Snipen L, Hardy SP, Kristoffersen AB, Lagesen K, Donsvik T, Skjerve E, Ussery DW. Analysis of intra-genomic GC content homogeneity within prokaryotes. BMC Genomics. 2010;11(1):464. doi: 10.1186/1471-2164-11-464. - DOI - PMC - PubMed
    1. Mann S, Chen YP. Bacterial genomic G + C composition-eliciting environmental adaptation. Genomics. 2010;95(1):7–15. doi: 10.1016/j.ygeno.2009.09.002. - DOI - PubMed
    1. Mitchell D. GC content and genome length in Chargaff compliant genomes. Biochem Biophys Res Commun. 2007;353(0006–291; 1):207–210. doi: 10.1016/j.bbrc.2006.12.008. - DOI - PubMed
    1. Bohlin J, Brynildsrud OB, Sekse C, Snipen L. An evolutionary analysis of genome expansion and pathogenicity in Escherichia coli. BMC Genomics. 2014;15:882. doi: 10.1186/1471-2164-15-882. - DOI - PMC - PubMed

Publication types

LinkOut - more resources