Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar;6(3):644-54.
doi: 10.1093/gbe/evu039.

Serial gene losses and foreign DNA underlie size and sequence variation in the plastid genomes of diatoms

Affiliations

Serial gene losses and foreign DNA underlie size and sequence variation in the plastid genomes of diatoms

Elizabeth C Ruck et al. Genome Biol Evol. 2014 Mar.

Abstract

Photosynthesis by diatoms accounts for roughly one-fifth of global primary production, but despite this, relatively little is known about their plastid genomes. We report the completely sequenced plastid genomes for eight phylogenetically diverse diatoms and show them to be variable in size, gene and foreign sequence content, and gene order. The genomes contain a core set of 122 protein-coding genes, with 15 additional genes exhibiting complex patterns of 1) gene losses at varying phylogenetic scales, 2) functional transfers to the nucleus, 3) gene duplication, divergence, and differential retention of paralogs, and 4) acquisitions of putatively functional recombinase genes from resident plasmids. The newly sequenced genomes also contain several previously unreported genes, highlighting how poorly characterized diatom plastid genomes are overall. Genome size variation reflects major expansions of the inverted repeat region in some cases but, more commonly, large-scale expansions of intergenic regions, many of which contain unique open reading frames of likely foreign origin. Although many gene clusters are conserved across species, rearrangements appear to be frequent in most lineages.

Keywords: chloroplast; diatoms; genomes; horizontal gene transfer; plastid.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—
Fig. 1.—
Sequence coverage by protein-coding, intergenic, and expanded (≥1 kb in length) intergenic regions in diatom plastid genomes. Bars are drawn proportional to the genome size and show the fraction of the genome occupied by these three sequence categories. Taxa in boldface identify genomes sequenced for this study. Phylogenetic relationships were redrawn from Theriot et al. (2010) and unpublished data. Taxa marked with a superscript “a” are dinoflagellates with diatom-derived plastids (Imanian et al. 2010).
F<sc>ig</sc>. 2.—
Fig. 2.—
Size variation of the inverted repeated region (IRa) across 16 fully sequenced plastid genomes in diatoms. Colored boxes circumscribe genes in various functional categories, with those above the line transcribed on the forward strand and vice versa for genes below the line. Maps are drawn to scale, and the gray box demarcates the core rns-trnI-trnA-rnl-rrn5 gene cluster conserved across the 16 genomes. Newly sequenced taxa from this study are in boldface and the nucleotide length of IRa are in parentheses beneath each taxon name. Double arrows delimit large putatively foreign sequence insertions.
F<sc>ig</sc>. 3.—
Fig. 3.—
Evolutionary patterns of pseudogenization, loss, and gain of genes in diatom plastid genomes. The matrix shows 15 genes variably present among the sequenced genomes. The presence of nuclear gene copies is almost certainly underreported for lack of nuclear genome data in most species. Taxa in boldface identify genomes sequenced for this study. Phylogenetic relationships were redrawn from Theriot et al. (2010) and unpublished data. Taxa marked with a superscript “a” are dinoflagellates with diatom-derived plastids (Imanian et al. 2010), and genes marked with a superscript “b” are of plasmid (serC) or unknown (tyrC) origin.
F<sc>ig</sc>. 4.—
Fig. 4.—
Gene phylogenies for acpP and tsf, both of which exhibit complex phylogenetic distributions across diatoms. Numbers are Bayesian posterior probability values. Genes located in the diatom plastid genome are marked with a circle, whereas genes in the nuclear genome are marked with a diamond. Taxa whose plastid genomes have multiple copies of the gene are boldfaced, with the −1 and −2 suffix denoting the different ortholog groups. Taxon marked with a superscript “a” is not the same Cylindrotheca strain sequenced as part of this study.
F<sc>ig</sc>. 5.—
Fig. 5.—
Plastid-genome alignments for three representative diatoms. One copy of the inverted repeat region was removed from each genome prior to aligning the genomes. Colored blocks indicate gene clusters with conserved gene order across the three taxa. Blocks below black center line are inverted relative to the Leptocylindrus reference genome. Two blocks (a and b) contain the largest conserved gene clusters, consisting of the RNA polymerase genes (a) or the large ribosomal operon in the small single copy region (b).

Similar articles

Cited by

References

    1. Adams KL, Palmer JD. Evolution of mitochondrial gene content: gene loss and transfer to the nucleus. Mol Phylogenet Evol. 2003;29:380–395. - PubMed
    1. Adams KL, et al. Intracellular gene transfer in action: dual transcription and multiple silencings of nuclear and mitochondrial cox2 genes in legumes. Proc Natl Acad Sci U S A. 1999;96:13863–13868. - PMC - PubMed
    1. Amin SA, Parker MS, Armbrust EV. Interactions between diatoms and bacteria. Microbiol Mol Biol Rev. 2012;76:667–684. - PMC - PubMed
    1. Archibald JM. The puzzle of plastid evolution. Curr Biol. 2009;19:R81–R88. - PubMed
    1. Baier M, Dietz KJ. The plant 2-Cys peroxiredoxin BAS1 is a nuclear-encoded chloroplast protein: its expressional regulation, phylogenetic origin, and implications for its specific physiological function in plants. Plant J. 1997;12:179–190. - PubMed

LinkOut - more resources