Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 May 5;44(8):3513-33.
doi: 10.1093/nar/gkw221. Epub 2016 Apr 8.

Transposons to toxins: the provenance, architecture and diversification of a widespread class of eukaryotic effectors

Affiliations

Transposons to toxins: the provenance, architecture and diversification of a widespread class of eukaryotic effectors

Dapeng Zhang et al. Nucleic Acids Res. .

Abstract

Enzymatic effectors targeting nucleic acids, proteins and other cellular components are the mainstay of conflicts across life forms. Using comparative genomics we identify a large class of eukaryotic proteins, which include effectors from oomycetes, fungi and other parasites. The majority of these proteins have a characteristic domain architecture with one of several N-terminal 'Header' domains, which are predicted to play a role in trafficking of these effectors, including a novel version of the Ubiquitin fold. The Headers are followed by one or more diverse C-terminal domains, such as restriction endonuclease (REase), protein kinase, HNH endonuclease, LK-nuclease (a RNase) and multiple distinct peptidase domains, which are predicted to carry their toxicity determinants. The most common types of these proteins appear to have originated from prokaryotic transposases (e.g. TN7 and Mu) and combine a CDC6/ORC1-STAND clade NTPase domain with a C-terminal REase domain. Other than the so-called Crinkler effectors of oomycetes and fungi, these effectors are encoded by other eukaryotic parasites such as trypanosomatids (the RHS proteins) and the rhizarian Plasmodiophora, and symbionts like Capsaspora Remarkably, we also find these proteins in free-living eukaryotes, including several viridiplantae, fungi, amoebozoans and animals. These versions might either still be transposons or function in other poorly understood eukaryote-specific inter-organismal and inter-genomic conflicts. These include the Medea1 selfish element of Tribolium that spreads via post-zygotic killing. We present a unified mechanism for the recombination-dependent diversification and action of this widespread class of molecular weaponry deployed across diverse conflicts ranging from parasitic to free-living forms.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) Domain architecture network of eukaryotic CR proteins. Domains linked in the same polypeptide are connected by arrows, with the arrow head pointing to the C-terminal domain. The arrow thickness reflects the number of associations found in the total dataset. (B–M) Representative domain architectures of CR proteins in different eukaryotic species. Proteins are labelled by the species abbreviation of organisms in which they are found followed by the NCBI Genbank id (gi). Species abbreviations are provided in the figure and also available in the ‘Materials and Methods’.
Figure 2.
Figure 2.
(A) Multiple sequence alignment of various eukaryotic CR-NTPase families and related AAA+ ATPases. Protein sequences are labelled by their species abbreviation followed by the NCBI gis. For species abbreviations refer to the Supplementary data. (B) Topology diagram depicting the conserved core of the STAND-CDC6/ORC1-like AAA+ ATPases highlighting family-specific and overall features. (C) 3-D cartoon of representative CDC6 structure (pdb: 2QBY) illustrating its DNA-contacting interface. (D) Representative structure of HEH domains depicting their DNA-binding modes. (E) Phylogenetic tree depicting the inter-relationships between various members of the STAND-CDC6/ORC1-like AAA+ATPases.
Figure 3.
Figure 3.
Multiple sequence alignment and predicted topologies for widely present CR-REase families. The conserved catalytic site residues are highlighted in both the alignment and the topology figure. The equivalence of core secondary elements (α1–β1–β2–β3–α2) between these families is also illustrated. Protein sequences are labelled by their species abbreviation followed by the NCBI gis. For species abbreviations refer to the ‘Materials and Methods’.
Figure 4.
Figure 4.
(A–B) Multiple sequence alignment of CR-HNH nuclease families. (C) Multiple sequence alignment and topology of CR-ubiquitin-like (CR-Ubl) Header domains. (D–F) Multiple sequence alignment of other lineage-specific Header domains including VP-NTD, Mnag-NTD and Caps-NTD domains. Protein sequences are labelled by their species abbreviation followed by the NCBI gis. For species abbreviations refer to the Supplementary data.
Figure 5.
Figure 5.
(A–F) Phylogenetic trees illustrating LSEs of eukaryotic CR protein domains and specific gene transfers from bacterial homologs (indicated by the curved arrows). LSEs are shown as coloured triangles/sectors in the tree. Bootstrap values are shown for the major branches only. The bacterial branches are coloured black. The complete trees from which these were derived can be retrieved from the Supplementary data. For species abbreviations refer to ‘Materials and Methods’. (G) Positional entropy comparison between CR-NTD and CR toxin domains. (H) Entropy plot for CR-Ubl 1+ CR-REase 2 type proteins in P. infestans.

Similar articles

Cited by

References

    1. Aravind L., Anantharaman V., Zhang D., de Souza R.F., Iyer L.M. Gene flow and biological conflict systems in the origin and evolution of eukaryotes. Front Cell Infect. Microbiol. 2012;2:89. - PMC - PubMed
    1. Burt A., Trivers R. Genes in Conflict: The Biology of Selfish Genetic Elements. Cambridge: The Belknap Press of Harvard University Press; 2006.
    1. Dawkins R., Krebs J.R. Arms races between and within species. Proc. R. Soc. Lond. B. Biol. Sci. 1979;205:489–511. - PubMed
    1. Hurst L.D., Atlan A., Bengtsson B.O. Genetic conflicts. Q. Rev. Biol. 1996;71:317–364. - PubMed
    1. Smith J.M., Price G.R. The logic of animal conflict. Nature. 1973;246:15–18.

Publication types