The Evolutionary Traceability of a Protein
- PMID: 30649284
- PMCID: PMC6394115
- DOI: 10.1093/gbe/evz008
The Evolutionary Traceability of a Protein
Abstract
Orthologs document the evolution of genes and metabolic capacities encoded in extant and ancient genomes. However, the similarity between orthologs decays with time, and ultimately it becomes insufficient to infer common ancestry. This leaves ancient gene set reconstructions incomplete and distorted to an unknown extent. Here we introduce the "evolutionary traceability" as a measure that quantifies, for each protein, the evolutionary distance beyond which the sensitivity of the ortholog search becomes limiting. Using yeast, we show that genes that were thought to date back to the last universal common ancestor are of high traceability. Their functions mostly involve catalysis, ion transport, and ribonucleoprotein complex assembly. In turn, the fraction of yeast genes whose traceability is not sufficient to infer their presence in last universal common ancestor is enriched for regulatory functions. Computing the traceabilities of genes that have been experimentally characterized as being essential for a self-replicating cell reveals that many of the genes that lack orthologs outside bacteria have low traceability. This leaves open whether their orthologs in the eukaryotic and archaeal domains have been overlooked. Looking at the example of REC8, a protein essential for chromosome cohesion, we demonstrate how a traceability-informed adjustment of the search sensitivity identifies hitherto missed orthologs in the fast-evolving microsporidia. Taken together, the evolutionary traceability helps to differentiate between true absence and nondetection of orthologs, and thus improves our understanding about the evolutionary conservation of functional protein networks. "protTrace," a software tool for computing evolutionary traceability, is freely available at https://github.com/BIONF/protTrace.git; last accessed February 10, 2019.
Keywords: LUCA; metabolic pathway; ortholog search; phylogenetic profile; sequence evolution; twilight zone.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Figures
Similar articles
-
A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes.Genome Biol. 2004;5(2):R7. doi: 10.1186/gb-2004-5-2-r7. Epub 2004 Jan 15. Genome Biol. 2004. PMID: 14759257 Free PMC article.
-
Protein superfamily evolution and the last universal common ancestor (LUCA).J Mol Evol. 2006 Oct;63(4):513-25. doi: 10.1007/s00239-005-0289-7. Epub 2006 Oct 4. J Mol Evol. 2006. PMID: 17021929
-
Assignment of orthologous genes via genome rearrangement.IEEE/ACM Trans Comput Biol Bioinform. 2005 Oct-Dec;2(4):302-15. doi: 10.1109/TCBB.2005.48. IEEE/ACM Trans Comput Biol Bioinform. 2005. PMID: 17044168
-
The Unfinished Reconstructed Nature of the Last Universal Common Ancestor.J Mol Evol. 2024 Oct;92(5):584-592. doi: 10.1007/s00239-024-10187-8. Epub 2024 Jul 18. J Mol Evol. 2024. PMID: 39026043 Free PMC article. Review.
-
The many faces of the helix-turn-helix domain: transcription regulation and beyond.FEMS Microbiol Rev. 2005 Apr;29(2):231-62. doi: 10.1016/j.femsre.2004.12.008. FEMS Microbiol Rev. 2005. PMID: 15808743 Review.
Cited by
-
The genetic factors of bilaterian evolution.Elife. 2020 Jul 16;9:e45530. doi: 10.7554/eLife.45530. Elife. 2020. PMID: 32672535 Free PMC article.
-
Tracing Eukaryotic Ribosome Biogenesis Factors Into the Archaeal Domain Sheds Light on the Evolution of Functional Complexity.Front Microbiol. 2021 Sep 16;12:739000. doi: 10.3389/fmicb.2021.739000. eCollection 2021. Front Microbiol. 2021. PMID: 34603269 Free PMC article.
-
Evolutionary Trajectories of New Duplicated and Putative De Novo Genes.Mol Biol Evol. 2023 May 2;40(5):msad098. doi: 10.1093/molbev/msad098. Mol Biol Evol. 2023. PMID: 37139943 Free PMC article.
-
Advances and Applications in the Quest for Orthologs.Mol Biol Evol. 2019 Oct 1;36(10):2157-2164. doi: 10.1093/molbev/msz150. Mol Biol Evol. 2019. PMID: 31241141 Free PMC article.
-
Universal and taxon-specific trends in protein sequences as a function of age.Elife. 2021 Jan 8;10:e57347. doi: 10.7554/eLife.57347. Elife. 2021. PMID: 33416492 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
