Graph Splitting: A Graph-Based Approach for Superfamily-Scale Phylogenetic Tree Reconstruction
- PMID: 31364707
- DOI: 10.1093/sysbio/syz049
Graph Splitting: A Graph-Based Approach for Superfamily-Scale Phylogenetic Tree Reconstruction
Abstract
A protein superfamily contains distantly related proteins that have acquired diverse biological functions through a long evolutionary history. Phylogenetic analysis of the early evolution of protein superfamilies is a key challenge because existing phylogenetic methods show poor performance when protein sequences are too diverged to construct an informative multiple sequence alignment (MSA). Here, we propose the Graph Splitting (GS) method, which rapidly reconstructs a protein superfamily-scale phylogenetic tree using a graph-based approach. Evolutionary simulation showed that the GS method can accurately reconstruct phylogenetic trees and be robust to major problems in phylogenetic estimation, such as biased taxon sampling, heterogeneous evolutionary rates, and long-branch attraction when sequences are substantially diverge. Its application to an empirical data set of the triosephosphate isomerase (TIM)-barrel superfamily suggests rapid evolution of protein-mediated pyrimidine biosynthesis, likely taking place after the RNA world. Furthermore, the GS method can also substantially improve performance of widely used MSA methods by providing accurate guide trees.
Keywords: Bioinformatics; TIM-barrel superfamily; early evolution; network analysis; phylogenetic method.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society of Systematic Biologists.
Similar articles
-
Evolutionary dynamics of triosephosphate isomerase gene intron location pattern in Metazoa: A new perspective on intron evolution in animals.Gene. 2017 Feb 20;602:24-32. doi: 10.1016/j.gene.2016.11.027. Epub 2016 Nov 15. Gene. 2017. PMID: 27864009
-
Phylogenetic tree reconstruction via graph cut presented using a quantum-inspired computer.Mol Phylogenet Evol. 2023 Jan;178:107636. doi: 10.1016/j.ympev.2022.107636. Epub 2022 Oct 5. Mol Phylogenet Evol. 2023. PMID: 36208695
-
Bayesian coestimation of phylogeny and sequence alignment.BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83. BMC Bioinformatics. 2005. PMID: 15804354 Free PMC article.
-
Molecular evolution and phylogenetic implications in clinical research.Ann Agric Environ Med. 2013;20(3):455-9. Ann Agric Environ Med. 2013. PMID: 24069849 Review.
-
Molecular Phylogenetics: Concepts for a Newcomer.Adv Biochem Eng Biotechnol. 2017;160:185-196. doi: 10.1007/10_2016_49. Adv Biochem Eng Biotechnol. 2017. PMID: 27783136 Review.
Cited by
-
Rapid evolution of mammalian APLP1 as a synaptic adhesion molecule.Sci Rep. 2021 May 28;11(1):11305. doi: 10.1038/s41598-021-90737-y. Sci Rep. 2021. PMID: 34050225 Free PMC article.
-
Four additional natural 7-deazaguanine derivatives in phages and how to make them.Nucleic Acids Res. 2023 Sep 22;51(17):9214-9226. doi: 10.1093/nar/gkad657. Nucleic Acids Res. 2023. PMID: 37572349 Free PMC article.
-
Genome analysis of Parmales, the sister group of diatoms, reveals the evolutionary specialization of diatoms from phago-mixotrophs to photoautotrophs.Commun Biol. 2023 Jul 7;6(1):697. doi: 10.1038/s42003-023-05002-x. Commun Biol. 2023. PMID: 37420035 Free PMC article.
-
A sequence-based evolutionary distance method for Phylogenetic analysis of highly divergent proteins.Sci Rep. 2023 Nov 20;13(1):20304. doi: 10.1038/s41598-023-47496-9. Sci Rep. 2023. PMID: 37985846 Free PMC article.
-
Phylogenetics Identifies Two Eumetazoan TRPM Clades and an Eighth TRP Family, TRP Soromelastatin (TRPS).Mol Biol Evol. 2020 Jul 1;37(7):2034-2044. doi: 10.1093/molbev/msaa065. Mol Biol Evol. 2020. PMID: 32159767 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
