Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle
- PMID: 12487758
- DOI: 10.1089/106652702761034136
Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle
Abstract
The Minimum Evolution (ME) approach to phylogeny estimation has been shown to be statistically consistent when it is used in conjunction with ordinary least-squares (OLS) fitting of a metric to a tree structure. The traditional approach to using ME has been to start with the Neighbor Joining (NJ) topology for a given matrix and then do a topological search from that starting point. The first stage requires O(n(3)) time, where n is the number of taxa, while the current implementations of the second are in O(p n(3)) or more, where p is the number of swaps performed by the program. In this paper, we examine a greedy approach to minimum evolution which produces a starting topology in O(n(2)) time. Moreover, we provide an algorithm that searches for the best topology using nearest neighbor interchanges (NNIs), where the cost of doing p NNIs is O(n(2) + p n), i.e., O(n(2)) in practice because p is always much smaller than n. The Greedy Minimum Evolution (GME) algorithm, when used in combination with NNIs, produces trees which are fairly close to NJ trees in terms of topological accuracy. We also examine ME under a balanced weighting scheme, where sibling subtrees have equal weight, as opposed to the standard "unweighted" OLS, where all taxa have the same weight so that the weight of a subtree is equal to the number of its taxa. The balanced minimum evolution scheme (BME) runs slower than the OLS version, requiring O(n(2) x diam(T)) operations to build the starting tree and O(p n x diam(T)) to perform the NNIs, where diam(T) is the topological diameter of the output tree. In the usual Yule-Harding distribution on phylogenetic trees, the diameter expectation is in log(n), so our algorithms are in practice faster that NJ. Moreover, this BME scheme yields a very significant improvement over NJ and other distance-based algorithms, especially with large trees, in terms of topological accuracy.
Similar articles
-
Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting.Mol Biol Evol. 2004 Mar;21(3):587-98. doi: 10.1093/molbev/msh049. Epub 2003 Dec 23. Mol Biol Evol. 2004. PMID: 14694080
-
Robustness of phylogenetic inference based on minimum evolution.Bull Math Biol. 2010 Oct;72(7):1820-39. doi: 10.1007/s11538-010-9510-y. Epub 2010 May 7. Bull Math Biol. 2010. PMID: 20449671
-
Consistency of topological moves based on the balanced minimum evolution principle of phylogenetic inference.IEEE/ACM Trans Comput Biol Bioinform. 2009 Jan-Mar;6(1):110-7. doi: 10.1109/TCBB.2008.37. IEEE/ACM Trans Comput Biol Bioinform. 2009. PMID: 19179704
-
Neighbor-joining revealed.Mol Biol Evol. 2006 Nov;23(11):1997-2000. doi: 10.1093/molbev/msl072. Epub 2006 Jul 28. Mol Biol Evol. 2006. PMID: 16877499 Review.
-
Distance-based phylogenetic inference from typing data: a unifying view.Brief Bioinform. 2021 May 20;22(3):bbaa147. doi: 10.1093/bib/bbaa147. Brief Bioinform. 2021. PMID: 32734294 Review.
Cited by
-
Whole-exome sequencing identifies mutated c12orf57 in recessive corpus callosum hypoplasia.Am J Hum Genet. 2013 Mar 7;92(3):392-400. doi: 10.1016/j.ajhg.2013.02.004. Epub 2013 Feb 28. Am J Hum Genet. 2013. PMID: 23453666 Free PMC article.
-
FastRFS: fast and accurate Robinson-Foulds Supertrees using constrained exact optimization.Bioinformatics. 2017 Mar 1;33(5):631-639. doi: 10.1093/bioinformatics/btw600. Bioinformatics. 2017. PMID: 27663499 Free PMC article.
-
DAMBE5: a comprehensive software package for data analysis in molecular biology and evolution.Mol Biol Evol. 2013 Jul;30(7):1720-8. doi: 10.1093/molbev/mst064. Epub 2013 Apr 5. Mol Biol Evol. 2013. PMID: 23564938 Free PMC article.
-
Coevolution between Stop Codon Usage and Release Factors in Bacterial Species.Mol Biol Evol. 2016 Sep;33(9):2357-67. doi: 10.1093/molbev/msw107. Epub 2016 Jun 13. Mol Biol Evol. 2016. PMID: 27297468 Free PMC article.
-
Resolving tumor evolution: a phylogenetic approach.J Natl Cancer Cent. 2024 Mar 21;4(2):97-106. doi: 10.1016/j.jncc.2024.03.001. eCollection 2024 Jun. J Natl Cancer Cent. 2024. PMID: 39282584 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous
