FastTree 2--approximately maximum-likelihood trees for large alignments
- PMID: 20224823
- PMCID: PMC2835736
- DOI: 10.1371/journal.pone.0009490
FastTree 2--approximately maximum-likelihood trees for large alignments
Abstract
Background: We recently described FastTree, a tool for inferring phylogenies for alignments with up to hundreds of thousands of sequences. Here, we describe improvements to FastTree that improve its accuracy without sacrificing scalability.
Methodology/principal findings: Where FastTree 1 used nearest-neighbor interchanges (NNIs) and the minimum-evolution criterion to improve the tree, FastTree 2 adds minimum-evolution subtree-pruning-regrafting (SPRs) and maximum-likelihood NNIs. FastTree 2 uses heuristics to restrict the search for better trees and estimates a rate of evolution for each site (the "CAT" approximation). Nevertheless, for both simulated and genuine alignments, FastTree 2 is slightly more accurate than a standard implementation of maximum-likelihood NNIs (PhyML 3 with default settings). Although FastTree 2 is not quite as accurate as methods that use maximum-likelihood SPRs, most of the splits that disagree are poorly supported, and for large alignments, FastTree 2 is 100-1,000 times faster. FastTree 2 inferred a topology and likelihood-based local support values for 237,882 distinct 16S ribosomal RNAs on a desktop computer in 22 hours and 5.8 gigabytes of memory.
Conclusions/significance: FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments. FastTree 2 is freely available at http://www.microbesonline.org/fasttree.
Conflict of interest statement
Figures
+ SPRs on simulated alignments with 250 protein sequences. We classified PhyML's splits as correct and found by both PhyML and FastTree, correct but missed by FastTree, or incorrect. We show the distribution of support values for each class. The right-most bin includes the strongly supported splits (0.95 to 1.0), and the gray dashed line shows the uniform distribution. The support values are PhyML's minimum of the approximate likelihood ratio test and SH-like , local supports.
axis). For RAxML with FastTree's (minimum-evolution) starting tree, we show the starting topology and RAxML's first two rounds of SPR moves.
Similar articles
-
FastTree: computing large minimum evolution trees with profiles instead of a distance matrix.Mol Biol Evol. 2009 Jul;26(7):1641-50. doi: 10.1093/molbev/msp077. Epub 2009 Apr 17. Mol Biol Evol. 2009. PMID: 19377059 Free PMC article.
-
RAxML and FastTree: comparing two methods for large-scale maximum likelihood phylogeny estimation.PLoS One. 2011;6(11):e27731. doi: 10.1371/journal.pone.0027731. Epub 2011 Nov 21. PLoS One. 2011. PMID: 22132132 Free PMC article.
-
morePhyML: improving the phylogenetic tree space exploration with PhyML 3.Mol Phylogenet Evol. 2011 Dec;61(3):944-8. doi: 10.1016/j.ympev.2011.08.029. Epub 2011 Sep 8. Mol Phylogenet Evol. 2011. PMID: 21925283
-
Very Fast Tree: speeding up the estimation of phylogenies for large alignments through parallelization and vectorization strategies.Bioinformatics. 2020 Nov 1;36(17):4658-4659. doi: 10.1093/bioinformatics/btaa582. Bioinformatics. 2020. PMID: 32573652
-
New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0.Syst Biol. 2010 May;59(3):307-21. doi: 10.1093/sysbio/syq010. Epub 2010 Mar 29. Syst Biol. 2010. PMID: 20525638
Cited by
-
Stability of Gastric Fluid and Fecal Microbial Populations in Healthy Horses under Pasture and Stable Conditions.Animals (Basel). 2024 Oct 16;14(20):2979. doi: 10.3390/ani14202979. Animals (Basel). 2024. PMID: 39457909 Free PMC article.
-
Comparative Genomic Analysis of the Foodborne Pathogen Burkholderia gladioli pv. cocovenenans Harboring a Bongkrekic Acid Biosynthesis Gene Cluster.Front Microbiol. 2021 May 17;12:628538. doi: 10.3389/fmicb.2021.628538. eCollection 2021. Front Microbiol. 2021. PMID: 34079526 Free PMC article.
-
OrthoDB: a hierarchical catalog of animal, fungal and bacterial orthologs.Nucleic Acids Res. 2013 Jan;41(Database issue):D358-65. doi: 10.1093/nar/gks1116. Epub 2012 Nov 24. Nucleic Acids Res. 2013. PMID: 23180791 Free PMC article.
-
Complete Genome Sequence of Human Oral Actinomyces sp. HMT897 Strain ORNL0104, a Host of the Saccharibacterium (TM7) HMT351.Microbiol Resour Announc. 2021 Apr 8;10(14):e00040-21. doi: 10.1128/MRA.00040-21. Microbiol Resour Announc. 2021. PMID: 33833021 Free PMC article.
-
Bacterial symbiont sharing in Megalomyrmex social parasites and their fungus-growing ant hosts.Mol Ecol. 2015 Jun;24(12):3151-69. doi: 10.1111/mec.13216. Epub 2015 Jun 9. Mol Ecol. 2015. PMID: 25907143 Free PMC article.
References
-
- Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–425. - PubMed
-
- Studier JA, Keppler KJ. A note on the neighbor-joining algorithm of Saitou and Nei. Mol Biol Evol. 1988;5:729–31. - PubMed
-
- Felsenstein J. Evolutionary trees from dna sequences: A maximum likelihood approach. J Mol Evol. 1981;17:368–376. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
