Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Jan-Jun:11:100436.
doi: 10.1016/j.softx.2020.100436. Epub 2020 Mar 4.

TreeSwift: A massively scalable Python tree package

Affiliations

TreeSwift: A massively scalable Python tree package

N Moshiri. SoftwareX. 2020 Jan-Jun.

Abstract

Phylogenetic trees are essential to evolutionary biology, and numerous methods exist that attempt to extract phylogenetic information applicable to a wide range of disciplines, such as epidemiology and metagenomics. Currently, the three main Python packages for trees are Bio.Phylo, DendroPy, and the ETE Toolkit, but as dataset sizes grow, parsing and manipulating ultra-large trees becomes impractical for these tools. To address this issue, we present TreeSwift, a user-friendly and massively scalable Python package for traversing and manipulating trees that is ideal for algorithms performed on ultra-large trees.

Keywords: Phylogenetics; Python; Scalable; Tree traversal.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Runtimes of DendroPy, Bio.Phylo, the ETE Toolkit, and TreeSwift for a wide range of typical tree operations using trees of various sizes, as well as memory consumption after loading a tree (see Section 3 for details).
Fig. 2.
Fig. 2.
Example Lineage-Through-Time (LTT) plot generated using TreeSwift.

Similar articles

Cited by

References

    1. Ragonnet-Cronin M, Hodcroft E, Hué S, Fearnhill E, Delpech V, Brown AJ, Lycett S. Automated analysis of phylogenetic clusters. BMC Bioinformatics 2013;14(1):317. 10.1186/1471-2105-14-317, URL 10.1186/1471-2105-14-317. - DOI - DOI - PMC - PubMed
    1. Rose R, Lamers SL, Dollar JJ, Grabowski MK, Hodcroft EB, Ragonnet-Cronin M, Wertheim JO, Redd AD, German D, Laeyendecker O. Identifying transmission clusters with cluster picker and HIV-TRACE. AIDS Res Human Retrovir 2017;33(3):211–8. 10.1089/aid.2016.0205, URL 10.1089/aid.2016.0205. - DOI - DOI - PMC - PubMed
    1. Kembel SW, Eisen JA, Pollard KS, Green JL. The phylogenetic diversity of metagenomes. PLoS One 2011;6(8). e23214. 10.1371/journal.pone.0023214, URL 10.1371/journal.pone.0023214, arXiv:arXiv:1208.5792v1. - DOI - DOI - PMC - PubMed
    1. Darling AE, Jospin G, Lowe E, Matsen FA, Bik HM, Eisen JA. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2014;2. e243. 10.7717/peerj.243, URL https://peerj.com/articles/243/. - DOI - PMC - PubMed
    1. Filipski A, Tamura K, Billing-Ross P, Murillo O, Kumar S. Phylogenetic placement of metagenomic reads using the minimum evolution principle. BMC Genom 2015;16(Supplement 1):S13. 10.1186/1471-2164-16-S1-S13, URL 10.1186/1471-2164-16-S1-S13. - DOI - DOI - PMC - PubMed