Phyloseq: a bioconductor package for handling and analysis of high-throughput phylogenetic sequence data

Pac Symp Biocomput. 2012;235-46.


We present a detailed description of a new Bioconductor package, phyloseq, for integrated data and analysis of taxonomically-clustered phylogenetic sequencing data in conjunction with related data types. The phyloseq package integrates abundance data, phylogenetic information and covariates so that exploratory transformations, plots, and confirmatory testing and diagnostic plots can be carried out seamlessly. The package is built following the S4 object-oriented framework of the R language so that once the data have been input the user can easily transform, plot and analyze the data. We present some examples that highlight the methods and the ease with which we can leverage existing packages.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology
  • Data Interpretation, Statistical
  • Databases, Genetic / statistics & numerical data
  • High-Throughput Nucleotide Sequencing / statistics & numerical data*
  • Microbiota / genetics*
  • Phylogeny*
  • Software*