Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 May 29:26:27663.
doi: 10.3402/mehd.v26.27663. eCollection 2015.

Analysis of composition of microbiomes: a novel method for studying microbial composition

Affiliations

Analysis of composition of microbiomes: a novel method for studying microbial composition

Siddhartha Mandal et al. Microb Ecol Health Dis. .

Abstract

Background: Understanding the factors regulating our microbiota is important but requires appropriate statistical methodology. When comparing two or more populations most existing approaches either discount the underlying compositional structure in the microbiome data or use probability models such as the multinomial and Dirichlet-multinomial distributions, which may impose a correlation structure not suitable for microbiome data.

Objective: To develop a methodology that accounts for compositional constraints to reduce false discoveries in detecting differentially abundant taxa at an ecosystem level, while maintaining high statistical power.

Methods: We introduced a novel statistical framework called analysis of composition of microbiomes (ANCOM). ANCOM accounts for the underlying structure in the data and can be used for comparing the composition of microbiomes in two or more populations. ANCOM makes no distributional assumptions and can be implemented in a linear model framework to adjust for covariates as well as model longitudinal data. ANCOM also scales well to compare samples involving thousands of taxa.

Results: We compared the performance of ANCOM to the standard t-test and a recently published methodology called Zero Inflated Gaussian (ZIG) methodology (1) for drawing inferences on the mean taxa abundance in two or more populations. ANCOM controlled the false discovery rate (FDR) at the desired nominal level while also improving power, whereas the t-test and ZIG had inflated FDRs, in some instances as high as 68% for the t-test and 60% for ZIG. We illustrate the performance of ANCOM using two publicly available microbial datasets in the human gut, demonstrating its general applicability to testing hypotheses about compositional differences in microbial communities.

Conclusion: Accounting for compositionality using log-ratio analysis results in significantly improved inference in microbiota survey data.

Keywords: constrained; log-ratio; relative abundance.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Histogram of pairwise Pearson correlation between operational taxonomic units in the global gut data set.
Fig. 2
Fig. 2
Comparison of (a) false discovery rate and (b) statistical power to detect differentially abundant microbial taxa by t-test, ZIG, and analysis of composition of microbiomes, based on 100 simulated data sets consisting of 500 (top panels) and 1,000 (bottom panels) taxa. Value of π ranges from 0.05 to 0.25. Power for the t-test is unity over the entire range of π and is not shown on the plots.
Fig. 3
Fig. 3
Unadjusted raw average OTU relative abundance and standard errors of Bacilli, Clostridia, and Gammaproteobacteria against the variables detected as having significant effects by application of ANCOM on the microbial dataset provided in LaRosa et al. (16). The mean OTU relative abundances for the two modes of birth at different gestational age categories are provided in the first row. The second row provides the mean OTU relative abundances at different ‘Day of life’ categories. The third row provides the mean OTU relative abundance for Bacilli against categories of breast milk variable and for Clostridia against categories of ‘Days on antibiotics’. Although, as in LaRosa et al. (16), ‘Day of life’ and ‘Days on antibiotics’ were analyzed as continuous variables, for simplicity of plotting in this figure they were discretized.

Similar articles

Cited by

References

    1. Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10:1200–2. - PMC - PubMed
    1. Clemente JC, Ursell LK, Parfrey LW, Knight R. The impact of the gut microbiota on human health: an integrative view. Cell. 2012;148:1258–70. - PMC - PubMed
    1. Cho I, Yamanishi S, Cox L, Methé BA, Zavadil J, Li K, et al. Antibiotics in early life alter the murine colonic microbiome and adiposity. Nature. 2012;488:621–6. - PMC - PubMed
    1. Dimmitt RA, Staley EM, Chuang G, Tanner SM, Soltau TD, Lorenz RG. The role of postnatal acquisition of the intestinal microbiome in the early development of immune function. J Pediatr Gastroenterol Nutr. 2010;51:262–73. - PMC - PubMed
    1. Johnson CL, Versalovic J. The human microbiome and its potential importance to pediatrics. Pediatrics. 2012;129:950–60. - PMC - PubMed