Environmental Atlas of Prokaryotes Enables Powerful and Intuitive Habitat-Based Analysis of Community Structures

iScience. 2020 Sep 29;23(10):101624. doi: 10.1016/j.isci.2020.101624. eCollection 2020 Oct 23.

Abstract

The recent prevalence of high-throughput sequencing has been producing numerous prokaryotic community structure datasets. Although the trait-based approach is useful to interpret those datasets from ecological perspectives, available trait information is biased toward culturable prokaryotes, especially those of clinical and public health relevance, and thus may not represent the breadth of microbiota found across many of Earth's environments. To facilitate habitat-based analysis free of such bias, here we report a ready-to-use prokaryotic habitat database, ProkAtlas. ProkAtlas comprehensively links 16S rRNA gene sequences to prokaryotic habitats, using public shotgun metagenome datasets. We also developed a computational pipeline for habitat-based analysis of given prokaryotic community structures. After confirmation of the method effectiveness using 16S rRNA gene sequence datasets from individual genomes and the Earth Microbiome Project, we showed its validness and effectiveness in drawing ecological insights by applying it to six empirical prokaryotic community datasets from soil, aquatic, and human gut samples.

Keywords: Bioinformatics; Environmental Biotechnology; Microbial Genomics; Microbiology.