A review of methods and databases for metagenomic classification and assembly

Brief Bioinform. 2019 Jul 19;20(4):1125-1136. doi: 10.1093/bib/bbx120.

Abstract

Microbiome research has grown rapidly over the past decade, with a proliferation of new methods that seek to make sense of large, complex data sets. Here, we survey two of the primary types of methods for analyzing microbiome data: read classification and metagenomic assembly, and we review some of the challenges facing these methods. All of the methods rely on public genome databases, and we also discuss the content of these databases and how their quality has a direct impact on our ability to interpret a microbiome sample.

Keywords: bacteria; databases; microbial genomics; microbiome; next-generation sequencing.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Algorithms
  • Computational Biology / methods
  • Databases, Genetic* / statistics & numerical data
  • Gene Expression Profiling / statistics & numerical data
  • Genetic Markers
  • High-Throughput Nucleotide Sequencing / statistics & numerical data
  • Metagenome
  • Metagenomics / methods*
  • Metagenomics / statistics & numerical data
  • Microbiota / genetics
  • Phylogeny
  • Sequence Alignment / statistics & numerical data

Substances

  • Genetic Markers