MIDAS2: Metagenomic Intra-species Diversity Analysis System

Bioinformatics. 2023 Jan 1;39(1):btac713. doi: 10.1093/bioinformatics/btac713.

Abstract

Summary: The Metagenomic Intra-Species Diversity Analysis System (MIDAS) is a scalable metagenomic pipeline that identifies single nucleotide variants (SNVs) and gene copy number variants in microbial populations. Here, we present MIDAS2, which addresses the computational challenges presented by increasingly large reference genome databases, while adding functionality for building custom databases and leveraging paired-end reads to improve SNV accuracy. This fast and scalable reengineering of the MIDAS pipeline enables thousands of metagenomic samples to be efficiently genotyped.

Availability and implementation: The source code is available at https://github.com/czbiohub/MIDAS2.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Databases, Factual
  • Genotype
  • Metagenome*
  • Metagenomics
  • Software*