Bioinformatics tools and databases for whole genome sequence analysis of Mycobacterium tuberculosis

Infect Genet Evol. 2016 Nov;45:359-368. doi: 10.1016/j.meegid.2016.09.013. Epub 2016 Sep 13.


Tuberculosis (TB) is an infectious disease of global public health importance caused by Mycobacterium tuberculosis complex (MTC) in which M. tuberculosis (Mtb) is the major causative agent. Recent advancements in genomic technologies such as next generation sequencing have enabled high throughput cost-effective generation of whole genome sequence information from Mtb clinical isolates, providing new insights into the evolution, genomic diversity and transmission of the Mtb bacteria, including molecular mechanisms of antibiotic resistance. The large volume of sequencing data generated however necessitated effective and efficient management, storage, analysis and visualization of the data and results through development of novel and customized bioinformatics software tools and databases. In this review, we aim to provide a comprehensive survey of the current freely available bioinformatics software tools and publicly accessible databases for genomic analysis of Mtb for identifying disease transmission in molecular epidemiology and in rapid determination of the antibiotic profiles of clinical isolates for prompt and optimal patient treatment.

Keywords: Analysis tools; Bioinformatics; Databases; Next generation sequencing; Tuberculosis; Whole genome sequencing.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Databases, Genetic
  • Genome, Bacterial / genetics*
  • Humans
  • Molecular Epidemiology
  • Mycobacterium tuberculosis / genetics*
  • Phylogeny
  • Sequence Analysis, DNA / methods*
  • Tuberculosis / microbiology*