Unified and isomer-specific NMR metabolomics database for the accurate analysis of (13)C-(1)H HSQC spectra

ACS Chem Biol. 2015 Feb 20;10(2):452-9. doi: 10.1021/cb5006382. Epub 2014 Nov 5.

Abstract

A new metabolomics database and query algorithm for the analysis of (13)C-(1)H HSQC spectra is introduced, which unifies NMR spectroscopic information on 555 metabolites from both the Biological Magnetic Resonance Data Bank (BMRB) and Human Metabolome Database (HMDB). The new database, termed Complex Mixture Analysis by NMR (COLMAR) (13)C-(1)H HSQC database, can be queried via an interactive, easy to use web interface at http://spin.ccic.ohio-state.edu/index.php/hsqc/index . Our new HSQC database separately treats slowly exchanging isomers that belong to the same metabolite, which permits improved query in cases where lowly populated isomers are below the HSQC detection limit. The performance of our new database and query web server compares favorably with the one of existing web servers, especially for spectra of samples of high complexity, including metabolite mixtures from the model organisms Drosophila melanogaster and Escherichia coli. For such samples, our web server has on average a 37% higher accuracy (true positive rate) and a 82% lower false positive rate, which makes it a useful tool for the rapid and accurate identification of metabolites from (13)C-(1)H HSQC spectra at natural abundance. This information can be combined and validated with NMR data from 2D TOCSY-type spectra that provide connectivity information not present in HSQC spectra.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Carbon Isotopes
  • Databases, Factual
  • Drosophila Proteins / chemistry*
  • Escherichia coli Proteins / chemistry*
  • Magnetic Resonance Spectroscopy*
  • Metabolomics*
  • User-Computer Interface

Substances

  • Carbon Isotopes
  • Drosophila Proteins
  • Escherichia coli Proteins