Metabolomics Data Preprocessing Using ADAP and MZmine 2

Methods Mol Biol. 2020:2104:25-48. doi: 10.1007/978-1-0716-0239-3_3.

Abstract

The informatics pipeline for making sense of untargeted LC-MS or GC-MS data starts with preprocessing the raw data. Results from data preprocessing undergo statistical analysis and subsequently mapped to metabolic pathways for placing untargeted metabolomics data in the biological context. ADAP is a suite of computational algorithms that has been developed specifically for preprocessing LC-MS and GC-MS data. It consists of two separate computational workflows that extract compound-relevant information from raw LC-MS and GC-MS data, respectively. Computational steps include construction of extracted ion chromatograms, detection of chromatographic peaks, spectral deconvolution, and alignment. The two workflows have been incorporated into the cross-platform and graphical MZmine 2 framework and ADAP-specific graphical user interfaces have been developed for using ADAP with ease. This chapter summarizes the algorithmic principles underlying key steps in the two workflows and illustrates how to apply ADAP to preprocess LC-MS and GC-MS data.

Keywords: ADAP; Alignment; Data preprocessing; GC–MS; LC–MS; MZmine 2; Metabolomics; Peak picking; Spectral deconvolution; Visualization.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Chromatography, Liquid
  • Computational Biology / methods*
  • Data Interpretation, Statistical*
  • Gas Chromatography-Mass Spectrometry
  • Humans
  • Mass Spectrometry
  • Metabolomics* / methods
  • Software*
  • User-Computer Interface
  • Workflow