High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses

Anal Chem. 2005 Sep 1;77(17):5635-42. doi: 10.1021/ac050601e.


In metabolomics, the objective is to identify differences in metabolite profiles between samples. A widely used tool in metabolomics investigations is gas chromatography-mass spectrometry (GC/MS). More than 400 compounds can be detected in a single analysis, if overlapping GC/MS peaks are deconvoluted. However, the deconvolution process is time-consuming and difficult to automate, and additional processing is needed in order to compare samples. Therefore, there is a need to improve and automate the data processing strategy for data generated in GC/MS-based metabolomics; if not, the processing step will be a major bottleneck for high-throughput analyses. Here we describe a new semiautomated strategy using a hierarchical multivariate curve resolution approach that processes all samples simultaneously. The presented strategy generates (after appropriate treatment, e.g., multivariate analysis) tables of all the detected metabolites that differ in relative concentrations between samples. The processing of 70 samples took similar time to that of the GC/TOFMS analyses of the samples. The strategy has been validated using two different sets of samples: a complex mixture of standard compounds and Arabidopsis samples.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / chemistry
  • Arabidopsis / genetics
  • Arabidopsis / metabolism*
  • Gas Chromatography-Mass Spectrometry / instrumentation*
  • Gas Chromatography-Mass Spectrometry / methods*
  • Gibberellins / chemistry
  • Metabolic Networks and Pathways*
  • Mutation / genetics


  • Gibberellins