Predictive Metabolite Profiling Applying Hierarchical Multivariate Curve Resolution to GC-MS Data--A Potential Tool for Multi-Parametric Diagnosis

J Proteome Res. 2006 Jun;5(6):1407-14. doi: 10.1021/pr0600071.


A method for predictive metabolite profiling based on resolution of GC-MS data followed by multivariate data analysis is presented and applied to three different biofluid data sets (rat urine, aspen leaf extracts, and human blood plasma). Hierarchical multivariate curve resolution (H-MCR) was used to simultaneously resolve the GC-MS data into pure profiles, describing the relative metabolite concentrations between samples, for multivariate analysis. Here, we present an extension of the H-MCR method allowing treatment of independent samples according to processing parameters estimated from a set of training samples. Predictions or inclusion of the new samples, based on their metabolite profiles, into an existing model could then be carried out, which is a requirement for a working application within, e.g., clinical diagnosis. Apart from allowing treatment and prediction of independent samples the proposed method also reduces the time for the curve resolution process since only a subset of representative samples have to be processed while the remaining samples can be treated according to the obtained processing parameters. The time required for resolving the 30 training samples in the rat urine example was approximately 13 h, while the treatment of the 30 test samples according to the training parameters required only approximately 30 s per sample (approximately 15 min in total). In addition, the presented results show that the suggested approach works for describing metabolic changes in different biofluids, indicating that this is a general approach for high-throughput predictive metabolite profiling, which could have important applications in areas such as plant functional genomics, drug toxicity, treatment efficacy and early disease diagnosis.

MeSH terms

  • Animals
  • Blood Proteins / analysis*
  • Clinical Laboratory Techniques
  • Data Interpretation, Statistical
  • Gas Chromatography-Mass Spectrometry
  • Humans
  • Male
  • Multivariate Analysis
  • Plant Leaves / chemistry*
  • Proteome / analysis*
  • Rats
  • Urine / chemistry


  • Blood Proteins
  • Proteome