Topologically inferring pathway activity toward precise cancer classification via integrating genomic and metabolomic data: prostate cancer as a case

Sci Rep. 2015 Aug 19;5:13192. doi: 10.1038/srep13192.

Abstract

Precise cancer classification is a central challenge in clinical cancer research such as diagnosis, prognosis and metastasis prediction. Most existing cancer classification methods based on gene or metabolite biomarkers were limited to single genomics or metabolomics, and lacked integration and utilization of multiple 'omics' data. The accuracy and robustness of these methods when applied to independent cohorts of patients must be improved. In this study, we propose a directed random walk-based method to evaluate the topological importance of each gene in a reconstructed gene-metabolite graph by integrating information from matched gene expression profiles and metabolomic profiles. The joint use of gene and metabolite information contributes to accurate evaluation of the topological importance of genes and reproducible pathway activities. We constructed classifiers using reproducible pathway activities for precise cancer classification and risk metabolic pathway identification. We applied the proposed method to the classification of prostate cancer. Within-dataset experiments and cross-dataset experiments on three independent datasets demonstrated that the proposed method achieved a more accurate and robust overall performance compared to several existing classification methods. The resulting risk pathways and topologically important differential genes and metabolites provide biologically informative models for prostate cancer prognosis and therapeutic strategies development.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / metabolism
  • Biosynthetic Pathways / genetics
  • Databases as Topic*
  • Genomics*
  • Humans
  • Male
  • Metabolic Networks and Pathways / genetics
  • Metabolomics*
  • Prognosis
  • Prostatic Neoplasms / classification
  • Prostatic Neoplasms / genetics*
  • Prostatic Neoplasms / metabolism*
  • Reproducibility of Results
  • Software

Substances

  • Biomarkers, Tumor