Cancer Subtype Discovery Based on Integrative Model of Multigenomic Data

IEEE/ACM Trans Comput Biol Bioinform. 2017 Sep-Oct;14(5):1115-1121. doi: 10.1109/TCBB.2016.2621769. Epub 2016 Oct 26.

Abstract

One major goal of large-scale cancer omics study is to understand molecular mechanisms of cancer and find new biomedical targets. To deal with the high-dimensional multidimensional cancer omics data (DNA methylation, mRNA expression, etc.), which can be used to discover new insight on identifying cancer subtypes, clustering methods are usually used to find an effective low-dimensional subspace of the original data and then cluster cancer samples in the reduced subspace. However, due to data-type diversity and big data volume, few methods can integrate these data and map them into an effective low-dimensional subspace. In this paper, we develop a dimension-reduction and data-integration method for indentifying cancer subtypes, named Scluster. First, Scluster, respectively, projects the different original data into the principal subspaces by an adaptive sparse reduced-rank regression method. Then, a fused patient-by-patient network is obtained for these subgroups through a scaled exponential similarity kernel method. Finally, candidate cancer subtypes are identified using spectral clustering method. We demonstrate the efficiency of our Scluster method using three cancers by jointly analyzing mRNA expression, miRNA expression, and DNA methylation data. The evaluation results and analyses show that Scluster is effective for predicting survival and identifies novel cancer subtypes of large-scale multi-omics data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Genes, Neoplasm / genetics*
  • Genomics / methods*
  • Humans
  • Models, Genetic*
  • Neoplasm Proteins / genetics*
  • Neoplasms / classification
  • Neoplasms / genetics*
  • Neoplasms / mortality*
  • Survival Analysis
  • Systems Integration

Substances

  • Biomarkers, Tumor
  • Neoplasm Proteins