Enhancing identification of cancer types via lowly-expressed microRNAs

Nucleic Acids Res. 2017 May 19;45(9):5048-5060. doi: 10.1093/nar/gkx210.

Abstract

The primary function of microRNAs (miRNAs) is to maintain cell homeostasis. In cancerous tissues miRNAs' expression undergo drastic alterations. In this study, we use miRNA expression profiles from The Cancer Genome Atlas of 24 cancer types and 3 healthy tissues, collected from >8500 samples. We seek to classify the cancer's origin and tissue identification using the expression from 1046 reported miRNAs. Despite an apparent uniform appearance of miRNAs among cancerous samples, we recover indispensable information from lowly expressed miRNAs regarding the cancer/tissue types. Multiclass support vector machine classification yields an average recall of 58% in identifying the correct tissue and tumor types. Data discretization had led to substantial improvement, reaching an average recall of 91% (95% median). We propose a straightforward protocol as a crucial step in classifying tumors of unknown primary origin. Our counter-intuitive conclusion is that in almost all cancer types, highly expressing miRNAs mask the significant signal that lower expressed miRNAs provide.

Publication types

  • Evaluation Study

MeSH terms

  • Biomarkers, Tumor / analysis*
  • Biomarkers, Tumor / genetics
  • Gene Expression Profiling
  • Humans
  • MicroRNAs / analysis*
  • MicroRNAs / genetics
  • Neoplasms / classification
  • Neoplasms / diagnosis*
  • Neoplasms / genetics

Substances

  • Biomarkers, Tumor
  • MicroRNAs