The Pan-Cancer analysis of pseudogene expression reveals biologically and clinically relevant tumour subtypes

Nat Commun. 2014 Jul 7;5:3963. doi: 10.1038/ncomms4963.


Although individual pseudogenes have been implicated in tumour biology, the biomedical significance and clinical relevance of pseudogene expression have not been assessed in a systematic way. Here we generate pseudogene expression profiles in 2,808 patient samples of seven cancer types from The Cancer Genome Atlas RNA-seq data using a newly developed computational pipeline. Supervised analysis reveals a significant number of pseudogenes differentially expressed among established tumour subtypes and pseudogene expression alone can accurately classify the major histological subtypes of endometrial cancer. Across cancer types, the tumour subtypes revealed by pseudogene expression show extensive and strong concordance with the subtypes defined by other molecular data. Strikingly, in kidney cancer, the pseudogene expression subtypes not only significantly correlate with patient survival, but also help stratify patients in combination with clinical variables. Our study highlights the potential of pseudogene expression analysis as a new paradigm for investigating cancer mechanisms and discovering prognostic biomarkers.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism*
  • Gene Expression
  • Genes, Neoplasm
  • Humans
  • Neoplasms / classification*
  • Neoplasms / genetics
  • Neoplasms / metabolism*
  • Pseudogenes*


  • Biomarkers, Tumor