CCG: an integrative resource of cancer protein-coding genes and long noncoding RNAs

Discov Med. 2016 Dec;22(123):351-359.


The identification of cancer genes remains a main aim of cancer research. With the advances of high-throughput sequencing technologies, thousands of novel cancer genes were identified through recurrent mutation analyses and differential expression analyses between normal tissues and tumors in large populations. Many databases were developed to document the cancer genes. However, no public database providing both cancer protein-coding genes and cancer lncRNAs is available presently. Here, we present the Catalogue of Cancer Genes (CCG) database (, a catalogue of cancer genes. It includes both well-supported and candidate cancer protein-coding genes and cancer lncRNAs collected from literature search and public databases. In addition, uniform genomic aberration information (such as somatic mutation and copy number variation) and drug-gene interactions were assigned to cancer genes in the database. CCG represents an effort on integrative assembly of well-supported and candidate cancer protein-coding and long noncoding RNA genes and takes advantages of high-throughput sequencing results on large populations. With the help of CCG, users can easily access a comprehensive list of cancer genes as well as genomic aberration related with these genes. The availability of integrative information will facilitate the understanding of cancer mechanisms. In addition, drug-gene information in CCG provides a useful guide to the development of new anti-cancer drugs and selection of rational combination therapies.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA Copy Number Variations
  • DNA Mutational Analysis
  • Databases, Genetic* / trends
  • Drug Discovery / methods
  • Exome / genetics
  • Gene Expression Profiling
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Molecular Targeted Therapy
  • Mutation
  • Neoplasms / drug therapy
  • Neoplasms / genetics*
  • Oncogenes / genetics*
  • RNA, Long Noncoding / genetics*


  • RNA, Long Noncoding