OncoBase: a platform for decoding regulatory somatic mutations in human cancers

Nucleic Acids Res. 2019 Jan 8;47(D1):D1044-D1055. doi: 10.1093/nar/gky1139.

Abstract

Whole-exome and whole-genome sequencing have revealed millions of somatic mutations associated with different human cancers, and the vast majority of them are located outside of coding sequences, making it challenging to directly interpret their functional effects. With the rapid advances in high-throughput sequencing technologies, genome-scale long-range chromatin interactions were detected, and distal target genes of regulatory elements were determined using three-dimensional (3D) chromatin looping. Herein, we present OncoBase (http://www.oncobase.biols.ac.cn/), an integrated database for annotating 81 385 242 somatic mutations in 68 cancer types from more than 120 cancer projects by exploring their roles in distal interactions between target genes and regulatory elements. OncoBase integrates local chromatin signatures, 3D chromatin interactions in different cell types and reconstruction of enhancer-target networks using state-of-the-art algorithms. It employs informative visualization tools to display the integrated local and 3D chromatin signatures and effects of somatic mutations on regulatory elements. Enhancer-promoter interactions estimated from chromatin interactions are integrated into a network diffusion system that quantitatively prioritizes somatic mutations and target genes from a large pool. Thus, OncoBase is a useful resource for the functional annotation of regulatory noncoding regions and systematically benchmarking the regulatory effects of embedded noncoding somatic mutations in human carcinogenesis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Chromatin / genetics
  • Computational Biology / methods*
  • Databases, Genetic*
  • Gene Expression Regulation, Neoplastic
  • Genomics / methods
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Internet
  • Mutation*
  • Neoplasms / genetics*
  • Quantitative Trait Loci / genetics
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Reproducibility of Results

Substances

  • Chromatin