International Cancer Genome Consortium Data Portal--a one-stop shop for cancer genomics data

Database (Oxford). 2011 Sep 19;2011:bar026. doi: 10.1093/database/bar026. Print 2011.

Abstract

The International Cancer Genome Consortium (ICGC) is a collaborative effort to characterize genomic abnormalities in 50 different cancer types. To make this data available, the ICGC has created the ICGC Data Portal. Powered by the BioMart software, the Data Portal allows each ICGC member institution to manage and maintain its own databases locally, while seamlessly presenting all the data in a single access point for users. The Data Portal currently contains data from 24 cancer projects, including ICGC, The Cancer Genome Atlas (TCGA), Johns Hopkins University, and the Tumor Sequencing Project. It consists of 3478 genomes and 13 cancer types and subtypes. Available open access data types include simple somatic mutations, copy number alterations, structural rearrangements, gene expression, microRNAs, DNA methylation and exon junctions. Additionally, simple germline variations are available as controlled access data. The Data Portal uses a web-based graphical user interface (GUI) to offer researchers multiple ways to quickly and easily search and analyze the available data. The web interface can assist in constructing complicated queries across multiple data sets. Several application programming interfaces are also available for programmatic access. Here we describe the organization, functionality, and capabilities of the ICGC Data Portal.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Database Management Systems*
  • Databases, Factual*
  • Gene Expression Profiling
  • Genetic Variation
  • Genomics*
  • Humans
  • International Cooperation
  • Internet
  • Neoplasms / genetics*
  • Societies
  • User-Computer Interface