The BIG Data Center: from deposition to integration to translation

Nucleic Acids Res. 2017 Jan 4;45(D1):D18-D24. doi: 10.1093/nar/gkw1060. Epub 2016 Nov 28.


Biological data are generated at unprecedentedly exponential rates, posing considerable challenges in big data deposition, integration and translation. The BIG Data Center, established at Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, provides a suite of database resources, including (i) Genome Sequence Archive, a data repository specialized for archiving raw sequence reads, (ii) Gene Expression Nebulas, a data portal of gene expression profiles based entirely on RNA-Seq data, (iii) Genome Variation Map, a comprehensive collection of genome variations for featured species, (iv) Genome Warehouse, a centralized resource housing genome-scale data with particular focus on economically important animals and plants, (v) Methylation Bank, an integrated database of whole-genome single-base resolution methylomes and (vi) Science Wikis, a central access point for biological wikis developed for community annotations. The BIG Data Center is dedicated to constructing and maintaining biological databases through big data integration and value-added curation, conducting basic research to translate big data into big knowledge and providing freely open access to a variety of data resources in support of worldwide research activities in both academia and industry. All of these resources are publicly available and can be found at

MeSH terms

  • Animals
  • Asians / genetics
  • DNA Methylation
  • Databases, Genetic*
  • Dogs
  • Gene Expression Profiling
  • Genetic Variation
  • Genomics*
  • Humans
  • Mice
  • Oryza / genetics
  • Oryza / metabolism
  • Rats
  • Sorghum / genetics
  • Systems Integration