Reuse of public genome-wide gene expression data

Nat Rev Genet. 2013 Feb;14(2):89-99. doi: 10.1038/nrg3394. Epub 2012 Dec 27.


Our understanding of gene expression has changed dramatically over the past decade, largely catalysed by technological developments. High-throughput experiments - microarrays and next-generation sequencing - have generated large amounts of genome-wide gene expression data that are collected in public archives. Added-value databases process, analyse and annotate these data further to make them accessible to every biologist. In this Review, we discuss the utility of the gene expression data that are in the public domain and how researchers are making use of these data. Reuse of public data can be very powerful, but there are many obstacles in data preparation and analysis and in the interpretation of the results. We will discuss these challenges and provide recommendations that we believe can improve the utility of such data.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Computational Biology
  • Databases, Genetic* / standards
  • Databases, Genetic* / statistics & numerical data
  • Gene Expression Profiling / statistics & numerical data*
  • High-Throughput Nucleotide Sequencing / statistics & numerical data
  • High-Throughput Screening Assays / statistics & numerical data
  • Humans
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Public Sector*