Gene expression data analysis

FEBS Lett. 2000 Aug 25;480(1):17-24. doi: 10.1016/s0014-5793(00)01772-5.


Microarrays are one of the latest breakthroughs in experimental molecular biology, which allow monitoring of gene expression for tens of thousands of genes in parallel and are already producing huge amounts of valuable data. Analysis and handling of such data is becoming one of the major bottlenecks in the utilization of the technology. The raw microarray data are images, which have to be transformed into gene expression matrices--tables where rows represent genes, columns represent various samples such as tissues or experimental conditions, and numbers in each cell characterize the expression level of the particular gene in the particular sample. These matrices have to be analyzed further, if any knowledge about the underlying biological processes is to be extracted. In this paper we concentrate on discussing bioinformatics methods used for such analysis. We briefly discuss supervised and unsupervised data analysis and its applications, such as predicting gene function classes and cancer classification. Then we discuss how the gene expression matrix can be used to predict putative regulatory signals in the genome sequences. In conclusion we discuss some possible future directions.

Publication types

  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Gene Expression Profiling / methods*
  • Genes / genetics
  • Genes / physiology
  • Humans
  • Neoplasms / classification
  • Neoplasms / genetics
  • Oligonucleotide Array Sequence Analysis / methods*
  • Phylogeny
  • Regulatory Sequences, Nucleic Acid / genetics
  • Statistics as Topic / methods