Dynamic covariation between gene expression and proteome characteristics

BMC Bioinformatics. 2005 Aug 30;6:215. doi: 10.1186/1471-2105-6-215.


Background: Cells react to changing intra- and extracellular signals by dynamically modulating complex biochemical networks. Cellular responses to extracellular signals lead to changes in gene and protein expression. Since the majority of genes encode proteins, we investigated possible correlations between protein parameters and gene expression patterns to identify proteome-wide characteristics indicative of trends common to expressed proteins.

Results: Numerous bioinformatics methods were used to filter and merge information regarding gene and protein annotations. A new statistical time point-oriented analysis was developed for the study of dynamic correlations in large time series data. The method was applied to investigate microarray datasets for different cell types, organisms and processes, including human B and T cell stimulation, Drosophila melanogaster life span, and Saccharomyces cerevisiae cell cycle.

Conclusion: We show that the properties of proteins synthesized correlate dynamically with the gene expression profile, indicating that not only is the actual identity and function of expressed proteins important for cellular responses but that several physicochemical and other protein properties correlate with gene expression as well. Gene expression correlates strongly with amino acid composition, composition- and sequence-derived variables, functional, structural, localization and gene ontology parameters. Thus, our results suggest that a dynamic relationship exists between proteome properties and gene expression in many biological systems, and therefore this relationship is fundamental to understanding cellular mechanisms in health and disease.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • B-Lymphocytes / physiology
  • Cell Cycle / genetics
  • Computational Biology / methods
  • Data Display
  • Drosophila melanogaster / genetics
  • Electronic Data Processing
  • Gene Expression Profiling / methods*
  • Gene Frequency
  • Humans
  • Information Storage and Retrieval / methods
  • Lymphocyte Activation / genetics
  • Markov Chains
  • Models, Biological
  • Oligonucleotide Array Sequence Analysis / methods*
  • Proteome / classification*
  • Saccharomyces cerevisiae / genetics
  • Sequence Analysis, Protein / methods
  • Signal Transduction / genetics
  • Software
  • T-Lymphocytes / physiology


  • Proteome