Systems-level analyses identify extensive coupling among gene expression machines

Mol Syst Biol. 2006;2:2006.0003. doi: 10.1038/msb4100045. Epub 2006 Jan 17.

Abstract

Here, we develop computational methods to assess and consolidate large, diverse protein interaction data sets, with the objective of identifying proteins involved in the coupling of multicomponent complexes within the yeast gene expression pathway. From among approximately 43 000 total interactions and 2100 proteins, our methods identify known structural complexes, such as the spliceosome and SAGA, and functional modules, such as the DEAD-box helicases, within the interaction network of proteins involved in gene expression. Our process identifies and ranks instances of three distinct, biologically motivated motifs, or patterns of coupling among distinct machineries involved in different subprocesses of gene expression. Our results confirm known coupling among transcription, RNA processing, and export, and predict further coupling with translation and nonsense-mediated decay. We systematically corroborate our analysis with two independent, comprehensive experimental data sets. The methods presented here may be generalized to other biological processes and organisms to generate principled, systems-level network models that provide experimentally testable hypotheses for coupling among biological machines.

MeSH terms

  • Cluster Analysis*
  • Computational Biology / methods*
  • Fungal Proteins / genetics
  • Fungal Proteins / physiology
  • Gene Expression Regulation, Fungal
  • Gene Expression*
  • Multiprotein Complexes
  • Yeasts / genetics*

Substances

  • Fungal Proteins
  • Multiprotein Complexes