Mass spectrometry and EST-database searching allows characterization of the multi-protein spliceosome complex

Nat Genet. 1998 Sep;20(1):46-50. doi: 10.1038/1700.


Many important cell mechanisms are carried out and regulated by multi-protein complexes, for example, transcription and RNA processing machinery, receptor complexes and cytoskeletal structures. Most of these complexes remain only partially characterized due to the difficulty of conventional protein analysis methods. The rapid expansion of DNA sequence databases now provides whole or partial gene sequences of model organisms, and recent advances in protein microcharacterization via mass spectrometry allow the possibility of linking these DNA sequences to the proteins in functional complexes. This approach has been demonstrated in organisms whose genomes have been sequenced, such as budding yeast. Here we report the first characterization of an entire mammalian multi-protein complex using these methods. The machinery that removes introns from mRNA precursors--the spliceosome--is a large multi-protein complex. Approximately half of the components excised from a two-dimensional gel separation of the spliceosome were found in protein sequence databases. Using nanoelectrospray mass spectrometry, the remainder were identified and cloned using public expressed sequence tag (EST) databases. Existing EST databases are thus already sufficiently complete to allow rapid characterization of large mammalian protein complexes via mass spectrometry.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • DNA, Complementary
  • Databases, Factual
  • Electrophoresis, Gel, Two-Dimensional
  • Gene Expression
  • Green Fluorescent Proteins
  • Humans
  • Luminescent Proteins / genetics
  • Luminescent Proteins / metabolism
  • Macromolecular Substances
  • Mass Spectrometry / methods*
  • Molecular Sequence Data
  • Proteins / genetics*
  • Proteins / isolation & purification
  • Proteins / metabolism
  • Recombinant Proteins / genetics
  • Recombinant Proteins / metabolism
  • Sequence Homology
  • Spliceosomes / genetics
  • Spliceosomes / metabolism*


  • DNA, Complementary
  • Luminescent Proteins
  • Macromolecular Substances
  • Proteins
  • Recombinant Proteins
  • Green Fluorescent Proteins

Associated data

  • GENBANK/AF081788
  • GENBANK/AF083383
  • GENBANK/AF083384
  • GENBANK/AF083385