Molecular scanner experiment with human plasma: improving protein identification by using intensity distributions of matching peptide masses

Proteomics. 2002 Oct;2(10):1413-25. doi: 10.1002/1615-9861(200210)2:10<1413::AID-PROT1413>3.0.CO;2-P.

Abstract

The development of high throughput utilities to identify proteins is a major challenge in present research in the field of proteomics. One such utility, the molecular scanner, uses proteins separated by two-dimensional polyacrylamide gel electrophoresis that are digested in the gel and during transfer onto a collecting membrane. After adding a matrix, the membrane is inserted into a matrix-assisted laser desorption/ionization-time of flight mass spectrometer and a peptide mass fingerprint (PMF) is measured for every scanned site. Since the spacing between scanned sites is much smaller than the size of the most abundant protein spots, there is a certain redundancy in the data that was used in an earlier experiment with Escherichia coli [1] to improve mass calibration and PMF identification results. It was observed that the signal intensity of a peptide mass as a function of the position on the membrane showed similar patterns if peptides stemmed from the same protein. Taking account of these similarities a clustering algorithm was used to find lists of experimental masses with similar intensity distributions, which provided clearer identification of the corresponding proteins. Here, these methods are applied to a human plasma scan, where proteins were highly modified and less separated. The presence of very abundant proteins like albumin and immunoglobulins added another difficulty. The calibration of the initial PMFs was not satisfactory and masses had to be recalibrated. After discarding chemical noise, the membrane was partitioned into regions and for each region protein identification was carried out separately. A new scoring method was used, where the PMF score was multiplied by a factor that measures the similarity of matching peptides. This method proved to be more robust than the method developed in [1] if the region where a protein was found had an extended, nonspherical shape and strong overlap with regions of other proteins. Many proteins annotated on the SWISS-2D PAGE human plasma master gel could be clearly identified and many interesting properties were observed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biotechnology / trends
  • Calibration
  • Databases as Topic
  • Electrophoresis, Gel, Two-Dimensional
  • Escherichia coli / metabolism
  • Humans
  • Hydrogen-Ion Concentration
  • Models, Statistical
  • Peptides / chemistry*
  • Plasma / metabolism*
  • Proteome*
  • Software
  • Statistics as Topic
  • Trypsin / pharmacology

Substances

  • Peptides
  • Proteome
  • Trypsin