Use of mass spectrometric molecular weight information to identify proteins in sequence databases

M Mann; P Højrup; P Roepstorff

doi:10.1002/bms.1200220605

Use of mass spectrometric molecular weight information to identify proteins in sequence databases

Biol Mass Spectrom. 1993 Jun;22(6):338-45. doi: 10.1002/bms.1200220605.

Authors

M Mann¹, P Højrup, P Roepstorff

Affiliation

¹ Department of Molecular Biology, Odense University, Denmark.

PMID: 8329463
DOI: 10.1002/bms.1200220605

Abstract

During the last decade new ionization techniques have made it possible to measure the molecular weight of many intact proteins by mass spectrometry, and they have made it much easier to obtain a mass spectrometric peptide map of a protein. At the same time advances in protein and DNA sequencing technology are resulting in an exponential increase in the number of sequences deposited in databases. Here we investigate the possibility to use mass spectrometric data to identify proteins in databases. Searching a database by total molecular weight is found to be an easy and sometimes sufficient approach. For more specificity and for error tolerance in both the mass spectrometric data and the database information we search by partial mass spectrometric peptide map of the protein. In general, just four to six proteolytic peptides measured with a mass accuracy between 0.1 and 0.01% allow a useful search of databases such as the Protein Identification Resource (PIR). As the size of DNA and protein sequence databases grows, protein identification by partial mass spectrometric peptide maps should become increasingly powerful and may become a general method to identify and characterize proteins.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acid Sequence*
Databases, Factual*
Mass Spectrometry
Molecular Sequence Data
Molecular Weight
Peptide Mapping
Proteins / analysis*
Software

Substances

Proteins