Computational prediction of proteotypic peptides for quantitative proteomics

Nat Biotechnol. 2007 Jan;25(1):125-31. doi: 10.1038/nbt1275. Epub 2006 Dec 31.


Mass spectrometry-based quantitative proteomics has become an important component of biological and clinical research. Although such analyses typically assume that a protein's peptide fragments are observed with equal likelihood, only a few so-called 'proteotypic' peptides are repeatedly and consistently identified for any given protein present in a mixture. Using >600,000 peptide identifications generated by four proteomic platforms, we empirically identified >16,000 proteotypic peptides for 4,030 distinct yeast proteins. Characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Possible applications of proteotypic peptides include validation of protein identifications, absolute quantification of proteins, annotation of coding sequences in genomes, and characterization of the physical principles governing key elements of mass spectrometric workflows (e.g., digestion, chromatography, ionization and fragmentation).

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Gene Expression Profiling / methods*
  • Mass Spectrometry / methods*
  • Peptide Mapping / methods*
  • Peptides / analysis
  • Peptides / chemistry*
  • Proteome / analysis
  • Proteome / chemistry*
  • Sequence Analysis, Protein / methods*


  • Peptides
  • Proteome