Definition and characterization of a "trypsinosome" from specific peptide characteristics by nano-HPLC-MS/MS and in silico analysis of complex protein mixtures

J Proteome Res. 2004 Nov-Dec;3(6):1138-48. doi: 10.1021/pr049909x.


Although HPLC-ESI-MS/MS is rapidly becoming an indispensable tool for the analysis of peptides in complex mixtures, the sequence coverage it affords is often quite poor. Low protein expression resulting in peptide signal intensities that fall below the limit of detection of the MS system in combination with differences in peptide ionization efficiency plays a significant role in this. A second important factor stems from differences in physicochemical properties of each peptide and how these properties relate to chromatographic retention and ultimate detection. To identify and understand those properties, we compared data from experimentally identified peptides with data from peptides predicted by in silico digest of all corresponding proteins in the experimental set. Three different complex protein mixtures extracted were used to define a training set to evaluate the amino acid retention coefficients based on linear regression analysis. The retention coefficients were also compared with other previous hydrophobic and retention scale. From this, we have constructed an empirical model that can be readily used to predict peptides that are likely to be observed on our HPLC-ESI-MS/MS system based on their physicochemical properties. Finally, we demonstrated that in silico prediction of peptides and their retention coefficients can be used to generate an inclusion list for a targeted mass spectrometric identification of low abundance proteins in complex protein samples. This approach is based on experimentally derived data to calibrate the method and therefore may theoretically be applied to any HPLC-MS/MS system on which data are being generated.

MeSH terms

  • Animals
  • Breast Neoplasms / pathology
  • Cell Extracts
  • Cell Line, Tumor
  • Chromatography, High Pressure Liquid / methods
  • Computational Biology / methods*
  • Expert Systems
  • Humans
  • K562 Cells
  • Mass Spectrometry / methods*
  • Nanotechnology / methods
  • Peptide Fragments / analysis*
  • Proteins / analysis
  • Proteomics / methods*
  • Trypsin*


  • Cell Extracts
  • Peptide Fragments
  • Proteins
  • Trypsin