Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry

Anal Chem. 2007 Feb 15;79(4):1393-400. doi: 10.1021/ac0617013. Epub 2007 Jan 23.

Abstract

A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks--"lookup peaks"--that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search tools--SEQUEST, Mascot, and X!Tandem--on both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.

MeSH terms

  • Animals
  • Blood Proteins / analysis*
  • Databases, Protein
  • Humans
  • Mice
  • Peptides / analysis
  • Recombinant Proteins / analysis
  • Sensitivity and Specificity
  • Software Validation
  • Tandem Mass Spectrometry / methods*

Substances

  • Blood Proteins
  • Peptides
  • Recombinant Proteins