Qscore: an algorithm for evaluating SEQUEST database search results

Roger E Moore; Mary K Young; Terry D Lee

doi:10.1016/S1044-0305(02)00352-5

Qscore: an algorithm for evaluating SEQUEST database search results

J Am Soc Mass Spectrom. 2002 Apr;13(4):378-86. doi: 10.1016/S1044-0305(02)00352-5.

Authors

Roger E Moore¹, Mary K Young, Terry D Lee

Affiliation

¹ Division of Immunology, Beckman Research Institute of the City of Hope, Duarte, California 91010, USA.

PMID: 11951976
DOI: 10.1016/S1044-0305(02)00352-5

Abstract

A scoring procedure is described for measuring the quality of the results for protein identifications obtained from spectral matching of MS/MS data using the Sequest database search program. The scoring system is essentially probabilistic and operates by estimating the probability that a protein identification has come about by chance. The probability is based on the number of identified peptides from the protein, the total number of identified peptides, and the fraction of distinct tryptic peptides from the database that are present in the identified protein. The score is not strictly a probability, as it also incorporates information about the quality of the individual peptide matches. The result of using Qscore on a large test set of data was similar to that achieved using approaches that validate individual spectral matches, with only a narrow overlap in scores between identified proteins and false positive matches. In direct comparison with a published method of evaluating Sequest results, Qscore was able to identify an equivalent number of proteins without any identifiable false positive assignments. Qscore greatly reduces the number of Sequest protein identifications that have to be validated manually.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms*
Amino Acid Sequence
Chromatography, High Pressure Liquid
Databases, Factual*
Mass Spectrometry
Molecular Sequence Data
Peptides / chemistry*
Proteins / chemistry*
Software
Spectrometry, Mass, Electrospray Ionization

Substances

Peptides
Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding