An adaptive classification model for peptide identification

Xijun Liang; Zhonghang Xia; Ling Jian; Xinnan Niu; Andrew Link

doi:10.1186/1471-2164-16-S11-S1

An adaptive classification model for peptide identification

BMC Genomics. 2015;16 Suppl 11(Suppl 11):S1. doi: 10.1186/1471-2164-16-S11-S1. Epub 2015 Nov 10.

Authors

Xijun Liang, Zhonghang Xia, Ling Jian, Xinnan Niu, Andrew Link

Abstract

Background: Peptide sequence assignment is the central task in protein identification with MS/MS-based strategies. Although a number of post-database search algorithms for filtering target peptide spectrum matches (PSMs) have been developed, the discrepancy among the output PSMs is usually significant, remaining a few disputable PSMs. Current studies show that a number of target PSMs which are close to decoy PSMs can hardly be separated from those decoys by only using the discrimination function.

Results: In this paper, we assign each target PSM a weight showing its possibility of being correct. We employ a SVM-based learning model to search the optimal weight for each target PSM and develop a new score system, CRanker, to rank all target PSMs. Due to the large PSM datasets generated in routine database searches, we use the Cholesky factorization technique for storing a kernel matrix to reduce the memory requirement.

Conclusions: Compared with PeptideProphet and Percolator, CRanker has identified more PSMs under similar false discover rates over different datasets. CRanker has shown consistent performance on different test sets, validated the reasonability the proposed model.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Computational Biology / methods*
Humans
Peptides / analysis*
Peptides / chemistry
Support Vector Machine*

Substances

Peptides

Grants and funding

GM064779/GM/NIGMS NIH HHS/United States