ProLuCID: An improved SEQUEST-like algorithm with enhanced sensitivity and specificity
- PMID: 26171723
- PMCID: PMC4630125
- DOI: 10.1016/j.jprot.2015.07.001
ProLuCID: An improved SEQUEST-like algorithm with enhanced sensitivity and specificity
Abstract
ProLuCID, a new algorithm for peptide identification using tandem mass spectrometry and protein sequence databases has been developed. This algorithm uses a three tier scoring scheme. First, a binomial probability is used as a preliminary scoring scheme to select candidate peptides. The binomial probability scores generated by ProLuCID minimize molecular weight bias and are independent of database size. A modified cross-correlation score is calculated for each candidate peptide identified by the binomial probability. This cross-correlation scoring function models the isotopic distributions of fragment ions of candidate peptides which ultimately results in higher sensitivity and specificity than that obtained with the SEQUEST XCorr. Finally, ProLuCID uses the distribution of XCorr values for all of the selected candidate peptides to compute a Z score for the peptide hit with the highest XCorr. The ProLuCID Z score combines the discriminative power of XCorr and DeltaCN, the standard parameters for assessing the quality of the peptide identification using SEQUEST, and displays significant improvement in specificity over ProLuCID XCorr alone. ProLuCID is also able to take advantage of high resolution MS/MS spectra leading to further improvements in specificity when compared to low resolution tandem MS data. A comparison of filtered data searched with SEQUEST and ProLuCID using the same false discovery rate as estimated by a target-decoy database strategy, shows that ProLuCID was able to identify as many as 25% more proteins than SEQUEST. ProLuCID is implemented in Java and can be easily installed on a single computer or a computer cluster. This article is part of a Special Issue entitled: Computational Proteomics.
Keywords: Bioinformatics; Identification; Mass spectrometry; ProLuCID; Proteomics; Sequest.
Copyright © 2015. Published by Elsevier B.V.
Figures
Similar articles
-
A fast SEQUEST cross correlation algorithm.J Proteome Res. 2008 Oct;7(10):4598-602. doi: 10.1021/pr800420s. Epub 2008 Sep 6. J Proteome Res. 2008. PMID: 18774840
-
Probability-based validation of protein identifications using a modified SEQUEST algorithm.Anal Chem. 2002 Nov 1;74(21):5593-9. doi: 10.1021/ac025826t. Anal Chem. 2002. PMID: 12433093
-
Optimization of filtering criterion for SEQUEST database searching to improve proteome coverage in shotgun proteomics.BMC Bioinformatics. 2007 Aug 31;8:323. doi: 10.1186/1471-2105-8-323. BMC Bioinformatics. 2007. PMID: 17761002 Free PMC article.
-
Improving protein identification from tandem mass spectrometry data by one-step methods and integrating data from other platforms.Brief Bioinform. 2016 Mar;17(2):262-9. doi: 10.1093/bib/bbv043. Epub 2015 Jul 3. Brief Bioinform. 2016. PMID: 26141827 Free PMC article. Review.
-
Protein identification using Sorcerer 2 and SEQUEST.Curr Protoc Bioinformatics. 2009 Dec;Chapter 13:Unit 13.3. doi: 10.1002/0471250953.bi1303s28. Curr Protoc Bioinformatics. 2009. PMID: 19957274 Review.
Cited by
-
Parallel Murine and Human Plaque Proteomics Reveals Pathways of Plaque Rupture.Circ Res. 2020 Sep 25;127(8):997-1022. doi: 10.1161/CIRCRESAHA.120.317295. Epub 2020 Jul 30. Circ Res. 2020. PMID: 32762496 Free PMC article.
-
TurboID Identification of Evolutionarily Divergent Components of the Nuclear Pore Complex in the Malaria Model Plasmodium berghei.mBio. 2022 Oct 26;13(5):e0181522. doi: 10.1128/mbio.01815-22. Epub 2022 Aug 30. mBio. 2022. PMID: 36040030 Free PMC article.
-
Aberrant astrocyte protein secretion contributes to altered neuronal development in multiple models of neurodevelopmental disorders.Nat Neurosci. 2022 Sep;25(9):1163-1178. doi: 10.1038/s41593-022-01150-1. Epub 2022 Aug 30. Nat Neurosci. 2022. PMID: 36042312 Free PMC article.
-
Implicating the red body of Nannochloropsis in forming the recalcitrant cell wall polymer algaenan.Nat Commun. 2024 Jun 27;15(1):5456. doi: 10.1038/s41467-024-49277-y. Nat Commun. 2024. PMID: 38937455 Free PMC article.
-
Across intra-mammalian stages of the liver f luke Fasciola hepatica: a proteomic study.Sci Rep. 2016 Sep 7;6:32796. doi: 10.1038/srep32796. Sci Rep. 2016. PMID: 27600774 Free PMC article.
References
-
- Link AJ, et al. Direct analysis of protein complexes using mass spectrometry. Nat Biotechnol. 1999;17:676–682. doi:10.1038/10890. - PubMed
-
- Washburn MP, Wolters D, Yates JR., 3rd Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat Biotechnol. 2001;19:242–247. - PubMed
-
- Nesvizhskii AI. Protein identification by tandem mass spectrometry and sequence database searching. Methods Mol Biol. 2006;367:87–120. - PubMed
-
- Olsen JV, et al. Parts per million mass accuracy on an Orbitrap mass spectrometer via lock mass injection into a C-trap. Mol Cell Proteomics. 2005;4:2010–2021. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
