ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects

J Proteomics. 2015 Nov 3;129:25-32. doi: 10.1016/j.jprot.2015.07.006. Epub 2015 Jul 18.

Abstract

Shotgun proteomics generates valuable information from large-scale and target protein characterizations, including protein expression, protein quantification, protein post-translational modifications (PTMs), protein localization, and protein-protein interactions. Typically, peptides derived from proteolytic digestion, rather than intact proteins, are analyzed by mass spectrometers because peptides are more readily separated, ionized and fragmented. The amino acid sequences of peptides can be interpreted by matching the observed tandem mass spectra to theoretical spectra derived from a protein sequence database. Identified peptides serve as surrogates for their proteins and are often used to establish what proteins were present in the original mixture and to quantify protein abundance. Two major issues exist for assigning peptides to their originating protein. The first issue is maintaining a desired false discovery rate (FDR) when comparing or combining multiple large datasets generated by shotgun analysis and the second issue is properly assigning peptides to proteins when homologous proteins are present in the database. Herein we demonstrate a new computational tool, ProteinInferencer, which can be used for protein inference with both small- or large-scale data sets to produce a well-controlled protein FDR. In addition, ProteinInferencer introduces confidence scoring for individual proteins, which makes protein identifications evaluable. This article is part of a Special Issue entitled: Computational Proteomics.

Keywords: Database search; False discovery rate (FDR); Mass spectrometry; Peptide-spectrum match (PSM); Protein inference; Proteomics.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Mass Spectrometry / methods
  • Molecular Sequence Data
  • Peptide Mapping / methods*
  • Proteome / chemistry*
  • Proteomics / methods*
  • Sequence Analysis, Protein / methods*
  • Software*

Substances

  • Proteome