Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2014 Jun 6;13(6):2715-23.
doi: 10.1021/pr500194t. Epub 2014 May 12.

Integrating Genomic, Transcriptomic, and Interactome Data to Improve Peptide and Protein Identification in Shotgun Proteomics

Affiliations
Free PMC article
Review

Integrating Genomic, Transcriptomic, and Interactome Data to Improve Peptide and Protein Identification in Shotgun Proteomics

Xiaojing Wang et al. J Proteome Res. .
Free PMC article

Abstract

Mass spectrometry (MS)-based shotgun proteomics is an effective technology for global proteome profiling. The ultimate goal is to assign tandem MS spectra to peptides and subsequently infer proteins and their abundance. In addition to database searching and protein assembly algorithms, computational approaches have been developed to integrate genomic, transcriptomic, and interactome information to improve peptide and protein identification. Earlier efforts focus primarily on making databases more comprehensive using publicly available genomic and transcriptomic data. More recently, with the increasing affordability of the Next Generation Sequencing (NGS) technologies, personalized protein databases derived from sample-specific genomic and transcriptomic data have emerged as an attractive strategy. In addition, incorporating interactome data not only improves protein identification but also puts identified proteins into their functional context and thus facilitates data interpretation. In this paper, we survey the major integrative bioinformatics approaches that have been developed during the past decade and discuss their merits and demerits.

Figures

Figure 1
Figure 1
A typical workflow of shotgun proteomics.
Figure 2
Figure 2
Orthogonal data assisted proteomics studies.
Figure 3
Figure 3
Methods for increasing database completeness using publicly available genomic and transcriptomic data.

Similar articles

See all similar articles

Cited by 8 articles

See all "Cited by" articles

References

    1. Nagaraj N.; Wisniewski J. R.; Geiger T.; Cox J.; Kircher M.; Kelso J.; Paabo S.; Mann M. Deep proteome and transcriptome mapping of a human cancer cell line. Mol. Syst. Biol. 2011, 7, 548. - PMC - PubMed
    1. Perkins D. N.; Pappin D. J.; Creasy D. M.; Cottrell J. S. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 1999, 20183551–35567. - PubMed
    1. Craig R.; Beavis R. C. TANDEM: matching proteins with tandem mass spectra. Bioinformatics 2004, 2091466–1467. - PubMed
    1. Tabb D. L.; Fernando C. G.; Chambers M. C. MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J. Proteome Res. 2007, 62654–661. - PMC - PubMed
    1. Ma Z. Q.; Dasari S.; Chambers M. C.; Litton M. D.; Sobecki S. M.; Zimmerman L. J.; Halvey P. J.; Schilling B.; Drake P. M.; Gibson B. W.; Tabb D. L. IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering. J. Proteome Res. 2009, 883872–3881. - PMC - PubMed

Publication types

Feedback