Integrating genomic, transcriptomic, and interactome data to improve Peptide and protein identification in shotgun proteomics

J Proteome Res. 2014 Jun 6;13(6):2715-23. doi: 10.1021/pr500194t. Epub 2014 May 12.


Mass spectrometry (MS)-based shotgun proteomics is an effective technology for global proteome profiling. The ultimate goal is to assign tandem MS spectra to peptides and subsequently infer proteins and their abundance. In addition to database searching and protein assembly algorithms, computational approaches have been developed to integrate genomic, transcriptomic, and interactome information to improve peptide and protein identification. Earlier efforts focus primarily on making databases more comprehensive using publicly available genomic and transcriptomic data. More recently, with the increasing affordability of the Next Generation Sequencing (NGS) technologies, personalized protein databases derived from sample-specific genomic and transcriptomic data have emerged as an attractive strategy. In addition, incorporating interactome data not only improves protein identification but also puts identified proteins into their functional context and thus facilitates data interpretation. In this paper, we survey the major integrative bioinformatics approaches that have been developed during the past decade and discuss their merits and demerits.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Animals
  • Gene Expression Profiling
  • Humans
  • Peptide Mapping
  • Protein Interaction Mapping
  • Proteome / genetics
  • Proteome / metabolism*
  • Proteomics
  • Tandem Mass Spectrometry


  • Proteome