Proteogenomics: concepts, applications and computational strategies
- PMID: 25357241
- PMCID: PMC4392723
- DOI: 10.1038/nmeth.3144
Proteogenomics: concepts, applications and computational strategies
Abstract
Proteogenomics is an area of research at the interface of proteomics and genomics. In this approach, customized protein sequence databases generated using genomic and transcriptomic information are used to help identify novel peptides (not present in reference protein sequence databases) from mass spectrometry-based proteomic data; in turn, the proteomic data can be used to provide protein-level evidence of gene expression and to help refine gene models. In recent years, owing to the emergence of new sequencing technologies such as RNA-seq and dramatic improvements in the depth and throughput of mass spectrometry-based proteomics, the pace of proteogenomic research has greatly accelerated. Here I review the current state of proteogenomic methods and applications, including computational strategies for building and using customized protein sequence databases. I also draw attention to the challenge of false positive identifications in proteogenomics and provide guidelines for analyzing the data and reporting the results of proteogenomic studies.
Conflict of interest statement
The author declares no competing financial interests.
Figures
Similar articles
-
Moving Toward Metaproteogenomics: A Computational Perspective on Analyzing Microbial Samples via Proteogenomics.Methods Mol Biol. 2025;2859:297-318. doi: 10.1007/978-1-0716-4152-1_17. Methods Mol Biol. 2025. PMID: 39436609
-
Proteogenomics from a bioinformatics angle: A growing field.Mass Spectrom Rev. 2017 Sep;36(5):584-599. doi: 10.1002/mas.21483. Epub 2015 Dec 15. Mass Spectrom Rev. 2017. PMID: 26670565 Free PMC article. Review.
-
PGx: Putting Peptides to BED.J Proteome Res. 2016 Mar 4;15(3):795-9. doi: 10.1021/acs.jproteome.5b00870. Epub 2015 Dec 18. J Proteome Res. 2016. PMID: 26638927 Free PMC article.
-
Mass spectrum sequential subtraction speeds up searching large peptide MS/MS spectra datasets against large nucleotide databases for proteogenomics.Genes Cells. 2012 Aug;17(8):633-44. doi: 10.1111/j.1365-2443.2012.01615.x. Epub 2012 Jun 12. Genes Cells. 2012. PMID: 22686349
-
Proteogenomics: From next-generation sequencing (NGS) and mass spectrometry-based proteomics to precision medicine.Clin Chim Acta. 2019 Nov;498:38-46. doi: 10.1016/j.cca.2019.08.010. Epub 2019 Aug 14. Clin Chim Acta. 2019. PMID: 31421119 Review.
Cited by
-
Moving Toward Metaproteogenomics: A Computational Perspective on Analyzing Microbial Samples via Proteogenomics.Methods Mol Biol. 2025;2859:297-318. doi: 10.1007/978-1-0716-4152-1_17. Methods Mol Biol. 2025. PMID: 39436609
-
Proteomic analysis of colon and rectal carcinoma using standard and customized databases.Sci Data. 2015 Jun 23;2:150022. doi: 10.1038/sdata.2015.22. eCollection 2015. Sci Data. 2015. PMID: 26110064 Free PMC article.
-
Undiscovered Physiology of Transcript and Protein Networks.Compr Physiol. 2016 Sep 15;6(4):1851-1872. doi: 10.1002/cphy.c160003. Compr Physiol. 2016. PMID: 27783861 Free PMC article. Review.
-
Transcriptome Analysis of the Japanese Pine Sawyer Beetle, Monochamus alternatus, Infected with the Entomopathogenic Fungus Metarhizium anisopliae JEF-197.J Fungi (Basel). 2021 May 10;7(5):373. doi: 10.3390/jof7050373. J Fungi (Basel). 2021. PMID: 34068801 Free PMC article.
-
Most non-canonical proteins uniquely populate the proteome or immunopeptidome.Cell Rep. 2021 Mar 9;34(10):108815. doi: 10.1016/j.celrep.2021.108815. Cell Rep. 2021. PMID: 33691108 Free PMC article.
References
-
- Mann M, Kulak NA, Nagaraj N, Cox J. The coming age of complete, accurate, and ubiquitous proteomes. Mol Cell. 2013;49:583–590. - PubMed
-
- Bantscheff M, Lemeer S, Savitski MM, Kuster B. Quantitative mass spectrometry in proteomics: critical review update from 2007 to the present. Anal Bioanal Chem. 2012;404:939–965. - PubMed
-
- Nesvizhskii AI, Aebersold R. Interpretation of shotgun proteomic data - The protein inference problem. Molecular & Cellular Proteomics. 2005;4:1419–1440. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
