Spectral Clustering Improves Label-Free Quantification of Low-Abundant Proteins

Johannes Griss; Florian Stanek; Otto Hudecz; Gerhard Dürnberger; Yasset Perez-Riverol; Juan Antonio Vizcaíno; Karl Mechtler

doi:10.1021/acs.jproteome.8b00377

Spectral Clustering Improves Label-Free Quantification of Low-Abundant Proteins

J Proteome Res. 2019 Apr 5;18(4):1477-1485. doi: 10.1021/acs.jproteome.8b00377. Epub 2019 Mar 22.

Authors

Johannes Griss^{1

2}, Florian Stanek^{3

4}, Otto Hudecz^{3

4}, Gerhard Dürnberger^{3

4

5}, Yasset Perez-Riverol², Juan Antonio Vizcaíno², Karl Mechtler^{3

4}

Affiliations

¹ Department of Dermatology , Medical University of Vienna , Währinger Gürtel 18-20 , 1090 Vienna , Austria.
² European Molecular Biology Laboratory , European Bioinformatics Institute (EMBL-EBI) , Wellcome Trust Genome Campus , CB10 1SD Hinxton , Cambridge , United Kingdom.
³ Research Institute of Molecular Pathology (IMP) , Vienna Biocenter (VBC) , Campus-Vienna-Biocenter 1 , 1030 Vienna , Austria.
⁴ Institute of Molecular Biotechnology of the Austrian Academy of Sciences (IMBA) , Vienna Biocenter (VBC) , Dr. Bohr-Gasse 3 , 1030 Vienna , Austria.
⁵ Gregor Mendel Institute of Molecular Plant Biology (GMI) , Vienna Biocenter (VBC) , Dr. Bohr-Gasse 3 , 1030 Vienna , Austria.

Abstract

Label-free quantification has become a common-practice in many mass spectrometry-based proteomics experiments. In recent years, we and others have shown that spectral clustering can considerably improve the analysis of (primarily large-scale) proteomics data sets. Here we show that spectral clustering can be used to infer additional peptide-spectrum matches and improve the quality of label-free quantitative proteomics data in data sets also containing only tens of MS runs. We analyzed four well-known public benchmark data sets that represent different experimental settings using spectral counting and peak intensity based label-free quantification. In both approaches, the additionally inferred peptide-spectrum matches through our spectra-cluster algorithm improved the detectability of low abundant proteins while increasing the accuracy of the derived quantitative data, without increasing the data sets' noise. Additionally, we developed a Proteome Discoverer node for our spectra-cluster algorithm which allows anyone to rebuild our proposed pipeline using the free version of Proteome Discoverer.

Keywords: IMP free nodes; Proteome Discoverer; Proteome Discoverer node; benchmarking study; bioinformatics; label-free quantification; mass spectrometry; proteomics; spectral clustering; spectral counting.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Cluster Analysis*
Databases, Protein
Humans
Mass Spectrometry / methods*
Proteome / analysis*
Proteomics / methods*

Substances

Proteome

Abstract

Publication types

MeSH terms

Substances

Grants and funding