DeepSCP: utilizing deep learning to boost single-cell proteome coverage

Brief Bioinform. 2022 Jul 18;23(4):bbac214. doi: 10.1093/bib/bbac214.

Abstract

Multiplexed single-cell proteomes (SCPs) quantification by mass spectrometry greatly improves the SCP coverage. However, it still suffers from a low number of protein identifications and there is much room to boost proteins identification by computational methods. In this study, we present a novel framework DeepSCP, utilizing deep learning to boost SCP coverage. DeepSCP constructs a series of features of peptide-spectrum matches (PSMs) by predicting the retention time based on the multiple SCP sample sets and fragment ion intensities based on deep learning, and predicts PSM labels with an optimized-ensemble learning model. Evaluation of DeepSCP on public and in-house SCP datasets showed superior performances compared with other state-of-the-art methods. DeepSCP identified more confident peptides and proteins by controlling q-value at 0.01 using target-decoy competition method. As a convenient and low-cost computing framework, DeepSCP will help boost single-cell proteome identification and facilitate the future development and application of single-cell proteomics.

Keywords: LightGBM; deep learning; fragment ion intensity; peptide-spectrum matches; retention time; single-cell proteomics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Peptides / chemistry
  • Proteome* / metabolism
  • Proteomics / methods
  • Tandem Mass Spectrometry / methods

Substances

  • Peptides
  • Proteome