Relative Retention Time Estimation Improves N-Glycopeptide Identifications by LC-MS/MS

J Proteome Res. 2020 May 1;19(5):2113-2121. doi: 10.1021/acs.jproteome.0c00051. Epub 2020 Apr 10.

Abstract

Glycopeptides identified by tandem mass spectrometry rely on the identification of the peptide backbone sequence and the attached glycan(s) by the incomplete fragmentation of both moieties. This may lead to ambiguous identifications where multiple structures could explain the same spectrum equally well due to missing information in the mass spectrum or incorrect precursor mass determination. To date, approaches to solving these problems have been limited, and few inroads have been made to address these issues. We present a technique to address some of these challenges and demonstrate it on previously published data sets. We use a linear modeling approach to learn the influence of the glycan composition on the retention time of a glycopeptide and use these models to validate glycopeptides within the same experiment, detecting over 400 incorrect cases during the MS/MS search and correcting 75 cases that could not be identified based on mass alone. We make this technique available as a command line executable program, written in Python and C, freely available at https://github.com/mobiusklein/glycresoft in source form, with precompiled binaries for Windows.

Keywords: C18 chromatography; ambiguity; bioinformatics; chromatogram extraction; chromatogram scoring; glycoproteomics; metallic cation adduction; retention time; software.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Amino Acid Sequence
  • Chromatography, Liquid
  • Glycopeptides*
  • Polysaccharides
  • Tandem Mass Spectrometry*

Substances

  • Glycopeptides
  • Polysaccharides