The opportunity cost of automated glycopeptide analysis: case study profiling the SARS-CoV-2 S glycoprotein

Anal Bioanal Chem. 2021 Dec;413(29):7215-7227. doi: 10.1007/s00216-021-03621-z. Epub 2021 Aug 27.

Abstract

Glycosylation analysis of viral glycoproteins contributes significantly to vaccine design and development. Among other benefits, glycosylation analysis allows vaccine developers to assess the impact of construct design or producer cell line choices for vaccine production, and it is a key measure by which glycoproteins that are produced for use in vaccination can be compared to their native viral forms. Because many viral glycoproteins are multiply glycosylated, glycopeptide analysis is a preferrable approach for mapping the glycans, yet the analysis of glycopeptide data can be cumbersome and requires the expertise of an experienced analyst. In recent years, a commercial software product, Byonic, has been implemented in several instances to facilitate glycopeptide analysis on viral glycoproteins and other glycoproteomics data sets, and the purpose of the study herein is to determine the strengths and limitations of using this software, particularly in cases relevant to vaccine development. The glycopeptides from a recombinantly expressed trimeric S glycoprotein of the SARS-CoV-2 virus were first analyzed using an expert-based analysis strategy; subsequently, analysis of the same data set was completed using Byonic. Careful assessment of instances where the two methods produced different results revealed that the glycopeptide assignments from Byonic contained more false positives than true positives, even when the data were assessed using a 1% false discovery rate. The work herein provides a roadmap for removing the spurious assignments that Byonic generates, and it provides an assessment of the opportunity cost for relying on automated assignments for glycopeptide data sets from viral glycoproteins.

Keywords: Glycopeptide; Glycoprotein; Mass spectrometry; SARS-CoV-2.

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Chromatography, Liquid / methods
  • Glycopeptides / metabolism*
  • Spike Glycoprotein, Coronavirus / chemistry
  • Spike Glycoprotein, Coronavirus / metabolism*
  • Tandem Mass Spectrometry / methods

Substances

  • Glycopeptides
  • Spike Glycoprotein, Coronavirus
  • spike protein, SARS-CoV-2