The genome of coronaviruses, including SARS-CoV-2, encodes for two proteases, a papain like (PLpro ) protease and the so-called main protease (Mpro ), a chymotrypsin-like cysteine protease, also named 3CLpro or non-structural protein 5 (nsp5). Mpro is activated by autoproteolysis and is the main protease responsible for cutting the viral polyprotein into functional units. Aside from this, it is described that Mpro proteases are also capable of processing host proteins, including those involved in the host innate immune response. To identify substrates of the three main proteases from SARS-CoV, SARS-CoV-2, and hCoV-NL63 coronviruses, an LC-MS based N-terminomics in vitro analysis is performed using recombinantly expressed proteases and lung epithelial and endothelial cell lysates as substrate pools. For SARS-CoV-2 Mpro , 445 cleavage events from more than 300 proteins are identified, while 151 and 331 Mpro derived cleavage events are identified for SARS-CoV and hCoV-NL63, respectively. These data enable to better understand the cleavage site specificity of the viral proteases and will help to identify novel substrates in vivo. All data are available via ProteomeXchange with identifier PXD021406.
Keywords: Covid19; LC-MS; isobaric labeling; protease substrates; terminomics.
© 2020 The Authors. Proteomics published by Wiley-VCH GmbH.