Characterization of SARS-CoV-2 Mutational Signatures from 1.5+ Million Raw Sequencing Samples

Viruses. 2022 Dec 20;15(1):7. doi: 10.3390/v15010007.

Abstract

We present a large-scale analysis of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) substitutions, considering 1,585,456 high-quality raw sequencing samples, aimed at investigating the existence and quantifying the effect of mutational processes causing mutations in SARS-CoV-2 genomes when interacting with the human host. As a result, we confirmed the presence of three well-differentiated mutational processes likely ruled by reactive oxygen species (ROS), apolipoprotein B editing complex (APOBEC), and adenosine deaminase acting on RNA (ADAR). We then evaluated the activity of these mutational processes in different continental groups, showing that some samples from Africa present a significantly higher number of substitutions, most likely due to higher APOBEC activity. We finally analyzed the activity of mutational processes across different SARS-CoV-2 variants, and we found a significantly lower number of mutations attributable to APOBEC activity in samples assigned to the Omicron variant.

Keywords: APOBEC; SARS-CoV-2; mutational signatures; variants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Africa
  • COVID-19*
  • Humans
  • Mutation
  • SARS-CoV-2* / genetics

Supplementary concepts

  • SARS-CoV-2 variants

Grants and funding

This research was funded by the Italian Ministry of University and Research (MUR) under the Department of Excellence project PREMIA (PREcision MedIcine Approach: bringing biomarker research to the clinic) grant and by the University of Milano-Bicocca with a Bicocca 2020 Starting Grant.