SARS-CoV-2 mutation 614G creates an elastase cleavage site enhancing its spread in high AAT-deficient regions

Infect Genet Evol. 2021 Jun:90:104760. doi: 10.1016/j.meegid.2021.104760. Epub 2021 Feb 5.

Abstract

SARS-CoV-2 was first reported from China. Within three months, it evolved to 10 additional subtypes. Two evolved subtypes (A2 and A2a) carry a non-synonymous Spike protein mutation (D614G). We conducted phylodynamic analysis of over 70,000 SARS-CoV-2 coronaviruses worldwide, sequenced until July2020, and found that the mutant subtype (614G) outcompeted the pre-existing type (614D), significantly faster in Europe and North-America than in East Asia. Bioinformatically and computationally, we identified a novel neutrophil elastase (ELANE) cleavage site introduced in the G-mutant, near the S1-S2 junction of the Spike protein. We hypothesised that elevation of neutrophil elastase level at the site of infection will enhance the activation of Spike protein thus facilitating host cell entry for 614G, but not the 614D, subtype. The level of neutrophil elastase in the lung is modulated by its inhibitor α1-antitrypsin (AAT). AAT prevents lung tissue damage by elastase. However, many individuals exhibit genotype-dependent deficiency of AAT. AAT deficiency eases host-cell entry of the 614G virus, by retarding inhibition of neutrophil elastase and consequently enhancing activation of the Spike protein. AAT deficiency is highly prevalent in European and North-American populations, but much less so in East Asia. Therefore, the 614G subtype is able to infect and spread more easily in populations of the former regions than in the latter region. Our analyses provide a molecular biological and evolutionary model for the higher observed virulence of the 614G subtype, in terms of causing higher morbidity in the host (higher infectivity and higher viral load), than the non-mutant 614D subtype.

Keywords: 614G subtype; Neutrophil Elastase; SARS-CoV-2; SERPINA1; α1-antitrypsin deficiency.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Binding Sites
  • COVID-19 / epidemiology
  • COVID-19 / etiology*
  • COVID-19 / metabolism*
  • Computational Biology
  • Disease Susceptibility
  • Genome, Viral*
  • Genotype
  • Global Health
  • Host-Pathogen Interactions
  • Humans
  • Leukocyte Elastase / chemistry
  • Leukocyte Elastase / metabolism*
  • Models, Biological
  • Models, Molecular
  • Models, Theoretical
  • Mutation*
  • Phylogeny
  • Protein Binding
  • Proteolysis
  • Public Health Surveillance
  • RNA, Viral
  • SARS-CoV-2 / classification*
  • SARS-CoV-2 / genetics*
  • SARS-CoV-2 / pathogenicity
  • Spike Glycoprotein, Coronavirus / chemistry
  • Spike Glycoprotein, Coronavirus / metabolism
  • Structure-Activity Relationship
  • alpha 1-Antitrypsin / genetics*

Substances

  • RNA, Viral
  • SERPINA1 protein, human
  • Spike Glycoprotein, Coronavirus
  • alpha 1-Antitrypsin
  • Leukocyte Elastase