Introduction and effect of natural selection analysis at common mutations of SARS-CoV-2 spike gene in Iran

Virus Res. 2023 Oct 15:336:199202. doi: 10.1016/j.virusres.2023.199202. Epub 2023 Sep 1.

Abstract

The epidemic of coronavirus disease 2019 (COVID-19) was caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The spike (S) protein of SARS-Cov-2 is composed of two subunits, S1 and S2. This study aimed to describe SARS-CoV-2 haplotypes in Iranians based on the S gene, which plays a key role in the receptor recognition and cell membrane fusion proses. 95 positive saliva samples for SARS-CoV-2 were amplified and sequenced for the S gene. The sequences were classified into 35 haplotypes, which 11 haplotypes were new (H1, H2, H3, H4, H6, H7, H11, H13, H15, H16, H25) and have not been reported so far. Amino acid substitutions were found at 40 positions that 23 were located at S1 subunit and 16 were at S2 subunit and one was at cleavage loop (P681H/R), thus polymorphisms at S1 subunit were found to be higher than S2. The neutrality index (NI) analyses showed a negative departure from the neutral substitution patterns (NI > 1) for S1 and S2 subunit in the studied sequences. The co-occurrence of B-cell epitopes and mutation sites were found in seven positions with more probably to be exposed the immune system pressure. In conclusion, the results provide the significant data to design an effective vaccine based on this protein.

Keywords: COVID-19; Haplotype; SARS-CoV-2; Spike protein.

MeSH terms

  • Base Sequence
  • COVID-19* / genetics
  • Humans
  • Iran / epidemiology
  • Mutation
  • SARS-CoV-2* / genetics
  • Spike Glycoprotein, Coronavirus / chemistry

Substances

  • Spike Glycoprotein, Coronavirus
  • spike protein, SARS-CoV-2