Full-genome sequences of the first two SARS-CoV-2 viruses from India

Indian J Med Res. 2020 Feb & Mar;151(2 & 3):200-209. doi: 10.4103/ijmr.IJMR_663_20.

Abstract

Background & objectives: Since December 2019, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has globally affected 195 countries. In India, suspected cases were screened for SARS-CoV-2 as per the advisory of the Ministry of Health and Family Welfare. The objective of this study was to characterize SARS-CoV-2 sequences from three identified positive cases as on February 29, 2020.

Methods: Throat swab/nasal swab specimens for a total of 881 suspected cases were screened by E gene and confirmed by RdRp (1), RdRp (2) and N gene real-time reverse transcription-polymerase chain reactions and next-generation sequencing. Phylogenetic analysis, molecular characterization and prediction of B- and T-cell epitopes for Indian SARS-CoV-2 sequences were undertaken.

Results: Three cases with a travel history from Wuhan, China, were confirmed positive for SARS-CoV-2. Almost complete (29,851 nucleotides) genomes of case 1, case 3 and a fragmented genome for case 2 were obtained. The sequences of Indian SARS-CoV-2 though not identical showed high (~99.98%) identity with Wuhan seafood market pneumonia virus (accession number: NC 045512). Phylogenetic analysis showed that the Indian sequences belonged to different clusters. Predicted linear B-cell epitopes were found to be concentrated in the S1 domain of spike protein, and a conformational epitope was identified in the receptor-binding domain. The predicted T-cell epitopes showed broad human leucocyte antigen allele coverage of A and B supertypes predominant in the Indian population.

Interpretation & conclusions: The two SARS-CoV-2 sequences obtained from India represent two different introductions into the country. The genetic heterogeneity is as noted globally. The identified B- and T-cell epitopes may be considered suitable for future experiments towards the design of vaccines and diagnostics. Continuous monitoring and analysis of the sequences of new cases from India and the other affected countries would be vital to understand the genetic evolution and rates of substitution of the SARS-CoV-2.

Keywords: India; Kerala; genomes; next-generation sequencing; phylogeny; real-time reverse transcription-polymerase chain reaction; severe acute respiratory syndrome coronavirus 2; Epitope.

MeSH terms

  • Betacoronavirus / genetics*
  • COVID-19
  • Coronavirus Infections
  • Epitopes, B-Lymphocyte / genetics
  • Epitopes, T-Lymphocyte / genetics
  • Genome, Viral*
  • Humans
  • India
  • Models, Molecular
  • Pandemics
  • Phylogeny
  • Pneumonia, Viral
  • Protein Structure, Tertiary
  • RNA, Viral / genetics
  • Reverse Transcriptase Polymerase Chain Reaction
  • SARS-CoV-2
  • Spike Glycoprotein, Coronavirus / genetics

Substances

  • Epitopes, B-Lymphocyte
  • Epitopes, T-Lymphocyte
  • RNA, Viral
  • Spike Glycoprotein, Coronavirus
  • spike protein, SARS-CoV-2