High frequency of transition to transversion ratio in the stem region of RNA secondary structure of untranslated region of SARS-CoV-2

PeerJ. 2024 Apr 22:12:e16962. doi: 10.7717/peerj.16962. eCollection 2024.

Abstract

Introduction: The propensity of nucleotide bases to form pairs, causes folding and the formation of secondary structure in the RNA. Therefore, purine (R): pyrimidine (Y) base-pairing is vital to maintain uniform lateral dimension in RNA secondary structure. Transversions or base substitutions between R and Y bases, are more detrimental to the stability of RNA secondary structure, than transitions derived from substitutions between A and G or C and T. The study of transversion and transition base substitutions is important to understand evolutionary mechanisms of RNA secondary structure in the 5' and 3' untranslated (UTR) regions of SARS-CoV-2. In this work, we carried out comparative analysis of transition and transversion base substitutions in the stem and loop regions of RNA secondary structure of SARS-CoV-2.

Methods: We have considered the experimentally determined and well documented stem and loop regions of 5' and 3' UTR regions of SARS-CoV-2 for base substitution analysis. The secondary structure comprising of stem and loop regions were visualized using the RNAfold web server. The GISAID repository was used to extract base sequence alignment of the UTR regions. Python scripts were developed for comparative analysis of transversion and transition frequencies in the stem and the loop regions.

Results: The results of base substitution analysis revealed a higher transition (ti) to transversion (tv) ratio (ti/tv) in the stem region of UTR of RNA secondary structure of SARS-CoV-2 reported during the early stage of the pandemic. The higher ti/tv ratio in the stem region suggested the influence of secondary structure in selecting the pattern of base substitutions. This differential pattern of ti/tv values between stem and loop regions was not observed among the Delta and Omicron variants that dominated the later stage of the pandemic. It is noteworthy that the ti/tv values in the stem and loop regions were similar among the later dominant Delta and Omicron variant strains which is to be investigated to understand the rapid evolution and global adaptation of SARS-CoV-2.

Conclusion: Our findings implicate the lower frequency of transversions than the transitions in the stem regions of UTRs of SARS-CoV-2. The RNA secondary structures are associated with replication, translation, and packaging, further investigations are needed to understand these base substitutions across different variants of SARS-CoV-2.

Keywords: SARS-CoV-2 genome; Single nucleotide polymorphism; Stem and loop motifs; Transition; Transversion.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions / genetics
  • 5' Untranslated Regions / genetics
  • Base Pairing
  • Base Sequence
  • COVID-19 / epidemiology
  • COVID-19 / virology
  • Humans
  • Nucleic Acid Conformation*
  • RNA, Viral* / chemistry
  • RNA, Viral* / genetics
  • SARS-CoV-2* / chemistry
  • SARS-CoV-2* / genetics

Substances

  • RNA, Viral
  • 3' Untranslated Regions
  • 5' Untranslated Regions

Grants and funding

The authors received no funding for this work. The National Network Project for ACTREC-TMC, Navi Mumbai, Department of Biotechnology, Govt. of India (No. BT/PR40231/BTIS/137/63/2023) and the Bioinformatics and Computational Biology Centre for Microbial Biodiversity in Assam and Arunachal Pradesh, Department of Biotechnology, Govt. of India (No. BT/PR40253/BTIS/137/52/2022) paid the APC for this article. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.