A database of flavivirus RNA structures with a search algorithm for pseudoknots and triple base interactions

Bioinformatics. 2021 May 17;37(7):956-962. doi: 10.1093/bioinformatics/btaa759.

Abstract

Motivation: The Flavivirus genus includes several important pathogens, such as Zika, dengue and yellow fever virus. Flavivirus RNA genomes contain a number of functionally important structures in their 3' untranslated regions (3'UTRs). Due to the diversity of sequences and topologies of these structures, their identification is often difficult. In contrast, predictions of such structures are important for understanding of flavivirus replication cycles and development of antiviral strategies.

Results: We have developed an algorithm for structured pattern search in RNA sequences, including secondary structures, pseudoknots and triple base interactions. Using the data on known conserved flavivirus 3'UTR structures, we constructed structural descriptors which covered the diversity of patterns in these motifs. The descriptors and the search algorithm were used for the construction of a database of flavivirus 3'UTR structures. Validating this approach, we identified a number of domains matching a general pattern of exoribonuclease Xrn1-resistant RNAs in the growing group of insect-specific flaviviruses.

Availability and implementation: The Leiden Flavivirus RNA Structure Database is available at https://rna.liacs.nl. The search algorithm is available at https://github.com/LeidenRNA/SRHS.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions
  • Algorithms
  • Databases, Nucleic Acid*
  • Flavivirus* / genetics
  • Nucleic Acid Conformation
  • RNA, Viral / chemistry*

Substances

  • 3' Untranslated Regions
  • RNA, Viral