The Evolution of G-quadruplex Structure in mRNA Untranslated Region

Evol Bioinform Online. 2021 Jul 21:17:11769343211035140. doi: 10.1177/11769343211035140. eCollection 2021.

Abstract

The RNA G-quadruplex (rG4) is a kind of non-canonical high-order secondary structure with important biological functions and is enriched in untranslated regions (UTRs) of protein-coding genes. However, how rG4 structures evolve is largely unknown. Here, we systematically investigated the evolution of RNA sequences around UTR rG4 structures in 5 eukaryotic organisms. We found universal selection on UTR sequences, which facilitated rG4 formation in all the organisms that we analyzed. While G-rich sequences were preferred in the rG4 structural region, C-rich sequences were selectively not preferred. The selective pressure acting on rG4 structures in the UTRs of genes with higher G content was significantly smaller. Furthermore, we found that rG4 structures experienced smaller evolutionary selection near the translation initiation region in the 5' UTR, near the polyadenylation signals in the 3' UTR, and in regions flanking the miRNA targets in the 3' UTR. These results suggest universal selection for rG4 formation in the UTRs of eukaryotic genomes and the selection may be related to the biological functions of rG4s.

Keywords: RNA G-quadruplex; evolutionary selection; untranslated region.