How Many Messenger RNAs Can Be Translated by the START Mechanism?

Int J Mol Sci. 2020 Nov 8;21(21):8373. doi: 10.3390/ijms21218373.

Abstract

Translation initiation is a key step in the protein synthesis stage of the gene expression pathway of all living cells. In this important process, ribosomes have to accurately find the AUG start codon in order to ensure the integrity of the proteome. "Structure Assisted RNA Translation", or "START", has been proposed to use stable secondary structures located in the coding sequence to augment start site selection by steric hindrance of the progression of pre-initiation complex on messenger RNA. This implies that such structures have to be located downstream and at on optimal distance from the AUG start codon (i.e., downstream nucleotide +16). In order to assess the importance of the START mechanism in the overall mRNA translation process, we developed a bioinformatic tool to screen coding sequences for such stable structures in a 50 nucleotide-long window spanning the nucleotides from +16 to +65. We screened eight bacterial genomes and six eukaryotic genomes. We found stable structures in 0.6-2.5% of eukaryotic coding sequences. Among these, approximately half of them were structures predicted to form G-quadruplex structures. In humans, we selected 747 structures. In bacteria, the coding sequences from Gram-positive bacteria contained 2.6-4.2% stable structures, whereas the structures were less abundant in Gram-negative bacteria (0.2-2.7%). In contrast to eukaryotes, putative G-quadruplex structures are very rare in the coding sequence of bacteria. Altogether, our study reveals that the START mechanism seems to be an ancient strategy to facilitate the start codon recognition that is used in different kingdoms of life.

Keywords: START; mRNA; ribosome; secondary structures; translation initiation.

MeSH terms

  • 5' Untranslated Regions / genetics
  • Animals
  • Bacteria / genetics
  • Bacteria / metabolism
  • Codon, Initiator* / chemistry
  • Codon, Initiator* / genetics
  • Computational Biology
  • Eukaryota / genetics
  • Eukaryota / metabolism
  • G-Quadruplexes
  • Genome, Bacterial
  • Humans
  • Models, Biological
  • Nucleic Acid Conformation
  • Peptide Chain Initiation, Translational / genetics*
  • RNA, Messenger / chemistry
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism

Substances

  • 5' Untranslated Regions
  • Codon, Initiator
  • RNA, Messenger