Using RNA inverse folding to identify IRES-like structural subdomains

RNA Biol. 2013 Dec;10(12):1842-52. doi: 10.4161/rna.26994. Epub 2013 Nov 4.

Abstract

Internal ribosome entry site (IRES) elements govern protein synthesis of mRNAs that bypass cap-dependent translation inhibition under stress conditions. Picornavirus IRES are cis-acting elements, organized in modular domains that recruit the ribosome to internal mRNA sites. The aim of this study was to retrieve short RNA sequences with the capacity to adopt RNA folding patterns conserved with IRES structural subdomains, likely corresponding to RNA modules. We have applied a new program, RNAiFold, an inverse folding algorithm that determines all sequences whose minimum free energy structure is identical to that of the structural domains of interest. Sequences differing by more than 1 nt were clustered. Then, BLASTing one randomly chosen sequence from each cluster of the RNAiFold output, we retrieved viral and cellular sequences among output hits. As a proof of principle, we present the data corresponding to a coding region of Drosophila melanogaster TAF6, a transcription factor-associated protein that contains a structural motif within its coding region potentially folding into an IRES-like subdomain. This RNA region shows a biased codon usage, as predicted from structural constraints at the RNA level, it harbors conserved IRES structural motifs in loops, and interestingly, it has the capacity to confer internal initiation of translation in tissue culture cells.

Keywords: IRES elements; RNA structural domains; SHAPE analysis; inverse folding; translation control.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Drosophila Proteins / genetics
  • Drosophila melanogaster / genetics
  • Drosophila melanogaster / metabolism
  • Gene Expression Regulation
  • Models, Molecular
  • Nucleic Acid Conformation
  • Picornaviridae / genetics
  • RNA Folding
  • RNA, Messenger / metabolism*
  • Regulatory Sequences, Ribonucleic Acid*
  • Reproducibility of Results
  • Ribosomes / metabolism*
  • TATA-Binding Protein Associated Factors / genetics
  • Transcription Factor TFIID / genetics

Substances

  • Drosophila Proteins
  • RNA, Messenger
  • Regulatory Sequences, Ribonucleic Acid
  • TATA-Binding Protein Associated Factors
  • Taf6 protein, Drosophila
  • Transcription Factor TFIID