Architecture and secondary structure of an entire HIV-1 RNA genome

Nature. 2009 Aug 6;460(7256):711-6. doi: 10.1038/nature08237.


Single-stranded RNA viruses encompass broad classes of infectious agents and cause the common cold, cancer, AIDS and other serious health threats. Viral replication is regulated at many levels, including the use of conserved genomic RNA structures. Most potential regulatory elements in viral RNA genomes are uncharacterized. Here we report the structure of an entire HIV-1 genome at single nucleotide resolution using SHAPE, a high-throughput RNA analysis technology. The genome encodes protein structure at two levels. In addition to the correspondence between RNA and protein primary sequences, a correlation exists between high levels of RNA structure and sequences that encode inter-domain loops in HIV proteins. This correlation suggests that RNA structure modulates ribosome elongation to promote native protein folding. Some simple genome elements previously shown to be important, including the ribosomal gag-pol frameshift stem-loop, are components of larger RNA motifs. We also identify organizational principles for unstructured RNA regions, including splice site acceptors and hypervariable regions. These results emphasize that the HIV-1 genome and, potentially, many coding RNAs are punctuated by previously unrecognized regulatory motifs and that extensive RNA structure constitutes an important component of the genetic code.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • Genome, Viral / genetics*
  • HIV Envelope Protein gp120 / genetics
  • HIV-1 / genetics*
  • HIV-1 / metabolism
  • Human Immunodeficiency Virus Proteins / chemistry
  • Human Immunodeficiency Virus Proteins / genetics
  • Nucleic Acid Conformation*
  • Protein Conformation
  • Protein Folding
  • Protein Sorting Signals / genetics
  • RNA, Viral / chemistry*
  • RNA, Viral / genetics*


  • HIV Envelope Protein gp120
  • Human Immunodeficiency Virus Proteins
  • Protein Sorting Signals
  • RNA, Viral