Non-LTR retrotransposons encode noncanonical RRM domains in their first open reading frame

Proc Natl Acad Sci U S A. 2009 Jan 20;106(3):731-6. doi: 10.1073/pnas.0809964106. Epub 2009 Jan 12.

Abstract

Non-LTR retrotransposons (NLRs) are a unique class of mobile genetic elements that have significant impact on the evolution of eukaryotic genomes. However, the molecular details and functions of their encoded proteins, in particular of the accessory ORF1p proteins, are poorly understood. Here, we identify noncanonical RNA-recognition-motifs (RRMs) in several phylogenetically unrelated NLR ORF1p proteins. This provides an explanation for their RNA-binding properties and clearly shows that they are not related to the retroviral nucleocapsid protein Gag, despite the frequent presence of CCHC zinc knuckles. In particular, we characterize the ORF1p protein of the human long interspersed nuclear element 1 (LINE-1 or L1). We show that L1ORF1p is a multidomain protein, consisting of a coiled coil (cc), RRM, and C-terminal domain (CTD). Most importantly, we solved the crystal structure of the RRM domain, which is characterized by extended loops stabilized by unique salt bridges. Furthermore, we demonstrate that L1ORF1p trimerizes via its N-terminal cc domain, and we suggest that this property is functionally important for all homologues. The formation of distinct complexes with single-stranded nucleic acids requires the presence of the RRM and CTD domains on the same polypeptide chain as well as their close cooperation. Finally, the phylogenetic analysis of mammalian L1ORF1p shows an ancient origin of the RRM domain and supports a modular evolution of NLRs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Long Interspersed Nucleotide Elements / genetics*
  • Open Reading Frames*
  • Protein Structure, Secondary
  • Protein Structure, Tertiary
  • RNA / metabolism*
  • RNA-Binding Proteins / chemistry*
  • Terminal Repeat Sequences*

Substances

  • RNA-Binding Proteins
  • RNA

Associated data

  • PDB/2W7A