A new group of tyrosine recombinase-encoding retrotransposons

Mol Biol Evol. 2004 Apr;21(4):746-59. doi: 10.1093/molbev/msh072. Epub 2004 Feb 12.

Abstract

A wide variety of novel tyrosine recombinase (YR)-encoding retrotransposons were identified using data emerging from the various eukaryotic genome sequencing projects. Although many of these elements are clearly members of the previously described DIRS group of YR retrotransposons, a substantial number, including elements from a variety of fungi and animals, belong to a distinct and previously unrecognized group. We refer to these latter elements as the Ngaro group after a representative from zebrafish. Like the members of the DIRS group, Ngaro elements encode proteins bearing reverse transcriptase (RT) and ribonuclease H (RH) domains similar to those of long terminal repeat (LTR) retrotransposons. Phylogenetic analyses based on alignments of RT/RH and YR domains, however, indicate that Ngaro and DIRS are anciently diverged groups. Differences in coding capacity also support the distinction between the two groups. For instance, we found that DIRS elements all encode a protein domain which is similar in sequence to the DNA methyltransferases of certain bacteriophages, whereas this domain is absent from all Ngaro elements. Together, the Ngaro and DIRS groups of YR retrotransposons contain elements with an astonishing diversity in structures, with variations in the nature of the associated repeat sequences and in the arrangement and complement of coding regions. In addition they contain elements with some surprising features, such as spliceosomal introns and long overlapping open reading frames.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • DNA Replication / genetics
  • Evolution, Molecular
  • Fungi / genetics
  • Methyltransferases / genetics
  • Models, Biological
  • Molecular Sequence Data
  • Phylogeny*
  • Protein Structure, Tertiary / genetics
  • Recombinases / chemistry
  • Recombinases / classification*
  • Recombinases / genetics*
  • Retroelements / genetics*
  • Sequence Alignment
  • Terminal Repeat Sequences / genetics
  • Tyrosine / chemistry
  • Zebrafish / genetics

Substances

  • Recombinases
  • Retroelements
  • Tyrosine
  • Methyltransferases

Associated data

  • GENBANK/AY152729