Functional noncoding sequences derived from SINEs in the mammalian genome

Genome Res. 2006 Jul;16(7):864-74. doi: 10.1101/gr.5255506. Epub 2006 May 22.

Abstract

Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Base Pairing
  • Base Sequence
  • Consensus Sequence
  • DNA Transposable Elements / genetics
  • Genome*
  • Genome, Human
  • Humans
  • Mammals / genetics*
  • Molecular Sequence Data
  • Nucleic Acid Conformation
  • Phylogeny
  • Promoter Regions, Genetic
  • Protein Structure, Tertiary
  • RNA / chemistry
  • RNA, Ribosomal, 5S / genetics
  • RNA, Transfer / genetics
  • Selection, Genetic
  • Sequence Homology, Nucleic Acid
  • Short Interspersed Nucleotide Elements / genetics*
  • Time Factors

Substances

  • DNA Transposable Elements
  • RNA, Ribosomal, 5S
  • RNA
  • RNA, Transfer

Associated data

  • GENBANK/AC150283
  • GENBANK/AC150284
  • GENBANK/AC150308
  • GENBANK/AC150309
  • GENBANK/AC150310
  • GENBANK/AC151571