Molecular evolution of the second ancient human mariner transposon, Hsmar2, illustrates patterns of neutral evolution in the human genome lineage

Gene. 1997 Dec 31;205(1-2):219-28. doi: 10.1016/s0378-1119(97)00471-x.

Abstract

A consensus sequence for the second ancient mariner identified in the human genome, Hsmar2, was constructed by majority rule from full-length and partial sequences of 44 of the +/-1000 copies in the genome. This 1300 base pair (bp) consensus has 31 bp imperfect terminal repeats (ITRs) and encodes a 351 amino acid (aa) mariner transposase. The sequence of this transposase has allowed classification of Hsmar2 as a basal lineage of the irritans subfamily of mariners, sharing at most 38% aa identity with other members of the subfamily. The individual copies in the human genome are all highly mutated from the consensus, having suffered numerous small and some large insertions and deletions (indels), including many insertions of S and J subfamily Alu elements. The copies differ, on average, from the consensus by 11.6%, have suffered 11.8 indels per kilobase (kb), and only 3.7% of the 30 hypermutable CpG dinucleotide pairs in the consensus remain intact. This level of divergence indicates that the ancestrally active Hsmar2 element represented by the consensus was present in the human genome lineage about 80 million years (Myr) ago. Each copy has apparently evolved since then largely independently of the others, and with little constraint on its transposase coding capacity. This pattern of molecular evolution fits the current model for mariner transposon evolution. These copies provide multiple independent datasets for evaluating the pattern of neutral evolution in the human genome, for example, they confirm that most indels are very short and that deletions are twice as common as insertions.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • DNA Transposable Elements*
  • DNA, Complementary
  • Evolution, Molecular*
  • Genome, Human*
  • Humans
  • Molecular Sequence Data
  • Mutagenesis, Insertional
  • Phylogeny
  • Sequence Deletion

Substances

  • DNA Transposable Elements
  • DNA, Complementary

Associated data

  • GENBANK/U49974
  • GENBANK/U92012
  • GENBANK/U92013
  • GENBANK/U92014
  • GENBANK/U92015
  • GENBANK/U92016
  • GENBANK/U92017
  • GENBANK/U92018
  • GENBANK/U92019
  • GENBANK/U92020
  • GENBANK/U92021
  • GENBANK/U92022
  • GENBANK/U92023
  • GENBANK/U92024
  • GENBANK/U92025
  • GENBANK/U92026
  • GENBANK/U92027
  • GENBANK/U92028