Unique sequence organization and erythroid cell-specific nuclear factor-binding of mammalian theta 1 globin promoters

Nucleic Acids Res. 1989 Jul 25;17(14):5687-700. doi: 10.1093/nar/17.14.5687.

Abstract

The theta 1 globin gene is an alpha globin-like gene, and started to diverge from the other members of the alpha globin family 260 million years ago. DNA sequencing and transcriptional analysis indicated that it is functional in erythroid cells of the higher primates, but not in prosimians and rabbit. The theta 1 promoter region of higher primates including man consists of GC-rich sequences characteristic of housekeeping gene promoters, and CCAAT and TATA boxes located further upstream. It is shown here that the housekeeping gene promoter-like region of human theta 1 contains two tandemly arranged, GC-rich motifs (GC-I and GC-II). Of these, GC-II interacts with nuclear factor(s) present in the globin-expressing, erythroleukemia cell line K562, before and after hemin induction. GC-I, however, interacts with nuclear factor(s) only present in hemin-induced K562 cells. These factors are different from previously reported erythroid cell-specific factors, and are not detectable in non-erythroid Hela cells. Furthermore, the sequence of the motif GC-I and its location relative to ATG codon have been conserved among all known mammalian theta 1 globin genes. Finally, and most interestingly, the CCAAT box of theta 1 is contained within a 38 bp internal segment of Alu repeat sequence. Immediately upstream from this CCAAT box-containing Alu repeat segment is a 241 bp Alu repeat pointing in the opposite direction. The conservation of this novel arrangement among the higher primates suggests that an inserted Alu family repeat and its flanking genomic sequence have co-evolved, for at least 30 million years, to provide the canonical CCAAT and TATA promoter elements of the theta 1 globin genes in higher primates.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Cell Line
  • Cell Nucleus / metabolism*
  • Codon / genetics
  • Exons
  • Genes*
  • Genes, Homeobox
  • Globins / genetics*
  • HeLa Cells / metabolism
  • Humans
  • Leukemia, Myelogenous, Chronic, BCR-ABL Positive
  • Molecular Sequence Data
  • Mosaicism
  • Primates
  • Promoter Regions, Genetic*
  • Sequence Homology, Nucleic Acid
  • Species Specificity

Substances

  • Codon
  • Globins