Highly conserved syntenic blocks at the vertebrate Hox loci and conserved regulatory elements within and outside Hox gene clusters

Proc Natl Acad Sci U S A. 2006 May 2;103(18):6994-9. doi: 10.1073/pnas.0601492103. Epub 2006 Apr 24.

Abstract

Hox genes in vertebrates are clustered, and the organization of the clusters has been highly conserved during evolution. The conservation of Hox clusters has been attributed to enhancers located within and outside the Hox clusters that are essential for the coordinated "temporal" and "spatial" expression patterns of Hox genes in developing embryos. To identify evolutionarily conserved regulatory elements within and outside the Hox clusters, we obtained contiguous sequences for the conserved syntenic blocks from the seven Hox loci in fugu and carried out a systematic search for conserved noncoding sequences (CNS) in the human, mouse, and fugu Hox loci. Our analysis has uncovered unusually large conserved syntenic blocks at the HoxA and HoxD loci. The conserved syntenic blocks at the human and mouse HoxA and HoxD loci span 5.4 Mb and 4 Mb and contain 21 and 19 genes, respectively. The corresponding regions in fugu are 16- and 12-fold smaller. A large number of CNS was identified within the Hox clusters and outside the Hox clusters spread over large regions. The CNS include previously characterized enhancers and overlap with the 5' global control regions of HoxA and HoxD clusters. Most of the CNS are likely to be control regions involved in the regulation of Hox and other genes in these loci. We propose that the regulatory elements spread across large regions on either side of Hox clusters are a major evolutionary constraint that has maintained the exceptionally long syntenic blocks at the HoxA and HoxD loci.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Binding Sites
  • Biological Evolution
  • Chromosomes, Human
  • Gene Expression Regulation, Developmental
  • Genes, Homeobox*
  • Homeodomain Proteins / genetics*
  • Humans
  • Mice
  • Molecular Sequence Data
  • Multigene Family*
  • Regulatory Sequences, Nucleic Acid*
  • Synteny*
  • Transcription Factors / metabolism

Substances

  • Homeodomain Proteins
  • Transcription Factors

Associated data

  • GENBANK/DQ481663
  • GENBANK/DQ481664
  • GENBANK/DQ481665
  • GENBANK/DQ481666
  • GENBANK/DQ481667
  • GENBANK/DQ481668
  • GENBANK/DQ481669