A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts

Genome Res. 2005 Aug;15(8):1061-72. doi: 10.1101/gr.4004805. Epub 2005 Jul 15.

Abstract

Recent studies of the genome architecture of vertebrates have uncovered two unforeseen aspects of its organization. First, large regions of the genome, called gene deserts, are devoid of protein-coding sequences and have no obvious biological role. Second, comparative genomics has highlighted the existence of an array of highly conserved non-coding regions (HCNRs) in all vertebrates. Most surprisingly, these structural features are strongly associated with genes that have essential functions during development. Among these, the vertebrate Iroquois (Irx) genes stand out on both fronts. Mammalian Irx genes are organized in two clusters (IrxA and IrxB) that span >1 Mb each with no other genes interspersed. Additionally, a large number of HCNRs exist within Irx clusters. We have systematically examined the enhancer activity of HCNRs from the IrxB cluster using transgenic Xenopus and zebrafish embryos. Most of these HCNRs are active in subdomains of endogenous Irx expression, and some are candidates to contain shared enhancers of neighboring genes, which could explain the evolutionary conservation of Irx clusters. Furthermore, HCNRs present in tetrapod IrxB but not in fish may be responsible for novel Irx expression domains that appeared after their divergence. Finally, we have performed a more detailed analysis on two IrxB ultraconserved non-coding regions (UCRs) duplicated in IrxA clusters in similar relative positions. These four regions share a core region highly conserved among all of them and drive expression in similar domains. However, inter-species conserved sequences surrounding the core, specific for each of these UCRs, are able to modulate their expression.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Animals, Genetically Modified
  • Base Sequence
  • Conserved Sequence
  • DNA, Intergenic*
  • Embryo, Nonmammalian
  • Enhancer Elements, Genetic*
  • Gene Expression Regulation, Developmental
  • Homeodomain Proteins / genetics*
  • Molecular Sequence Data
  • Multigene Family*
  • Transcription Factors / genetics*
  • Vertebrates / genetics*
  • Xenopus / embryology
  • Xenopus / genetics
  • Zebrafish / embryology
  • Zebrafish / genetics
  • Zebrafish Proteins

Substances

  • DNA, Intergenic
  • Homeodomain Proteins
  • Transcription Factors
  • Zebrafish Proteins
  • irx1a protein, zebrafish