Extreme HOT regions are CpG-dense promoters in C. elegans and humans

Genome Res. 2014 Jul;24(7):1138-46. doi: 10.1101/gr.161992.113. Epub 2014 Mar 20.

Abstract

Most vertebrate promoters lie in unmethylated CpG-dense islands, whereas methylation of the more sparsely distributed CpGs in the remainder of the genome is thought to contribute to transcriptional repression. Nonmethylated CG dinucleotides are recognized by CXXC finger protein 1 (CXXC1, also known as CFP1), which recruits SETD1A (also known as Set1) methyltransferase for trimethylation of histone H3 lysine 4, an active promoter mark. Genomic regions enriched for CpGs are thought to be either absent or irrelevant in invertebrates that lack DNA methylation, such as C. elegans; however, a CXXC1 ortholog (CFP-1) is present. Here we demonstrate that C. elegans CFP-1 targets promoters with high CpG density, and these promoters are marked by high levels of H3K4me3. Furthermore, as for mammalian promoters, high CpG content is associated with nucleosome depletion irrespective of transcriptional activity. We further show that highly occupied target (HOT) regions identified by the binding of a large number of transcription factors are CpG-rich promoters in C. elegans and human genomes, suggesting that the unusually high factor association at HOT regions may be a consequence of CpG-linked chromatin accessibility. Our results indicate that nonmethylated CpG-dense sequence is a conserved genomic signal that promotes an open chromatin state, targeting by a CXXC1 ortholog, and H3K4me3 modification in both C. elegans and human genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans / metabolism
  • CpG Islands*
  • DNA Methylation*
  • Epigenesis, Genetic
  • Epigenomics
  • Gene Expression
  • Gene Expression Regulation
  • Gene Order
  • Genes, Reporter
  • Genetic Vectors / genetics
  • Histones / metabolism
  • Humans
  • Nucleosomes / genetics
  • Nucleosomes / metabolism
  • Promoter Regions, Genetic*
  • Protein Binding
  • Transcription Factors / metabolism

Substances

  • Histones
  • Nucleosomes
  • Transcription Factors