Integrative analysis of C. elegans modENCODE ChIP-seq data sets to infer gene regulatory interactions

Genome Res. 2013 Jun;23(6):941-53. doi: 10.1101/gr.152876.112. Epub 2013 Mar 26.

Abstract

The C. elegans modENCODE Consortium has defined in vivo binding sites for a large array of transcription factors by ChIP-seq. In this article, we present examples that illustrate how this compendium of ChIP-seq data can drive biological insights not possible with analysis of individual factors. First, we analyze the number of independent factors bound to the same locus, termed transcription factor complexity, and find that low-complexity sites are more likely to respond to altered expression of a single bound transcription factor. Next, we show that comparison of binding sites for the same factor across developmental stages can reveal insight into the regulatory network of that factor, as we find that the transcription factor UNC-62 has distinct binding profiles at different stages due to distinct cofactor co-association as well as tissue-specific alternative splicing. Finally, we describe an approach to infer potential regulators of gene expression changes found in profiling experiments (such as DNA microarrays) by screening these altered genes to identify significant enrichment for targets of a transcription factor identified in ChIP-seq data sets. After confirming that this approach can correctly identify the upstream regulator on expression data sets for which the regulator was previously known, we applied this approach to identify novel candidate regulators of transcriptional changes with age. The analysis revealed nine candidate aging regulators, of which three were previously known to have a role in longevity. We experimentally showed that two of the new candidate aging regulators can extend lifespan when overexpressed, indicating that this approach can identify novel functional regulators of complex processes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aging / genetics
  • Animals
  • Binding Sites*
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans / metabolism*
  • Caenorhabditis elegans Proteins / metabolism
  • Chromatin Immunoprecipitation
  • Cluster Analysis
  • Computational Biology* / methods
  • Gene Expression Profiling
  • Gene Expression Regulation*
  • Gene Regulatory Networks*
  • High-Throughput Nucleotide Sequencing
  • Homeodomain Proteins / metabolism
  • Nucleotide Motifs
  • Organ Specificity / genetics
  • Position-Specific Scoring Matrices
  • Protein Binding
  • Transcription Factors / metabolism*

Substances

  • Caenorhabditis elegans Proteins
  • Homeodomain Proteins
  • Transcription Factors
  • Unc-62 protein, C elegans