Synthetic STARR-seq reveals how DNA shape and sequence modulate transcriptional output and noise

PLoS Genet. 2018 Nov 14;14(11):e1007793. doi: 10.1371/journal.pgen.1007793. eCollection 2018 Nov.

Abstract

The binding of transcription factors to short recognition sequences plays a pivotal role in controlling the expression of genes. The sequence and shape characteristics of binding sites influence DNA binding specificity and have also been implicated in modulating the activity of transcription factors downstream of binding. To quantitatively assess the transcriptional activity of tens of thousands of designed synthetic sites in parallel, we developed a synthetic version of STARR-seq (synSTARR-seq). We used the approach to systematically analyze how variations in the recognition sequence of the glucocorticoid receptor (GR) affect transcriptional regulation. Our approach resulted in the identification of a novel highly active functional GR binding sequence and revealed that sequence variation both within and flanking GR's core binding site can modulate GR activity without apparent changes in DNA binding affinity. Notably, we found that the sequence composition of variants with similar activity profiles was highly diverse. In contrast, groups of variants with similar activity profiles showed specific DNA shape characteristics indicating that DNA shape may be a better predictor of activity than DNA sequence. Finally, using single cell experiments with individual enhancer variants, we obtained clues indicating that the architecture of the response element can independently tune expression mean and cell-to cell variability in gene expression (noise). Together, our studies establish synSTARR as a powerful method to systematically study how DNA sequence and shape modulate transcriptional output and noise.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites / genetics
  • DNA / chemistry
  • DNA / genetics*
  • DNA / metabolism
  • Enhancer Elements, Genetic
  • Gene Expression Regulation
  • Genes, Reporter
  • Genes, Synthetic
  • Genetic Variation
  • Humans
  • Nucleic Acid Conformation
  • Protein Conformation
  • Receptors, Glucocorticoid / chemistry
  • Receptors, Glucocorticoid / genetics
  • Receptors, Glucocorticoid / metabolism
  • Response Elements
  • Sequence Analysis, DNA / methods*
  • Sequence Analysis, DNA / statistics & numerical data
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Transcription, Genetic*

Substances

  • Receptors, Glucocorticoid
  • Transcription Factors
  • DNA

Grants and funding

This work was supported by the Deutsche Forschungsgemeinschaft [ME4154/1-1 to SS]. And by the Max-Planck-Gesellschaft (to SS, MB, EE, MB, PB, MV and SHM. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.