Deciphering eukaryotic gene-regulatory logic with 100 million random promoters
- PMID: 31792407
- PMCID: PMC6954276
- DOI: 10.1038/s41587-019-0315-8
Deciphering eukaryotic gene-regulatory logic with 100 million random promoters
Erratum in
-
Author Correction: Deciphering eukaryotic gene-regulatory logic with 100 million random promoters.Nat Biotechnol. 2020 Oct;38(10):1211. doi: 10.1038/s41587-020-0665-2. Nat Biotechnol. 2020. PMID: 32792646
Abstract
How transcription factors (TFs) interpret cis-regulatory DNA sequence to control gene expression remains unclear, largely because past studies using native and engineered sequences had insufficient scale. Here, we measure the expression output of >100 million synthetic yeast promoter sequences that are fully random. These sequences yield diverse, reproducible expression levels that can be explained by their chance inclusion of functional TF binding sites. We use machine learning to build interpretable models of transcriptional regulation that predict ~94% of the expression driven from independent test promoters and ~89% of the expression driven from native yeast promoter fragments. These models allow us to characterize each TF's specificity, activity and interactions with chromatin. TF activity depends on binding-site strand, position, DNA helical face and chromatin context. Notably, expression level is influenced by weak regulatory interactions, which confound designed-sequence studies. Our analyses show that massive-throughput assays of fully random DNA can provide the big data necessary to develop complex, predictive models of gene regulation.
Conflict of interest statement
Declaration of Interests
AR is an SAB member of ThermoFisher Scientific, Neogene Therapeutics, and Syros Pharmaceuticals and a founder of and equity holder in Celsius Therapeutics. All other authors declare no competing interests.
Figures
Similar articles
-
Mapping functional transcription factor networks from gene expression data.Genome Res. 2013 Aug;23(8):1319-28. doi: 10.1101/gr.150904.112. Epub 2013 May 1. Genome Res. 2013. PMID: 23636944 Free PMC article.
-
Large-scale mapping of gene regulatory logic reveals context-dependent repression by transcriptional activators.Genome Res. 2017 Jan;27(1):87-94. doi: 10.1101/gr.212316.116. Epub 2016 Dec 13. Genome Res. 2017. PMID: 27965290 Free PMC article.
-
A New Mechanism for Mendelian Dominance in Regulatory Genetic Pathways: Competitive Binding by Transcription Factors.Genetics. 2017 Jan;205(1):101-112. doi: 10.1534/genetics.116.195255. Epub 2016 Nov 18. Genetics. 2017. PMID: 27866169 Free PMC article.
-
Disentangling the many layers of eukaryotic transcriptional regulation.Annu Rev Genet. 2012;46:43-68. doi: 10.1146/annurev-genet-110711-155437. Epub 2012 Aug 28. Annu Rev Genet. 2012. PMID: 22934649 Free PMC article. Review.
-
A conserved role for transcription factor sumoylation in binding-site selection.Curr Genet. 2019 Dec;65(6):1307-1312. doi: 10.1007/s00294-019-00992-w. Epub 2019 May 15. Curr Genet. 2019. PMID: 31093693 Review.
Cited by
-
Performance of abiotic stress-inducible synthetic promoters in genetically engineered hybrid poplar (Populus tremula × Populus alba).Front Plant Sci. 2022 Oct 18;13:1011939. doi: 10.3389/fpls.2022.1011939. eCollection 2022. Front Plant Sci. 2022. PMID: 36330242 Free PMC article.
-
A community effort to optimize sequence-based deep learning models of gene regulation.Nat Biotechnol. 2024 Oct 11. doi: 10.1038/s41587-024-02414-w. Online ahead of print. Nat Biotechnol. 2024. PMID: 39394483
-
Designing for durability: new tools to build stable, non-repetitive DNA.Synth Biol (Oxf). 2020 Aug 19;5(1):ysaa016. doi: 10.1093/synbio/ysaa016. eCollection 2020. Synth Biol (Oxf). 2020. PMID: 33094170 Free PMC article. No abstract available.
-
Focus on your locus with a massively parallel reporter assay.J Neurodev Disord. 2022 Sep 9;14(1):50. doi: 10.1186/s11689-022-09461-x. J Neurodev Disord. 2022. PMID: 36085003 Free PMC article. Review.
-
Sequence determinants of human gene regulatory elements.Nat Genet. 2022 Mar;54(3):283-294. doi: 10.1038/s41588-021-01009-4. Epub 2022 Feb 21. Nat Genet. 2022. PMID: 35190730 Free PMC article.
References
-
- Beer MA & Tavazoie S Predicting gene expression from sequence. Cell 117, 185–198 (2004). - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous
