Epigenetic priors for identifying active transcription factor binding sites
- PMID: 22072382
- PMCID: PMC3244768
- DOI: 10.1093/bioinformatics/btr614
Epigenetic priors for identifying active transcription factor binding sites
Abstract
Motivation: Accurate knowledge of the genome-wide binding of transcription factors in a particular cell type or under a particular condition is necessary for understanding transcriptional regulation. Using epigenetic data such as histone modification and DNase I, accessibility data has been shown to improve motif-based in silico methods for predicting such binding, but this approach has not yet been fully explored.
Results: We describe a probabilistic method for combining one or more tracks of epigenetic data with a standard DNA sequence motif model to improve our ability to identify active transcription factor binding sites (TFBSs). We convert each data type into a position-specific probabilistic prior and combine these priors with a traditional probabilistic motif model to compute a log-posterior odds score. Our experiments, using histone modifications H3K4me1, H3K4me3, H3K9ac and H3K27ac, as well as DNase I sensitivity, show conclusively that the log-posterior odds score consistently outperforms a simple binary filter based on the same data. We also show that our approach performs competitively with a more complex method, CENTIPEDE, and suggest that the relative simplicity of the log-posterior odds scoring method makes it an appealing and very general method for identifying functional TFBSs on the basis of DNA and epigenetic evidence.
Availability and implementation: FIMO, part of the MEME Suite software toolkit, now supports log-posterior odds scoring using position-specific priors for motif search. A web server and source code are available at http://meme.nbcr.net. Utilities for creating priors are at http://research.imb.uq.edu.au/t.bailey/SD/Cuellar2011.
Contact: t.bailey@uq.edu.au
Supplementary information: Supplementary data are available at Bioinformatics online.
Figures
Similar articles
-
MCAST: scanning for cis-regulatory motif clusters.Bioinformatics. 2016 Apr 15;32(8):1217-9. doi: 10.1093/bioinformatics/btv750. Epub 2015 Dec 24. Bioinformatics. 2016. PMID: 26704599 Free PMC article.
-
Inferring direct DNA binding from ChIP-seq.Nucleic Acids Res. 2012 Sep 1;40(17):e128. doi: 10.1093/nar/gks433. Epub 2012 May 18. Nucleic Acids Res. 2012. PMID: 22610855 Free PMC article.
-
FIMO: scanning for occurrences of a given motif.Bioinformatics. 2011 Apr 1;27(7):1017-8. doi: 10.1093/bioinformatics/btr064. Epub 2011 Feb 16. Bioinformatics. 2011. PMID: 21330290 Free PMC article.
-
MEME SUITE: tools for motif discovery and searching.Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8. doi: 10.1093/nar/gkp335. Epub 2009 May 20. Nucleic Acids Res. 2009. PMID: 19458158 Free PMC article.
-
Tissue-specific prediction of directly regulated genes.Bioinformatics. 2011 Sep 1;27(17):2354-60. doi: 10.1093/bioinformatics/btr399. Epub 2011 Jun 30. Bioinformatics. 2011. PMID: 21724591 Free PMC article.
Cited by
-
Cis-regulatory modes of Ultrabithorax inactivation in butterfly forewings.Elife. 2024 Jan 23;12:RP90846. doi: 10.7554/eLife.90846. Elife. 2024. PMID: 38261357 Free PMC article.
-
CTCF: an R/bioconductor data package of human and mouse CTCF binding sites.Bioinform Adv. 2022 Dec 16;2(1):vbac097. doi: 10.1093/bioadv/vbac097. eCollection 2022. Bioinform Adv. 2022. PMID: 36699364 Free PMC article.
-
Profiling the quantitative occupancy of myriad transcription factors across conditions by modeling chromatin accessibility data.Genome Res. 2022 Jun;32(6):1183-1198. doi: 10.1101/gr.272203.120. Epub 2022 May 24. Genome Res. 2022. PMID: 35609992 Free PMC article.
-
LncRNA KCNQ1OT1 activated by c-Myc promotes cell proliferation via interacting with FUS to stabilize MAP3K1 in acute promyelocytic leukemia.Cell Death Dis. 2021 Aug 17;12(9):795. doi: 10.1038/s41419-021-04080-1. Cell Death Dis. 2021. PMID: 34404765 Free PMC article.
-
Nitrate-induced CLE35 signaling peptides inhibit nodulation through the SUNN receptor and miR2111 repression.Plant Physiol. 2021 Apr 2;185(3):1216-1228. doi: 10.1093/plphys/kiaa094. Plant Physiol. 2021. PMID: 33793938 Free PMC article.
References
-
- Bailey T. L., Noble W. S. Searching for statistically significant regulatory modules. Bioinformatics. 2003;19(Suppl. 2):ii16–ii25. - PubMed
-
- Barski A., et al. High-resolution profiling of histone methylations in the human genome. Cell. 2007;129:823–837. - PubMed
-
- Bernat J.A., et al. Distant conserved sequences flanking endothelial-specific promoters contain tissue-specific DNase-hypersensitive sites and over-represented motifs. Hum. Mol. Genet. 2006;15:2098–2105. - PubMed
