RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets
- PMID: 22156162
- PMCID: PMC3287167
- DOI: 10.1093/nar/gkr1104
RSAT peak-motifs: motif analysis in full-size ChIP-seq datasets
Abstract
ChIP-seq is increasingly used to characterize transcription factor binding and chromatin marks at a genomic scale. Various tools are now available to extract binding motifs from peak data sets. However, most approaches are only available as command-line programs, or via a website but with size restrictions. We present peak-motifs, a computational pipeline that discovers motifs in peak sequences, compares them with databases, exports putative binding sites for visualization in the UCSC genome browser and generates an extensive report suited for both naive and expert users. It relies on time- and memory-efficient algorithms enabling the treatment of several thousand peaks within minutes. Regarding time efficiency, peak-motifs outperforms all comparable tools by several orders of magnitude. We demonstrate its accuracy by analyzing data sets ranging from 4000 to 1,28,000 peaks for 12 embryonic stem cell-specific transcription factors. In all cases, the program finds the expected motifs and returns additional motifs potentially bound by cofactors. We further apply peak-motifs to discover tissue-specific motifs in peak collections for the p300 transcriptional co-activator. To our knowledge, peak-motifs is the only tool that performs a complete motif analysis and offers a user-friendly web interface without any restriction on sequence size or number of peaks.
Figures
Similar articles
-
A complete workflow for the analysis of full-size ChIP-seq (and similar) data sets using peak-motifs.Nat Protoc. 2012 Jul 26;7(8):1551-68. doi: 10.1038/nprot.2012.088. Nat Protoc. 2012. PMID: 22836136
-
MEME-ChIP: motif analysis of large DNA datasets.Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12. Bioinformatics. 2011. PMID: 21486936 Free PMC article.
-
RSAT::Plants: Motif Discovery in ChIP-Seq Peaks of Plant Genomes.Methods Mol Biol. 2016;1482:297-322. doi: 10.1007/978-1-4939-6396-6_19. Methods Mol Biol. 2016. PMID: 27557775
-
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.Biol Direct. 2014 Feb 20;9:4. doi: 10.1186/1745-6150-9-4. Biol Direct. 2014. PMID: 24555784 Free PMC article. Review.
-
Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond.Cell Cycle. 2014;13(18):2847-52. doi: 10.4161/15384101.2014.949201. Cell Cycle. 2014. PMID: 25486472 Free PMC article. Review.
Cited by
-
DNA methylation shapes the Polycomb landscape during the exit from naive pluripotency.Nat Struct Mol Biol. 2024 Oct 24. doi: 10.1038/s41594-024-01405-4. Online ahead of print. Nat Struct Mol Biol. 2024. PMID: 39448850
-
Rebalancing gene haploinsufficiency in vivo by targeting chromatin.Nat Commun. 2016 Jun 3;7:11688. doi: 10.1038/ncomms11688. Nat Commun. 2016. PMID: 27256596 Free PMC article.
-
Overlapping ETS and CRE Motifs ((G/C)CGGAAGTGACGTCA) preferentially bound by GABPα and CREB proteins.G3 (Bethesda). 2012 Oct;2(10):1243-56. doi: 10.1534/g3.112.004002. Epub 2012 Oct 1. G3 (Bethesda). 2012. PMID: 23050235 Free PMC article.
-
cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data.Nat Methods. 2019 May;16(5):397-400. doi: 10.1038/s41592-019-0367-1. Epub 2019 Apr 8. Nat Methods. 2019. PMID: 30962623 Free PMC article.
-
TrawlerWeb: an online de novo motif discovery tool for next-generation sequencing datasets.BMC Genomics. 2018 Apr 5;19(1):238. doi: 10.1186/s12864-018-4630-0. BMC Genomics. 2018. PMID: 29621972 Free PMC article.
References
-
- Robertson G, Hirst M, Bainbridge M, Bilenky M, Zhao Y, Zeng T, Euskirchen G, Bernier B, Varhol R, Delaney A, et al. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nat. Methods. 2007;4:651–657. - PubMed
-
- Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein–DNA interactions. Science. 2007;316:1497–1502. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous
