Exploring massive, genome scale datasets with the GenometriCorr package
- PMID: 22693437
- PMCID: PMC3364938
- DOI: 10.1371/journal.pcbi.1002529
Exploring massive, genome scale datasets with the GenometriCorr package
Abstract
We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets.
Availability and implementation: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Girafe--an R/Bioconductor package for functional exploration of aligned next-generation sequencing reads.Bioinformatics. 2010 Nov 15;26(22):2902-3. doi: 10.1093/bioinformatics/btq531. Epub 2010 Sep 21. Bioinformatics. 2010. PMID: 20861030 Free PMC article.
-
The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets.Bioinformatics. 2009 Oct 15;25(20):2730-1. doi: 10.1093/bioinformatics/btp472. Epub 2009 Aug 4. Bioinformatics. 2009. PMID: 19654113 Free PMC article.
-
rtracklayer: an R package for interfacing with genome browsers.Bioinformatics. 2009 Jul 15;25(14):1841-2. doi: 10.1093/bioinformatics/btp328. Epub 2009 May 25. Bioinformatics. 2009. PMID: 19468054 Free PMC article.
-
Browsing (Epi)genomes: a guide to data resources and epigenome browsers for stem cell researchers.Cell Stem Cell. 2013 Jul 3;13(1):14-21. doi: 10.1016/j.stem.2013.06.006. Cell Stem Cell. 2013. PMID: 23827707 Free PMC article. Review.
-
Sparse models for correlative and integrative analysis of imaging and genetic data.J Neurosci Methods. 2014 Nov 30;237:69-78. doi: 10.1016/j.jneumeth.2014.09.001. Epub 2014 Sep 9. J Neurosci Methods. 2014. PMID: 25218561 Free PMC article. Review.
Cited by 68 articles
-
Tissue-specific usage of transposable element-derived promoters in mouse development.Genome Biol. 2020 Sep 28;21(1):255. doi: 10.1186/s13059-020-02164-3. Genome Biol. 2020. PMID: 32988383 Free PMC article.
-
Cell type-specific genome scans of DNA methylation divergence indicate an important role for transposable elements.Genome Biol. 2020 Jul 13;21(1):172. doi: 10.1186/s13059-020-02068-2. Genome Biol. 2020. PMID: 32660534 Free PMC article.
-
Conserved Small Nucleotidic Elements at the Origin of Concerted piRNA Biogenesis from Genes and lncRNAs.Cells. 2020 Jun 18;9(6):1491. doi: 10.3390/cells9061491. Cells. 2020. PMID: 32570966 Free PMC article.
-
CNN-Peaks: ChIP-Seq peak detection pipeline using convolutional neural networks that imitate human visual inspection.Sci Rep. 2020 May 13;10(1):7933. doi: 10.1038/s41598-020-64655-4. Sci Rep. 2020. PMID: 32404971 Free PMC article.
-
Global DNA Hypomethylation in Epithelial Ovarian Cancer: Passive Demethylation and Association with Genomic Instability.Cancers (Basel). 2020 Mar 24;12(3):764. doi: 10.3390/cancers12030764. Cancers (Basel). 2020. PMID: 32213861 Free PMC article.
References
-
- Bird AP. CpG-rich islands and the function of DNA methylation. Nature. 1986;321:209–213. - PubMed
-
- Giles KE, Gowher H, Ghirlando R, Jin C, Felsenfeld G. Chromatin boundaries, insulators, and long-range interactions in the nucleus. Cold Spring Harb Symp Quant Biol. 2010;75:79–85. - PubMed
-
- Bickel PJ, Brown JB, Huang H, Li Q. An overview of recent developments in genomics and associated statistical methods. Philos Transact A Math Phys Eng Sci. 2009;367:4313–4337. - PubMed
-
- Bickel PJ, Boley N, Brown JB, Huang H, Zhang NR. Subsampling methods for genomic inference. Ann Appl Stat. 2010;4:1660–1660–1697.
Publication types
MeSH terms
Substances
Grant support
LinkOut - more resources
Full Text Sources
Other Literature Sources
