Exploring massive, genome scale datasets with the GenometriCorr package

PLoS Comput Biol. 2012 May;8(5):e1002529. doi: 10.1371/journal.pcbi.1002529. Epub 2012 May 31.


We have created a statistically grounded tool for determining the correlation of genomewide data with other datasets or known biological features, intended to guide biological exploration of high-dimensional datasets, rather than providing immediate answers. The software enables several biologically motivated approaches to these data and here we describe the rationale and implementation for each approach. Our models and statistics are implemented in an R package that efficiently calculates the spatial correlation between two sets of genomic intervals (data and/or annotated features), for use as a metric of functional interaction. The software handles any type of pointwise or interval data and instead of running analyses with predefined metrics, it computes the significance and direction of several types of spatial association; this is intended to suggest potentially relevant relationships between the datasets.

Availability and implementation: The package, GenometriCorr, can be freely downloaded at http://genometricorr.sourceforge.net/. Installation guidelines and examples are available from the sourceforge repository. The package is pending submission to Bioconductor.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromosomes
  • Databases, Genetic*
  • Epigenomics
  • Genetic Loci
  • Genome
  • Genomics / methods*
  • Humans
  • Information Storage and Retrieval*
  • Internet
  • Models, Genetic*
  • Models, Statistical*
  • RNA, Transfer / genetics
  • Software*
  • Statistics, Nonparametric
  • User-Computer Interface


  • RNA, Transfer