Gene function analysis in complex data sets using ErmineJ

Nat Protoc. 2010 Jun;5(6):1148-59. doi: 10.1038/nprot.2010.78. Epub 2010 Jun 3.


ErmineJ is software for the analysis of functionally interesting patterns in large gene lists drawn from gene expression profiling data or other high-throughput genomics studies. It can be used by biologists with no bioinformatics background to conduct sophisticated analyses of gene sets with multiple methods. It allows users to assess whether microarray data or other gene lists are enriched for a particular pathway or gene class. This protocol provides steps on how to format data files, determine analysis type, create custom gene sets and perform specific analyses-including overrepresentation analysis, genes score resampling and correlation resampling. ErmineJ differs from other methods in providing a rapid, simple and customizable analysis, including high-level visualization through its graphical user interface and scripting tools through its command-line interface, as well as custom gene sets and a variety of statistical methods. The protocol should take approximately 1 h, including (one-time) installation and setup.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology / methods
  • Databases, Genetic
  • Electronic Data Processing
  • Gene Expression Profiling / statistics & numerical data*
  • Genomics / statistics & numerical data
  • Humans
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Software*
  • User-Computer Interface