GOstat: find statistically overrepresented Gene Ontologies within a group of genes

Bioinformatics. 2004 Jun 12;20(9):1464-5. doi: 10.1093/bioinformatics/bth088. Epub 2004 Feb 12.


Summary: Modern experimental techniques, as for example DNA microarrays, as a result usually produce a long list of genes, which are potentially interesting in the analyzed process. In order to gain biological understanding from this type of data, it is necessary to analyze the functional annotations of all genes in this list. The Gene-Ontology (GO) database provides a useful tool to annotate and analyze the functions of a large number of genes. Here, we introduce a tool that utilizes this information to obtain an understanding of which annotations are typical for the analyzed list of genes. This program automatically obtains the GO annotations from a database and generates statistics of which annotations are overrepresented in the analyzed list of genes. This results in a list of GO terms sorted by their specificity.

Availability: Our program GOstat is accessible via the Internet at http://gostat.wehi.edu.au

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Abstracting and Indexing / methods
  • Algorithms*
  • DNA / analysis
  • DNA / chemistry
  • DNA / classification*
  • DNA / genetics
  • Databases, Genetic*
  • Documentation / methods*
  • Gene Expression Profiling / methods*
  • Internet
  • Models, Genetic
  • Models, Statistical
  • Natural Language Processing*
  • Software*


  • DNA