BioThesaurus: a web-based thesaurus of protein and gene names

Bioinformatics. 2006 Jan 1;22(1):103-5. doi: 10.1093/bioinformatics/bti749. Epub 2005 Nov 2.


BioThesaurus is a web-based system designed to map a comprehensive collection of protein and gene names to protein entries in the UniProt Knowledgebase. Currently covering more than two million proteins, BioThesaurus consists of over 2.8 million names extracted from multiple molecular biological databases according to the database cross-references in iProClass. The BioThesaurus web site allows the retrieval of synonymous names of given protein entries and the identification of protein entries sharing the same names.

Availability: BioThesaurus is accessible for online searching at

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Databases, Factual
  • Databases, Genetic
  • Databases, Protein
  • Genome
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Models, Genetic
  • Names
  • Proteins
  • Terminology as Topic
  • Vocabulary, Controlled*


  • Proteins