InterPro--an integrated documentation resource for protein families, domains and functional sites

Bioinformatics. 2000 Dec;16(12):1145-50. doi: 10.1093/bioinformatics/16.12.1145.

Abstract

Motivation: InterPro is a new integrated documentation resource for protein families, domains and functional sites, developed initially as a means of rationalising the complementary efforts of the PROSITE, PRINTS, Pfam and ProDom database projects.

Results: Merged annotations from PRINTS, PROSITE and Pfam form the InterPro core. Each combined InterPro entry includes functional descriptions and literature references, and links are made back to the relevant parent database(s), allowing users to see at a glance whether a particular family or domain has associated patterns, profiles, fingerprints, etc. Merged and individual entries (i.e. those that have no counterpart in the companion resources) are assigned unique accession numbers. Release 1.2 of InterPro (June 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification (PTMs) encoded by 6581 different regular expressions, profiles, fingerprints and Hidden Markov Models (HMMs). Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (more than 1000000 hits from 264333 different proteins out of 384572 in SWISS-PROT and TrEMBL).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • Computer Graphics
  • Databases, Factual*
  • Internet
  • Proteins / chemistry*
  • Proteins / genetics
  • Software

Substances

  • Proteins