IsoBase: a database of functionally related proteins across PPI networks

Nucleic Acids Res. 2011 Jan;39(Database issue):D295-300. doi: 10.1093/nar/gkq1234.

Abstract

We describe IsoBase, a database identifying functionally related proteins, across five major eukaryotic model organisms: Saccharomyces cerevisiae, Drosophila melanogaster, Caenorhabditis elegans, Mus musculus and Homo Sapiens. Nearly all existing algorithms for orthology detection are based on sequence comparison. Although these have been successful in orthology prediction to some extent, we seek to go beyond these methods by the integration of sequence data and protein-protein interaction (PPI) networks to help in identifying true functionally related proteins. With that motivation, we introduce IsoBase, the first publicly available ortholog database that focuses on functionally related proteins. The groupings were computed using the IsoRankN algorithm that uses spectral methods to combine sequence and PPI data and produce clusters of functionally related proteins. These clusters compare favorably with those from existing approaches: proteins within an IsoBase cluster are more likely to share similar Gene Ontology (GO) annotation. A total of 48,120 proteins were clustered into 12,693 functionally related groups. The IsoBase database may be browsed for functionally related proteins across two or more species and may also be queried by accession numbers, species-specific identifiers, gene name or keyword. The database is freely available for download at http://isobase.csail.mit.edu/.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Caenorhabditis elegans Proteins / chemistry
  • Caenorhabditis elegans Proteins / genetics
  • Caenorhabditis elegans Proteins / metabolism
  • Cluster Analysis
  • Databases, Protein*
  • Drosophila Proteins / chemistry
  • Drosophila Proteins / genetics
  • Drosophila Proteins / metabolism
  • Drosophila melanogaster
  • Humans
  • Mice
  • Protein Interaction Mapping
  • Saccharomyces cerevisiae Proteins / chemistry
  • Saccharomyces cerevisiae Proteins / genetics
  • Saccharomyces cerevisiae Proteins / metabolism
  • Sequence Homology, Amino Acid*
  • User-Computer Interface

Substances

  • Caenorhabditis elegans Proteins
  • Drosophila Proteins
  • Saccharomyces cerevisiae Proteins