Framework for a protein ontology

BMC Bioinformatics. 2007 Nov 27;8 Suppl 9(Suppl 9):S1. doi: 10.1186/1471-2105-8-S9-S1.


Biomedical ontologies are emerging as critical tools in genomic and proteomic research, where complex data in disparate resources need to be integrated. A number of ontologies describe properties that can be attributed to proteins. For example, protein functions are described by the Gene Ontology (GO) and human diseases by SNOMED CT or ICD10. There is, however, a gap in the current set of ontologies - one that describes the protein entities themselves and their relationships. We have designed the PRotein Ontology (PRO) to facilitate protein annotation and to guide new experiments. The components of PRO extend from the classification of proteins on the basis of evolutionary relationships to the representation of the multiple protein forms of a gene (products generated by genetic variation, alternative splicing, proteolytic cleavage, and other post-translational modifications). PRO will allow the specification of relationships between PRO, GO and other ontologies in the OBO Foundry. Here we describe the initial development of PRO, illustrated using human and mouse proteins involved in the transforming growth factor-beta and bone morphogenetic protein signaling pathways.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Database Management Systems*
  • Databases, Protein*
  • Evolution, Molecular*
  • Information Storage and Retrieval / methods*
  • Proteins* / chemistry
  • Proteins* / classification
  • Proteins* / genetics
  • Proteins* / metabolism
  • Sequence Alignment / methods
  • Sequence Analysis / methods*
  • User-Computer Interface*


  • Proteins