Development of human protein reference database as an initial platform for approaching systems biology in humans

Genome Res. 2003 Oct;13(10):2363-71. doi: 10.1101/gr.1680803.


Human Protein Reference Database (HPRD) is an object database that integrates a wealth of information relevant to the function of human proteins in health and disease. Data pertaining to thousands of protein-protein interactions, posttranslational modifications, enzyme/substrate relationships, disease associations, tissue expression, and subcellular localization were extracted from the literature for a nonredundant set of 2750 human proteins. Almost all the information was obtained manually by biologists who read and interpreted >300,000 published articles during the annotation process. This database, which has an intuitive query interface allowing easy access to all the features of proteins, was built by using open source technologies and will be freely available at to the academic community. This unified bioinformatics platform will be useful in cataloging and mining the large number of proteomic interactions and alterations that will be discovered in the postgenomic era.

MeSH terms

  • BRCA1 Protein / physiology
  • Computational Biology / methods
  • Databases, Protein / trends*
  • Genetics, Medical / methods
  • Humans
  • Macromolecular Substances
  • Protein Interaction Mapping / trends
  • Protein Processing, Post-Translational / physiology
  • Protein Structure, Quaternary / physiology
  • Protein Structure, Tertiary / physiology
  • Substrate Specificity / physiology


  • BRCA1 Protein
  • Macromolecular Substances