SCOPEC: a database of protein catalytic domains
- PMID: 15262791
- DOI: 10.1093/bioinformatics/bth948
SCOPEC: a database of protein catalytic domains
Abstract
Motivation: Domains are the units of protein structure, function and evolution. It is therefore essential to utilize knowledge of domains when studying the evolution of function, or when assigning function to genome sequence data. For this purpose, we have developed a database of catalytic domains, SCOPEC, by combining structural domain information from SCOP, full-length sequence information from Swiss-Prot, and verified functional information from the Enzyme Classification (EC) database. Two major problems need to be overcome to create a database of domain-function relationships; (1) for sequences, EC numbers are typically assigned to whole sequences rather than the functional unit, and (2) The Protein Data Bank (PDB) structures elucidated from a larger multi-domain protein will often have EC annotation although the relevant catalytic domain may lie elsewhere.
Results: SCOPEC entries have high quality enzyme assignments; having passed both computational and manual checks. SCOPEC currently contains entries for 75% of all EC annotations in the PDB. Overall, EC number is fairly well conserved within a superfamily, even when the proteins are distantly related. Initial analysis is encouraging; suggesting that there is a 50:50 chance of conserved function in distant homologues first detected by a third iteration PSI-BLAST search. Therefore, we envisage that a knowledge-based approach to function assignment using the domain-EC relationships in SCOPEC will gain a marked improvement over this base line.
Availability: The SCOPEC database is a valuable resource in the analysis and prediction of protein structure and function. It can be obtained or queried at our website http://www.enzome.com
Similar articles
-
PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics.BMC Bioinformatics. 2006 Feb 6;7:53. doi: 10.1186/1471-2105-7-53. BMC Bioinformatics. 2006. PMID: 16460560 Free PMC article.
-
ProtBuD: a database of biological unit structures of protein families and superfamilies.Bioinformatics. 2006 Dec 1;22(23):2876-82. doi: 10.1093/bioinformatics/btl490. Epub 2006 Oct 2. Bioinformatics. 2006. PMID: 17018535
-
PIBASE: a comprehensive database of structurally defined protein interfaces.Bioinformatics. 2005 May 1;21(9):1901-7. doi: 10.1093/bioinformatics/bti277. Epub 2005 Jan 18. Bioinformatics. 2005. PMID: 15657096
-
Automatic annotation of protein function.Curr Opin Struct Biol. 2005 Jun;15(3):267-74. doi: 10.1016/j.sbi.2005.05.010. Curr Opin Struct Biol. 2005. PMID: 15922590 Review.
-
Large-scale database searching using tandem mass spectra: looking up the answer in the back of the book.Nat Methods. 2004 Dec;1(3):195-202. doi: 10.1038/nmeth725. Nat Methods. 2004. PMID: 15789030 Review.
Cited by
-
Functional Analogy in Human Metabolism: Enzymes with Different Biological Roles or Functional Redundancy?Genome Biol Evol. 2017 Jun 1;9(6):1624-1636. doi: 10.1093/gbe/evx119. Genome Biol Evol. 2017. PMID: 28854631 Free PMC article.
-
Quantitative comparison of catalytic mechanisms and overall reactions in convergently evolved enzymes: implications for classification of enzyme function.PLoS Comput Biol. 2010 Mar 12;6(3):e1000700. doi: 10.1371/journal.pcbi.1000700. PLoS Comput Biol. 2010. PMID: 20300652 Free PMC article.
-
Evolution of the α-Subunit of Na/K-ATPase from Paramecium to Homo sapiens: Invariance of Transmembrane Helix Topology.J Mol Evol. 2016 May;82(4-5):183-98. doi: 10.1007/s00239-016-9732-1. Epub 2016 Mar 10. J Mol Evol. 2016. PMID: 26961431 Free PMC article.
-
PDB-UF: database of predicted enzymatic functions for unannotated protein structures from structural genomics.BMC Bioinformatics. 2006 Feb 6;7:53. doi: 10.1186/1471-2105-7-53. BMC Bioinformatics. 2006. PMID: 16460560 Free PMC article.
-
NICEdrug.ch, a workflow for rational drug design and systems-level analysis of drug metabolism.Elife. 2021 Aug 3;10:e65543. doi: 10.7554/eLife.65543. Elife. 2021. PMID: 34340747 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
