Enzyme annotation for orphan and novel reactions using knowledge of substrate reactive sites
- PMID: 30910961
- PMCID: PMC6462048
- DOI: 10.1073/pnas.1818877116
Enzyme annotation for orphan and novel reactions using knowledge of substrate reactive sites
Abstract
Thousands of biochemical reactions with characterized activities are "orphan," meaning they cannot be assigned to a specific enzyme, leaving gaps in metabolic pathways. Novel reactions predicted by pathway-generation tools also lack associated sequences, limiting protein engineering applications. Associating orphan and novel reactions with known biochemistry and suggesting enzymes to catalyze them is a daunting problem. We propose the method BridgIT to identify candidate genes and catalyzing proteins for these reactions. This method introduces information about the enzyme binding pocket into reaction-similarity comparisons. BridgIT assesses the similarity of two reactions, one orphan and one well-characterized nonorphan reaction, using their substrate reactive sites, their surrounding structures, and the structures of the generated products to suggest enzymes that catalyze the most-similar nonorphan reactions as candidates for also catalyzing the orphan ones. We performed two large-scale validation studies to test BridgIT predictions against experimental biochemical evidence. For the 234 orphan reactions from the Kyoto Encyclopedia of Genes and Genomes (KEGG) 2011 (a comprehensive enzymatic-reaction database) that became nonorphan in KEGG 2018, BridgIT predicted the exact or a highly related enzyme for 211 of them. Moreover, for 334 of 379 novel reactions in 2014 that were later cataloged in KEGG 2018, BridgIT predicted the exact or highly similar enzymes. BridgIT requires knowledge about only four connecting bonds around the atoms of the reactive sites to correctly annotate proteins for 93% of analyzed enzymatic reactions. Increasing to seven connecting bonds allowed for the accurate identification of a sequence for nearly all known enzymatic reactions.
Keywords: novel (de novo) reactions; orphan reactions; reaction similarity; reactive site recognition; sequence similarity.
Copyright © 2019 the Author(s). Published by PNAS.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Rapid identification of sequences for orphan enzymes to power accurate protein annotation.PLoS One. 2013 Dec 30;8(12):e84508. doi: 10.1371/journal.pone.0084508. eCollection 2013. PLoS One. 2013. PMID: 24386392 Free PMC article.
-
Implementation of homology based and non-homology based computational methods for the identification and annotation of orphan enzymes: using Mycobacterium tuberculosis H37Rv as a case study.BMC Bioinformatics. 2020 Oct 19;21(1):466. doi: 10.1186/s12859-020-03794-x. BMC Bioinformatics. 2020. PMID: 33076816 Free PMC article.
-
The CanOE strategy: integrating genomic and metabolic contexts across multiple prokaryote genomes to find candidate genes for orphan enzymes.PLoS Comput Biol. 2012 May;8(5):e1002540. doi: 10.1371/journal.pcbi.1002540. Epub 2012 May 31. PLoS Comput Biol. 2012. PMID: 22693442 Free PMC article.
-
Profiling the orphan enzymes.Biol Direct. 2014 Jun 6;9:10. doi: 10.1186/1745-6150-9-10. Biol Direct. 2014. PMID: 24906382 Free PMC article. Review.
-
Orphan enzymes could be an unexplored reservoir of new drug targets.Drug Discov Today. 2006 Apr;11(7-8):300-5. doi: 10.1016/j.drudis.2006.02.002. Drug Discov Today. 2006. PMID: 16580971 Review.
Cited by
-
Drosophila-associated bacteria differentially shape the nutritional requirements of their host during juvenile growth.PLoS Biol. 2020 Mar 20;18(3):e3000681. doi: 10.1371/journal.pbio.3000681. eCollection 2020 Mar. PLoS Biol. 2020. PMID: 32196485 Free PMC article.
-
SelenzymeRF: updated enzyme suggestion software for unbalanced biochemical reactions.Comput Struct Biotechnol J. 2023 Nov 23;21:5868-5876. doi: 10.1016/j.csbj.2023.11.039. eCollection 2023. Comput Struct Biotechnol J. 2023. PMID: 38074466 Free PMC article.
-
A workflow for annotating the knowledge gaps in metabolic reconstructions using known and hypothetical reactions.Proc Natl Acad Sci U S A. 2022 Nov 16;119(46):e2211197119. doi: 10.1073/pnas.2211197119. Epub 2022 Nov 7. Proc Natl Acad Sci U S A. 2022. PMID: 36343249 Free PMC article.
-
EnzymeMap: curation, validation and data-driven prediction of enzymatic reactions.Chem Sci. 2023 Nov 22;14(48):14229-14242. doi: 10.1039/d3sc02048g. eCollection 2023 Dec 13. Chem Sci. 2023. PMID: 38098707 Free PMC article.
-
Engineering cellular metabolite transport for biosynthesis of computationally predicted tropane alkaloid derivatives in yeast.Proc Natl Acad Sci U S A. 2021 Jun 22;118(25):e2104460118. doi: 10.1073/pnas.2104460118. Proc Natl Acad Sci U S A. 2021. PMID: 34140414 Free PMC article.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
