Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information
- PMID: 32413515
- PMCID: PMC7393547
- DOI: 10.1016/j.gpb.2019.08.002
Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information
Abstract
Proteases are enzymes that cleave and hydrolyse the peptide bonds between two specific amino acid residues of target substrate proteins. Protease-controlled proteolysis plays a key role in the degradation and recycling of proteins, which is essential for various physiological processes. Thus, solving the substrate identification problem will have important implications for the precise understanding of functions and physiological roles of proteases, as well as for therapeutic target identification and pharmaceutical applicability. Consequently, there is a great demand for bioinformatics methods that can predict novel substrate cleavage events with high accuracy by utilizing both sequence and structural information. In this study, we present Procleave, a novel bioinformatics approach for predicting protease-specific substrates and specific cleavage sites by taking into account both their sequence and 3D structural information. Structural features of known cleavage sites were represented by discrete values using a LOWESS data-smoothing optimization method, which turned out to be critical for the performance of Procleave. The optimal approximations of all structural parameter values were encoded in a conditional random field (CRF) computational framework, alongside sequence and chemical group-based features. Here, we demonstrate the outstanding performance of Procleave through extensive benchmarking and independent tests. Procleave is capable of correctly identifying most cleavage sites in the case study. Importantly, when applied to the human structural proteome encompassing 17,628 protein structures, Procleave suggests a number of potential novel target substrates and their corresponding cleavage sites of different proteases. Procleave is implemented as a webserver and is freely accessible at http://procleave.erc.monash.edu/.
Keywords: Cleavage site prediction; Conditional random field; Machine learning; Protease; Structural determinants.
Copyright © 2020 The Authors. Published by Elsevier B.V. All rights reserved.
Figures
Similar articles
-
PROSPER: an integrated feature-based tool for predicting protease substrate cleavage sites.PLoS One. 2012;7(11):e50300. doi: 10.1371/journal.pone.0050300. Epub 2012 Nov 29. PLoS One. 2012. PMID: 23209700 Free PMC article.
-
iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.Brief Bioinform. 2019 Mar 25;20(2):638-658. doi: 10.1093/bib/bby028. Brief Bioinform. 2019. PMID: 29897410 Free PMC article. Review.
-
PROSPERous: high-throughput prediction of substrate cleavage sites for 90 proteases with improved accuracy.Bioinformatics. 2018 Feb 15;34(4):684-687. doi: 10.1093/bioinformatics/btx670. Bioinformatics. 2018. PMID: 29069280 Free PMC article.
-
Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077. Brief Bioinform. 2019. PMID: 30184176 Free PMC article. Review.
-
Bioinformatic approaches for predicting substrates of proteases.J Bioinform Comput Biol. 2011 Feb;9(1):149-78. doi: 10.1142/s0219720011005288. J Bioinform Comput Biol. 2011. PMID: 21328711 Review.
Cited by
-
Predicting Structural Susceptibility of Proteins to Proteolytic Processing.Int J Mol Sci. 2023 Jun 28;24(13):10761. doi: 10.3390/ijms241310761. Int J Mol Sci. 2023. PMID: 37445939 Free PMC article.
-
As in Real Estate, Location Matters: Cellular Expression of Complement Varies Between Macular and Peripheral Regions of the Retina and Supporting Tissues.Front Immunol. 2022 Jun 15;13:895519. doi: 10.3389/fimmu.2022.895519. eCollection 2022. Front Immunol. 2022. PMID: 35784369 Free PMC article.
-
Integrating knowledge of protein sequence with protein function for the prediction and validation of new MALT1 substrates.Comput Struct Biotechnol J. 2022 Aug 19;20:4717-4732. doi: 10.1016/j.csbj.2022.08.021. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 36147669 Free PMC article.
-
Structure of the human heparan-α-glucosaminide N-acetyltransferase (HGSNAT).Elife. 2024 Aug 28;13:RP93510. doi: 10.7554/eLife.93510. Elife. 2024. PMID: 39196614 Free PMC article.
-
Protease Activity Analysis: A Toolkit for Analyzing Enzyme Activity Data.ACS Omega. 2022 Jul 6;7(28):24292-24301. doi: 10.1021/acsomega.2c01559. eCollection 2022 Jul 19. ACS Omega. 2022. PMID: 35874224 Free PMC article.
References
-
- Overall C.M., Blobel C.P. In search of partners: linking extracellular proteases to substrates. Nat Rev Mol Cell Biol. 2007;8:245–257. - PubMed
-
- Turk B. Targeting proteases: successes, failures and future prospects. Nat Rev Drug Discov. 2006;5:785–799. - PubMed
-
- Boyd S.E., Pike R.N., Rudy G.B., Whisstock J.C., Garcia de la Banda M. PoPS: a computational tool for modeling and predicting protease specificity. J Bioinform Comput Biol. 2005;3:551–585. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
