Metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure
- PMID: 34480923
- PMCID: PMC8553642
- DOI: 10.1016/j.bpj.2021.08.039
Metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure
Abstract
Intrinsically disordered proteins and protein regions make up a substantial fraction of many proteomes in which they play a wide variety of essential roles. A critical first step in understanding the role of disordered protein regions in biological function is to identify those disordered regions correctly. Computational methods for disorder prediction have emerged as a core set of tools to guide experiments, interpret results, and develop hypotheses. Given the multiple different predictors available, consensus scores have emerged as a popular approach to mitigate biases or limitations of any single method. Consensus scores integrate the outcome of multiple independent disorder predictors and provide a per-residue value that reflects the number of tools that predict a residue to be disordered. Although consensus scores help mitigate the inherent problems of using any single disorder predictor, they are computationally expensive to generate. They also necessitate the installation of multiple different software tools, which can be prohibitively difficult. To address this challenge, we developed a deep-learning-based predictor of consensus disorder scores. Our predictor, metapredict, utilizes a bidirectional recurrent neural network trained on the consensus disorder scores from 12 proteomes. By benchmarking metapredict using two orthogonal approaches, we found that metapredict is among the most accurate disorder predictors currently available. Metapredict is also remarkably fast, enabling proteome-scale disorder prediction in minutes. Importantly, metapredict is a fully open source and is distributed as a Python package, a collection of command-line tools, and a web server, maximizing the potential practical utility of the predictor. We believe metapredict offers a convenient, accessible, accurate, and high-performance predictor for single-proteins and proteomes alike.
Copyright © 2021 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Figures
Similar articles
-
DEPICTER: Intrinsic Disorder and Disorder Function Prediction Server.J Mol Biol. 2020 May 15;432(11):3379-3387. doi: 10.1016/j.jmb.2019.12.030. Epub 2019 Dec 21. J Mol Biol. 2020. PMID: 31870849
-
DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach.PLoS One. 2015 Jun 19;10(6):e0128334. doi: 10.1371/journal.pone.0128334. eCollection 2015. PLoS One. 2015. PMID: 26090958 Free PMC article.
-
Rapid prediction and analysis of protein intrinsic disorder.Protein Sci. 2022 Dec;31(12):e4496. doi: 10.1002/pro.4496. Protein Sci. 2022. PMID: 36334049 Free PMC article.
-
Tutorial: a guide for the selection of fast and accurate computational tools for the prediction of intrinsic disorder in proteins.Nat Protoc. 2023 Nov;18(11):3157-3172. doi: 10.1038/s41596-023-00876-x. Epub 2023 Sep 22. Nat Protoc. 2023. PMID: 37740110 Review.
-
Computational prediction of functions of intrinsically disordered regions.Prog Mol Biol Transl Sci. 2019;166:341-369. doi: 10.1016/bs.pmbts.2019.04.006. Epub 2019 May 20. Prog Mol Biol Transl Sci. 2019. PMID: 31521235 Review.
Cited by
-
Loss of non-motor kinesin KIF26A causes congenital brain malformations via dysregulated neuronal migration and axonal growth as well as apoptosis.Dev Cell. 2022 Oct 24;57(20):2381-2396.e13. doi: 10.1016/j.devcel.2022.09.011. Epub 2022 Oct 12. Dev Cell. 2022. PMID: 36228617 Free PMC article.
-
Structure of the N-RNA/P interface indicates mode of L/P recruitment to the nucleocapsid of human metapneumovirus.Nat Commun. 2023 Nov 22;14(1):7627. doi: 10.1038/s41467-023-43434-5. Nat Commun. 2023. PMID: 37993464 Free PMC article.
-
ParSe 2.0: A web tool to identify drivers of protein phase separation at the proteome level.Protein Sci. 2023 Sep;32(9):e4756. doi: 10.1002/pro.4756. Protein Sci. 2023. PMID: 37574757 Free PMC article.
-
PARROT is a flexible recurrent neural network framework for analysis of large protein datasets.Elife. 2021 Sep 17;10:e70576. doi: 10.7554/eLife.70576. Elife. 2021. PMID: 34533455 Free PMC article.
-
Self-association and multimer formation in AtLEA4-5, a desiccation-induced intrinsically disordered protein from plants.Protein Sci. 2024 Nov;33(11):e5192. doi: 10.1002/pro.5192. Protein Sci. 2024. PMID: 39467203
References
-
- Sormanni P., Piovesan D., Vendruscolo M. Simultaneous quantification of protein order and disorder. Nat. Chem. Biol. 2017;13:339–342. - PubMed
-
- Bottaro S., Lindorff-Larsen K. Biophysical experiments and biomolecular simulations: a perfect match? Science. 2018;361:355–360. - PubMed
-
- Henzler-Wildman K., Kern D. Dynamic personalities of proteins. Nature. 2007;450:964–972. - PubMed
-
- Wright P.E., Dyson H.J. Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J. Mol. Biol. 1999;293:321–331. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
