TopModel: Template-Based Protein Structure Prediction at Low Sequence Identity Using Top-Down Consensus and Deep Neural Networks
- PMID: 31967823
- DOI: 10.1021/acs.jctc.9b00825
TopModel: Template-Based Protein Structure Prediction at Low Sequence Identity Using Top-Down Consensus and Deep Neural Networks
Abstract
Knowledge of protein structures is essential to understand proteins' functions, evolution, dynamics, stabilities, and interactions and for data-driven protein- or drug design. Yet, experimental structure determination rates are far exceeded by that of next-generation sequencing, resulting in less than 1/1000th of proteins having an experimentally known 3D structure. Computational structure prediction seeks to alleviate this problem, and the Critical Assessment of Protein Structure Prediction (CASP) has shown the value of consensus and meta-methods that utilize complementary algorithms. However, traditionally, such methods employ majority voting during template selection and model averaging during refinement, which can drive the model away from the native fold if it is underrepresented in the ensemble. Here, we present TopModel, a fully automated meta-method for protein structure prediction. In contrast to traditional consensus and meta-methods, TopModel uses top-down consensus and deep neural networks to select templates and identify and correct wrongly modeled regions. TopModel combines a broad range of state-of-the-art methods for threading, alignment, and model quality estimation and provides a versatile workflow and toolbox for template-based structure prediction. TopModel shows a superior template selection, alignment accuracy, and model quality for template-based structure prediction on the CASP10-12 datasets compared to 12 state-of-the-art stand-alone primary predictors. TopModel was validated by prospective predictions of the nisin resistance protein (NSR) protein from Streptococcus agalactiae and LipoP from Clostridium difficile, showing far better agreement with experimental data than any of its constituent primary predictors. These results, in general, demonstrate the utility of TopModel for protein structure prediction and, in particular, show how combining computational structure prediction with sparse or low-resolution experimental data can improve the final model.
Similar articles
-
TopSuite Web Server: A Meta-Suite for Deep-Learning-Based Protein Structure and Quality Prediction.J Chem Inf Model. 2021 Feb 22;61(2):548-553. doi: 10.1021/acs.jcim.0c01202. Epub 2021 Jan 19. J Chem Inf Model. 2021. PMID: 33464891
-
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5. BMC Struct Biol. 2009. PMID: 19183478 Free PMC article.
-
Protein structure prediction of CASP5 comparative modeling and fold recognition targets using consensus alignment approach and 3D assessment.Proteins. 2003;53 Suppl 6:410-7. doi: 10.1002/prot.10548. Proteins. 2003. PMID: 14579329
-
A guide to template based structure prediction.Curr Protein Pept Sci. 2009 Jun;10(3):270-85. doi: 10.2174/138920309788452182. Curr Protein Pept Sci. 2009. PMID: 19519455 Review.
-
Mass spectrometry coupled experiments and protein structure modeling methods.Int J Mol Sci. 2013 Oct 15;14(10):20635-57. doi: 10.3390/ijms141020635. Int J Mol Sci. 2013. PMID: 24132151 Free PMC article. Review.
Cited by
-
Enzyme Databases in the Era of Omics and Artificial Intelligence.Int J Mol Sci. 2023 Nov 29;24(23):16918. doi: 10.3390/ijms242316918. Int J Mol Sci. 2023. PMID: 38069254 Free PMC article. Review.
-
The cyclophilin A-binding loop of the capsid regulates the human TRIM5α sensitivity of nonpandemic HIV-1.Proc Natl Acad Sci U S A. 2023 Nov 28;120(48):e2306374120. doi: 10.1073/pnas.2306374120. Epub 2023 Nov 20. Proc Natl Acad Sci U S A. 2023. PMID: 37983491
-
A step forward to the optimized HlyA type 1 secretion system through directed evolution.Appl Microbiol Biotechnol. 2023 Aug;107(16):5131-5143. doi: 10.1007/s00253-023-12653-7. Epub 2023 Jul 5. Appl Microbiol Biotechnol. 2023. PMID: 37405436 Free PMC article.
-
TopEnzyme: a framework and database for structural coverage of the functional enzyme space.Bioinformatics. 2023 Mar 1;39(3):btad116. doi: 10.1093/bioinformatics/btad116. Bioinformatics. 2023. PMID: 36883717 Free PMC article.
-
Pathogen Resistance Depending on Jacalin-Dirigent Chimeric Proteins Is Common among Poaceae but Absent in the Dicot Arabidopsis as Evidenced by Analysis of Homologous Single-Domain Proteins.Plants (Basel). 2022 Dec 23;12(1):67. doi: 10.3390/plants12010067. Plants (Basel). 2022. PMID: 36616196 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Miscellaneous