PATH - Prediction of Amyloidogenicity by Threading and Machine Learning
- PMID: 32382058
- PMCID: PMC7206081
- DOI: 10.1038/s41598-020-64270-3
PATH - Prediction of Amyloidogenicity by Threading and Machine Learning
Abstract
Amyloids are protein aggregates observed in several diseases, for example in Alzheimer's and Parkinson's diseases. An aggregate has a very regular beta structure with a tightly packed core, which spontaneously assumes a steric zipper form. Experimental methods enable studying such peptides, however they are tedious and costly, therefore inappropriate for genomewide studies. Several bioinformatic methods have been proposed to evaluate protein propensity to form an amyloid. However, the knowledge of aggregate structures is usually not taken into account. We propose PATH (Prediction of Amyloidogenicity by THreading) - a novel structure-based method for predicting amyloidogenicity and show that involving available structures of amyloidogenic fragments enhances classification performance. Experimental aggregate structures were used in templatebased modeling to recognize the most stable representative structural class of a query peptide. Several machine learning methods were then applied on the structural models, using their energy terms. Finally, we identified the most important terms in classification of amyloidogenic peptides. The proposed method outperforms most of the currently available methods for predicting amyloidogenicity, with its area under ROC curve equal to 0.876. Furthermore, the method gave insight into significance of selected structural features and the potentially most stable structural class of a peptide fragment if subjected to crystallization.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Diverse Structural Conversion and Aggregation Pathways of Alzheimer's Amyloid-β (1-40).ACS Nano. 2019 Aug 27;13(8):8766-8783. doi: 10.1021/acsnano.9b01578. Epub 2019 Jul 24. ACS Nano. 2019. PMID: 31310506 Free PMC article.
-
On the amyloid datasets used for training PAFIG--how (not) to extend the experimental dataset of hexapeptides.BMC Bioinformatics. 2013 Dec 4;14:351. doi: 10.1186/1471-2105-14-351. BMC Bioinformatics. 2013. PMID: 24305169 Free PMC article.
-
Bioinformatics methods for identification of amyloidogenic peptides show robustness to misannotated training data.Sci Rep. 2021 Apr 26;11(1):8934. doi: 10.1038/s41598-021-86530-6. Sci Rep. 2021. PMID: 33903613 Free PMC article.
-
Simulation Studies of Amyloidogenic Polypeptides and Their Aggregates.Chem Rev. 2019 Jun 26;119(12):6956-6993. doi: 10.1021/acs.chemrev.8b00731. Epub 2019 Apr 11. Chem Rev. 2019. PMID: 30973229 Review.
-
Structure and Aggregation Mechanisms in Amyloids.Molecules. 2020 Mar 6;25(5):1195. doi: 10.3390/molecules25051195. Molecules. 2020. PMID: 32155822 Free PMC article. Review.
Cited by
-
Mechanisms and pathology of protein misfolding and aggregation.Nat Rev Mol Cell Biol. 2023 Dec;24(12):912-933. doi: 10.1038/s41580-023-00647-2. Epub 2023 Sep 8. Nat Rev Mol Cell Biol. 2023. PMID: 37684425 Review.
-
Advanced computational approaches to understand protein aggregation.Biophys Rev (Melville). 2024 Apr 24;5(2):021302. doi: 10.1063/5.0180691. eCollection 2024 Jun. Biophys Rev (Melville). 2024. PMID: 38681860 Review.
-
Prion-like proteins: from computational approaches to proteome-wide analysis.FEBS Open Bio. 2021 Sep;11(9):2400-2417. doi: 10.1002/2211-5463.13213. Epub 2021 Jun 17. FEBS Open Bio. 2021. PMID: 34057308 Free PMC article. Review.
-
Variability of Amyloid Propensity in Imperfect Repeats of CsgA Protein of Salmonella enterica and Escherichia coli.Int J Mol Sci. 2021 May 12;22(10):5127. doi: 10.3390/ijms22105127. Int J Mol Sci. 2021. PMID: 34066237 Free PMC article.
-
Bioinformatics Methods in Predicting Amyloid Propensity of Peptides and Proteins.Methods Mol Biol. 2022;2340:1-15. doi: 10.1007/978-1-0716-1546-1_1. Methods Mol Biol. 2022. PMID: 35167067
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
