Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May 7;10(1):7721.
doi: 10.1038/s41598-020-64270-3.

PATH - Prediction of Amyloidogenicity by Threading and Machine Learning

Affiliations

PATH - Prediction of Amyloidogenicity by Threading and Machine Learning

Jakub W Wojciechowski et al. Sci Rep. .

Abstract

Amyloids are protein aggregates observed in several diseases, for example in Alzheimer's and Parkinson's diseases. An aggregate has a very regular beta structure with a tightly packed core, which spontaneously assumes a steric zipper form. Experimental methods enable studying such peptides, however they are tedious and costly, therefore inappropriate for genomewide studies. Several bioinformatic methods have been proposed to evaluate protein propensity to form an amyloid. However, the knowledge of aggregate structures is usually not taken into account. We propose PATH (Prediction of Amyloidogenicity by THreading) - a novel structure-based method for predicting amyloidogenicity and show that involving available structures of amyloidogenic fragments enhances classification performance. Experimental aggregate structures were used in templatebased modeling to recognize the most stable representative structural class of a query peptide. Several machine learning methods were then applied on the structural models, using their energy terms. Finally, we identified the most important terms in classification of amyloidogenic peptides. The proposed method outperforms most of the currently available methods for predicting amyloidogenicity, with its area under ROC curve equal to 0.876. Furthermore, the method gave insight into significance of selected structural features and the potentially most stable structural class of a peptide fragment if subjected to crystallization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Prediction procedure. Using comparative modeling, query sequence was threaded into seven templates representing different structural classes (class numbering in accordance with). For each of them ten models were proposed and the model with the lowest DOPE score was selected. For this model REF15 and selected PyRosetta energy terms were calculated and used as an input for machine learning classifiers.
Figure 2
Figure 2
DOPE score of amyloidogenic and non-amyloidogenic sequences threaded onto steric zipper structures for the benchmark set.
Figure 3
Figure 3
ROC curves for compared machine learning methods, for two test sets.
Figure 4
Figure 4
Coefficients of logistic regression model for energy terms.
Figure 5
Figure 5
Mean z-scores for energy terms obtained from Boruta feature selection method.

Similar articles

Cited by

References

    1. Eisenberg D, Jucker M. The amyloid state of proteins in human diseases. Cell. 2012;148:1188–1203. doi: 10.1016/j.cell.2012.02.022. - DOI - PMC - PubMed
    1. McGlinchey RP, Lee JC. Why study functional amyloids? Lessons from the repeat domain of pmel17. J. molecular biology. 2018;430:3696–3706. doi: 10.1016/j.jmb.2018.06.011. - DOI - PMC - PubMed
    1. de la Paz ML, Serrano L. Sequence determinants of amyloid fibril formation. Proc. Natl. Acad. Sci. 2004;101:87–92. doi: 10.1073/pnas.0308594100. - DOI - PMC - PubMed
    1. Eisenberg DS, Sawaya MR. Structural studies of amyloid proteins at the molecular level. Annu. review biochemistry. 2017;86:69–95. doi: 10.1146/annurev-biochem-061516-045104. - DOI - PubMed
    1. Howie AJ, Brewer DB. Optical properties of amyloid stained by congo red: history and mechanisms. Micron. 2009;40:285–301. doi: 10.1016/j.micron.2008.10.002. - DOI - PubMed

MeSH terms