PredAmyl-MLP: Prediction of Amyloid Proteins Using Multilayer Perceptron

Comput Math Methods Med. 2020 Nov 20:2020:8845133. doi: 10.1155/2020/8845133. eCollection 2020.

Abstract

Amyloid is generally an aggregate of insoluble fibrin; its abnormal deposition is the pathogenic mechanism of various diseases, such as Alzheimer's disease and type II diabetes. Therefore, accurately identifying amyloid is necessary to understand its role in pathology. We proposed a machine learning-based prediction model called PredAmyl-MLP, which consists of the following three steps: feature extraction, feature selection, and classification. In the step of feature extraction, seven feature extraction algorithms and different combinations of them are investigated, and the combination of SVMProt-188D and tripeptide composition (TPC) is selected according to the experimental results. In the step of feature selection, maximum relevant maximum distance (MRMD) and binomial distribution (BD) are, respectively, used to remove the redundant or noise features, and the appropriate features are selected according to the experimental results. In the step of classification, we employed multilayer perceptron (MLP) to train the prediction model. The 10-fold cross-validation results show that the overall accuracy of PredAmyl-MLP reached 91.59%, and the performance was better than the existing methods.

MeSH terms

  • Algorithms
  • Alzheimer Disease / etiology
  • Alzheimer Disease / metabolism
  • Amino Acid Sequence
  • Amino Acids / chemistry
  • Amyloidogenic Proteins / chemistry*
  • Amyloidogenic Proteins / genetics
  • Computational Biology
  • Diabetes Mellitus, Type 2 / etiology
  • Diabetes Mellitus, Type 2 / metabolism
  • Humans
  • Machine Learning
  • Mathematical Concepts
  • Neural Networks, Computer*
  • Support Vector Machine

Substances

  • Amino Acids
  • Amyloidogenic Proteins