Identify Bitter Peptides by Using Deep Representation Learning Features

Int J Mol Sci. 2022 Jul 17;23(14):7877. doi: 10.3390/ijms23147877.

Abstract

A bitter taste often identifies hazardous compounds and it is generally avoided by most animals and humans. Bitterness of hydrolyzed proteins is caused by the presence of bitter peptides. To improve palatability, bitter peptides need to be identified experimentally in a time-consuming and expensive process, before they can be removed or degraded. Here, we report the development of a machine learning prediction method, iBitter-DRLF, which is based on a deep learning pre-trained neural network feature extraction method. It uses three sequence embedding techniques, soft symmetric alignment (SSA), unified representation (UniRep), and bidirectional long short-term memory (BiLSTM). These were initially combined into various machine learning algorithms to build several models. After optimization, the combined features of UniRep and BiLSTM were finally selected, and the model was built in combination with a light gradient boosting machine (LGBM). The results showed that the use of deep representation learning greatly improves the ability of the model to identify bitter peptides, achieving accurate prediction based on peptide sequence data alone. By helping to identify bitter peptides, iBitter-DRLF can help research into improving the palatability of peptide therapeutics and dietary supplements in the future. A webserver is available, too.

Keywords: bitter peptide; deep representation learning; feature selection; light gradient boosting.

MeSH terms

  • Algorithms
  • Animals
  • Humans
  • Machine Learning
  • Neural Networks, Computer
  • Peptides* / chemistry
  • Taste*

Substances

  • Peptides