MEnTaT: A machine-learning approach for the identification of mutations to increase protein stability

Proc Natl Acad Sci U S A. 2023 Dec 5;120(49):e2309884120. doi: 10.1073/pnas.2309884120. Epub 2023 Dec 1.

Abstract

Enhancing protein thermal stability is important for biomedical and industrial applications as well as in the research laboratory. Here, we describe a simple machine-learning method which identifies amino acid substitutions that contribute to thermal stability based on comparison of the amino acid sequences of homologous proteins derived from bacteria that grow at different temperatures. A key feature of the method is that it compares the sequences based not simply on the amino acid identity, but rather on the structural and physicochemical properties of the side chain. The method accurately identified stabilizing substitutions in three well-studied systems and was validated prospectively by experimentally testing predicted stabilizing substitutions in a polyamine oxidase. In each case, the method outperformed the widely used bioinformatic consensus approach. The method can also provide insight into fundamental aspects of protein structure, for example, by identifying how many sequence positions in a given protein are relevant to temperature adaptation.

Keywords: PLS-DA; esterase; kinase; oxidase; thermostability.

MeSH terms

  • Amino Acid Sequence
  • Enzyme Stability
  • Machine Learning*
  • Mutation
  • Protein Stability
  • Proteins* / genetics

Substances

  • Mentat
  • Proteins