Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models

Nat Commun. 2018 Dec 7;9(1):5252. doi: 10.1038/s41467-018-07652-6.

Abstract

Knowing the catalytic turnover numbers of enzymes is essential for understanding the growth rate, proteome composition, and physiology of organisms, but experimental data on enzyme turnover numbers is sparse and noisy. Here, we demonstrate that machine learning can successfully predict catalytic turnover numbers in Escherichia coli based on integrated data on enzyme biochemistry, protein structure, and network context. We identify a diverse set of features that are consistently predictive for both in vivo and in vitro enzyme turnover rates, revealing novel protein structural correlates of catalytic turnover. We use our predictions to parameterize two mechanistic genome-scale modelling frameworks for proteome-limited metabolism, leading to significantly higher accuracy in the prediction of quantitative proteome data than previous approaches. The presented machine learning models thus provide a valuable tool for understanding metabolism and the proteome at the genome scale, and elucidate structural, biochemical, and network properties that underlie enzyme kinetics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biocatalysis
  • Escherichia coli / enzymology*
  • Escherichia coli / genetics
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism*
  • Kinetics
  • Machine Learning*
  • Metabolic Networks and Pathways*
  • Models, Biological
  • Proteome / genetics
  • Proteome / metabolism

Substances

  • Escherichia coli Proteins
  • Proteome