Knowledge extraction from literature and enzyme sequences complements FBA analysis in metabolic engineering

Biotechnol J. 2021 Dec;16(12):e2000443. doi: 10.1002/biot.202000443. Epub 2021 Sep 29.

Abstract

Flux balance analysis (FBA) using genome-scale metabolic model (GSM) is a useful method for improving the bio-production of useful compounds. However, FBA often does not impose important constraints such as nutrients uptakes, by-products excretions and gases (oxygen and carbon dioxide) transfers. Furthermore, important information on metabolic engineering such as enzyme amounts, activities, and characteristics caused by gene expression and enzyme sequences is basically not included in GSM. Therefore, simple FBA is often not sufficient to search for metabolic manipulation strategies that are useful for improving the production of target compounds. In this study, we proposed a method using literature and enzyme search to complement the FBA-based metabolic manipulation strategies. As a case study, this method was applied to shikimic acid production by Corynebacterium glutamicum to verify its usefulness. As unique strategies in literature-mining, overexpression of the transcriptional regulator SugR and gene disruption related to by-products productions were complemented. In the search for alternative enzyme sequences, it was suggested that those candidates are searched for from various species based on features captured by deep learning, which are not simply homologous to amino acid sequences of the base enzymes.

Keywords: bio-production; flux balance analysis; genome-scale metabolic model; literature mining.

MeSH terms

  • Corynebacterium glutamicum* / genetics
  • Metabolic Engineering*