Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning

Gregory L Medlock; Jason A Papin

doi:10.1016/j.cels.2019.11.006

Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning

Cell Syst. 2020 Jan 22;10(1):109-119.e3. doi: 10.1016/j.cels.2019.11.006. Epub 2020 Jan 8.

Authors

Gregory L Medlock¹, Jason A Papin²

Affiliations

¹ Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA.
² Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA; Department of Medicine, Division of Infectious Diseases & International Health, University of Virginia, Charlottesville, VA, USA; Department of Biochemistry & Molecular Genetics, University of Virginia, Charlottesville, VA, USA. Electronic address: papin@virginia.edu.

Abstract

Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for genome-scale metabolic models. We generate an ensemble of models, each equally consistent with experimental data, then perform simulations with them. We apply machine learning to the simulation output to identify model structure variation that maximally influences simulations. These variants are high-priority candidates for curation through removal, addition, or reannotation in the model. We apply this approach, automated metabolic model ensemble-driven elimination of uncertainty with statistical learning (AMMEDEUS), to 29 bacterial species to improve gene essentiality predictions. We explore targets for individual species and compile pan-species targets to improve the database used during model construction. AMMEDEUS is an automated and performance-driven recommendation system that complements intuition during curation of biochemical knowledgebases.

Keywords: ensemble modeling; machine learning; mechanistic models; metabolic modeling; metabolism; model curation; systems biology.

Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding