Bio-inspired chemical space exploration of terpenoids

Brief Bioinform. 2022 Sep 20;23(5):bbac197. doi: 10.1093/bib/bbac197.

Abstract

Many computational methods are devoted to rapidly generating pseudo-natural products to expand the open-ended border of chemical spaces for natural products. However, the accessibility and chemical interpretation were often ignored or underestimated in conventional library/fragment-based or rule-based strategies, thus hampering experimental synthesis. Herein, a bio-inspired strategy (named TeroGen) is developed to mimic the two key biosynthetic stages (cyclization and decoration) of terpenoid natural products, by utilizing physically based simulations and deep learning models, respectively. The precision and efficiency are validated for different categories of terpenoids, and in practice, more than 30 000 sesterterpenoids (10 times as many as the known sesterterpenoids) are predicted to be linked in a reaction network, and their synthetic accessibility and chemical interpretation are estimated by thermodynamics and kinetics. Since it could not only greatly expand the chemical space of terpenoids but also numerate plausible biosynthetic routes, TeroGen is promising for accelerating heterologous biosynthesis, bio-mimic and chemical synthesis of complicated terpenoids and derivatives.

Keywords: biosynthesis; chemical space; deep learning; molecular generation; terpenoids.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biological Products*
  • Space Flight*
  • Terpenes

Substances

  • Biological Products
  • Terpenes