Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction

Chemistry. 2017 May 2;23(25):5966-5971. doi: 10.1002/chem.201605499. Epub 2017 Feb 22.


Reaction prediction and retrosynthesis are the cornerstones of organic chemistry. Rule-based expert systems have been the most widespread approach to computationally solve these two related challenges to date. However, reaction rules often fail because they ignore the molecular context, which leads to reactivity conflicts. Herein, we report that deep neural networks can learn to resolve reactivity conflicts and to prioritize the most suitable transformation rules. We show that by training our model on 3.5 million reactions taken from the collective published knowledge of the entire discipline of chemistry, our model exhibits a top10-accuracy of 95 % in retrosynthesis and 97 % for reaction prediction on a validation set of almost 1 million reactions.

Keywords: artificial intelligence; machine learning; retrosynthesis; synthesis design; total synthesis.