Prediction of Organic Reaction Outcomes Using Machine Learning
- PMID: 28573205
- PMCID: PMC5445544
- DOI: 10.1021/acscentsci.7b00064
Prediction of Organic Reaction Outcomes Using Machine Learning
Abstract
Computer assistance in synthesis design has existed for over 40 years, yet retrosynthesis planning software has struggled to achieve widespread adoption. One critical challenge in developing high-quality pathway suggestions is that proposed reaction steps often fail when attempted in the laboratory, despite initially seeming viable. The true measure of success for any synthesis program is whether the predicted outcome matches what is observed experimentally. We report a model framework for anticipating reaction outcomes that combines the traditional use of reaction templates with the flexibility in pattern recognition afforded by neural networks. Using 15 000 experimental reaction records from granted United States patents, a model is trained to select the major (recorded) product by ranking a self-generated list of candidates where one candidate is known to be the major product. Candidate reactions are represented using a unique edit-based representation that emphasizes the fundamental transformation from reactants to products, rather than the constituent molecules' overall structures. In a 5-fold cross-validation, the trained model assigns the major product rank 1 in 71.8% of cases, rank ≤3 in 86.7% of cases, and rank ≤5 in 90.8% of cases.
Conflict of interest statement
The authors declare no competing financial interest.
Figures
Similar articles
-
Machine Learning in Computer-Aided Synthesis Planning.Acc Chem Res. 2018 May 15;51(5):1281-1289. doi: 10.1021/acs.accounts.8b00087. Epub 2018 May 1. Acc Chem Res. 2018. PMID: 29715002
-
RetroRanker: leveraging reaction changes to improve retrosynthesis prediction through re-ranking.J Cheminform. 2023 Jun 8;15(1):58. doi: 10.1186/s13321-023-00727-7. J Cheminform. 2023. PMID: 37291642 Free PMC article.
-
Influence of Template Size, Canonicalization, and Exclusivity for Retrosynthesis and Reaction Prediction Applications.J Chem Inf Model. 2022 Jan 10;62(1):16-26. doi: 10.1021/acs.jcim.1c01192. Epub 2021 Dec 23. J Chem Inf Model. 2022. PMID: 34939786 Free PMC article.
-
Evidence Brief: The Effectiveness Of Mandatory Computer-Based Trainings On Government Ethics, Workplace Harassment, Or Privacy And Information Security-Related Topics [Internet].Washington (DC): Department of Veterans Affairs (US); 2014 May. Washington (DC): Department of Veterans Affairs (US); 2014 May. PMID: 27606391 Free Books & Documents. Review.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
Cited by
-
Reactivities of N-Nitrosamines against Common Reagents and Reaction Conditions.Org Process Res Dev. 2024 Sep 18;28(10):3837-3846. doi: 10.1021/acs.oprd.4c00217. eCollection 2024 Oct 18. Org Process Res Dev. 2024. PMID: 39444428 Free PMC article.
-
Reaction rebalancing: a novel approach to curating reaction databases.J Cheminform. 2024 Jul 19;16(1):82. doi: 10.1186/s13321-024-00875-4. J Cheminform. 2024. PMID: 39030583 Free PMC article.
-
On Accelerating Substrate Optimization Using Computational Gibbs Energy Barriers: A Numerical Consideration Utilizing a Computational Data Set.ACS Omega. 2024 Jan 29;9(6):7123-7131. doi: 10.1021/acsomega.3c09066. eCollection 2024 Feb 13. ACS Omega. 2024. PMID: 38371820 Free PMC article.
-
Artificial Intelligence and Machine Learning Technology Driven Modern Drug Discovery and Development.Int J Mol Sci. 2023 Jan 19;24(3):2026. doi: 10.3390/ijms24032026. Int J Mol Sci. 2023. PMID: 36768346 Free PMC article. Review.
-
Designing and understanding light-harvesting devices with machine learning.Nat Commun. 2020 Sep 11;11(1):4587. doi: 10.1038/s41467-020-17995-8. Nat Commun. 2020. PMID: 32917886 Free PMC article. Review.
References
-
- Corey E. J. General methods for the construction of complex molecules. Pure Appl. Chem. 1967, 14, 19–38. 10.1016/B978-0-08-020741-4.50004-X. - DOI
-
- Pensak D. A.; Corey E. J.. Computer-Assisted Organic Synthesis; ACS Symp. Ser.; 1977; Vol. 61; Chapter 1, pp 1–32, doi:10.1021/bk-1977-0061.ch001. - DOI
-
- Satoh H.; Funatsu K. SOPHIA, a Knowledge Base-Guided Reaction Prediction System - Utilization of a Knowledge Base Derived from a Reaction Database. J. Chem. Inf. Model. 1995, 35, 34–44. 10.1021/ci00023a005. - DOI
LinkOut - more resources
Full Text Sources
Other Literature Sources
