Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery

Lun K Tsou; Shiu-Hwa Yeh; Shau-Hua Ueng; Chun-Ping Chang; Jen-Shin Song; Mine-Hsine Wu; Hsiao-Fu Chang; Sheng-Ren Chen; Chuan Shih; Chiung-Tong Chen; Yi-Yu Ke

doi:10.1038/s41598-020-73681-1

Comparative study between deep learning and QSAR classifications for TNBC inhibitors and novel GPCR agonist discovery

Sci Rep. 2020 Oct 8;10(1):16771. doi: 10.1038/s41598-020-73681-1.

Authors

Affiliations

¹ Institute of Biotechnology and Pharmaceutical Research, National Health Research Institutes, Zhunan, 35053, Miaoli County, Taiwan, ROC.
² Institute of Biotechnology and Pharmaceutical Research, National Health Research Institutes, Zhunan, 35053, Miaoli County, Taiwan, ROC. yiyuke@nhri.edu.tw.

^# Contributed equally.

Abstract

Machine learning is a well-known approach for virtual screening. Recently, deep learning, a machine learning algorithm in artificial neural networks, has been applied to the advancement of precision medicine and drug discovery. In this study, we performed comparative studies between deep neural networks (DNN) and other ligand-based virtual screening (LBVS) methods to demonstrate that DNN and random forest (RF) were superior in hit prediction efficiency. By using DNN, several triple-negative breast cancer (TNBC) inhibitors were identified as potent hits from a screening of an in-house database of 165,000 compounds. In broadening the application of this method, we harnessed the predictive properties of trained model in the discovery of G protein-coupled receptor (GPCR) agonist, by which computational structure-based design of molecules could be greatly hindered by lack of structural information. Notably, a potent (~ 500 nM) mu-opioid receptor (MOR) agonist was identified as a hit from a small-size training set of 63 compounds. Our results show that DNN could be an efficient module in hit prediction and provide experimental evidence that machine learning could identify potent hits in silico from a limited training set.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Antineoplastic Agents / therapeutic use*
Deep Learning*
Drug Discovery / methods
Humans
Neural Networks, Computer
Receptors, G-Protein-Coupled / agonists*
Triple Negative Breast Neoplasms / drug therapy*

Substances

Antineoplastic Agents
Receptors, G-Protein-Coupled