Developing an artificial intelligence method for screening hepatotoxic compounds in traditional Chinese medicine and Western medicine combination

Chin Med. 2022 May 17;17(1):58. doi: 10.1186/s13020-022-00617-4.

Abstract

Backgrounds: Traditional Chinese medicine and Western medicine combination (TCM-WMC) increased the complexity of compounds ingested.

Objective: To develop a method for screening hepatotoxic compounds in TCM-WMC based on chemical structures using artificial intelligence (AI) methods.

Methods: Drug-induced liver injury (DILI) data was collected from the public databases and published literatures. The total dataset formed by DILI data was randomly divided into training set and test set at a ratio of 3:1 approximately. Machine learning models of SGD (Stochastic Gradient Descent), kNN (k-Nearest Neighbor), SVM (Support Vector Machine), NB (Naive Bayes), DT (Decision Tree), RF (Random Forest), ANN (Artificial Neural Network), AdaBoost, LR (Logistic Regression) and one deep learning model (deep belief network, DBN) were adopted to construct models for screening hepatotoxic compounds.

Result: Dataset of 2035 hepatotoxic compounds was collected in this research, in which 1505 compounds were as training set and 530 compounds were as test set. Results showed that RF obtained 0.838 of classification accuracy (CA), 0.827 of F1-score, 0.832 of Precision, 0.838 of Recall, 0.814 of area under the curve (AUC) on the training set and 0.767 of CA, 0.731 of F1, 0.739 of Precision, 0.767 of Recall, 0.739 of AUC on the test set, which was better than other eight machine learning methods. The DBN obtained 82.2% accuracy on the test set, which was higher than any other machine learning models on the test set.

Conclusion: The DILI AI models were expected to effectively screen hepatotoxic compounds in TCM-WMC.

Keywords: AI; DILI; Deep learning; Machine learning; Safety assessment; TCM-WMC.