Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
- PMID: 32361862
- PMCID: PMC7449951
- DOI: 10.1007/s10822-020-00314-0
Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions
Abstract
Difficulties in interpreting machine learning (ML) models and their predictions limit the practical applicability of and confidence in ML in pharmaceutical research. There is a need for agnostic approaches aiding in the interpretation of ML models regardless of their complexity that is also applicable to deep neural network (DNN) architectures and model ensembles. To these ends, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced. The SHAP approach enables the identification and prioritization of features that determine compound classification and activity prediction using any ML model. Herein, we further extend the evaluation of the SHAP methodology by investigating a variant for exact calculation of Shapley values for decision tree methods and systematically compare this variant in compound activity and potency value predictions with the model-independent SHAP method. Moreover, new applications of the SHAP analysis approach are presented including interpretation of DNN models for the generation of multi-target activity profiles and ensemble regression models for potency prediction.
Keywords: Black box character; Compound activity; Compound potency prediction; Feature importance; Machine learning; Model interpretation; Multi-target modeling; Shapley values; Structure–activity relationships.
Figures
Similar articles
-
Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values.J Med Chem. 2020 Aug 27;63(16):8761-8777. doi: 10.1021/acs.jmedchem.9b01101. Epub 2019 Sep 26. J Med Chem. 2020. PMID: 31512867
-
Evaluation of multi-target deep neural network models for compound potency prediction under increasingly challenging test conditions.J Comput Aided Mol Des. 2021 Mar;35(3):285-295. doi: 10.1007/s10822-021-00376-8. Epub 2021 Feb 17. J Comput Aided Mol Des. 2021. PMID: 33598870 Free PMC article.
-
Classification and Explanation for Intrusion Detection System Based on Ensemble Trees and SHAP Method.Sensors (Basel). 2022 Feb 3;22(3):1154. doi: 10.3390/s22031154. Sensors (Basel). 2022. PMID: 35161899 Free PMC article.
-
Utilization of model-agnostic explainable artificial intelligence frameworks in oncology: a narrative review.Transl Cancer Res. 2022 Oct;11(10):3853-3868. doi: 10.21037/tcr-22-1626. Transl Cancer Res. 2022. PMID: 36388027 Free PMC article. Review.
-
A renaissance of neural networks in drug discovery.Expert Opin Drug Discov. 2016 Aug;11(8):785-95. doi: 10.1080/17460441.2016.1201262. Epub 2016 Jul 4. Expert Opin Drug Discov. 2016. PMID: 27295548 Review.
Cited by
-
Advancing material property prediction: using physics-informed machine learning models for viscosity.J Cheminform. 2024 Mar 14;16(1):31. doi: 10.1186/s13321-024-00820-5. J Cheminform. 2024. PMID: 38486289 Free PMC article.
-
Generalizability Improvement of Interpretable Symbolic Regression Models for Quantitative Structure-Activity Relationships.ACS Omega. 2024 Feb 16;9(8):9463-9474. doi: 10.1021/acsomega.3c09047. eCollection 2024 Feb 27. ACS Omega. 2024. PMID: 38434845 Free PMC article.
-
Development of a deep learning model that predicts critical events of pediatric patients admitted to general wards.Sci Rep. 2024 Feb 27;14(1):4707. doi: 10.1038/s41598-024-55528-1. Sci Rep. 2024. PMID: 38409469 Free PMC article.
-
Interpretable machine learning model to predict surgical difficulty in laparoscopic resection for rectal cancer.Front Oncol. 2024 Feb 6;14:1337219. doi: 10.3389/fonc.2024.1337219. eCollection 2024. Front Oncol. 2024. PMID: 38380369 Free PMC article. Review.
-
Improved QSAR models for PARP-1 inhibition using data balancing, interpretable machine learning, and matched molecular pair analysis.Mol Divers. 2024 Feb 20. doi: 10.1007/s11030-024-10809-9. Online ahead of print. Mol Divers. 2024. PMID: 38374474
References
-
- Cherkasov A, Muratov E, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuzmin VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A. QSAR modeling: where have you been? Where are you going to? J Med Chem. 2014;57:4977–5010. doi: 10.1021/jm4004285. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
