Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct;34(10):1013-1026.
doi: 10.1007/s10822-020-00314-0. Epub 2020 May 2.

Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions

Affiliations

Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions

Raquel Rodríguez-Pérez et al. J Comput Aided Mol Des. 2020 Oct.

Abstract

Difficulties in interpreting machine learning (ML) models and their predictions limit the practical applicability of and confidence in ML in pharmaceutical research. There is a need for agnostic approaches aiding in the interpretation of ML models regardless of their complexity that is also applicable to deep neural network (DNN) architectures and model ensembles. To these ends, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced. The SHAP approach enables the identification and prioritization of features that determine compound classification and activity prediction using any ML model. Herein, we further extend the evaluation of the SHAP methodology by investigating a variant for exact calculation of Shapley values for decision tree methods and systematically compare this variant in compound activity and potency value predictions with the model-independent SHAP method. Moreover, new applications of the SHAP analysis approach are presented including interpretation of DNN models for the generation of multi-target activity profiles and ensemble regression models for potency prediction.

Keywords: Black box character; Compound activity; Compound potency prediction; Feature importance; Machine learning; Model interpretation; Multi-target modeling; Shapley values; Structure–activity relationships.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Comparison of kernel and tree SHAP. For 10 activity classes, distributions of correlation coefficient (r) values for kernel and tree SHAP calculations, corresponding to approximated and exact SHAP values, respectively, are reported in boxplots. Black horizontal lines indicate median values. Results are shown for classification (activity prediction, top) and regression (potency value prediction, bottom) models using RF (blue) and ExtraTrees (red)
Fig. 2
Fig. 2
Interpretation of GB-based compound potency prediction. a For an exemplary prediction, a feature importance ranking is shown including features with positive (red) and negative (blue) contributions to the prediction of the high potency value. Sequential arrows on the left are proportional to the feature contributions or SHAP values (shown on the pKi scale). The summation of the expected value (7.7, gray) and all feature contributions yield the predicted pKi value (9.6). Numbers in white preceded by # indicate top-ranked features. b From the top to the bottom, top-ranked features with positive contributions are iteratively mapped onto the test compound
Fig. 3
Fig. 3
Interpretation of RF-based compound potency prediction. a Feature contributions to an exemplary prediction are depicted according to Fig. 2. The expected value (8.4, gray) and all feature contributions yield the predicted pKi value (10.3). b Top-ranked features with positive contributions are mapped onto the test compound. c Top-5 and -10 ranked features are mapped onto three analog from the same series
Fig. 4
Fig. 4
Comparative interpretation of RF- and GBM-based potency prediction. In a, positive (red) and negative (blue) feature contributions are compared for RF- and GBM-based regression models. The predicted pKi values are shown in bold and different colors for RF (yellow) and GBM (orange). White numbers give indices of top-ranked features. b The top-1 and top-5 ranked features are mapped onto the compound. These features are common to both models
Fig. 5
Fig. 5
SHAP-based interpretation of MT-DNN predictions. Each output neuron facilitates activity prediction of a different target (Tx). A SHAP-based explanation model is generated for each node/target. For a given test compound, each output prediction is rationalized
Fig. 6
Fig. 6
Interpretation of MT-DNN activity predictions. In a and b SHAP analysis is shown for two inhibitors that were highly potent against two kinases. The top-1 and top-3 positive and negative features are mapped onto the compound and colored according to their contributions
Fig. 7
Fig. 7
Rationalizing an error of a GB regression model. For a compound with high potency against muscarinic acetylcholine receptor M3, the potency value was ~ 1000-fold under-predicted by the GB model. SHAP identifies features making strong positive contributions to the prediction that are mapped onto the compound (left). By contrast, features with strongest negative contributions to potency prediction are absent in the compound. The corresponding atom environments are shown on the right
Fig. 8
Fig. 8
Rationalizing an MT-DNN classification error. A highly potent inhibitor of ribosomal protein S6 kinase 1 was incorrectly predicted to be weakly potent against this target. SHAP analysis identifies a variety of features with strong negative contributions to the prediction. The top-6 ranked negative features are mapped onto the inhibitor. Three of these features are present in the compound, but the three others are absent. The corresponding atom environments are displayed

Similar articles

Cited by

References

    1. Varnek A, Baskin I. Machine learning methods for property prediction in cheminformatics: quo vadis? J Chem Inf Model. 2012;52:1413–1437. doi: 10.1021/ci200409x. - DOI - PubMed
    1. Cherkasov A, Muratov E, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuzmin VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A. QSAR modeling: where have you been? Where are you going to? J Med Chem. 2014;57:4977–5010. doi: 10.1021/jm4004285. - DOI - PMC - PubMed
    1. Lavecchia A. Machine-learning approaches in drug discovery: methods and applications. Drug Discov Today. 2015;20:318–331. doi: 10.1016/j.drudis.2014.10.012. - DOI - PubMed
    1. Lo Y, Rensi SE, Torng W, Altman RB. Machine learning in chemoinformatics and drug discovery. Drug Discov Today. 2018;23:1538–1546. doi: 10.1016/j.drudis.2018.05.010. - DOI - PMC - PubMed
    1. Hansen K, Baehrens D, Schroeter T, Rupp M, Müller K-R. Visual interpretation of kernel-based prediction models. Mol Inform. 2011;30:817–826. doi: 10.1002/minf.201100059. - DOI - PubMed

Publication types

Substances