Explainable AI identifies diagnostic cells of genetic AML subtypes

Matthias Hehr; Ario Sadafi; Christian Matek; Peter Lienemann; Christian Pohlkamp; Torsten Haferlach; Karsten Spiekermann; Carsten Marr

doi:10.1371/journal.pdig.0000187

Explainable AI identifies diagnostic cells of genetic AML subtypes

PLOS Digit Health. 2023 Mar 15;2(3):e0000187. doi: 10.1371/journal.pdig.0000187. eCollection 2023 Mar.

Authors

Matthias Hehr^{1

2

3}, Ario Sadafi^{1

2

4}, Christian Matek^{1

2

3}, Peter Lienemann^{1

3}, Christian Pohlkamp⁵, Torsten Haferlach⁵, Karsten Spiekermann^{3

6

7}, Carsten Marr^{1

2}

Affiliations

¹ Institute of AI for Health, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany.
² Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany.
³ Laboratory of Leukemia Diagnostics, Department of Medicine III, University Hospital, LMU Munich, Munich, Germany.
⁴ Computer Aided Medical Procedures, Technical University of Munich, Munich, Germany.
⁵ Munich Leukemia Laboratory, Munich, Germany.
⁶ German Cancer Consortium (DKTK), Heidelberg, Germany.
⁷ German Cancer Research Center (DKFZ), Heidelberg, Germany.

Abstract

Explainable AI is deemed essential for clinical applications as it allows rationalizing model predictions, helping to build trust between clinicians and automated decision support tools. We developed an inherently explainable AI model for the classification of acute myeloid leukemia subtypes from blood smears and found that high-attention cells identified by the model coincide with those labeled as diagnostically relevant by human experts. Based on over 80,000 single white blood cell images from digitized blood smears of 129 patients diagnosed with one of four WHO-defined genetic AML subtypes and 60 healthy controls, we trained SCEMILA, a single-cell based explainable multiple instance learning algorithm. SCEMILA could perfectly discriminate between AML patients and healthy controls and detected the APL subtype with an F1 score of 0.86±0.05 (mean±s.d., 5-fold cross-validation). Analyzing a novel multi-attention module, we confirmed that our algorithm focused with high concordance on the same AML-specific cells as human experts do. Applied to classify single cells, it is able to highlight subtype specific cells and deconvolve the composition of a patient's blood smear without the need of single-cell annotation of the training data. Our large AML genetic subtype dataset is publicly available, and an interactive online tool facilitates the exploration of data and predictions. SCEMILA enables a comparison of algorithmic and expert decision criteria and can present a detailed analysis of individual patient data, paving the way to deploy AI in the routine diagnostics for identifying hematopoietic neoplasms.

Copyright: © 2023 Hehr et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Grants and funding

M.H. acknowledges support from Deutsche José Carreras-Leukämie Stiftung. C.M. has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant agreement No. 866411). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.