"What is relevant in a text document?": An interpretable machine learning approach
- PMID: 28800619
- PMCID: PMC5553725
- DOI: 10.1371/journal.pone.0181142
"What is relevant in a text document?": An interpretable machine learning approach
Abstract
Text documents can be described by a number of abstract concepts such as semantic category, writing style, or sentiment. Machine learning (ML) models have been trained to automatically map documents to these abstract concepts, allowing to annotate very large text collections, more than could be processed by a human in a lifetime. Besides predicting the text's category very accurately, it is also highly desirable to understand how and why the categorization process takes place. In this paper, we demonstrate that such understanding can be achieved by tracing the classification decision back to individual words using layer-wise relevance propagation (LRP), a recently developed technique for explaining predictions of complex non-linear classifiers. We train two word-based ML models, a convolutional neural network (CNN) and a bag-of-words SVM classifier, on a topic categorization task and adapt the LRP method to decompose the predictions of these models onto words. Resulting scores indicate how much individual words contribute to the overall classification decision. This enables one to distill relevant information from text documents without an explicit semantic information extraction step. We further use the word-wise relevance scores for generating novel vector-based document representations which capture semantic information. Based on these document vectors, we introduce a measure of model explanatory power and show that, although the SVM and CNN models perform similarly in terms of classification accuracy, the latter exhibits a higher level of explainability which makes it more comprehensible for humans and potentially more useful for other applications.
Conflict of interest statement
Figures
Similar articles
-
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153. J Am Med Inform Assoc. 2020. PMID: 31710668 Free PMC article.
-
Transferability of artificial neural networks for clinical document classification across hospitals: A case study on abnormality detection from radiology reports.J Biomed Inform. 2018 Sep;85:68-79. doi: 10.1016/j.jbi.2018.07.017. Epub 2018 Jul 17. J Biomed Inform. 2018. PMID: 30026067
-
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8. BMC Med Inform Decis Mak. 2017. PMID: 29191207 Free PMC article.
-
A Machine Learning Approach with Human-AI Collaboration for Automated Classification of Patient Safety Event Reports: Algorithm Development and Validation Study.JMIR Hum Factors. 2024 Jan 25;11:e53378. doi: 10.2196/53378. JMIR Hum Factors. 2024. PMID: 38271086 Free PMC article.
-
State-of-the-art methods in healthcare text classification system: AI paradigm.Front Biosci (Landmark Ed). 2020 Jan 1;25(4):646-672. doi: 10.2741/4826. Front Biosci (Landmark Ed). 2020. PMID: 31585909 Review.
Cited by
-
Explainable deep learning in plant phenotyping.Front Artif Intell. 2023 Sep 19;6:1203546. doi: 10.3389/frai.2023.1203546. eCollection 2023. Front Artif Intell. 2023. PMID: 37795496 Free PMC article. Review.
-
What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods.Adv Neural Inf Process Syst. 2022;35:2832-2845. Adv Neural Inf Process Syst. 2022. PMID: 37786623 Free PMC article.
-
Exploring the application of machine learning to expert evaluation of research impact.PLoS One. 2023 Aug 3;18(8):e0288469. doi: 10.1371/journal.pone.0288469. eCollection 2023. PLoS One. 2023. PMID: 37535633 Free PMC article.
-
Combining 3D skeleton data and deep convolutional neural network for balance assessment during walking.Front Bioeng Biotechnol. 2023 Jun 20;11:1191868. doi: 10.3389/fbioe.2023.1191868. eCollection 2023. Front Bioeng Biotechnol. 2023. PMID: 37409167 Free PMC article.
-
An interpretable method for automated classification of spoken transcripts and written text.Evol Intell. 2023 May 4:1-13. doi: 10.1007/s12065-023-00851-1. Online ahead of print. Evol Intell. 2023. PMID: 37360587 Free PMC article.
References
-
- Jones KS. A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation. 1972;28:11–21. 10.1108/eb026526 - DOI
-
- Salton G, Wong A, Yang CS. A Vector Space Model for Automatic Indexing. Communications of the ACM. 1975;18(11):613–620. 10.1145/361219.361220 - DOI
-
- Hasan KS, Ng V. Conundrums in Unsupervised Keyphrase Extraction: Making Sense of the State-of-the-Art. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING); 2010. p. 365–373.
-
- Aggarwal CC, Zhai C. A Survey of Text Classification Algorithms In: Aggarwal CC, Zhai C, editors. Mining Text Data. Springer; 2012. p. 163–222.
-
- Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Distributed Representations of Words and Phrases and their Compositionality. In: Advances in Neural Information Processing Systems 26 (NIPS); 2013. p. 3111–3119.
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
