Unmasking Clever Hans predictors and assessing what machines really learn
- PMID: 30858366
- PMCID: PMC6411769
- DOI: 10.1038/s41467-019-08987-4
Unmasking Clever Hans predictors and assessing what machines really learn
Abstract
Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Building machines that learn and think like people.Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24. Behav Brain Sci. 2017. PMID: 27881212
-
Preventing undesirable behavior of intelligent machines.Science. 2019 Nov 22;366(6468):999-1004. doi: 10.1126/science.aag3311. Science. 2019. PMID: 31754000
-
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26. Artif Intell Med. 2019. PMID: 31383477 Review.
-
Stochastic subset selection for learning with kernel machines.IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):616-26. doi: 10.1109/TSMCB.2011.2171680. Epub 2011 Oct 27. IEEE Trans Syst Man Cybern B Cybern. 2012. PMID: 22049369
-
Intelligent machines in the twenty-first century: foundations of inference and inquiry.Philos Trans A Math Phys Eng Sci. 2003 Dec 15;361(1813):2859-73. doi: 10.1098/rsta.2003.1268. Philos Trans A Math Phys Eng Sci. 2003. PMID: 14667302 Review.
Cited by
-
A new method applied for explaining the landing patterns: Interpretability analysis of machine learning.Heliyon. 2024 Feb 9;10(4):e26052. doi: 10.1016/j.heliyon.2024.e26052. eCollection 2024 Feb 29. Heliyon. 2024. PMID: 38370177 Free PMC article.
-
Early Detection of Oral Potentially Malignant Disorders: A Review on Prospective Screening Methods with Regard to Global Challenges.J Maxillofac Oral Surg. 2024 Feb;23(1):23-32. doi: 10.1007/s12663-022-01710-9. Epub 2022 Apr 15. J Maxillofac Oral Surg. 2024. PMID: 38312957 Free PMC article. Review.
-
Insights into the inner workings of transformer models for protein function prediction.Bioinformatics. 2024 Mar 4;40(3):btae031. doi: 10.1093/bioinformatics/btae031. Bioinformatics. 2024. PMID: 38244570 Free PMC article.
-
Modelling dataset bias in machine-learned theories of economic decision-making.Nat Hum Behav. 2024 Jan 12. doi: 10.1038/s41562-023-01784-6. Online ahead of print. Nat Hum Behav. 2024. PMID: 38216691
-
Label-free deep learning-based species classification of bacteria imaged by phase-contrast microscopy.PLoS Comput Biol. 2023 Nov 13;19(11):e1011181. doi: 10.1371/journal.pcbi.1011181. eCollection 2023 Nov. PLoS Comput Biol. 2023. PMID: 37956197 Free PMC article.
References
Publication types
LinkOut - more resources
Full Text Sources
Other Literature Sources
