Contextual influence on confidence judgments in human reinforcement learning
- PMID: 30958826
- PMCID: PMC6472836
- DOI: 10.1371/journal.pcbi.1006973
Contextual influence on confidence judgments in human reinforcement learning
Abstract
The ability to correctly estimate the probability of one's choices being correct is fundamental to optimally re-evaluate previous choices or to arbitrate between different decision strategies. Experimental evidence nonetheless suggests that this metacognitive process-confidence judgment- is susceptible to numerous biases. Here, we investigate the effect of outcome valence (gains or losses) on confidence while participants learned stimulus-outcome associations by trial-and-error. In two experiments, participants were more confident in their choices when learning to seek gains compared to avoiding losses, despite equal difficulty and performance between those two contexts. Computational modelling revealed that this bias is driven by the context-value, a dynamically updated estimate of the average expected-value of choice options, necessary to explain equal performance in the gain and loss domain. The biasing effect of context-value on confidence, revealed here for the first time in a reinforcement-learning context, is therefore domain-general, with likely important functional consequences. We show that one such consequence emerges in volatile environments, where the (in)flexibility of individuals' learning strategies differs when outcomes are framed as gains or losses. Despite apparent similar behavior- profound asymmetries might therefore exist between learning to avoid losses and learning to seek gains.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Linking confidence biases to reinforcement-learning processes.Psychol Rev. 2023 Jul;130(4):1017-1043. doi: 10.1037/rev0000424. Epub 2023 May 8. Psychol Rev. 2023. PMID: 37155268
-
Adaptive History Biases Result from Confidence-Weighted Accumulation of past Choices.J Neurosci. 2018 Mar 7;38(10):2418-2429. doi: 10.1523/JNEUROSCI.2189-17.2017. Epub 2018 Jan 25. J Neurosci. 2018. PMID: 29371318 Free PMC article.
-
Decomposing the effects of context valence and feedback information on speed and accuracy during reinforcement learning: a meta-analytical approach using diffusion decision modeling.Cogn Affect Behav Neurosci. 2019 Jun;19(3):490-502. doi: 10.3758/s13415-019-00723-1. Cogn Affect Behav Neurosci. 2019. PMID: 31175616 Free PMC article.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
-
[Mathematical models of decision making and learning].Brain Nerve. 2008 Jul;60(7):791-8. Brain Nerve. 2008. PMID: 18646619 Review. Japanese.
Cited by
-
Robust valence-induced biases on motor response and confidence in human reinforcement learning.Cogn Affect Behav Neurosci. 2020 Dec;20(6):1184-1199. doi: 10.3758/s13415-020-00826-0. Cogn Affect Behav Neurosci. 2020. PMID: 32875531 Free PMC article.
-
Impact of number of critical care procedural skill repetitions on supervision level and teaching style.PLoS One. 2023 Jan 23;18(1):e0280207. doi: 10.1371/journal.pone.0280207. eCollection 2023. PLoS One. 2023. PMID: 36689411 Free PMC article.
-
Intertemporal choice reflects value comparison rather than self-control: insights from confidence judgements.Philos Trans R Soc Lond B Biol Sci. 2022 Dec 19;377(1866):20210338. doi: 10.1098/rstb.2021.0338. Epub 2022 Oct 31. Philos Trans R Soc Lond B Biol Sci. 2022. PMID: 36314145 Free PMC article.
-
Direct stimulation of anterior insula and ventromedial prefrontal cortex disrupts economic choices.Nat Commun. 2024 Aug 29;15(1):7508. doi: 10.1038/s41467-024-51822-8. Nat Commun. 2024. PMID: 39209840 Free PMC article.
-
Model-based prioritization for acquiring protection.PLoS Comput Biol. 2022 Dec 19;18(12):e1010805. doi: 10.1371/journal.pcbi.1010805. eCollection 2022 Dec. PLoS Comput Biol. 2022. PMID: 36534704 Free PMC article.
References
-
- Sutton RS, Barto AG. Reinforcement learning: An introduction. MIT press Cambridge; 1998.
-
- Erev I, Roth AE. Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria. Am Econ Rev. 1998;88: 848–881. 10.2307/117009 - DOI
-
- Rescorla RA, Wagner AR. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. Class Cond II Curr Res Theory. 1972;2: 64–99.
