Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain
- PMID: 22238090
- PMCID: PMC6621075
- DOI: 10.1523/JNEUROSCI.5498-10.2012
Neural prediction errors reveal a risk-sensitive reinforcement-learning process in the human brain
Abstract
Humans and animals are exquisitely, though idiosyncratically, sensitive to risk or variance in the outcomes of their actions. Economic, psychological, and neural aspects of this are well studied when information about risk is provided explicitly. However, we must normally learn about outcomes from experience, through trial and error. Traditional models of such reinforcement learning focus on learning about the mean reward value of cues and ignore higher order moments such as variance. We used fMRI to test whether the neural correlates of human reinforcement learning are sensitive to experienced risk. Our analysis focused on anatomically delineated regions of a priori interest in the nucleus accumbens, where blood oxygenation level-dependent (BOLD) signals have been suggested as correlating with quantities derived from reinforcement learning. We first provide unbiased evidence that the raw BOLD signal in these regions corresponds closely to a reward prediction error. We then derive from this signal the learned values of cues that predict rewards of equal mean but different variance and show that these values are indeed modulated by experienced risk. Moreover, a close neurometric-psychometric coupling exists between the fluctuations of the experience-based evaluations of risky options that we measured neurally and the fluctuations in behavioral risk aversion. This suggests that risk sensitivity is integral to human learning, illuminating economic models of choice, neuroscientific models of affective learning, and the workings of the underlying neural mechanisms.
Figures
Similar articles
-
Dynamic shaping of dopamine signals during probabilistic Pavlovian conditioning.Neurobiol Learn Mem. 2015 Jan;117:84-92. doi: 10.1016/j.nlm.2014.07.010. Epub 2014 Aug 27. Neurobiol Learn Mem. 2015. PMID: 25172480 Free PMC article.
-
Neural correlates of risk prediction error during reinforcement learning in humans.Neuroimage. 2009 Oct 1;47(4):1929-39. doi: 10.1016/j.neuroimage.2009.04.096. Epub 2009 May 13. Neuroimage. 2009. PMID: 19442744
-
Human reinforcement learning subdivides structured action spaces by learning effector-specific values.J Neurosci. 2009 Oct 28;29(43):13524-31. doi: 10.1523/JNEUROSCI.2469-09.2009. J Neurosci. 2009. PMID: 19864565 Free PMC article.
-
The ubiquity of model-based reinforcement learning.Curr Opin Neurobiol. 2012 Dec;22(6):1075-81. doi: 10.1016/j.conb.2012.08.003. Epub 2012 Sep 6. Curr Opin Neurobiol. 2012. PMID: 22959354 Free PMC article. Review.
-
The nucleus accumbens and Pavlovian reward learning.Neuroscientist. 2007 Apr;13(2):148-59. doi: 10.1177/1073858406295854. Neuroscientist. 2007. PMID: 17404375 Free PMC article. Review.
Cited by
-
Neurocomputational mechanisms of affected beliefs.Commun Biol. 2022 Nov 14;5(1):1241. doi: 10.1038/s42003-022-04165-3. Commun Biol. 2022. PMID: 36376497 Free PMC article.
-
Interactions between attributions and beliefs at trial-by-trial level: Evidence from a novel computer game task.PLoS Comput Biol. 2022 Sep 26;18(9):e1009920. doi: 10.1371/journal.pcbi.1009920. eCollection 2022 Sep. PLoS Comput Biol. 2022. PMID: 36155635 Free PMC article.
-
Threat Prediction from Schemas as a Source of Bias in Pain Perception.J Neurosci. 2020 Feb 12;40(7):1538-1548. doi: 10.1523/JNEUROSCI.2104-19.2019. Epub 2020 Jan 2. J Neurosci. 2020. PMID: 31896672 Free PMC article.
-
Intact Reinforcement Learning But Impaired Attentional Control During Multidimensional Probabilistic Learning in Older Adults.J Neurosci. 2020 Jan 29;40(5):1084-1096. doi: 10.1523/JNEUROSCI.0254-19.2019. Epub 2019 Dec 11. J Neurosci. 2020. PMID: 31826943 Free PMC article.
-
Nicotinic receptors in the ventral tegmental area promote uncertainty-seeking.Nat Neurosci. 2016 Mar;19(3):471-8. doi: 10.1038/nn.4223. Epub 2016 Jan 18. Nat Neurosci. 2016. PMID: 26780509
References
-
- Abler B, Walter H, Erk S, Kammerer H, Spitzer M. Prediction error as a linear function of reward probability is coded in human nucleus accumbens. Neuroimage. 2006;31:790–795. - PubMed
-
- Barto A. G. Adaptive critic and the basal ganglia. In: Houk JC, Davis JL, Beiser DG, editors. Models of information processing in the basal ganglia. Cambridge, MA: MIT; 1995. pp. 215–232.
-
- Bernoulli D. Exposition of a new theory on the measurement of risk. Econometrica. 1954;22:23–36.
-
- Breiter HC, Aharon I, Kahneman D, Dale A, Shizgal P. Functional imaging of neural responses to expectancy and experience of monetary gains and losses. Neuron. 2001;30:619–639. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources