Hedging your bets by learning reward correlations in the human brain
- PMID: 21943609
- PMCID: PMC3183226
- DOI: 10.1016/j.neuron.2011.07.025
Hedging your bets by learning reward correlations in the human brain
Abstract
Human subjects are proficient at tracking the mean and variance of rewards and updating these via prediction errors. Here, we addressed whether humans can also learn about higher-order relationships between distinct environmental outcomes, a defining ecological feature of contexts where multiple sources of rewards are available. By manipulating the degree to which distinct outcomes are correlated, we show that subjects implemented an explicit model-based strategy to learn the associated outcome correlations and were adept in using that information to dynamically adjust their choices in a task that required a minimization of outcome variance. Importantly, the experimentally generated outcome correlations were explicitly represented neuronally in right midinsula with a learning prediction error signal expressed in rostral anterior cingulate cortex. Thus, our data show that the human brain represents higher-order correlation structures between rewards, a core adaptive ability whose immediate benefit is optimized sampling.
Copyright © 2011 Elsevier Inc. All rights reserved.
Figures
Similar articles
-
The Good, the Bad, and the Irrelevant: Neural Mechanisms of Learning Real and Hypothetical Rewards and Effort.J Neurosci. 2015 Aug 12;35(32):11233-51. doi: 10.1523/JNEUROSCI.0396-15.2015. J Neurosci. 2015. PMID: 26269633 Free PMC article.
-
How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29. J Cogn Neurosci. 2014. PMID: 24168216
-
Learning to minimize efforts versus maximizing rewards: computational principles and neural correlates.J Neurosci. 2014 Nov 19;34(47):15621-30. doi: 10.1523/JNEUROSCI.1350-14.2014. J Neurosci. 2014. PMID: 25411490 Free PMC article.
-
Predictive reward signal of dopamine neurons.J Neurophysiol. 1998 Jul;80(1):1-27. doi: 10.1152/jn.1998.80.1.1. J Neurophysiol. 1998. PMID: 9658025 Review.
-
Splitting the difference: how does the brain code reward episodes?Ann N Y Acad Sci. 2007 May;1104:54-69. doi: 10.1196/annals.1390.020. Epub 2007 Apr 7. Ann N Y Acad Sci. 2007. PMID: 17416922 Review.
Cited by
-
An electrophysiological index of changes in risk decision-making strategies.Neuropsychologia. 2013 Jul;51(8):1397-407. doi: 10.1016/j.neuropsychologia.2013.04.014. Epub 2013 May 2. Neuropsychologia. 2013. PMID: 23643796 Free PMC article.
-
Positively biased processing of self-relevant social feedback.J Neurosci. 2012 Nov 21;32(47):16832-44. doi: 10.1523/JNEUROSCI.3016-12.2012. J Neurosci. 2012. PMID: 23175836 Free PMC article.
-
Using Neural Data to Test A Theory of Investor Behavior: An Application to Realization Utility.J Finance. 2014 Apr 1;69(2):907-946. doi: 10.1111/jofi.12126. J Finance. 2014. PMID: 25774065 Free PMC article.
-
Cultural influences on social feedback processing of character traits.Front Hum Neurosci. 2014 Apr 4;8:192. doi: 10.3389/fnhum.2014.00192. eCollection 2014. Front Hum Neurosci. 2014. PMID: 24772075 Free PMC article.
-
Separate neural representations of prediction error valence and surprise: Evidence from an fMRI meta-analysis.Hum Brain Mapp. 2018 Jul;39(7):2887-2906. doi: 10.1002/hbm.24047. Epub 2018 Mar 25. Hum Brain Mapp. 2018. PMID: 29575249 Free PMC article.
References
-
- Andersson J.L., Hutton C., Ashburner J., Turner R., Friston K. Modeling geometric deformations in EPI time series. Neuroimage. 2001;13:903–919. - PubMed
-
- Andrade A., Paradis A.L., Rouquette S., Poline J.B. Ambiguous results in functional neuroimaging data analysis due to covariate correlation. Neuroimage. 1999;10:483–486. - PubMed
-
- Bechara A., Damasio H., Tranel D., Damasio A.R. Deciding advantageously before knowing the advantageous strategy. Science. 1997;275:1293–1295. - PubMed
-
- Becker G.M., DeGroot M.H., Marschak J. Measuring utility by a single-response sequential method. Behav. Sci. 1964;9:226–232. - PubMed
-
- Behrens T.E., Woolrich M.W., Walton M.E., Rushworth M.F. Learning the value of information in an uncertain world. Nat. Neurosci. 2007;10:1214–1221. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
