The Good, the Bad, and the Irrelevant: Neural Mechanisms of Learning Real and Hypothetical Rewards and Effort
- PMID: 26269633
- PMCID: PMC4532756
- DOI: 10.1523/JNEUROSCI.0396-15.2015
The Good, the Bad, and the Irrelevant: Neural Mechanisms of Learning Real and Hypothetical Rewards and Effort
Abstract
Natural environments are complex, and a single choice can lead to multiple outcomes. Agents should learn which outcomes are due to their choices and therefore relevant for future decisions and which are stochastic in ways common to all choices and therefore irrelevant for future decisions between options. We designed an experiment in which human participants learned the varying reward and effort magnitudes of two options and repeatedly chose between them. The reward associated with a choice was randomly real or hypothetical (i.e., participants only sometimes received the reward magnitude associated with the chosen option). The real/hypothetical nature of the reward on any one trial was, however, irrelevant for learning the longer-term values of the choices, and participants ought to have only focused on the informational content of the outcome and disregarded whether it was a real or hypothetical reward. However, we found that participants showed an irrational choice bias, preferring choices that had previously led, by chance, to a real reward in the last trial. Amygdala and ventromedial prefrontal activity was related to the way in which participants' choices were biased by real reward receipt. By contrast, activity in dorsal anterior cingulate cortex, frontal operculum/anterior insula, and especially lateral anterior prefrontal cortex was related to the degree to which participants resisted this bias and chose effectively in a manner guided by aspects of outcomes that had real and more sustained relationships with particular choices, suppressing irrelevant reward information for more optimal learning and decision making.
Significance statement: In complex natural environments, a single choice can lead to multiple outcomes. Human agents should only learn from outcomes that are due to their choices, not from outcomes without such a relationship. We designed an experiment to measure learning about reward and effort magnitudes in an environment in which other features of the outcome were random and had no relationship with choice. We found that, although people could learn about reward magnitudes, they nevertheless were irrationally biased toward repeating certain choices as a function of the presence or absence of random reward features. Activity in different brain regions in the prefrontal cortex either reflected the bias or reflected resistance to the bias.
Keywords: effort; frontal pole; hypothetical; learning; reward; vmPFC.
Copyright © 2015 Scholl, Kolling et al.
Figures
Similar articles
-
Neural Signatures of Value Comparison in Human Cingulate Cortex during Decisions Requiring an Effort-Reward Trade-off.J Neurosci. 2016 Sep 28;36(39):10002-15. doi: 10.1523/JNEUROSCI.0292-16.2016. Epub 2016 Sep 28. J Neurosci. 2016. PMID: 27683898 Free PMC article.
-
Necessary Contributions of Human Frontal Lobe Subregions to Reward Learning in a Dynamic, Multidimensional Environment.J Neurosci. 2016 Sep 21;36(38):9843-58. doi: 10.1523/JNEUROSCI.1337-16.2016. J Neurosci. 2016. PMID: 27656023 Free PMC article.
-
Social structure learning in human anterior insula.Elife. 2020 Feb 18;9:e53162. doi: 10.7554/eLife.53162. Elife. 2020. PMID: 32067635 Free PMC article.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
-
Neuronal Reward and Decision Signals: From Theories to Data.Physiol Rev. 2015 Jul;95(3):853-951. doi: 10.1152/physrev.00023.2014. Physiol Rev. 2015. PMID: 26109341 Free PMC article. Review.
Cited by
-
The relationship between outcome prediction and cognitive fatigue: A convergence of paradigms.Cogn Affect Behav Neurosci. 2017 Aug;17(4):838-849. doi: 10.3758/s13415-017-0515-y. Cogn Affect Behav Neurosci. 2017. PMID: 28547127
-
Understanding psychiatric disorder by capturing ecologically relevant features of learning and decision-making.Behav Brain Res. 2018 Dec 14;355:56-75. doi: 10.1016/j.bbr.2017.09.050. Epub 2017 Sep 28. Behav Brain Res. 2018. PMID: 28966147 Free PMC article. Review.
-
Using functional connectivity changes associated with cognitive fatigue to delineate a fatigue network.Sci Rep. 2020 Dec 14;10(1):21927. doi: 10.1038/s41598-020-78768-3. Sci Rep. 2020. PMID: 33318529 Free PMC article. Clinical Trial.
-
When piloting health services interventions, what predicts real world behaviours? A systematic concept mapping review.BMC Med Res Methodol. 2020 Apr 6;20(1):76. doi: 10.1186/s12874-020-00955-7. BMC Med Res Methodol. 2020. PMID: 32252648 Free PMC article.
-
Distinct effects of apathy and dopamine on effort-based decision-making in Parkinson's disease.Brain. 2018 May 1;141(5):1455-1469. doi: 10.1093/brain/awy110. Brain. 2018. PMID: 29672668 Free PMC article.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials