Validation of decision-making models and analysis of decision variables in the rat basal ganglia
- PMID: 19657038
- PMCID: PMC6666589
- DOI: 10.1523/JNEUROSCI.6157-08.2009
Validation of decision-making models and analysis of decision variables in the rat basal ganglia
Abstract
Reinforcement learning theory plays a key role in understanding the behavioral and neural mechanisms of choice behavior in animals and humans. Especially, intermediate variables of learning models estimated from behavioral data, such as the expectation of reward for each candidate choice (action value), have been used in searches for the neural correlates of computational elements in learning and decision making. The aims of the present study are as follows: (1) to test which computational model best captures the choice learning process in animals and (2) to elucidate how action values are represented in different parts of the corticobasal ganglia circuit. We compared different behavioral learning algorithms to predict the choice sequences generated by rats during a free-choice task and analyzed associated neural activity in the nucleus accumbens (NAc) and ventral pallidum (VP). The major findings of this study were as follows: (1) modified versions of an action-value learning model captured a variety of choice strategies of rats, including win-stay-lose-switch and persevering behavior, and predicted rats' choice sequences better than the best multistep Markov model; and (2) information about action values and future actions was coded in both the NAc and VP, but was less dominant than information about trial types, selected actions, and reward outcome. The results of our model-based analysis suggest that the primary role of the NAc and VP is to monitor information important for updating choice behaviors. Information represented in the NAc and VP might contribute to a choice mechanism that is situated elsewhere.
Figures
Similar articles
-
The ventral striato-pallidal pathway mediates the effect of predictive learning on choice between goal-directed actions.J Neurosci. 2013 Aug 21;33(34):13848-60. doi: 10.1523/JNEUROSCI.1697-13.2013. J Neurosci. 2013. PMID: 23966704 Free PMC article.
-
Optogenetic Dissection of Temporal Dynamics of Amygdala-Striatal Interplay during Risk/Reward Decision Making.eNeuro. 2018 Dec 10;5(6):ENEURO.0422-18.2018. doi: 10.1523/ENEURO.0422-18.2018. eCollection 2018 Nov-Dec. eNeuro. 2018. PMID: 30627636 Free PMC article.
-
Ventral pallidum encodes relative reward value earlier and more robustly than nucleus accumbens.Nat Commun. 2018 Oct 19;9(1):4350. doi: 10.1038/s41467-018-06849-z. Nat Commun. 2018. PMID: 30341305 Free PMC article.
-
[Mathematical models of decision making and learning].Brain Nerve. 2008 Jul;60(7):791-8. Brain Nerve. 2008. PMID: 18646619 Review. Japanese.
-
Navigating complex decision spaces: Problems and paradigms in sequential choice.Psychol Bull. 2014 Mar;140(2):466-86. doi: 10.1037/a0033455. Epub 2013 Jul 8. Psychol Bull. 2014. PMID: 23834192 Free PMC article. Review.
Cited by
-
Separate populations of neurons in ventral striatum encode value and motivation.PLoS One. 2013 May 28;8(5):e64673. doi: 10.1371/journal.pone.0064673. Print 2013. PLoS One. 2013. PMID: 23724077 Free PMC article.
-
Differential recruitment of ventral pallidal e-types by behaviorally salient stimuli during Pavlovian conditioning.iScience. 2021 Mar 31;24(4):102377. doi: 10.1016/j.isci.2021.102377. eCollection 2021 Apr 23. iScience. 2021. PMID: 33912818 Free PMC article.
-
Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation.PLoS Comput Biol. 2016 Oct 13;12(10):e1005145. doi: 10.1371/journal.pcbi.1005145. eCollection 2016 Oct. PLoS Comput Biol. 2016. PMID: 27736881 Free PMC article.
-
Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats.Eur J Neurosci. 2012 Apr;35(7):1180-9. doi: 10.1111/j.1460-9568.2012.08025.x. Eur J Neurosci. 2012. PMID: 22487046 Free PMC article.
-
Reinforcement Learning during Adolescence in Rats.J Neurosci. 2020 Jul 22;40(30):5857-5870. doi: 10.1523/JNEUROSCI.0910-20.2020. Epub 2020 Jun 29. J Neurosci. 2020. PMID: 32601244 Free PMC article.
References
-
- Barraclough DJ, Conroy ML, Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nat Neurosci. 2004;7:404–410. - PubMed
-
- Cardinal RN. Neural systems implicated in delayed and probabilistic reinforcement. Neural Netw. 2006;19:1277–1301. - PubMed
-
- Chang JY, Chen L, Luo F, Shi LH, Woodward DJ. Neuronal responses in the frontal cortico-basal ganglia system during delayed matching-to-sample task: ensemble recording in freely moving rats. Exp Brain Res. 2002;142:67–80. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous