Entropy-based metrics for predicting choice behavior based on local response to reward
- PMID: 34772943
- PMCID: PMC8590026
- DOI: 10.1038/s41467-021-26784-w
Entropy-based metrics for predicting choice behavior based on local response to reward
Abstract
For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.
© 2021. The Author(s).
Conflict of interest statement
The authors declare no competing interests.
Figures
Similar articles
-
Nutrient-Sensitive Reinforcement Learning in Monkeys.J Neurosci. 2023 Mar 8;43(10):1714-1730. doi: 10.1523/JNEUROSCI.0752-22.2022. Epub 2023 Jan 20. J Neurosci. 2023. PMID: 36669886 Free PMC article.
-
The ubiquity of model-based reinforcement learning.Curr Opin Neurobiol. 2012 Dec;22(6):1075-81. doi: 10.1016/j.conb.2012.08.003. Epub 2012 Sep 6. Curr Opin Neurobiol. 2012. PMID: 22959354 Free PMC article. Review.
-
Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23. Cogn Affect Behav Neurosci. 2023. PMID: 36823249 Free PMC article.
-
Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.PLoS Comput Biol. 2021 Jun 3;17(6):e1009070. doi: 10.1371/journal.pcbi.1009070. eCollection 2021 Jun. PLoS Comput Biol. 2021. PMID: 34081705 Free PMC article.
-
[Neural mechanisms of decision making].Brain Nerve. 2008 Sep;60(9):1017-27. Brain Nerve. 2008. PMID: 18807936 Review. Japanese.
Cited by
-
Ventrolateral prefrontal cortex in macaques guides decisions in different learning contexts.bioRxiv [Preprint]. 2024 Sep 19:2024.09.18.613767. doi: 10.1101/2024.09.18.613767. bioRxiv. 2024. PMID: 39345480 Free PMC article. Preprint.
-
Chronic Ethanol Exposure Produces Sex-Dependent Impairments in Value Computations in the Striatum.bioRxiv [Preprint]. 2024 Dec 19:2024.03.10.584332. doi: 10.1101/2024.03.10.584332. bioRxiv. 2024. PMID: 38585868 Free PMC article. Preprint.
-
The role of rat prelimbic cortex in decision making.bioRxiv [Preprint]. 2024 Mar 19:2024.03.18.585593. doi: 10.1101/2024.03.18.585593. bioRxiv. 2024. PMID: 38562679 Free PMC article. Preprint.
-
Tracking subjects' strategies in behavioural choice experiments at trial resolution.Elife. 2024 Mar 1;13:e86491. doi: 10.7554/eLife.86491. Elife. 2024. PMID: 38426402 Free PMC article.
-
An Information Theoretic Approach to Symbolic Learning in Synthetic Languages.Entropy (Basel). 2022 Feb 10;24(2):259. doi: 10.3390/e24020259. Entropy (Basel). 2022. PMID: 35205553 Free PMC article.
References
-
- Williams, B. A. Reinforcement, choice, and response strength. in Stevens’ handbook of experimental psychology vol. 2 167–244 (John Wiley & Sons, 1988).
-
- de Villiers PA, Herrnstein RJ. Toward a law of response strength. Psychol. Bull. 1976;83:1131–1153.
-
- Mazur JE. Optimization theory fails to predict performance of pigeons in a two-response situation. Science. 1981;214:823–825. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
