Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making
- PMID: 31490932
- PMCID: PMC6750884
- DOI: 10.1371/journal.pcbi.1007334
Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making
Abstract
State-space and action representations form the building blocks of decision-making processes in the brain; states map external cues to the current situation of the agent whereas actions provide the set of motor commands from which the agent can choose to achieve specific goals. Although these factors differ across environments, it is currently unknown whether or how accurately state and action representations are acquired by the agent because previous experiments have typically provided this information a priori through instruction or pre-training. Here we studied how state and action representations adapt to reflect the structure of the world when such a priori knowledge is not available. We used a sequential decision-making task in rats in which they were required to pass through multiple states before reaching the goal, and for which the number of states and how they map onto external cues were unknown a priori. We found that, early in training, animals selected actions as if the task was not sequential and outcomes were the immediate consequence of the most proximal action. During the course of training, however, rats recovered the true structure of the environment and made decisions based on the expanded state-space, reflecting the multiple stages of the task. Similarly, we found that the set of actions expanded with training, although the emergence of new action sequences was sensitive to the experimental parameters and specifics of the training procedure. We conclude that the profile of choices shows a gradual shift from simple representations to more complex structures compatible with the structure of the world.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
Similar articles
-
Uncovering the 'state': Tracing the hidden state representations that structure learning and decision-making.Behav Processes. 2019 Oct;167:103891. doi: 10.1016/j.beproc.2019.103891. Epub 2019 Aug 2. Behav Processes. 2019. PMID: 31381985 Free PMC article. Review.
-
Cost-benefit trade-offs in decision-making and learning.PLoS Comput Biol. 2019 Sep 6;15(9):e1007326. doi: 10.1371/journal.pcbi.1007326. eCollection 2019 Sep. PLoS Comput Biol. 2019. PMID: 31490934 Free PMC article.
-
Dorsomedial striatal contributions to different forms of risk/reward decision making.Neurobiol Learn Mem. 2021 Feb;178:107369. doi: 10.1016/j.nlm.2020.107369. Epub 2020 Dec 28. Neurobiol Learn Mem. 2021. PMID: 33383183
-
Orbitofrontal State Representations Are Related to Choice Adaptations and Reward Predictions.J Neurosci. 2021 Mar 3;41(9):1941-1951. doi: 10.1523/JNEUROSCI.0753-20.2020. Epub 2021 Jan 14. J Neurosci. 2021. PMID: 33446521 Free PMC article.
-
Internally generated sequences in learning and executing goal-directed behavior.Trends Cogn Sci. 2014 Dec;18(12):647-57. doi: 10.1016/j.tics.2014.06.011. Epub 2014 Aug 23. Trends Cogn Sci. 2014. PMID: 25156191 Review.
Cited by
-
Detailed mapping of behavior reveals the formation of prelimbic neural ensembles across operant learning.Neuron. 2022 Feb 16;110(4):674-685.e6. doi: 10.1016/j.neuron.2021.11.022. Epub 2021 Dec 17. Neuron. 2022. PMID: 34921779 Free PMC article.
-
Value representations in the rodent orbitofrontal cortex drive learning, not choice.Elife. 2022 Aug 17;11:e64575. doi: 10.7554/eLife.64575. Elife. 2022. PMID: 35975792 Free PMC article.
-
Hierarchical Action Control: Adaptive Collaboration Between Actions and Habits.Front Psychol. 2019 Dec 11;10:2735. doi: 10.3389/fpsyg.2019.02735. eCollection 2019. Front Psychol. 2019. PMID: 31920796 Free PMC article. Review.
-
The role of the lateral orbitofrontal cortex in creating cognitive maps.Nat Neurosci. 2023 Jan;26(1):107-115. doi: 10.1038/s41593-022-01216-0. Epub 2022 Dec 22. Nat Neurosci. 2023. PMID: 36550290 Free PMC article.
-
The Anterior Cingulate Cortex Predicts Future States to Mediate Model-Based Action Selection.Neuron. 2021 Jan 6;109(1):149-163.e7. doi: 10.1016/j.neuron.2020.10.013. Epub 2020 Nov 4. Neuron. 2021. PMID: 33152266 Free PMC article.
References
-
- Sutton RS, Barto AG. Reinforcement learning: an introduction. Cambridge, MA: MIT Press; 1998.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
