Modeling awake hippocampal reactivations with model-based bidirectional search
- PMID: 32065253
- DOI: 10.1007/s00422-020-00817-x
Modeling awake hippocampal reactivations with model-based bidirectional search
Abstract
Hippocampal offline reactivations during reward-based learning, usually categorized as replay events, have been found to be important for performance improvement over time and for memory consolidation. Recent computational work has linked these phenomena to the need to transform reward information into state-action values for decision making and to propagate it to all relevant states of the environment. Nevertheless, it is still unclear whether an integrated reinforcement learning mechanism could account for the variety of awake hippocampal reactivations, including variety in order (forward and reverse reactivated trajectories) and variety in the location where they occur (reward site or decision-point). Here, we present a model-based bidirectional search model which accounts for a variety of hippocampal reactivations. The model combines forward trajectory sampling from current position and backward sampling through prioritized sweeping from states associated with large reward prediction errors until the two trajectories connect. This is repeated until stabilization of state-action values (convergence), which could explain why hippocampal reactivations drastically diminish when the animal's performance stabilizes. Simulations in a multiple T-maze task show that forward reactivations are prominently found at decision-points while backward reactivations are exclusively generated at reward sites. Finally, the model can generate imaginary trajectories that are not allowed to the agent during task performance. We raise some experimental predictions and implications for future studies of the role of the hippocampo-prefronto-striatal network in learning.
Keywords: Computational neuroscience; Hippocampal replay; Navigation; Reinforcement learning.
Similar articles
-
Real-time sensory-motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization.Biol Cybern. 2020 Apr;114(2):249-268. doi: 10.1007/s00422-020-00820-2. Epub 2020 Feb 24. Biol Cybern. 2020. PMID: 32095878
-
Task Demands Predict a Dynamic Switch in the Content of Awake Hippocampal Replay.Neuron. 2017 Nov 15;96(4):925-935.e6. doi: 10.1016/j.neuron.2017.09.035. Epub 2017 Oct 19. Neuron. 2017. PMID: 29056296 Free PMC article.
-
Distinct effects of reward and navigation history on hippocampal forward and reverse replays.Proc Natl Acad Sci U S A. 2020 Jan 7;117(1):689-697. doi: 10.1073/pnas.1912533117. Epub 2019 Dec 23. Proc Natl Acad Sci U S A. 2020. PMID: 31871185 Free PMC article.
-
The Role of Hippocampal Replay in Memory and Planning.Curr Biol. 2018 Jan 8;28(1):R37-R50. doi: 10.1016/j.cub.2017.10.073. Curr Biol. 2018. PMID: 29316421 Free PMC article. Review.
-
Hippocampal replays under the scrutiny of reinforcement learning models.J Neurophysiol. 2018 Dec 1;120(6):2877-2896. doi: 10.1152/jn.00145.2018. Epub 2018 Oct 10. J Neurophysiol. 2018. PMID: 30303758 Review.
Cited by
-
From spatial navigation via visual construction to episodic memory and imagination.Biol Cybern. 2020 Apr;114(2):139-167. doi: 10.1007/s00422-020-00829-7. Epub 2020 Apr 13. Biol Cybern. 2020. PMID: 32285205 Free PMC article. Review.
-
Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics.Front Neurorobot. 2022 Jun 24;16:864380. doi: 10.3389/fnbot.2022.864380. eCollection 2022. Front Neurorobot. 2022. PMID: 35812782 Free PMC article.
-
Reward prediction errors drive declarative learning irrespective of agency.Psychon Bull Rev. 2021 Dec;28(6):2045-2056. doi: 10.3758/s13423-021-01952-7. Epub 2021 Jun 15. Psychon Bull Rev. 2021. PMID: 34131890
-
A model of hippocampal replay driven by experience and environmental structure facilitates spatial learning.Elife. 2023 Mar 14;12:e82301. doi: 10.7554/eLife.82301. Elife. 2023. PMID: 36916899 Free PMC article.
-
An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.Biomimetics (Basel). 2024 May 23;9(6):315. doi: 10.3390/biomimetics9060315. Biomimetics (Basel). 2024. PMID: 38921195 Free PMC article.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
