A robotic model of hippocampal reverse replay for reinforcement learning

Matthew T Whelan; Alejandro Jimenez-Rodriguez; Tony J Prescott; Eleni Vasilaki

doi:10.1088/1748-3190/ac9ffc

A robotic model of hippocampal reverse replay for reinforcement learning

Bioinspir Biomim. 2022 Dec 2;18(1). doi: 10.1088/1748-3190/ac9ffc.

Authors

Matthew T Whelan^{1

2}, Alejandro Jimenez-Rodriguez^{1

2}, Tony J Prescott^{1

2}, Eleni Vasilaki^{1

2}

Affiliations

¹ Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.
² Sheffield Robotics, Sheffield, United Kingdom.

PMID: 36327454
DOI: 10.1088/1748-3190/ac9ffc

Abstract

Hippocampal reverse replay, a phenomenon in which recently active hippocampal cells reactivate in the reverse order, is thought to contribute to learning, particularly reinforcement learning (RL), in animals. Here, we present a novel computational model which exploits reverse replay to improve stability and performance on a homing task. The model takes inspiration from the hippocampal-striatal network, and learning occurs via a three-factor RL rule. To augment this model with hippocampal reverse replay, we derived a policy gradient learning rule that associates place-cell activity with responses in cells representing actions and a supervised learning rule of the same form, interpreting the replay activity as a 'target' frequency. We evaluated the model using a simulated robot spatial navigation task inspired by the Morris water maze. Results suggest that reverse replay can improve performance stability over multiple trials. Our model exploits reverse reply as an additional source for propagating information about desirable synaptic changes, reducing the requirements for long-time scales in eligibility traces combined with low learning rates. We conclude that reverse replay can positively contribute to RL, although less stable learning is possible in its absence. Analogously, we postulate that reverse replay may enhance RL in the mammalian hippocampal-striatal system rather than provide its core mechanism.

Keywords: computational neuroscience; hippocampal reply; reinforcement learning; robotics.

Creative Commons Attribution license.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Hippocampus / physiology
Mammals
Reinforcement, Psychology
Robotic Surgical Procedures*
Robotics*
Spatial Navigation* / physiology