A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task
- PMID: 20573887
- PMCID: PMC2917246
- DOI: 10.1523/JNEUROSCI.4284-09.2010
A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task
Abstract
It has recently been shown in a brain-computer interface experiment that motor cortical neurons change their tuning properties selectively to compensate for errors induced by displaced decoding parameters. In particular, it was shown that the three-dimensional tuning curves of neurons whose decoding parameters were reassigned changed more than those of neurons whose decoding parameters had not been reassigned. In this article, we propose a simple learning rule that can reproduce this effect. Our learning rule uses Hebbian weight updates driven by a global reward signal and neuronal noise. In contrast to most previously proposed learning rules, this approach does not require extrinsic information to separate noise from signal. The learning rule is able to optimize the performance of a model system within biologically realistic periods of time under high noise levels. Furthermore, when the model parameters are matched to data recorded during the brain-computer interface learning experiments described above, the model produces learning effects strikingly similar to those found in the experiments.
Figures
Similar articles
-
Functional network reorganization in motor cortex can be explained by reward-modulated Hebbian learning.Adv Neural Inf Process Syst. 2009;2009:1105-1113. Adv Neural Inf Process Syst. 2009. PMID: 25284966 Free PMC article.
-
General differential Hebbian learning: Capturing temporal relations between events in neural networks and the brain.PLoS Comput Biol. 2018 Aug 28;14(8):e1006227. doi: 10.1371/journal.pcbi.1006227. eCollection 2018 Aug. PLoS Comput Biol. 2018. PMID: 30153263 Free PMC article.
-
Robustness of learning that is based on covariance-driven synaptic plasticity.PLoS Comput Biol. 2008 Mar 7;4(3):e1000007. doi: 10.1371/journal.pcbi.1000007. PLoS Comput Biol. 2008. PMID: 18369414 Free PMC article.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
-
Synaptic plasticity: taming the beast.Nat Neurosci. 2000 Nov;3 Suppl:1178-83. doi: 10.1038/81453. Nat Neurosci. 2000. PMID: 11127835 Review.
Cited by
-
Toward an autonomous brain machine interface: integrating sensorimotor reward modulation and reinforcement learning.J Neurosci. 2015 May 13;35(19):7374-87. doi: 10.1523/JNEUROSCI.1802-14.2015. J Neurosci. 2015. PMID: 25972167 Free PMC article.
-
Explicit and implicit contributions to learning in a sensorimotor adaptation task.J Neurosci. 2014 Feb 19;34(8):3023-32. doi: 10.1523/JNEUROSCI.3619-13.2014. J Neurosci. 2014. PMID: 24553942 Free PMC article.
-
Reward-Modulated Hebbian Plasticity as Leverage for Partially Embodied Control in Compliant Robotics.Front Neurorobot. 2015 Aug 17;9:9. doi: 10.3389/fnbot.2015.00009. eCollection 2015. Front Neurorobot. 2015. PMID: 26347645 Free PMC article.
-
Decoding arm speed during reaching.Nat Commun. 2018 Dec 7;9(1):5243. doi: 10.1038/s41467-018-07647-3. Nat Commun. 2018. PMID: 30531921 Free PMC article.
-
Neural syntax: cell assemblies, synapsembles, and readers.Neuron. 2010 Nov 4;68(3):362-85. doi: 10.1016/j.neuron.2010.09.023. Neuron. 2010. PMID: 21040841 Free PMC article. Review.
References
-
- Baras D, Meir R. Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput. 2007;19:2245–2279. - PubMed
-
- Barto AG, Sutton RS, Anderson CW. Neuronlike adaptive elements that can solve difficult learning and control problems. IEEE Trans Syst Man Cybern. 1983;13:835–846.
-
- Baxter J, Bartlett PL. Canberra, Australia: Research School of Information Sciences and Engineering, Australian National University; 1999. Direct gradient-based reinforcement learning: I. Gradient estimation algorithms.
-
- Baxter J, Bartlett PL. Infinite-horizon policy-gradient estimation. J Artif Intell Res. 2001;15:319–350.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources