Functional requirements for reward-modulated spike-timing-dependent plasticity
- PMID: 20926659
- PMCID: PMC6634722
- DOI: 10.1523/JNEUROSCI.6249-09.2010
Functional requirements for reward-modulated spike-timing-dependent plasticity
Abstract
Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulation. We derive theoretical conditions for successful learning of reward-related behavior for a large class of learning rules where Hebbian synaptic plasticity is conditioned on a global modulatory factor signaling reward. We show that all learning rules in this class can be separated into a term that captures the covariance of neuronal firing and reward and a second term that presents the influence of unsupervised learning. The unsupervised term, which is, in general, detrimental for reward-based learning, can be suppressed if the neuromodulatory signal encodes the difference between the reward and the expected reward-but only if the expected reward is calculated for each task and stimulus separately. If several tasks are to be learned simultaneously, the nervous system needs an internal critic that is able to predict the expected reward for arbitrary stimuli. We show that, with a critic, reward-modulated spike-timing-dependent plasticity is capable of learning motor trajectories with a temporal resolution of tens of milliseconds. The relation to temporal difference learning, the relevance of block-based learning paradigms, and the limitations of learning with a critic are discussed.
Figures
Similar articles
-
Neuromodulated Spike-Timing-Dependent Plasticity, and Theory of Three-Factor Learning Rules.Front Neural Circuits. 2016 Jan 19;9:85. doi: 10.3389/fncir.2015.00085. eCollection 2015. Front Neural Circuits. 2016. PMID: 26834568 Free PMC article. Review.
-
Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity.Neural Comput. 2007 Jun;19(6):1468-502. doi: 10.1162/neco.2007.19.6.1468. Neural Comput. 2007. PMID: 17444757
-
Coexistence of reward and unsupervised learning during the operant conditioning of neural firing rates.PLoS One. 2014 Jan 27;9(1):e87123. doi: 10.1371/journal.pone.0087123. eCollection 2014. PLoS One. 2014. PMID: 24475240 Free PMC article.
-
Competitive Hebbian learning through spike-timing-dependent synaptic plasticity.Nat Neurosci. 2000 Sep;3(9):919-26. doi: 10.1038/78829. Nat Neurosci. 2000. PMID: 10966623
-
Synaptic plasticity: taming the beast.Nat Neurosci. 2000 Nov;3 Suppl:1178-83. doi: 10.1038/81453. Nat Neurosci. 2000. PMID: 11127835 Review.
Cited by
-
Eligibility Traces and Plasticity on Behavioral Time Scales: Experimental Support of NeoHebbian Three-Factor Learning Rules.Front Neural Circuits. 2018 Jul 31;12:53. doi: 10.3389/fncir.2018.00053. eCollection 2018. Front Neural Circuits. 2018. PMID: 30108488 Free PMC article. Review.
-
Neuromodulated Spike-Timing-Dependent Plasticity, and Theory of Three-Factor Learning Rules.Front Neural Circuits. 2016 Jan 19;9:85. doi: 10.3389/fncir.2015.00085. eCollection 2015. Front Neural Circuits. 2016. PMID: 26834568 Free PMC article. Review.
-
Goal-Directed Decision Making with Spiking Neurons.J Neurosci. 2016 Feb 3;36(5):1529-46. doi: 10.1523/JNEUROSCI.2854-15.2016. J Neurosci. 2016. PMID: 26843636 Free PMC article.
-
Reinforcement learning using a continuous time actor-critic framework with spiking neurons.PLoS Comput Biol. 2013 Apr;9(4):e1003024. doi: 10.1371/journal.pcbi.1003024. Epub 2013 Apr 11. PLoS Comput Biol. 2013. PMID: 23592970 Free PMC article.
-
Synaptic consolidation: from synapses to behavioral modeling.J Neurosci. 2015 Jan 21;35(3):1319-34. doi: 10.1523/JNEUROSCI.3989-14.2015. J Neurosci. 2015. PMID: 25609644 Free PMC article.
References
-
- Arbuthnott GW, Wickens J. Space, time and dopamine. Trends Neurosci. 2007;30:62–69. - PubMed
-
- Artola A, Bröcher S, Singer W. Different voltage dependent thresholds for inducing long-term depression and long-term potentiation in slices of rat visual cortex. Nature. 1990;347:69–72. - PubMed
-
- Baras D, Meir R. Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput. 2007;19:2245–2279. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources