Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
- PMID: 14687542
- DOI: 10.1016/s0896-6273(03)00761-x
Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
Abstract
It is well-known that chemical synaptic transmission is an unreliable process, but the function of such unreliability remains unclear. Here I consider the hypothesis that the randomness of synaptic transmission is harnessed by the brain for learning, in analogy to the way that genetic mutation is utilized by Darwinian evolution. This is possible if synapses are "hedonistic," responding to a global reward signal by increasing their probabilities of vesicle release or failure, depending on which action immediately preceded reward. Hedonistic synapses learn by computing a stochastic approximation to the gradient of the average reward. They are compatible with synaptic dynamics such as short-term facilitation and depression and with the intricacies of dendritic integration and action potential generation. A network of hedonistic synapses can be trained to perform a desired computation by administering reward appropriately, as illustrated here through numerical simulations of integrate-and-fire model neurons.
Similar articles
-
Learning in neural networks by reinforcement of irregular spiking.Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Apr;69(4 Pt 1):041909. doi: 10.1103/PhysRevE.69.041909. Epub 2004 Apr 30. Phys Rev E Stat Nonlin Soft Matter Phys. 2004. PMID: 15169045
-
Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting.Chin J Physiol. 2010 Dec 31;53(6):447-53. Chin J Physiol. 2010. PMID: 21793357
-
Reinforcement Learning in Spiking Neural Networks with Stochastic and Deterministic Synapses.Neural Comput. 2019 Dec;31(12):2368-2389. doi: 10.1162/neco_a_01238. Epub 2019 Oct 15. Neural Comput. 2019. PMID: 31614099
-
A review of the integrate-and-fire neuron model: I. Homogeneous synaptic input.Biol Cybern. 2006 Jul;95(1):1-19. doi: 10.1007/s00422-006-0068-6. Epub 2006 Apr 19. Biol Cybern. 2006. PMID: 16622699 Review.
-
Propagation of synchronous spiking activity in feedforward neural networks.J Physiol Paris. 1996;90(3-4):243-7. doi: 10.1016/s0928-4257(97)81432-5. J Physiol Paris. 1996. PMID: 9116676 Review.
Cited by
-
Matched pre- and post-synaptic changes underlie synaptic plasticity over long time scales.J Neurosci. 2013 Apr 10;33(15):6257-66. doi: 10.1523/JNEUROSCI.3740-12.2013. J Neurosci. 2013. PMID: 23575825 Free PMC article.
-
Gradient estimation in dendritic reinforcement learning.J Math Neurosci. 2012 Feb 15;2(1):2. doi: 10.1186/2190-8567-2-2. J Math Neurosci. 2012. PMID: 22657827 Free PMC article.
-
Temporal structure in associative retrieval.Elife. 2015 Jan 23;4:e04919. doi: 10.7554/eLife.04919. Elife. 2015. PMID: 25615722 Free PMC article.
-
Cerebellar learning using perturbations.Elife. 2018 Nov 12;7:e31599. doi: 10.7554/eLife.31599. Elife. 2018. PMID: 30418871 Free PMC article.
-
Vocal experimentation in the juvenile songbird requires a basal ganglia circuit.PLoS Biol. 2005 May;3(5):e153. doi: 10.1371/journal.pbio.0030153. Epub 2005 Mar 29. PLoS Biol. 2005. PMID: 15826219 Free PMC article.
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
