Synaptic theory of replicator-like melioration
- PMID: 20617184
- PMCID: PMC2896075
- DOI: 10.3389/fncom.2010.00017
Synaptic theory of replicator-like melioration
Abstract
According to the theory of Melioration, organisms in repeated choice settings shift their choice preference in favor of the alternative that provides the highest return. The goal of this paper is to explain how this learning behavior can emerge from microscopic changes in the efficacies of synapses, in the context of a two-alternative repeated-choice experiment. I consider a large family of synaptic plasticity rules in which changes in synaptic efficacies are driven by the covariance between reward and neural activity. I construct a general framework that predicts the learning dynamics of any decision-making neural network that implements this synaptic plasticity rule and show that melioration naturally emerges in such networks. Moreover, the resultant learning dynamics follows the Replicator equation which is commonly used to phenomenologically describe changes in behavior in operant conditioning experiments. Several examples demonstrate how the learning rate of the network is affected by its properties and by the specifics of the plasticity rule. These results help bridge the gap between cellular physiology and learning behavior.
Keywords: operant conditioning; reinforcement learning; synaptic plasticity.
Figures
Similar articles
-
Robustness of learning that is based on covariance-driven synaptic plasticity.PLoS Comput Biol. 2008 Mar 7;4(3):e1000007. doi: 10.1371/journal.pcbi.1000007. PLoS Comput Biol. 2008. PMID: 18369414 Free PMC article.
-
A biophysically based neural model of matching law behavior: melioration by stochastic synapses.J Neurosci. 2006 Apr 5;26(14):3731-44. doi: 10.1523/JNEUROSCI.5159-05.2006. J Neurosci. 2006. PMID: 16597727 Free PMC article.
-
Statistical mechanics of reward-modulated learning in decision-making networks.Neural Comput. 2012 May;24(5):1230-70. doi: 10.1162/NECO_a_00264. Epub 2012 Feb 1. Neural Comput. 2012. PMID: 22295982
-
Neural plasticity and behavior - sixty years of conceptual advances.J Neurochem. 2016 Oct;139 Suppl 2:179-199. doi: 10.1111/jnc.13580. Epub 2016 Mar 10. J Neurochem. 2016. PMID: 26875778 Review.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
Cited by
-
Selectionist and evolutionary approaches to brain function: a critical appraisal.Front Comput Neurosci. 2012 Apr 26;6:24. doi: 10.3389/fncom.2012.00024. eCollection 2012. Front Comput Neurosci. 2012. PMID: 22557963 Free PMC article.
-
Dynamical regimes in neural network models of matching behavior.Neural Comput. 2013 Dec;25(12):3093-112. doi: 10.1162/NECO_a_00522. Epub 2013 Sep 18. Neural Comput. 2013. PMID: 24047324 Free PMC article.
-
Melioration Learning in Two-Person Games.PLoS One. 2016 Nov 16;11(11):e0166708. doi: 10.1371/journal.pone.0166708. eCollection 2016. PLoS One. 2016. PMID: 27851815 Free PMC article.
-
Spike-based decision learning of Nash equilibria in two-player games.PLoS Comput Biol. 2012;8(9):e1002691. doi: 10.1371/journal.pcbi.1002691. Epub 2012 Sep 27. PLoS Comput Biol. 2012. PMID: 23028289 Free PMC article.
-
Striatal action-value neurons reconsidered.Elife. 2018 May 31;7:e34248. doi: 10.7554/eLife.34248. Elife. 2018. PMID: 29848442 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Miscellaneous
