Supporting generalization in non-human primate behavior by tapping into structural knowledge: Examples from sensorimotor mappings, inference, and decision-making
- PMID: 33454361
- PMCID: PMC8096669
- DOI: 10.1016/j.pneurobio.2021.101996
Supporting generalization in non-human primate behavior by tapping into structural knowledge: Examples from sensorimotor mappings, inference, and decision-making
Abstract
The complex behaviors we ultimately wish to understand are far from those currently used in systems neuroscience laboratories. A salient difference are the closed loops between action and perception prominently present in natural but not laboratory behaviors. The framework of reinforcement learning and control naturally wades across action and perception, and thus is poised to inform the neurosciences of tomorrow, not only from a data analyses and modeling framework, but also in guiding experimental design. We argue that this theoretical framework emphasizes active sensing, dynamical planning, and the leveraging of structural regularities as key operations for intelligent behavior within uncertain, time-varying environments. Similarly, we argue that we may study natural task strategies and their neural circuits without over-training animals when the tasks we use tap into our animal's structural knowledge. As proof-of-principle, we teach animals to navigate through a virtual environment - i.e., explore a well-defined and repetitive structure governed by the laws of physics - using a joystick. Once these animals have learned to 'drive', without further training they naturally (i) show zero- or one-shot learning of novel sensorimotor contingencies, (ii) infer the evolving path of dynamically changing latent variables, and (iii) make decisions consistent with maximizing reward rate. Such task designs allow for the study of flexible and generalizable, yet controlled, behaviors. In turn, they allow for the exploitation of pillars of intelligence - flexibility, prediction, and generalization -, properties whose neural underpinning have remained elusive.
Keywords: Cognitive map; Flexibility; Generalization; Learning set; Natural behavior; Reinforcement learning.
Copyright © 2021 Elsevier Ltd. All rights reserved.
Figures
Similar articles
-
Sensorimotor learning biases choice behavior: a learning neural field model for decision making.PLoS Comput Biol. 2012;8(11):e1002774. doi: 10.1371/journal.pcbi.1002774. Epub 2012 Nov 15. PLoS Comput Biol. 2012. PMID: 23166483 Free PMC article.
-
Emphasizing the "positive" in positive reinforcement: using nonbinary rewarding for training monkeys on cognitive tasks.J Neurophysiol. 2018 Jul 1;120(1):115-128. doi: 10.1152/jn.00572.2017. Epub 2018 Apr 4. J Neurophysiol. 2018. PMID: 29617217
-
Multiple memory systems as substrates for multiple decision systems.Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15. Neurobiol Learn Mem. 2015. PMID: 24846190 Free PMC article.
-
Evidence Brief: The Effectiveness Of Mandatory Computer-Based Trainings On Government Ethics, Workplace Harassment, Or Privacy And Information Security-Related Topics [Internet].Washington (DC): Department of Veterans Affairs (US); 2014 May. Washington (DC): Department of Veterans Affairs (US); 2014 May. PMID: 27606391 Free Books & Documents. Review.
-
Mechanisms of reinforcement learning and decision making in the primate dorsolateral prefrontal cortex.Ann N Y Acad Sci. 2007 May;1104:108-22. doi: 10.1196/annals.1390.007. Epub 2007 Mar 8. Ann N Y Acad Sci. 2007. PMID: 17347332 Review.
Cited by
-
Causal inference during closed-loop navigation: parsing of self- and object-motion.bioRxiv [Preprint]. 2023 Jan 30:2023.01.27.525974. doi: 10.1101/2023.01.27.525974. bioRxiv. 2023. Update in: Philos Trans R Soc Lond B Biol Sci. 2023 Sep 25;378(1886):20220344. doi: 10.1098/rstb.2022.0344 PMID: 36778376 Free PMC article. Updated. Preprint.
-
Cognitive, Systems, and Computational Neurosciences of the Self in Motion.Annu Rev Psychol. 2022 Jan 4;73:103-129. doi: 10.1146/annurev-psych-021021-103038. Epub 2021 Sep 21. Annu Rev Psychol. 2022. PMID: 34546803 Free PMC article. Review.
-
Alternative female and male developmental trajectories in the dynamic balance of human visual perception.Sci Rep. 2022 Jan 31;12(1):1674. doi: 10.1038/s41598-022-05620-1. Sci Rep. 2022. PMID: 35102227 Free PMC article.
-
Sensory Evidence Accumulation Using Optic Flow in a Naturalistic Navigation Task.J Neurosci. 2022 Jul 6;42(27):5451-5462. doi: 10.1523/JNEUROSCI.2203-21.2022. Epub 2022 May 31. J Neurosci. 2022. PMID: 35641186 Free PMC article.
-
Coding of latent variables in sensory, parietal, and frontal cortices during closed-loop virtual navigation.Elife. 2022 Oct 25;11:e80280. doi: 10.7554/eLife.80280. Elife. 2022. PMID: 36282071 Free PMC article.
References
-
- Balzani E, Lakshminarasimhan K, Angelaki D, Savin C (2020). Efficient estimation of neural tuning during naturalistic behavior. NeurIPS
-
- Banino A, Barry C, Uria B, Blundell C, Lillicrap T, Mirowski P, Pritzel A, Chadwick MJ, Degris T, Modayil J, et al. (2018). Vector-based navigation using grid-like representations in artificial agents. Nature 557, 429–433. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous
