Unique features of stimulus-based probabilistic reversal learning
- PMID: 34460275
- PMCID: PMC9205533
- DOI: 10.1037/bne0000474
Unique features of stimulus-based probabilistic reversal learning
Abstract
Reversal learning paradigms are widely used assays of behavioral flexibility with their probabilistic versions being more amenable to studying integration of reward outcomes over time. Prior research suggests differences between initial and reversal learning, including higher learning rates, a greater need for inhibitory control, and more perseveration after reversals. However, it is not well-understood what aspects of stimulus-based reversal learning are unique to reversals, and whether and how observed differences depend on reward probability. Here, we used a visual probabilistic discrimination and reversal learning paradigm where male and female rats selected between a pair of stimuli associated with different reward probabilities. We compared accuracy, rewards collected, omissions, latencies, win-stay/lose-shift strategies, and indices of perseveration across two different reward probability schedules. We found that discrimination and reversal learning are behaviorally more unique than similar: Fit of choice behavior using reinforcement learning models revealed a lower sensitivity to the difference in subjective reward values (greater exploration) and higher learning rates for the reversal phase. We also found latencies to choose the better option were greater in females than males, but only for the reversal phase. Further, animals employed more win-stay strategies during early discrimination and increased perseveration during early reversal learning. Interestingly, a consistent reward probability group difference emerged with a richer environment associated with longer reward collection latencies than a leaner environment. Future studies should systematically compare the neural correlates of fine-grained behavioral measures to reveal possible dissociations in how the circuitry is recruited in each phase. (PsycInfo Database Record (c) 2021 APA, all rights reserved).
Conflict of interest statement
There is no conflict of interest or need for disclosure. One of the senior authors (Alicia Izquierdo) is an Associate Editor of Behavioral Neuroscience.
Figures
Similar articles
-
Sex-dependent effects of chronic intermittent voluntary alcohol consumption on attentional, not motivational, measures during probabilistic learning and reversal.PLoS One. 2020 Jun 18;15(6):e0234729. doi: 10.1371/journal.pone.0234729. eCollection 2020. PLoS One. 2020. PMID: 32555668 Free PMC article.
-
Preferential involvement by nucleus accumbens shell in mediating probabilistic learning and reversal shifts.J Neurosci. 2014 Mar 26;34(13):4618-26. doi: 10.1523/JNEUROSCI.5058-13.2014. J Neurosci. 2014. PMID: 24672007 Free PMC article.
-
Pedunculopontine tegmental nucleus lesions impair probabilistic reversal learning by reducing sensitivity to positive reward feedback.Neurobiol Learn Mem. 2016 May;131:1-8. doi: 10.1016/j.nlm.2016.03.010. Epub 2016 Mar 11. Neurobiol Learn Mem. 2016. PMID: 26976089 Free PMC article.
-
The rat's not for turning: Dissociating the psychological components of cognitive inflexibility.Neurosci Biobehav Rev. 2015 Sep;56:1-14. doi: 10.1016/j.neubiorev.2015.06.015. Epub 2015 Jun 22. Neurosci Biobehav Rev. 2015. PMID: 26112128 Free PMC article. Review.
-
The midsession reversal task: A theoretical analysis.Learn Behav. 2020 Jun;48(2):195-207. doi: 10.3758/s13420-020-00423-8. Learn Behav. 2020. PMID: 32342285 Review.
Cited by
-
Sex differences in cognitive aging: a 4-year longitudinal study in marmosets.Neurobiol Aging. 2022 Jan;109:88-99. doi: 10.1016/j.neurobiolaging.2021.09.015. Epub 2021 Sep 20. Neurobiol Aging. 2022. PMID: 34700200 Free PMC article.
-
Foraging with the frontal cortex: A cross-species evaluation of reward-guided behavior.Neuropsychopharmacology. 2022 Jan;47(1):134-146. doi: 10.1038/s41386-021-01140-0. Epub 2021 Aug 18. Neuropsychopharmacology. 2022. PMID: 34408279 Free PMC article. Review.
-
Tracking subjects' strategies in behavioural choice experiments at trial resolution.Elife. 2024 Mar 1;13:e86491. doi: 10.7554/eLife.86491. Elife. 2024. PMID: 38426402 Free PMC article.
-
Mechanisms of adjustments to different types of uncertainty in the reward environment across mice and monkeys.Cogn Affect Behav Neurosci. 2023 Jun;23(3):600-619. doi: 10.3758/s13415-022-01059-z. Epub 2023 Feb 23. Cogn Affect Behav Neurosci. 2023. PMID: 36823249 Free PMC article.
-
Noradrenergic regulation of two-armed bandit performance.Behav Neurosci. 2022 Feb;136(1):84-99. doi: 10.1037/bne0000495. Epub 2021 Oct 14. Behav Neurosci. 2022. PMID: 34647770 Free PMC article.
References
-
- Aguirre CG, Stolyarova A, Das K, Kolli S, Marty V, Ray L, Spigelman I, & Izquierdo A (2020). Sex-dependent effects of chronic intermittent voluntary alcohol consumption on attentional, not motivational, measures during probabilistic learning and reversal. PLoS One, 15(6). Article e0234729. 10.1371/journal.pone.0234729 - DOI - PMC - PubMed
-
- Alvarez P, & Eichenbaum H (2002). Representations of odors in the rat orbitofrontal cortex change during and after learning. Behavioral Neuroscience, 116(3), 421–433. https://www.ncbi.nlm.nih.gov/pubmed/12049323 - PubMed
-
- Amitai N, & Markou A (2010). Disruption of performance in the five-choice serial reaction time task induced by administration of N-methyl-d-aspartate receptor antagonists: Relevance to cognitive dysfunction in schizophrenia. Biological Psychiatry, 68(1), 5–16. 10.1016/j.biopsych.2010.03.004 - DOI - PMC - PubMed
-
- Amsel A (Ed.). (1967). Partial reinforcement effects in vigor and persistence: Advances in frustration theory derived from a variety of within-subjects experiments (Vol. 1). Academic Press.
