Habit formation coincides with shifts in reinforcement representations in the sensorimotor striatum

Kyle S Smith; Ann M Graybiel

doi:10.1152/jn.00925.2015

Habit formation coincides with shifts in reinforcement representations in the sensorimotor striatum

J Neurophysiol. 2016 Mar;115(3):1487-98. doi: 10.1152/jn.00925.2015. Epub 2016 Jan 6.

Authors

Kyle S Smith¹, Ann M Graybiel²

Affiliations

¹ Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hampshire; and kyle.s.smith@dartmouth.edu.
² McGovern Institute for Brain Research and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts.

Abstract

Evaluating outcomes of behavior is a central function of the striatum. In circuits engaging the dorsomedial striatum, sensitivity to goal value is accentuated during learning, whereas outcome sensitivity is thought to be minimal in the dorsolateral striatum and its habit-related corticostriatal circuits. However, a distinct population of projection neurons in the dorsolateral striatum exhibits selective sensitivity to rewards. Here, we evaluated the outcome-related signaling in such neurons as rats performed an instructional T-maze task for two rewards. As the rats formed maze-running habits and then changed behavior after reward devaluation, we detected outcome-related spike activity in 116 units out of 1,479 recorded units. During initial training, nearly equal numbers of these units fired preferentially either after rewarded runs or after unrewarded runs, and the majority were responsive at only one of two reward locations. With overtraining, as habits formed, firing in nonrewarded trials almost disappeared, and reward-specific firing declined. Thus error-related signaling was lost, and reward signaling became generalized. Following reward devaluation, in an extinction test, postgoal activity was nearly undetectable, despite accurate running. Strikingly, when rewards were then returned, postgoal activity reappeared and recapitulated the original early response pattern, with nearly equal numbers responding to rewarded and unrewarded runs and to single rewards. These findings demonstrate that outcome evaluation in the dorsolateral striatum is highly plastic and tracks stages of behavioral exploration and exploitation. These signals could be a new target for understanding compulsive behaviors that involve changes to dorsal striatum function.

Keywords: basal ganglia; electrophysiology; learning; reinforcement.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Animals
Corpus Striatum / cytology
Corpus Striatum / physiology*
Evoked Potentials, Somatosensory
Exploratory Behavior
Goals
Habituation, Psychophysiologic*
Male
Neurons / physiology
Rats
Rats, Sprague-Dawley
Reward*

Abstract

Publication types

MeSH terms

Grants and funding