Dissociable reward and timing signals in human midbrain and ventral striatum

Miriam C Klein-Flügge; Laurence T Hunt; Dominik R Bach; Raymond J Dolan; Timothy E J Behrens

doi:10.1016/j.neuron.2011.08.024

Dissociable reward and timing signals in human midbrain and ventral striatum

Neuron. 2011 Nov 17;72(4):654-64. doi: 10.1016/j.neuron.2011.08.024.

Authors

Miriam C Klein-Flügge¹, Laurence T Hunt, Dominik R Bach, Raymond J Dolan, Timothy E J Behrens

Affiliation

¹ Sobell Department of Motor Neuroscience and Movement Disorders, Institute of Neurology, UCL, London WC1N3BG, UK. m.klein@ucl.ac.uk

Abstract

Reward prediction error (RPE) signals are central to current models of reward-learning. Temporal difference (TD) learning models posit that these signals should be modulated by predictions, not only of magnitude but also timing of reward. Here we show that BOLD activity in the VTA conforms to such TD predictions: responses to unexpected rewards are modulated by a temporal hazard function and activity between a predictive stimulus and reward is depressed in proportion to predicted reward. By contrast, BOLD activity in ventral striatum (VS) does not reflect a TD RPE, but instead encodes a signal on the variable relevant for behavior, here timing but not magnitude of reward. The results have important implications for dopaminergic models of cortico-striatal learning and suggest a modification of the conventional view that VS BOLD necessarily reflects inputs from dopaminergic VTA neurons signaling an RPE.

Publication types

Comparative Study
Randomized Controlled Trial
Research Support, Non-U.S. Gov't

MeSH terms

Adult
Basal Ganglia / physiology*
Female
Humans
Magnetic Resonance Imaging / methods
Male
Mesencephalon / physiology*
Photic Stimulation / methods
Psychomotor Performance / physiology*
Reward*
Time Factors
Young Adult

Abstract

Publication types

MeSH terms

Grants and funding