Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jun 8;36(23):6242-57.
doi: 10.1523/JNEUROSCI.0157-16.2016.

Cholinergic Interneurons Use Orbitofrontal Input to Track Beliefs about Current State

Affiliations

Cholinergic Interneurons Use Orbitofrontal Input to Track Beliefs about Current State

Thomas A Stalnaker et al. J Neurosci. .

Erratum in

Abstract

When conditions change, organisms need to learn about the changed conditions without interfering with what they already know. To do so, they can assign the new learning to a new "state" and the old learning to a previous state. This state assignment is fundamental to behavioral flexibility. Cholinergic interneurons (CINs) in the dorsomedial striatum (DMS) are necessary for associative information to be compartmentalized in this way, but the mechanism by which they do so is unknown. Here we addressed this question by recording putative CINs from the DMS in rats performing a task consisting of a series of trial blocks, or states, that required the recall and application of contradictory associative information. We found that individual CINs in the DMS represented the current state throughout each trial. These state correlates were not observed in dorsolateral striatal CINs recorded in the same rats. Notably, DMS CIN ensembles tracked rats' beliefs about the current state such that, when states were miscoded, rats tended to make suboptimal choices reflecting the miscoding. State information held by the DMS CINs also depended completely on the orbitofrontal cortex, an area that has been proposed to signal environmental states. These results suggest that CINs set the stage for recalling associative information relevant to the current environment by maintaining a real-time representation of the current state. Such a role has novel implications for understanding the neural basis of a variety of psychiatric diseases, such as addiction or anxiety disorders, in which patients generalize inappropriately (or fail to generalize) between different environments.

Significance statement: Striatal cholinergic interneurons (CINs) are thought to be identical to tonically active neurons. These neurons have long been thought to have an important influence on striatal processing during reward-related learning. Recently, a more specific function for striatal CINs has been suggested, which is that they are necessary for striatal learning to be compartmentalized into different states as the state of the environment changes. Here we report that putative CINs appear to track rats' beliefs about which environmental state is current. We further show that this property of CINs depends on orbitofrontal cortex input and is correlated with choices made by rats. These findings could provide new insight into neuropsychiatric diseases that involve improper generalization between different contexts.

Keywords: cholinergic; orbitofrontal; rat; single unit; striatum.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The task, lesions, behavior, and recording locations. The task (a) had four different blocks, or states, defined by the set of available response–reward contingencies. Trials began with an instructional odor, indicating a free-choice or forced-choice, after which rats responded at one of the two fluid wells for 1 or 3 drops of chocolate or vanilla milk. Reward contingencies were stable across blocks of ∼60 trials, but at unsignaled transitions the number of drops or flavor changed on both sides (only 1 of the 4 possible block sequences is shown). Unilateral neurotoxic lesions of orbitofrontal cortex (OFC) (b) were made in one group of rats (numbers are millimeters from bregma). Groups with sham lesions or unilateral lesions of OFC were similarly sensitive to changes in the number of drops (c), and similarly insensitive to changes in the flavor (d). Line figures show average trial-by-trial choice rates across transitions; bar graphs summarize these data by showing average choice rates in the last 20 trials of the previous block versus the first 20 of the new block; scatter plots show rat-by-rat difference scores (choice rate after minus choice rate before) with length of lines showing SEs. e, The approximate locations of recordings and proportions of putative cell types in each of the four groups. The width of each box represents 1 mm. FSI, Fast-spiking interneuron. *p < 0.001.
Figure 2.
Figure 2.
Cell-type separation. a, MSNs were first separated from other neurons [including fast-spiking interneurons (FSIs)] using a three-dimensional cluster analysis on all recorded units (left). Subsequently, putative CINs were selected by taking the average spike-to-spike CV2 of every unit’s entire spike-train. The criteria for CINs was CV2 <0.85 and average firing rate <8 spikes/s, which were based on measurements from histochemically confirmed CINs reported in published juxtacellular recordings. These criteria effectively discriminated CINs from separately defined MSNs (right; CV2 distribution, low-firing rate MSNs versus putative CINs). The upper limit of the CIN criterion is outside of the 1-a confidence interval for low-firing rate MSNs with a = 0.028. b, Resulting average waveforms for each cell type in each recording group (the dotted line represents the 0 voltage level. Scale bar, 100 ms. Shading represents SE). Note that CINs were not selected based on waveform, whereas MSNs and FSIs were. FSIs were not analyzed for this report.
Figure 3.
Figure 3.
DMS CINs track the current state (i.e., block) across trials. Plots in a and b show a bin-by-bin average index of block-selectivity for the entire DMS (n = 26) and DLS (n = 16) CIN populations, respectively, averaged by each neuron’s preferred direction (defined as direction with highest average firing rate during the choice movement; left) or anti-preferred direction (right). Only correct forced-choice trials after the first 20 correct trials in blocks were included in these averages. Bins were aligned to multiple trial events separated by the average time between them. Block-selectivity scores of individual DMS CINs were strongly correlated between different epochs across the trial (c), but this was not true of DLS CINs (b). c, d, Left, Scores for pre-odor epoch versus post-reward epoch; other pairs of epochs are summarized on the right. All R2 values for DMS CINs were significant at p < 0.001, average R2 = 0.61. For DLS CINs, none of thx R2 values were significant at p < 0.05 (p values > 0.13); average R2 = 0.07.
Figure 4.
Figure 4.
DMS CIN pseudo-ensembles decode the current state (i.e., block) across the trials. The top plot shows block decoding accuracy of a pseudo-ensemble of 25 DMS CINs versus MSNs across the trial. All rewarded trials after the first 20 correct trials in blocks were used in the decoder, regardless of direction. Bottom plots show decoding accuracy as a function of ensemble size for three 1 s epochs across the trial. **p < 0.01.
Figure 5.
Figure 5.
On suboptimal (i.e., inappropriate) choices, DMS CINs miscoded the block as the one that would have been appropriate for that choice. Top plots show percentage accurate block decoding (a) and percentage miscoding as the opposite drop-number block (b), by a pseudo-ensemble of 12 CINs or MSNs on suboptimal choices. Twelve CINs were recorded in sessions with enough such free-choices. Bottom plots show accurate decoding and miscoding as a function of ensemble size for three 1 s epochs across the trial. c, Bars show average (±SE) reaction time by trial type. Reaction times were faster for big reward than small reward, showing that faster reaction times occurred when more valuable rewards were predicted. However, free-choices of the small reward, suboptimal choices, were still significantly faster than forced-choices of the small reward, suggesting an inaccurate (and perhaps optimistic) prediction as to the number of drops to be received on those trials. **p < 0.01.
Figure 6.
Figure 6.
Decoder performance on free-choices of the big reward (i.e., optimal choices) compared with that on free-choices of the small reward (i.e., suboptimal choices). a, c, Top plots show block decoding accuracy of a pseudo-ensemble of 12 DMS CINs (a) or 12 DMS MSNs (c), during sliding 500 ms epochs across the trial. The same procedures were used as for the analysis shown in Figure 5, except that only free-choices of the big reward were tested. For comparison, data from small free-choices (suboptimal choices), shown in Figure 5, is also plotted here in dotted lines. Bottom plots show decoding accuracy as a function of ensemble size for the same three epochs shown in Figure 4. b, d, Top plots show percentage of test trials in which the block decoder from the corresponding plot (in a and c, respectively) misclassified the block as the one with the opposite drop-number rewards but the same flavor. Bottom plots show the misclassification percentage as a function of ensemble size. *p < 0.05 **p < 0.01.
Figure 7.
Figure 7.
MSNs block decoding is “yoked” to CIN decoding during execution of suboptimal choices. a–d, Top plots show block decoding accuracy of pseudo-ensembles of MSNs recorded in the same sessions as CINs. a, c, MSN decoding yoked to CIN decoding on suboptimal choices, meaning the percentage of trials on which MSN ensembles identified the same block as the CIN ensembles. b, d, Block decoding accuracy of the same MSN ensembles as a and c (i.e., not yoked to CINs). Bottom plots show decoding accuracy as a function of ensemble size for three 500 ms epochs across the trial. (starting 1000 ms before choice initiation, 100 ms before choice initiation, and at first reward delivery, respectively). **p < 0.01 compared with 0.
Figure 8.
Figure 8.
DMS CINs also coded the direction of free-choice before and after it occurred better than MSNs. Top plot shows choice-direction decoding accuracy of a pseudo-ensemble of 25 CINs or MSNs. Only free-choice trials across entire sessions were used; therefore significant decoding of direction before the choice could only reflect an internal intention or state. Bottom plots show decoding accuracy as a function of ensemble size for three 1 s epochs across the trial. **p < 0.001.
Figure 9.
Figure 9.
OFC lesions eliminate state encoding in DMS CINs. As in Figure 3, plots in a and b show a bin-by-bin average index of block-selectivity for the entire DMS (a) and DLS (b) CIN population in OFC-lesioned rats compared with those in sham rats. Block-selectivity scores of individual CINs in c and d show that OFC lesions eliminated the consistency of block-selectivity across the trial in DMS CINs (c) without affecting that in DLS CINs (d). Left, Scores in a pre-odor epoch versus post-reward epoch; other pairs of epochs are summarized on the right.
Figure 10.
Figure 10.
OFC lesions eliminate state encoding in DMS CINs. As in Figure 4, top plot shows block decoding accuracy of a pseudo-ensemble of 19 DMS CINs in OFC-lesioned rats (only 19 neurons were recorded in blocks with a sufficient number of trials) versus those in sham rats, also with a pseudo-ensemble of 19 neurons for comparison. Bottom plots show decoding accuracy as a function of ensemble size for three 1 s epochs across the trial. **p < 0.01.
Figure 11.
Figure 11.
DMS CINs recorded in rats with OFC lesions decode trial stimuli better than those in sham rats. Top plots show odor identity (a) and reward number (b) decoding accuracy of a pseudo-ensemble of 24 DMS CINs in shams versus OFC lesions, during sliding 200 ms epochs across the trial. All correct trials were included and therefore for odor identity, three odors had to be decoded (the forced-choice left, forced-choice right, and free-choice odors). Note that because of correction trials and the pseudorandom sequence of trials, some limited information about the upcoming trial-type could be derived before odor delivery began. Bottom plots show decoding accuracy as a function of ensemble size for three epochs each: for odor decoding, 1 s ending with the beginning of odor delivery, the 500 ms of odor delivery, and the 1 s immediately following odor delivery; for reward number decoding, 1 s ending with the beginning of reward delivery, the period of reward delivery and consumption (1500 ms beginning with delivery of the first drop of reward), and 1 s immediately after the end of the reward delivery epoch. **p < 0.01.
Figure 12.
Figure 12.
Ipsilateral OFC lesions eliminate the relationship between free-choices of the small reward and miscoding of the block in DMS CINs. Top plots show percentage accurate block decoding (a) and percentage miscoding as the opposite drop-number block (b), by a pseudo-ensemble of 12 CINs in sham rats or ipsilateral OFC-lesioned rats on suboptimal choices. Bottom plots show accurate decoding and miscoding as a function of ensemble size for three 1 s epochs across the trial. c, Bars show average (±SE) reaction time by trial type in lesioned rats. The pattern of reaction times was the same as in sham rats (Fig. 5); that is, they were faster for big reward than small reward, but free-choices of the small reward, suboptimal choices, were similarly fast to those when a big reward was expected, suggesting an inaccurate prediction as to the number of drops to be received on those trials. d, Top plot shows choice-direction decoding accuracy of a pseudo-ensemble of 24 CINs in sham rats or ipsilateral OFC-lesioned rats. Only free-choice trials were used; therefore significant decoding of direction before the choice could only reflect an internal intention or state. Bottom plots show decoding accuracy as a function of ensemble size for three 1 s epochs across the trial. **p < 0.01.

Similar articles

Cited by

References

    1. Aoki S, Liu AW, Zucca A, Zucca S, Wickens JR. Role of striatal cholinergic interneurons in set-shifting in the rat. J Neurosci. 2015;35:9424–9431. doi: 10.1523/JNEUROSCI.0490-15.2015. - DOI - PMC - PubMed
    1. Aosaki T, Tsubokawa H, Ishida A, Watanabe K, Graybiel AM, Kimura M. Responses of tonically active neurons in the primate's striatum undergo systematic changes during behavioral sensorimotor conditioning. J Neurosci. 1994;14:3969–3984. - PMC - PubMed
    1. Apicella P. Tonically active neurons in the primate striatum and their role in the processing of information about motivationally relevant events. Eur J Neurosci. 2002;16:2017–2026. doi: 10.1046/j.1460-9568.2002.02262.x. - DOI - PubMed
    1. Apicella P. Leading tonically active neurons of the striatum from reward detection to context recognition. Trends Neurosci. 2007;30:299–306. doi: 10.1016/j.tins.2007.03.011. - DOI - PubMed
    1. Ashby FG, Crossley MJ. A computational model of how cholinergic interneurons protect striatal-dependent learning. J Cogn Neurosci. 2011;23:1549–1566. doi: 10.1162/jocn.2010.21523. - DOI - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources