Dynamic excitatory and inhibitory gain modulation can produce flexible, robust and optimal decision-making
- PMID: 23825935
- PMCID: PMC3694816
- DOI: 10.1371/journal.pcbi.1003099
Dynamic excitatory and inhibitory gain modulation can produce flexible, robust and optimal decision-making
Abstract
Behavioural and neurophysiological studies in primates have increasingly shown the involvement of urgency signals during the temporal integration of sensory evidence in perceptual decision-making. Neuronal correlates of such signals have been found in the parietal cortex, and in separate studies, demonstrated attention-induced gain modulation of both excitatory and inhibitory neurons. Although previous computational models of decision-making have incorporated gain modulation, their abstract forms do not permit an understanding of the contribution of inhibitory gain modulation. Thus, the effects of co-modulating both excitatory and inhibitory neuronal gains on decision-making dynamics and behavioural performance remain unclear. In this work, we incorporate time-dependent co-modulation of the gains of both excitatory and inhibitory neurons into our previous biologically based decision circuit model. We base our computational study in the context of two classic motion-discrimination tasks performed in animals. Our model shows that by simultaneously increasing the gains of both excitatory and inhibitory neurons, a variety of the observed dynamic neuronal firing activities can be replicated. In particular, the model can exhibit winner-take-all decision-making behaviour with higher firing rates and within a significantly more robust model parameter range. It also exhibits short-tailed reaction time distributions even when operating near a dynamical bifurcation point. The model further shows that neuronal gain modulation can compensate for weaker recurrent excitation in a decision neural circuit, and support decision formation and storage. Higher neuronal gain is also suggested in the more cognitively demanding reaction time than in the fixed delay version of the task. Using the exact temporal delays from the animal experiments, fast recruitment of gain co-modulation is shown to maximize reward rate, with a timescale that is surprisingly near the experimentally fitted value. Our work provides insights into the simultaneous and rapid modulation of excitatory and inhibitory neuronal gains, which enables flexible, robust, and optimal decision-making.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
or
) and the choice targets (
). (B) Sample gain modulation on a single-cell input-output relation. Dotted to solid curves show effect of increasing gain. Inset: Temporal evolution of gain, light to dark colours show decreasing time constant of gain modulation. (C) Timecourse of input currents. (D) Firing rate of an upstream MT neuron encoding motion stimulus, when motion is into or out of its response field (RF).
Hz, compare with Figure 1. (B,C) Accuracy (B) and mean RT (C) generated by model and in the experiment of . Psychometric functions fitted with a Weibull function,
:, where
is the discrimination threshold at which performance is at
correct, and
yields the slope of the psychometric function.
and
of the model (experiment):
and
(
and
), respectively. Error bars denote standard errors. (D,E) RT distributions generated by model and in the experiment for correct (D) and error (E) trials. Upper, middle, lower panels:
,
,
coherence, respectively.
,
,
. Orange and green curves represent nullclines: where
and
, respectively. (B) Fixation with targets:
,
,
. Grey regions show the basin of attraction for the symmetric (on-diagonal) attractor (see text for definition). (C) Fixation, with targets and gain onset:
. Solid (dashed) nullclines:
,
(
,
). Solid nullclines: with inhibitory gain onset prior to excitatory gain onset; dashed nullclines: with both inhibitory and early stage of excitatory gain increase. (D) Fixation, with targets, gain modulation, and motion stimulus (zero coherence):
,
,
. Closed and open circles represent stable (attractors) and unstable steady states, respectively. Diagonal line (stable manifold) and the curve to which off-diagonal trajectories near an unstable steady state are repelled to (unstable manifold), are shown in black and denoted by arrows moving towards or away from the unstable steady state, respectively. Blue: a sample trial with the corresponding epochs in a trial in bold and labeled in inset, grey lines denote the unobserved part of the simulation trial after saccade initiation. Red: denotes the time when the phase plane was viewed. Note the different scales between the top and bottom panels.
with zero motion coherence. Black: without gain modulation,
. Grey: gains increase to
. Solid and dashed lines are the stable and unstable steady-states, respectively. Double horizontal arrows show the range where a symmetric unstable steady state (dashed symmetric curves) co-exists with asymmetric stable steady states (upper and lower stable branches). These are the dynamic ranges of decision-making under these two conditions. Circle, triangle and square represent the fitted firing rate for the net stimulus input during fixation, target and motion periods, respectively. Vertical dashed double arrows show the winner-take-all effect (from the square) during motion stimulus and gain increase, either transiting to the upper winning branch or lower losing branch. Note that with gain modulation, the upper branch is mostly higher than the
Hz response threshold, enabling saccade initiation to the winning direction.
(A) and inhibitory gain
(B). Arrows in (A) and (B) show direction of change as
or
varies, respectively. Vertical dashed lines partition regimes of
in (A) and
in (B), respectively. A regime can have a single symmetric stable steady state, which is either low (LSS) or high (HSS), or multiple stable steady states: one symmetric and two asymmetric, with two asymmetric unstable steady states. The symmetric steady state can be low (LMS) or high (HMS). Or it may have a symmetric unstable steady state with asymmetric stable and unstable steady states. This constitutes the decision-making (DM) regime. (C,D) Sample activity timecourses showing either no winner-take-all behaviour (C) or divergence at low firing rates, when the excitatory (inhibitory) gain is increased (decreased) in isolation (C), or when the excitatory (inhibitory) gain is decreased (increased) in isolation (D), respectively.
and
, respectively. A regime can have (i) a symmetric low single stable steady state (LSS), (ii) a symmetric high single stable state (HSS), (iii) multiple stable steady states: one low symmetric and two asymmetric (LMS) with two asymmetric unstable steady states, (iv) multiple stable steady states: one high symmetric and two asymmetric (HMS) with two asymmetric unstable steady states and (v) a symmetric unstable steady state with asymmetric stable and unstable steady states, which constitutes the decision-making (DM) regime. Compare with Figure 6 A,B. The regimes of
are analysed for different net-stimulus inputs (
), i.e. during (A) fixation, (B) target, and (C) target and motion. Black dots show our fitted
parameters during these epochs, with black and open dots showing the fitted parameters during the RT and FD tasks, respectively.
and
). motion stimulus duration: 1 second. Delay period: 1 second. (B) Activity timecourse of model averaged across trials. Inset: Accuracy from model simulations, compared with that in RT task. Coherence threshold
and slope
of model (experiment):
and
(
and
), respectively. The ratio
is similar to that in the experiment.
, with weak recurrent excitation (
) in the absence of gain modulation (
), with low gains (
,
) and large gains (
,
). Solid and dashed lines show stable and unstable steady states, respectively. (B) Without any gain modulation, the network cannot perform decision-making nor store decisions. (C) With sufficient gain modulation, network can form a decision and store it below a motor threshold (dotted horizontal line at 70 Hz). Upon cue to respond, higher set of gains allow threshold to be crossed and saccade to be made.
. (A) mean reward rate (RR); (B) accuracy; (C) mean trial duration (TD). Dashed horizontal lines show our model's fit to the data, with
ms. Inset: zoomed in around the optimal timescale, showing the optimal timescale is
.
ms (dashed), compared with experimental data (open circles), and our previous model's fit to it with
ms (black bold). Dash-dotted: model with an optimal gain time constant
for each motion coherence. Lighter colours in the lower panel are for error RTs. Error bars denote standard errors of the mean. (B) Comparing optimal
for each individual motion coherence (triangles, dash-dotted curve) with model fit (
ms, (black bold line)), and single optimal (
ms, (dashed line)). (C) Comparing RR for optimal
for each individual coherence with that of model fit (
ms), and single optimal (
ms). Dotted: average of the RR with
over all motion coherences.Similar articles
-
Flexible control of representational dynamics in a disinhibition-based model of decision-making.Elife. 2023 Jun 1;12:e82426. doi: 10.7554/eLife.82426. Elife. 2023. PMID: 37261426 Free PMC article.
-
Optimization of decision making in multilayer networks: the role of locus coeruleus.Neural Comput. 2008 Dec;20(12):2863-94. doi: 10.1162/neco.2008.03-07-487. Neural Comput. 2008. PMID: 18624653
-
Predicting Perceptual Decisions Using Visual Cortical Population Responses and Choice History.J Neurosci. 2019 Aug 21;39(34):6714-6727. doi: 10.1523/JNEUROSCI.0035-19.2019. Epub 2019 Jun 24. J Neurosci. 2019. PMID: 31235648 Free PMC article.
-
Reward-dependent learning in neuronal networks for planning and decision making.Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0. Prog Brain Res. 2000. PMID: 11105649 Review.
-
[Neural mechanisms of decision making].Brain Nerve. 2008 Sep;60(9):1017-27. Brain Nerve. 2008. PMID: 18807936 Review. Japanese.
Cited by
-
Choice selective inhibition drives stability and competition in decision circuits.Nat Commun. 2023 Jan 10;14(1):147. doi: 10.1038/s41467-023-35822-8. Nat Commun. 2023. PMID: 36627310 Free PMC article.
-
Decisions are expedited through multiple neural adjustments spanning the sensorimotor hierarchy.Nat Commun. 2018 Sep 7;9(1):3627. doi: 10.1038/s41467-018-06117-0. Nat Commun. 2018. PMID: 30194305 Free PMC article.
-
A circuit mechanism for decision-making biases and NMDA receptor hypofunction.Elife. 2020 Sep 29;9:e53664. doi: 10.7554/eLife.53664. Elife. 2020. PMID: 32988455 Free PMC article.
-
Of monkeys and men: Impatience in perceptual decision-making.Psychon Bull Rev. 2016 Jun;23(3):738-49. doi: 10.3758/s13423-015-0958-5. Psychon Bull Rev. 2016. PMID: 26518307 Free PMC article. Review.
-
Global gain modulation generates time-dependent urgency during perceptual choice in humans.Nat Commun. 2016 Nov 24;7:13526. doi: 10.1038/ncomms13526. Nat Commun. 2016. PMID: 27882927 Free PMC article.
References
-
- Gold JI, Shadlen MN (2007) The neural basis of decision making. Annu Rev Neurosci 30: 535–574. - PubMed
-
- Luce RD (1986) Response Times: Their Role in Inferring Elementary Mental Organization. New York (New York): Oxford Univ. Press. 577 p.
-
- Andersen RA, Buneo CA (2002) Intentional maps in posterior parietal cortex. Annu Rev Neurosci 25: 189–220. - PubMed
-
- Shadlen MN, Gold JI (2004) The neurophysiology of decision-making as a window on cognition. In: Gazzaniga MS, editor, The Cognitive Neurosciences, 3rd edition. MIT Press.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
