Understanding auditory spectro-temporal receptive fields and their changes with input statistics by efficient coding principles
- PMID: 21887121
- PMCID: PMC3158037
- DOI: 10.1371/journal.pcbi.1002123
Understanding auditory spectro-temporal receptive fields and their changes with input statistics by efficient coding principles
Abstract
Spectro-temporal receptive fields (STRFs) have been widely used as linear approximations to the signal transform from sound spectrograms to neural responses along the auditory pathway. Their dependence on statistical attributes of the stimuli, such as sound intensity, is usually explained by nonlinear mechanisms and models. Here, we apply an efficient coding principle which has been successfully used to understand receptive fields in early stages of visual processing, in order to provide a computational understanding of the STRFs. According to this principle, STRFs result from an optimal tradeoff between maximizing the sensory information the brain receives, and minimizing the cost of the neural activities required to represent and transmit this information. Both terms depend on the statistical properties of the sensory inputs and the noise that corrupts them. The STRFs should therefore depend on the input power spectrum and the signal-to-noise ratio, which is assumed to increase with input intensity. We analytically derive the optimal STRFs when signal and noise are approximated as Gaussians. Under the constraint that they should be spectro-temporally local, the STRFs are predicted to adapt from being band-pass to low-pass filters as the input intensity reduces, or the input correlation becomes longer range in sound frequency or time. These predictions qualitatively match physiological observations. Our prediction as to how the STRFs should be determined by the input power spectrum could readily be tested, since this spectrum depends on the stimulus ensemble. The potentials and limitations of the efficient coding principle are discussed.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
, and evokes response at a typical latency
. Since the response at time
is
, an input stimulus
exactly as depicted in this plot is most likely to elicit a large response
at time
, or indeed a spike.
into a time-frequency representation
, as the population activities of the auditory nerves, which is the input to the efficient encoding system. Signal and noise pass through a series of brain nuclei such as cochlear nucleus, superior olive, inferior colliculus, etc. The current work proposes that the effective transform STRF of the spectrogram that is collectively realized by these nuclei is, in its linear form, the optimal filter
implied by the efficient coding principle. The output
is the activity of neurons in a higher nucleus. (C) Three steps of signal flow within the linear encoding step
or STRF in (A) and (B). Note that these three steps are merely abstract algorithmic steps, rather than neural implementation processes for the effective transform
or STRF.
, each of which is smoothed Gaussian white noise in the frequency domain (equations (11–13),
). (B) Correlation between different frequency channels
. Left: Correlation
; Right: an zoomed-in view, as
vs
. (C) Ten examples of eigenvectors
of the correlation matrix
in B; each is an independent component in
. Smaller indices
are associated with larger eigenvalues. (D) Gain profile (peaking at
), and signal and noise power in decorrelated channels. (E) Four examples (
,
,
, and
) of spectral receptive fields
; each prefers input frequencies around
.
. (A) Gain control (red), signal (blue), and noise power (black) under high, medium and low SNR. (B) The corresponding SRFs of one output neuron (channel #120) in the three SNR cases.
and
, are set for short and long range correlations, respectively. Analogous figure format as in Figure 4, with added illustrations of the adaptation to input correlations. The thick and thin curves correspond to quantities for inputs with large and small correlations respectively, blue/red curves plot signal power
and gain
respectively.
in equation (12) to ensure translation invariance of correlation. (A;B) Demonstration of transforming an acausal temporal filter (A) to its causal minimum-phase counterpart (B) at a relatively high input SNR. (C) TRF for a relatively low input SNR.
(equation (15),
,
) in decorrelated channels. (B, C) MTF profile
and the corresponding STRFs with two SNRs (scaled by
's). (D)
and STRF as in B;C (when
) except with larger input correlations (
,
in equation (15)). (E;F) Modulation transfer functions (MTFs) and their properties at low and high input sound intensities averaged over 40 IC neurons from Lesica and Grothe . Here,
is the spectral-temporal modulation frequency where the MTF peaks. Modulation frequencies in E and F are normalized by the same value across cells and intensities. Error bars in E indicate standard errors. The magnitude patterns of the MTFs for all neurons are normalized to peak value
. Their average across neurons at each input intensity is then normalized to the same peak value and displayed in F.Similar articles
-
Plasticity of Multidimensional Receptive Fields in Core Rat Auditory Cortex Directed by Sound Statistics.Neuroscience. 2021 Jul 15;467:150-170. doi: 10.1016/j.neuroscience.2021.04.028. Epub 2021 May 2. Neuroscience. 2021. PMID: 33951506 Free PMC article.
-
Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds.J Neurosci. 2000 Mar 15;20(6):2315-31. doi: 10.1523/JNEUROSCI.20-06-02315.2000. J Neurosci. 2000. PMID: 10704507 Free PMC article.
-
Context dependence of spectro-temporal receptive fields with implications for neural coding.Hear Res. 2011 Jan;271(1-2):123-32. doi: 10.1016/j.heares.2010.01.014. Epub 2010 Feb 1. Hear Res. 2011. PMID: 20123121 Review.
-
Sustained firing of model central auditory neurons yields a discriminative spectro-temporal representation for natural sounds.PLoS Comput Biol. 2013;9(3):e1002982. doi: 10.1371/journal.pcbi.1002982. Epub 2013 Mar 28. PLoS Comput Biol. 2013. PMID: 23555217 Free PMC article.
-
Incorporating behavioral and sensory context into spectro-temporal models of auditory encoding.Hear Res. 2018 Mar;360:107-123. doi: 10.1016/j.heares.2017.12.021. Epub 2017 Dec 31. Hear Res. 2018. PMID: 29331232 Free PMC article. Review.
Cited by
-
Frequency-specific adaptation and its underlying circuit model in the auditory midbrain.Front Neural Circuits. 2015 Oct 1;9:55. doi: 10.3389/fncir.2015.00055. eCollection 2015. Front Neural Circuits. 2015. PMID: 26483641 Free PMC article.
-
Sparse codes for speech predict spectrotemporal receptive fields in the inferior colliculus.PLoS Comput Biol. 2012;8(7):e1002594. doi: 10.1371/journal.pcbi.1002594. Epub 2012 Jul 12. PLoS Comput Biol. 2012. PMID: 22807665 Free PMC article.
-
Computational principles underlying recognition of acoustic signals in grasshoppers and crickets.J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2015 Jan;201(1):61-71. doi: 10.1007/s00359-014-0946-7. Epub 2014 Sep 26. J Comp Physiol A Neuroethol Sens Neural Behav Physiol. 2015. PMID: 25258206 Review.
-
A Survey on Probabilistic Models in Human Perception and Machines.Front Robot AI. 2020 Jul 7;7:85. doi: 10.3389/frobt.2020.00085. eCollection 2020. Front Robot AI. 2020. PMID: 33501252 Free PMC article. Review.
-
Sensory cortex is optimized for prediction of future input.Elife. 2018 Jun 18;7:e31557. doi: 10.7554/eLife.31557. Elife. 2018. PMID: 29911971 Free PMC article.
References
-
- Aertsen AM, Johannesma PI. The spectro-temporal receptive field. A functional characteristic of auditory neurons. Biol Cybern. 1981;42:133–43. - PubMed
-
- Klein DJ, Depireux DA, Simon JZ, Shamma SA. Robust spectrotemporal reverse correlation for the auditory system: optimizing stimulus design. J Comput Neurosci. 2000;9:85–111. - PubMed
-
- Eggermont JJ, Johannesma PIM, Aertsen AMHJ. Reverse-correlation methods in auditory research. Q Rev Biophys. 1983;16:341–414. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
