Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Aug;7(8):e1002123.
doi: 10.1371/journal.pcbi.1002123. Epub 2011 Aug 18.

Understanding auditory spectro-temporal receptive fields and their changes with input statistics by efficient coding principles

Affiliations

Understanding auditory spectro-temporal receptive fields and their changes with input statistics by efficient coding principles

Lingyun Zhao et al. PLoS Comput Biol. 2011 Aug.

Abstract

Spectro-temporal receptive fields (STRFs) have been widely used as linear approximations to the signal transform from sound spectrograms to neural responses along the auditory pathway. Their dependence on statistical attributes of the stimuli, such as sound intensity, is usually explained by nonlinear mechanisms and models. Here, we apply an efficient coding principle which has been successfully used to understand receptive fields in early stages of visual processing, in order to provide a computational understanding of the STRFs. According to this principle, STRFs result from an optimal tradeoff between maximizing the sensory information the brain receives, and minimizing the cost of the neural activities required to represent and transmit this information. Both terms depend on the statistical properties of the sensory inputs and the noise that corrupts them. The STRFs should therefore depend on the input power spectrum and the signal-to-noise ratio, which is assumed to increase with input intensity. We analytically derive the optimal STRFs when signal and noise are approximated as Gaussians. Under the constraint that they should be spectro-temporally local, the STRFs are predicted to adapt from being band-pass to low-pass filters as the input intensity reduces, or the input correlation becomes longer range in sound frequency or time. These predictions qualitatively match physiological observations. Our prediction as to how the STRFs should be determined by the input power spectrum could readily be tested, since this spectrum depends on the stimulus ensemble. The potentials and limitations of the efficient coding principle are discussed.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A schematic example of a typical spectro-temporal receptive field, plotted with a reversed abscissa.
This STRF has one excitatory and three inhibitory regions, prefers frequency formula image, and evokes response at a typical latency formula image. Since the response at time formula image is formula image, an input stimulus formula image exactly as depicted in this plot is most likely to elicit a large response formula image at time formula image, or indeed a spike.
Figure 2
Figure 2. Formulation and components of efficient coding.
(A) A schematic plot of the efficient encoding transform. (B) Signal transformation in the auditory system. The cochlea turns the time-varying waveform formula image into a time-frequency representation formula image, as the population activities of the auditory nerves, which is the input to the efficient encoding system. Signal and noise pass through a series of brain nuclei such as cochlear nucleus, superior olive, inferior colliculus, etc. The current work proposes that the effective transform STRF of the spectrogram that is collectively realized by these nuclei is, in its linear form, the optimal filter formula image implied by the efficient coding principle. The output formula image is the activity of neurons in a higher nucleus. (C) Three steps of signal flow within the linear encoding step formula image or STRF in (A) and (B). Note that these three steps are merely abstract algorithmic steps, rather than neural implementation processes for the effective transform formula image or STRF.
Figure 3
Figure 3. Simulation of the efficient spectral kernel SRF, when the temporal dimension is omitted.
(A) 250 samples of input spectra formula image, each of which is smoothed Gaussian white noise in the frequency domain (equations (11–13), formula image). (B) Correlation between different frequency channels formula image. Left: Correlation formula image; Right: an zoomed-in view, as formula image vs formula image. (C) Ten examples of eigenvectors formula image of the correlation matrix formula image in B; each is an independent component in formula image. Smaller indices formula image are associated with larger eigenvalues. (D) Gain profile (peaking at formula image), and signal and noise power in decorrelated channels. (E) Four examples (formula image, formula image, formula image, and formula image) of spectral receptive fields formula image; each prefers input frequencies around formula image.
Figure 4
Figure 4. The effect of signal-to-noise ratio (SNR) on gain and the spectral receptive field (SRF).
Same stimulus ensemble as in Figure 3A except the overall SNR has been scaled by formula image. (A) Gain control (red), signal (blue), and noise power (black) under high, medium and low SNR. (B) The corresponding SRFs of one output neuron (channel #120) in the three SNR cases.
Figure 5
Figure 5. Adaptation of gain and spectral filter kernel SRF to input correlations under high/low SNR.
Same input ensemble as that in Figure 3A, except that the smoothing parameter, formula image and formula image, are set for short and long range correlations, respectively. Analogous figure format as in Figure 4, with added illustrations of the adaptation to input correlations. The thick and thin curves correspond to quantities for inputs with large and small correlations respectively, blue/red curves plot signal power formula image and gain formula image respectively.
Figure 6
Figure 6. Simulation of temporal receptive field TRF, when the spectral dimension is omitted.
The same stimulus ensemble is used as in Figure 3A, except the factor formula image in equation (12) to ensure translation invariance of correlation. (A;B) Demonstration of transforming an acausal temporal filter (A) to its causal minimum-phase counterpart (B) at a relatively high input SNR. (C) TRF for a relatively low input SNR.
Figure 7
Figure 7. The 2D STRFs/MTFs implied by efficient coding and found physiologically.
(A) input power formula image (equation (15), formula image, formula image) in decorrelated channels. (B, C) MTF profile formula image and the corresponding STRFs with two SNRs (scaled by formula image's). (D) formula image and STRF as in B;C (when formula image) except with larger input correlations (formula image, formula image in equation (15)). (E;F) Modulation transfer functions (MTFs) and their properties at low and high input sound intensities averaged over 40 IC neurons from Lesica and Grothe . Here, formula image is the spectral-temporal modulation frequency where the MTF peaks. Modulation frequencies in E and F are normalized by the same value across cells and intensities. Error bars in E indicate standard errors. The magnitude patterns of the MTFs for all neurons are normalized to peak value formula image. Their average across neurons at each input intensity is then normalized to the same peak value and displayed in F.

Similar articles

Cited by

References

    1. Aertsen AM, Johannesma PI. The spectro-temporal receptive field. A functional characteristic of auditory neurons. Biol Cybern. 1981;42:133–43. - PubMed
    1. Escabi MA, Schreiner CE. Nonlinear spectrotemporal sound analysis by neurons in the auditory midbrain. J Neurosci. 2002;22:4114–4131. - PMC - PubMed
    1. Klein DJ, Depireux DA, Simon JZ, Shamma SA. Robust spectrotemporal reverse correlation for the auditory system: optimizing stimulus design. J Comput Neurosci. 2000;9:85–111. - PubMed
    1. Theunissen FE, Sen K, Doupe AJ. Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds. J Neurosci. 2000;20:2315–31. - PMC - PubMed
    1. Eggermont JJ, Johannesma PIM, Aertsen AMHJ. Reverse-correlation methods in auditory research. Q Rev Biophys. 1983;16:341–414. - PubMed

Publication types