Transformation of a temporal speech cue to a spatial neural code in human auditory cortex

Neal P Fox; Matthew Leonard; Matthias J Sjerps; Edward F Chang

doi:10.7554/eLife.53051

Transformation of a temporal speech cue to a spatial neural code in human auditory cortex

Elife. 2020 Aug 25:9:e53051. doi: 10.7554/eLife.53051.

Authors

Neal P Fox¹, Matthew Leonard¹, Matthias J Sjerps^{2

3}, Edward F Chang^{1

4}

Affiliations

¹ Department of Neurological Surgery, University of California, San Francisco, San Francisco, United States.
² Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, Netherlands.
³ Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands.
⁴ Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, United States.

Abstract

In speech, listeners extract continuously-varying spectrotemporal cues from the acoustic signal to perceive discrete phonetic categories. Spectral cues are spatially encoded in the amplitude of responses in phonetically-tuned neural populations in auditory cortex. It remains unknown whether similar neurophysiological mechanisms encode temporal cues like voice-onset time (VOT), which distinguishes sounds like /b/ and/p/. We used direct brain recordings in humans to investigate the neural encoding of temporal speech cues with a VOT continuum from /ba/ to /pa/. We found that distinct neural populations respond preferentially to VOTs from one phonetic category, and are also sensitive to sub-phonetic VOT differences within a population's preferred category. In a simple neural network model, simulated populations tuned to detect either temporal gaps or coincidences between spectral cues captured encoding patterns observed in real neural data. These results demonstrate that a spatial/amplitude neural code underlies the cortical representation of both spectral and temporal speech cues.

Keywords: auditory cortex; categorical perception; electrocorticography; human; neuroscience; speech; temporal processing; voice-onset time (VOT).

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Auditory Cortex / physiology*
Cues
Humans
Speech Perception / physiology*
Speech*
Voice

Abstract

Publication types

MeSH terms

Grants and funding