Spectral and temporal cues for speech recognition: implications for auditory prostheses

Hear Res. 2008 Aug;242(1-2):132-40. doi: 10.1016/j.heares.2007.12.010. Epub 2007 Dec 28.

Abstract

Features of stimulation important for speech recognition in people with normal hearing and in people using implanted auditory prostheses include spectral information represented by place of stimulation along the tonotopic axis and temporal information represented in low-frequency envelopes of the signal. The relative contributions of these features to speech recognition and their interactions have been studied using vocoder-like simulations of cochlear implant speech processors presented to listeners with normal hearing. In these studies, spectral/place information was manipulated by varying the number of channels and the temporal-envelope information was manipulated by varying the lowpass cutoffs of the envelope extractors. Consonant and vowel recognition in quiet reached plateau at 8 and 12 channels and lowpass cutoff frequencies of 16 Hz and 4 Hz, respectively. Phoneme (especially vowel) recognition in noise required larger numbers of channels. Lexical tone recognition required larger numbers of channels and higher lowpass cutoff frequencies. There was a tradeoff between spectral/place and temporal-envelope requirements. Most current auditory prostheses seem to deliver adequate temporal-envelope information, but the number of effective channels is suboptimal, particularly for speech recognition in noise, lexical tone recognition, and music perception.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustics
  • Auditory Pathways / physiology
  • Auditory Perception / physiology*
  • Cochlear Implants*
  • Cues*
  • Electric Stimulation
  • Humans
  • Language
  • Pattern Recognition, Physiological / physiology*
  • Pitch Perception / physiology
  • Speech Perception / physiology*