Speech recognition with primarily temporal cues

R V Shannon; F G Zeng; V Kamath; J Wygonski; M Ekelid

doi:10.1126/science.270.5234.303

Speech recognition with primarily temporal cues

Science. 1995 Oct 13;270(5234):303-4. doi: 10.1126/science.270.5234.303.

Authors

R V Shannon¹, F G Zeng, V Kamath, J Wygonski, M Ekelid

Affiliation

¹ House Ear Institute, Los Angeles, CA 90057, USA.

PMID: 7569981
DOI: 10.1126/science.270.5234.303

Abstract

Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information. Temporal envelopes of speech were extracted from broad frequency bands and were used to modulate noises of the same bandwidths. This manipulation preserved temporal envelope cues in each band but restricted the listener to severely degraded information on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance was obtained with only three bands of modulated noise. Thus, the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Auditory Threshold
Cues
Hearing
Humans
Noise
Speech Perception*
Temporal Lobe / physiology*

Grants and funding

R01 DCO1526/DC/NIDCD NIH HHS/United States