Neural Oscillations Carry Speech Rhythm through to Comprehension
- PMID: 22973251
- PMCID: PMC3434440
- DOI: 10.3389/fpsyg.2012.00320
Neural Oscillations Carry Speech Rhythm through to Comprehension
Abstract
A key feature of speech is the quasi-regular rhythmic information contained in its slow amplitude modulations. In this article we review the information conveyed by speech rhythm, and the role of ongoing brain oscillations in listeners' processing of this content. Our starting point is the fact that speech is inherently temporal, and that rhythmic information conveyed by the amplitude envelope contains important markers for place and manner of articulation, segmental information, and speech rate. Behavioral studies demonstrate that amplitude envelope information is relied upon by listeners and plays a key role in speech intelligibility. Extending behavioral findings, data from neuroimaging - particularly electroencephalography (EEG) and magnetoencephalography (MEG) - point to phase locking by ongoing cortical oscillations to low-frequency information (~4-8 Hz) in the speech envelope. This phase modulation effectively encodes a prediction of when important events (such as stressed syllables) are likely to occur, and acts to increase sensitivity to these relevant acoustic cues. We suggest a framework through which such neural entrainment to speech rhythm can explain effects of speech rate on word and segment perception (i.e., that the perception of phonemes and words in connected speech is influenced by preceding speech rate). Neuroanatomically, acoustic amplitude modulations are processed largely bilaterally in auditory cortex, with intelligible speech resulting in differential recruitment of left-hemisphere regions. Notable among these is lateral anterior temporal cortex, which we propose functions in a domain-general fashion to support ongoing memory and integration of meaningful input. Together, the reviewed evidence suggests that low-frequency oscillations in the acoustic speech signal form the foundation of a rhythmic hierarchy supporting spoken language, mirrored by phase-locked oscillations in the human brain.
Keywords: intelligibility; language; oscillations; phase locking; speech comprehension; speech rate; theta.
Figures
), and the aspiration for a clear /ba/ in a region of high excitability (
). However, for the ambiguous token, the aspiration occurs at different levels of excitability for the faster and slower speech rates (
), making it less likely to be perceived as /pa/ (and more likely to be perceived as a /ba/) at slower speech rates. (C) Schematic categorical perception curves demonstrating a shift of perceptual boundaries as a function of speech rate based on this framework.
Similar articles
-
Phase-locked responses to speech in human auditory cortex are enhanced during comprehension.Cereb Cortex. 2013 Jun;23(6):1378-87. doi: 10.1093/cercor/bhs118. Epub 2012 May 17. Cereb Cortex. 2013. PMID: 22610394 Free PMC article.
-
Acoustic landmarks drive delta-theta oscillations to enable speech comprehension by facilitating perceptual parsing.Neuroimage. 2014 Jan 15;85 Pt 2(0 2):761-8. doi: 10.1016/j.neuroimage.2013.06.035. Epub 2013 Jun 19. Neuroimage. 2014. PMID: 23791839 Free PMC article.
-
Phase Entrainment of Brain Oscillations Causally Modulates Neural Responses to Intelligible Speech.Curr Biol. 2018 Feb 5;28(3):401-408.e5. doi: 10.1016/j.cub.2017.11.071. Epub 2018 Jan 18. Curr Biol. 2018. PMID: 29358073 Free PMC article.
-
Prediction and constraint in audiovisual speech perception.Cortex. 2015 Jul;68:169-81. doi: 10.1016/j.cortex.2015.03.006. Epub 2015 Mar 20. Cortex. 2015. PMID: 25890390 Free PMC article. Review.
-
The neural oscillations of speech processing and language comprehension: state of the art and emerging mechanisms.Eur J Neurosci. 2018 Oct;48(7):2609-2621. doi: 10.1111/ejn.13748. Epub 2017 Nov 14. Eur J Neurosci. 2018. PMID: 29055058 Review.
Cited by 120 articles
-
Development of the Mechanisms Underlying Audiovisual Speech Perception Benefit.Brain Sci. 2021 Jan 5;11(1):49. doi: 10.3390/brainsci11010049. Brain Sci. 2021. PMID: 33466253 Free PMC article. Review.
-
Neural Generators Underlying Temporal Envelope Processing Show Altered Responses and Hemispheric Asymmetry Across Age.Front Aging Neurosci. 2020 Dec 4;12:596551. doi: 10.3389/fnagi.2020.596551. eCollection 2020. Front Aging Neurosci. 2020. PMID: 33343335 Free PMC article.
-
Neurocognitive dynamics of near-threshold voice signal detection and affective voice evaluation.Sci Adv. 2020 Dec 11;6(50):eabb3884. doi: 10.1126/sciadv.abb3884. Print 2020 Dec. Sci Adv. 2020. PMID: 33310844 Free PMC article.
-
The role of isochrony in speech perception in noise.Sci Rep. 2020 Nov 11;10(1):19580. doi: 10.1038/s41598-020-76594-1. Sci Rep. 2020. PMID: 33177590 Free PMC article.
-
The relation between neurofunctional and neurostructural determinants of phonological processing in pre-readers.Dev Cogn Neurosci. 2020 Dec;46:100874. doi: 10.1016/j.dcn.2020.100874. Epub 2020 Oct 20. Dev Cogn Neurosci. 2020. PMID: 33130464 Free PMC article.
References
-
- Abercrombie D. (1967). Elements of General Phonetics. Chicago: Aldine
-
- Baer T., Moore B. C. J. (1993). Effects of spectral smearing on the intelligibility of sentences in noise. J. Acoust. Soc. Am. 94, 1229–124110.1121/1.408176 - DOI
Grant support
LinkOut - more resources
Full Text Sources
Other Literature Sources
