Causal inference of asynchronous audiovisual speech
- PMID: 24294207
- PMCID: PMC3826594
- DOI: 10.3389/fpsyg.2013.00798
Causal inference of asynchronous audiovisual speech
Abstract
During speech perception, humans integrate auditory information from the voice with visual information from the face. This multisensory integration increases perceptual precision, but only if the two cues come from the same talker; this requirement has been largely ignored by current models of speech perception. We describe a generative model of multisensory speech perception that includes this critical step of determining the likelihood that the voice and face information have a common cause. A key feature of the model is that it is based on a principled analysis of how an observer should solve this causal inference problem using the asynchrony between two cues and the reliability of the cues. This allows the model to make predictions about the behavior of subjects performing a synchrony judgment task, predictive power that does not exist in other approaches, such as post-hoc fitting of Gaussian curves to behavioral data. We tested the model predictions against the performance of 37 subjects performing a synchrony judgment task viewing audiovisual speech under a variety of manipulations, including varying asynchronies, intelligibility, and visual cue reliability. The causal inference model outperformed the Gaussian model across two experiments, providing a better fit to the behavioral data with fewer parameters. Because the causal inference model is derived from a principled understanding of the task, model parameters are directly interpretable in terms of stimulus and subject properties.
Keywords: Bayesian observer; causal inference; multisensory integration; speech perception; synchrony judgments.
Figures
Similar articles
-
A Causal Inference Model Explains Perception of the McGurk Effect and Other Incongruent Audiovisual Speech.PLoS Comput Biol. 2017 Feb 16;13(2):e1005229. doi: 10.1371/journal.pcbi.1005229. eCollection 2017 Feb. PLoS Comput Biol. 2017. PMID: 28207734 Free PMC article.
-
Weak observer-level correlation and strong stimulus-level correlation between the McGurk effect and audiovisual speech-in-noise: A causal inference explanation.Cortex. 2020 Dec;133:371-383. doi: 10.1016/j.cortex.2020.10.002. Epub 2020 Oct 17. Cortex. 2020. PMID: 33221701 Free PMC article.
-
Visual and Auditory Components in the Perception of Asynchronous Audiovisual Speech.Iperception. 2015 Nov 30;6(6):2041669515615735. doi: 10.1177/2041669515615735. eCollection 2015 Dec. Iperception. 2015. PMID: 27551361 Free PMC article.
-
Causal inference and temporal predictions in audiovisual perception of speech and music.Ann N Y Acad Sci. 2018 Mar 31. doi: 10.1111/nyas.13615. Online ahead of print. Ann N Y Acad Sci. 2018. PMID: 29604082 Review.
-
Multisensory Integration in Cochlear Implant Recipients.Ear Hear. 2017 Sep/Oct;38(5):521-538. doi: 10.1097/AUD.0000000000000435. Ear Hear. 2017. PMID: 28399064 Free PMC article. Review.
Cited by
-
Audio-Visual Causality and Stimulus Reliability Affect Audio-Visual Synchrony Perception.Front Psychol. 2021 Feb 18;12:629996. doi: 10.3389/fpsyg.2021.629996. eCollection 2021. Front Psychol. 2021. PMID: 33679553 Free PMC article.
-
Brief period of monocular deprivation drives changes in audiovisual temporal perception.J Vis. 2020 Aug 3;20(8):8. doi: 10.1167/jov.20.8.8. J Vis. 2020. PMID: 32761108 Free PMC article.
-
The noisy encoding of disparity model predicts perception of the McGurk effect in native Japanese speakers.Front Neurosci. 2024 Jun 26;18:1421713. doi: 10.3389/fnins.2024.1421713. eCollection 2024. Front Neurosci. 2024. PMID: 38988770 Free PMC article.
-
Synthetic faces generated with the facial action coding system or deep neural networks improve speech-in-noise perception, but not as much as real faces.Front Neurosci. 2024 May 9;18:1379988. doi: 10.3389/fnins.2024.1379988. eCollection 2024. Front Neurosci. 2024. PMID: 38784097 Free PMC article.
-
A theory of autism bridging across levels of description.Trends Cogn Sci. 2023 Jul;27(7):631-641. doi: 10.1016/j.tics.2023.04.010. Epub 2023 May 13. Trends Cogn Sci. 2023. PMID: 37183143 Free PMC article. Review.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
