Schema learning for the cocktail party problem
- PMID: 29563229
- PMCID: PMC5889675
- DOI: 10.1073/pnas.1801614115
Schema learning for the cocktail party problem
Abstract
The cocktail party problem requires listeners to infer individual sound sources from mixtures of sound. The problem can be solved only by leveraging regularities in natural sound sources, but little is known about how such regularities are internalized. We explored whether listeners learn source "schemas"-the abstract structure shared by different occurrences of the same type of sound source-and use them to infer sources from mixtures. We measured the ability of listeners to segregate mixtures of time-varying sources. In each experiment a subset of trials contained schema-based sources generated from a common template by transformations (transposition and time dilation) that introduced acoustic variation but preserved abstract structure. Across several tasks and classes of sound sources, schema-based sources consistently aided source separation, in some cases producing rapid improvements in performance over the first few exposures to a schema. Learning persisted across blocks that did not contain the learned schema, and listeners were able to learn and use multiple schemas simultaneously. No learning was evident when schema were presented in the task-irrelevant (i.e., distractor) source. However, learning from task-relevant stimuli showed signs of being implicit, in that listeners were no more likely to report that sources recurred in experiments containing schema-based sources than in control experiments containing no schema-based sources. The results implicate a mechanism for rapidly internalizing abstract sound structure, facilitating accurate perceptual organization of sound sources that recur in the environment.
Keywords: auditory scene analysis; implicit learning; perceptual learning; statistical learning.
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG.Neuroimage. 2020 Jan 15;205:116283. doi: 10.1016/j.neuroimage.2019.116283. Epub 2019 Oct 17. Neuroimage. 2020. PMID: 31629828
-
Recovering sound sources from embedded repetition.Proc Natl Acad Sci U S A. 2011 Jan 18;108(3):1188-93. doi: 10.1073/pnas.1004765108. Epub 2011 Jan 3. Proc Natl Acad Sci U S A. 2011. PMID: 21199948 Free PMC article.
-
The effects of aging and interaural delay on the detection of a break in the interaural correlation between two sounds.Ear Hear. 2009 Apr;30(2):273-86. doi: 10.1097/AUD.0b013e318198703d. Ear Hear. 2009. PMID: 19194287
-
Breaking the wave: effects of attention and learning on concurrent sound perception.Hear Res. 2007 Jul;229(1-2):225-36. doi: 10.1016/j.heares.2007.01.011. Epub 2007 Jan 16. Hear Res. 2007. PMID: 17303355 Review.
-
Selectively attending to auditory objects.Front Biosci. 2000 Jan 1;5:D202-12. doi: 10.2741/alain. Front Biosci. 2000. PMID: 10702369 Review.
Cited by
-
Illusory sound texture reveals multi-second statistical completion in auditory scene analysis.Nat Commun. 2019 Nov 8;10(1):5096. doi: 10.1038/s41467-019-12893-0. Nat Commun. 2019. PMID: 31704913 Free PMC article.
-
Relative pitch representations and invariance to timbre.Cognition. 2023 Mar;232:105327. doi: 10.1016/j.cognition.2022.105327. Epub 2022 Dec 7. Cognition. 2023. PMID: 36495710 Free PMC article.
-
The Headphone and Loudspeaker Test-Part II: A comprehensive method for playback device screening in Internet experiments.Behav Res Methods. 2024 Jan;56(1):362-378. doi: 10.3758/s13428-022-02048-3. Epub 2023 Jan 17. Behav Res Methods. 2024. PMID: 36650403 Free PMC article.
-
Neural signatures of automatic repetition detection in temporally regular and jittered acoustic sequences.PLoS One. 2023 Nov 10;18(11):e0284836. doi: 10.1371/journal.pone.0284836. eCollection 2023. PLoS One. 2023. PMID: 37948467 Free PMC article.
-
Listening in complex acoustic scenes.Curr Opin Physiol. 2020 Dec;18:63-72. doi: 10.1016/j.cophys.2020.09.001. Epub 2020 Sep 8. Curr Opin Physiol. 2020. PMID: 33479600 Free PMC article.
References
-
- Bregman AS. Auditory Scene Analysis: The Perceptual Organization of Sound. MIT Press; Cambridge, MA: 1990.
-
- Bronkhorst AW. The cocktail party phenomenon: A review of research on speech intelligibility in multiple-talker conditions. Acta Acust United Acust. 2000;86:117–128.
-
- Carlyon RP. How the brain separates sounds. Trends Cogn Sci. 2004;8:465–471. - PubMed
-
- McDermott JH. The cocktail party problem. Curr Biol. 2009;19:R1024–R1027. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
