Phrase-level speech simulation with an airway modulation model of speech production
- PMID: 23503742
- PMCID: PMC3596841
- DOI: 10.1016/j.csl.2012.10.005
Phrase-level speech simulation with an airway modulation model of speech production
Abstract
Artificial talkers and speech synthesis systems have long been used as a means of understanding both speech production and speech perception. The development of an airway modulation model is described that simulates the time-varying changes of the glottis and vocal tract, as well as acoustic wave propagation, during speech production. The result is a type of artificial talker that can be used to study various aspects of how sound is generated by humans and how that sound is perceived by a listener. The primary components of the model are introduced and simulation of words and phrases are demonstrated.
Keywords: modulation; speech simulation; speech synthesis; vocal folds; vocal tract.
Figures
Similar articles
-
Structure, Movement, Sound, and Perception.Perspect Speech Sci Orofac Disord. 2014 Aug;24:7-20. doi: 10.1044/ssod24.1.7. Perspect Speech Sci Orofac Disord. 2014. PMID: 25383138 Free PMC article.
-
Modeling Speech Level as a Function of Background Noise Level and Talker-to-Listener Distance for Talkers Wearing Hearing Protection Devices.J Speech Lang Hear Res. 2017 Dec 20;60(12):3393-3403. doi: 10.1044/2017_JSLHR-S-17-0052. J Speech Lang Hear Res. 2017. PMID: 29204606
-
Talker-to-listener distance effects on speech production and perception.J Acoust Soc Am. 2009 Oct;126(4):2052-60. doi: 10.1121/1.3205400. J Acoust Soc Am. 2009. PMID: 19813814
-
On the perception of similarity among talkers.J Acoust Soc Am. 2007 Dec;122(6):3688-96. doi: 10.1121/1.2799903. J Acoust Soc Am. 2007. PMID: 18247776
-
Vocal tract acoustics.J Voice. 1993 Jun;7(2):97-117. doi: 10.1016/s0892-1997(05)80339-x. J Voice. 1993. PMID: 8353635 Review.
Cited by
-
The Effects of Remote Signal Transmission and Recording on Acoustical Measures of Simulated Essential Vocal Tremor: Considerations for Remote Treatment Research and Telepractice.J Voice. 2024 Mar;38(2):325-336. doi: 10.1016/j.jvoice.2021.09.012. Epub 2021 Oct 24. J Voice. 2024. PMID: 34702610
-
Formant measurement in children's speech based on spectral filtering.Speech Commun. 2015;76:93-111. doi: 10.1016/j.specom.2015.11.001. Speech Commun. 2015. PMID: 26855461 Free PMC article.
-
The effects of physiological adjustments on the perceptual and acoustical characteristics of simulated laryngeal vocal tremor.J Acoust Soc Am. 2015 Aug;138(2):953-63. doi: 10.1121/1.4927561. J Acoust Soc Am. 2015. PMID: 26328711 Free PMC article.
-
Motor representations underlie the reading of unfamiliar letter combinations.Sci Rep. 2020 Mar 2;10(1):3828. doi: 10.1038/s41598-020-59199-6. Sci Rep. 2020. PMID: 32123186 Free PMC article.
-
A model of speech production based on the acoustic relativity of the vocal tract.J Acoust Soc Am. 2019 Oct;146(4):2522. doi: 10.1121/1.5127756. J Acoust Soc Am. 2019. PMID: 31671993 Free PMC article.
References
-
- Atal BS, Chang JJ, Mathews MV, Tukey JW. Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer sorting technique. J Acoust Soc Am. 1978;63:1535–1555. - PubMed
-
- Baer T, Gore JC, Gracco LC, Nye PW. Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels. J Acoust Soc Am. 1991;90:799–828. - PubMed
-
- Bauer D, Birkholz P, Kannampuzha J, Kröger BJ. Evaluation of articulatory speech synthesis: a perception study. 36th Deutsche Jahrestagung fr Akustik (DAGA 2010); Berlin, Germany. 2010. pp. 1003–1004.
-
- Båvegård M. Proceedings Eurospeech. Vol. 95. Madrid, Spain: 1995. Introducing a parametric consonantal model to the articulatory speech synthesizer; pp. 1857–1860.
-
- Birkholz P, Jackel D, Kröger BJ. Construction and control of a three-dimensional vocal tract model. Proc. Intl. Conf. Acoust., Spch, and Sig. Proc. (ICASSP 2006); Toulouse, France. 2006. pp. 873–876.
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources