Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Nov 28:3:524.
doi: 10.3389/fpsyg.2012.00524. eCollection 2012.

Auditory time-interval perception as causal inference on sound sources

Affiliations

Auditory time-interval perception as causal inference on sound sources

Ken-Ichi Sawai et al. Front Psychol. .

Abstract

Perception of a temporal pattern in a sub-second time scale is fundamental to conversation, music perception, and other kinds of sound communication. However, its mechanism is not fully understood. A simple example is hearing three successive sounds with short time intervals. The following misperception of the latter interval is known: underestimation of the latter interval when the former is a little shorter or much longer than the latter, and overestimation of the latter when the former is a little longer or much shorter than the latter. Although this misperception of auditory time intervals for simple stimuli might be a cue to understanding the mechanism of time-interval perception, there exists no model that comprehensively explains it. Considering a previous experiment demonstrating that illusory perception does not occur for stimulus sounds with different frequencies, it might be plausible to think that the underlying mechanism of time-interval perception involves a causal inference on sound sources: herein, different frequencies provide cues for different causes. We construct a Bayesian observer model of this time-interval perception. We introduce a probabilistic variable representing the causality of sounds in the model. As prior knowledge, the observer assumes that a single sound source produces periodic and short time intervals, which is consistent with several previous works. We conducted numerical simulations and confirmed that our model can reproduce the misperception of auditory time intervals. A similar phenomenon has also been reported in visual and tactile modalities, though the time ranges for these are wider. This suggests the existence of a common mechanism for temporal pattern perception over modalities. This is because these different properties can be interpreted as a difference in time resolutions, given that the time resolutions for vision and touch are lower than those for audition.

Keywords: Bayesian inference; causal inference; source identification; time-interval perception.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Perceptual and simulated overestimation of T2 as a function of T1 − T2. Each marker represents a different total duration T1 + T2. (A) Perceptual overestimation of T2. Perceptual overestimation is measured by using the method of adjustment. (From Figure 3B, Miyauchi and Nakajima, , with changes in notations. ©University of California Press Journals. Adapted with permission.) (B) Simulated overestimation of T2, calculated by subtracting T2 from the expectation value of P(T2|s1,s2,s3). The area under the horizontal dashed line indicates the underestimation of T2, and the area on the left side of the vertical dashed line indicates T1 < T2.
Figure 2
Figure 2
(A) A temporal pattern and its observation. The horizontal axis is time. Each solid vertical line indicates an actual sound timing ti, and each dashed vertical line indicates its observed timing si. Each observation is made on the basis of an independent identical distribution. (B) Likelihood function of (T1,T2) given observed values of S1 = 100 ms and S2 = 140 ms (indicated as “+”). Intensity indicates the degree of likelihood. (C) Prior distribution of (T1,T2) with intensity on a logarithmic scale of the probability, illustrating that the prior takes a high value near both axes as well as along the 45° line from the T1-axis. (D) Posterior distribution of (T1,T2) given observed values of S1 = 100 ms and S2 = 140 ms (indicated as “+”). Intensity indicates probability. The cross sign (×) corresponds to the peak of the distribution. (E–G) Prior distributions of (T1,T2) given that (E) the first and second sounds come from the same source, (F) the second and third sounds come from the same source, and (G) all three sounds come from the same source. Intensity indicates probability.
Figure 3
Figure 3
Schematic figure of the model mechanism. Dashed lines indicate the shape of the prior distribution. Each solid-lined ellipse represents the likelihood function given an observed interval pair marked as a plus sign on the center of the ellipse. Each arrow describes the direction of the perceptual shift.

Similar articles

Cited by

References

    1. Akerboom S., ten Hoopen G., Olierook P., van der Schaaf T. (1983). Auditory spatial alternation transforms auditory time. J. Exp. Psychol. Hum. Percept. Perform. 9, 882–89710.1037/0096-1523.9.6.882 - DOI - PubMed
    1. Arao H., Suetomi D., Nakajima Y. (2000). Does time-shrinking take place in visual temporal patterns? Perception 29, 819–83010.1068/p2907rvw - DOI - PubMed
    1. Bregman A. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. Boston: The MIT Press
    1. Burr D., Silva O., Cicchini G. M., Banks M. S., Morrone M. C. (2009). Temporal mechanisms of multimodal binding. Proc. R. Soc. Lond. B Biol. Sci. 276, 1761–176910.1098/rspb.2008.1899 - DOI - PMC - PubMed
    1. Deutsch D. (1975). Two-channel listening to musical scales. J. Acoust. Soc. Am. 57, 1156–116010.1121/1.380573 - DOI - PubMed