Multistable Auditory Perception
   HOME

TheInfoList



OR:

Multistable auditory perception is a cognitive phenomenon in which certain auditory stimuli can be perceived in multiple ways. While
multistable perception Multistable perception (or bistable perception) is a perceptual phenomenon in which an observer experiences an unpredictable sequence of spontaneous subjective changes. While usually associated with visual perception (a form of optical illusion), ...
has been most commonly studied in the visual domain, it also has been observed in the auditory and olfactory modalities. In the olfactory domain, different scents are piped to the two nostrils, while in the auditory domain, researchers often examine the effects of binaural sequences of pure tones. Generally speaking, multistable perception has three main characteristics: exclusivity, implying that the multiple perceptions cannot simultaneously occur; randomness, indicating that the duration of perceptual phases follows a random law, and inevitability, meaning that subjects are unable to completely block out one percept indefinitely.


History

While binocular rivalry has been studied since the 16th century, the study of multistable auditory perception is relatively new. Diana Deutsch was the first to discover multistability in human auditory perception, in the form of auditory illusions involving periodically oscillating tones.


Experimental Findings

Different experimental paradigms have since been used to study
multistable perception Multistable perception (or bistable perception) is a perceptual phenomenon in which an observer experiences an unpredictable sequence of spontaneous subjective changes. While usually associated with visual perception (a form of optical illusion), ...
in the auditory modality. One is auditory stream segregation, in which two different frequencies are presented in a temporal pattern. Listeners experience alternating percepts: one percept is of a single stream fluctuating between frequencies, and the alternative percept is of two separate streams repeating single frequencies each. Other experimental findings demonstrate the verbal transformation effect. In this paradigm, the input is a speech form repeated rapidly and continuously. The alternating percepts here are words—for example, continuous repetition of the word “life” results in the bistability of “life” and “fly.” Prefrontal activation is implicated with such fluctuations in percept, and not with changes in the physical stimulus, and there is also a possible inverse relationship between left inferior frontal and
cingulate Cingulata, part of the superorder Xenarthra, is an order of armored New World placental mammals. Dasypodids and chlamyphorids, the armadillos, are the only surviving families in the order. Two groups of cingulates much larger than extant arm ...
activation involved in this percept alternation.


Principles of Perceptual Bistability

The temporal dynamics observed in auditory stream segregation are similar to those of bistable visual perception, suggesting that the mechanisms mediating
multistable perception Multistable perception (or bistable perception) is a perceptual phenomenon in which an observer experiences an unpredictable sequence of spontaneous subjective changes. While usually associated with visual perception (a form of optical illusion), ...
, the alternating dominance and suppression of multiple competing interpretations of ambiguous sensory input, might be shared across modalities. Pressnitzer and Hupe analyzed results of an auditory streaming experiment and demonstrated that the perceptual experience that occurred exhibited all three properties of multistable perception found in the visual modality—exclusivity, randomness, and inevitability.Pressnitzer, D. & Hupe, J. (2006). Temporal Dynamics of Auditory and Visual Bistability Reveal Common Principles of Organization. Current Biology, 16, 1351–1357 Exclusivity was satisfied, as there was “spontaneous alternation between mutually exclusive percepts,” and very little time was spent in an “indeterminate” experience. Randomness also characterized the phenomenon, as the first phase of perception is longer in duration than subsequent phases, and then the “steady-state of the temporal dynamics of auditory streaming is purely
stochastic Stochastic (, ) refers to the property of being well described by a random probability distribution. Although stochasticity and randomness are distinct in that the former refers to a modeling approach and the latter refers to phenomena themselv ...
with no long-term trend.” Lastly, the percept alternation was inevitable; even though volitional control did reduce suppression of the specified percept, it did not exclude perception of the alternative percept altogether. These similarities between perceptual bistability in the visual and auditory modalities raise the possibility of a common mechanism governing the phenomenon. In Pressnitzer and Hupe's subjects, the distributions of phase durations in the two modalities were not significantly different, and it has been speculated that the
intraparietal sulcus The intraparietal sulcus (IPS) is located on the lateral surface of the parietal lobe, and consists of an oblique and a horizontal portion. The IPS contains a series of functionally distinct subregions that have been intensively investigated usin ...
, likely involved in crossmodal integration, could be responsible for bistability in both domains. However, the absence of subject-specific biases across the modalities contradicts the notion that a “single top-down selection mechanism were the sole determinant of the auditory and visual bistability.” This observation, along with evidence of neural correlates at different stages of processing, instead suggests that competition is distributed and “based on adaptation and mutual inhibition, at multiple neural processing stages.”


Neural Correlates


Place model

When using a two stream tone test, specific populations of
neurons A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
activate, known as the place model.
Event related potential An event-related potential (ERP) is the measured brain response that is the direct result of a specific sensory, cognitive, or motor event. More formally, it is any stereotyped electrophysiological response to a stimulus. The study of the brai ...
(ERP) amplitude increases when the difference of the frequency of the two tones increase. This model hypothesizes that when this is happening, the distance between the two populations of neurons increase, so that the two populations will interact less with each other, allowing for easier tone segregation.


fMRI results

FMRI Functional magnetic resonance imaging or functional MRI (fMRI) measures brain activity by detecting changes associated with blood flow. This technique relies on the fact that cerebral blood flow and neuronal activation are coupled. When an area o ...
has been used to measure the correlation between listening to alternating tones compared to single stream of tones. The posterior regions of the left
auditory cortex The auditory cortex is the part of the temporal lobe that processes auditory information in humans and many other vertebrates. It is a part of the auditory system, performing basic and higher functions in hearing, such as possible relations to ...
were modulated by the alternating tones, indicating that there may be areas of the brains responsible for stream segregation.


Theoretical View


Sequential grouping

A problem of large behavioral importance is the question of how to group auditory stimuli. When a continuous stream of auditory information is received, numerous alternative interpretations are possible, but individuals are only consciously aware of one percept at a time. For this to occur, the auditory system must segregate and group incoming sounds, the goal being to “construct, modify, and maintain dynamic representations of putative objects within its environment”.Winkler, I. Denham, S. Mill. R, Bohm, T. & Bendixen, A. (2012). Multistability in auditory stream segregation: a predictive coding view. Philosophical Transactions of the Royal Society Biological Sciences, 367, 1001–1012 It has been suggested that this process of binding sound events into groups is driven by different levels of similarities. One principle for binding is based on the perceptual similarity between individual events. Sounds that share many or all of their acoustic features are more likely to have been emitted by the same source, and thus are more likely to be linked to form a “proto-object”. The other principle for binding is based on the sequential predictability of sound events. If events reliably follow each other, it is also more likely that they have a common underlying cause.


Competition

A theory explaining the alternation of auditory percepts is that different interpretations are neurally represented simultaneously, but all but the dominant one at the time are suppressed. This idea of competition among parallel hypotheses might provide an explanation for the temporal dynamics observed in auditory stream segregation. The initial perceptual phase is held longer than the subsequent ones, “with the duration of the first phase being stimulus-parameter dependent and an order of magnitude longer in duration than parameter-independent subsequent phases”.Denham, S. Gyimesi, K. Stefanics, G. & Winkler, I. (2010). The Neurophysiological Bases of Auditory Perception, 477-487 At stimulus onset, the first percept might be that which is easiest to discover, based on featural proximity (and thus stimulus-parameter dependent), and it is held for relatively longer because time is required for other hypotheses to form. As more sensory information is received and processed, the “neural associations underlying the alternative sound organizations become strong and start to vie for dominance” and “the probabilities of perceiving different organizations tend to become more balanced with time”.


References

{{reflist Psychoacoustics