Trends in Cognitive Sciences
ReviewModeling the auditory scene: predictive regularity representations and perceptual objects
Section snippets
Prediction underlies adaptive behavior
Achieving one's goals in constantly changing environments requires actions directed at future states of the world. For example, when crossing a street, one has to anticipate the location of cars at the moment when one is likely to intersect their trajectories. Predicting future events is essential for everything we do, from taking into account the immediate sensory consequences of our own actions to signing up to a pension plan. The realization that we constantly interact with the future led to
Predictive representations in analyzing the auditory scene
Orderly perception of complex auditory scenes requires them to be broken down into internally coherent constituents. According to Bregman's theory [6] (see Box 1), auditory scene analysis (ASA) consists of two phases; the first phase is concerned with the formation of alternative sound organizations, while the second is concerned with selecting one of the alternatives to be perceived. Although perceptually it is difficult to separate these processes, the existence of the two phases was
Maintaining the representation of the auditory scene
Once possible object representations are formed, inconsistencies between them need to be resolved while preferably maintaining the continuity of perception. Figure 1 shows a conceptualization of ASA. First-phase grouping processes are represented on the left with simultaneous and sequential grouping processes separately marked (bottom left box). Sequential grouping is based on predictions produced by representations encoding the previously detected acoustic regularities (upper left box).
Neural bases for detecting change and deviance
Possible neural correlates of the processes that are reviewed in the previous sections may be found in various stations of the auditory system. The ‘core’ auditory pathway (Figure 2) seems to keep a high-fidelity representation of sounds at least up to the level of the primary auditory cortex, although contributions to the buildup of streaming could occur as early as the cochlear nucleus [21]. In the primary auditory cortex itself, a number of response features may already encode information
Predictive regularity representations as perceptual objects
We have argued that auditory regularity representations supported by the SSA mechanism observable in many parts of the auditory system play an essential role in parsing complex auditory scenes. Here we examine whether regularity representations may form the core of auditory object representations. Recent theories of auditory object representation 34, 35 emphasize the requirement of common characteristics for object representations across different modalities. So, what do we expect of perceptual
Auditory object representations and attention
The hypothesis that auditory object representations are representations of the regularities linking together sounds forming a coherent sequence allows us to reexamine the long-standing debate in psychology regarding whether object formation requires focused attention 61, 62. Within the present framework, we should ask whether forming regularity representations requires attention. Several studies suggest that deviations from auditory regularities are detected even when attention is not focused
Conclusions
We have argued that predictive representations of temporal regularities constitute the core of auditory objects in the brain. This notion of auditory object formation is compatible with recent accounts of perception in other modalities 3, 70, with theories of motor control [74], and the interaction between motor control and perception [75]. Although there are several outstanding questions regarding the mechanisms underlying the proposed model (Box 3), it appears that predictive processing
Acknowledgements
Supported by the European Community's Seventh Framework Programme (grant no 231168 – SCANDLE; I.W. and S.D.) and by a grant of the Israeli Science Foundation (ISF) to I.N.
Glossary
- Auditory Scene Analysis (ASA)
- The process of analyzing a complex mixture of sounds to isolate the information relating to different sound sources.
- Auditory streaming
- A perceptual phenomenon in which a sequence of sounds is perceived as consisting of two or more auditory streams. When streaming occurs, perceivers experience difficulty in extracting inter-sound relationships across streams, such as the order between two sounds belonging to different streams.
- Build-up of auditory streams
- The perception
References (83)
The proactive brain: using analogies and associations to generate predictions
Trends Cogn. Sci.
(2007)- et al.
The reverse hierarchy theory of visual perceptual learning
Trends Cogn. Sci.
(2004) Event-related brain potentials reveal multiple stages in the perceptual organization of sound
Brain Res. Cogn. Brain Res.
(2005)Auditory organization of sound sequences by a temporal or numerical regularity: a mismatch negativity study comparing musicians and non-musicians
Cogn. Brain Res.
(2005)- et al.
The role of predictive models in the formation of auditory streams
J. Physiol. Paris
(2006) Perceptual organization of tone sequences in the auditory cortex of awake macaques
Neuron
(2005)Perceptual organization of sound begins in the auditory periphery
Curr. Biol.
(2008)- et al.
Temporal dynamics of auditory and visual bistability reveal common principles of perceptual organization
Curr. Biol.
(2006) The mismatch negativity: a review of underlying mechanisms
Clin. Neurophysiol.
(2009)- et al.
Auditory and visual objects
Cognition
(2001)
The mismatch negativity (MMN) in basic research of central auditory processing: A review
Clin. Neurophysiol.
Pre–attentive representation of feature conjunctions for simultaneous, spatially distributed auditory objects
Brain. Res. Cogn. Brain. Res.
Processing abstract auditory features in the human auditory cortex
NeuroImage
Primitive intelligence” in the auditory cortex
Trends. Neurosci.
Measurement of extensive auditory discrimination profiles using mismatch negativity (MMN) of the auditory event-related potential
Clin. Neurophysiol.
Repetition effects to sounds: Evidence for predictive coding in the auditory system
Trends. Cogn. Sci.
Rapid extraction of auditory feature contingencies
NeuroImage.
Objective examination for two-point stimulation using a somatosensory oddball paradigm: an MEG study
Clin. Neurophysiol.
Expectation (and attention) in visual cognition
Trends. Cogn. Sci.
Internal models for motor control and trajectory planning
Curr. Op. Neurobiol.
How the brain separates sounds
Trends. Cogn. Sci.
The mismatch negativity in cognitive and clinical neuroscience: theoretical and methodological considerations
Biol. Psychol.
Cortical circuits for perceptual inference
Neural Networks
Perceptions as hypotheses
Philos. Trans. R Soc. Lond. B Biol. Sci.
Visual objects in context
Nat. Rev. Neurosci.
A theory of cortical responses
Philos. Trans R Soc. Lond. B Biol. Sci.
Auditory Scene Analysis
Effects of attention on neuroelectric correlates of auditory stream segregation
J. Cogn. Neurosci.
Neural activity associated with distinguishing concurrent auditory objects
J. Acoust. Soc. Am.
Gestalt Psychology
Interpreting the mismatch negativity (MMN)
J. Psychophysiol.
I heard that coming: ERP evidence for stimulus driven prediction in the auditory system
J. Neurosci.
Neural representations of auditory input accommodate to the context in a dynamically changing acoustic environment
Eur. J. Neurosci.
Development of a memory trace for a complex sound in the human brain
NeuroReport
Brain responses reveal the learning of foreign language phonemes
Psychophysiol.
Factors influencing sequential stream segregation
Acta Acust - Acust.
Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration
J. Acoust. Soc. Am.
Toward a neurophysiological theory of auditory stream segregation
Psychol. Bull.
Effects of location, frequency region, and time course of selective attention on auditory scene analysis
J. Exp. Psychol. Hum. Percept. Perform.
The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component structure
Psychophysiol.
Auditory edge detection: a neural model for physiological and psychoacoustical responses to amplitude transients
J. Neurophysiol.
Cited by (418)
Neural encoding of musical expectations in a non-human primate
2024, Current BiologyContributions of the subcortical auditory system to predictive coding and the neural encoding of speech
2023, Current Opinion in Behavioral Sciences