Processing of complex stimuli and natural scenes in the auditory cortex
Introduction
Research into signal coding in primary auditory cortex (A1) has enjoyed renewed popularity in recent years. All modern methodologies, including new slice preparations for studying thalamo–cortical interactions [1], intracellular and extracellular single neuron recordings 2., 3.••, 4.••, evoked electrical and magnetic fields (EEG and MEG) 5., 6.•, and functional magnetic resonance imaging (fMRI) 7., 8., are being applied to this system in a variety of model animals, including rodents, bats, cats, primates, and humans.
Despite this accumulation of information, the nature of the representation of complex sounds in A1 remains the subject of heated debate. This is not due to a lack of data, but rather because of the fact that the data are often contradictory. Whereas some studies emphasize a relatively simple cortical representation, other studies show a large degree of complexity in the neuronal responses.
Here, I review evidence that indicates that simplicity and complexity co-exist in A1. Evidence with converging implications suggests that the co-existence of simplicity and complexity in A1 is due to its participation in processes that are often implicitly assigned to higher brain areas. In particular, I review evidence that suggests the involvement of auditory cortex in processes such as the on-line extraction of statistical regularities from the auditory scene and the organization of the auditory scene in terms of auditory objects.
Section snippets
Precise and imprecise temporal coding
One of the complexities in auditory cortex is the interplay among multiple time scales that determine the neural responses. For example, cortical neurons respond to some auditory events with stereotypical response bursts at a fixed latency (‘locking’). The variance of the latency of such bursts might be similar to that of peripheral neurons. However, the same neurons may show sluggish responses to other features of the sounds.
Temporal coding is usually tested using repetitive stimuli, such as
Feature detection or something else?
It seems that depending upon the circumstances, a cortical neuron can choose to be sluggish or precise, linear or non-linear. Thus, the feature sensitivity of a neuron, as determined, for example, by its STRF, cannot be used as an invariant essential characterization of its responses. The multiple time scales at which cortical neurons process sounds provide another argument against a pure role in feature-detection for auditory cortex neurons [25]. Feature detectors are expected to be sensitive
Adaptation and plasticity
The plastic capabilities of auditory cortex have been studied in several preparations on many time scales. Significant changes in electrical and magnetic brain potentials (EEG and MEG) occur during training for the performance of tasks such as the perception of virtual pitch [5] and fine pitch discrimination [37]. Even simple exposure to different auditory environments can substantially change auditory cortical organization and responses: thus, raising rats in an enriched environment increases
Auditory scene analysis in auditory cortex
Several recent studies, using a variety of techniques, suggest a role for auditory cortex in segregation and grouping of sound components. For example, at the brain potential level, Dyson and Alain [50] reported that the amplitude of the mid-latency potentials increased when a harmonic was mistuned, potentially creating two auditory objects instead of one. Furthermore, the enhanced amplitude was correlated with an increased likelihood of reporting two concurrent auditory objects. Krumbholz et al
Speculative synthesis and conclusions
Most of the interesting auditory features might already be extracted from the incoming sounds by the level of the IC, which should therefore be considered as the auditory analog of the primary visual cortex (V1). The role of auditory cortex is to organize these features into auditory objects (Figure 1). To do that, auditory cortex has to use temporal and spectral context at several time scales. The large adaptive and plastic capacity of auditory cortex is used to tune the neural circuits to the
References and recommended reading
Papers of particular interest, published within the annual period of review, have been highlighted as:
• of special interest
•• of outstanding interest
Acknowledgements
Supported by grants from the Israeli Science Foundation (ISF), the German-Israeli Foundation (GIF) and the Volkswagenstiftung.
References (63)
- et al.
Auditory edge detection: a neural model for physiological and psychoacoustical responses to amplitude transients
J Neurophysiol
(2001) - et al.
Linearity of cortical receptive fields measured with natural sounds
J Neurosci
(2004) - et al.
Responses of neurons in cat primary auditory cortex to bird chirps: effects of temporal and spectral context
J Neurosci
(2002) The nucleus basalis and memory codes: auditory cortical plasticity and the induction of specific, associative behavioral memory
Neurobiol Learn Mem
(2003)- et al.
Transformation of binaural response properties in the ascending auditory pathway: influence of time-varying interaural phase disparity
J Neurophysiol
(1998) - et al.
Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey
Hear Res
(2001) Auditory Scene Analysis: The Perceptual Organization of Sound
(1990)- et al.
The lower limit of melodic pitch
J Acoust Soc Am
(2001) - et al.
Mechanisms and streams for processing of “what” and “where” in auditory cortex
Proc Natl Acad Sci USA
(2000) - et al.
Auditory thalamocortical synaptic transmission in vitro
J Neurophysiol
(2002)
Intracortical pathways determine breadth of subthreshold frequency receptive fields in primary auditory cortex
J Neurophysiol
Balanced inhibition underlies tuning and sharpens spike timing in auditory cortex
Nature
Topography and synaptic shaping of direction selectivity in primary auditory cortex
Nature
Music and learning-induced cortical plasticity
Ann N Y Acad Sci
Morphology of Heschl’s gyrus reflects enhanced activation in the auditory cortex of musicians
Nat Neurosci
Amplitude and frequency-modulated stimuli activate common regions of human auditory cortex
Cereb Cortex
Neural correlates of sensory and decision processes in auditory object identification
Nat Neurosci
Neural processing of amplitude-modulated sounds
Physiol Rev
Temporal and rate representations of time-varying signals in the auditory cortex of awake primates
Nat Neurosci
Information content of auditory cortical responses to time-varying acoustic stimuli
J Neurophysiol
The spectro-temporal receptive field. A functional characteristic of auditory neurons
Biol Cybern
Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex
J Neurophysiol
Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex
J Neurophysiol
Optimizing sound features for cortical neurons
Science
Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex
J Neurophysiol
Neural model for physiological responses to frequency and amplitude transitions uncovers topographical order in the auditory cortex
J Neurophysiol
Response timing constraints on the cortical representation of sound time structure
J Acoust Soc Am
Responses to linear and logarithmic frequency-modulated sweeps in ferret primary auditory cortex
Eur J Neurosci
Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I: Effects of variation of stimulus parameters
Hear Res
Dynamics of precise spike timing in primary auditory cortex
J Neurosci
The representation of peripheral neural activity in the middle-latency evoked field of primary auditory cortex in humans(1)
Hear Res
Cited by (178)
Inference of network connectivity from temporally binned spike trains
2024, Journal of Neuroscience MethodsGeneralizable dimensions of human cortical auditory processing of speech in natural soundscapes: A data-driven ultra high field fMRI approach
2021, NeuroImageCitation Excerpt :However, little is currently known about the brain mechanisms that support the processing of speech in realistic acoustic environments. One reason for this is that cortical sound representations are complex, context dependent, and not well understood (Hausfeld et al., 2018; Nelken, 2004), but they appear to be of sufficient complexity to allow for the segregation of different sound sources (Elhilali and Shamma, 2008; Han et al., 2019; Khalighinejad et al., 2019; O’Sullivan et al., 2019). Here we set out to reveal and characterize brain processes in the cortical temporal lobes that support processing of speech in natural soundscapes using unsupervised machine learning methods that exploit the statistical structure of realistic sounds and distributed brain activity.
Early cortical processing of pitch height and the role of adaptation and musicality
2021, NeuroImageCitation Excerpt :The results of the processing provide the basis for the prediction of how a sound pattern will continue (Winkler et al., 2009). Predictions may refer to different stages of pitch processing, from the note-to-note level (as in the present study) to complex musical structures (Salimpoor et al., 2015); it is argued that neural networks in the brain adapt to the statistical properties of sound sequences to maintain efficient coding (Nelken, 2004; Wark et al., 2007; Yaron et al., 2012; Pérez-González and Malmierca, 2014). For example, an early MEG study by Rupp and Uppenkamp (2005) showed that the amplitude of the N1 wave (Näätänen and Picton, 1987) decreases much more in response to fixed pitch sequences than random sequences.