Processing of complex stimuli and natural scenes in the auditory cortex

https://doi.org/10.1016/j.conb.2004.06.005Get rights and content

Neuronal responses in auditory cortex show a fascinating mixture of characteristics that span the range from almost perfect copies of physical aspects of the stimuli to extremely complex context-dependent responses. Fast, highly stimulus-specific adaptation and slower plastic mechanisms work together to constantly adjust neuronal response properties to the statistics of the auditory scene. Evidence with converging implications suggests that the neuronal activity in primary auditory cortex represents sounds in terms of auditory objects rather than in terms of invariant acoustic features.

Introduction

Research into signal coding in primary auditory cortex (A1) has enjoyed renewed popularity in recent years. All modern methodologies, including new slice preparations for studying thalamo–cortical interactions [1], intracellular and extracellular single neuron recordings 2., 3.••, 4.••, evoked electrical and magnetic fields (EEG and MEG) 5., 6.•, and functional magnetic resonance imaging (fMRI) 7., 8., are being applied to this system in a variety of model animals, including rodents, bats, cats, primates, and humans.

Despite this accumulation of information, the nature of the representation of complex sounds in A1 remains the subject of heated debate. This is not due to a lack of data, but rather because of the fact that the data are often contradictory. Whereas some studies emphasize a relatively simple cortical representation, other studies show a large degree of complexity in the neuronal responses.

Here, I review evidence that indicates that simplicity and complexity co-exist in A1. Evidence with converging implications suggests that the co-existence of simplicity and complexity in A1 is due to its participation in processes that are often implicitly assigned to higher brain areas. In particular, I review evidence that suggests the involvement of auditory cortex in processes such as the on-line extraction of statistical regularities from the auditory scene and the organization of the auditory scene in terms of auditory objects.

Section snippets

Precise and imprecise temporal coding

One of the complexities in auditory cortex is the interplay among multiple time scales that determine the neural responses. For example, cortical neurons respond to some auditory events with stereotypical response bursts at a fixed latency (‘locking’). The variance of the latency of such bursts might be similar to that of peripheral neurons. However, the same neurons may show sluggish responses to other features of the sounds.

Temporal coding is usually tested using repetitive stimuli, such as

Feature detection or something else?

It seems that depending upon the circumstances, a cortical neuron can choose to be sluggish or precise, linear or non-linear. Thus, the feature sensitivity of a neuron, as determined, for example, by its STRF, cannot be used as an invariant essential characterization of its responses. The multiple time scales at which cortical neurons process sounds provide another argument against a pure role in feature-detection for auditory cortex neurons [25]. Feature detectors are expected to be sensitive

Adaptation and plasticity

The plastic capabilities of auditory cortex have been studied in several preparations on many time scales. Significant changes in electrical and magnetic brain potentials (EEG and MEG) occur during training for the performance of tasks such as the perception of virtual pitch [5] and fine pitch discrimination [37]. Even simple exposure to different auditory environments can substantially change auditory cortical organization and responses: thus, raising rats in an enriched environment increases

Auditory scene analysis in auditory cortex

Several recent studies, using a variety of techniques, suggest a role for auditory cortex in segregation and grouping of sound components. For example, at the brain potential level, Dyson and Alain [50] reported that the amplitude of the mid-latency potentials increased when a harmonic was mistuned, potentially creating two auditory objects instead of one. Furthermore, the enhanced amplitude was correlated with an increased likelihood of reporting two concurrent auditory objects. Krumbholz et al

Speculative synthesis and conclusions

Most of the interesting auditory features might already be extracted from the incoming sounds by the level of the IC, which should therefore be considered as the auditory analog of the primary visual cortex (V1). The role of auditory cortex is to organize these features into auditory objects (Figure 1). To do that, auditory cortex has to use temporal and spectral context at several time scales. The large adaptive and plastic capacity of auditory cortex is used to tune the neural circuits to the

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

Supported by grants from the Israeli Science Foundation (ISF), the German-Israeli Foundation (GIF) and the Volkswagenstiftung.

References (63)

  • S. Kaur et al.

    Intracortical pathways determine breadth of subthreshold frequency receptive fields in primary auditory cortex

    J Neurophysiol

    (2004)
  • M. Wehr et al.

    Balanced inhibition underlies tuning and sharpens spike timing in auditory cortex

    Nature

    (2003)
  • L.I. Zhang et al.

    Topography and synaptic shaping of direction selectivity in primary auditory cortex

    Nature

    (2003)
  • C. Pantev et al.

    Music and learning-induced cortical plasticity

    Ann N Y Acad Sci

    (2003)
  • P. Schneider et al.

    Morphology of Heschl’s gyrus reflects enhanced activation in the auditory cortex of musicians

    Nat Neurosci

    (2002)
  • H.C. Hart et al.

    Amplitude and frequency-modulated stimuli activate common regions of human auditory cortex

    Cereb Cortex

    (2003)
  • J.R. Binder et al.

    Neural correlates of sensory and decision processes in auditory object identification

    Nat Neurosci

    (2004)
  • P.X. Joris et al.

    Neural processing of amplitude-modulated sounds

    Physiol Rev

    (2004)
  • T. Lu et al.

    Temporal and rate representations of time-varying signals in the auditory cortex of awake primates

    Nat Neurosci

    (2001)
  • T. Lu et al.

    Information content of auditory cortical responses to time-varying acoustic stimuli

    J Neurophysiol

    (2004)
  • A.M. Aertsen et al.

    The spectro-temporal receptive field. A functional characteristic of auditory neurons

    Biol Cybern

    (1981)
  • D.A. Depireux et al.

    Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex

    J Neurophysiol

    (2001)
  • J.F. Linden et al.

    Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex

    J Neurophysiol

    (2003)
  • R.C. deCharms et al.

    Optimizing sound features for cortical neurons

    Science

    (1998)
  • L.M. Miller et al.

    Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex

    J Neurophysiol

    (2002)
  • A. Fishbach et al.

    Neural model for physiological responses to frequency and amplitude transitions uncovers topographical order in the auditory cortex

    J Neurophysiol

    (2003)
  • D.P. Phillips et al.

    Response timing constraints on the cortical representation of sound time structure

    J Acoust Soc Am

    (1990)
  • I. Nelken et al.

    Responses to linear and logarithmic frequency-modulated sweeps in ferret primary auditory cortex

    Eur J Neurosci

    (2000)
  • P. Heil et al.

    Sensitivity of neurons in cat primary auditory cortex to tones and frequency-modulated stimuli. I: Effects of variation of stimulus parameters

    Hear Res

    (1992)
  • M. Elhilali et al.

    Dynamics of precise spike timing in primary auditory cortex

    J Neurosci

    (2004)
  • A. Rupp et al.

    The representation of peripheral neural activity in the middle-latency evoked field of primary auditory cortex in humans(1)

    Hear Res

    (2002)
  • Cited by (178)

    • Generalizable dimensions of human cortical auditory processing of speech in natural soundscapes: A data-driven ultra high field fMRI approach

      2021, NeuroImage
      Citation Excerpt :

      However, little is currently known about the brain mechanisms that support the processing of speech in realistic acoustic environments. One reason for this is that cortical sound representations are complex, context dependent, and not well understood (Hausfeld et al., 2018; Nelken, 2004), but they appear to be of sufficient complexity to allow for the segregation of different sound sources (Elhilali and Shamma, 2008; Han et al., 2019; Khalighinejad et al., 2019; O’Sullivan et al., 2019). Here we set out to reveal and characterize brain processes in the cortical temporal lobes that support processing of speech in natural soundscapes using unsupervised machine learning methods that exploit the statistical structure of realistic sounds and distributed brain activity.

    • Early cortical processing of pitch height and the role of adaptation and musicality

      2021, NeuroImage
      Citation Excerpt :

      The results of the processing provide the basis for the prediction of how a sound pattern will continue (Winkler et al., 2009). Predictions may refer to different stages of pitch processing, from the note-to-note level (as in the present study) to complex musical structures (Salimpoor et al., 2015); it is argued that neural networks in the brain adapt to the statistical properties of sound sequences to maintain efficient coding (Nelken, 2004; Wark et al., 2007; Yaron et al., 2012; Pérez-González and Malmierca, 2014). For example, an early MEG study by Rupp and Uppenkamp (2005) showed that the amplitude of the N1 wave (Näätänen and Picton, 1987) decreases much more in response to fixed pitch sequences than random sequences.

    View all citing articles on Scopus
    View full text