Selective cortical representation of attended speaker in multi-talker speech perception

Nima Mesgarani; Edward F Chang

doi:10.1038/nature11020

Selective cortical representation of attended speaker in multi-talker speech perception

Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.

Authors

Nima Mesgarani¹, Edward F Chang

Affiliation

¹ Departments of Neurological Surgery and Physiology, UCSF Center for Integrative Neuroscience, University of California, San Francisco, California 94143, USA.

Abstract

Humans possess a remarkable ability to attend to a single speaker's voice in a multi-talker background. How the auditory system manages to extract intelligible speech under such acoustically complex and adverse listening conditions is not known, and, indeed, it is not clear how attended speech is internally represented. Here, using multi-electrode surface recordings from the cortex of subjects engaged in a listening task with two simultaneous speakers, we demonstrate that population responses in non-primary human auditory cortex encode critical features of attended speech: speech spectrograms reconstructed based on cortical responses to the mixture of speakers reveal the salient spectral and temporal features of the attended speaker, as if subjects were listening to that speaker alone. A simple classifier trained solely on examples of single speakers can decode both attended words and speaker identity. We find that task performance is well predicted by a rapid increase in attention-modulated neural selectivity across both single-electrode and population-level cortical responses. These findings demonstrate that the cortical representation of speech does not merely reflect the external acoustic environment, but instead gives rise to the perceptual aspects relevant for the listener's intended goal.

Publication types

Clinical Trial
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Acoustic Stimulation*
Acoustics
Attention / physiology*
Auditory Cortex / physiology*
Electrodes
Female
Humans
Language
Male
Models, Neurological
Noise
Sound Spectrography
Speech Perception / physiology*
Speech*

Abstract

Publication types

MeSH terms

Grants and funding