Evidence for distinct human auditory cortex regions for sound location versus identity processing

Ahveninen, Jyrki; Huang, Samantha; Nummenmaa, Aapo; Belliveau, John W.; Hung, An-Yi; Jääskeläinen, Iiro P.; Rauschecker, Josef P.; Rossi, Stephanie; Tiitinen, Hannu; Raij, Tommi

doi:10.1038/ncomms3585

Article
Published: 14 October 2013

Evidence for distinct human auditory cortex regions for sound location versus identity processing

Jyrki Ahveninen¹,
Samantha Huang¹,
Aapo Nummenmaa¹,
John W. Belliveau^1,2,
An-Yi Hung¹,
Iiro P. Jääskeläinen³,
Josef P. Rauschecker⁴,
Stephanie Rossi¹,
Hannu Tiitinen³ &
…
Tommi Raij¹

Nature Communications volume 4, Article number: 2585 (2013) Cite this article

2638 Accesses
54 Citations
2 Altmetric
Metrics details

Subjects

Auditory system

Abstract

Neurophysiological animal models suggest that anterior auditory cortex (AC) areas process sound identity information, whereas posterior ACs specialize in sound location processing. In humans, inconsistent neuroimaging results and insufficient causal evidence have challenged the existence of such parallel AC organization. Here we transiently inhibit bilateral anterior or posterior AC areas using MRI-guided paired-pulse transcranial magnetic stimulation (TMS) while subjects listen to Reference/Probe sound pairs and perform either sound location or identity discrimination tasks. The targeting of TMS pulses, delivered 55–145 ms after Probes, is confirmed with individual-level cortical electric-field estimates. Our data show that TMS to posterior AC regions delays reaction times (RT) significantly more during sound location than identity discrimination, whereas TMS to anterior AC regions delays RTs significantly more during sound identity than location discrimination. This double dissociation provides direct causal support for parallel processing of sound identity features in anterior AC and sound location in posterior AC.

You have full access to this article via your institution.

Download PDF

Subcortical responses to music and speech are alike while cortical responses diverge

Article Open access 08 January 2024

Tong Shan, Madeline S. Cappelloni & Ross K. Maddox

Evolving perspectives on the sources of the frequency-following response

Article Open access 06 November 2019

Emily B. J. Coffey, Trent Nicol, … Nina Kraus

Cortical encoding of speech enhances task-relevant acoustic information

Article 08 July 2019

Sanne Rutten, Roberta Santoro, … Narly Golestani

Introduction

A number of studies support the theory that sound identity information is processed in anterior and spatial features in posterior aspects of non-primary auditory cortices (ACs)^1,2,3,4, which project to distinct inferior frontal ‘what’ and posterior parietal/superior prefrontal ‘where’ areas^5,6,7,8,9,10, respectively. In humans, posterior AC regions, encompassing the planum temporale (PT) and posterior superior temporal gyrus (STG), are activated by a range of audiospatial cues^{3,11,12,13,14}, whereas AC areas anterolateral to Heschl’s gyrus, extending to the anterior STG and the planum polare, are sensitive to a various sound identity features^{3,10,15,16,17,18,19,20}. Yet, several neuroimaging studies suggest that the posterior non-primary ACs also respond to non-spatial sounds, such as ‘spectral motion’²¹ and/or phonetic features^22,23, which has led to an alternative interpretation that the model of separate sound identification versus localization regions in human ACs is inaccurate (see, for example, refs 22, 24, 25). However, when interpreting the existing evidence in humans, it is noteworthy that previous studies have almost exclusively used observational neuroimaging techniques that reveal correlations, but cannot prove causality, that is, that these AC areas are required for sound identification versus localization.

A powerful way to causally corroborate neuroimaging-based models is to turn the participating brain areas off one by one and observe how brain functions and behaviours change. Classically, cognitive neuropsychologists have applied this kind of approach in humans by seeking double dissociations in functional deficits between subjects with different kinds of lesions. Pioneering studies in patients with lesions extending to different aspects of the temporal cortex have provided support for the AC dual-pathway model in humans^6,26,27. However, the specificity of this approach is limited because it is rare to find patients with lesions focused to certain part of the human AC only. For example, in one previous human lesion study on the dual-pathway model²⁷, the damage extended to extra-auditory frontoparietal regions, as well as to subcortical grey and white matter in almost all studied patients. Lesion studies are, further, complicated by potential differences in premorbid abilities between subjects and compensatory changes in other brain areas related to the long-term effects of the lesion. A more accurate approach, available in animal models, is reversible inactivation of individual brain regions by cooling. A recent dual-pathway study⁴ showed that local cooling of posterior versus anterior auditory fields of cat AC causes selective deficits in sound localization versus identity discrimination tasks, respectively. However, analogous confirmation in humans is still lacking. Further, although the aforementioned cooling study provided strong longitudinal evidence for the lack of long-term changes in the manipulated cat AC areas⁴, other studies using the cooling technique^28,29,30, or alternative means of local deactivation³¹ or deafferentation^32,33 imply that a complete deactivation of any cortical region will, within minutes³⁰, result in profound changes in other areas that are connected to the manipulated region. For example, pharmacological temporary deactivation of AC leads to profound changes at the preceding subcortical stages of sound processing³⁴. This raises the question whether the observed behavioural changes reflect the properties of the cooled area alone or also that of other areas. Unfortunately, there are no prior studies in any species that have investigated processing of sound identity versus location information using very transient disruption of functionality (for example, brief cortical electrical stimulation) that is less likely to induce wider spread network changes or reorganization.

Instantaneous (<50 ms) and local modulations of human brain activity can be non-invasively induced using transcranial magnetic stimulation (TMS). The paired-pulse TMS (ppTMS) technique is especially powerful in this regard, as applying two TMS pulses at about 1–4 ms apart with a specific intensity ratio causes short-interval intracortical inhibition (SICI), effectively creating a relatively focal ‘functional lesion’ lasting only tens of milliseconds^35,36. ppTMS therefore allows one to create short-lived functional deactivations in the human brain with high spatial and temporal accuracy, and in the context of specific behavioural paradigms, to determine the functional role of the targeted areas. This technique can be applied to healthy subjects, and the short duration of the effects allows subjects to serve as their own controls, and any compensatory functional reorganization is unlikely to occur. SICI effects emerge only when a brain area receives two TMS pulses at a specific intensity ratio, which limits them to occur locally at the TMS target area³⁷. In comparison with repetitive TMS, in which the target area receives trains of pulses for a longer period of time, or cooling deactivations used in animal models, ppTMS probably results in less pronounced effects in remote areas connected to the TMS target. Further, recent advances in magnetic resonance imaging (MRI)-guided online navigation systems and focal stimulation coils allow reasonably accurate targeting, influencing spatially much smaller regions than most naturally occurring lesions.

To our knowledge, there are no prior studies applying TMS to different subregions of the supratemporal AC. However, we stipulated that such an approach might be fruitful, given that TMS to sound processing areas outside the superior temporal AC has separately modulated processing of auditory object identity and spatial localization. Specifically, a recent study³⁸ suggested that TMS of the left inferior frontal gyrus (IFG) that is associated with the auditory ‘what’ stream¹ impairs identification of sound patterns associated with words, whereas TMS of occipital and posterior parietal cortex areas, partially overlapping with the putative auditory posterior-dorsal ‘where’ auditory stream¹, modulates sound localization performance^39,40. Recently, it was also shown that repetitive TMS to inferior parietal lobule produces selective impairments in auditory motion processing⁴¹.

Here we utilize MRI-guided ppTMS to validate the model^1,2,3,4 postulating separate areas for spatial versus sound identity processing in human AC. Our results demonstrate a double dissociation that supports this parallel processing model, with ppTMS to anterior AC delaying reaction times (RTs) most prominently during sound identity discrimination and ppTMS to posterior AC delaying RTs most prominently during sound location discrimination. Cortical electric-field (E-field) estimates confirm that TMS pulses reached the intended targets in all subjects.

Results

TMS effects on sound location and identity discrimination

Ten normal-hearing participants were presented with Reference/Probe sound pairs (Fig. 1). In the spatial task, subjects discriminated whether Probe arrived from 5° to the left or 5° to the right relative to Reference (25° to the right). In the identity task, subjects discriminated whether the amplitude-modulation (AM) frequency of Probe was one-sixth octaves lower (35.6 Hz) or higher (44.9 Hz) than that of Reference (40 Hz). The experiment was divided into eight runs, each containing 80 trials. Fifty percent of the trials did not include TMS and therefore provided baseline RT data. For the other 50% of trials, bilateral TMS was delivered 55–145 ms after the Probes, at either the anterior or the posterior non-primary ACs.

RT delays caused by TMS were analysed using a contemporary ‘real-time’ linear mixed modelling approach^42,43 (for details, see Methods) that is becoming increasingly popular amongst behavioural and cognitive scientists. This linear mixed approach models individual task responses in each task condition/subject and brings longitudinal temporal dependencies into the statistical model, thus allowing for better control for a number of biases that have been difficult to model in more traditional analyses where task responses are first aggregated within subjects^42,43. For RT lags, we found task-specific modulations for the anterior versus posterior non-primary AC TMS. According to Markov chain Monte Carlo (MCMC) simulations, there was a highly significant interaction between the task-type and the TMS target location (t=3.3, P_MCMC=0.001; N_Subjects=10, N_{Trials/Subject}=640; note that conventional t/F-distributions and degrees of freedom are not directly applicable to linear mixed models). Figure 2 shows the estimated marginal means of RT lags caused by TMS in each task condition and at each target location, as derived from the linear mixed model. These data show that the RT lags were more pronounced (a) after posterior than anterior AC TMS during the spatial task (P<0.05, Tukey adjusted; N_Subjects=10, N_{Trials/Subject}=640) and (b) after anterior than posterior AC TMS during the identity task (P<0.05, Tukey adjusted; N_Subjects=10, N_{Trials/Subject}=640). There was also a slightly weaker but significant main effect of TMS target location (t=−2.2, P_MCMC<0.05; N_Subjects=10, N_{Trials/Subject}=640). The main effect of task was non-significant, showing that there were no significant difficulty differences between the spatial and identity tasks. Group averages of RTs in different task conditions, separately for different TMS latencies, are shown in Table 1. An additional analysis did not find any significant interactions between the task, target location and TMS pulse latency.

**Figure 2: TMS-induced RT lags during spatial and identity tasks.**

Table 1 Group-average RTs in milliseconds.

Full size table

For hit rates (HRs), the linear mixed model did not indicate significant interaction effects between the task, target location and TMS treatment. TMS resulted in an overall decrease of 9% in HR (estimated marginal means±s.e.: 77±4% after TMS, 86±4% with no TMS; t₆₃=4.1, P<0.001; N_Subjects=10, N_{Trials/Subject}=640). The group estimates of HR values in the individual task conditions are shown in Table 2. Finally, Fig. 3 shows the variability of task performance, quantified as the s.d. of RT, in each individual subject during the baseline trials, TMS trials and across all trials.

Table 2 Group-average HR percentages.

Full size table

**Figure 3: The s.d. of individual-trial RTs in the population of 10 subjects.**

Verifying TMS targeting by cortical E-field estimates

The results of electromagnetic modelling analyses, which were conducted to verify the foci of E-fields induced by TMS in each subject, are shown in Fig. 4. These data show that the intended targets were successfully stimulated during both posterior and anterior AC TMS conditions. The anterior AC stimulation reached maximal levels at areas anterolateral to Heschl’s gyrus, including anterior STG. The supratemporal areas stimulated during TMS of posterior AC included posterior STG and PT. Further, the E-field distributions were quite focal, resulting in that collateral fields (surrounding the intended target areas), which are largely unavoidable with TMS, did not extend to areas that have been associated with frontal or parietal auditory ‘what’ or ‘where’ pathways. For example, during the anterior TMS pulses, the supra-Sylvian fields reaching the threshold were centred at the central sulcus, far from IFG areas presumed to be associated with the auditory ‘what’ stream beyond AC. During posteriorly targeted TMS, the supra-Sylvian fields were strongest in inferior aspects of parietal cortex. The mean±s.d. distance along the supratemporal surface of peak E-field values between the anterior versus posterior stimulation sites was 34±10 mm (N_Subjects=10, N_{Pulse pairs/Location}=160) in the left and 37±10 mm in the right hemisphere (N_Subjects=10, N_{Pulse pairs/Location}=160).

**Figure 4: Modeling of TMS-induced E-fields in the cortex.**

Continuous tracking of the coils and head position with the navigation device indicated that movement was minimal during the experiment (group level mean of individually computed SDs for coil/head movements was 1.2 mm). This excludes the possibility that the differences between runs would have been caused by head/coil movements leading to stimulation of different brain areas.

Discussion

Here we studied the effects of transient focal AC deactivations induced by TMS⁴⁴ on processing of object identity versus spatial aspects of auditory stimuli. TMS pulse pairs targeted at anterior non-primary AC areas produced more pronounced RT lags during sound identity than location discrimination performance. In contrast, TMS targeted at posterior non-primary AC areas delayed RTs significantly more during the audiospatial than identity task. The linear mixed modelling analyses showed a corresponding significant interaction between the auditory task and anatomical target location on TMS-induced RT lags. These results provide direct causal support for distinct human AC regions for sound location and identity processing, which so far has been studied in healthy humans mainly using neuroimaging methods. The study also offers proof-of-principle for using TMS for studies on human AC functional anatomy.

Previous studies have suggested that delivering ppTMS at the presently used intrapair interval of 2.5 ms triggers transient intracortical inhibition that lasts for up to a few tens of milliseconds³⁶. Here the TMS pulses were delivered at four latencies between 55 and 145 ms after sound onset. This encompasses the time window during which the MEG/EEG response N1 typically ascends, peaks and descends in the posterior and anterior non-primary ACs (for example, refs 3, 45, 46). Notably, stimulus-locked neuronal processes occurring at these latencies at non-primary ACs are also believed to coincide with the emergence of conscious percepts and formation of neuronal representation of sound objects^45,46. One might thus speculate that the present behavioural effects of TMS were caused by increased local inhibition that delayed sound-feature processing in the underlying areas. However, further studies are needed to verify the exact neuronal mechanisms of the presently observed effects.

A large number of investigations in humans (for example, refs 1, 2, 3, 11, 12, 13, 14) and animal models^1,2,4, even some of those supporting distributed coding instead of topographical representation of acoustic space^47,48, are in line with the view that neurons processing spatial features are most abundantly populating the posterior auditory fields. As for the ‘what’ pathway, contradictory findings have been reported, for instance, in recent human neuroimaging studies that have found spread of pitch-related neuroimaging activations also to posterior AC⁴⁹. This has been interpreted to contradict the existence of a pitch-specific region in the anterolateral AC, previously reported in monkeys⁵⁰ and humans (for example, refs 15, 16). The present study, in which sound identity processing was measured using a task that required discrimination of AM differences (that is, temporal pitch), however, suggests that anterior AC areas may, indeed, be behaviourally relevant for pitch tasks. However, it is noteworthy that the present AM frequencies were slightly closer to the boundary of pitch versus flutter perception than the typical fundamental frequencies (f₀) used in previous temporal-pitch functional MRI studies; for example, f₀=62.5 Hz in ref. 51 and 83.1 Hz in ref. 15. Thus, future studies may be needed to see how the present results generalize across different stimulus and task types.

Although previous studies^3,45,46 have suggested a latency difference of about 10–30 ms between the N1 activation peaks in the posterior ‘where’ versus anterior ‘what’ regions, the present results regarding TMS latencies were inconclusive. That is, no significant interactions between the task type, TMS target and TMS latency emerged. Further studies could illuminate whether the interruption of local AC processes have characteristic critical latencies for different task types and subanatomical areas in humans. On the same note, the behavioural effects induced by TMS in the present study were, as fully expected because of the more transient nature of neuronal manipulations, more subtle than those produced by recent cooling deactivation studies⁴, which resulted in complete inactivation of larger extents of AC (and probably also in areas connected to the deactivated regions^28,29). Instead of a major decrease of HR, we observed RT delays <100 ms. Note also that the current extent of RT lags is logical, given the presumed short duration of neuronal effects of ppTMS.

The present study demonstrates the utility of electromagnetic modelling of the TMS-induced E-fields at the individual cortical surfaces and the value of being able to compute surface-based across-subjects averages of the E-fields. To our knowledge, this is the first applied TMS study to report such data. Our modelling, specifically, showed that anterior and posterior TMS targeted separate areas in the intended regions, extending from the supratemporal cortex inside the Sylvian fissure to lateral aspects of STG that are more accessible to TMS than the core regions inside the sulcus. It is, however, noteworthy that beyond coil design and selection, TMS does not allow customized shaping of the E-fields, and volume conduction leads to that not only the maximum but also the immediately surrounding areas receive stimulation. For example, despite the fact that our study was based on an MRI-guided online navigation system and quite small stimulation coils that allow relatively focal targeting of cortical tissue, not only STG but also the adjacent gyri (albeit with lesser intensities) were stimulated. Yet, our electromagnetic modelling showed that, in the case of anterior AC stimulation, such ‘collateral’ TMS effects in supra-Sylvian regions were clearly more posterior than the inferior frontal ‘what’ stream (including the IFG areas pars opercularis and triangularis), as they actually peaked in the postcentral (that is, parietal lobe) regions. Similarly, in the case of the posterior AC target, the supra-Sylvian areas stimulated by ppTMS were restricted to the inferior aspects of parietal lobe and did not extend into the vicinity of the intraparietal sulcus, which is a key anatomical locus of the posterior parietal ‘where’ processing stream^1,52. Most importantly, our estimates of TMS-induced E-fields showed clearest and most consistent foci across subjects in the anticipated AC areas.

TMS coil discharges produce both auditory and somatosensory stimulations, which may cause auditory masking effects or nonspecific distractions on cognitive performance beyond the neuronal mechanisms of interest. For example, in the present study, ppTMS resulted also in nonspecific task modulations (RT increases to ppTMS, irrespective of target location) that might have been partially affected by these biases. (Note that, here, we used relatively small figure-of-eight of coils that produce less pronounced clicks than larger coils, as well as earpieces that dampen the click.) In previous studies, researchers have therefore applied several types of control conditions to separate the TMS effects of interest from artifacts. For example, one can apply either ‘sham TMS’ or real TMS to a control brain region that does not, presumably, participate in the activity of interest. A complication for using an independent ‘non-auditory’ control region is, however, that a wide network of areas beyond the supratemporal cortex is, either directly or through polymodal associations, activated during active auditory task performance. Indeed, auditory RTs have been shown to be affected by TMS delivered to a great variety of such ‘extra-auditory’ areas^38,39,40,41. Therefore, in the present study, we applied a factorial design where each of the AC regions of interest, anterior versus posterior, acted as their own control during the two different auditory task domains. It is also noteworthy that the relatively high spatial focality and short (2.5-ms) duration of ppTMS would suggest that the behavioural effects emerge from local inhibition in the target area where SICI occurs. Although spreading influences to connected areas cannot be completely excluded, if present, they are probably much less prominent than those observed with deactivations lasting several orders of magnitude longer^{30,31,32,33,34} used in animal models⁴. Finally, it is possible that certain extracerebral artifacts, such as scalp muscle stimulation, may result in feedback activities that might potentially cause more pronounced effects on processing of sound identity, particularly in the case of speech sounds³⁸. Here such biases are probably less likely, as the present identity dimension was artificial, not directly linked to speech features.

In summary, TMS to posterior AC regions delayed RTs significantly more during sound location than identity discrimination, whereas TMS to anterior AC regions delays RTs significantly more during sound identity than location discrimination. This double dissociation provides causal support for the parallel processing model^1,2,3,4 postulating that sound identity features are processed in anterior AC and sound location is encoded in posterior AC in humans. These results also demonstrate the potential usefulness of ppTMS for studying functional anatomy of the AC in humans, especially when combined with forward modelling of the TMS-induced E-fields on the cortical surface. Further studies are needed to verify the exact neuronal mechanisms that underlie the ppTMS cortical effects.

Methods

Subjects and task design

Ten healthy right-handed subjects (five women, age 22–51 years) with self-reported normal hearing, screened for TMS and MRI contraindications (metal in the body, implanted medical devices, medications affecting the central nervous system, pregnancy, history of seizures/convulsions/fainting/syncope or significant head trauma)⁵³ participated in the study. Human subjects’ approval was obtained and voluntary informed consents approved by the Massachusetts General Hospital Institutional Review Board were signed before the start of the experiment. TMS targeted at AC regions might stimulate temporal muscles, the intensity and discomfort of which varies across individuals. The subjects were informed that they could interrupt the experiment at any time, and their willingness to continue was confirmed repeatedly during the session. In each forced-choice RT trial (Fig. 1a), subjects were delivered a pair of binaural 300-ms white-noise bursts (50-dB sensation level, 10-ms on/off ramps) at a 1-s onset-to-onset interval (S14 Insert headphones, Sensimetrics, Malden, MA). The first sound of each pair was Reference, and the second was Probe. In the identity task, Probe was amplitude modulated (AM) at either one-sixth octaves lower (35.6 Hz) or higher (44.9 Hz) frequency than Reference (40-Hz AM frequency). After the lower-AM Probe, subjects were to press the leftmost of two buttons using their right-hand index finger, and after the higher-AM Probe the rightmost of these buttons with the right-hand middle finger. In the spatial task (Fig. 1a), References were simulated from 25° and Probes from either 20° or 30° to the right along the azimuth by convolving the Reference sound of the identity task (40 Hz AM) with generic head-related transfer functions⁵⁴. The subjects were instructed to press the leftmost button with the right-hand index finger to the left Probes, and the rightmost button with the right-hand middle finger to the right Probes. Subjects received feedback in both tasks (‘OK’ or ‘Wrong’ shown during 0.9–1.9 s after each button press). The subsequent trial started 700 ms after the feedback stimulus ended. The trial duration (average across subjects 5.5 s) was, thus, paced by the subjects’ performance. The stimuli were presented and behavioural responses recorded on a PC running Presentation 14.2 (Neurobehavioral Systems, Albany, CA), which sent timing information to the TMS stimulators to synchronize the pulses with auditory stimuli. The difficulty of each task was matched a priori. Our statistical model (see below) showed no significant main effects of the task (identity versus spatial) for RTs or HRs, suggesting that the baseline difficulty levels of each task were consistent.

ppTMS design

TMS experiments were conducted in a dimly lit sound-attenuated low-reverberation chamber. The TMS coil navigation system was co-registered with each subject’s structural MRI (3T Siemens TimTrio, multi-echo MPRAGE pulse sequence, TR=2,510 ms; 4 echoes with TEs=1.64, 3.5, 5.36 and 7.22 ms; 176 sagittal slices, 1-mm isotropic voxels, 256 × 256 matrix; flip angle=7°) with respect to the fiduciary landmarks (nasion, two preauricular points) and additional scalp points using a 3D digitizer (Nexstim NBS, Helsinki, Finland). Two-channel TMS was guided with an infrared navigation system that calculates the location and strength of the induced E-field and displays this on the subject’s MRI in real time (Nexstim NBS). Subjects’ head movements were minimized by using a headrest and a vacuum pillow.

During auditory stimulation/tasks, ppTMS (80/120% motor threshold of hand first dorsal interosseous; interpolated from previously reported optimal values^35,36,37) with 2.5-ms inter-pulse interval, biphasic waveform, was delivered simultaneously to both hemispheres with two stimulators (MagPro X100 w/MagOption, MagVenture, Falun, Denmark) and two figure-of-eight coils (MagPro C-B60, MagVenture). In separate runs, TMS was targeted either to anterior or posterior non-primary AC areas. TMS coil orientation was perpendicular to the individual local curvature of Sylvian fissure, (that is, about vertical). Bilateral TMS was chosen because sounds are processed simultaneously in both hemispheres, and because previous studies⁴ suggest more consistent effects after bilateral than unilateral AC deactivations. Bilateral TMS, presumably, also helped avoid lateralized distraction effects during Spatial task. On the basis of previous studies^3,46, the TMS pulse pairs were applied randomly at four different latencies (55, 85, 115 and 145 ms) after the Probe sound onset, covering the typical N1(m) activity in ACs.

The experiment was divided into eight runs, each containing 80 trials and lasting on average 7 min 18 s. In each run, TMS was applied in 50% of trials in random order. Each session, thus, included 320 ppTMS events. On average, ppTMS was delivered about every 10–13 s. To avoid making the experiment excessively long, all conditions in one TMS location were always conducted in a row (coil re-targeting may require extra time when two coils are used). The order of TMS locations and task conditions was counterbalanced across subjects.

Anatomical ppTMS target definition

Each subject’s anatomical MRIs were segmented and co-registered to volume-based and surface-based standard brain representations using FreeSurfer 5.1 (http://www.surfer.nmr.mgh.harvard.edu). The TMS target locations were selected based on a sample of previously published maximally activated functional MRI^{10,11,13,14,15,16} and positron emission tomography⁵¹ voxels during audiospatial or sound-object identity processing. More specifically, the posterior AC targets were defined by averaging the peak-voxel Talairach coordinates of contrasts reflecting sound movement versus rest¹¹, sound direction changes versus constant stimulation¹³, sound distance versus intensity changes¹⁴ and spatially shifting versus stationary sounds¹⁰. When needed, Montreal Neurological Institute coordinates were converted to Talairach coordinates. The anterior AC targets were defined similarly, based on peak voxels in contrasts reflecting varying versus fixed pitch^10,15,16, fixed pitch versus noise¹⁵, fixed Huggins pitch versus noise¹⁶, fixed binaural band pitch versus noise¹⁶ and pitch-strength-by-melody interactions⁵¹. For both targets, average loci were calculated first within, and then across studies. The resulting Talairach coordinates (mm) of initial targets were {x,y,z}={−52,−30,12} for the left posterior, {x,y,z}={−54,0,−5} for the left anterior, {x,y,z}={60,−30,11} for the right posterior and {x,y,z}={55,−4,−6} for the right anterior AC (Fig. 1b). These locations were then transformed to each individual subject’s brain representations. Finally, before the TMS experiment, an optimized entry point closer to the inner skull was selected, as guided by the E-field estimates produced by the navigation system, to allow the TMS effects to be maximally focused to the target area.

TMS target post hoc confirmation

Electromagnetic modelling analyses utilizing realistic anatomy were conducted to estimate the actual cortically induced E-field distributions in each subject. The inner skull surface obtained from Freesurfer MRI reconstructions was used to create a single-layer boundary element model. The intracranial space was considered as a homogenous isotropic volume conductor. The position and orientation of the TMS coil was exported from the navigator computer in the MRI coordinates and a model for the wire winding geometry was constructed according to the manufacturer’s specifications (MagPro C-B60, MagVenture). TMS-induced E-field amplitudes were computed at the white matter surfaces (extracted using Freesurfer) according to the well-established physical principles, as described by Nummenmaa et al.⁵⁵. The custom implementation of the numerical methods was done in Matlab (R2012a, The Mathworks, Inc., Natick, MA) utilizing the core routines from the boundary element model toolbox of ref. 56. For comparisons across subjects, the cortical activation estimates were co-registered via spherical morphing to a surface-based standard brain representation⁵⁷, thresholded at 80% of the individual’s maximum and normalized across individuals.

Data analysis

A full factorial design was utilized to control for potential biases caused by the TMS side effects (acoustic clicks and muscle stimulation). Behavioural data, including RTs (correct responses only) and HRs, were recorded separately for each task condition. At the initial screening, trial responses faster or slower than two s.d. of each subject’s average RT were excluded as outliers (physiologically unreasonable responses, that is, RT<50 ms or >4 s, were excluded before this). Statistical analyses were conducted using the R packages lme4, languageR and lsmeans^58,59,60. Instead of averaging RTs first within subjects, we utilized a linear mixed approach^42,43 that models individual responses in each task condition/subject and brings the potential longitudinal biases between successive trials into the statistical model specification. To further mitigate within-session fluctuations (for example, related to the blocked TMS design), we specifically examined the time series of TMS-induced RT lags obtained by subtracting the within-condition average RT_Baseline from each individual RT_TMS event. The RT lag time series were finally entered into a linear mixed model: The random effects included the subject and trial type (sound direction nested within spatial task; AM frequency nested within identity task); the fixed effects included the task (identity versus spatial), TMS target location (anterior versus posterior), task-by-TMS-target–location interaction, TMS pulse-pair latency (55–145 ms), age, gender and the trial-specific sequential predictors (task-block number, trial numbers and RT to preceding baseline trial) controlling for temporal dependencies/autocorrelations. We controlled for potential biases caused by non-normality and homogeneity by examining the model residuals against fitted values. Multicollinearity was addressed by residualizing predictors correlating with each other. The model was weighted by the inverse variances of each subject’s behavioural performance.

Our main hypothesis regarded the interaction between the task and TMS target location: we hypothesized that RT lags are significantly larger with TMS targeted to the anterior versus posterior AC during the identity task, and vice versa during the spatial task. Statistical significances were presented as MCMC-estimated P-values. A priori comparisons were computed based on the main models using the R lsmeans package⁶⁰. Finally, HR results were analysed with a linear mixed model with the task, target location, treatment (TMS versus no TMS), age and gender as fixed-effect factors and the subject as a random-effect factor.

Additional information

How to cite this article: Ahveninen, J. et al. Evidence for distinct human auditory cortex regions for sound location versus identity processing. Nat. Commun. 4:2585 doi: 10.1038/ncomms3585 (2013).

References

Rauschecker, J. P. & Tian, B. Mechanisms and streams for processing of ‘what’ and ‘where’ in auditory cortex. Proc. Natl Acad. Sci. USA 97, 11800–11806 (2000).
Article CAS ADS Google Scholar
Tian, B., Reser, D., Durham, A., Kustov, A. & Rauschecker, J. P. Functional specialization in rhesus monkey auditory cortex. Science 292, 290–293 (2001).
Article CAS ADS Google Scholar
Ahveninen, J. et al. Task-modulated ‘what’ and ‘where’ pathways in human auditory cortex. Proc. Natl Acad. Sci. USA 103, 14608–14613 (2006).
Article CAS ADS Google Scholar
Lomber, S. G. & Malhotra, S. Double dissociation of 'what' and 'where' processing in auditory cortex. Nat. Neurosci. 11, 609–616 (2008).
Article CAS Google Scholar
Romanski, L. M. et al. Dual streams of auditory afferents target multiple domains in the primate prefrontal cortex. Nat. Neurosci. 2, 1131–1136 (1999).
Article CAS Google Scholar
Clarke, S., Bellmann, A., Meuli, R. A., Assal, G. & Steck, A. J. Auditory agnosia and auditory spatial deficits following left hemispheric lesions: evidence for distinct processing pathways. Neuropsychologia 38, 797–807 (2000).
Article CAS Google Scholar
Alain, C., Arnott, S. R., Hevenor, S., Graham, S. & Grady, C. L. ‘What’ and ‘where’ in the human auditory system. Proc. Natl Acad. Sci. USA 98, 12301–12306 (2001).
Article CAS ADS Google Scholar
Maeder, P. P. et al. Distinct pathways involved in sound recognition and localization: a human fMRI study. Neuroimage 14, 802–816 (2001).
Article CAS Google Scholar
Bushara, K. O. et al. Modality-specific frontal and parietal areas for auditory and visual spatial localization in humans. Nat. Neurosci. 2, 759–766 (1999).
Article CAS Google Scholar
Barrett, D. J. & Hall, D. A. Response preferences for ‘what’ and ‘where’ in human non-primary auditory cortex. Neuroimage 32, 968–977 (2006).
Article Google Scholar
Warren, J. D., Zielinski, B. A., Green, G. G., Rauschecker, J. P. & Griffiths, T. D. Perception of sound-source motion by the human brain. Neuron 34, 139–148 (2002).
Article CAS Google Scholar
Brunetti, M. et al. Human brain activation during passive listening to sounds from different locations: an fMRI and MEG study. Hum. Brain Mapp. 26, 251–261 (2005).
Article CAS Google Scholar
Deouell, L. Y., Heller, A. S., Malach, R., D’Esposito, M. & Knight, R. T. Cerebral responses to change in spatial location of unattended sounds. Neuron 55, 985–996 (2007).
Article CAS Google Scholar
Kopčo, N. et al. Neuronal representations of distance in human auditory cortex. Proc. Natl Acad. Sci. USA 109, 11019–11024 (2012).
Article ADS Google Scholar
Patterson, R. D., Uppenkamp, S., Johnsrude, I. S. & Griffiths, T. D. The processing of temporal pitch and melody information in auditory cortex. Neuron 36, 767–776 (2002).
Article CAS Google Scholar
Puschmann, S., Uppenkamp, S., Kollmeier, B. & Thiel, C. M. Dichotic pitch activates pitch processing centre in Heschl's gyrus. Neuroimage 49, 1641–1649 (2010).
Article Google Scholar
Binder, J. R. et al. Human temporal lobe activation by speech and nonspeech sounds. Cereb. Cortex 10, 512–528 (2000).
Article CAS Google Scholar
Scott, S. K., Blank, C. C., Rosen, S. & Wise, R. J. Identification of a pathway for intelligible speech in the left temporal lobe. Brain 123, 2400–2406 (2000).
Article Google Scholar
Obleser, J. et al. Vowel sound extraction in anterior superior temporal cortex. Hum. Brain Mapp. 27, 562–571 (2005).
Article Google Scholar
DeWitt, I. & Rauschecker, J. P. Phoneme and word recognition in the auditory ventral stream. Proc. Natl Acad. Sci. USA 109, E505–E514 (2012).
Article CAS ADS Google Scholar
Thivard, L., Belin, P., Zilbovicius, M., Poline, J. B. & Samson, Y. A cortical region sensitive to auditory spectral motion. Neuroreport 11, 2969–2972 (2000).
Article CAS Google Scholar
Griffiths, T. D. & Warren, J. D. The planum temporale as a computational hub. Trends Neurosci. 25, 348–353 (2002).
Article CAS Google Scholar
Zatorre, R. J., Evans, A. C., Meyer, E. & Gjedde, A. Lateralization of phonetic and pitch discrimination in speech processing. Science 256, 846–849 (1992).
Article CAS ADS Google Scholar
Belin, P. & Zatorre, R. J. ‘What’, ‘where’ and ‘how’ in auditory cortex. Nat. Neurosci. 3, 965–966 (2000).
Article CAS Google Scholar
Recanzone, G. H. & Cohen, Y. E. Serial and parallel processing in the primate auditory cortex revisited. Behav. Brain Res. 206, 1–7 (2010).
Article Google Scholar
Adriani, M. et al. Sound recognition and localization in man: specialized cortical networks and effects of acute circumscribed lesions. Exp. Brain Res. 153, 591–604 (2003).
Article Google Scholar
Clarke, S. et al. What and where in human audition: selective deficits following focal hemispheric lesions. Exp. Brain Res. 147, 8–15 (2002).
Article Google Scholar
Payne, B. R. & Lomber, S. G. A method to assess the functional impact of cerebral connections on target populations of neurons. J. Neurosci. Methods 86, 195–208 (1999).
Article CAS Google Scholar
Vanduffel, W., Payne, B. R., Lomber, S. G. & Orban, G. A. Functional impact of cerebral connections. Proc. Natl Acad. Sci. USA 94, 7617–7620 (1997).
Article CAS ADS Google Scholar
Clarey, J. C., Tweedale, R. & Calford, M. B. Interhemispheric modulation of somatosensory receptive fields: evidence for plasticity in primary somatosensory cortex. Cereb. Cortex 6, 196–206 (1996).
Article CAS Google Scholar
Wilke, M., Kagan, I. & Andersen, R. A. Functional imaging reveals rapid reorganization of cortical activity after parietal inactivation in monkeys. Proc. Natl Acad. Sci. USA 109, 8274–8279 (2012).
Article CAS ADS Google Scholar
Calford, M. B. & Tweedale, R. Interhemispheric transfer of plasticity in the cerebral cortex. Science 249, 805–807 (1990).
Article CAS ADS Google Scholar
Werhahn, K. J., Mortensen, J., Kaelin-Lang, A., Boroojerdi, B. & Cohen, L. G. Cortical excitability changes induced by deafferentation of the contralateral hemisphere. Brain 125, 1402–1413 (2002).
Article Google Scholar
Popelar, J., Nwabueze-Ogbo, F. C. & Syka, J. Changes in neuronal activity of the inferior colliculus in rat after temporal inactivation of the auditory cortex. Physiol. Res. 52, 615–628 (2003).
CAS PubMed Google Scholar
Kujirai, T. et al. Corticocortical inhibition in human motor cortex. J. Physiol. 471, 501–519 (1993).
Article CAS Google Scholar
Oliveri, M. et al. Paired transcranial magnetic stimulation protocols reveal a pattern of inhibition and facilitation in the human parietal cortex. J. Physiol. 529 Pt 2, 461–468 (2000).
Article CAS Google Scholar
Ilic, T. V. et al. Short-interval paired-pulse inhibition and facilitation of human motor cortex: the dimension of stimulus intensity. J. Physiol. 545, 153–167 (2002).
Article CAS Google Scholar
Gough, P. M., Nobre, A. C. & Devlin, J. T. Dissociating linguistic processes in the left inferior frontal cortex with transcranial magnetic stimulation. J. Neurosci. 25, 8010–8016 (2005).
Article CAS Google Scholar
Collignon, O. et al. Time-course of posterior parietal and occipital cortex contribution to sound localization. J. Cogn. Neurosci. 20, 1454–1463 (2008).
Article Google Scholar
At, A., Spierer, L. & Clarke, S. The role of the right parietal cortex in sound localization: a chronometric single pulse transcranial magnetic stimulation study. Neuropsychologia 49, 2794–2797 (2011).
Article Google Scholar
Lewald, J., Staedtgen, M., Sparing, R. & Meister, I. G. Processing of auditory motion in inferior parietal lobule: evidence from transcranial magnetic stimulation. Neuropsychologia 49, 209–215 (2011).
Article Google Scholar
Baayen, R. H., Davidson, D. J. & Bates, D. M. Mixed-effects modeling with crossed random effects for subjects and items. J. Mem. Lang. 59, 390–412 (2008).
Article Google Scholar
Baayen, R. H. & Milin, P. Analyzing reaction times. Int. J. Psychol. Res. 3, 12–28 (2010).
Article Google Scholar
Pascual-Leone, A., Walsh, V. & Rothwell, J. Transcranial magnetic stimulation in cognitive neuroscience--virtual lesion, chronometry, and functional connectivity. Curr. Opin. Neurobiol. 10, 232–237 (2000).
Article CAS Google Scholar
Lu, Z. L., Williamson, S. J. & Kaufman, L. Behavioral lifetime of human auditory sensory memory predicted by physiological measures. Science 258, 1668–1670 (1992).
Article CAS ADS Google Scholar
Jääskeläinen, I. P. et al. Human posterior auditory cortex gates novel sounds to consciousness. Proc. Natl Acad. Sci. USA 101, 6809–6814 (2004).
Article ADS Google Scholar
Stecker, G. C., Mickey, B. J., Macpherson, E. A. & Middlebrooks, J. C. Spatial sensitivity in field PAF of cat auditory cortex. J. Neurophysiol. 89, 2889–2903 (2003).
Article Google Scholar
Salminen, N. H., May, P. J., Alku, P. & Tiitinen, H. A population rate code of auditory space in the human cortex. PLoS One 4, e7600 (2009).
Article ADS Google Scholar
Hall, D. A. & Plack, C. J. Pitch processing sites in the human auditory brain. Cereb. Cortex 19, 576–585 (2009).
Article Google Scholar
Bendor, D. & Wang, X. The neuronal representation of pitch in primate auditory cortex. Nature 436, 1161–1165 (2005).
Article CAS ADS Google Scholar
Griffiths, T. D., Buchel, C., Frackowiak, R. S. & Patterson, R. D. Analysis of temporal structure in sound by the human brain. Nat. Neurosci. 1, 422–427 (1998).
Article CAS Google Scholar
Rauschecker, J. P. & Scott, S. K. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat. Neurosci. 12, 718–724 (2009).
Article CAS Google Scholar
Rossi, S., Hallett, M., Rossini, P. M. & Pascual-Leone, A. & Safety of T.M.S.C.G. Safety, ethical considerations, and application guidelines for the use of transcranial magnetic stimulation in clinical practice and research. Clin. Neurophysiol. 120, 2008–2039 (2009).
Article Google Scholar
Algazi, V. R., Duda, R. O., Thompson, M. & Avendano, C. in IEEE Workshop on Applications of Signal Processing to Audio and Electroacoustics, 99–102 (Mohonk Mountain House, 2001).
Nummenmaa, A. et al. Comparison of spherical and realistically shaped boundary element head models for transcranial magnetic stimulation navigation. Clin. Neurophysiol. 124, 1995–2007 (2013).
Article Google Scholar
Stenroos, M., Mantynen, V. & Nenonen, J. A Matlab library for solving quasi-static volume conduction problems using the boundary element method. Comput. Methods Programs Biomed. 88, 256–263 (2007).
Article CAS Google Scholar
Fischl, B., Sereno, M. & Dale, A. Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system. Neuroimage 9, 195–207 (1999).
Article CAS Google Scholar
Bates, D. M. & Maechler, M. lme4: Linear mixed-effects models using S4 classes. R package version 0.999999-0 (2009).
Team, R.D.C. R: a language and environment for statistical computing R Foundation for Statistical Computing (2009).
Lenth, R. V. lsmeans: least-squares means. R Package version 1.06-05 (2013).

Download references

Acknowledgements

We thank Mary O’Hara, Nancy Shearer, Chinmayi Tengshe and Lawrence White, as well as Drs Wei-Tang Chang, Sharon Furtak, Matti Hämäläinen and Norbert Kopčo. This research was supported by the National Institutes of Health (NIH) Grants R21DC010060, R01MH083744, R01HD040712, R01NS037462, R01NS048279 and K99EB015445. The research environment was supported by the NIH/National Institute of Biomedical Imaging and Bioengineering (NIBIB) Grant P41EB015896 (Center for Functional Neuroimaging Techniques, CFNT), NIH Shared Instrumentation Grant S10-RR024694, and the Harvard Clinical and Translational Science Center (Harvard Catalyst; NCRR-NIH UL1 RR025758; NCRR-NIH UL1 TR000170). I.P.J. was supported by the Academy Of Finland Grant 130412, and J.P.R was supported by the NIH Grant R56NS052494 and by the National Science Foundation Grant NSF PIRE-OISE-0730255. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funding agencies.

Author information

Authors and Affiliations

Department of Radiology, Harvard Medical School—Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Building 149, 13th Street, Charlestown, 02129, Massachusetts, USA
Jyrki Ahveninen, Samantha Huang, Aapo Nummenmaa, John W. Belliveau, An-Yi Hung, Stephanie Rossi & Tommi Raij
Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts, USA
John W. Belliveau
Department of Biomedical Engineering and Computational Science (BECS), Aalto University School of Science, Espoo, Aalto, FIN-00076, Finland
Iiro P. Jääskeläinen & Hannu Tiitinen
Department of Neuroscience, Laboratory of Integrative Neuroscience and Cognition, Georgetown University Medical Center, New Research Building, Room WP 15, 3900 Reservoir Road, Northwest Washington, 20057-1460, District of Columbia, USA
Josef P. Rauschecker

Authors

Jyrki Ahveninen
View author publications
You can also search for this author in PubMed Google Scholar
Samantha Huang
View author publications
You can also search for this author in PubMed Google Scholar
Aapo Nummenmaa
View author publications
You can also search for this author in PubMed Google Scholar
John W. Belliveau
View author publications
You can also search for this author in PubMed Google Scholar
An-Yi Hung
View author publications
You can also search for this author in PubMed Google Scholar
Iiro P. Jääskeläinen
View author publications
You can also search for this author in PubMed Google Scholar
Josef P. Rauschecker
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Rossi
View author publications
You can also search for this author in PubMed Google Scholar
Hannu Tiitinen
View author publications
You can also search for this author in PubMed Google Scholar
Tommi Raij
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.A., I.P.J, J.P.R. and T.R. designed the study; J.A., S.H., A.Y.H., S.R. and T.R. performed the experiments; A.N., J.W.B. and T.R. contributed new analysis tools; J.A., S.H., A.Y.H., S.R. and T.R. analysed data; J.A., S.H., A.N., J.W.B., A.Y.H., I.P.J., J.P.R., S.R., H.T. and T.R. wrote the manuscript.

Corresponding author

Correspondence to Jyrki Ahveninen.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahveninen, J., Huang, S., Nummenmaa, A. et al. Evidence for distinct human auditory cortex regions for sound location versus identity processing. Nat Commun 4, 2585 (2013). https://doi.org/10.1038/ncomms3585

Download citation

Received: 20 May 2013
Accepted: 10 September 2013
Published: 14 October 2013
DOI: https://doi.org/10.1038/ncomms3585

This article is cited by

Understanding rostral–caudal auditory cortex contributions to auditory perception
- Kyle Jasmin
- César F. Lima
- Sophie K. Scott
Nature Reviews Neuroscience (2019)
Cortical mechanisms of spatial hearing
- Kiki van der Heijden
- Josef P. Rauschecker
- Elia Formisano
Nature Reviews Neuroscience (2019)
Decoding auditory spatial and emotional information encoding using multivariate versus univariate techniques
- James H. Kryklywy
- Ewan A. Macpherson
- Derek G. V. Mitchell
Experimental Brain Research (2018)
Rehabilitating the addicted brain with transcranial magnetic stimulation
- Marco Diana
- Tommi Raij
- Antonello Bonci
Nature Reviews Neuroscience (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.