Stimulus arousal drives amygdalar responses to emotional expressions across sensory modalities

Lin, Huiyan; Müller-Bardorff, Miriam; Gathmann, Bettina; Brieke, Jaqueline; Mothes-Lasch, Martin; Bruchmann, Maximilian; Miltner, Wolfgang H. R.; Straube, Thomas

doi:10.1038/s41598-020-58839-1

Download PDF

Article
Open access
Published: 05 February 2020

Stimulus arousal drives amygdalar responses to emotional expressions across sensory modalities

Huiyan Lin^1,2^na1,
Miriam Müller-Bardorff²^na1,
Bettina Gathmann²^na1,
Jaqueline Brieke²,
Martin Mothes-Lasch²,
Maximilian Bruchmann²,
Wolfgang H. R. Miltner³ &
…
Thomas Straube²

Scientific Reports volume 10, Article number: 1898 (2020) Cite this article

4467 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The factors that drive amygdalar responses to emotionally significant stimuli are still a matter of debate – particularly the proneness of the amygdala to respond to negatively-valenced stimuli has been discussed controversially. Furthermore, it is uncertain whether the amygdala responds in a modality-general fashion or whether modality-specific idiosyncrasies exist. Therefore, the present functional magnetic resonance imaging (fMRI) study systematically investigated amygdalar responding to stimulus valence and arousal of emotional expressions across visual and auditory modalities. During scanning, participants performed a gender judgment task while prosodic and facial emotional expressions were presented. The stimuli varied in stimulus valence and arousal by including neutral, happy and angry expressions of high and low emotional intensity. Results demonstrate amygdalar activation as a function of stimulus arousal and accordingly associated emotional intensity regardless of stimulus valence. Furthermore, arousal-driven amygdalar responding did not depend on the visual and auditory modalities of emotional expressions. Thus, the current results are consistent with the notion that the amygdala codes general stimulus relevance across visual and auditory modalities irrespective of valence. In addition, whole brain analyses revealed that effects in visual and auditory areas were driven mainly by high intense emotional facial and vocal stimuli, respectively, suggesting modality-specific representations of emotional expressions in auditory and visual cortices.

Emotions and brain function are altered up to one month after a single high dose of psilocybin

Article Open access 10 February 2020

Frederick S. Barrett, Manoj K. Doss, … Roland R. Griffiths

A systems identification approach using Bayes factors to deconstruct the brain bases of emotion regulation

Article 22 March 2024

Ke Bo, Thomas E. Kraynak, … Tor D. Wager

Neural signatures of natural behaviour in socializing macaques

Article 13 March 2024

Camille Testard, Sébastien Tremblay, … Michael L. Platt

Introduction

It has been suggested that the amygdala classifies sensory input according to its emotional and motivational relevance^1,2 and modulates ongoing sensory processing leading to enhanced representations of emotionally relevant stimuli^3,4. Social signals, such as emotional vocal and facial expressions, typically represent environmental aspects of high social and personal relevance (e.g., indicating other persons’ intentions or pointing towards relevant environmental changes) and high intense expressions are associated with higher arousal ratings as compared to low intense expressions⁵. It has been shown that the amygdala responds to both emotional vocal^6,7,8,9 and facial expressions¹⁰. However, despite a large body of imaging studies on this issue, previous research does not provide an unequivocal answer regarding the factors that drive amygdalar responses to emotionally expressive voices and faces. Particularly, the specificity of amygdalar responding, that is, the proneness to respond to negative, threat-related emotional information has been a matter of debate^11,12,13,14. Furthermore, it is uncertain whether emotional signals from different sensory domains are processed in an analogous fashion or whether modality-specific idiosyncrasies exist^15,16.

Regarding the mentioned specificity of amygdalar activation to negative as compared to positive stimuli, findings have been mixed. Several studies employing emotional facial expressions suggest a heightened sensitivity for negative stimuli, threat-related stimuli in particular^{17,18,19,20,21,22,23}. Unfortunately, most of these studies do not clarify, whether this ‘threat-sensitivity’ reflects effects of stimulus valence and/or stimulus arousal¹⁹. Several studies indicate that the amygdala is sensitive to positive and negative stimuli^24,25,26,27 and might code general effects of motivational relevance and, therefore general arousal, irrespective of valence^{11,12,14,28,29}. With regard to facial expressions, enhanced amygdalar activation has been observed for various types of facial expressions, including happy and surprised faces^23,30,31,32. Previous research from our own lab provides evidence for amygdalar modulation as a function of stimulus arousal irrespective of stimulus valence by using positive and negative expressions of varying emotional intensity⁵. Here, intensity refers to the entirety of aspects, constitutive for the emotional experience as a whole^33,34 and is highly correlated with emotional arousal⁵. Interestingly, some other studies also report modulation by expression intensity (and corresponding stimulus arousal), but report inverse intensity effects (that is, enhanced amygdalar activation to low intense/low arousing expressions³⁵ (see discussion below).

With regard to affective voice processing, findings are also mixed. Using verbal and non-verbal vocalizations some studies show valence specific (e.g., responding to anger, fear, disgust but not happy vocalizations) amygdalar responses^36,37, while others indicate valence-independent enhancements reflecting stimulus arousal^38,39 or combined effects of stimulus valence and stimulus arousal⁵. In general, many studies only provide a dichotomous experimental manipulation (e.g., neutral versus negative expressions) and do therefore not provide information regarding separate contributions of stimulus valence and stimulus arousal^40,41,42.

With respect to potential parallels between the processing of emotional vocalizations and facial expressions, it remains uncertain, whether the amygdala responds in a domain-general way across visual and auditory modalities. Recent reviews suggest that the amygdala is more important in affective face processing, as compared to the processing of emotional vocalizations^15,16. On the other hand, a large number of imaging studies demonstrate enhanced amygdalar activation to emotional signals from the auditory compared to the visual domain^6,8,40,42,43. In a similar vein, lesion studies also report impaired processing of emotional prosody in amygdala-lesioned patients^{37,44,45,46,47}. Finally, there are some bimodal studies, which suggest analogous response patterns irrespective of the visual and auditory domains^36,48,49. Aubé and colleagues (2015)⁴⁹, for instance, demonstrate enhanced amygdalar activation in response to fear-related facial expressions, vocalizations and music plays, thus indicating parallels in the processing of emotional signals from different modalities⁵⁰. Taken together, previous studies indicate that the amygdala responds to emotional signals from visual and auditory channels, although it is uncertain whether asymmetries in affective voice and face processing exist.

Several aspects might be relevant with regard to the heterogeneous findings of previous research. In particular, many of the above-mentioned studies neither assessed stimulus valence/arousal, nor controlled for comparable arousal levels across valence categories^{22,36,49,50,51}. Positively-valenced facial expressions and voices may tend to be perceived as less arousing since they are frequently encountered in everyday life^28,52. Importantly, several of the above-mentioned studies might have failed to create highly arousing positive signals – especially those which did not manipulate the intensity of emotional expressions^20,22. These issues might reduce the arousal effect of positive expression on amygdalar responding, resulting in observing a valence-related effect or a combined effect of stimulus valence and arousal. In addition, only few studies used bimodal experimental designs including emotional signals from visual and auditory domains^36,49,50, allowing for testing whether modality has an effect. Therefore, it remains uncertain, whether observed discrepancies reflect fundamental asymmetries in affective voice and face processing or methodological differences.

The present study aimed at systematically investigating the role of stimulus valence and stimulus arousal in the processing of emotional expressions from visual and auditory modalities. More precisely, we were interested in clarifying, whether amygdalar responding to affective voices and faces is driven by stimulus valence, stimulus arousal, or the interaction of both factors. In addition, we aimed to answer the question whether effects depend on the visual and auditory modalities of emotional expressions. In order to circumvent the abovementioned limitations of previous research, we employed stimuli, which provided different levels of emotional intensities for positive and negative expressions and therefore comprised varying levels of stimulus arousal and valence. Stimulus arousal was comparable between negative and positive expressions. In addition, rating data reflecting stimulus valence/arousal were used as parametric predictors, modeling brain activation based on stimulus specific mean arousal or valence ratings, in order to identify brain activation varying on these dimensions. Finally, we used a bimodal design in order to directly test potential domain-specific response patterns within the same experimental framework. Overall, we hypothesized that (1.) amygdalar responses reflect modulation of neuronal activation as a function of stimulus arousal with stronger activation for high arousing/high intense expressions, (2.) potential effects of stimulus arousal and expression intensity do not depend on stimulus valence and (3.) amygdalar responding as a function of stimulus arousal and stimulus intensity is analogous across visual and auditory modalities with no modality-specific idiosyncrasies.

Methods

Participants

Twenty healthy undergraduate and postgraduate students (19–28 years, M = 22.30, SD = 2.54; 10 females) were recruited from the University of Jena, Germany. Participants were right-handed as determined by the Edinburgh Handedness Inventory⁵³. All participants had normal or corrected-to-normal vision and no participants had a history of neurological or psychiatric disease. The study was conducted in accordance with the guidelines of ethical standards in the Declaration of Helsinki and was approved by the Ethics Committee of the University of Jena. Written informed consent was obtained from all participants prior to participation.

Stimuli

Facial and vocal expressions were selected from our newly developed stimulus sets, the Jena 3D Face Database (J3DFD) and the Person Perception Research Unit – EmoVoice (PPRU – EmoVoice), respectively. The J3DFD contains 32 Caucasian individuals showing angry, fearful, sad, disgusted, happy, and surprised expressions at three intensity levels plus neutral expressions^5,54. The PPRU – EmoVoice database consists of twenty-four neutral bisyllabic nouns spoken in angry, fearful, sad, disgusted, happy, and surprised prosody at three intensity levels and a neutral prosody by five females and five males. Stimuli were recorded and digitized through an audio interface with a 44100 Hz sampling rate and 16 bit resolution and utterances were normalized in amplitude. These facial and vocal stimuli had been rated by independent samples of 44 and 50 participants, respectively, with respect to physiological arousal (ranging from 1 = very low to 9 = very high) and valence (ranging from 1 = very unpleasant to 9 = very pleasant). Emotional expressions were additionally rated with respect to the emotional expression intensity (ranging from 1 = very low to 7 = very high).

For the present study, we selected 50 facial and 50 vocal stimuli. Facial stimuli portrayed ten identities (5 females, 5 males) showing angry and happy expressions at high and low intensity levels plus neutral expressions. Vocal stimuli contained ten nouns spoken by 5 females and 5 males in angry, happy, and neutral prosodies, matched of stimulus duration per emotional category (mean: 658 ms, range: 415 ms − 917 ms, F (4, 45) = 0.84, p = 0.509, partial η² = 0.07). Mean ratings of emotional valence, arousal and intensity for facial and vocal stimuli are shown in Table 1. For arousal ratings, ANOVA analysis revealed no significant main effect of expression (F (4, 90) = 5.90, p = 0.057, partial η² = 0.86) or modality (F (1, 90) = 2.72, p = 0.175, partial η² = 0.41) but an interaction effect between expression and modality (F (4, 90) = 10.91, p < 0.001, partial η² = 0.34). Regarding valence ratings there was a significant main effect of expression (F (4, 90) = 8.83, p < 0.029, partial η² = 0.90) but not modality (F (1, 90) = 0.68, p = 0.457, partial η² = 0.15). Moreover, the interaction between expression and modality reached significance (F (4, 90) = 12.11, p < 0.001, partial η² = 0.35).

Table 1 Mean rating data on intensity (1 to 7), arousal (1 to 9) and valence (1 to 9) with respect to facial and vocal stimuli employed in the present study.

Full size table

Procedure

Auditory stimuli were presented binaurally via headphones that were specifically adapted for the use in the fMRI environment (commander XG MRI audio system, Resonance Technology, Northridge, USA). When presenting auditory stimuli, a blank screen was presented simultaneously. Visual stimuli were shown via a back-projection screen onto an overhead mirror. Scanning was conducted in two runs (run duration 12 min). Overall, we had 10 conditions (2 modalities [faces vs. voices] × 5 expressions [angry high, angry low, neutral, happy low, happy high]). Each condition was presented in one block (see Fig. 1 for a schematic presentation of the procedure), consisting of ten trials (i.e., each facial/vocal identity [5 females, 5 males, see also the Stimuli section] was presented once in a block). The presentation sequence of each identity was randomized across blocks and participants. Each block was presented twice resulting in 20 blocks per run and overall, in 400 trials (5 expressions × 2 modalities × 10 identities × 2 repetitions × 2 runs). Between each block, there was an 18 second pause. Visual stimuli were presented for 658 ms, while acoustic stimuli were in average presented for 658 ms (see stimulus description) with a stimulus onset asynchrony of 2000 ms. Sequence of blocks were counterbalanced between runs and across participants. Participants were instructed to perform a gender judgment task in order to ensure that participants paid attention to the presented voices and faces. The instructions emphasized both speed and accuracy. Responses were given via button press of the index and the middle finger of the right hand, using a fiber optic response box (LUMItouch; Photon Control). Response assignments to index and middle finger were counterbalanced across participants. Only key pressing during stimulus presentation were considered as valid response. Stimulus presentation and recordings were accomplished by Presentation Software (Neurobehavioral Systems, Inc., Albany, California).

Behavioral data recording and analysis

Accuracy and reaction times were analyzed with within-subject repeated measures analyses of variance (ANOVA) with the factors Modality (face and voice) and Expression (angry high, angry low, neutral, happy low, and happy high) using IBM SPSS 22 software (SPSS Inc., Chicago, Illinois). Greenhouse-Geisser and Bonferroni corrections were used, if appropriate. Results were regarded as statistically significant for p < 0.05.

FMRI data acquisition and analysis

Scanning was performed in a 1.5-Tesla magnetic resonance scanner (Magnetom Vision Plus; Siemens Medical Systems). Following the acquisition of a T1-weighted anatomical scan, two runs of 245 volumes were obtained for each participant using T2*-weighted echo-planar images (TE = 50 ms, flip angle = 90°, matrix = 512 × 512, field of view = 200 mm, TR = 2973 ms). Each volume comprised 30 axial slices (thickness = 3 mm, gap = 1 mm, in-plane resolution = 3 × 3 mm). The slices were acquired parallel to the line between anterior and posterior commissure with a tilted orientation to reduce susceptibility artifacts in inferior parts of the anterior brain⁵⁵. Before imaging, a shimming procedure was performed to improve field homogeneity. The first four volumes of each run were discarded from analysis to ensure steady-state tissue magnetization.

Preprocessing and analyses were performed using Brain Voyager QX (Brain Innovation, Maastricht, the Netherlands). The volumes were realigned to the first volume to minimize effects of head movements. Further preprocessing comprised spatial (8 mm full-width half-maximum isotropic Gaussian) and temporal (high-pass filter: three cycles per run, linear trend removal) filtering. The anatomical and functional images were co-registered and normalized to the Talairach space. The expected BOLD signal change for each predictor was modelled with a canonical double γ haemodynamic response function. The GLM was calculated with predictors of interest being the factors Modality (face and voice) and Expression (angry high, angry low, neutral, happy low, and happy high).

Valence and arousal effects were investigated using a parametric approach involving balanced contrast weights, which were derived from normative valence and arousal ratings reported in Table 1. Analysis was conducted for two main contrasts (valence and arousal) and their interaction with modality. For the first main contrast ‘arousal’, the arousal rating data for faces and voices were used as contrast weights, displaying a u-shaped function with higher values for high intense compared to low intense expression and neutral expressions being at the lowest point of the u-shape. Contrast weights were zero-centered. The second main contrast modeled valence effects by using normative valence ratings for faces and voices (see Table 1). This contrast modeled a linear function across expression predictors with positive values for positive valence. The two interaction contrasts of visual and auditory modalities with stimulus arousal or valence respectively were modeled using inverted contrast weights for voices. Interactions of arousal and valence were investigated with the mean-centered product of the mean-centered valance and arousal ratings. This parametric approach was chosen, since rating data reflecting stimulus valence/arousal were regarded as most accurate predictors for expected effects on amygdalar responses. Since contrast weights modelled brain activation separately for both modalities, we also controlled for potential differences across modality conditions.

Since the present study focuses on amygdalar response properties, data analysis was conducted as a region-of-interest (ROI) analysis for the amygdala. Additionally, to make the study more comprehensive, a whole-brain analysis was performed without a priori defined ROIs. The amygdala ROI was defined according to probabilistic cytoarchitectonic maps^56,57 and contained the superficial group, the basolateral group, and the centromedial group as subregions⁵⁸. Anatomical maps were created using the Anatomy Toolbox in Matlab (MATLAB 2014, The MathWorks, Inc., Natick, Massachusetts, USA) and transformed into Talairarch space using CBM2TAL^59,60. Significant clusters were obtained through cluster-based permutation (CBP) with 1000 permutations. The non-parametric CBP framework was chosen, in order to gain precise false discovery rates with no need of assumptions regarding test-statistic distributions⁶¹. Voxel-level threshold was set to p < 0.005. For each permutation, individual beta maps representing activation patterns in a single experimental condition were randomly assigned without replacement to one of the tested experimental conditions. For example, to test the parametric arousal effect, the five beta maps corresponding to the five expressions were randomly assigned to these five conditions, separately for each subject. This approach is based on the assumption formulated by the null-hypothesis stating that the activation is equal across the five expression within a given subject. Cluster mass was assessed by summing all t-values in neighboring significant voxels, where voxels are defined as neighbors if they share a face (i.e. each voxel has six neighbors). Cluster masses larger than the 95% of the permutation distribution were considered as statistically significant.

Results

Behavioral results

Accuracy

Results revealed a significant main effect of Expression (F (4, 72) = 3.84, p = 0.007, partial η² = 0.18), which was further qualified by a significant two-way interaction between Modality and Expression (F (4, 72) = 5.37, p = 0.001, partial η² = 0.23). The main effect expression was significant for both voices (F(4, 76) = 5.45, p = 0.001, partial η² = 0.22) and faces (F(4, 76) = 3.35, p = 0.014, partial η² = 0.15). Bonferroni corrected post hoc t-tests revealed higher accuracy rates for angry high as compared to happy low expressions (p ≤ 0.001) for the visual domain and higher accuracy rates for happy low as compared to angry low and neutral expressions (p’s ≤ 0.004) for the auditory domain (see Table 2). No further contrast reached the Bonferroni corrected level of significance (all p’s > 0.05). There was no significant effect of modality (p = 0.299).

Table 2 Mean accuracy in percent and response times (RTs) in milliseconds for each experimental condition.

Full size table

Response times

Results revealed main effects of Modality (F (1, 18) = 114.91, p < 0.001, partial η² = 0.87) and Expression (F (4, 72) = 10.80, p < 0.001, partial η² = 0.38, corrected). These main effects were further analyzed by a significant interaction between those two factors (F (4, 72) = 8.48, p < 0.001, partial η² = 0.32). The main effect expression was significant for voices (F(4, 76) = 12.92, p = 0.001, partial η² = 0.41), but not faces (F(4, 76) = 1.91, p = 0.118, partial η² = 0.09). Within the auditory domain, Bonferroni corrected post-hoc t-tests revealed shorter response times for happy low as compared to angry low, angry high, and neutral expressions, and for happy high as compared to angry high expressions (all p’s < 0.05).

FMRI results

ROI analysis

For the arousal contrast vector a significant activation cluster within the right amygdala was revealed, showing responses as a function of stimulus arousal (peak voxel coordinates: x = 25, y = −4, z = −10; t_max = 3.39, cluster mass = 18.68, p < 0.001, CBP corrected, cluster size = 6 voxels or 162 mm³, see Fig. 2). Importantly, there was no significant interaction between stimulus arousal and modality (p > 0.05). Furthermore, there were no significant clusters for the main contrast of valence as well as its interaction with stimulus modality (all ps > 0.05).

In order to additionally analyze whether or not there was an overall interaction between stimulus valence and stimulus arousal independent of modality, we used the mean-centered product of the mean-centered valence and arousal ratings as a contrast vector. There was no single voxel reaching the initial set voxel-level threshold. Finally, we also investigated potentially bimodal responses to valence⁶² by comparing all negative with all other stimuli und all positive with all other stimuli. There were no voxels that survived the voxel threshold.

Whole brain analysis

There were several brain regions, which responded as a function of stimulus arousal, most importantly, mid superior temporal sulcus (STS, including the transversal gyrus), postcentral gyrus, posterior occipital cortex, insula, cingulate gyrus, and parts of the lateral frontal cortex (see Table 3 for a complete listing and Fig. 3 for main clusters).

Table 3 Significant activations modelled by the parametric arousal effect irrespective of visual and auditory modalities.

Full size table

Clusters in the mid STS (x = 54, y = −16, z = 6) reflected modulation by vocal expression, while effects in fusiform gyrus (x = −39, y = −40, z = −8) reflected modulation by facial expression (see Fig. 3). Congruently, significant arousal × modality interactions were observed for these and several other brain regions, including supramarginal gyrus and anterior cingulate, indicating either preferred responses to voices or to faces (see Table 4 for a complete listing).

Table 4 Significant activations modelled by the parametric interaction of arousal and modality.

Full size table

With regard to stimulus valence, significant clusters were mainly revealed in multi- and supramodal regions (e.g., insula, posterior STS, supramarginal gyrus, middle frontal gyrus), in visual areas (e.g., fusiform gyrus), and somatosensory areas (e.g., postcentral gyrus, see Table 5 for a complete listing of brain regions and Fig. 3 for main clusters). There were several significant valence × modality interactions, which reflected dominance for visually-driven valence effects (see Table 6 for a complete listing).

Table 5 Significant activations modelled by the parametric valence effect irrespective of visual and auditory modalities.

Full size table

Table 6 Significant activations modelled by the parametric interaction of valence and modality.

Full size table

Discussion

The present study investigated whether amygdalar responses to affective vocal and facial expression reflected modulation by stimulus valence and/or stimulus arousal. Furthermore, it was of interest whether or not potential modulation of the amygdala by valence and/or arousal would rely on analogous mechanisms for vocal and facial stimuli. We used voices and faces of varying emotional intensity across stimulus valence categories to examine this question. BOLD responses were modeled based on normative rating data on stimulus valence and arousal. Our results revealed amygdalar responses as a function of stimulus arousal and emotional intensity, crucially, irrespective of stimulus valence. In addition, arousal-driven effects for the amygdala were independent of the visual and auditory modalities of incoming emotional information, but reflected common response patterns across visual and auditory domains.

The proneness of the amygdala to respond to negative, threatening stimuli has been controversially debated^12,13. Although enhanced amygdalar activation to negative, threat-related stimuli has been frequently observed^{17,18,20,23,48}, there are few studies which provide convincing evidence in favor of valence-driven amygdalar responding (but see e.g., Kim et al.¹⁹). On the other hand, there is strong empirical support for the notion, that positive, negative, and ambiguous stimuli can elicit amygdalar responding, indicating that the amygdala shows general responsiveness to any salient emotional information^1,12,30 and stimuli related to personal goals^2,25,26,27. The present study adds to this observation indicating that amygdalar responses might code general stimulus relevance irrespective of stimulus valence and threat-relation.

There is also accumulating evidence that emotional intensity impacts amygdalar responding for several categories of emotional stimuli (e.g., scenes^34,63,64 and odors^65,66). In line with these studies, we find a significant positive relationship between amygdalar activation and stimulus arousal, and thus also a positive relationship between amygdalar activation and emotional intensity of facial expressions. Regarding facial expressions, several other studies found effects of emotional intensity on amygdalar responding^5,29,35, which however varied. Interestingly, Gerber and colleagues³⁵ observed inverse intensity effects, that is, enhanced amygdalar responding for weak, possibly ambiguous expressions. It is possible that the amygdala is sensitive to both stimulus intensity (signaling a need for prioritized processing) and stimulus ambiguity (signaling a need for gathering more sensory information), resulting in combined intensity and ambiguity effects²⁹.

Even though there are many studies investigating whether amygdalar responses to vocal and facial expressions reflect modulation by stimulus valence or stimulus arousal, findings have been inconsistent so far^11,12,13,14. Unfortunately, the majority of affective face and voice processing studies neither provide orthogonal manipulations of the two factors, nor include rating data on stimulus valence and arousal (but see e.g., Kim et al.¹⁹; Lin et al.⁵, for exceptions). In contrast to previous research, the present study provided highly arousing negative and positive expressions and systematically varied stimulus arousal and emotional intensity across emotional valence categories. Furthermore, statistical models were directly inferred from rating data on stimulus valence and arousal. Thus, our findings provide strong evidence that amygdalar responses to vocal and facial expressions reflect effects of emotional intensity and associated stimulus arousal and do not depend solely on stimulus valence.

Importantly, the present study also investigated whether amygdalar responses to stimulus arousal and expression intensity depend on the visual and auditory modalities of incoming information. The results of the present study provide evidence that the amygdala responds in an analogous fashion to social signals from visual and auditory modalities. These results are in line with earlier findings by Aubé and colleagues⁴⁹, which suggest that the amygdala processes emotional information from different modalities in an analogous fashion. Our findings are also partly in line with the findings of Phillips and colleagues³⁶, who found analogous amygdalar responses to fearful voices and faces (with respect to disgusted expressions, however, amygdalar enhancements were only observed for facial expressions). Interestingly, recent reviews proposed asymmetries in affective voice and face processing^15,16. It is still uncertain, however, whether these asymmetries reflect minor relevance of subcortical structures in affective voice processing (as suggested by the authors) or methodological differences between the two research fields (e.g., less arousing vocal stimuli, smaller sample sizes, less sensitive statistical approaches in auditory studies). The present study experimentally manipulated stimulus modality as a within-subject factor and provided stimuli of comparable emotional properties across modalities. Controlling for methodological differences, we found parallel amygdalar response patterns for emotionally salient voices and faces. Thus, our results indicate that the amygdala responds in a domain-general fashion to emotional signals across visual and auditory domains with no modality-specific idiosyncrasies.

Besides the amygdala, our results provide evidence for domain-general, arousal-driven effects in several multimodal brain regions including the posterior STS, possibly indicating that these regions play an important role in the processing of stimulus arousal across visual and auditory modalities. A recent study by Lin and colleagues (2016)⁵ showed that stimulus arousal strongly impacts activation of the posterior STS in response to facial expressions. Several researchers proposed that the posterior STS is involved in the representation of facial information, particularly the representation of emotional expressions^67,68, and demonstrated coupling with other face processing areas such as the fusiform gyrus^69,70. Moreover, parts of the STS have been suggested to be the vocal analogue of the fusiform face processing area^9,71,72, representing vocal features of varying complexity dependent on their emotional significance^8,9,71,73. In addition, the posterior STS and supramarginal gyrus have been reported to be involved in the integration of audio-visual information and to respond to multiple types of social signals^74,75. The results of the present study extend the findings of Lin and colleagues⁵ and indicate arousal-driven modulation of the posterior STS by facial and vocal expressions.

In addition, modality-specific arousal effects were observed in unimodal primary and secondary cortices, such as the lateral occipital cortex and the medial STS (mSTS), which showed enhanced activation in response to highly arousing faces and voices, respectively. In addition, modality-specific valence effects were also observed in some regions (see Table 5), which were primarily driven by visual stimulation, and reflected stronger activation to angry as compared to happy expressions. It is possible that advantages for the visual domain reflect a higher degree of specialization for representations of visual stimuli, in line with the dominance of visual representations in human perception. Mostly, modulation by stimulus valence did not reflect valence effects in isolation, but reflected mixed effects of stimulus valence and stimulus arousal, indicating limited empirical support for the valence model (see also Lindquist et al.¹² for a recent meta-analysis on the plausibility of valence-driven brain responses).

There are several limitations of the present study. Since fMRI results were based on a 1.5 Tesla scanner, future work should investigate these issues with 3 or even 7 Tesla scanners and potential increased sensitivity for more nuanced effects^76,77,78. We would like to mention that we do not suggest that the amygdala might not also code valence. However, the resolution of most fMRI studies makes it difficult to investigate this question in sufficient detail. Single unit studies provide also evidence for highly overlapping units with valence and arousal responses⁷⁹. Future high resolution studies are needed to investigate the issue of potentially spatially distinct responses in small voxels due to valence, arousal, but also modality and other factors in more detail. Furthermore, the fact that the utilized auditory stimuli have no emotional meaning beyond prosody might be regarded as detrimental for the comparative validity of employed stimuli. Importantly, there are several studies demonstrating that it is rather prosody than meaning that causes an emotional reaction^80,81,82,83. In addition, it should be noted that both stimulus categories provide affective and – to a large extend – non-affective information such as basic visual/auditory features related to gender, age, and identity. Considering these aspects, we regard the parallelism between the employed voices and faces as relatively far-reaching^15,16. The present study used one specific negative emotion (i.e. anger) and a specific class of socially relevant stimuli. Thus, in order to ensure the generalizability of our findings to other types of negative expressions and emotional stimuli, the inclusion of a broader range of expressions³⁰ and further emotional stimuli (e.g., biological emotional stimuli⁸⁴) would be highly desirable. Finally, the present study found a valence-independent and modality-independent effect of arousal on amygdalar responding by using an implicit emotion task (e.g., a gender task). However, an explicit emotion task (e.g., an emotion discrimination task) is often used in studies on emotion processing. Furthermore, several studies have manipulated both explicit and implicit tasks to investigate the effect of task on the processing of emotional facial and vocal expressions^81,85,86. Future studies might use both explicit and implicit tasks to investigate whether these tasks will show differential effects on arousal and valence dependent amygdala activations.

Conclusion

Based on normative rating data on stimulus valence and arousal, the present fMRI study suggest enhanced amygdalar activation as a function of stimulus arousal, which does not depend on stimulus valence. Furthermore, present findings support the hypothesis of the amygdala as common neural substrate in affective voice and face processing, which evaluates emotional relevance irrespective of visual and auditory modalities. Finally, whole brain data provided evidence for modality-specific representations of emotional expressions in auditory and visual cortices, which again, mainly reflected the impact of emotional intensity and associated stimulus arousal. Future high resolution studies, however, should further investigate potential overlapping and distinct activations in the amygdala depending on arousal, valence, stimulus modality and specific task contexts.

References

Cunningham, W. A. & Brosch, T. Motivational salience: Amygdala tuning from traits, needs, values, and goals. Curr. Dir. Psychol. Sci. 21, 54–59, https://doi.org/10.1177/0963721411430832 (2012).
Article Google Scholar
Murray, R. J., Brosch, T. & Sander, D. The functional profile of the human amygdala in affective processing: Insights from intracranial recordings. Cortex 60, 10–33, https://doi.org/10.1016/j.cortex.2014.06.010 (2014).
Article PubMed Google Scholar
Day-Brown, J. D., Wei, H., Chomsung, R. D., Petry, H. M. & Bickford, M. E. Pulvinar projections to the striatum and amygdala in the tree shrew. Front. Neuroanat. 4, 143, https://doi.org/10.3389/fnana.2010.00143 (2010).
Article PubMed PubMed Central Google Scholar
Pessoa, L. & Adolphs, R. Emotion processing and the amygdala: from a ‘low road’ to ‘many roads’ of evaluating biological significance. Nat. Rev. Neurosci. 11, 773–783, https://doi.org/10.1038/nrn2920 (2010).
Article CAS PubMed PubMed Central Google Scholar
Lin, H. et al. Effects of intensity of facial expressions on amygdalar activation independently of valence. Front. Hum. Neurosci. 10, 646, https://doi.org/10.3389/fnhum.2016.00646 (2016).
Article PubMed PubMed Central Google Scholar
Bestelmeyer, P. E. G., Kotz, S. A. & Belin, P. Effects of emotional valence and arousal on the voice perception network. Soc. Cogn. Affect. Neurosci. 12, 1351–1358, https://doi.org/10.1523/JNEUROSCI.4820-13.2014 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mothes-Lasch, M., Mentzel, H.-J., Miltner, W. H. R. & Straube, T. Visual attention modulates brain activation to angry voices. J. Neurosci. 31, 9594–9598, https://doi.org/10.1523/JNEUROSCI.6665-10.2011 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mothes-Lasch, M., Becker, M. P. I., Miltner, W. H. R. & Straube, T. Neural basis of processing threatening voices in a crowded auditory world. Soc. Cogn. Affect. Neurosci. 11, 821–828, https://doi.org/10.1093/scan/nsw022 (2016).
Article PubMed PubMed Central Google Scholar
Sander, D. et al. Emotion and attention interactions in social cognition: Brain regions involved in processing anger prosody. Neuroimage 28, 848–858, https://doi.org/10.1016/j.neuroimage.2005.06.023 (2005).
Article PubMed Google Scholar
Fusar-Poli, P. et al. Functional atlas of emotional faces processing: A voxel-based meta-analysis of 105 functional magnetic resonance imaging studies. J. Psychiatr. Neurosci. 34, 418–432 (2009).
Google Scholar
Costafreda, S. G., Brammer, M. J., David, A. S. & Fu, C. H. Y. Predictors of amygdala activation during the processing of emotional stimuli: A meta-analysis of 385 PET and fMRI studies. Brain Res. Rev. 58, 57–70, https://doi.org/10.1016/j.brainresrev.2007.10.012 (2008).
Article PubMed Google Scholar
Lindquist, K. A., Satpute, A. B., Wager, T. D., Weber, J. & Barrett, L. F. The brain basis of positive and negative affect: Evidence from a meta-analysis of the human neuroimaging literature. Cereb. Cortex 26, 1910–1922, https://doi.org/10.1093/cercor/bhv001 (2016).
Article PubMed Google Scholar
Phan, K. L., Wager, T., Taylor, S. F. & Liberzon, I. Functional neuroanatomy of emotion: A meta-analysis of emotion activation studies in PET and fMRI. Neuroimage 16, 331–348, https://doi.org/10.1006/nimg.2002.1087 (2002).
Article PubMed Google Scholar
Zald, D. H. The human amygdala and the emotional evaluation of sensory stimuli. Brain Res. Rev. 41, 88–123, https://doi.org/10.1016/S0165-0173(02)00248-5 (2003).
Article PubMed Google Scholar
Schirmer, A. Is the voice an auditory face? An ALE meta-analysis comparing vocal and facial emotion processing. Soc. Cogn. Affect. Neurosci. 13, 1–13, https://doi.org/10.1093/scan/nsx142 (2018).
Article PubMed Google Scholar
Schirmer, A. & Adolphs, R. Emotion perception from face, voice, and touch: Comparisons and convergence. Trends Cogn. Sci. 21, 216–228, https://doi.org/10.1016/j.tics.2017.01.001 (2017).
Article PubMed PubMed Central Google Scholar
Furl, N., Henson, R. N., Friston, K. J. & Calder, A. J. Top-down control of visual responses to fear by the amygdala. J. Neurosci. 33, 17435–17443, https://doi.org/10.1523/JNEUROSCI.2992-13.2013 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hardee, J. E., Thompson, J. C. & Puce, A. The left amygdala knows fear: Laterality in the amygdala response to fearful eyes. Soc. Cogn. Affect. Neurosci. 3, 47–54, https://doi.org/10.1093/scan/nsn001 (2008).
Article PubMed PubMed Central Google Scholar
Kim, M. J. et al. Human amygdala tracks a feature-based valence signal embedded within the facial expression of surprise. J. Neurosci. 37, 9510–9518, https://doi.org/10.1523/JNEUROSCI.1375-17.2017 (2017).
Article CAS PubMed PubMed Central Google Scholar
Mattavelli, G. et al. Neural responses to facial expressions support the role of the amygdala in processing threat. Soc. Cogn. Affect. Neurosci. 9, 1684–1689, https://doi.org/10.1093/scan/nst162 (2014).
Article PubMed Google Scholar
Sauer, A., Mothes-Lasch, M., Miltner, W. H. R. & Straube, T. Effects of gaze direction, head orientation and valence of facial expression on amygdala activity. Soc. Cogn. Affect. Neurosci. 9, 1246–1252, https://doi.org/10.1093/scan/nst100 (2014).
Article PubMed Google Scholar
Whalen, P. J. et al. Masked presentations of emotional facial expressions modulate amygdala activity without explicit knowledge. J. Neurosci. 18, 411–418, https://doi.org/10.1523/JNEUROSCI.18-01-00411.1998 (1998).
Article CAS PubMed PubMed Central Google Scholar
Yang, J., Bellgowan, P. S. F. & Martin, A. Threat, domain-specificity and the human amygdala. Neuropsychologia 50, 2566–2572, https://doi.org/10.1016/j.neuropsychologia.2012.07.001 (2012).
Article PubMed PubMed Central Google Scholar
Klucken, T. et al. Neural activations of the acquisition of conditioned sexual arousal: Effects of contingency awareness and sex. J. Sex. Med. 6, 3071–3085, https://doi.org/10.1111/j.1743-6109.2009.01405.x (2009).
Article PubMed Google Scholar
Mohanty, A., Gitelman, D. R., Small, D. M. & Mesulam, M. M. The spatial attention network interacts with limbic and monoaminergic systems to modulate motivation-induced attention shifts. Cereb. Cortex 18, 2604–2613, https://doi.org/10.1093/cercor/bhn021 (2008).
Article PubMed PubMed Central Google Scholar
Ousdal, O. T. et al. The human amygdala is involved in general behavioral relevance detection: Evidence from an event-related functional magnetic resonance imaging Go-NoGo task. Neuroscience 156, 450–455, https://doi.org/10.1016/j.neuroscience.2008.07.066 (2008).
Article CAS PubMed Google Scholar
Ousdal, O. T., Reckless, G. E., Server, A., Andreassen, O. A. & Jensen, J. Effect of relevance on amygdala activation and association with the ventral striatum. Neuroimage 62, 95–101, https://doi.org/10.1016/j.neuroimage.2012.04.035 (2012).
Article PubMed Google Scholar
Straube, T., Mothes-Lasch, M. & Miltner, W. H. R. Neural mechanisms of the automatic processing of emotional information from faces and voices. Brit. J. Psychol. 102, 830–848, https://doi.org/10.1111/j.2044-8295.2011.02056.x (2011).
Article PubMed Google Scholar
Wang, S. et al. The human amygdala parametrically encodes the intensity of specific facial emotions and their categorical ambiguity. Nat. Commun. 8, 14821, https://doi.org/10.1038/ncomms14821 (2017).
Article ADS PubMed PubMed Central Google Scholar
Fitzgerald, D. A., Angstadt, M., Jelsone, L. M., Nathan, P. J. & Phan, K. L. Beyond threat: Amygdala reactivity across multiple expressions of facial affect. Neuroimage 30, 1441–1448, https://doi.org/10.1016/j.neuroimage.2005.11.003 (2006).
Article PubMed Google Scholar
Santos, A., Mier, D., Kirsch, P. & Meyer-Lindenberg, A. Evidence for a general face salience signal in human amygdala. Neuroimage 54, 3111–3116, https://doi.org/10.1016/j.neuroimage.2010.11.024 (2011).
Article PubMed Google Scholar
Yang, T. et al. Amygdalar activation associated with positive and negative facial expressions. Neuroreport 13, 1737–1741 (2002).
Article PubMed Google Scholar
Schupp, H. T. et al. Affective picture processing: the late positive potential is modulated by motivational relevance. Psychophysiology 37(2), 257–261, https://doi.org/10.1111/1469-8986.3720257 (2000).
Article CAS PubMed Google Scholar
Sabatinelli, D., Bradley, M. M., Fitzsimmons, J. R. & Lang, P. J. Parallel amygdala and inferotemporal activation reflect emotional intensity and fear relevance. Neuroimage 24, 1265–1270, https://doi.org/10.1016/j.neuroimage.2004.12.015 (2005).
Article PubMed Google Scholar
Gerber, A. J. et al. An affective circumplex model of neural systems subserving valence, arousal, and cognitive overlay during the appraisal of emotional faces. Neuropsychologia 46, 2129–2139, https://doi.org/10.1016/j.neuropsychologia.2008.02.032 (2008).
Article PubMed PubMed Central Google Scholar
Phillips, M. L. et al. Neural responses to facial and vocal expressions of fear and disgust. Proceedings of the Royal Society of London. Series B: Biological Sciences 265, 1809–1817, https://doi.org/10.1098/rspb.1998.0506 (1998).
Article CAS PubMed PubMed Central Google Scholar
Scott, S. K. et al. Impaired auditory recognition of fear and anger following bilateral amygdala lesions. Nature 385, 254–257, https://doi.org/10.1038/385254a0 (1997).
Article ADS CAS PubMed Google Scholar
Fecteau, S., Belin, P., Joanette, Y. & Armony, J. L. Amygdala responses to nonlinguistic emotional vocalizations. Neuroimage 36, 480–487, https://doi.org/10.1016/j.neuroimage.2007.02.043 (2007).
Article PubMed Google Scholar
Wiethoff, S., Wildgruber, D., Grodd, W. & Ethofer, T. Response and habituation of the amygdala during processing of emotional prosody. Neuroreport 20, 1356–1360, https://doi.org/10.1097/WNR.0b013e328330eb83 (2009).
Article PubMed Google Scholar
Frühholz, S. & Grandjean, D. Amygdala subregions differentially respond and rapidly adapt to threatening voices. Cortex 49, 1394–1403, https://doi.org/10.1016/j.cortex.2012.08.003 (2013).
Article PubMed Google Scholar
Johnstone, T., van Reekum, C. M., Oakes, T. R. & Davidson, R. J. The voice of emotion: An FMRI study of neural responses to angry and happy vocal expressions. Soc. Cogn. Affect. Neurosci. 1, 242–249, https://doi.org/10.1093/scan/nsl027 (2006).
Article PubMed PubMed Central Google Scholar
Kumar, S., Kriegstein, K., von, Friston, K. & Griffiths, T. D. Features versus feelings: Dissociable representations of the acoustic features and valence of aversive sounds. J. Neurosci. 32, 14184–14192, https://doi.org/10.1523/JNEUROSCI.1759-12.2012 (2012).
Article CAS PubMed PubMed Central Google Scholar
Frühholz, S., Trost, W. & Grandjean, D. The role of the medial temporal limbic system in processing emotions in voice and music. Prog. Neurobiol. 123, 1–17, https://doi.org/10.1016/j.pneurobio.2014.09.003 (2014).
Article PubMed Google Scholar
Adolphs, R., Tranel, D. & Damasio, H. Emotion recognition from faces and prosody following temporal lobectomy. Neuropsychology 15, 396–404, https://doi.org/10.1037/0894-4105.15.3.396 (2001).
Article CAS PubMed Google Scholar
Frühholz, S. et al. Asymmetrical effects of unilateral right or left amygdala damage on auditory cortical processing of vocal emotions. P. Natl. Acad. Sci. USA 112, 1583–1588, https://doi.org/10.1073/pnas.1411315112 (2015).
Article ADS CAS Google Scholar
Adolphs, R. Intact recognition of emotional prosody following amygdala damage. Neuropsychologia 37, 1285–1292, https://doi.org/10.1016/S0028-3932(99)00023-8 (1999).
Article CAS PubMed Google Scholar
Bach, D. R., Hurlemann, R. & Dolan, R. J. Unimpaired discrimination of fearful prosody after amygdala lesion. Neuropsychologia 51, 2070–2074, https://doi.org/10.1016/j.neuropsychologia.2013.07.005 (2013).
Article PubMed PubMed Central Google Scholar
Anders, S., Eippert, F., Weiskopf, N. & Veit, R. The human amygdala is sensitive to the valence of pictures and sounds irrespective of arousal: An fMRI study. Soc. Cogn. Affect. Neurosci. 3, 233–243, https://doi.org/10.1093/scan/nsn017 (2008).
Article PubMed PubMed Central Google Scholar
Aubé, W., Angulo-Perkins, A., Peretz, I., Concha, L. & Armony, J. L. Fear across the senses: Brain responses to music, vocalizations and facial expressions. Soc. Cogn. Affect. Neurosci. 10, 399–407, https://doi.org/10.1093/scan/nsu067 (2015).
Article PubMed Google Scholar
Pourtois, G., de Gelder, B., Bol, A. & Crommelinck, M. Perception of facial expressions and voices and of their combination in the human brain. Cortex 41, 49–59, https://doi.org/10.1016/S0010-9452(08)70177-1 (2005).
Article PubMed Google Scholar
Morris, J. S., Scott, S. K. & Dolan, R. J. Saying it with feeling: Neural responses to emotional vocalizations. Neuropsychologia 37, 1155–1163, https://doi.org/10.1016/S0028-3932(99)00015-9 (1999).
Article CAS PubMed Google Scholar
Somerville, L. H. & Whalen, P. J. Prior experience as a stimulus category confound: An example using facial expressions of emotion. Soc. Cogn. Affect. Neurosci. 1, 271–274, https://doi.org/10.1093/scan/nsl040 (2006).
Article PubMed PubMed Central Google Scholar
Oldfield, R. C. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia 9(1), 97–113, https://doi.org/10.1016/0028-3932(71)90067-4 (1971).
Article CAS PubMed Google Scholar
Müller-Bardorff, M. et al. Effects of emotional intensity under perceptual load: An event-related potentials (ERPs) study. Biol. Psychol. 117, 141–149, https://doi.org/10.1016/j.biopsycho.2016.03.006 (2016).
Article PubMed Google Scholar
Deichmann, R., Gottfried, J. A., Hutton, C. & Turner, R. Optimized EPI for fMRI studies of the orbitofrontal cortex. Neuroimage 19, 430–441, https://doi.org/10.1016/S1053-8119(03)00073-9 (2003).
Article CAS PubMed Google Scholar
Eickhoff, S. B., Heim, S., Zilles, K. & Amunts, K. Testing anatomically specified hypotheses in functional imaging using cytoarchitectonic maps. Neuroimage 32, 570–582, https://doi.org/10.1016/j.neuroimage.2006.04.204 (2006).
Article PubMed Google Scholar
Eickhoff, S. B. et al. A new SPM toolbox for combining probabilistic cytoarchitectonic maps and functional imaging data. Neuroimage 25, 1325–1335, https://doi.org/10.1016/j.neuroimage.2004.12.034 (2005).
Article PubMed Google Scholar
Amunts, K. et al. Cytoarchitectonic mapping of the human amygdala, hippocampal region and entorhinal cortex: Intersubject variability and probability maps. Anat. Embryol. 210, 343–352, https://doi.org/10.1007/s00429-005-0025-5 (2005).
Article CAS Google Scholar
Laird, A. R. et al. Comparison of the disparity between Talairach and MNI coordinates in functional neuroimaging data: Validation of the Lancaster transform. Neuroimage 51, 677–683, https://doi.org/10.1016/j.neuroimage.2010.02.048 (2010).
Article PubMed Google Scholar
Lancaster, J. L. et al. Bias between MNI and Talairach coordinates analyzed using the ICBM-152 brain template. Hum. Brain Mapp. 28, 1194–1205, https://doi.org/10.1002/hbm.20345 (2007).
Article PubMed PubMed Central Google Scholar
Eklund, A., Nichols, T. E. & Knutsson, H. Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates. P. Natl. Acad. Sci. USA 113, 7900–7905, https://doi.org/10.1073/pnas.1602413113 (2016).
Article CAS Google Scholar
Mattek, A., Burr, D. A., Shin, J., Whicker, C. L. & Kim, M. J. Identifying the representational structure of affect using fMRI. https://doi.org/10.31234/osf.io/6dvn3 (2018).
Bonnet, L. et al. The role of the amygdala in the perception of positive emotions: An “intensity detector”. Fron. Behav Neurosci. 9, 178, https://doi.org/10.3389/fnbeh.2015.00178 (2015).
Article Google Scholar
Fastenrath, M. et al. Dynamic modulation of amygdala-hippocampal connectivity by emotional arousal. J. Neurosci. 34, 13935–13947, https://doi.org/10.1523/JNEUROSCI.0786-14.2014 (2014).
Article CAS PubMed PubMed Central Google Scholar
Anderson, A. K. et al. Dissociated neural representations of intensity and valence in human olfaction. Nat. Neurosci. 6, 196–202, https://doi.org/10.1038/nn1001 (2003).
Article CAS PubMed Google Scholar
Winston, J. S., Gottfried, J. A., Kilner, J. M. & Dolan, R. J. Integrated neural representations of odor intensity and affective valence in human amygdala. J. Neurosci. 25, 8903–8907, https://doi.org/10.1523/JNEUROSCI.1569-05.2005 (2005).
Article CAS PubMed PubMed Central Google Scholar
Calder, A. J. & Young, A. W. Understanding the recognition of facial identity and facial expression. Nat. Rev. Neurosci. 6, 641–651, https://doi.org/10.1038/nrn1724 (2005).
Article CAS PubMed Google Scholar
Ishai, A. Let’s face it: It’s a cortical network. Neuroimage 40, 415–419, https://doi.org/10.1016/j.neuroimage.2007.10.040 (2008).
Article PubMed Google Scholar
Benetti, S. et al. Functional selectivity for face processing in the temporal voice area of early deaf individuals. P. Natl. Acad. Sci. USA 114, E6437–E6446, https://doi.org/10.1073/pnas.1618287114 (2017).
Article CAS Google Scholar
Kriegstein, K., von, Kleinschmidt, A., Sterzer, P. & Giraud, A.-L. Interaction of face and voice areas during speaker recognition. J. Cogn. Neurosci. 17, 367–376, https://doi.org/10.1162/0898929053279577 (2005).
Article Google Scholar
Grandjean, D. et al. The voices of wrath: Brain responses to angry prosody in meaningless speech. Nat. Neurosci. 8, 145–146, https://doi.org/10.1038/nn1392 (2005).
Article CAS PubMed Google Scholar
Sammler, D., Grosbras, M.-H., Anwander, A., Bestelmeyer, P. E. G. & Belin, P. Dorsal and ventral pathways for prosody. Curr. Biol. 25, 3079–3085, https://doi.org/10.1016/j.cub.2015.10.009 (2015).
Article CAS PubMed Google Scholar
Bestelmeyer, P. E. G., Maurage, P., Rouger, J., Latinus, M. & Belin, P. Adaptation to vocal expressions reveals multistep perception of auditory emotion. J. Neurosci. 34, 8098–8105, https://doi.org/10.1523/JNEUROSCI.4820-13.2014 (2014).
Article CAS PubMed PubMed Central Google Scholar
Deen, B., Koldewyn, K., Kanwisher, N. & Saxe, R. Functional organization of social perception and cognition in the superior temporal sulcus. Cereb. Cortex 25, 4596–4609, https://doi.org/10.1093/cercor/bhv111 (2015).
Article PubMed PubMed Central Google Scholar
Joassin, F. et al. Cross-modal interactions between human faces and voices involved in person recognition. Cortex 47, 367–376, https://doi.org/10.1016/j.cortex.2010.03.003 (2011).
Article PubMed Google Scholar
Nelson, I. Incidental findings in magnetic resonance imaging (MRI) Brain Research. J. Law Med. Ethics 36, 315–213, https://doi.org/10.1111/j.1748-720X.2008.00275.x (2008).
Article PubMed PubMed Central Google Scholar
Willinek, W. A. & Kuhl, C. K. 3.0 T neuroimaging: technical considerations and clinical applications. Neuroimag Clin N AM 16, 217–28, https://doi.org/10.1016/j.nic.2006.02.007 (2006).
Article Google Scholar
Willinek, W. A. & Schild, H. H. Clinical advantages of 3.0 T MRI over 1.5 T. Eur. J. Radiol. 65, 2–14, https://doi.org/10.1016/j.ejrad.2007.11.006 (2008).
Article PubMed Google Scholar
Lane, R. D., Chua, P. M. & Dolan, R. J. Common effects of emotional valence, arousal and attention on neural activation during visual processing of pictures. Neuropsychologia 37(9), 989–997, https://doi.org/10.1016/S0028-3932(99)00017-2 (1999).
Article CAS PubMed Google Scholar
Bach, D. R. et al. The effect of appraisal level on processing of emotional prosody in meaningless speech. Neuroimage 42, 919–927, https://doi.org/10.1016/j.neuroimage.2008.05.034 (2008).
Article PubMed Google Scholar
Frühholz, S., Ceravolo, L. & Grandjean, D. Specific brain networks during explicit and implicit decoding of emotional prosody. Cereb. Cortex 22, 1107–1117, https://doi.org/10.1093/cercor/bhr184 (2012).
Article PubMed Google Scholar
Leitman, D. I. et al. “It’s not what you say, but how you say it”: A reciprocal temporo-frontal network for affective prosody. Fron. Hum. Neurosci. 4, 19, https://doi.org/10.3389/fnhum.2010.00019 (2010).
Article Google Scholar
Wiethoff, S. et al. Cerebral processing of emotional prosody—influence of acoustic parameters and arousal. Neuroimage 39, 885–893, https://doi.org/10.1016/j.neuroimage.2007.09.028 (2008).
Article PubMed Google Scholar
Sakaki, M., Niki, K. & Mather, M. Beyond arousal and valence: The importance of the biological versus social relevance of emotional stimuli. Cogn. Affect. Behav. Neurosci. 12, 115–139, https://doi.org/10.3758/s13415-011-0062-x (2012).
Article PubMed PubMed Central Google Scholar
Critchley, H., et al. Explicit and implicit neural mechanisms for processing of social information from facial expressions: a functional magnetic resonance imaging study. Hum. Brain Mapp. 9(2), 93–105 10.1002/(SICI)1097-0193(200002)9:2<93::AID-HBM4>3.0.CO;2-Z (2000).
Habel, U. et al. Amygdala activation and facial expressions: explicit emotion discrimination versus implicit emotion processing. Neuropsychologia 45(10), 2369–2377, https://doi.org/10.1016/j.neuropsychologia.2007.01.023 (2007).
Article PubMed Google Scholar

Download references

Acknowledgements

This research was supported by German Research Foundation (DFG) Project No. STR 987/3-1, STR 987/6-1. We acknowledge support from the Open Access Publication Fund of the University of Muenster.

Author information

These authors contributed equally: Huiyan Lin, Miriam Müller-Bardorff and Bettina Gathmann.

Authors and Affiliations

Institute of Applied Psychology, School of Public Administration, Guangdong University of Finance, 510521, Guangzhou, China
Huiyan Lin
Institute of Medical Psychology and Systems Neuroscience, University of Muenster, 48149, Muenster, Germany
Huiyan Lin, Miriam Müller-Bardorff, Bettina Gathmann, Jaqueline Brieke, Martin Mothes-Lasch, Maximilian Bruchmann & Thomas Straube
Department of Clinical Psychology, Friedrich Schiller University of Jena, 07743, Jena, Germany
Wolfgang H. R. Miltner

Authors

Huiyan Lin
View author publications
You can also search for this author in PubMed Google Scholar
Miriam Müller-Bardorff
View author publications
You can also search for this author in PubMed Google Scholar
Bettina Gathmann
View author publications
You can also search for this author in PubMed Google Scholar
Jaqueline Brieke
View author publications
You can also search for this author in PubMed Google Scholar
Martin Mothes-Lasch
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Bruchmann
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang H. R. Miltner
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Straube
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L., M.M., B.G. and J.B. were involved in data analyses and manuscript drafting and revises. M.M. was involved in study design and data collection and analyses. M.B. was involved in data analyses and manuscript revises. W.H.R.M. was involved in study design. T.S. was involved in study design and manuscript revises. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Huiyan Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lin, H., Müller-Bardorff, M., Gathmann, B. et al. Stimulus arousal drives amygdalar responses to emotional expressions across sensory modalities. Sci Rep 10, 1898 (2020). https://doi.org/10.1038/s41598-020-58839-1

Download citation

Received: 23 September 2019
Accepted: 23 December 2019
Published: 05 February 2020
DOI: https://doi.org/10.1038/s41598-020-58839-1

This article is cited by

Infrequent facial expressions of emotion do not bias attention
- Joshua W. Maxwell
- Danielle N. Sanchez
- Eric Ruthruff
Psychological Research (2023)
Trait anxiety predicts amygdalar responses during direct processing of threat-related pictures
- Huiyan Lin
- Wolfgang H. R. Miltner
- Thomas Straube
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Emotions and brain function are altered up to one month after a single high dose of psilocybin

A systems identification approach using Bayes factors to deconstruct the brain bases of emotion regulation

Neural signatures of natural behaviour in socializing macaques

Introduction

Methods

Participants

Stimuli

Procedure

Behavioral data recording and analysis

FMRI data acquisition and analysis

Results

Behavioral results

Accuracy

Response times

FMRI results

ROI analysis

Whole brain analysis

Discussion

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Infrequent facial expressions of emotion do not bias attention

Trait anxiety predicts amygdalar responses during direct processing of threat-related pictures

Comments

Search

Quick links