Abstract
Vagus nerve stimulation (VNS) is widely used to treat drug-resistant epilepsy and depression. While the precise mechanisms mediating its long-term therapeutic effects are not fully resolved, they likely involve locus coeruleus (LC) stimulation via the nucleus of the solitary tract, which receives afferent vagal inputs. In rats, VNS elevates LC firing and forebrain noradrenaline levels, whereas LC lesions suppress VNS therapeutic efficacy. Noninvasive transcutaneous VNS (tVNS) uses electrical stimulation that targets the auricular branch of the vagus nerve at the cymba conchae of the ear. However, the extent to which tVNS mimics VNS remains unclear. Here, we investigated the short-term effects of tVNS in healthy human male volunteers (n = 24), using high-density EEG and pupillometry during visual fixation at rest. We compared short (3.4 s) trials of tVNS to sham electrical stimulation at the earlobe (far from the vagus nerve branch) to control for somatosensory stimulation. Although tVNS and sham stimulation did not differ in subjective intensity ratings, tVNS led to robust pupil dilation (peaking 4–5 s after trial onset) that was significantly higher than following sham stimulation. We further quantified, using parallel factor analysis, how tVNS modulates idle occipital alpha (8–13Hz) activity identified in each participant. We found greater attenuation of alpha oscillations by tVNS than by sham stimulation. This demonstrates that tVNS reliably induces pupillary and EEG markers of arousal beyond the effects of somatosensory stimulation, thus supporting the hypothesis that tVNS elevates noradrenaline and other arousal-promoting neuromodulatory signaling, and mimics invasive VNS.
SIGNIFICANCE STATEMENT Current noninvasive brain stimulation techniques are mostly confined to modulating cortical activity, as is typical with transcranial magnetic or transcranial direct/alternating current electrical stimulation. Transcutaneous vagus nerve stimulation (tVNS) has been proposed to stimulate subcortical arousal-promoting nuclei, though previous studies yielded inconsistent results. Here we show that short (3.4 s) tVNS pulses in naive healthy male volunteers induced transient pupil dilation and attenuation of occipital alpha oscillations. These markers of brain arousal are in line with the established effects of invasive VNS on locus coeruleus–noradrenaline signaling, and support that tVNS mimics VNS. Therefore, tVNS can be used as a tool for studying how endogenous subcortical neuromodulatory signaling affects human cognition, including perception, attention, memory, and decision-making; and also for developing novel clinical applications.
Introduction
Since 1988, vagus nerve stimulation (VNS) has been successfully used to reduce epileptic seizures in patients with drug-resistant epilepsy (Krahl and Clark, 2012), and has demonstrated clinical effectiveness for many patients treated with invasive VNS (Boon et al., 2018; Kwon et al., 2018). VNS is also applied as a treatment for drug resistant major depression (Nemeroff et al., 2006).
VNS modulates vagal afferent inputs to the brainstem nucleus tractus solitaris, which subsequently activate the locus coeruleus–noradrenaline (LC–NE) system. Indeed, in rats, VNS increases LC neuronal discharges (Takigawa and Mogenson, 1977; Groves et al., 2005; Hulsey et al., 2017) and elevates NE levels in the hippocampus and cortex (Dorr and Debonnel, 2006; Roosevelt et al., 2006). The effects of VNS on LC–NE are considered key to reducing seizures. This is because of the strong positive correlation observed of the noradrenergic and anticonvulsive effects of VNS (Raedt et al., 2011), and because of the elimination of the anticonvulsive effects by means of the chemical lesions of the LC (Krahl et al., 1998). VNS also modulates signaling in other neuromodulatory pathways such as the serotonergic, dopaminergic, and cholinergic systems (Dorr and Debonnel, 2006; Manta et al., 2009; Mridha et al., 2019). However, some of these effects are likely to be secondary (i.e., occur later and with mediation through the LC–NE system; Dorr and Debonnel, 2006).
In humans, invasive VNS induces markers of brain arousal that are consistent with LC–NE activity. This includes pupil dilation (Desbeaumes Jodoin et al., 2015), which is tightly linked with LC–NE activity (Joshi et al., 2016; Reimer et al., 2016; Gelbard-Sagiv et al., 2018; Hayat et al., 2020). VNS may also lead to EEG desynchronization, but the effects are subtler than in pupil dilation, at least regarding the clinical parameters that typically use long (30–60 s) stimulation epochs. Accordingly, early studies with <10 patients each, did not find VNS effects on spontaneous intracranial EEG (Hammond et al., 1992) or scalp EEG (Salinsky and Burchiel, 1993). In contrast, a more recent study with 19 participants, which analyzed separately VNS “responders” and “nonresponders,” observed EEG desynchronization in the alpha and delta bands (Bodin et al., 2015).
Noninvasive transcutaneous vagal nerve stimulation (tVNS) applies electrical current at a high frequency (typically, 25 Hz) through the left ear, targeting the auricular branch of the vagus nerve at the cymba conchae (Fig. 1; for anatomic evidence, see Van Bockstaele et al., 1999; Bermejo et al., 2017). tVNS has been shown to mimic the anticonvulsive and antidepressant effects of invasive VNS (Stefan et al., 2012; He et al., 2013; Hein et al., 2013; Bauer et al., 2016; Rong et al., 2016; Trevizol et al., 2016), and has demonstrated safety and tolerability (Redgrave et al., 2018). Beyond the clinical efficacy of tVNS, interest has grown regarding its use in healthy individuals for basic neuroscience research (Van Leusden et al., 2015). However, the literature is inconsistent as to the extent that tVNS mimics the effects of invasive VNS on EEG or pupil dilation; such evidence would suggest LC–NE involvement (Ventura-Bort et al., 2018; Keute et al., 2019; Warren et al., 2019). We suspect that the discrepancies stem from using long (e.g. 30 s) stimulation epochs as in clinical applications, and because of the indirect focus on the P300 component in which LC–NE activity is assumed to play a key role.
Here, we set out to examine whether short-term tVNS induces EEG and pupillary markers of arousal, as is in the context of VNS-induced activation. We used short (3.4 s) stimulation pulses during task-free rest conditions in healthy naive male volunteers (to avoid long-term changes associated with therapeutic effects; Follesa et al., 2007; Manta et al., 2013). We hypothesized that, if indeed tVNS increases LC and neuromodulatory activities, it should lead to pupil dilation, as has been observed across multiple species (Joshi et al., 2016; Reimer et al., 2016; Hayat et al., 2020). In addition, we hypothesized that tVNS would attenuate alpha oscillations that are anticorrelated with arousal during rest (Torsvall and Akerstedt, 1987; Drapeau and Carrier, 2004; Schomer et al., 2017) and that are attenuated by invasive VNS (Bodin et al., 2015). In line with these predictions, we found that tVNS induces pupil dilation and alpha desynchronization above and beyond the effects of sham (somatosensory) stimulation.
Materials and Methods
Participants
High-density (256-channel) EEG and pupillometry were recorded in 25 healthy young male adults (mean age, 28.08 ± 5.84 years; 2 were left handed). Written informed consent was obtained from each participant. The study was approved by the Medical Institutional Review Board (IRB) at the Tel Aviv Sourasky Medical Center. Females of child-bearing age were not included, per guidelines of the approved IRB. Participants reported being healthy and without a history of neuropsychiatric disorders; they indicated their dominant eye for pupillometry. One participant was excluded from the analysis because of excessive blinking, after which 24 participants remained (mean age, 28.3 ± 1.2 years). Data from an additional three participants were excluded from the EEG analysis because of a lack of alpha activity, after which 21 participants remained for the EEG analysis (mean age, 28.01 ± 1.3 years).
Experimental design
Main experiment
After the EEG setup (see below), participants performed a short “method of limits” procedure to select tVNS/sham stimulation intensities while sitting. This procedure systematically identifies the maximal comfortable stimulation levels for each individual, as in the studies by Kraus et al. (2013), Yakunina et al. (2017), and Ventura-Bort et al. (2018). We applied 5-s-long stimulation trials, starting at 0.1 mA, and increasing in each trial by 0.2 mA. After each trial, participants rated the subjective intensity on a scale of 0–9 (0 = no sensation; 3 = light tingling; 6 = strong tingling; 9 = painful). We continued increasing the current until either reaching a level rated as 9 or a maximal level of 5 mA. This procedure was conducted twice for each stimulation location (real tVNS at the cymba conchae vs sham stimulation at the ear lobe). The mean currents corresponding to a subjective rating of 8 (just below painful) were selected for each stimulation location separately. Thus, tVNS intensity was adjusted for each participant and location separately, as above the detection threshold and below the pain threshold, as is done in clinical settings (Ellrich, 2011). Participants were then instructed to position their heads in a chin-rest apparatus for adjusting and calibrating the eye tracker (see below). Subsequently, participants were instructed to fixate on a white cross on a background of a gray computer screen (positioned 80 cm from the participants' eyes; model 2311x, HP), throughout experimental “blocks” lasting 5 min. Each block included 11 trials of 3.4 s stimulation epochs (in each trial, tVNS intensity was ramped up gradually, to the level defined above), separated by interstimulus intervals of 26 s (±1 s jitter). We performed two blocks of either tVNS or sham, and then switched to position the stimulating electrode in the alternate location (the order was counterbalanced) to reach a total of eight blocks per session (Fig. 1C). Before changing the electrode location, participants answered questions regarding their subjective experience of stimulation (Table 1). Participants were free to rest between the blocks ad libitum. Data acquired during these “breaks” were used to characterize alpha activity in each individual during nonexperimental conditions (see below).
Pilot experiment
A similar experiment using the same device with a separate group of 29 male participants (mean age, 26.82 ± 1.1 years; 4 were left handed) used the default clinical stimulation mode (30 s ON, 30 s OFF) during fixation at rest, while recording high-density EEG (n = 15) and pupillometry (n = 29).
Transcutaneous vagus nerve stimulation
tVNS was delivered using NEMOS (Cerbomed), now tVNS technologies (Fig. 1B). In the tVNS condition, the electrodes were placed at the left cymba conchae, which is heavily innervated by the auricular branch of the vagus nerve (Peuker and Filler, 2002; Safi et al., 2016; Badran et al., 2018; Fig. 1A). In the sham condition, the electrodes were placed at the left earlobe (Figure 1A), which is not expected to induce brainstem or cortical activation (Kraus et al., 2007; Sellaro et al., 2015; Steenbergen et al., 2015), were placed at the left earlobe (Fig. 1A). Pulses (width, 200–300 µs) were delivered at a rate of 25 Hz (duty cycle, ∼7% ON time) for 3.4 s. This included ramping up of intensity (as set by the device) to a level experienced as just below painful, adjusted for each participant and condition separately (the method of limits procedure described above), as is often set clinically in patients (Vonck et al., 2014). To achieve 3.4 s stimulation trials, we controlled the NEMOS stimulation device using linear actuators (Actounix) that pressed the ON/OFF button automatically according to programmable times. These actuators were controlled by Arduino Mega (Arduino), directed by PsychoPy Python package (Peirce, 2007). Two additional measures verified good electrode contact throughout and consistent effectiveness of the stimulation, as follows: (1) the NEMOS device stops stimulation automatically whenever good physical contact with the participant's skin is disrupted; and (2) the experimenter verified in each participant the presence of a visually evident 25 Hz stimulation artifact in EEG electrodes close to the left ear.
Pupillometry
Data acquisition
Eye movements/gaze and pupil size were recorded monocularly from the dominant eye using an infrared video-oculographic system with a chin rest (Eyelink 1000 Plus, SR Research). Gaze and pupil data were sampled at 500 Hz, and positions were converted to degrees of visual angle based on a 9 point calibration performed at the beginning of the experiment (on middle gray background). The experiment was conducted in a room with constant ambient light.
Data analysis
Pupil data were low-pass filtered, using a 10 Hz fourth-order Butterworth filter with a zero-phase shift. Periods of blinks were detected using the manufacturer standard algorithms, with default settings. The remaining data analyses were performed using custom-made MATLAB scripts (MathWorks). Blinks were removed by linear interpolation of values measured 100 ms before and after each identified blink (de Gee et al., 2017). Peritrial data were segmented by extracting pupil data (–10 s to +13.4 s) around each stimulation trial. Trials in which interpolated data accounted for >50% of the data points were excluded (van Steenbergen and Band, 2013). After excluding one participant who had no trials remaining, the process yielded a mean of 42.12 ± 1.79 trials in the tVNS condition and 42.16 ± 1.79 in the sham condition (of 45 trials; range, 42–45 trials for both). To enable averaging across participants with different pupil sizes while avoiding arbitrary units, we converted pupil data to “percent change” values relative to a 10 s baseline before stimulation: [(x – baseline/baseline) * 100], as in the studies by Reimer et al. (2016) and Liu et al. (2017). Baseline pupil values did not differ significantly between the tVNS and sham conditions. In both conditions, the smaller the pupil before a specific trial, the higher the chance of observing significant pupil dilation (Spearman's correlation R = −0.27, p < 10–20; see Discussion).
The resulting pupil time courses were the mean values across trials for each participant and condition separately, as depicted in Figure 2A (for visualization only, single-participant traces were bandpass filtered again between 0.01 and 10 Hz; Fig. 2C). To present the individual participant data, we reduced the pupil data for each participant and condition to a scalar value (Fig. 2B) by averaging the time course across trials in the interval between the two points of full duration at half-maximum (FDHM; 3.2–10.4 s) following stimulation onset (Fig. 2A, dashed bar).
Gaze data and blink rate were also inspected and compared between conditions. Gaze was extracted, interpolated, and averaged using the same procedure described above for pupil size. Data points marked as blinks were summed across participants to produce a blink rate that was time locked to stimulation onset.
EEG
Data acquisition
High-density EEG was recorded continuously using a 256-channel hydrocel geodesic sensor net [Electrical Geodesics, Inc. (EGI)]. Each carbon-fiber electrode, consisting of a silver chloride carbon fiber pellet, a lead wire, and a gold-plated pin, was injected with conductive gel (Electro-Cap International). Signals were referenced to Cz, amplified via an AC-coupled high-input impedance amplifier with an antialiasing analog filter (NetAmps 300, EGI), and digitized at 1000 Hz. Electrode impedance in all sensors was verified to be <50 kΩ before starting the recording.
EEG data analysis
EEG preprocessing was performed in MATLAB using custom-written code and the FieldTrip toolbox (Oostenveld et al., 2011). First, we used a subsample of 192 electrodes placed directly on the skull (avoiding cheek electrodes with higher muscle artifacts). Continuous data from these electrodes were segmented to 33s epochs (–15 s to +18 s) around each stimulation onset. To enable effective visual inspection, data epochs were initially detrended linearly, notch filtered (at 50 Hz), and high-pass (>0.1 Hz) filtered using a second-order Butterworth filter to remove DC shifts. We then visually confirmed that all sham and tVNS trials showed a 25 Hz stimulation artifact around the left ear. Trials without the artifact were excluded (a mean of 14.75 ± 3.08% of trials were excluded). To focus on alpha oscillations, data were further bandpass filtered at two frequencies. The first filter was applied at 5–15 Hz using a third-order two-pass Butterworth filter, as in previous parallel factor analysis (PARAFAC) studies (Barzegaran et al., 2017). An additional notch filter at 25 Hz (stimulation frequency) was used, with harmonics up to 475 Hz, to remove any residual artifact stemming from stimulation and not removed by previous filters. Then, we removed the minimal number of channels or trials whose data crossed an absolute amplitude threshold of 100 µV in an automatic iterative process—that is, each 3 s epoch (x) in each channel had a Boolean value [max(abs(x)) > 100]. Subsequently, in each iteration, either a channel or a trial was excluded, such that a minimal number of channel × trial 3 s data epochs was discarded (the code is available at: https://github.com/sharomer/eeg_2d_minimal_rejection). This process removed large movement artifacts, but not all blink artifacts, which were separated later using the parallel factor analysis.
This preprocessing resulted in identifying a mean number of 18.76 ± 2.86% unacceptable channels per participant (of 192; data were interpolated using a linear distance weighted interpolation), and a mean number of 22 ± 3.42% unacceptable trials per participant (discarded from subsequent analyses). Only then were trials classified to the tVNS or sham condition to avoid any bias in preprocessing. The mean number of valid trials in the tVNS condition was 35.61 ± 1.09, and in the sham condition, 35.61 ± 1.09 (of 44; the number of trials did not differ significantly between the conditions). Next, data from each trial were transformed to the time–frequency domain using the fast Fourier transform, after multiplying by a moving Hamming window of 3 s. This yielded a frequency resolution of 0.33 Hz and a temporal resolution of 50 ms.
Parallel factor analysis
We first extracted data from “break” periods (between stimulation blocks) to identify each participant's alpha topography and frequency in an unbiased manner with respect to the study objectives. These data were segmented to 5 s epochs, with 1 s overlap with the preceding epochs and 1 s overlap with the subsequent epochs. The epochs were bandpass filtered (as described above for stimulation data) and reduced to 3 s trials (discarding the overlap) to avoid filtering artifacts at the edges. Then, break data epochs were cleaned as described for stimulation data, using the same procedure described above (resulting in 117.84 ± 7.15 trials, on average, per participant, with 175.28 ± 2.71 clean channels on average). These 3 s time–frequency epochs were used to identify each participant's alpha topography and precise frequency range using the PARAFAC analysis (Harshman, 1970), as implemented in the N-way toolbox (Andersson and Bro, 2000) and as presented in Figure 3. The type of constraint for each dimension was set to non-negativity. The proper number of components was determined by using the Core Consistency Diagnostic (CCD), in which the number of components is the highest when the minimal value of CCD is 55% and 90.60 ± 3.18% on average (Bro and Kiers, 2003).
Next, to assess the changes in alpha oscillations during stimulation, the individual weights for alpha component topography (Fig. 4A) and frequency (Fig. 4C) were derived from the break data, and multiplied by the spectrum of all channels, such that a single channel representing the weighted activity was achieved. We then subtracted the mean baseline activity in (−1 s to 0 s) relative to stimulation onset, for each participant and for each trial, and calculated the mean activity across participants (N = 21); this yielded the results depicted in Figure 4D.
To assess more carefully the brain activity following stimulation, beyond the a priori electrode and frequency band of interest, we conducted the following analyses. (1) We rigidly set the alpha topography (to investigate time–frequency changes in the entire spectrogram). To this end, we used the topography of interest derived from the PARAFAC decomposition of the break data (Fig. 4A), ignored the frequency of interest, plotted the entire spectrogram at 5–15 Hz in percentage change relative to the same baseline ((−1 s to 0 s) before stimulation onset; Fig. 4E,G,H), and used a cluster permutation test (see below). We also confirmed differences between tVNS and sham conditions by means of a post hoc direct comparison using Wilcoxon signed-rank tests (see Results). (2) Alternatively, we rigidly set the frequency band of interest (to investigate changes in all electrodes). To this end, we used the frequency-of-interest derived from the PARAFAC decomposition of the break data (Fig. 4C) and ignored the topography of interest. We plotted the entire topographical changes in voltage around times of stimulation, while subtracting the activity (−1 s to 0 s) before stimulation (Fig. 4F). We then performed a topographical cluster permutation test (Fig. 4F, yellow points). We also confirmed the difference between the tVNS and sham conditions using a post hoc direct comparison on the electrodes marked in blue in Figure 4A, using Wilcoxon signed-rank tests (see Results).
Statistical analyses
Unless stated otherwise, all statistical tests were conducted using a Wilcoxon signed-rank test (Wilcoxon, 1945). This included the significance of the pupil time course, which was corrected for multiple comparisons using false discovery rate (FDR) correction (Benjamini and Yekutieli, 2011). The significance for alpha attenuation in the EEG spectrogram was assessed using a cluster permutation test with the Monte Carlo method and a dependent-samples t statistic with 10,000 permutations, as implemented in the fieldtrip toolbox (Oostenveld et al., 2011). An α of 0.05 was considered significant after FDR correction for clusters (Benjamini and Yekutieli, 2011). In Figure 4, we plotted alpha attenuation at the individual participant level to facilitate, for the reader, the assessment of effect size across participants; these per subject values were also tested using the Wilcoxon signed-rank test (Fig. 4B). Data are expressed as the mean ± SEM throughout.
Data availability
The code and data is available upon a reasonable request.
Results
To investigate the short-term effects of tVNS in naive humans, we compared pupil dynamics and EEG alpha oscillations in healthy young male volunteers (n = 24) induced by multiple trials of short (3.4 s) electrical stimulation at the cymba conchae (tVNS) and at the earlobe (sham; Fig. 1). Stimulation was applied at a frequency of 25 Hz, with the intensity ramping up during the trial, to a maximal value selected per participant and location (see Materials and Methods).
First, we verified that the sham and tVNS conditions did not differ in any of the parameters of subjective averseness examined. Indeed, we did not find any significant differences between the tVNS and sham conditions in subjective reports such as pain and irritation (Table 1; p > 0.05 for all comparisons after FDR correction). Regarding objective current intensity, the mean values of the currents applied were 2.20 ± 0.24 mA in the tVNS condition and 2.79 ;± 0.27 mA in the sham condition. The higher current intensity in the sham condition was statistically significant (p = 0.0125, Wilcoxon signed-rank test). This is likely because of lower sensitivity at the earlobe, and “works against” our a priori hypothesis (larger effects in tVNS despite higher current intensity in sham condition; see Discussion). This finding shows that earlobe stimulation provides good somatosensory control, which distills the changes related specifically to tVNS.
Transcutaneous vagus nerve stimulation induces pupil dilation
tVNS led to robust pupil dilation that increased gradually (consistent with the ramping up of the stimulation intensity), reaching half-maximum at 2.53 s after stimulation onset, peaking at 4.25 s after stimulation onset, decreasing back to half-maximum at 8.17 s, and returning to baseline levels 10 s after stimulation. During peak pupil dilation, the mean pupil size (in pixels) was 4.05 ± 0.92% above baseline (Fig. 2A).
In contrast, sham stimulation led to only modest pupil dilation (mean, 1.67 ± 0.63%) and peaking around the same time. This dilation level was significantly weaker than following tVNS (p < 0.05 between 2.88 and 5.96 s, repeated Wilcoxon signed-rank test across all time points, and FDR correction for multiple comparisons; Fig. 2A, red bar). These results were largely consistent across individual participants (Fig. 2B) and evident in most (21 of 24) participants (Fig. 2C, examples).
Baseline pupil values did not differ significantly between the tVNS and sham conditions (p = 0.5). In both conditions, the smaller the pupil before a specific trial, the higher the stimulation-evoked response (Spearman's correlation R = −0.27, p < 10−20). We found no significant differences between the conditions, in blink rate or gaze position (p > 0.6 for all comparisons, using the same statistical procedure). In addition, the higher pupil dilation following on tVNS remained significant and robust across individuals, when the blink data were discarded (rather than interpolated). To verify that the effect of pupil dilation was not mediated by the difference in objective currents, we calculated the correlation between differences in pupil dilation (across the tVNS and sham conditions), and the difference in current (across the tVNS and sham conditions). This did not reveal a significant correlation (Spearman's correlation R = −0.12, p = 0.56).
In a pilot experiment that used 30 s ON/30 s OFF “clinical-like” stimulation, we observed only a modest trend for greater pupil dilation for tVNS than sham stimulation (p = 0.053, n = 23), and pupil size did not differ significantly between ON and OFF periods. Thus, short tVNS pulses lead to significantly greater pupil dilation than following sham stimulation. This indicates that tVNS promotes arousal above and beyond somatosensory stimulation at the ear.
Transcutaneous vagus nerve stimulation attenuates alpha oscillations
Alpha oscillations exhibit considerable interindividual variability in frequency and scalp topography (Haegens et al., 2014). To discern the effects of tVNS and sham stimulation on alpha activity, we first identified the frequency and topography of alpha oscillations in each participant separately, using PARAFAC analysis (Harshman, 1970). PARAFAC provides a unique solution for decomposing the EEG signal to three factors (time, frequency, channel; Fig. 3A) and may enhance sensitivity. This analytic technique was previously applied to electrophysiological recordings (Miwakeichi et al., 2004; Yanagawa et al., 2013; Meij et al., 2016), and specifically to assessing individual alpha oscillations (Barzegaran et al., 2017; Knyazeva et al., 2018; for detailed review of EEG applications, see Cong et al., 2015). We identified the regions and frequencies of interest for alpha oscillations in each participant separately, using unbiased break data between stimulation blocks (Fig. 3B). Figure 3, C and D, presents the result of this process in representative participants, and Figure 4A shows the median region and Figure 4C shows the frequency profile of alpha oscillations across all participants. PARAFAC successfully identified alpha activity (Fig. 3C,D, examples), capturing each individual's specific alpha frequency at ∼7–13 Hz, with the typical occipital topography.
After identifying alpha activity for each individual, we quantified the extent that this activity may be decreased by tVNS or sham stimulation in each participant separately. We found that tVNS-attenuated alpha activity (mean, 94.35 ± 2.2% of baseline) to a greater extent (p = 0.0027, Wilcoxon signed-rank test) than did sham stimulation, which was not associated with significant alpha attenuation (mean, 103.55 ± 2.4% of baseline). Baseline alpha activity was not significantly different between the conditions (p = 0.3 via the Wilcoxon signed-rank test). Greater alpha attenuation following tVNS was evident in most (19 of 21) participants (Fig. 4B).
We found a significant negative correlation between the differences in alpha attenuation (tVNS vs sham conditions) in each individual and the differences in applied current (tVNS vs sham conditions; Spearman's correlation R = −0.49, p = 0.02). Accordingly, participants with stronger sham stimulation current showed less difference in alpha attenuation. Along this line, we repeated the analysis for alpha attenuation while removing one-third of the participants with the highest difference in current between the conditions. This analysis revealed a difference in alpha attenuation that was even more significant for the remaining 14 participants, despite the fewer number of participants (p = 0.0001, Wilcoxon signed-rank test). The implication is that the alpha attenuation we observed following on tVNS constitutes a lower bound (an equivalent objective current intensity in the two conditions leads to a stronger difference in alpha attenuation between the tVNS and sham conditions, see also Discussion).
tVNS-induced alpha attenuation was not observed in our pilot experiment, which used 30 s ON/30 s OFF clinical-like stimulation (p > 0.05, n = 15). Neither experiment revealed a significant correlation between alpha attenuation and individual subjective (or objective) scores of stimulation intensities or a significant correlation between alpha attenuation and pupil dilation at the individual level (all p ≥ 0.1).
To complement the PARAFAC-based analysis and to better understand the precise time–frequency dynamics and topographical changes of alpha attenuation, we used the weighted alpha topography from the break data as a “weighted region of interest.” This reduced the data to two dimensions (time and frequency). Such an approach ignored the frequency of interest and inspected the induced power changes in the 5–15 Hz frequency range for the (weighted) occipital region derived from the PARAFAC decomposition (Fig. 4A). In line with the previous results, we found that tVNS significantly attenuated activity in the alpha band (8–12 Hz; Fig. 4G). Similarly, examining the effects of stimulation on EEG dynamics using cluster-based permutation (Maris and Oostenveld, 2007) revealed a significant (p = 0.0063; Fig. 4G,H, white contour) cluster at ∼8–12 Hz in the seconds following stimulation onset. During this time interval, the mean alpha power was 90.84 ± 2.77% in the tVNS condition, significantly lower than the mean 106.66 ± 2.70% observed in the sham condition (p < 0.0001, in a direct comparison). We also compared the two conditions using the classical alpha frequency range (8–12 Hz), during stimulation (0–4 s; mean: tVNS condition, 94.41 ± 2.15%; sham condition, 105.25 ± 2.41%; p = 0.0012, Wilcoxon signed-rank test for both).
Finally, we examined the extent to which the observed alpha attenuation was specific to occipital electrodes. We inspected the topographical changes in voltage around stimulation relative to baseline (Fig. 4F). This analysis was conducted by focusing on the a priori frequency band of interest derived from the PARAFAC decomposition (Fig. 4C), while ignoring the topography of interest derived from the break. We examined topographical effects of stimulation on EEG dynamics using topographical cluster-based permutation (Maris and Oostenveld, 2007). This revealed a significant (p < 0.05) cluster over occipital electrodes, which exhibited tVNS attenuation > sham attenuation (Figure 4F; yellow points). The implication is that alpha attenuation was specific to occipital areas (Fig. 4F, mean attenuation in yellow electrodes; during 0–4 s: −3.75 and −1.78 µV in the tVNS and sham conditions, respectively; p = 0.007 via the Wilcoxon signed-rank test). We also compared the two conditions directly using the occipital electrodes marked in Figure 4A, during 0–4 s (mean, −3.75 and −1.78 µV in the respective conditions; p = 0.007 via the Wilcoxon signed-rank test). Importantly, the regions showing tVNS-induced alpha attenuation overlapped electrodes showing alpha activity in the independent break intervals between stimulation blocks (compare Fig. 4A, blue dots, Fig. 4F, yellow dots).
Altogether, the EEG data establish that short tVNS pulses, but not sham stimulation, attenuate occipital alpha activity.
Discussion
We examined the effects of short tVNS pulses (and sham stimulation at the ear lobe) on pupil dynamics and EEG alpha activity in naive healthy men. While subjective stimulation intensities were not significantly different in the two conditions (Table 1), we found that short tVNS pulses induce pupil dilation (Fig. 2) and EEG alpha attenuation (Fig. 4) to a greater extent than does sham stimulation. These effects support the hypothesis that tVNS activates endogenous arousal-promoting neuromodulatory signaling such as LC–NE activity, as is known to occur in invasive VNS (Hulsey et al., 2017; Mridha et al., 2019). This suggests that tVNS mimics VNS.
Validity and limitations
Our results were obtained during fixation at rest. Although they may be applicable to other conditions, future studies are needed to determine the effects of short-pulse tVNS during other states, such as drowsiness and sleep, and during the performance of specific cognitive tasks. For example, high arousal at baseline could create a ceiling effect for pupil dilation and alpha attenuation; conversely, during decreased vigilance, EEG effects may attenuate idle activity at different frequency bands (e.g., changing the theta/alpha ratio during drowsiness, or suppressing slow-wave activity in sleep). Another limitation is that we could only study tVNS in male volunteers. Since there may be sex-specific differences in LC–NE and neuromodulatory activity (Bangasser et al., 2016), future studies with females are warranted.
Our experimental design equated the subjective intensity of tVNS and sham stimulation. Our data revealed that a significantly higher current at the earlobe (sham condition) was necessary to achieve equal subjective intensity. We did not find a significant correlation between differences in current (tVNS vs sham) and in pupil dilation (tVNS vs sham), but current differences were negatively correlated with differences in alpha attenuation (tVNS vs sham). Accordingly, among participants with a higher current intensity in the sham than the tVNS condition, the difference in alpha attenuation was smaller between the conditions. Indeed, restricting the analysis to a subset of 14 individuals, such that the significant difference in current intensity was eliminated, revealed a stronger effect of alpha attenuation in the tVNS versus sham condition. Thus, our results represent a conservative lower bound of the actual difference between alpha attenuation in tVNS and sham, which would be even greater in the context of comparable currents in tVNS and sham. More generally, this issue is relevant to an inherent limitation of using earlobe sham stimulation as a control condition. Despite its extensive use (Yap et al., 2020) and advantages, the earlobe, with its lower sensitivity, requires higher currents to produce a comparable subjective intensity. Future studies should apply additional control conditions (e.g., stimulation at other frequencies) to mitigate this limitation.
While our study used a single stimulation intensity per condition below that perceived as painful, as well as a single stimulation duration of 3.4 s, future studies could systematically compare several intensity levels and durations of both sham stimulation and tVNS to conduct a parametric investigation. With respect to intensity, stronger stimulation currents are naturally expected to produce stronger sensations and this should be accounted for (as is the case with other brain stimulation techniques; Woods et al., 2016), especially for therapeutic approaches. However, stronger stimulation may not necessarily produce stronger effects on neuronal activity and behavior, since some effects may actually elicit “U-shape” profiles (for example, see Clark et al., 1999). With respect to stimulation duration, future studies can reveal the minimal duration necessary to induce robust effects on arousal indices, brain activity, and behavior.
Previous tVNS studies
Our finding that tVNS attenuates alpha oscillations is compatible with the findings of a number of studies (Bodin et al., 2015; Lewine et al., 2019), while earlier studies reported mixed results or did not detect EEG effects (Hammond et al., 1992; Salinsky and Burchiel, 1993). Our use of short tVNS pulses and long baseline periods likely contributed to our ability to reveal alpha attenuation and pupil dilation. In addition, the sensitive analysis that was afforded by the use of PARAFAC enabled identifying alpha effects in many, but not all, of the participants.
In contrast to our focus on ongoing EEG and pupillometry, most previous studies attempted to demonstrate the effectiveness of tVNS by focusing on the EEG P300 or on salivary α-amylase as readouts. The P300 is a positive deflection with maximal amplitude in electrodes placed over the centroparietal midline, 300–500 ms after stimulus onset. The amplitude of this deflection is modulated by the probability of stimulus appearance regardless of sensory modality (Desmedt et al., 1965; Sutton et al., 1965). The P300 has been hypothesized to be a marker of LC–NE activity (Nieuwenhuis et al., 2005). This is because LC neurons are likewise activated by infrequent stimuli, independent of sensory modality (Aston-Jones et al., 1991); deviant stimuli elicit greater pupil dilation than standard stimuli (Murphy et al., 2011). However, the P300 may not constitute a straightforward test of tVNS efficacy since the link between P300 and LC–NE activity is still debated (Nieuwenhuis et al., 2011), and the dopaminergic (Glover et al., 1988) and glutamatergic (Hall et al., 2015) systems could also substantially affect the P300. Ventura-Bort et al. (2018) demonstrated that tVNS amplifies the parietal component of the P300 effect (P3b), selectively, for easy targets in their task. However, this effect was modest and could not be replicated using weaker fixed currents (0.5 mA) and a simpler classical P300 task (Warren et al., 2019). Another study by Keute et al. (2019) focused on the difference in pupil dilation between deviant and standard stimuli, using a classical auditory oddball task. The use of a constant 3 mA tVNS in all participants did not reveal any effect of the stimulation on event-related or baseline pupil size. A possible explanation is that 30 s tVNS modulates tonic NE levels but does not affect phasic stimulus-evoked changes in NE that are associated with the P300. In agreement with this possibility, the use of clonidine (an α2-adrenergic receptor agonist that reduces NE signaling) provided similar mixed results (Pineda and Swick, 1992; Halliday et al., 1994; Pineda et al., 1997; Brown et al., 2015). Future studies that will use short tVNS pulses, as used here, may help to elucidate the effects on the P300.
Both Warren et al. (2019) and Ventura-Bort (2018) showed that tVNS increases levels of salivary α-amylase, which has served as a peripheral measure of sympathetic activity associated with LC–NE signaling (Rohleder and Nater, 2009). However, this measure has poor temporal resolution and can only reveal differences between time intervals before versus after stimulation blocks that last several minutes. This approach does not leverage the superior temporal resolution of tVNS compared with pharmacological manipulations of NE in humans; such manipulations are highly effective in studying the effects of slower NE dynamics (Gelbard-Sagiv et al., 2018). By contrast, the transient (within seconds) tVNS-mediated effects revealed here offer considerable advantages over the slow modulations elicited by NE drugs.
Pupil dilation and alpha attenuation as indices of arousal and LC–NE activity
Pupil diameter was suggested as a proxy for noradrenergic signaling since Aston-Jones and Cohen (2005) first provided an example of correlated dynamics in simultaneous pupil and LC single-unit activities in a monkey (for recent review, see Joshi and Gold, 2020). Since this initial report, the relation between pupil diameter and noradrenergic signaling has been established in monkeys (Varazzani et al., 2015; Joshi et al., 2016), rats (Liu et al., 2017; Hayat et al., 2020), and mice (Reimer et al., 2016; Breton-Provencher and Sur, 2019), as well as in human BOLD fMRI (Murphy et al., 2014). The tVNS-induced pupil dilation time course that we observed (Fig. 3) resembles pupil dynamics in response to LC electrical stimulation in monkeys (Joshi et al., 2016) and optogenetic stimulation in rats (Hayat et al., 2020). This supports the hypothesis that tVNS activates the LC, as has been established for invasive VNS.
Alpha oscillations are abundant during detachment from the sensory environment in wakefulness. These are considered an index of low arousal (Torsvall and Akerstedt, 1987; Drapeau and Carrier, 2004). Alpha oscillations are believed to represent an “idling” state of cortical activity (Steriade, 2001; Palva and Palva, 2007) that is expected to be anticorrelated with arousal-promoting activity, such as that of the LC–NE system. These oscillations bias sensory perception (Waschke et al., 2019). A recent study that used long 2 min tVNS on the neck also found that tVNS attenuates alpha and theta oscillation (Lewine et al., 2019).
LC–NE versus other neuromodulatory systems
While pupil dilation and EEG alpha attenuation are both compatible with noradrenergic signaling, LC–NE involvement is unlikely to be the only modulatory system involved, given the overlap and redundancy among neuromodulatory systems. Other elements such as the cholinergic system also contribute to brain arousal and are associated with both pupil dilation (Reimer et al., 2016) and EEG activation (Szerb, 1967). However, cholinergic activation alone is unlikely to drive the effects observed. This is because during rapid eye movement sleep, cholinergic activation occurs without LC–NE activity (Nir and Tononi, 2010); and the EEG is activated but pupils remain constricted (Siegel, 2005). Moreover, given that VNS robustly activates the LC, and no such relation has been reported for cholinergic nuclei, the most parsimonious interpretation is that the primary neuromodulatory effects of tVNS are noradrenergic, while cholinergic modulation (Mridha et al., 2019) is likely secondary. tVNS may engage additional subcortical neuromodulatory systems such as the dorsal raphe and the ventral tegmental area, as observed with tVNS-induced BOLD fMRI (Frangos et al., 2015). Thus, the possible relation of tVNS to other neuromodulatory systems beyond LC–NE is an important topic for further investigation.
tVNS as a novel tool for transient neuromodulation
Great interest has arisen in investigating the contribution of the LC–NE system to human cognition, including perception, learning and memory, aging and neurodegeneration, and decision making. In this context, tVNS entails important advantages over existing tools. While important advances have been made by relying on the correlation of LC–NE activity with pupil dynamics (de Gee et al., 2017), hidden factors (e.g., fluctuations in arousal and attention) could be at the basis of the observed correlations (Clewett et al., 2018; Dragone et al., 2018). Previous human studies also used causal perturbations, using NE drugs to study the effects on perception (Gelbard-Sagiv et al., 2018), memory (for review, see van Stegeren, 2008), and decision-making (Warren et al., 2017; Neuser et al., 2020). However, the systemic delivery of NE drugs is inherently limited to affecting tonic LC–NE activity and has poor temporal resolution, whereas tVNS has clear added value.
A number of studies reported a benefit of invasive VNS on memory (Clark et al., 1999; Jacobs et al., 2015; Hansen, 2017; Sun et al., 2017). However, the participants of those studies had severe epilepsy or depression. In addition, ongoing daily VNS induces complex long-term plastic changes that make interpretation difficult.
Because of limitations of the available techniques for studying cognition, the potential of tVNS has been recognized and is increasingly being realized (Van Leusden et al., 2015). However, to date, the evidence supporting the effectiveness of tVNS in mimicking invasive VNS is mixed. We applied short stimulation pulses and limited currents to a maximal value per participant. Accordingly, and by focusing on simple pupillary and ongoing EEG readouts, we showed that tVNS transiently elicits markers of brain arousal that are compatible with arousal-promoting neuromodulatory signaling such as NE/acetylcholine. This supports the hypothesis that tVNS mimics invasive VNS, thereby extending the experimental toolkit for nonpharmacological neuromodulation in humans with high temporal resolution. Therefore, tVNS can be used to further investigate the means by which transient neuromodulation contributes to human cognition.
Finally, tVNS should be conducted to further elucidate the processes mediating the clinical improvements elicited by VNS in epilepsy and depression. These include the role of arousal-promoting neuromodulatory signaling in improving mood in depressed patients (Grimonprez et al., 2015; Liu et al., 2016; Fang et al., 2017; Tu et al., 2018). In particular, tVNS-induced pupillary and EEG effects may help predict clinical efficacy of invasive VNS and thus facilitate triaging patients to receive either conservative therapy or surgical implantation of VNS stimulation devices.
Footnotes
This work was supported by Israel Science Foundation (ISF) Grant 51/11 (I-CORE cognitive sciences) and the Adelis Foundation (Y.N.); and by The Herczeg Institute on Aging, the TAU Global Research Fund, and the Naomi Foundation (O.S.). We thank Talma Hendler for continuing support at the Tel Aviv Sourasky Medical Center, Neoreet Bracha, Jonathan Nir and Noa Duckler for assistance with data acquisition, members of the Nir laboratory for discusssions and Leon Deouell and Amit Marmelstein for comments on earlier drafts of the manuscripts.
The authors declare no competing financial interests.
- Correspondence should be addressed to Yuval Nir at ynir{at}tauex.tau.ac.il