Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro
eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleResearch Article: New Research, Cognition and Behavior

Pupil Size Sensitivity to Listening Demand Depends on Motivational State

Frauke Kraus, Jonas Obleser and Björn Herrmann
eNeuro 21 November 2023, 10 (12) ENEURO.0288-23.2023; https://doi.org/10.1523/ENEURO.0288-23.2023
Frauke Kraus
1Department of Psychology, University of Lübeck, 23562 Lübeck, Germany
2Center of Brain, Behavior, and Metabolism, University of Lübeck, 23562 Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Frauke Kraus
Jonas Obleser
1Department of Psychology, University of Lübeck, 23562 Lübeck, Germany
2Center of Brain, Behavior, and Metabolism, University of Lübeck, 23562 Lübeck, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jonas Obleser
Björn Herrmann
3Rotman Research Institute, Baycrest Academy for Research and Education, Toronto M6A 2E1, Ontario, Canada
4Department of Psychology, University of Toronto, Toronto M5S 3G3, Ontario, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Björn Herrmann
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Abstract

Motivation plays a role when a listener needs to understand speech under acoustically demanding conditions. Previous work has demonstrated pupil-linked arousal being sensitive to both listening demands and motivational state during listening. It is less clear how motivational state affects the temporal evolution of the pupil size and its relation to subsequent behavior. We used an auditory gap detection task (N = 33) to study the joint impact of listening demand and motivational state on the pupil size response and examine its temporal evolution. Task difficulty and a listener’s motivational state were orthogonally manipulated through changes in gap duration and monetary reward prospect. We show that participants’ performance decreased with task difficulty, but that reward prospect enhanced performance under hard listening conditions. Pupil size increased with both increased task difficulty and higher reward prospect, and this reward prospect effect was largest under difficult listening conditions. Moreover, pupil size time courses differed between detected and missed gaps, suggesting that the pupil response indicates upcoming behavior. Larger pre-gap pupil size was further associated with faster response times on a trial-by-trial within-participant level. Our results reiterate the utility of pupil size as an objective and temporally sensitive measure in audiology. However, such assessments of cognitive resource recruitment need to consider the individual’s motivational state.

  • cognitive demand
  • gap detection
  • motivation intensity theory
  • pupil dilation
  • reward prospect

Significance Statement

The literature is inconclusive to what degree pupil size may be representing the interaction of listening task demand and an individual’s motivational state. Using an auditory gap detection task, we manipulated both the degree of listening demand and a person’s motivational state and investigated the temporal evolution of the pupil size response and its relation to behavior. We find that the pupil size represents the interaction of demand and motivational state. These results highlight the importance of considering a person’s motivational state when using pupil size as a clinical measure. Pupil size appears as a key tool when assessing the impact of motivational state on the recruiting of cognitive resources.

Introduction

Many listening situations in everyday life are characterized by speech that is masked by background sounds (e.g., music or other people talking). Communicating under such circumstances requires the allocation of cognitive resources, captured by the construct of listening effort (Eckert et al., 2016; Pichora-Fuller et al., 2016; Peelle, 2018; Herrmann and Johnsrude, 2020). Critically, the degree to which a person invests cognitive resources to achieve a goal increases with increasing situational demands as long as resources are available and the person is sufficiently motivated to achieve the goal (Brehm and Self, 1989; Richter et al., 2016). That is, a person may disengage and give up listening if the situational demands to achieve listening success exceed the person’s available resources (Brehm and Self, 1989; Pichora-Fuller et al., 2016; Richter et al., 2016; Herrmann and Johnsrude, 2020). A person may also disengage from listening, or not engage in the first place, if the reward associated with listening success is too low relative to the experienced or anticipated mental costs (Brehm and Self, 1989; Matthen, 2016; Pichora-Fuller et al., 2016; Richter et al., 2016; Herrmann and Johnsrude, 2020). Any scientifically and clinically useful measure of listening effort must therefore be sensitive to both cognitive load and motivation for it to be useful (Richter et al., 2016).

Listening effort can be measured through subjective self-reports and physiological measures (Pichora-Fuller et al., 2016; Carolan et al., 2022). Furthermore, behavioral outcomes such as changes in speech understanding, accuracy, and response time can be associated with listening effort (Piquado et al., 2012; Houben et al., 2013), but do not directly reflect listening effort (McGarrigle et al., 2014; Pichora-Fuller et al., 2016). Pupil size is often used as a physiological measure to index listening effort (Winn et al., 2018; Zekveld et al., 2018). Physiologically, a key driver of pupil size is noradrenergic neuromodulation emerging from the locus coeruleus (LC) in the brainstem (Joshi et al., 2016; Joshi and Gold, 2020). The LC in turn is sensitive to cognitive factors, such as attention (Vazey et al., 2018), and helps to optimize task performance and engagement (Aston-Jones and Cohen, 2005).

From a more audiological point of view, pupil-linked arousal is a relatively unspecific response to cognitive load during listening. For example, several speech manipulations have been shown to lead to increased pupil size: noise reduction of hearings aids being switched off (Ohlenforst et al., 2018); poor spectral resolution (Winn et al., 2015); lowered signal-to-noise ratio in speech (Zekveld et al., 2010; Koelewijn et al., 2012); syntactic complexity (Wendt et al., 2016); or semantic ambiguity (Kadem et al., 2020).

Investing cognitively under challenging listening conditions requires a listener to be motivated (Kahneman, 1973; Pichora-Fuller et al., 2016; Herrmann and Johnsrude, 2020). Motivation is often manipulated experimentally through social influences (evaluation threat: Carrillo-de-la-Peña and Cadaveira, 2000; Gilzenrat et al., 2010; Zekveld et al., 2019; perceived competence: DeWall et al., 2011; McAuley et al., 2012; Hodgetts et al., 2019) or monetary incentives (Bijleveld et al., 2009; Gilzenrat et al., 2010; Koelewijn et al., 2018, 2021; Mirkovic et al., 2019; Zhang et al., 2019; Carolan et al., 2021; Cole et al., 2022). Monetary reward prospect can improve speech perception (Bianco et al., 2021) and increase self-reported listening effort (Carolan et al., 2021). However, other work finds that monetary reward prospect does not affect pitch discrimination (Richter, 2016) or speech perception (Koelewijn et al., 2018, 2021).

A meta-analysis suggests that physiological measures of listening effort are more sensitive to monetary reward manipulations than subjective or behavioral measures (Carolan et al., 2022). Indeed, whereas some work shows that monetary reward prospect increases pupil size and performance (Van Slooten et al., 2018; Zhang et al., 2019), other work suggests that monetary reward prospect increases only physiologically measured listening effort but does not affect behavior (pupil size: Bijleveld et al., 2009; Koelewijn et al., 2018; cardiovascular reactivity: Richter, 2016). In addition, some research suggests that motivation may affect behavioral performance and physiological measures of listening effort, specifically under hard listening conditions, but less so under easy listening conditions (Bijleveld et al., 2009; Zhang et al., 2019), consistent with motivation–effort frameworks (Brehm and Self, 1989; Pichora-Fuller et al., 2016; Richter et al., 2016; Herrmann and Johnsrude, 2020). However, other research finds that motivation impacts listening effort measured through pupillometry under both easy and hard conditions (Koelewijn et al., 2018).

Critically, previous research has mainly relied on speech materials to investigate how motivation affects physiological measures of listening effort and behavior (Bijleveld et al., 2009; Koelewijn et al., 2018, 2021; Zekveld et al., 2019; Zhang et al., 2019, 2022). However, speech materials are inherently variable in content and temporal evolution, which makes it harder to pinpoint how motivation, listening effort, and behavioral performance interact during listening. In the present study, we therefore use an auditory gap-in-noise detection task for which the timing of listening demand and the degree of demand can be manipulated tightly (through changes in gap timing and duration).

We seek to investigate, first, how reward prospect and listening demand jointly impact pupil-linked arousal and behavioral outcomes. Second, we explore the temporal evolution of reward prospect on pupil size. Last, we aim to characterize to what extent pupil size predicts upcoming behavior.

Materials and Methods

Participants

We recruited 33 younger adults (age range, 18–31 years; mean, 24.4 years; SD, 3.78 years; 7 males and 24 females; all right handed) from the participant database of the Department of Psychology at the University of Lübeck. All of them reported no history of neural disorders and no hearing problems.

Participants gave written informed consent before participation and were financially compensated with 10€/h or received course credits. In the current study, motivation was manipulated through financial rewards and participants could thus earn additional 10€ depending on their behavioral performance. The study was conducted in accordance with the Declaration of Helsinki and was approved by the local ethics committee of the University of Lübeck.

Experimental environment

Participants were seated in a comfortable chair in a sound-attenuated booth. Participants placed their head on a chin rest positioned at ∼70 cm distance from an LED computer monitor (refresh rate, 60 Hz; catalog #TD2421, ViewSonic). The experimental stimulation was controlled by a desktop computer (Windows 7) running Psychtoolbox-3 in MATLAB and an external RME Fireface UC sound card. Sound was delivered binaurally via in-ear headphones (EARTONE 3A, 3M). Responses were given via a four-button response box (The Black Box Toolkit), using only one of the four buttons.

Experimental design

Participants listened to 5.2 s white-noise sounds that each contained one gap. The gap could occur at 1 of 70 randomly selected time points between 2.2 and 4.2 s after noise onset (linearly spaced; Fig. 1). Participants were instructed to press a button on a response box as soon as they detected the gap (Henry and Obleser, 2012; Henry et al., 2014, 2017; Herrmann et al., 2023). Noise sounds were presented at 50 dB above a participant’s sensation level estimated using a methods of limits procedure (Herrmann et al., 2018). During the presentation of the white noise, a fixation cross was presented on the screen. Participants were asked to fixate their eyes on the fixation cross during the experiment.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Experimental design. A, Auditory gap detection task: participant’s task was to detect a gap within a 5.2 s white noise sound. The gap occurred at a random time between 2.2 and 4.2 s post-noise onset (gap window, marked by the gray line). B, A 2 (task difficulty) × 2 (reward prospect) design was used. Task difficulty was manipulated by changing the duration of the gap (hard: duration was titrated to 65% detection performance; easy: twice the titrated duration). An auditory cue presented 1 s before each trial indicated whether the upcoming trial was paired with no reward or reward. C, Hypothesis: according to the motivation intensity theory (gray lines; Brehm and Self, 1989; Richter, 2016), motivation influences cognitive resource recruitment when task difficulty is high (hard), but not when task difficulty is low (easy). In hard settings, participants should only invest cognitively when being motivated to succeed (solid line) but give up when being less motivated (dashed line). Colored dots show our hypothesis.

A 2 × 2 experimental design was used in the current study, such that we manipulated task difficulty (easy, hard) and reward prospect (no reward, reward). In detail, task difficulty was manipulated by presenting participants with white noises containing a near-threshold (hard) or a suprathreshold gap (easy). For the hard condition, gap duration was individually titrated to an ∼65% gap detection performance in training blocks before the main experimental blocks (four to six training blocks of 2 min each). For the easy condition, the estimated gap duration was doubled (Kraus et al., 2023).

Monetary reward prospect was manipulated such that half of the trials were paired with no reward, whereas the other half were paired with the possibility to receive an additional financial reward based on the participant’s performance on these trials. Specifically, after the experiment, three trials for each of the two difficulty levels were chosen from the reward trials (Tusche and Hutcherson, 2018; Teoh et al., 2020; Cole et al., 2022). Participants could gain 10€ in addition to their hourly compensation rate if their average performance across these six trials was >80%.

Trials were presented in 14 blocks, each containing 20 trials. In half of the blocks, task difficulty of the 20 trials was easy, whereas in the other half of the blocks, task difficulty of the 20 trials was hard. Written information about the task difficulty of a block (easy or hard) was presented at the beginning of each block. Hence, participants had prior knowledge about whether trials contained a gap that is easy or hard to detect. Easy and hard blocks alternated, and the starting task difficulty was counterbalanced across participants.

Ten of the 20 trials per block were no-reward trials, whereas the other 10 trials were reward prospect trials. No-reward and reward prospect trials were presented in random order within each block. An auditory cue was presented 1 s before each white noise sound that indicated whether a trial was a no-reward or a reward prospect trial. Auditory cues consisted of a guitar and a flute sound. The pairing of guitar and flute sounds to no-reward and reward prospect trials was counterbalanced across participants. Training before the experiment was conducted to familiarize participants with the cue–reward association. Overall, participants listened to 70 white noise sounds per task difficulty (easy, hard) × reward prospect (no reward, reward) condition, resulting in 280 trials in total.

After the experiment, participants completed a questionnaire regarding their use of the reward cues to examine whether participants were familiar with the cue–reward association during the experiment. They first indicated which auditory cue was associated with reward trials. We then asked them to rate the following statement (translated from German): “I used the auditory cues (guitar and flute) to distinguish between important and unimportant conditions” on a 6-point scale with the following labels: “strongly disagree,” “disagree,” “somewhat disagree,” “somewhat agree,” “agree,” and “strongly agree.” Participants rated this statement twice, separately for the easy and the hard conditions.

Analysis of behavioral data

Any button press within 0.1–1 s after gap onset was defined as a hit (coded 1). Trials for which no button was pressed within this time window were considered a miss (coded 0). Response time was calculated as the time between gap onset and the button press.

Linear mixed-effect modeling using R (version 4.1.2), with the packages lme4 and sjPlot, was used to analyze the influence of task difficulty and reward prospect separately on accuracy and response time. Models included effects of task difficulty (easy, hard) and reward prospect (no reward, reward) as well as the task difficulty × reward prospect interaction. Task difficulty and reward prospect were categorial predictors and were deviation coded [i.e., −0.5 (easy, no-reward) and 0.5 (hard, reward)]. To account for and test for an expected hazard effect (faster responses with later gap time), we included gap time as a regressor (Niemi and Näätänen, 1981; Nobre et al., 2007; Herrmann et al., 2023). Gap time is a continuous variable and was z-transformed before the analysis. Response time data were log transformed to obtain values closer to normal distribution. We included participant-specific random intercepts to account for individual differences in overall accuracy or response time. This resulted in the statistical model expression of the following form: AccuracyorReactiontime[log]∼difficulty*reward + gaptime[z] + (1|participant).

The single-trial accuracy data are binary. We thus calculated a generalized linear mixed-effect model (GLMM) with a binomial distribution and a logit link function (Tune et al., 2021). Response time (log-transformed) is a continuous variable, and we thus used a linear mixed-effect model (LMM) with a Gaussian distribution and an identity link function (Tune et al., 2021).

Our main hypothesis (derived from motivational intensity theory; Fig. 1C) was that, specifically for hard trails, accuracy and response time should depend on the reward level. Moreover, for the accuracy data, we expected that performance would be at ceiling for easy trials, and we thus decided to elaborate the possible influence of reward in more detail for the hard condition. Hence, we used a reduced linear mixed model using data from the two hard conditions (no reward, reward) and tested for the effect of reward prospect on accuracy and response time: AccuracyorResponsetime[log] ∼ rewardhard + gaptime [z]+(1|participant).

Please note that it proved not possible to include random effects for all regressors of interest because of missing convergence (Barr et al., 2013). Therefore, we decided to use only random-intercept models to avoid decisions about including one random effect over the other. Nevertheless, in the Results section, we report whether fixed effects lost their significance when including the corresponding random effect.

A paired-sample t test was used to test for effects in cue-use rating. Effect sizes for t tests are reported as Cohen’s d (Cohen, 1988).

Pupil data recording and preprocessing

Eye movements and pupil size of the right eye were continuously recorded using an Eyelink 1000 Plus eye tracker (SR Research) at a sampling rate of 500 Hz. Data were preprocessed and analyzed using MATLAB (MathWorks). Time windows of missing data and their enclosing sharp signal increase and decrease were marked as blinks and set to NaN (“not a number” in MATLAB; Hershman et al., 2018). If a blink occurred within 100 ms of another blink, the data points between the two blinks were also set to NaN. NaN-coded data points were linearly interpolated. Incomplete eye lid closing can also cause artifacts in the data that do not result in missing data points (Kret and Sjak-Shie, 2019). To remove such artifacts, we calculated the median absolute deviation with a moving time window of 20 ms (Kret and Sjak-Shie, 2019). Data points that were larger than the median of the median absolute deviation + 3 * IQR (interquartile range) were marked as missing timepoints and NaN coded. If a NaN-coded time window occurred within 100 ms of another NaN-coded time window, then data points in between were also NaN-coded. NaN-coded data points were linearly interpolated.

Pupil size can appear to change when the eyes move without an actual change in pupil size, because eye movements change the angle of the pupil relative to the eye-tracking camera (van Rij et al., 2019). To account for such potential influences of eye movement on pupil size, we regressed out the linear and quadratic contributions of the x- and y-coordinates on the pupil size time course (Cui and Herrmann, 2023; Fink et al., 2023; Kraus et al., 2023). In the current study, the residual pupil size was used as the dependent measure. Pupil data were subsequently low-pass filtered at 4 Hz (Butterworth, fourth-order filter) and divided into epochs ranging from −2 to 6.2 s time locked to noise onset. An epoch was excluded if >40% of the trial had been NaN coded before interpolation. If >50% of the trials in any condition were excluded, the full dataset of the respective participant was excluded from analysis (N = 1). Pupil size data were downsampled to 50 Hz. For each trial, the mean pupil size in the −1.6 to −1.1 s time window was subtracted from the pupil size at every time point of the epoch (baseline correction). This baseline time window was chosen to avoid contamination of the auditory cue which was presented at −1 s. For each participant, single-trial time courses were averaged, separately for each condition.

Analysis of task difficulty and reward prospect on pupil size

To investigate the effects of task difficulty and reward prospect on pupil size, we used a similar model for the analysis as was used for the analysis of behavioral data. Non-baseline-corrected pupil size data were averaged across the time window ranging from 2.2 s (onset of gap window) to 6.2 s (end of trial) and were used as the dependent variable (in the literature referred to as mean pupil diameter). Mean pupil size in the baseline time window (−1.6 to −1.1 s; time locked to noise onset) was used as a predictor to account for baseline variations in the model (Alday, 2019). Task difficulty and reward prospect were deviation-coded predictors in the model as well as their interaction. The pupil size data were continuous and were thus z scored as follows: Pupilsize[z]∼baselinepupilsize[z] + difficulty * reward+(1|participant).

To visualize and investigate when in time the effects of task difficulty and reward prospect occurred, we calculated difference time courses. For time-resolved analyses of the main effect of task difficulty, we averaged time courses across the two reward prospect levels, separately for the easy and the hard condition. The averaged time course for the easy condition was then subtracted from the averaged time course for the hard condition. For time-resolved analyses of the main effect of reward prospect, we averaged time courses across the two task difficulty conditions, separately for the no-reward condition and the reward condition. The averaged time course for the no-reward condition was then subtracted from the averaged time course for the reward condition. Every time point of the difference time courses was tested against zero using a one-sample t test. To examine whether the effect of task difficulty differed from the effect of reward prospect, the two difference time courses (reflecting the task-difficulty and reward prospect effects) were compared using a paired sample t test for each time point. To account for multiple comparisons, p-values were corrected across time points using a false discovery rate (FDR) of q = 5% (Benjamini and Hochberg, 1995).

For the time-resolved analysis of the interaction between task difficulty and reward prospect, we subtracted the time course for the no-reward condition from the time course for the reward condition (capturing the reward prospect effect), separately for the easy condition and the hard condition. Furthermore, we subtracted the time course for the easy condition from the time course of the hard condition (capturing the task-difficulty effect), separately for the no-reward condition and the reward condition. Every time point of the difference time courses was tested against zero using a one-sample t test, and p-values were FDR corrected. The reward prospect effect between easy and hard task difficulty and the task difficulty effect between no-reward and reward were also compared for each time point (i.e., testing the interaction), again using FDR correction.

Analysis of the relationship of pupil size and behavior

We also investigated whether a smaller pupil size increases the probability of missing the gap. To this end, we time locked the baselined (see above) pupil size data of every trial to the exact gap time. First, we split the trials of the hard condition into hit versus miss trials. Trials for the easy condition were not used because very few gaps were missed. To analyze whether the effect of reward prospect differs between hit and miss trials, we subtracted the time course for the no-reward condition from the time course for the reward condition, separately for hit and miss trials. Each time point of the difference time courses was tested against zero using a one-sample t test to test at which time points pupil size was different between reward and no-reward condition. The time courses of the reward prospect effect for the hit and miss conditions were also compared using paired-sample t tests at each time point. To analyze further whether the effect of detection (hit vs miss) differed between reward and no-reward conditions, we subtracted the time course for miss trials from the time course for hit trials, separately for reward and no-reward conditions. Again, both difference time courses were tested against zero to test at which time points hit and miss trials differed. To test whether the detection effect differed between reward and no-reward trials, the time courses of the detection effect for reward and no-reward conditions were compared with each other using paired-sample t tests. To account for multiple comparisons, p-values were corrected across time points using an FDR of q = 5% (Benjamini and Hochberg, 1995).

To investigate whether pupil size before a gap predicts behavioral performance, we used a GLMM with the following expression: AccuracyorResponsetime[log]∼pupilsize[within;z] + pupilsize[between;z]+reward+gaptime[z] + (1|participant).

Again, only data from the hard condition were used, because of the limited number of miss trials in the easy condition. For the pupil size regressors, pupil size data (baselined to −1.6 to −1.1 s to noise onset) were time locked to the gap onset time and averaged across the 500 ms time window before the gap. To disentangle associations of pupil size and behavior at the trial-by-trial state level (i.e., within participant) from associations at the trait level (i.e., between participants), we included two separate regressors associated with changes in pupil size. The between-participants regressor contained the trial-averaged pupil size per individual, whereas the within-participant regressor contained the single-trial pupil size relative to the individual mean (Bell et al., 2019; Tune et al., 2021; Kraus et al., 2023). Similar to the LMM for the behavioral analysis, we included the gap time of each trial as a regressor into the model. All continuous data (pupil size and gap time) were z scored across participants. Reward was deviation coded.

To further test a possible mediation effect of pupil size on the effect of reward prospect on response time, we calculated a mediation analysis in R using the package mediation. We included the following two models into the mediation analysis running 10,000 iterations: Pupilsize[z]∼reward+gaptime[z]+(1|participant), (1)and Responsetime[log]∼pupilsize[z]+reward + gaptime[z]+(1|participant). (2)

For the pupil size data, we averaged the gap-locked pupil size data across the 0.5 s before the gap.

To illustrate the association of pupil size and response time (see Fig. 4D), we created the following two groups of trials: (1) for the group of fast trials, we included all trials with response times at least +0.75 SD above the mean; and (2) for the group of slow trials, we included all trials with response times at least −0.75 SD below the mean. We chose this threshold as a good compromise between too many and too few trials per group. The grouping in fast and slow trials was done separately for reward and no-reward conditions to ensure that the proportion of reward and no-reward trials does not differ between slow and fast trials. Furthermore, we only included trials such that there was no difference in gap times, relative to noise onset, between slow and fast trial responses.

Data availability

Data and analysis scripts are available at https://osf.io/yxn96/.

Results

Reward prospect impacts performance under hard conditions

As expected, participants performed better and faster in the easy compared with the hard condition [Fig. 2: accuracy, GLMM: odds ratio (OR) = 0.03, SE = 0.004, p = 2.05 × 10−153; response time, LMM: β = 0.33, SE = 0.006, p < 2.05 × 10−153]. An overall reward prospect effect was present for response time but not for accuracy (accuracy, GLMM: p > 0.2; response time, LMM: β = −0.03, SE = 0.006, p = 4.89 × 10−5). The interaction between task difficulty and reward prospect was not significant (accuracy: OR = 1.32, p > 0.2; response time: β = −0.02, p > 0.2). The hazard effect (Niemi and Näätänen, 1981; Nobre et al., 2007) was present for response time but not for accuracy (accuracy, GLMM: OR = 1.05, p > 0.2; response time, LMM: β = −0.06, SE = 0.003, p = 1.43 × 10−66). The later the gap appeared in a trial the faster participants responded, but the probability to detect the gap did not change as a function of gap time.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Behavioral results. A, Accuracy: a participant’s performance was better in the easy compared with the hard condition. In the hard condition, performance was better in the reward compared with the no-reward condition. Insets, 45° scatter plots showing the task difficulty effect (left) and the reward prospect effect (right) from linear mixed-model analysis. Difference plots (y-axis minus x-axis) are shown in top right corners. B, Response time: participants were faster in the easy compared with the hard condition and in the reward compared with the no-reward condition. In the hard condition, participants were faster in the reward compared with the no-reward condition. Insets, Same as in A but for response time. Crosshairs in insets indicate 95% confidence interval (CI). Note that all analysis of response time used log-transformed data, while figures show untransformed data for reference. * p < 0.05, ** p < 0.01, *** p < 0.001, n.s. p > 0.05.

Our main interest was whether reward prospect influences behavior under the hard listening condition. Hence, we calculated a mixed-effect model-analysis using only the data from hard trials and tested for the main effect of reward prospect. We found better and faster performance in the reward compared with the no-reward condition (accuracy, GLMM: OR = 1.33, SE= 0.092, p = 7.01 × 10−5; response time, LMM: β = −0.03, SE = 0.011, p = 3.48 × 10−3). For hard trials, later gap time was associated with a faster response and a higher probability to detect the gap (accuracy, GLMM: OR = 1.08, SE = 0.037, p = 0.029; response time, LMM: β = −0.04, SE = 0.006, p = 4.89 × 10−13).

Interestingly, participants reported after the experiment that they had used the cue indicating reward versus no-reward trials more to adjust their listening when the task was harder (t(32) = 7.17, p = 3.89 × 10−8, d = 1.25).

Sensitivity of pupil size to reward prospect depends on task difficulty

We first analyzed pupil size data over a large time window from 2.2 s (onset of gap time window) to 6.2 s (end of trial; time locked to noise onset; Fig. 3). Pupil size was averaged within this time window for each trial and used as dependent variable in an LMM. Pupil size was larger for the hard compared with the easy condition (effect of task difficulty: β = 0.30, SE = 0.02, p = 2.90 × 10−83) and for the reward condition compared with the no-reward condition (effect of reward prospect: β = 0.14, SE = 0.02, p = 1.06 × 10−18).

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Pupil size results. A, Averaged pupil size time courses across participants per condition. Error bands reflect the within-participants error. Gray areas indicate time windows during which a gap could occur, from 2.2 to 4.2 s. B, Averaged data for a 2.2–6.2 s time window. Error bars indicate the standard error of the mean (SEM). C, Time-resolved task difficulty (hard minus easy) and reward prospect (reward minus no-reward) effects. Lines at the bottom show significant time points at which main effects were significantly different from zero (black, task difficulty; gray dashed, reward prospect; FDR thresholded) and different from each other (light blue). Error bands reflect the SEM. Gray areas indicate time windows during which a gap could occur, from 2.2 to 4.2 s. D, 45° scatter plots illustrate the interaction. Left, Data from the easy condition. Right, Data from the hard condition. Colored dots show averaged pupil data per task difficulty level, separately for each participant. The 45° line indicates no difference between conditions. Crosshairs indicate the 95% confidence interval (CI). Difference plots (y-axis minus x-axis) are shown in the top right corners. E, Left, Time-resolved difference of reward. Reward prospect difference (reward minus no-reward) was calculated for each task-difficulty level and participant. Lines at the bottom show significant time points at which Δ reward prospect was significantly different from zero (green, easy; purple, hard) and different from each other (black, FDR thresholded). Right, Time-resolved difference of task difficulty. Task difficulty difference (hard minus easy) was calculated for each reward prospect level and participant. Lines at the bottom show significant time points at which Δ task difficulty was significantly different from zero (dashed line, no-reward; solid line, reward) and different from each other (black, FDR thresholded). Error bands reflect the SEM. Gray areas indicate time windows during which a gap could occur, from 2.2 to 4.2 s.

Critically, the task difficulty × reward prospect interaction was significant, indicating a larger reward prospect effect (reward minus no-reward) when the task was hard compared with easy (LMM; β = 0.12, SE = 0.03, p = 5.97 × 10−5; easy reward prospect effect: t(31) = 2.84, p = 7.95 × 10−3, d = 0.502; hard reward prospect effect: t(31) = 5.25, p = 1.06 × 10−5, d = 0.927; Fig. 3D).

Second, we calculated difference time courses to visualize and investigate in which time window the manipulation of task difficulty and reward prospect led to changes in the pupil size response. For both task-difficulty levels, pupil size was greater in the reward compared with the no-reward condition over a wide time window (easy, 2.68–4.06 s; hard, 1–6.2 s; Fig. 3E, left). For both reward prospect levels, pupil size was greater in the hard condition compared with the easy condition over a wide time window (no-reward: −0.72 to −0.06 s, 1.32–6.2 s; reward: −1.2 to 0.08 s, 0.88–6.2 s; Fig. 3E, right). Critically, and in line with the interaction results from the temporally averaged analysis, the difference in time courses (reward minus no-reward) differed across a wide time window with the task difficulty. The reward prospect effect was larger in the hard compared with the easy condition (2.56–2.8 and 2.96–6.16 s; Fig. 3E, left, right).

Furthermore, for the analysis of the time-resolved main effects, both—the task difficulty and the reward prospect effect—were significant almost the entire time the white noise was presented (difficulty, 0.88–6.2 s; reward, 1.02–6.2 s; Fig. 3C). In addition, pupil size was larger for the hard condition compared with the easy condition in response to the cue, before the noise sound started (−1.14 to 0.08 s). Finally, the task difficulty and the reward prospect effects differed in the early cue time window (−0.44 to 0.1 s), but also toward the end of the white noise sound (4.2–6.14 s; Fig. 3C).

Larger pre-gap pupil size is associated with faster response time

We investigated whether a smaller pupil size before the gap is associated with a higher likelihood of missing a gap. To this end, we time locked the pupil size data of each trial to the respective gap time. Splitting the data in hit and miss trials enabled us to analyze whether the pupil size time course differs depending on gap detection (Fig. 4A). Pupil size was indeed larger for hit trials compared with miss trials, but only ∼1 s after gap onset, for both reward prospect levels (no-reward, 0.38–1.14 s; reward, 0.48–1.26 s). Toward the end of the noise sound, pupil size was larger for miss trials compared with hit trials for both reward prospect conditions (no-reward, 1.7–2 s; reward, 1.8–2 s; Fig. 4B). This indicates that pupil size decreased after participants detected the gap (hit), whereas, for miss trials, the pupil continued to dilate continuously until the offset of the white noise. This behavior-related effect occurred independent from reward prospect (interaction of detection and reward prospect was not significant; Fig. 4B).

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Association of pupil size and behavioral outcome. A, Averaged pupil size time courses (time locked to gap onset) for the hard condition across participants, separately for each reward prospect condition (light vs dark) and for hit and miss trials (solid vs dashed lines). Error bands reflect the within-participant error. B, Time-resolved difference of the detection effect. Detection difference (hit minus miss) was calculated for each reward prospect level (no-reward, reward) and participant. Lines at the bottom show significant time points at which Δ detection was significantly different from zero (light line, no-reward; dark line, reward). Δ Detection did not significantly differ between no-reward and reward. Error bands reflect SEM. C, Time-resolved difference of reward prospect. Reward prospect difference (reward minus no-reward) was calculated for each detection level (hit, miss) and participant. Lines at the bottom show significant time points at which Δ reward prospect was significantly different from zero (solid, hit; dashed, miss). Δ Reward prospect did not significantly differ between hit and miss trials. Error bands reflect the SEM. D, Average of split of pupil size data into trials with slow (dashed) and fast (solid) response times. Histogram shows the distribution of response times of the corresponding trials. Gray area indicates time window for LMM analysis in E. Error bands reflect the within-participant error. E, Effect of pupil size on response time in an LMM analysis. Larger pupil size is associated with faster response time. Participant-specific slope for pupil size did not improve the model, but we show it here for illustrating purposes. All significant effects are based on FDR correction. F, Mediation analysis shows that the total effect from reward on response time is partly mediated by pre-gap pupil size. *p < 0.05, **p < 0.01, ***p < 0.001.

Likewise, the reward prospect effect (reward minus no-reward) was independent of whether a gap was detected or not (hit vs miss). In both cases, pupil size was larger for the reward compared with the no-reward condition. The reward prospect effect started earlier for hit trials than for miss trials (hit, −2 to 2 s; miss, 0.52–1.1 s; interaction of detection and reward prospect was not significant; Fig. 4C).

Last, we examined whether the differences in pupil size between hit and miss trials may also translate to successful predictions of behavior on a trial-by-trial level. We calculated LMM models to predict accuracy or log response time from the pupil size averaged across the 0.5 s time window pre-gap onset (Fig. 4D). There was no significant trial-by-trial relationship between pre-gap pupil size and accuracy for the within-participant effect (GLMM: OR = 1.02, p > 0.5) or for the between-participant effect (OR = 1.13, p > 0.4). However, within participants, trials with larger pre-gap pupil size preceded shorter response times in these trials (LMM: β = −0.013, SE = 0.006, p = 0.039; no effect at the between-participants level: β = −0.015, p > 0.5; Fig. 4E).

Since we found an effect of reward prospect on response time and pupil size, and that the pupil size and response time are associated, we asked whether the effect of reward prospect on response time may be mediated by pupil size. Therefore, we ran a mediation analysis resulting in a small mediation effect of pupil size (β = −0.0028, p = 0.035; proportion mediated, 8.5%).

Discussion

In the current study, we investigated the extent to which task difficulty and a listener’s motivational state, here manipulated as reward prospect, influence pupil size and ensuing behavior. An auditory gap detection task allowed us to manipulate both the task difficulty (i.e., gap duration) and precise timing of a target (gap).

Reward prospect improved response time and accuracy under difficult listening conditions. Critically, pupil dilation was greater under higher compared with lower task demands, and especially so when reward prospect was high. This result emphasizes the fact that pupil size cannot be taken as an invariant readout of listening demand but is also sensitive to an individual’s motivational state, especially under more challenging listening conditions.

Reward prospect improves behavioral performance

Behavioral performance decreased with increasing cognitive demand, as expected based on our manipulation (but see Herrmann et al., 2023; Kraus et al., 2023). Critically, reward prospect was associated with better gap detection accuracy and response time when listening was hard (response time was also faster under easy listening conditions; Fig. 2B). Some previous work also observed reward-related performance increases in auditory tasks (Mirkovic et al., 2019; Zhang et al., 2019; Bianco et al., 2021), whereas other studies did not (Koelewijn et al., 2018; Carolan et al., 2021). Why behavioral benefits related to manipulations of motivation are observed only sometimes could be because of different tasks (mainly speech tasks compared with present study: Bianco et al., 2021; Carolan et al., 2021; Mirkovic et al., 2019; Zhang et al., 2019), different manipulation of task difficulty (speech vocoding: Carolan et al., 2021; speech rate: Zhang et al., 2019; directional microphone technique in hearing aids: Mirkovic et al., 2019; target saliency: Koelewijn et al., 2018, 2021), and/or the amount of monetary reward prospect. It appears that the maximum achievable reward has been lower in studies that did not observe reward performance benefits (₤2.50: Carolan et al., 2021; 5€: Koelewijn et al., 2018) compared with those observing such benefits (10€: present study; 20€: Mirkovic et al., 2019; 50€: Zhang et al., 2019), possibly adding to the inconstancies across studies.

We observed no statistically significant interaction between task difficulty and reward prospect on behavioral outcomes (accuracy, response speed; Fig. 2). This is not easily reconciled with motivation intensity theory (Brehm and Self, 1989; Richter, 2016) and also empirical listening effort work (Mirkovic et al., 2019; Zhang et al., 2019), which suggest that motivation should exhibit the strongest benefit under harder task conditions, and possibly no effect under easy conditions. Under hard conditions, people in a low-motivation state may give up because the effort required is not worth the reward (Fig. 1C), whereas under easy conditions, participants should have no problem to perform the task regardless of motivational incentives.

The observation that the reward effect was not significantly stronger under hard compared with easy listening condition, as suggested by the main effect and absence of an interaction, may be because of several reasons. During the “hard” condition under no reward, participants had nothing else interesting to do and may have invested cognitively. The hard condition may also not have been hard enough for people to give up under the no-reward condition on most trials (∼69%; Fig. 2A). Another reason could be that the “easy” condition still required individuals to invest effort despite ceiling performance accuracy (work showing the same behavioral performance but a different amount of listening effort: Bentler et al., 2008; Picou and Ricketts, 2014; Gagné et al., 2017). People may thus have invested more cognitively during reward than no-reward trials even when task difficulty was easy, leading to response time benefits from reward for both the easy listening and the hard listening conditions.

Interestingly, although there was no significant interaction between task difficulty and reward prospect, participants’ self-reports about cue use showed that participants used the auditory cue more in the hard condition to adjust their listening to the task. This is similar to Carolan et al. (2021), who found reward effects on self-reported listening effort but not on performance.

Pupil size sensitivity to listening demand depends on motivational state

We show increased pupil size for high compared with low task demands, consistent with previous work (Kahneman and Beatty, 1966; Zekveld et al., 2010; Koelewijn et al., 2012; Zekveld and Kramer, 2014; Winn et al., 2015; Wendt et al., 2016; Ohlenforst et al., 2018; Zhao et al., 2019; Kadem et al., 2020; Kraus et al., 2023). We also observed that monetary reward prospect for good task performance increased pupil size (Fig. 3), indicating that motivational state plays a key role in modulating pupil size. A recent meta-analysis also suggests that pupil size is sensitive to reward manipulations (Carolan et al., 2022; but see Bijleveld et al., 2009; Manohar et al., 2017; Koelewijn et al., 2018; Zhang et al., 2019), although not all studies show motivation effects on pupil size (Gilzenrat et al., 2010; Koelewijn et al., 2021).

Critically, task difficulty and reward prospect interacted. The increase in pupil size because of reward prospect was greater under hard listening conditions compared with easy listening conditions (Fig. 3), suggesting that the investment of cognitive resources is greater when a listener is highly motivated, especially in hard listening situations.

The interaction between task difficulty and reward prospect on pupil size is in line with motivation intensity theory (Brehm and Self, 1989; Richter, 2016), in that a higher influence of motivation on effort (pupil size) was expected in highly demanding compared with less demanding situations. Motivation intensity theory predicts that a person only invests cognitively when they have sufficient cognitive resources available and are motivated (Brehm and Self, 1989; Richter, 2016), otherwise they would give up listening.

Under hard conditions, people may have given up investing effort (Zekveld and Kramer, 2014; Wendt et al., 2016; Ohlenforst et al., 2018) more frequently compared with easy conditions. Under easy listening conditions, participants had sufficient cognitive resources to perform the task and were motivated to do so as indicated by the ceiling performance (Fig. 2). However, our data unexpectedly show also for the easy condition that pupil size—and thus listening effort—is larger during reward compared with no-reward trials, whereas no difference was predicted by the motivation intensity theory (Brehm and Self, 1989; Richter, 2016; Fig. 1). It is possible that participants occasionally gave up listening under the easy, no-reward condition because they were bored or underchallenged (Westgate and Wilson, 2018; Herrmann and Johnsrude, 2020), but then still detected the gap because it was perceptually sufficiently salient in the easy condition (consistent also with longer response times for the no reward than the reward condition). Yet, if this low-effort strategy was used by listeners, why they would not have used it under the reward condition—where pupil size indicates higher effort—is not clear. Future research may need to further address the impact of motivation under easy listening conditions.

Not least, the present results hold an important lesson for the increasingly common use of pupil dilation as a clinical measure. Pupil dilation cannot serve as a direct readout of purely physical listening demand. It must be taken into account that the internal motivational state of a person is not only influencing pupil dilation directly. Motivational state also gates or constrains the size of the effects that listening demand will exert on pupil size. Furthermore, a person’s motivational state can affect their pupil size even when influences on behavioral performance do not emerge.

Pupil size predicts response time

The time-resolved analysis of pupil size revealed that pupil size differed depending on whether a listener detected the gap or not, whereas pupil size did not predict accuracy on a trial-by-trial level. However, we found that a larger pupil size was associated with faster response times. These results are in line with work suggesting that response time may be a more sensitive behavioral measure compared with accuracy (Houben et al., 2013; Weis et al., 2013; Kang et al., 2017) and that response time is associated with pupil-linked arousal (Wainstein et al., 2017; Schriver et al., 2018; Van Kempen et al., 2019). Using our auditory gap detection task allowed us to investigate when in time effort is invested, revealing that pupil size predicts response time measures. This can be interpreted as higher cognitive resource investment measured by pupil size helps to optimize behavioral outcome. Time-resolved analyses may be more challenging for complex speech stimuli, for which it is less clear when effort is invested (Bijleveld et al., 2009; Gilzenrat et al., 2010; Koelewijn et al., 2018, 2021; Zhang et al., 2019). Furthermore, the present work highlights the use of single-trial models to analyze pupil–behavior associations (Tune et al., 2021; Kraus et al., 2023).

Previous work has suggested a link between LC–norepinephrine (NE) activity and performance increases (Aston-Jones et al., 1994; Usher et al., 1999; Aston-Jones and Cohen, 2005), which may be reflected in the pupil size. Our mediation analysis revealed that a small part of the influence of reward on response time is mediated via pupil size. Hence, only a small portion of the behavioral benefit under reward can be explained via enhanced LC–NE activity that is indexed by pupil size changes. Our data suggest that most of the reward effect directly impacts behavioral performance or via pathways different from the LC–NE pathway.

Conclusion

The motivational state of a person influences the extent to which they invest cognitive resources under high demands. We used an auditory gap detection paradigm that tightly controlled task demands and target timing. Pupil size reflected stronger influences of a person’s motivational state under the hard compared to the easy listening condition. This highlights the importance for future work to not see pupil size as a simple readout of task demand. Not least pupil size is also a key indicator when considering an individual’s motivational state in measuring cognitive investment.

Acknowledgments

Acknowledgment: We thank Aigerim Tuichieva for assisting with the data collection.

Footnotes

  • The authors declare no competing financial interests.

  • This work was supported by Deutsche Forschungsgesellschaft (Grant HE 7857/1-1) awarded to B.H. B.H. was supported by the Natural Sciences and Engineering Research Council of Canada (Discovery Grant RGPIN-2021-02602) and the Canada Research Chair Program (Grant CRC-2019-00156, 232733).

  • ↵* J.O. and B.H. share joint senior authorship.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.

References

  1. ↵
    Alday PM (2019) How much baseline correction do we need in ERP research? Extended GLM model can replace baseline correction while lifting its limits. Psychophysiology 56:e13451. https://doi.org/10.1111/psyp.13451 pmid:31403187
    OpenUrlCrossRefPubMed
  2. ↵
    Aston-Jones G, Cohen JD (2005) An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance. Annu Rev Neurosci 28:403–450. https://doi.org/10.1146/annurev.neuro.28.061604.135709 pmid:16022602
    OpenUrlCrossRefPubMed
  3. ↵
    Aston-Jones G, Rajkowski J, Kubiak P, Alexinsky T (1994) Locus coeruleus neurons in monkey are selectively activated by attended cues in a vigilance task. J Neurosci 14:4467–4480. https://doi.org/10.1523/JNEUROSCI.14-07-04467.1994 pmid:8027789
    OpenUrlAbstract/FREE Full Text
  4. ↵
    Barr DJ, Levy R, Scheepers C, Tily HJ (2013) Random effects structure for confirmatory hypothesis testing: keep it maximal. J Mem Lang 68:255–278. https://doi.org/10.1016/j.jml.2012.11.001
    OpenUrlCrossRefPubMed
  5. ↵
    Bell A, Fairbrother M, Jones K (2019) Fixed and random effects models: making an informed choice. Qual Quant 53:1051–1074. https://doi.org/10.1007/s11135-018-0802-x
    OpenUrlCrossRef
  6. ↵
    Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B Stat Methodol 57:289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
    OpenUrl
  7. ↵
    Bentler R, Wu YH, Kettel J, Hurtig R (2008) Digital noise reduction: outcomes from laboratory and field studies. Int J Audiol 47:447–460. https://doi.org/10.1080/14992020802033091 pmid:18698521
    OpenUrlCrossRefPubMed
  8. ↵
    Bianco R, Mills G, de Kerangal M, Rosen S, Chait M (2021) Reward enhances online participants’ engagement with a demanding auditory task. Trends Hear 25:23312165211025941. https://doi.org/10.1177/23312165211025941 pmid:34170748
    OpenUrlPubMed
  9. ↵
    Bijleveld E, Custers R, Aarts H (2009) The unconscious eye opener. Psychol Sci 20:1313–1315. https://doi.org/10.1111/j.1467-9280.2009.02443.x pmid:19788532
    OpenUrlCrossRefPubMed
  10. ↵
    Brehm JW, Self EA (1989) The intensity of motivation. Annu Rev Psychol 40:109–131. https://doi.org/10.1146/annurev.ps.40.020189.000545 pmid:2648973
    OpenUrlCrossRefPubMed
  11. ↵
    Carolan PJ, Heinrich A, Munro KJ, Millman RE (2021) Financial reward has differential effects on behavioural and self-report measures of listening effort. Int J Audiol 60:900–910. https://doi.org/10.1080/14992027.2021.1884907 pmid:33630718
    OpenUrlPubMed
  12. ↵
    Carolan PJ, Heinrich A, Munro KJ, Millman RE (2022) Quantifying the effects of motivation on listening effort: a systematic review and meta-analysis. Trends Hear 26:23312165211059982. https://doi.org/10.1177/23312165211059982 pmid:35077257
    OpenUrlPubMed
  13. ↵
    Carrillo-de-la-Peña M, Cadaveira F (2000) The effect of motivational instructions on P300 amplitude. Neurophysiol Clin 30:232–239. https://doi.org/10.1016/s0987-7053(00)00220-3 pmid:11013896
    OpenUrlCrossRefPubMed
  14. ↵
    Cohen J (1988) Statistical power analysis for the behavioral sciences, pp 20–26. Hillsdale, NJ: Lawrence Erlbaum.
  15. ↵
    Cole L, Lightman S, Clark R, Gilchrist ID (2022) Tonic and phasic effects of reward on the pupil: implications for locus coeruleus function. Proc R Soc B Biol Sci 289:20221545. https://doi.org/10.1098/rspb.2022.1545
    OpenUrl
  16. ↵
    Cui ME, Herrmann B (2023) Eye movements decrease during effortful speech listening. J Neurosci 43:5856–5869. https://doi.org/10.1523/JNEUROSCI.0240-23.2023 pmid:37491313
    OpenUrlAbstract/FREE Full Text
  17. ↵
    DeWall CN, Baumeister RF, Mead NL, Vohs KD (2011) How leaders self-regulate their task performance: evidence that power promotes diligence, depletion, and disdain. J Pers Soc Psychol 100:47–65. https://doi.org/10.1037/a0020932 pmid:20919772
    OpenUrlCrossRefPubMed
  18. ↵
    Eckert MA, Teubner-Rhodes S, Vaden KI (2016) Is listening in noise worth it? the neurobiology of speech recognition in challenging listening conditions. Ear Hear 37 Suppl 1]:101S–110S. https://doi.org/10.1097/AUD.0000000000000300 pmid:27355759
    OpenUrlCrossRefPubMed
  19. ↵
    Fink L, Simola J, Tavano A, Lange E, Wallot S, Laeng B (2023) From pre-processing to advanced dynamic modeling of pupil data. Behav Res Methods. Advance online publication. Retrieved June 22, 2023. https://doi.org/10.3758/s13428-023-02098-1.
  20. ↵
    Gagné JP, Besser J, Lemke U (2017) Behavioral assessment of listening effort using a dual-task paradigm: a review. Trends Hear 21:2331216516687287. https://doi.org/10.1177/2331216516687287 pmid:28091178
    OpenUrlPubMed
  21. ↵
    Gilzenrat MS, Nieuwenhuis S, Jepma M, Cohen JD (2010) Pupil diameter tracks changes in control state predicted by the adaptive gain theory of locus coeruleus function. Cogn Affect Behav Neurosci 10:252–269. https://doi.org/10.3758/CABN.10.2.252 pmid:20498349
    OpenUrlCrossRefPubMed
  22. ↵
    Henry MJ, Obleser J (2012) Frequency modulation entrains slow neural oscillations and optimizes human listening behavior. Proc Natl Acad Sci U S A 109:20095–20100. https://doi.org/10.1073/pnas.1213390109 pmid:23151506
    OpenUrlAbstract/FREE Full Text
  23. ↵
    Henry MJ, Herrmann B, Obleser J (2014) Entrained neural oscillations in multiple frequency bands comodulate behavior. Proc Natl Acad Sci U S A 111:14935–14940. https://doi.org/10.1073/pnas.1408741111 pmid:25267634
    OpenUrlAbstract/FREE Full Text
  24. ↵
    Henry MJ, Herrmann B, Kunke D, Obleser J (2017) Aging affects the balance of neural entrainment and top-down neural modulation in the listening brain. Nat Commun 8:15801. https://doi.org/10.1038/ncomms15801 pmid:28654081
    OpenUrlCrossRefPubMed
  25. ↵
    Herrmann B, Maess B, Johnsrude IS (2018) Aging affects adaptation to sound-level statistics in human auditory cortex. J Neurosci 38:1989–1999. https://doi.org/10.1523/JNEUROSCI.1489-17.2018 pmid:29358362
    OpenUrlAbstract/FREE Full Text
  26. ↵
    Herrmann B, Johnsrude IS (2020) A model of listening engagement (MoLE). Hear Res 397:108016. https://doi.org/10.1016/j.heares.2020.108016 pmid:32680706
    OpenUrlPubMed
  27. ↵
    Herrmann B, Maess B, Henry MJ, Obleser J, Johnsrude IS (2023) Neural signatures of task-related fluctuations in auditory attention and age-related changes. Neuroimage 268:119883. https://doi.org/10.1016/j.neuroimage.2023.119883 pmid:36657693
    OpenUrlPubMed
  28. ↵
    Hershman R, Henik A, Cohen N (2018) A novel blink detection method based on pupillometry noise. Behav Res Methods 50:107–114. https://doi.org/10.3758/s13428-017-1008-1 pmid:29340968
    OpenUrlPubMed
  29. ↵
    Hodgetts WE, Aalto D, Ostevik A, Cummine J (2019) Changing hearing performance and sound preference with words and expectations: meaning responses in audiology. Ear Hear 40:615–620. https://doi.org/10.1097/AUD.0000000000000634 pmid:30063475
    OpenUrlCrossRefPubMed
  30. ↵
    Houben R, Van Doorn-Bierman M, Dreschler WA (2013) Using response time to speech as a measure for listening effort. Int J Audiol 52:753–761. https://doi.org/10.3109/14992027.2013.832415 pmid:24053226
    OpenUrlCrossRefPubMed
  31. ↵
    Joshi S, Gold JI (2020) Pupil size as a window on neural substrates of cognition. Trends Cogn Sci 24:466–480. https://doi.org/10.1016/j.tics.2020.03.005 pmid:32331857
    OpenUrlCrossRefPubMed
  32. ↵
    Joshi S, Li Y, Kalwani RM, Gold JI (2016) Relationships between pupil diameter and neuronal activity in the locus coeruleus, colliculi, and cingulate cortex. Neuron 89:221–234. https://doi.org/10.1016/j.neuron.2015.11.028 pmid:26711118
    OpenUrlCrossRefPubMed
  33. ↵
    Kadem M, Herrmann B, Rodd JM, Johnsrude IS (2020) Pupil dilation is sensitive to semantic ambiguity and acoustic degradation. Trends Hear 24:2331216520964068. https://doi.org/10.1177/2331216520964068 pmid:33124518
    OpenUrlPubMed
  34. ↵
    Kahneman D (1973) Attention and effort. Hoboken, NJ: Prentice Hall. https://doi.org/10.2307/1421603
  35. ↵
    Kahneman D, Beatty J (1966) Pupil diameter and load on memory. Science 154:1583–1585. https://doi.org/10.1126/science.154.3756.1583 pmid:5924930
    OpenUrlAbstract/FREE Full Text
  36. ↵
    Kang G, Wang L, Zhou X (2017) Reward interacts with modality shift to reduce cross-modal conflict. J Vis 17(1):19, 1–14. https://doi.org/10.1167/17.1.19 pmid:28114493
    OpenUrlCrossRefPubMed
  37. ↵
    Koelewijn T, Zekveld AA, Festen JM, Kramer SE (2012) Pupil dilation uncovers extra listening effort in the presence of a single-talker masker. Ear Hear 33:291–300. https://doi.org/10.1097/AUD.0b013e3182310019 pmid:21921797
    OpenUrlCrossRefPubMed
  38. ↵
    Koelewijn T, Zekveld AA, Lunner T, Kramer SE (2018) The effect of reward on listening effort as reflected by the pupil dilation response. Hear Res 367:106–112. https://doi.org/10.1016/j.heares.2018.07.011 pmid:30096490
    OpenUrlCrossRefPubMed
  39. ↵
    Koelewijn T, Zekveld AA, Lunner T, Kramer SE (2021) The effect of monetary reward on listening effort and sentence recognition. Hear Res 406:108255. https://doi.org/10.1016/j.heares.2021.108255 pmid:33964552
    OpenUrlCrossRefPubMed
  40. ↵
    Kraus F, Tune S, Obleser J, Herrmann B (2023) Neural α oscillations and pupil size differentially index cognitive demand under competing audiovisual task conditions. J Neurosci 43:4352–4364. https://doi.org/10.1523/JNEUROSCI.2181-22.2023
    OpenUrlAbstract/FREE Full Text
  41. ↵
    Kret ME, Sjak-Shie EE (2019) Preprocessing pupil size data: guidelines and code. Behav Res Methods 51:1336–1342. https://doi.org/10.3758/s13428-018-1075-y pmid:29992408
    OpenUrlCrossRefPubMed
  42. ↵
    Manohar SG, Finzi RD, Drew D, Husain M (2017) Distinct motivational effects of contingent and noncontingent rewards. Psychol Sci 28:1016–1026. https://doi.org/10.1177/0956797617693326 pmid:28488927
    OpenUrlCrossRefPubMed
  43. ↵
    Matthen M (2016) Effort and displeasure in people who are hard of hearing. Ear Hear 37 Suppl 1]:28S–34S. https://doi.org/10.1097/AUD.0000000000000292 pmid:27355767
    OpenUrlCrossRefPubMed
  44. ↵
    McAuley JD, Henry MJ, Wedd A, Pleskac TJ, Cesario J (2012) Effects of musicality and motivational orientation on auditory category learning: a test of a regulatory-fit hypothesis. Mem Cognit 40:231–251. https://doi.org/10.3758/s13421-011-0146-4 pmid:22006580
    OpenUrlCrossRefPubMed
  45. ↵
    McGarrigle R, Munro KJ, Dawes P, Stewart AJ, Moore DR, Barry JG, Amitay S (2014) Listening effort and fatigue: what exactly are we measuring? A British Society of Audiology Cognition in Hearing Special Interest Group “white paper”. Int J Audiol 53:433–440. https://doi.org/10.3109/14992027.2014.890296 pmid:24673660
    OpenUrlCrossRefPubMed
  46. ↵
    Mirkovic B, Debener S, Schmidt J, Jaeger M, Neher T (2019) Effects of directional sound processing and listener’s motivation on EEG responses to continuous noisy speech: do normal-hearing and aided hearing-impaired listeners differ? Hear Res 377:260–270. https://doi.org/10.1016/j.heares.2019.04.005 pmid:31003037
    OpenUrlCrossRefPubMed
  47. ↵
    Niemi P, Näätänen R (1981) Foreperiod and simple reaction time. Psychol Bull 89:133–162. https://doi.org/10.1037/0033-2909.89.1.133
    OpenUrlCrossRef
  48. ↵
    Nobre A, Correa A, Coull J (2007) The hazards of time. Curr Opin Neurobiol 17:465–470. https://doi.org/10.1016/j.conb.2007.07.006 pmid:17709239
    OpenUrlCrossRefPubMed
  49. ↵
    Ohlenforst B, Wendt D, Kramer SE, Naylor G, Zekveld AA, Lunner T (2018) Impact of SNR, masker type and noise reduction processing on sentence recognition performance and listening effort as indicated by the pupil dilation response. Hear Res 365:90–99. https://doi.org/10.1016/j.heares.2018.05.003 pmid:29779607
    OpenUrlCrossRefPubMed
  50. ↵
    Peelle JE (2018) Listening effort: how the cognitive consequences of acoustic challenge are reflected in brain and behavior. Ear Hear 39:204–214. https://doi.org/10.1097/AUD.0000000000000494 pmid:28938250
    OpenUrlCrossRefPubMed
  51. ↵
    Pichora-Fuller MK, Kramer SE, Eckert MA, Edwards B, Hornsby BWY, Humes LE, Lemke U, Lunner T, Matthen M, Mackersie CL, Naylor G, Phillips NA, Richter M, Rudner M, Sommers MS, Tremblay KL, Wingfield A (2016) Hearing impairment and cognitive energy: the framework for understanding effortful listening (FUEL). Ear Hear 37 Suppl 1]:5S–27S. https://doi.org/10.1097/AUD.0000000000000312 pmid:27355771
    OpenUrlCrossRefPubMed
  52. ↵
    Picou EM, Ricketts TA (2014) The effect of changing the secondary task in dual-task paradigms for measuring listening effort. Ear Hear 35:611–622. https://doi.org/10.1097/AUD.0000000000000055 pmid:24992491
    OpenUrlCrossRefPubMed
  53. ↵
    Piquado T, Benichov JI, Brownell H, Wingfield A (2012) The hidden effect of hearing acuity on speech recall, and compensatory effects of self-paced listening. Int J Audiol 51:576–583. https://doi.org/10.3109/14992027.2012.684403 pmid:22731919
    OpenUrlCrossRefPubMed
  54. ↵
    Richter M (2016) The moderating effect of success importance on the relationship between listening demand and listening effort. Ear Hear 37 Suppl 1]:111S–117S. https://doi.org/10.1097/AUD.0000000000000295 pmid:27355760
    OpenUrlCrossRefPubMed
  55. ↵
    Richter M, Gendolla GHE, Wright RA (2016) Three decades of research on motivational intensity theory. In: Advances in motivation science (Elliot AJ, ed), pp 149–186. Cambridge, MA: Academic.
  56. ↵
    Schriver BJ, Bagdasarov S, Wang Q (2018) Pupil-linked arousal modulates behavior in rats performing a whisker deflection direction discrimination task. J Neurophysiol 120:1655–1670. https://doi.org/10.1152/jn.00290.2018 pmid:29995602
    OpenUrlPubMed
  57. ↵
    Teoh YY, Yao Z, Cunningham WA, Hutcherson CA (2020) Attentional priorities drive effects of time pressure on altruistic choice. Nat Commun 11:3534. https://doi.org/10.1038/s41467-020-17326-x
    OpenUrl
  58. ↵
    Tune S, Alavash M, Fiedler L, Obleser J (2021) Neural attentional-filter mechanisms of listening success in middle-aged and older individuals. Nat Commun 12:4533. https://doi.org/10.1038/s41467-021-24771-9 pmid:34312388
    OpenUrlCrossRefPubMed
  59. ↵
    Tusche A, Hutcherson CA (2018) Cognitive regulation alters social and dietary choice by changing attribute representations in domain-general and domain-specific brain circuits. Elife 7:e31185. https://doi.org/10.7554/eLife.31185
    OpenUrlCrossRefPubMed
  60. ↵
    Usher M, Cohen JD, Servan-Schreiber D, Rajkowski J, Aston-Jones G (1999) The role of locus coeruleus in the regulation of cognitive performance. Science 283:549–554. https://doi.org/10.1126/science.283.5401.549 pmid:9915705
    OpenUrlAbstract/FREE Full Text
  61. ↵
    Van Kempen J, Loughnane GM, Newman DP, Kelly SP, Thiele A, O’Connell RG, Bellgrove MA (2019) Behavioural and neural signatures of perceptual decision-making are modulated by pupil-linked arousal. Elife 8:e42541. https://doi.org/10.7554/eLife.42541
    OpenUrl
  62. ↵
    van Rij J, Hendriks P, van Rijn H, Baayen RH, Wood SN (2019) Analyzing the time course of pupillometric data. Trends Hear 23:2331216519832483. https://doi.org/10.1177/2331216519832483 pmid:31081486
    OpenUrlPubMed
  63. ↵
    Van Slooten JC, Jahfari S, Knapen T, Theeuwes J (2018) How pupil responses track value-based decision-making during and after reinforcement learning. PLoS Comput Biol 14:e1006632. https://doi.org/10.1371/journal.pcbi.1006632 pmid:30500813
    OpenUrlCrossRefPubMed
  64. ↵
    Vazey EM, Moorman DE, Aston-Jones G (2018) Phasic locus coeruleus activity regulates cortical encoding of salience information. Proc Natl Acad Sci U S A 115:E9439–E9448. https://doi.org/10.1073/pnas.1803716115 pmid:30232259
    OpenUrlAbstract/FREE Full Text
  65. ↵
    Wainstein G, Rojas-Líbano D, Crossley NA, Carrasco X, Aboitiz F, Ossandón T (2017) Pupil size tracks attentional performance in attention-deficit/hyperactivity disorder. Sci Rep 7:8228. https://doi.org/10.1038/s41598-017-08246-w pmid:28811624
    OpenUrlPubMed
  66. ↵
    Weis T, Puschmann S, Brechmann A, Thiel CM (2013) Positive and negative reinforcement activate human auditory cortex. Front Hum Neurosci 7:842. https://doi.org/10.3389/fnhum.2013.00842 pmid:24367318
    OpenUrlPubMed
  67. ↵
    Wendt D, Dau T, Hjortkjær J (2016) Impact of background noise and sentence complexity on processing demands during sentence comprehension. Front Psychol 7:345. https://doi.org/10.3389/fpsyg.2016.00345 pmid:27014152
    OpenUrlPubMed
  68. ↵
    Westgate EC, Wilson TD (2018) Boring thoughts and bored minds: the MAX model of boredom and cognitive engagement. Psychol Rev 125:689–713. https://doi.org/10.1037/rev0000097 pmid:29963873
    OpenUrlCrossRefPubMed
  69. ↵
    Winn MB, Edwards JR, Litovsky RY (2015) The impact of auditory spectral resolution on listening effort revealed by pupil dilation. Ear Hear 36:e153–e165. https://doi.org/10.1097/AUD.0000000000000145 pmid:25654299
    OpenUrlCrossRefPubMed
  70. ↵
    Winn MB, Wendt D, Koelewijn T, Kuchinsky SE (2018) Best practices and advice for using pupillometry to measure listening effort: an introduction for those who want to get started. Trends Hear 22:2331216518800869. https://doi.org/10.1177/2331216518800869 pmid:30261825
    OpenUrlPubMed
  71. ↵
    Zekveld AA, Kramer SE (2014) Cognitive processing load across a wide range of listening conditions: insights from pupillometry. Psychophysiology 51:277–284. https://doi.org/10.1111/psyp.12151 pmid:24506437
    OpenUrlCrossRefPubMed
  72. ↵
    Zekveld AA, Kramer SE, Festen JM (2010) Pupil response as an indication of effortful listening: the influence of sentence intelligibility. Ear Hear 31:480–490. https://doi.org/10.1097/AUD.0b013e3181d4f251 pmid:20588118
    OpenUrlCrossRefPubMed
  73. ↵
    Zekveld AA, Koelewijn T, Kramer SE (2018) The pupil dilation response to auditory stimuli: current state of knowledge. Trends Hear 22:2331216518777174. https://doi.org/10.1177/2331216518777174 pmid:30249172
    OpenUrlCrossRefPubMed
  74. ↵
    Zekveld AA, van Scheepen JAM, Versfeld NJ, Veerman ECI, Kramer SE (2019) Please try harder! The influence of hearing status and evaluative feedback during listening on the pupil dilation response, saliva-cortisol and saliva alpha-amylase levels. Hear Res 381:107768. https://doi.org/10.1016/j.heares.2019.07.005 pmid:31387070
    OpenUrlPubMed
  75. ↵
    Zhang M, Siegle GJ, McNeil MR, Pratt SR, Palmer C (2019) The role of reward and task demand in value-based strategic allocation of auditory comprehension effort. Hear Res 381:107775. https://doi.org/10.1016/j.heares.2019.107775 pmid:31401432
    OpenUrlCrossRefPubMed
  76. ↵
    Zhang M, Palmer CV, Pratt SR, McNeil MR, Siegle GJ (2022) Need for cognition is associated with the interaction of reward and task-load on effort: a verification and extension study. Int J Psychophysiol 180:60–67. https://doi.org/10.1016/j.ijpsycho.2022.07.011 pmid:35931237
    OpenUrlCrossRefPubMed
  77. ↵
    Zhao S, Bury G, Milne A, Chait M (2019) Pupillometry as an objective measure of sustained attention in young and older listeners. Trends Hear 23:2331216519887815. https://doi.org/10.1177/2331216519887815 pmid:31775578
    OpenUrlCrossRefPubMed

Synthesis

Reviewing Editor: Michael Michaelides, NIDA-NIH

Decisions are customarily a result of the Reviewing Editor and the peer reviewers coming together and discussing their recommendations until a consensus is reached. When revisions are invited, a fact-based synthesis statement explaining their decision and outlining what is needed to prepare a revision will be listed below. The following reviewer(s) agreed to reveal their identity: Drew McLaughlin.

Reviewer 1

The present study examined cognitive load via pupillometry during an auditory gap detection task. Of key interest were the effects and interaction of task difficulty (easy vs. hard trials) and motivation (manipulated by reward incentives). Results of the study are, for the most part, in line with a Motivation Intensity Theory account; the pupil data indicated that listeners devoted more cognitive resources under harder conditions and when offered larger rewards.

There is evident motivation for the present study, which nicely adds to the pre-existing literature using a non-linguistic auditory task (whereas prior work has looked at sentence perception, etc.). I believe the study was well-designed and that the results are well-interpreted by the authors. The authors do an excellent job in the discussion of the paper unpacking the complicating factors of pupillometry that many researchers overlook - the pupil response is not just a correlate of objective effort, but rather the expenditure of cognitive resources (which is influenced by motivation among other factors). I particularly like Figure 1, and how the authors frame their predictions. I would characterize all of my comments and suggestions for the authors as minor.

1. Abstract

a. Perhaps reframe the first sentence along the lines of: “Motivation is a crucial component for understanding speech under acoustically demanding conditions.”

2. Analysis

a. I have some concerns that the reduced model introduced on Line 174 is not justified by the omnibus model, where the interaction was non-significant. Perhaps expand on whether this model design was a-priori and/or pre-registered.

b. The approach to analyzing the pupil data is different from what I (and possibly other authors) will be familiar with. I am specifically referring to creating difference curves to test against 0 with FDR correction. It may be worth drawing a connection for the reader to similar approaches for generalized additive models (GAMs), which is a mixed-effects modeling approach growing in popularity. The “itsadug” package in R does something very similar to determine timepoints at which conditions significantly differ (check out van Rij et al., 2015: https://research.rug.nl/en/publications/itsadug-interpreting-time-series-and-autocorrelated-data-using-ga).

3. Results

a. Figure 3: Specify what the vertical gray band indicates (I believe it is the gap window?)

b. Line 339: Do you mean smaller pupil size at baseline?

c. Line 315: The averaged pupil size across the entire window is what many other authors refer to as mean pupil diameter (MPD). It may be helpful to the reader to note this and/or add this terminology.

d. Lines 361-364: I found the characterization of the mediation analysis a bit strange. What does it mean for effects of reward prospect on response time outcomes to be mediated by pupil size? I think it would be helpful to frame this in terms of what causes a larger pupil response (noradrenergic neuromodulation in the locus coeruleus) - this is the actual mediating factor of interest, and it is indexed by the pupil response amplitude.

i. Related to this comment, the paragraph beginning on Page 19 in the Discussion could also be unpacked more for the reader.

e. Figure 4d: The histogram in this figure is too small to interpret. Partly because the colors aren’t very distinct at that size.

4. Discussion

a. I think that one area that could be revised in the discussion is the framing of the outcomes in respect to Motivation Intensity Theory. The current discussion treats the two data types, behavioral versus pupil data, as completely separate; the conclusions by the authors point to the former not supporting MIT and the latter supporting MIT. This framing loses an opportunity to highlight the benefits of having both datatypes from the same experiment. While it’s true that the behavioral outcomes are not as predicted based on MIT, the fact that the pupil data simultaneously does fit with the MIT-based predictions may point to a critical insight. As the author’s state, the pupil response is not “an invariant read-out of listening demand but is equally sensitive to an individual’s motivational state.” Perhaps a crucial insight from the present study is that effects of motivation on allocation of cognitive resources can be observed even when differences in performance do not emerge. In other words, although the behavioral data is not as predicted, the results of the current study point to pupillometry as providing a critical and unique insight to the effect of motivation on listener engagement with a task.

5. Editorial/grammatical notes:

a. Abstract: Remove comma in line 3

b. Significance statement: Line 18 - change “how far” to “how well”, or something along those lines?

c. Line 49: optimizing -> optimize

d. Line 76: remove colon

e. Line 165: “The single-trial accuracy data...”

f. Line 166: “The response time data... are continuous...”

g. Line 362: “in how far” sounds strange here, perhaps “we asked to what degree, if any, the effect of reward...”

h. Figure 4f: It looks like the “Response time” bubble may be cut off in the figure.

i. Line 371: “was stronger” sounds strange here, perhaps instead “pupil dilation was greater”

Author Response

Reviewer #1 The present study examined cognitive load via pupillometry during an auditory gap detection task. Of key interest were the effects and interaction of task difficulty (easy vs. hard trials) and motivation (manipulated by reward incentives). Results of the study are, for the most part, in line with a Motivation Intensity Theory account; the pupil data indicated that listeners devoted more cognitive resources under harder conditions and when offered larger rewards. There is evident motivation for the present study, which nicely adds to the pre-existing literature using a non-linguistic auditory task (whereas prior work has looked at sentence perception, etc.). I believe the study was well-designed and that the results are well-interpreted by the authors. The authors do an excellent job in the discussion of the paper unpacking the complicating factors of pupillometry that many researchers overlook - the pupil response is not just a correlate of objective effort, but rather the expenditure of cognitive resources (which is influenced by motivation among other factors). I particularly like Figure 1, and how the authors frame their predictions. I would characterize all of my comments and suggestions for the authors as minor. RESPONSE: We thank the reviewer for their positive feedback. We respond to each comment individually below. 1. Abstract a. Perhaps reframe the first sentence along the lines of: “Motivation is a crucial component for understanding speech under acoustically demanding conditions.” RESPONSE: We have rewritten this sentence according to the reviewer’s suggestion. 2. Analysis a. I have some concerns that the reduced model introduced on Line 174 is not justified by the omnibus model, where the interaction was non-significant. Perhaps expand on whether this model design was a-priori and/or pre-registered. RESPONSE: According to the Motivation Intensity Theory (Brehm & Self, 1989; Richter et al., 2016) we expected to see a reward effect under hard but not under easy listening (see Figure 1C). Furthermore, we expected performance being on ceiling in the easy conditions. With the minimized model we only wanted to see whether the reward effect is present in the hard condition. We explain this in more detail now in the methods. Page 7: “Our main hypothesis (derived from Motivation Intensity Theory; see Figure 1C) was that, specifically for hard trails, accuracy and response time should depend on the reward-level. Moreover, for the accuracy data, we expected performance being at ceiling for easy trials and we thus decided to elaborate the possible influence of reward in more detail for the hard condition. MOTIVATIONAL STATE AND LISTENING DEMANDS · Author rebuttal · Page 2 Hence, we employed a reduced linear mixed model using data from the two hard conditions (no reward, reward) and tested for the effect of reward prospect on accuracy and response time.” b. The approach to analyzing the pupil data is different from what I (and possibly other authors) will be familiar with. I am specifically referring to creating difference curves to test against 0 with FDR correction. It may be worth drawing a connection for the reader to similar approaches for generalized additive models (GAMs), which is a mixed-effects modeling approach growing in popularity. The “itsadug” package in R does something very similar to determine timepoints at which conditions significantly differ (check out van Rij et al., 2015: https://research.rug.nl/en/publications/itsadug-interpreting-time-series-and-autocorrelated data-using-ga). RESPONSE: Thank you for bringing up the generalized additive models. We would like to note that it is relatively common, especially in neurophysiological analyses, to use difference time courses and testing them against each other and/or against zero (Herrmann et al., 2023; Krebs et al., 2013; Limbach & Corballis, 2016; Magosso et al., 2019). The time-resolved analysis of the pupil size data provides an elaboration of the model analysis described just prior to the time-course analysis, where we averaged the pupil size data over time. The analysis does thus not aim to reveal a new effect, just when the effect occurs in time. Critically, we use FDR to correct for multiple comparisons, which is a common approach to control the risk of false positives at a pre specified level. 3. Results a. Figure 3: Specify what the vertical gray band indicates (I believe it is the gap window?) RESPONSE: We had previously specified this only at the end of the figure caption. It is now stated for each panel in which we display the gray band. b. Line 339: Do you mean smaller pupil size at baseline? RESPONSE: We meant a smaller pupil size in the 500-ms time window right before the gap. For this analysis, we locked the single-trial pupil-size data to the gap onset time. The gap-locked pupil size data were still baseline-corrected relative to the time window prior to noise onset (-1.6 to - 1.1 s), as for the data time-locked to noise onset. Page 14 “We investigated whether a smaller pupil size prior to the gap is associated with a higher likelihood of missing a gap. To this end, we time-locked the pupil-size data of each trial to the respective gap time.” MOTIVATIONAL STATE AND LISTENING DEMANDS · Author rebuttal · Page 3 c. Line 315: The averaged pupil size across the entire window is what many other authors refer to as mean pupil diameter (MPD). It may be helpful to the reader to note this and/or add this terminology. RESPONSE: We now include this term in the method section. Page 8: “Non-baseline corrected pupil-size data were averaged across the time window ranging from 2.2 s (onset of gap window) to 6.2 s (end of trial) and used as dependent variable (in the literature referred to as mean pupil diameter (MPD)).” d. Lines 361-364: I found the characterization of the mediation analysis a bit strange. What does it mean for effects of reward prospect on response time outcomes to be mediated by pupil size? I think it would be helpful to frame this in terms of what causes a larger pupil response (noradrenergic neuromodulation in the locus coeruleus) - this is the actual mediating factor of interest, and it is indexed by the pupil response amplitude. RESPONSE: This is very helpful feedback, and we of course agree: the pupil size itself is not a potential mediator but only a proxy of the true potential mediator, the LC-NE system. We will elaborate more on this point now in the discussion. See next point. i. Related to this comment, the paragraph beginning on Page 19 in the Discussion could also be unpacked more for the reader. RESPONSE: In the revised manuscript, we describe more explicitly how the mediation analysis can be interpreted. Page 19: “Previous work has suggested a link between LC-NE activity and performance increases (Aston-Jones et al., 1994; Aston-Jones & Cohen, 2005; Usher et al., 1999), which may be reflected in the pupil size. Our mediation analysis revealed that a small part of the influence of reward on response time is mediated via pupil size. Hence, only a small portion of the behavioral benefit under reward can be explained via enhanced LC-NE activity that is indexed by pupil-size changes. Our data suggest that most of the reward effect directly impacts behavioral performance or via pathways different from the LC-NE pathway.” e. Figure 4d: The histogram in this figure is too small to interpret. Partly because the colors aren’t very distinct at that size. RESPONSE: We changed the size of the histogram in the figure. 4. Discussion a. I think that one area that could be revised in the discussion is the framing of the outcomes in respect to Motivation Intensity Theory. The current discussion treats the two data types, MOTIVATIONAL STATE AND LISTENING DEMANDS · Author rebuttal · Page 4 behavioral versus pupil data, as completely separate; the conclusions by the authors point to the former not supporting MIT and the latter supporting MIT. This framing loses an opportunity to highlight the benefits of having both datatypes from the same experiment. While it’s true that the behavioral outcomes are not as predicted based on MIT, the fact that the pupil data simultaneously does fit with the MIT-based predictions may point to a critical insight. As the author’s state, the pupil response is not “an invariant read-out of listening demand but is equally sensitive to an individual’s motivational state.” Perhaps a crucial insight from the present study is that effects of motivation on allocation of cognitive resources can be observed even when differences in performance do not emerge. In other words, although the behavioral data is not as predicted, the results of the current study point to pupillometry as providing a critical and unique insight to the effect of motivation on listener engagement with a task. RESPONSE: We included this important point into the discussion section. Page 18: “Not least the present results hold an important lesson for the increasingly common use of pupil dilation as a clinical measure. Pupil dilation cannot serve as a direct readout of purely physical listening demand. It must be taken into account that the internal motivational state of a person is not only influencing pupil dilation directly. Motivational state also gates or constrains the size of the effects that listening demand will exert on pupil size. Furthermore, a person’s motivational state can affect their pupil size even when influences on behavioral performance do not emerge.” 5. Editorial/grammatical notes: a. Abstract: Remove comma in line 3 b. Significance statement: Line 18 - change “how far” to “how well”, or something along those lines? c. Line 49: optimizing -> optimize d. Line 76: remove colon e. Line 165: “The single-trial accuracy data...” f. Line 166: “The response time data... are continuous...” g. Line 362: “in how far” sounds strange here, perhaps “we asked to what degree, if any, the effect of reward...” h. Figure 4f: It looks like the “Response time” bubble may be cut off in the figure. i. Line 371: “was stronger” sounds strange here, perhaps instead “pupil dilation was greater” RESPONSE: We performed these edits as suggested by the reviewer. Literature Brehm, J. W., & Self, E. A. (1989). The intensity of motivation. Annual Review of Psychology, 40(1973), 109-131. https://doi.org/10.1146/annurev.ps.40.020189.000545 Herrmann, B., Maess, B., Henry, M. J., Obleser, J., & Johnsrude, I. S. (2023). Neural signatures of MOTIVATIONAL STATE AND LISTENING DEMANDS · Author rebuttal · Page 5 task-related fluctuations in auditory attention and age-related changes. NeuroImage, 268, 119883. https://doi.org/10.1016/j.neuroimage.2023.119883 Krebs, R. M., Boehler, C. N., Appelbaum, L. G., & Woldorff, M. G. (2013). Reward Associations Reduce Behavioral Interference by Changing the Temporal Dynamics of Conflict Processing. PLoS ONE, 8(1). https://doi.org/10.1371/journal.pone.0053894 Limbach, K., & Corballis, P. M. (2016). Prestimulus alpha power influences response criterion in a detection task. Psychophysiology, 53(8), 1154-1164. https://doi.org/10.1111/psyp.12666 Magosso, E., De Crescenzio, F., Ricci, G., Piastra, S., & Ursino, M. (2019). EEG alpha power is modulated by attentional changes during cognitive tasks and virtual reality immersion. Computational Intelligence and Neuroscience, 2019. https://doi.org/10.1155/2019/7051079 Richter, M., Gendolla, G. H. E., & Wright, R. A. (2016). Three Decades of Research on Motivational Intensity Theory. 149-186. https://doi.org/10.1016/bs.adms.2016.02.001

Back to top

In this issue

eneuro: 10 (12)
eNeuro
Vol. 10, Issue 12
December 2023
  • Table of Contents
  • Index by author
  • Masthead (PDF)
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Pupil Size Sensitivity to Listening Demand Depends on Motivational State
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Pupil Size Sensitivity to Listening Demand Depends on Motivational State
Frauke Kraus, Jonas Obleser, Björn Herrmann
eNeuro 21 November 2023, 10 (12) ENEURO.0288-23.2023; DOI: 10.1523/ENEURO.0288-23.2023

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
Pupil Size Sensitivity to Listening Demand Depends on Motivational State
Frauke Kraus, Jonas Obleser, Björn Herrmann
eNeuro 21 November 2023, 10 (12) ENEURO.0288-23.2023; DOI: 10.1523/ENEURO.0288-23.2023
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Acknowledgments
    • Footnotes
    • References
    • Synthesis
    • Author Response
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • cognitive demand
  • gap detection
  • motivation intensity theory
  • pupil dilation
  • reward prospect

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Article: New Research

  • Independent encoding of orientation and mean luminance by mouse visual cortex
  • Neck Vascular Biomechanical Dysfunction Precedes Brain Biochemical Alterations in a Murine Model of Alzheimer’s Disease
  • Alpha-2 Adrenergic Agonists Reduce Heavy Alcohol Drinking and Improve Cognitive Performance in Mice
Show more Research Article: New Research

Cognition and Behavior

  • Neck Vascular Biomechanical Dysfunction Precedes Brain Biochemical Alterations in a Murine Model of Alzheimer’s Disease
  • Spontaneous oscillatory activity in episodic timing: an EEG replication study and its limitations
  • Neural Signatures of Engagement and Event Segmentation during Story Listening in Background Noise
Show more Cognition and Behavior

Subjects

  • Cognition and Behavior
  • Home
  • Alerts
  • Follow SFN on BlueSky
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Notice
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2026 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.