Abstract
Previous studies emphasize the importance of prestimulus neural oscillations in shaping endogenous brain states that substantially impact perceptual outcomes. However, what features in such oscillations drive perception remains unknown. Furthermore, research has shown that non-oscillatory activity is also important for cognitive processing. However, their interaction prior to perceiving a multisensory stimulus remains unexplored. In this human EEG study (n = 18, 10 males and 8 females), we investigated the role of prestimulus periodic power and aperiodic activity in modulating perception of the widely studied McGurk illusion on a trial-by-trial basis. Using logistic mixed-effects models, we reveal that the illusion perception is associated with reduced prestimulus alpha (8–12 Hz) and beta (15–30 Hz) power over frontal and occipital regions; increased theta (4–7 Hz) power in parietal, central, and occipital regions; and increased gamma (31–45 Hz) power across the scalp. Furthermore, lower aperiodic offset and exponent values in central, parietal, and occipital regions also predicted illusory responses. Our logistic mixed interaction models revealed that the aperiodic exponent and periodic power jointly influence the perception of upcoming McGurk stimuli. Specifically, a decrease in occipital theta and global beta power and an increase in occipital and parietal gamma power were associated with a steeper slope. We conclude that the predominant source of variations in the prestimulus state is the aperiodic activity and that fluctuations in both periodic and aperiodic activity account for inter-trial variability in the perception of the McGurk illusion.
- aperiodic activity
- AV speech perception
- EEG
- McGurk
- logistic mixed-effects model
- periodic power
- prestimulus
Significance Statement
Prestimulus brain oscillations and aperiodic activity are fundamental to understanding individual perceptual and cognitive processing during multisensory speech perception. However, during multisensory integration between auditory and visual streams, how periodic and aperiodic activity sculpts inter-individual and inter-trial differences in multisensory perception remains largely unknown. In this EEG study, we discovered that lower aperiodic offset and exponent (slope) values in central, parietal, and occipital regions predicted illusory responses. Using statistical interaction models, we further show that the mechanisms of susceptibility to illusory speech perception arise from the significant interplay between aperiodic background activity and oscillatory features. This interplay between periodic and aperiodic activity accounts for inter-trial variability in the perception of the McGurk illusion.
Introduction
Perception is driven by incoming stimuli and brain's internal state (Van Dijk et al., 2008). The role of internal state in shaping perception has been investigated in both unisensory (Kayser et al., 2016; Krasich et al., 2022) and multisensory domains (Kaiser et al., 2019; Rohe et al., 2019). Evidences suggest that spontaneous neural (prestimulus) activity is related to variability in multisensory speech perception (Keil et al., 2012; Spadone et al., 2020). The significance of prestimulus activity in perception has been studied by examining the oscillatory properties including spectral power and phase synchrony in canonical frequency bands (Senkowski and Engel, 2024). However, studies suggest that traditional oscillatory activity is often confounded with aperiodic (scale-free) activity that complicates the computation and interpretation of spectral properties across frequency bands (Donoghue et al., 2020a). The aperiodic activity follows a 1/f-like distribution with exponentially decreasing power across increasing frequencies (Miller et al., 2009; He, 2014). It can further be parameterized with a slope (exponent) and intercept (offset) parameter (Donoghue et al., 2020b). These two parameters are known to reflect certain physiological information. Offset has been associated with underlying neuronal spiking (Manning et al., 2009) and exponent with excitation–inhibition (E/I) balance (Gao et al., 2017). Aperiodic components have also been linked to distinct cognitive states (Podvalny et al., 2015) that serve as neural makers for neuropathologies (Levin et al., 2020; Belova et al., 2021) and aging (Voytek et al., 2015; Thuwal et al., 2021). Although often considered distinct from oscillatory activity, some evidence suggests that both might originate from the same underlying neuronal population dynamics. Gireesh and Plenz (2008) demonstrated that neuronal avalanches in developing cortical networks organize as nested theta and beta/gamma oscillations. Similarly, Haegens et al. (2011) reported that alpha (8–14 Hz) oscillations are negatively correlated with neuronal spiking rates in the somatosensory cortex, underscoring their interdependence. Together, these findings suggest that oscillations and aperiodic activity, whether originating from shared or distinct mechanisms, represent complementary facets of brain dynamics whose balance might shape cognition.
Even within oscillatory (periodic) activity, parameters such as center frequency, aperiodic adjusted power, and bandwidth carry meaningful cognitive information. For instance, shifts in alpha center frequency are associated with increased cognitive demands (Haegens et al., 2014) and perceptual influences (Mierau et al., 2017). Although the role of bandwidth remains underexplored, studies suggest that together with center frequency, it provides insights into neural synchrony and circuit dynamics (Cole and Voytek, 2019). Lowet et al. (2022) suggest that variability in oscillatory parameters might reflect changes in neural computations across cognitive tasks. Therefore, examining changes in these parameters are crucial for understanding cognitive differences. However, in multisensory perception, it remains unclear how prestimulus periodic and aperiodic activity together influence perception in lower-level areas (occipital and temporal) and possibly higher-level areas (frontal and parietal). Therefore, this study aimed to identify the separate and interactive contributions of prestimulus periodic power and aperiodic components to perceptual integration.
Studies on multisensory speech perception have predominantly employed the McGurk effect, wherein participants report an illusory percept when presented with a mismatching AV stimulus (e.g., auditory /ba/ dubbed over visual /ga/ producing /da/; McGurk and MacDonald, 1976). The likelihood of illusory perception varies across individuals and is not consistent across trials (Nath and Beauchamp, 2012; Kumar et al., 2020). This makes it a lucrative paradigm for investigating the role of prestimulus activity in multisensory perception. Keil et al. (2012) reported an increased beta power (14–30 Hz) in the left superior temporal gyrus (lSTG), precuneus, and right frontal cortex before perceiving the McGurk illusion. However, the difference in the spectral power were computed without separating the aperiodic activity from the periodic power and activity changes were observed for averaged trials, limiting insights into trial-by-trial perceptual variability.
Summing up a consensus emerges that there lacks a comprehensive understanding of how prestimulus spectral features from different brain regions influence trial-wise perceptual responses. To address this, we reanalyzed EEG data from Kumar et al. (2020), examining the differences in prestimulus periodic and aperiodic components between illusory and nonillusory McGurk trials, thereby capturing trial-wise variability. We used logistic mixed-effects models to interpret which spectral parameters, periodic (center frequency, aperiodic adjusted power, bandwidth) and aperiodic (offset, exponent), predicted the perceptual outcome across sensor regions. We also fitted logistic mixed-effects interaction models to understand the synergistic relationship between prestimulus aperiodic adjusted power and aperiodic exponent that affected subsequent illusory percept at single trial level. Overall, we showcase how prestimulus periodic power and aperiodic activity distinctly and together index the veracity and experience of multisensory illusory speech perception.
Materials and Methods
Participants
For this study, we have used previously recorded EEG and behavioral data described in Kumar et al. (2020), where 18 right-handed healthy participants (eight females) with a mean age of 24.9 (SD = 2.8 years) were recruited for the study. All participants had normal or corrected-to-normal vision and had normal hearing. A written informed consent was obtained from the participants under the experimental protocol approved by the Institutional Human Ethics Committee (IHEC).
Stimulus design and trials
Each participant viewed 600 trials of videos of a male annotator (matched by the geographic region and demographic background of the participants) articulating the syllables /pa/, /ta/, and /ka/. Participants were instructed to report their subjective perceptions of the audiovisual (AV) stimuli. The experiment included four types of AV stimuli: three congruent (audio syllables matching video articulation) syllables /pa/, /ta/, and /ka/ and one McGurk (audiovisual mismatch) syllable (auditory /pa/ with visual /ka/), which created the illusion of syllable /ta/. Participants responded to the four randomly presented stimuli by reporting whether they heard /pa/, /ta/, /ka/, or “something else,” unaware of the incongruent McGurk trial. The experiment was carried out for five blocks, where each block consisted of 120 trials (30 trials of each AV stimuli presented at random). A total of 150 incongruent McGurk trials out of 600 trials were presented to each participant. Interstimulus intervals were pseudorandomly varied between 1,200 ms (milliseconds) snd 2,800 ms to minimize expectancy effects (Minor, 1970).
EEG data acquisition and preprocessing
A Neuroscan EEG recording and acquisition system (SynAmps2, Compumedics) with 64 Ag/AgCl scalp electrodes molded on an elastic cap in a 10–20 montage was used to collect continuous EEG scans, where the reference electrode was located near to Cz. The sampling rate was 1,000 Hz, and the channel impedances were kept below 10 kΩ.
The continuous EEG data collected was reanalyzed in the current study. Before analysis, raw EEG data were preprocessed using EEGLAB toolbox (Delorme and Makeig, 2004) and custom MATLAB codes (version R2020a, https://www.mathworks.com/). A linear (zero-phase) noncausal finite impulse response (FIR) filter was applied at 0.1 and 80 Hz to the data, followed by a ninth-order two-pass Butterworth filter (notch filter) between 46 and 54 Hz to remove the line noise. Any bad channels present—defined by extreme amplitude fluctuations or high variance or flatlines—were removed by visual inspection and interpolated using spherical spline interpolation (Perrin et al., 1989). Mean number of channels interpolated was 0.17 (SD = 0.38). The data was then average rereferenced. Eyeblinks and muscle artifact components from the signal were identified and removed through a systematic independent component analysis (ICA) pipeline. First ICA was applied on the continuous EEG signals using the runica function in EEGLAB. Then, each ICA component was classified into different categories of signals such as the eye, muscle, brain, and others by using an automatic EEG independent component classifier called ICLabel (Pion-Tonachini et al., 2019). Finally, components labeled eye and muscle with a probability of 0.9 and above were flagged and then removed. Mean number of ICA components removed was 1.95 (SD = 2.1).
Epochs of 0.8 s (−0.8 to 0 s) before the McGurk stimulus onset (prestimulus) and 0.8 s post the onset (poststimulus) were extracted (Fig. 2A). The extracted epoch data were then sorted based on illusory /ta/ and nonillusory /pa/ responses to the McGurk stimuli. The sorted prestimulus and poststimulus epochs were then baseline corrected by removing the temporal mean of the EEG signal on an epoch-by-epoch basis. Finally, to remove response contamination from any other artefacts, epochs with amplitudes above and/or below ±100 µV were removed from all electrodes. The mean number of epochs in the prestimulus duration discarded after thresholding was 17.69 (SD = 24.8) and mean number of epochs discarded in the poststimulus duration after thresholding was 9.17 (SD = 19.06).
Spectral analysis
We computed the spectral power at each electrode on a trial-by-trial basis using the multitaper fast Fourier transform (mtfft) method, for both the prestimulus and the poststimulus epoch data. Power spectra were computed to extract the distribution of signal power over different frequency bands [theta (4–7 Hz), alpha (8–12 Hz), beta (15–30 Hz), and gamma (31–45 Hz)] for the incongruent McGurk stimulus condition using the Chronux toolbox function “mtspectrumc.m” (Bokil et al., 2010) and customized MATLAB codes. The time bandwidth product and the number of tapers used were set to 3 and 5, respectively.
Extracting the periodic and aperiodic components from the power spectral densities
To separate the background 1/f aperiodic component from its periodic counterpart, we used modified version of SpecParam v1.0.0 (previously called FOOOF or Fitting Oscillations and One Over f, https://github.com/fooof-tools/fooof) algorithm (Donoghue et al., 2020b). The algorithm takes the original power spectral densities (PSDs), extracts the aperiodic signal, and superimposes them on periodic components, referred to as “peaks.” These peaks are oscillations and are modeled individually as Gaussian functions. Each of these Gaussians has three parameters that are used to define a periodic oscillation: center frequency (CF) or peak frequency, power (PW) of the peak—which is the distance between the peak of the Gaussian and the aperiodic fit—and bandwidth (BW) as two standard deviations. The aperiodic component is defined by two parameters: exponent or negative slope (Freeman and Zhai, 2009) and offset, which is the y-intercept of the model fit across frequencies (Manning et al., 2009).
The periodic and the aperiodic components were extracted from the prestimulus and poststimulus power spectra of each electrode, trial-wise using the MATLAB implementation of SpecParam/FOOOF (version 1.0.0; Donoghue et al., 2020b). The following model fit settings were applied: peak width limits, 0.512 Hz; maximum number of peaks, infinite; minimum peak height, 0.0; peak threshold, 2.0 SDs; aperiodic mode, fixed; and evaluated frequency range, 1–45 Hz. The frequency range was chosen based on model fit results with different algorithm settings on a single participant data (see Extended Data Fig. 3-2 for more details). The aperiodic fit estimated by the model was computed in logarithmic space. So, to extract the periodic power, aperiodic fit was transformed back to linear space and subtracted from the original power spectrum (Gyurkovics et al., 2022). The topographical distributions of both periodic parameters (center frequency, peak power, and bandwidth) and aperiodic parameters (offset and exponent), as well as the mean periodic power across different frequency bands, were examined.
Statistical analysis
At the group level
To examine group-level differences in spectral power (both prestimulus and poststimulus) between illusory (/ta/) and nonillusory (/pa/) trials across frequency bands, a repeated-measures analysis of variance (ANOVA) was performed using MATLAB's ranova function. The analysis employed a 2 (condition: /ta/, /pa/) × 4 (frequency: theta, alpha, beta, gamma) within-subject design. For each participant, mean power values were computed for each frequency bands and condition, and these were fit into the repeated-measures ANOVA model. One participant with 100% illusory /ta/ McGurk susceptibility was excluded from the group-level pairwise analysis. Sphericity was evaluated using Mauchly's test, and Greenhouse–Geisser corrections for p values (pGG) were applied when the assumption of sphericity was violated. Partial eta-squared (η2) was calculated to estimate effect sizes for main and interaction effects. Post hoc pairwise comparisons between condition within each frequency band were performed using Tukey’s HSD test for multiple-comparison corrections. To further dissociate condition effects on periodic power and aperiodic components, a similar repeated-measures ANOVA was conducted on parameterized spectral features. This analysis used a 2 (condition: /ta/, /pa/) × 6 (component: mean theta, alpha, beta, gamma power; aperiodic offset, aperiodic exponent) within-subjects design. The same procedures for sphericity assessment, correction, effect size estimation, and post hoc testing were applied as in the power spectral analysis.
At the sensor level
To assess differences between illusory /ta/ and nonillusory /pa/ conditions at the sensor level, we used a mass univariate linear mixed-effects regression (LMER) approach implemented in the lme4 package (v1.1.32; Bates et al., 2015) in R (R Core Team, 2022). A separate model was fit at each sensor, with the spectral feature of interest—mean periodic power across frequency bands, aperiodic offset, or aperiodic exponent—modeled as a function of Condition (fixed effect), while including Subject ID as a random intercept to account for repeated measures and interindividual variability. The model formula was as follows:
Predicting response to the McGurk trials from prestimulus parameterized power
We were interested to estimate which prestimulus periodic parameters (CF, PW, BW) and aperiodic parameters (offset and exponent) predicted behavioral responses to the upcoming McGurk trials across different sensor regions (Table 1). To do this, we used several logistic mixed models to analyze both whole-brain predictors and sensor region-specific predictors that significantly explained the behavior. The mixed models were fitted using the “glmer” function from the “lme4” package v.1.1.32 (Bates et al., 2015) in R (R Core Team, 2022), with p values provided by “lmerTest” v.3.1.3 package (Kuznetsova et al., 2017). To assess topographical variations in these periodic and aperiodic parameters, and for better overall model fit, the entire EEG scalp was further divided into six distinct, nonoverlapping sensor regions: frontal (14 sensors), central (14 sensors), parietal (14 sensors), occipital (14 sensors), left temporal (5 sensors), and right temporal (5 sensors; Table 1). To control for multiple comparisons across all frequency models, p values of all the model predictors were adjusted using the Benjamini–Hochberg method (Benjamini and Hochberg, 1995), maintaining the FDR at α = 0.05. The two possible behavioral outcomes (i.e., illusion vs nonillusion /pa/) were chosen as the dependent variable. The model fitted parameters were estimated using bound optimization by quadratic approximation (BOBYQA) optimizer with a set maximum of 200,000 iterations (Powell, 2009). The periodic parameters (center frequency CF, aperiodic adjusted peak power PW, and bandwidth BW) and aperiodic parameters (offset, exponent) were chosen as fixed effects (or predictors) of the models. Since, we were interested to determine features that could individually predict the behavior, we assumed that the effect of each predictor on the dependent variable is independent of other predictors in the model. Moreover, to account for the interindividual variability, we used subject ID as the random effect. The fixed effects were mean-centered around zero (Hox et al., 2017). The general formula was defined as follows:
List of sensors categorized into respective sensor regions
Furthermore, to quantify the evidence of each predictor in the estimated logistic mixed models, we computed the Bayes factor (BF) of each independent predictor. We first conducted Bayesian analysis using the “brms” Stan package (version 2.21.0; Bürkner, 2017; Carpenter et al., 2017) in R. The Bayesian generalized (non)linear multivariate multilevel (or brm) model was fitted using weakly informative priors to avoid overfitting. Specifically, we used normal priors with a mean of 0 and a standard deviation of 3 for the intercept; followed by a normal prior with a mean of 0 and a standard deviation of 1 for fixed effects. Priors of the random effects were specified as half-Cauchy distributions with a location parameter of 0 and a scale parameter of 0.1. The model was estimated using Markov chain Monte Carlo (MCMC) sampling with four chains, each running for 2,000 iterations. To assess the strength of evidence for each predictor, we computed Bayes factors using the “bayes_factor” function within the “brms” package. Bayes factors were calculated by comparing brm models with and without each predictor of interest. Specifically, for each predictor, we compared the full model, which included all predictors, to a reduced model where the predictor of interest was removed. Following the guidelines by Jeffreys (1939) and Lee and Wagenmakers (2014), we interpreted Bayes factors as follows: a Bayes factor (BF) greater than 3 was considered moderate evidence in favor of the model with the predictor, a BF greater than 10 was considered strong evidence, and a BF greater than 100 was considered extreme evidence in favor of the model with the predictor. Conversely, BFs less than 1/3, 1/10, and 1/100 were interpreted as moderate, strong, and decisive evidence against the model with the predictor, respectively.
Correlation analysis
The association between prestimulus mean aperiodic adjusted periodic power (across frequency bands) and aperiodic parameters (offset and exponent) was examined using Spearman rank correlation. The mean aperiodic adjusted periodic power was obtained by subtracting the aperiodic fit in linear space from the original untransformed spectrum (Gyurkovics et al., 2022). Correlations were computed for illusory /ta/ and nonillusory /pa/ trial conditions separately. To compute these associations, channel and trial-averaged aperiodic adjusted periodic power and aperiodic parameters were computed for each participant. Participants with zero trials in a given condition were excluded from that condition's correlation analysis.
Logistic mixed-effects interaction models
The association between prestimulus aperiodic slope (or exponent) and aperiodic adjusted periodic power in predicting the response to upcoming stimulus was computed using logistic mixed-effects interaction models (Fisher, 1992). The model was applied across different frequency bands (theta, alpha, beta, and gamma). The frequency-wise models had the following general formula:
A schematic representation of the data processing and analyses pipeline. A, Neurophysiological signals were recorded from 18 participants during the McGurk task using a 64-channel EEG system. B, Prestimulus EEG data were extracted from a time window spanning −800 to 0 ms relative to stimulus onset. C, The raw prestimulus EEG data was filtered, rereferenced, and subjected to independent component analysis (ICA) for artefact (eye and muscle) removal. D, PSDs were estimated for all sensors and trials for all participants during illusory and nonillusory McGurk trial conditions. E, Parameterization of the PSDs to estimate aperiodic and periodic activity using SpecParam (earlier called FOOOF) model. From the model, periodic parameters, center frequency, peak power, and bandwidth, and aperiodic parameters, offset and exponent, were estimated for all 64 sensors, across different frequency bands: theta (4–7 Hz), alpha (8–12 Hz), beta (15–30 Hz), and gamma (31–45 Hz). The parameters from all 64 EEG sensors were classified into six sensor regions: frontal, central, parietal, occipital, left temporal, and right temporal (see topoplot). F, These periodic and aperiodic parameters were subsequently fitted as predictors in sensor region-wise logistic mixed-effects models to predict the behavioral response. To counter for intersubject variability, subject ID was put as random effect in the model. G, Bayes factor analysis was performed as a post hoc test to validate the evidence of significant predictors estimated in the regression model. H, Aperiodic adjusted power was estimated by subtracting the original PSD from aperiodic fit in linear space, a.u., arbitrary units. I, Frequency-wise logistic mixed-effects interaction models were fitted with frequency power (aperiodic adjusted), aperiodic exponent, and sensor regions as predictors. Subject ID was chosen as the random effect.
Code accessibility
The raw EEG data used in this study were reanalyzed from a previous study published by Kumar et al. (2020). Therefore, access to the raw EEG data can be obtained by contacting the corresponding authors of the original study. The preprocessed EEG data and all the relevant codes used in this study are available in the provided Open Science Framework (OSF) link. The code was implemented on a Dell Precision 5820 Tower using a Windows 10 Pro for Workstations operating system. The code/software described in the paper is freely available online at https://osf.io/degzm. The code is available as Extended Data.
Data 1
MATLAB and R codes for implementing FOOOF to parameterize power spectra and for fitting logistic mixed-effects models, respectively. Download Data 1, ZIP file.
Results
Variability in illusory percept observed within and between participants
We observed a high degree of interindividual variability in McGurk susceptibility (Fig. 2Bi). McGurk susceptibility for each participant was quantified as the total percentage of illusory /ta/ responses to the McGurk stimulus. Across the 18 participants, the percentage of illusory responses ranged widely from 4 to 100% (indicating an illusory percept on every McGurk trial), with a median response of 60.83%. We also calculated the response tendency (intertrial variability), which is the relative proportion of illusory responses for all the McGurk trials across all 18 participants (Bechtold, 2016). For incongruent McGurk condition, illusion /ta/ was perceived in 58.38% (SD = 32.5) of trials, whereas unisensory /pa/ (auditory) was perceived in 37.39% (SD = 31.94) and unisensory /ka/ (visual) in 1.68% (SD = 4.97) of total trials presented across all participants. The remaining 2.55% of trials were either responded as “something else” (1.53% of total trials, SD = 2.85) or no response at all (1.02% of trials). To statistically compare response distributions, we performed nonparametric Friedman's test, which revealed a significant difference among the three response categories (χ2(2) = 23.35, p < 0.001, Kendall's W = 0.65) to McGurk stimulus. Post hoc pairwise comparisons using the multcompare function in MATLAB with the Tukey’s HSD correction for multiple comparisons revealed no significance (p = 0.827) between the number of illusion and unisensory /pa/ (auditory) trials. However, significant differences between illusion /ta/ and unisensory /ka/ (visual; p < 0.001) and between unisensory /pa/ and unisensory /ka/ (p < 0.001) was observed (Fig. 2Bii). Because very few participants (8 out of 18 participants) reported the visual /ka/ response to the McGurk stimulus, we considered the auditory /pa/ response (17 out of 18 participants) as the nonillusory trial condition for subsequent analyses. This choice allowed for a more reliable within-subject comparison between illusory /ta/ and nonillusory /pa/ perception across trials. Response tendency was also estimated for the congruent trial conditions where participants correctly responded to congruent AV stimuli in 96.54% (SD = 2.66) of trials. We then proceeded with analyzing the electrophysiological data for both illusory /ta/ and nonillusory /pa/ McGurk trials.
Prestimulus window relative to trial timing and behavioral results. A, Example trial depicting three video frames from a video used in this experiment and intertrial interval (fixation cross) pseudorandomly varied between 1,200 and 2,800 ms. The red box indicates the 800 ms prestimulus epoch used in this study. B, Behavioral results: i, Bar graph representing interindividual variability–propensity of McGurk effect across 18 participants expressed as the percentage of /ta/ percept during the presentation of the McGurk stimulus. Dark blue represents participants below the median percentage of illusory response (or less prone), and light blue represents participants above the median percentage of illusory response (or more prone). ii, Violin plot showing intertrial variability–percentage of /ta/ (illusory), unisensory /pa/ (auditory), and unisensory /ka/ (visual) percept during the presentation of McGurk stimulus and the congruent AV stimuli (/pa/, /ta/, and /ka/). The white dot in the center of each violin plot represents the median. Significance levels are denoted by asterisks: ****p ≤ 0.0001 and n.s. is not significant.
No group-level differences observed in periodic power and aperiodic activity between illusory and nonillusory McGurk trials
A repeated-measures ANOVA with a 2 (condition: /ta/ and /pa/) × 4 (frequency: theta, alpha, beta, gamma) within-subjects design was computed to analyze power spectrum differences between illusory and nonillusory response conditions. During prestimulus duration, the analysis revealed no significant interaction between condition and frequency (F(3,48) = 0.239, pGG = 0.868, partial η2 = 0.015). However, post hoc pairwise comparison with Tukey’s HSD multiple correction revealed significantly higher beta power (mean difference = 0.34, 95% CI = [0.015, 0.662], p = 0.041, Cohen's D = 0.539) and higher gamma power (mean difference = 0.575, 95% CI = [0.229, 0.921], p = 0.003, Cohen's D = 0.854) before illusory perception (Fig. 3Aa). For poststimulus duration, the repeated-measures ANOVA revealed no significant interaction between condition and frequency (F(3,48) = 0.529, pGG = 0.571, partial η2 = 0.032). Post hoc tests revealed no significant condition differences across frequency bands (Fig. 3Ab). Consistent with previous findings by Keil et al. (2012), we observed higher prestimulus beta band power associated with illusory perception; however, we did not observe a reduction in theta power following illusory perception.
Grand-averaged PSDs and parameterized power distributions. A, Group-averaged PSDs computed from subject-specific spectra, averaged over all sensors and trials within each participant. Mean power spectra are shown for illusory (red) and nonillusory (blue) McGurk trials, with shaded areas representing the standard error of the mean (SEM), separately for the (a) prestimulus and (b) poststimulus periods. B, Prestimulus power distributions derived from parameterized spectra. Panel a shows grand-averaged periodic (oscillatory) power, averaged over all sensors and trials within each participant for illusory (red) and nonillusory (blue) conditions, with SEM shading. Panel b displays box-and-whisker plots overlaid with density distributions for the aperiodic parameters—offset (top) and exponent (bottom)—averaged over sensors and trials within each participant, grouped by perceptual condition (red, illusory; blue, nonillusory). All extended analysis of parametrized power distributions after perception of McGurk trials indicating interindividual (or group-level) variability is shown as Extended Data Figure 3-1 and two different model algorithms were fit on one participant prestimulus McGurk data is shown as Extended Data Figure 3-2.
Figure 3-1
Parametrized power distributions after perception of McGurk trials indicating inter-individual (or group-level) variability. (A) Poststimulus periodic power distributions for the illusory (red) versus non-illusory (blue) McGurk trials with SEM as shaded region. (B) Box and whisker plots with density distributions for comparisons between poststimulus aperiodic offset (above) and exponent (below), by response conditions (red – illusory, and blue – non-illusory). Download Figure 3-1, TIF file.
Figure 3-2
Tuning the SpecParam (or FOOOF) algorithm for better fit. To estimate the best model parameters, two different model algorithms were fit on one participant prestimulus McGurk data (n = 75). For model #1 (red), default fit parameters were chosen with frequency range of 1 – 45 Hz. For model #2 (blue), the frequency range chosen was 3 – 45 Hz. No significant difference between (A) aperiodic exponent estimated by the two model algorithms were observed (Z = -0.0113, p = .9910, Cohen’s D = 0.0837). (B) Visual inspection of mean absolute error (or MAE) indicated that both model algorithms fit the participant data well with majority of trials with MAE < 0.100. (C) Variance explained by model #1 (R2) was slightly higher (M = 0.98, SD = 0.04) as compared to model #2 (M = 0.97, SD = 0.02). Overall, model #1 demonstrated better goodness-of-fit (R2 and MAE), supporting our decision to use the 1–45 Hz frequency range in our study. Download Figure 3-2, TIF file.
We further examined the group-level differences in periodic power and aperiodic component estimated by parameterizing the power spectrum using SpecParam (FOOOF) model (Donoghue et al., 2020b). The periodic oscillations were computed after removing the aperiodic component in linear space from untransformed power spectrum from trials where peaks were detected after fitting the FOOOF model. A repeated-measures ANOVA with a 2 (condition: /ta/ and /pa/) × 6 (component: mean theta, alpha, beta, gamma power; aperiodic offset, aperiodic exponent) within-subjects design was computed to analyse differences in periodic power and aperiodic component between illusory and nonillusory response conditions. We observed no significant interaction between condition and component for either the prestimulus (F(5,80) = 0.343, pGG = 0.632, partial η2 = 0.021; Fig. 3B) or poststimulus duration (F(5,80) = 1.548, pGG = 0.227, partial η2 = 0.090; Extended Data Fig. 3-1). Moreover, post hoc tests also revealed no significant pairwise condition difference for aperiodic components (offset and exponent) and periodic power across frequency bands. These findings emphasize the importance of separating aperiodic activity from periodic oscillations to accurately characterize the spectral changes associated with the McGurk illusory percept. Importantly, the absence of group-level effects prior to McGurk perception suggests that individual differences in susceptibility may obscure perception-related neural dynamics, making intertrial analyses a more sensitive approach to uncover the underlying mechanisms before McGurk perception. We then proceeded to capture the trial-wise variability at the sensor space.
Spatial topographies of prestimulus periodic power and aperiodic component show distinct brain regions involved in illusory and nonillusory perception
To investigate differences in prestimulus spectral activity preceding illusory /ta/ and nonillusory /pa/ perceptions, we compared trial-wise spatial topographies of periodic power (theta, alpha, beta, gamma) and aperiodic components (offset, exponent) using mass univariate linear mixed-effects models with permutation-based correction (see Materials and Methods section for details). Regression models revealed significant differences across a distributed set of scalp sensors after FDR correction (p < 0.05). Band-specific patterns revealed distinct topographical profiles. A higher mean theta power was observed over frontal F4 and F6 sensors before illusory perception (Fig. 4A). A lower mean alpha power was observed over frontal, frontal-central, and frontal-temporal sensors; whereas a higher alpha power over central sensors (C5, C1, C6) were observed before illusion (Fig. 4B). A lower beta power was observed across scalp before McGurk illusory perception (Fig. 4C). A higher gamma power over frontal, central, parietal, occipital, and right temporal sensors were observed; and a lower gamma power was observed over left temporal sensors before illusory perception (Fig. 4D). For the aperiodic offset, a higher t value was observed across scalp (Fig. 4E). And, finally lower aperiodic exponent was observed across temporal and occipital sensors and a higher exponent over FC3 and FC5 sensors was observed before illusory perception (Fig. 4F). Taken together, these spatial topographical distributions of mean periodic power and aperiodic component indicate distinct prestimulus brain states preceding the perception of an impending McGurk stimulus. These differences are evident not only in the unisensory perceptual regions like the occipital and temporal sensor regions but also in higher cognitive processing regions, such as the frontal, central, and parietal areas. So, to further understand how changes in these prestimulus spectral features relate to varying perception to McGurk stimulus, we further divided the whole scalp into six distinct sensor clusters covering bilateral frontal, central, parietal, occipital, and unilateral left temporal and right temporal sensors (see Table 1 for details). And then, fitted logistic mixed-effects models across the entire brain and different sensor regions.
Topographic distribution of t values from linear mixed-effects models for periodic and aperiodic spectral components. Topoplots depicting t values from linear mixed-effects models comparing illusory (/ta/) and nonillusory (/pa/) McGurk trials for mean periodic power across (A) theta, (B) alpha, (C) beta, and (D) gamma frequency bands, as well as aperiodic components: (E) offset and (F) exponent. The color bar indicates the corresponding t values. White-marked sensors denote statistically significant differences between conditions. Periodic power was computed by subtracting the aperiodic fit from the original (untransformed) PSDs in linear space.
Prestimulus periodic and aperiodic parameters across different sensor regions can predict the response to the upcoming McGurk stimulus
We fitted region-specific logistic mixed-effects models to predict illusory responses to incongruent McGurk stimuli. The models incorporated periodic parameters—center frequency (CF), aperiodic adjusted power at identified peaks (PW), and bandwidth (BW) across the theta, alpha, beta, and gamma frequency bands—as well as aperiodic parameters, including offset and exponent, as continuous predictors. Subject ID was included as a random effect (see Materials and Methods section for design of the model). As a measure of post hoc analysis, Bayes factor was estimated for all predictors to assess the evidence of significant predictors. At whole brain level, extreme evidence (BF > 100) of beta BW, gamma BW, gamma PW, offset, and exponent predictors was observed. For theta BW and beta CF, however, moderate evidence (3 < BF < 10) was predicted (Fig. 5A). Across different sensor regions, we found evidences for certain predictors that significantly predicted the illusory response to the impending McGurk stimulus. Aperiodic offset and exponent parameters from central and parietal sensors showed extreme evidence (BF > 100) for predicting the illusory response to the upcoming McGurk stimulus. Aperiodic offset from occipital sensors showed strong evidence (10 < BF < 100) and occipital exponent showed moderate evidence (3 < BF < 10) in predicting illusory responses. The influence of periodic parameter predictors, before illusory perception, proved to be more complex than initially expected. In the frontal region, extreme evidence (BF > 100) was found for gamma BW, with strong evidence (10 < BF < 100) observed for beta PW and gamma PW and moderate evidence (3 < BF < 10) for beta BW, alpha PW, and theta BW (Fig. 5B). In the central sensor region, moderate evidence was observed for theta PW and gamma BW (Fig. 5C). In the parietal region, strong evidence was found for theta BW and gamma BW and moderate evidence for gamma CF and theta PW (Fig. 5D). In the posterior occipital region, theta BW showed strong evidence for predicting the response to McGurk trials, while alpha CF, exponent, beta CF, gamma BW, beta BW, and theta CF showed moderate evidence (Fig. 5E). Finally, in the temporal region, extreme evidence was found for gamma BW, along with moderate evidence for gamma PW in the left temporal region (Fig. 5F,G). For more details on model output summaries and estimated Bayes factor values for each predictor, refer to Extended Data Figures 5-1–5-7. Overall, the results of logistic mixed-effects models show that different components of PSDs from different sensor regions predict the response to McGurk trials. Whether these parameters contribute to the illusory effect independently or dependently is what we looked into with correlation analysis.
Bayes factor estimated for all the prestimulus periodic (CF, PW, BW) and aperiodic (offset, exponent) parameter predictors that were fitted to a logistic mixed-effects model to predict response to upcoming McGurk stimulus. The models were fit for (A) all sensor region, (B) frontal, (C) central, (D) parietal, (E) occipital, (F) left temporal, and (G) right temporal. The Bayes factor evidence scale was as follows: extreme evidence, BF > 100; strong evidence, 10 < BF < 100; moderate evidence, 3 < BF < 10; no evidence, BF < 3. For representation purposes, exact BF values have been log base 10 transformed. Summary tables of the logistic mixed-effects models fitted across the whole brain with periodic (CF, PW, BW) and aperiodic (offset, exponent) parameter as predictors is shown as Extended Data Figure 5-1. Summary tables of the logistic mixed-effects models fitted across the frontal, central, parietal, and occipital sensors are displayed as Extended Data Figures 5-2–5-5, and Extended Data Figures 5-6 and 5-7 show summary tables of the logistic mixed-effects models fitted across the left and right temporal sensors, respectively.
Figure 5-1
Summary tables of the logistic mixed-effect models fitted across the whole brain with periodic (CF, PW, BW) and aperiodic (offset, exponent) parameter as predictors. Predictors that significantly predicted the response are in bold. Download Figure 5-1, TIF file.
Figure 5-2
Summary tables of the logistic mixed-effect models fitted across the frontal sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-2, TIF file.
Figure 5-3
Summary tables of the logistic mixed-effect models fitted across the central sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-3, TIF file.
Figure 5-4
Summary tables of the logistic mixed-effect models fitted across the parietal sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-4, TIF file.
Figure 5-5
Summary tables of the logistic mixed-effect models fitted across the occipital sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-5, TIF file.
Figure 5-6
Summary tables of the logistic mixed-effect models fitted across the left temporal sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-6, TIF file.
Figure 5-7
Summary tables of the logistic mixed-effect models fitted across the right temporal sensors. Predictors that significantly predicted the response are in bold. Download Figure 5-7, TIF file.
Association between mean periodic power and aperiodic parameters before perceiving the McGurk stimulus
By fitting logistic mixed-effects models, we observed that periodic and aperiodic activity from different regions could distinctly predict the response to the upcoming incongruent McGurk stimulus. We further examined the relationship between prestimulus mean periodic power at different frequency bands (theta, alpha, beta, and gamma) and aperiodic activity (offset and exponent) using Spearman rank correlation separately for illusory /ta/ and nonillusory /pa/ trial conditions. We found significant positive correlation for illusory conditions between mean theta power and exponent (r(18) = 0.56, p = 0.017; Fig. 6A). A positive correlation was observed between mean alpha power and exponent for illusory (r(18) = 0.55, p = 0.021) conditions (Fig. 6B). No significant association between mean beta power was observed for both illusory (r(18) = 0.036, p = 0.89) and nonillusory (r(17) = 0.13, p = 0.63) conditions (Fig. 6C). And, contrastingly, a negative correlation was observed between mean gamma power and exponent for both illusory (r(18) = −0.57, p = 0.015) and nonillusory (r(17) = −0.74, p = 0.0011) conditions (Fig. 6D).
Association between aperiodic exponent and mean aperiodic adjusted power in the (A) theta, (B) alpha, (C) beta, and (D) gamma frequency bands before perception of illusory (red) and nonillusory (blue) McGurk trials averaged across all channels and trials for each participant. The aperiodic slope is represented on the x-axis and aperiodic adjusted power on the y-axis. The density of observations for each association plot is indicated on the margins for illusory (red) and nonillusory (blue) trial conditions. One participant contributed zero nonillusory trials, so the nonillusory condition includes N = 17. All the extended analysis of association between aperiodic offset and mean periodic power, in the (A) theta, (B) alpha, (C) beta, and (D) gamma frequency bands before perception of illusory (red) and nonillusory (blue) McGurk trials averaged across all trials and channels for each participant is shown as Extended Data Figure 6-1.
Figure 6-1
Association between aperiodic offset and mean periodic power, in the (A) Theta, (B) Alpha, (C) Beta, and (D) Gamma frequency bands before perception of illusory (red) and non-illusory (blue) McGurk trials averaged across all trials and channels for each participant. The aperiodic offset is represented on the x-axis and oscillatory power on the y-axis. The density of observations for each association plot is indicated on the margins for illusory (red) and non-illusory (blue) trial parameters. Download Figure 6-1, TIF file.
For associations between mean power and offset for both response conditions, significant positive correlation was observed between mean theta power and offset (illusory: r(18) = 0.67, p = 0.0031; nonillusory: r(17) = 0.68, p = 0.0037) and a positive correlation between mean alpha power and offset (illusory: r(18) = 0.73, p = 0.00086; nonillusory: r(17) = 0.73, p = 0.0012). No significant association between mean beta power and offset (illusory: r(18) = 0.17, p = 0.51; nonillusory: r(17) = 0.16, p = 0.54) as well as mean gamma power and offset for both conditions were observed (illusory: r(18) = 0.19, p = 0.45; nonillusory: r(17) = −0.3, p = 0.25; Extended Data Fig. 6-1). These results suggest that narrowband frequency periodic power and aperiodic parameters especially slope (or exponent) may influence each other in shaping the perception of the impending McGurk stimulus. We further validated this association in predicting illusion by fitting a logistic mixed-effects interaction model across different frequency bands.
Interactions between aperiodic parameters and mean periodic power predicts perception to McGurk stimulus
Given the association between prestimulus mean periodic power and aperiodic activity before perceiving the McGurk stimulus, we now examined whether the aperiodic exponent interact with power in the prestimulus duration to influence behavioral perception. We used logistic mixed-effects interaction models for each frequency band of interest (theta, alpha, beta, and gamma; see Materials and Methods section for more details on the model design) and found significant interactions between mean power and aperiodic activity (see Table 2 for ANOVA results for all interaction models). Significant interactions mean that the values of one independent predictor influence another's overall performance in prediction (Fisher, 1992). In case of the theta interaction model, we found a significant three-way interaction between theta power × exponent × sensor regions [χ2(5) = 18.822, p (adjusted) = 0.0066]. This three-way interaction was most pronounced in the occipital sensor region (Log Odds = −0.082, 95% CI = [−0.16, −0.001], p = 0.038). The model revealed that when the exponent was steeper and as theta power lowered, the probability of illusory percept was higher (Fig. 7A). For the alpha interaction model, no significant two-way interaction [mean alpha power × exponent, χ2(1) = 0.3849, p (adjusted) = 0.6795] or three-way interaction [mean alpha power × exponent × sensor regions, χ2(5) = 9.951, p (adjusted) = 0.1752] was observed. However, both mean alpha power and the exponent were individually significant predictors [mean alpha power: χ2(1) = 25.9706, p (adjusted) = <0.0001; exponent: χ2(1) = 15.3560, p (adjusted) = 0.0005]. Figure 7B (left) illustrates that the probability of an illusory percept increases as alpha power decreases, while Figure 7B (right) shows that a steeper exponent is associated with a higher probability of an illusory percept. While the interaction was not statistically significant, this trend suggests a potential relationship between the predictors. For beta interaction model, there was a significant two-way interaction between beta power × exponent [χ2(1) = 9.205, p (adjusted) = 0.0070]. As depicted in Figure 7C, when the exponent was the steepest and as beta power lowered, the likelihood of perceiving the illusion increased. Finally, for gamma interaction model, a significant three-way interaction between gamma power × exponent × sensor regions [χ2(5) = 21.586, p (adjusted) = 0.0025] was observed. The effect was most pronounced in the occipital (Log Odds = 0.19, 95% CI = [0.06, 0.32], p = 0.0036) and parietal (Log Odds = 0.17, 95% CI = [0.05, 0.29], p = 0.0041) sensor regions, where when the exponent was steeper and as the gamma power increased, the probability of illusory percept was higher (Fig. 7D). Overall, these results suggest that when there is a steeper exponent, decreased theta and beta power were associated with better predicting the illusory response. Contrastingly, a higher gamma power associated with steeper exponent had a higher probability of perceiving the McGurk illusion.
Visualization of the relationship between prestimulus aperiodic exponent and aperiodic adjusted mean power in predicting McGurk illusory response at (A) occipital theta, (B) alpha, (C) beta, and (D) occipital and parietal gamma band power. For A, C, and D frequency band models, x-axis represents the mean power (aperiodic adjusted) in the frequency range (higher values indicate higher power) and y-axis represents the probability (in percentage) of perceiving the illusion. Note that the distinction of aperiodic exponent into shallow, moderate, and steeper facets are for visualization purposes only and the aperiodic slopes were entered as continuous predictors in all the models. Also, note that B illustrates effect of individual predictors: alpha power (left; x-axis, higher values indicate higher power) and exponent (right; x-axis, absolute higher values indicate steeper slope) for prediction to response (y-axis). The shaded region indicates 83% CI.
Fixed-effects ANOVA (type III Wald chi-square tests) results for logistic mixed interaction effects in the (A) theta, (B) alpha, (C) beta, and (D) gamma frequency bands
Discussion
This study aimed to elucidate the role of prestimulus aperiodic component along with putative oscillations (or periodic power) in shaping perceptual response to upcoming McGurk stimulus. Accordingly, spectral features of EEG data obtained during the prestimulus duration were put in a statistical model where the dependent variable was the perceptual categorization of response to the McGurk stimulus. This analysis design allowed us to test the much broader hypothesis of whether the brain's internal states, further divided into periodic and aperiodic activity, are good predictors of individual perceptual experience. The prestimulus PSDs were parameterized to extract periodic and aperiodic components of the EEG signal at single trial level. We estimated the interindividual differences in the periodic power and aperiodic component in the context of McGurk effect. Subsequently, using mass univariate linear mixed-effects models with permutation-based correction, we investigated sensor region-wise differences in power and aperiodic parameters between the illusory /ta/ and nonillusory /pa/ conditions, independent of between-subject variability. Finally, using logistic mixed effect models, we evaluated whether the spectral markers of neural activity in the prestimulus period can predict the response to the upcoming McGurk trial, independent of interindividual variability. Four key findings emerged from this study: (1) no significant differences in power across frequency bands were observed across participants after adjusting for the aperiodic component prior to McGurk trials; (2) lower aperiodic exponent and offset from central, parietal, and occipital regions predict response to following McGurk stimulus on a single trial level; (3) periodic parameters [center peak frequency (CF), peak power (PW), and bandwidth (BW)] from specific sensor regions can also predict response to subsequent McGurk perception on a trial-by-trial basis; and (4) the steeper aperiodic exponent associated with lower occipital theta, global beta, and higher posterior gamma power shape the perception of the upcoming McGurk stimulus. These findings suggest that prestimulus periodic and aperiodic components reflect distinct brain states, and both individually and collectively influence the perception of upcoming AV stimuli at a single trial level of variability.
Prestimulus periodic activity across different frequency bands leads to the perception of McGurk illusion
Periodic or rhythmic neural oscillations are known to encode different cognitive and behavioral states (Engel et al., 2001; Fries, 2005; Cole and Voytek, 2019). Keil et al. (2012) previously reported that prestimulus beta power correlated with the perception of the McGurk illusion. However, when we separated aperiodic broadband activity from periodic power using the FOOOF model, our repeated-measures ANOVA did not detect significant beta power differences between illusory and nonillusory conditions at the interindividual level (Fig. 3A). This suggests that the previously reported beta effects may have been influenced by underlying aperiodic activity, highlighting the importance of disentangling aperiodic and periodic components when interpreting power spectral differences related to McGurk perception. In contrast, our trial-wise sensor-level analysis revealed significantly higher prestimulus theta power over frontal sensors prior to illusion perception (Fig. 4A). Previous studies have demonstrated that increased theta power in frontal electrodes are associated with conflict processing, which reflects enhanced cognitive control following the perception of incongruent audiovisual stimuli (Hanslmayr et al., 2008; Cavanagh and Frank, 2014; Cohen, 2014; Ergen et al., 2014; Morís-Fernandez et al., 2018). In contrast, studies on McGurk effect perception suggest that reduced theta power after exposure to the McGurk stimulus primarily represents audiovisual integration (Keil et al., 2012; Lindborg et al., 2019). We propose that the elevated frontal periodic theta power observed in the prestimulus state before the illusory perception is linked to enhanced cognitive control or a shift in sensory precision (Cavanagh and Frank, 2014), likely directed toward the visual regions, which may contribute to the illusion. This hypothesis is supported by our logistic mixed-effects models (Fig. 5), which shows that in addition to increased frontal and central periodic theta power, elevated parietal and occipital periodic theta power predicts the illusory perception. However, a key caveat to consider with narrowband theta peaks is that the peak differences observed might stem from nonoscillatory spectral features (Donoghue et al., 2021) or shifts in neighboring bands (changes in alpha peak frequency or bandwidth; Seymour et al., 2022). Although we enforced stringent inclusion criteria so that only trials with valid periodic parameters in all four canonical bands were modeled, we cannot entirely rule out subtle cross-band interactions or transient nonoscillatory events contributing to our results. Future work that combines spectral parameterization with time-domain burst detection (Wen and Liu, 2016; Seymour et al., 2022; Wilson et al., 2022) or phase-based analyses (like cycle-by-cycle methods; Cole and Voytek, 2019) will help to further disambiguate true oscillations from spectral leakage or aperiodic fluctuations.
Moreover, lower prestimulus alpha band power has been associated with higher cortical excitability (Romei et al., 2008, 2010; Mathewson et al., 2009, 2011; Samaha et al., 2017). This means that at regions with high excitability (lower alpha power), the threshold of activation of the underlying neural population is lowered. Thus, improving the neural processing of sensory input leading to better perception, which in case of incongruent McGurk stimulus, sometimes may lead to illusory percept (Van Dijk et al., 2008; Jensen et al., 2012; Lange et al., 2014). In our study, we observed lower prestimulus periodic alpha power in the frontal and central sensor regions prior to illusory perception (Fig. 4B). This might reflect increased excitability or attentional engagement of higher-order networks that might be critical for multisensory integration in McGurk illusion (Foxe and Snyder, 2011; Haegens et al., 2012; Sadaghiani and Kleinschmidt, 2016). Fluctuations in frontal and central alpha power have also been linked to top-down modulation of attention and preparatory states that influence perceptual outcomes (Jones et al., 2010; Klimesch, 2012; Samaha et al., 2015). Notably, while we did not find significant occipital alpha power differences at the sensor level, our analysis of peak alpha frequency (PAF) in the occipital region (Fig. 5E) revealed that lower PAF predicted illusory percepts that is consistent with previous findings that occipital PAF modulates audiovisual integration and susceptibility to multisensory illusions (Cecere et al., 2015; Samaha and Postle, 2015; Keil and Senkowski, 2017; Venskus and Hughes, 2021). Taken together, these results suggest that lower prestimulus periodic alpha power in frontal and central regions, along with lower occipital PAF, may jointly facilitate the attentional and integrative processes that bias perception toward the McGurk illusion. This aligns with the view that alpha oscillations reflect both local excitability and large-scale network dynamics that shape multisensory perceptual experiences (Foxe and Snyder, 2011; Sadaghiani and Kleinschmidt, 2016).
Furthermore, we observed a significantly lower periodic beta power before illusory percept across all sensor regions (Table 1, Fig. 4C); however, our model could only predict response to illusion perception from periodic beta power over frontal and occipital electrodes (Fig. 5B,E). Our findings are consistent with previous research on audiovisual simultaneity judgment tasks, where the authors showed that lower beta power over the scalp preceded visual-then-auditory sequences perceived as simultaneous trials (Yuan et al., 2016). Moreover, prestimulus lower beta band power has been associated with better sensory encoding (Griffiths et al., 2019), which might lead to illusory perception in case of incongruent AV stimulus input (for review, see Keil and Senkowski, 2018). Studies on rubber hand illusion have shown reduced central alpha and beta band power before illusory percept (Evans and Blanke, 2013; Rao and Kayser, 2017). Interestingly, however, our results deviate from multisensory illusion studies where they have reported a higher prestimulus beta power before the illusory percept (Keil et al., 2012; Keil and Senkowski, 2018; Kaiser et al., 2019). These studies have looked at beta power without removing the 1/f power (aperiodic) component, which might indicate that the aperiodic component of the power spectrum majorly drives the power in the prestimulus duration and separating the periodic power from the aperiodic component brings out the true nature of oscillatory activity (Monto et al., 2008; Becker et al., 2018; Donoghue et al., 2020b). Taking all these observations together, we propose that a lower periodic beta power in the frontal and occipital regions drives McGurk illusory percept instead of the higher beta power. Moreover, we also observed that lower peak beta center frequency (CF) in the occipital regions predicted illusory responses (Fig. 5E). These peak beta frequencies might be modulating audiovisual attention shift along with peak alpha center frequencies especially in the occipital region. However, looking at prestimulus cross-frequency coupling (CFC) is beyond the scope of this study.
Interestingly, our logistic mixed-effects model predicted significantly higher gamma bandwidth (BW) activity, with extreme evidence (BF > 100) in the frontal and temporal regions prior to illusory perception (Fig. 5B,F,G), while the role of bandwidth in multisensory perception remains rather underexplored. However, a review by Lowet et al. (2022) suggests that variability in gamma bandwidth reflects changes in neural synchronization driven by different cognitive demands, underscoring the adaptability of neural circuits to external stimuli. Based on this premise, we hypothesize that gamma band synchronization between frontal and temporal regions undergoes distinct changes before illusory perception compared with nonillusory conditions. However, examining the nature of this synchronization falls beyond the scope of the current study. Overall, the periodic activity pattern of high theta and low alpha power (in frontal, central, and occipital regions), alongside low beta (in frontal and occipital region) and high gamma power (over frontal and temporal regions), suggests a shift in attention modulation in the prestimulus duration. And, this shift appears to favor visual processing, thereby biasing perception toward the illusory interpretation of the upcoming McGurk stimulus.
Decreased aperiodic offset and exponent parameters predicts response to the illusory speech sound perception
The aperiodic component of the power spectrum (offset and exponent) has been associated with modulations in cognitive states (He et al., 2010; Podvalny et al., 2015), aging (Voytek et al., 2015; Donoghue et al., 2020b; Thuwal et al., 2021), and the excitation/ inhibition (E/I) balance of local neuronal populations (Manning et al., 2009; Gao et al., 2017; Waschke et al., 2021; Chini et al., 2022). In our sensor-level analysis, we found a significant increase in offset values across the scalp (Fig. 4E) and flatter exponent over temporal and occipital sensors before the illusory percept (Fig. 4F). In contrast, our logistic mixed-effects models estimated that lower prestimulus offset and flatter exponent from parietal, central, and occipital electrodes predicted the illusory response to the upcoming McGurk stimulus (Fig. 5C,D,E). This apparent discrepancy reflects the distinct questions addressed by the two analyses. The sensor-level LMER captures average group-level differences, highlighting a general shift toward increased cortical excitability before illusion, while the logistic model leverages trial-by-trial variability to reveal that, within this elevated state, local reductions in aperiodic features are most predictive of illusory perception in specific regions. These findings suggest that both global and local aperiodic fluctuations shape multisensory outcomes, underscoring the complex, potentially nonlinear relationship between background neural activity and perceptual experience (He, 2014; Donoghue et al., 2020b).
Moreover, the offset parameter is often referred to as signal's baseline or “background noise,” which is positively correlated to spontaneous neural spiking (Manning et al., 2009; Miller et al., 2012). Therefore, our results on decreased offset value before illusory perception could reflect a decreased spontaneous neuronal population spiking activity over the central, parietal, and occipital regions, potentially indicating on a selective cortical state that leads to illusory percept. This could further be systematically tested by tracking population firing rate using computational models of neural masses with the excitation/inhibition (E/I) balance of local neuronal populations (Gao et al., 2017; Keil and Senkowski, 2017; Kumar et al., 2020) which is beyond the scope of the current study.
Furthermore, the exponent parameter refers to the steepness of exponential power decay with increasing frequencies, which is associated with underlying synaptic currents and reflects the underlying E/I balance of neural networks. A flatter exponent—as observed prior to McGurk illusory perception—suggests increased arrhythmic background neuronal firing which is thought to be driven by increased E/I ratio (Voytek et al., 2015; Gao et al., 2017; Lendner et al., 2020). However, the extant literature does not shed a clear mechanistic understanding of this phenomenon but rather provides a mixed understanding of this correlation in relation to higher-order cognitive processing. For instance, Wiltshire et al. (2017) have reported that spectral slope becomes more steep with increasing demands on externally oriented attention in a given task. Other studies have reported flatter slopes in association with higher state of consciousness (Miskovic et al., 2019; Lendner et al., 2020), in rest to task state transitions (Podvalny et al., 2015), and in neuromodulatory processing of uncertainty (Kosciessa et al., 2021). Considering all these inferences, we propose that in case of multisensory McGurk perceptual task, a flatter exponent before illusory percept over central, parietal, and occipital sensors reflects a shift in cortical dynamics toward a more excitable state. Such a shift might arise under conditions with high uncertainty like the prestimulus state, where cortical networks transition from a rhythmically dominated regime to a predominantly aperiodic (or arrhythmic) state. This high excitable and arrhythmic state might be better suited for integrating incongruent McGurk stimulus leading to illusory perception. However, these interpretations remain under active investigation, and therefore our findings should be viewed as consistent with, but not definitive of, these proposed mechanisms.
Interactions between mean periodic power and aperiodic exponent predict multisensory speech sound illusion
Significant interactions between mean periodic power (aperiodic adjusted) and aperiodic exponent during prestimulus intervals also predicted subsequent responses to the McGurk illusion on a trial-by-trial basis. Specifically, we observed that decreased occipital theta power and increased occipital and parietal gamma power were associated with a higher likelihood of illusory perception when the aperiodic exponent was steeper (Fig. 7A,D). Similarly, decreased global beta power predicted illusory responses in the context of a steeper exponent (Fig. 7C), while decreased alpha power and a steeper exponent individually predicted illusory perception, though their interaction was not significant (Fig. 7B). These results align closely with studies linking the aperiodic exponent to the underlying excitation–inhibition (E/I) balance in cortical circuits (Voytek et al., 2015; Gao et al., 2017). A steeper exponent is indicative of a more inhibition-dominated (lower E/I) state, which has been associated with reduced population spiking (Miller et al., 2009; Voytek and Knight, 2015; Zhou and Yu, 2018; Medel et al., 2023; Diachenko et al., 2024). Moreover, in our study, the decreased low-frequency (theta, beta) power in conjunction with a steeper exponent predicts illusory percept is consistent with previous reports that low-frequency power is correlated with population spiking (Miller et al., 2007, 2009; Ball et al., 2008; Voytek and Knight, 2015), suggesting that reduced synchronous low-frequency activity, in our case, occipital theta and global beta, associated with steeper exponent (less population spiking) may facilitate prestimulus oscillatory dynamics that shapes McGurk illusory perception.
Moreover, we also observed an increase in posterior (occipital and parietal) gamma power predicted the illusory perception when the exponent was steeper. These observations support the notion that gamma activity, often linked to local cortical excitation and perceptual binding, may be more functionally relevant in a subcritical (or inhibition-dominated) state (Cohen and Maunsell, 2011; Harris and Thiele, 2011; Waschke et al., 2021). This interplay between aperiodic and mean power highlights the importance of considering both oscillatory and aperiodic neural dynamics in understanding perceptual outcomes. Our interaction model results suggest that a transient shift toward a more inhibited, and more ordered neural state, indexed by a steeper aperiodic exponent and selective modulation of mean power (lower occipital theta, lower global beta, and higher posterior gamma power), may facilitate the perceptual binding processes underlying the McGurk illusion. Overall, we speculate that while different oscillatory features govern distinct cognitive functions (Haegens et al., 2014; Mierau et al., 2017; Scally et al., 2018), they also interact significantly to modulate individual participants’ predisposition of prestimulus brain state over individual trials, which leads to experiencing illusory or nonillusory perception in case of McGurk stimulus.
Taken together, in this study we have attempted to understand the relationship between prestimulus oscillatory activity and aperiodic activity in subsequently perceiving the McGurk illusion on a trial-by-trial basis. We have demonstrated that aperiodic activity (especially steeper aperiodic exponent) greatly influences spectral power at both unisensory (temporal and occipital) and higher cognitive areas (like parietal and frontal) in shaping perceptual response. We believe that this interaction accounts for intertrial variability which is quintessentially observed in McGurk paradigms. Therefore, this study opens a new avenue for addressing the role of prestimulus aperiodic activity in influencing low- and high-frequency band oscillations across relevant brain areas. Moreover, by parameterizing the power spectrum one can study the modulation effect of one true oscillatory activity from another in the context of multisensory speech perception.
Footnotes
The authors declare no competing financial interests.
We acknowledge the support from the Department of Information Technology (DIC) NBRC. NBRC Core funds supported this study. D.R. was supported by SERB Core Research Grant (CRG) S/SERB/DPR/20230033 extramural grant from the Department of Science and Technology, Ministry of Science and Technology, Govt. India. D.R. and A.B. acknowledge the generous support of the NBRC Flagship program BT/MIDI/NBRC/Flagship/Program/2019: Comparative mapping of common mental disorders (CMD) over the lifespan.
- Received October 4, 2024.
- Revision received July 30, 2025.
- Accepted August 18, 2025.
- Copyright © 2025 Singh et al.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.








