Abstract
The posterior alpha (α) rhythm, seen in human electroencephalogram (EEG), is posited to originate from cycling inhibitory/excitatory states of visual relay cells in the thalamus. These cycling states are thought to lead to oscillating visual sensitivity levels termed the “neuronic shutter effect.” If true, perceptual performance should be predictable by observed α phase (of cycling inhibitory/excitatory states) relative to the timeline of afferentiation onto the visual cortex. Here, we tested this hypothesis by presenting contrast changes at near perceptual threshold intensity through closed eyelids to 20 participants (balanced for gender) during times of spontaneous α oscillations. To more accurately and rigorously test the shutter hypothesis than ever before, α rhythm phase and amplitude were calculated relative to each individual’s retina-to-primary visual cortex (V1) conduction delay, estimated from the individual’s C1 visual-evoked potential (VEP) latency. Our results show that stimulus observation rates (ORs) are greater at a trough than a peak of the posterior α rhythm when phase is measured at the individual’s conduction delay relative to stimulus onset. Specifically, the optimal phase for stimulus observation was found to be 272.41°, where ORs are 20.96% greater than the opposing phase of 92.41°. The perception-phase relationship is modulated by α rhythm amplitude and is not observed at lower amplitude oscillations. Collectively, these results provide support to the “neuronic shutter” hypothesis and demonstrate a phase and timing relationship consistent with the theory that cycling excitability in the thalamic relay cells underly posterior α oscillations.
Significance Statement
After accounting for neural conduction delays, we found that threshold intensity stimuli are observed at higher rates when the α wave is at a trough phase than at a peak phase, but only when α amplitude is high. Our results were derived using methods consistent with a specific hypothesis about a mechanism of visual perception, considering the structure, physiology, and transmission delays in the visual system. The results of this rigorous study design add support to the neuronic shutter hypothesis and are consistent with the theory that posterior α reflects cycling excitability in thalamic relay cells, thereby gating the flow of visual information.
Introduction
The posterior alpha (α) rhythm, a 7- to 13-Hz oscillation, is a notable characteristic of the human electroencephalogram (EEG) over the occipital cortex, especially when eyes are closed (Berry and Wagner, 2015). This is thought to assist attention regulation, as its characteristics are often found to correlate with attention over specific sensory cortices (Sauseng et al., 2005; Ben‐Simon et al., 2013), possibly a gating mechanism to limit the flow of sensory information (Cooper et al., 2003; Janssens et al., 2018). This rhythm has analogs in the sensorimotor (Pineda, 2005; Bazanova and Vernon, 2014) and auditory cortices (Tiihonen et al., 1991; Weisz et al., 2011), with both appearing to provide similar gating. This regulatory function is further demonstrated by the observation that α activity increases during introspective behavior (Ancoli and Green, 1977) and behaviors involving internally directed attention such as imagination (Cooper et al., 2003), mental arithmetic (Ray and Cole, 1985), autobiographical recall (Yue et al., 2013), and meditative practices (Tenke et al., 2017), presumably because external sensory information is distractive during these behaviors.
However, the cellular mechanism and functional significance of α oscillations remains unclear. In the 1950s, α oscillations were proposed to represent a “neuronic shutter” (Callaway and Alexander, 1960) to down-sample incoming sensory information to reduce processing load, and similar discrete perceptual processes are proposed today (VanRullen and Koch, 2003; VanRullen et al., 2014). This shutter is thought to occur at the lateral geniculate nucleus (LGN), which relays visual information to primary visual cortex (V1). At times, LGN relay cells burst-fire at 10 Hz, with a hyperpolarized refractory period between bursts (Lopes da Silva, 1991; Sherman, 2001; Timofeev and Bazhenov, 2005; Alexander et al., 2006; Timofeev and Chauvette, 2011). If many of these cells fire synchronously, visual afferentation during a widespread refractory period is less likely to be relayed to V1, resulting in a 10-Hz visual shutter. Each burst in LGN relay cells results in a large EPSP at V1, measured as a negativity in the occipital EEG (Timofeev and Bazhenov, 2005), with higher amplitudes indicating more synchronous excitation (Pfurtscheller et al., 1996), resulting, as hypothesized by this model, from LGN excitatory volleys. Thus, the α rhythm is thought to reflect cycling excitability in the LGN visual-relay cells, which would result in cycling visual sensitivity in phase with the α oscillation. Evidence of this mechanism producing α oscillations has been found in animal models with the associated phasic gating of sensory transfer (Lorincz et al., 2009; Chen et al., 2016). The hypothesis of the present study was based on this model; however, we did not directly test the underlying cellular mechanism.
To more accurately measure the α phase relationship to perceptual performance, we accounted for neural conduction delays: if a stimulus strikes the retina at t0, that information arrives at the LGN at a later time, t1. Therefore, the t1 LGN excitatory state determines whether the information is relayed to V1. Further, the LGN excitatory state is measurable even later in the EEG, as V1 EPSPs at t2. Thus, t0 to t2 represents the retina-to-V1 conduction delay: At t2, we can assess the t1 LGN excitability, the time of afferentation of the t0 stimulus. This is the first study, to our knowledge, to directly examine the relationship between visual observation and spontaneous α phase relative to each individual’s conduction delay, which advances the accuracy and rigor for testing the shutter hypothesis.
This study’s objective is to more robustly test the hypothesis that posterior α reflects cyclic excitability of the visual system with a phase and timing relationship predicted by underlying physiological model and conduction delays. Therefore, we presented visual stimuli (t0) to participants during spontaneous occipital α activity, then measured α amplitude and phase at t2 (estimated by the individual’s C1 visual-evoked potential (VEP) component peak latency; Di Russo et al., 2002). We predicted that visual stimuli at t0 would be more likely observed with α phase at a trough at t2 (assumed to indicate t1 LGN excitation) than at a peak (t1 LGN inhibition). Additionally, this effect was predicted to be greater at high α amplitudes, indicating greater cellular excitatory state synchrony. We found that participants responded to 20.96% more stimuli with α phase at a trough versus a peak at t2 when α amplitude was high. At low amplitude, no significant phase effect on observation rate (OR) was observed. These results support our hypothesis that the α rhythm reflects cyclic excitatory states in the visual system resulting in a visual shutter effect.
Materials and Methods
Participants
α Activity varies largely both within (Gonçalves et al., 2006) and between (Wieneke et al., 1980) individuals. For this study, we needed to ensure stimuli were able to be presented during times of observable (e.g., stationary) α oscillations and therefore selected participants who more readily and reliably generated observable occipital α activity. This selection was made by observing EEG activity as the participants practiced the visual sensitivity task with eyes closed during an earlier portion of the experimental session. Participants who did not readily and reliably produce occipital α oscillations during this practice period were excluded from further participation.
In total, 41 participants were recruited for this study. Based on the observable α criteria described above, 21 participants were excluded from further participation. The remaining 20 healthy participants (10 males/10 females, mean age: 22.3 years, range: 18–28 years) completed two tasks: one to estimate their individual conduction delay; the other to investigate their visual sensitivity at different phases of α oscillations. The experimental protocol was approved by the Institutional Review Boards of Wright State University and the Air Force Research Laboratory. All participants were compensated for their time.
Recording
All recordings were made using the BioSemi ActiveTwo system (BioSemi B.V.). Recordings were made with a 2048-Hz sampling rate at 64 channel locations based on the modified combinatorial nomenclature extension of the 10–10 system (American Electroencephalographic Society, 1994) excluding the inferior chain with the exception of the channels P9/P10 and Iz (Seeck et al., 2017), with bilateral electrodes on the mastoid process, infraorbital, and outer canthus locations. Participant responses were recorded using a low-latency mechanical keyboard (Cherry MX 6.0 [G80-3930], Cherry GmbH). This was combined with other task-state and visual-stimulus timing information via light sensors placed on the monitor to record events on the ActiveTwo’s 16-bit trigger line (StimTracker 1G, Cedrus Corporation).
Stimuli
All tasks and stimuli were constructed and presented using MATLAB (R2011b; The MathWorks) and the Psychophysics Toolbox (v3.0.13; Brainard, 1997; Pelli, 1997; Kleiner et al., 2007). This software was run on a Dell Precision T3610 computer (Dell Inc.) with the Windows 7 Professional operating system (Microsoft Corporation). Stimuli were presented to participants on a 24.5”, 240 Hz monitor (BenQ ZOWIE XL2540, BenQ Corporation) providing 4.2-ms temporal resolution for stimulus presentation. Where relevant, stimulus luminance was measured using a light meter (Light Meter LUX/FC 840020, SPER SCIENTIFIC). The participant’s head position was fixed using a chinrest placed 58 cm from this monitor. The experiment was conducted in a dark room with a natural sound machine (Dohm Classic, Marpac LLC) to mask noise disturbances.
Data analysis
All data analyses were performed in MATLAB (R2019b; The MathWorks) using the EEGLAB Toolbox (v2019.0; Delorme and Makeig, 2004) and the ERPLAB plugin (v7.0.0; Lopez-Calderon and Luck, 2014). Statistical analyses were conducted using MATLAB’s Statistics and Machine Learning Toolbox (R2019b; The MathWorks). This software was run on a Lenovo ThinkPad P50 computer (Lenovo) with the Windows 10 Enterprise operating system (Microsoft Corporation).
Code accessibility
Task design and data analysis code are made publicly available at https://osf.io/qnryf/; DOI 10.17 605/OSF.IO/QNRYF in addition to the recorded EEG data.
Estimation of t2
The peak latency of each participant’s C1 VEP component was used as an estimate of their individual retina-to-V1 conduction delay, t2. This component reflects the arrival of a volley of visual information to V1 from the LGN along the optic radiations. This is analogous to the α wave, in the sense that it also reflects excitatory burst volleys from the LGN to V1, according to the hypothesized model. The C1 peak, rather than onset, latency was used to estimate the middle point of these volleys rather than their onset. Because of the specific folding of the V1 cortical area around the calcarine fissure (Fig. 1A), the C1 wave shape will vary with the location of the stimulus in the visual field. Stimuli presented horizontally centered in the visual field will stimulate spatially opposing V1 neurons whose dipoles will cancel each other out, yielding no measurable C1 component in the EEG (Clark et al., 1994; Di Russo et al., 2002). This is similarly the case for stimuli presented ∼3° below the vertical center (Clark et al., 1994; Di Russo et al., 2002). However, with some horizontal spacing, bilateral stimuli presented above −3° in the visual field will produce a negative C1 wave, while those presented below −3° in the visual field will produce a positive C1 wave as shown in Figure 1B (Clark et al., 1994; Di Russo et al., 2002).
C1 waves are found to be measured maximally at electrode POz using an averaged mastoid reference, but they are still quite small in amplitude. To obtain a well-defined C1 VEP component, both negative and positive components were obtained using upper and lower visual field stimuli. Next, the difference between these two signals was taken to create a large-amplitude wave with a clear peak (Fig. 1C shows these waveforms from a single participant). This peak latency was taken as an estimate of t2.
Experimental design
The stimulus used for this task was a 17° wide, 6.38° tall black and white checkerboard made of 16 × 6 1.06° squares placed on an otherwise black screen with a fixation point placed in the center (Fig. 1B). The checkerboard was flashed with its center either 5° above or 7.5° below the fixation point and centered horizontally. The size and placement of these stimuli were selected to generate maximum amplitude waves (Clark et al., 1994). A total of 600 upper and 600 lower stimuli were presented in a mixed random order each with a duration of 33 ms with random 250- to 450-ms interstimulus intervals, resulting in a trial time of ∼7.5 min.
Similarly to Di Russo et al. (2002), participant attention and gaze were maintained on the fixation point by a sham task performed during the trial. The fixation point would occasionally flash to a brighter color for 12.6 ms, and the participant was instructed to watch for this flash and to respond by pressing the space bar.
Analysis
For the C1 VEP analysis, the originally recorded 2048-Hz sampling rate was maintained but re-referenced to averaged mastoids. All EEG signals were bandpass filtered from 0.01 to 50 Hz at −6 dB, using a second order IIR Butterworth filter as implemented in the ERPLAB toolbox (Lopez-Calderon and Luck, 2014), to achieve a zero-phase filter response. A bipolar vertical electrooculogram (vEOG) signal was created by subtracting the averaged left and right infraorbital electrodes from the averaged Fp1 and Fp2 electrodes.
The data were epoched from 50 ms before to 200 ms after stimulus presentation for both the upper and lower visual stimuli. Blinks within these epochs were detected by sliding a 150-ms window at 75-ms steps over the vEOG signal, and any window containing a peak-to-peak amplitude of 100 μV or greater was rejected from analysis. Separate average VEPs were calculated for the upper and lower stimulus trials and were baseline-corrected to the 50-ms prestimulus period. The lower visual field VEP was then subtracted from the upper field VEP to create a difference wave containing a C1 wave with a clear peak.
The C1 peak latency was measured at electrode POz by finding the most negative peak within the 0- to 110-ms time window relative to stimulus onset. This peak latency was calculated for each participant individually and was used as an estimate of their individual t2.
Measuring ORs
This task was designed to be conducted with closed eyes to evoke more frequent (Legewie et al., 1969) and higher amplitude α oscillations (Barry et al., 2007). For this purpose, stimuli were designed such that they could be observed as light flashes through closed eyelids. The brightness of these flashes was designed to be near threshold intensity (where participants reported seeing ∼50% of the stimuli) to prevent any ceiling or floor effects on performance.
Experimental design
The stimulus used for this task was an 8.5° square centered on an otherwise black (0.7 lux) screen. The stimulus brightness was defined by coloring the square as a greyscale value from 0 (completely black; 0.7 lux) to 255 (completely white; 89.2 lux). The stimulus was designed to be observable through the closed eyes of the participant. The brightness of the stimulus was adjusted to near the threshold intensity for each individual participant, the value at which the participant reported observing the stimulus half of the number of times it was presented. This threshold was estimated using the staircase method as described in Cornsweet (1962) during a calibration task before the main task. In this task, the participant kept their eyes closed as a series of flashes were presented. Each time the participant reported observing a flash (using the spacebar), the stimulus intensity was decreased in the next trial. However, if the participant did not report observing the stimulus, the intensity was increased on the next trial. Following the methods described in Cornsweet (1962), threshold intensity is defined as the average of all trials following the third reversal in trial-intensity slope. An example of this task from one participant is shown in Figure 2. In this figure, the third reversal occurred at trial 5, and threshold intensity was calculated as the average of trials 6–40.
The stimuli used were so dim that the participants’ ability to perceive them was sensitive to how well their eyes had adapted to the dark. Pilot testing of the task revealed that, without an adjustment period, the threshold brightness found during the calibration task would become easier to see over the course of the main task. Although visual adaptation to the dark will continue over many hours, the most rapid changes occur in the first 15 min, and visual acuity begins to plateau in 20 min (Bierings et al., 2018). Therefore, participants were given 25 min in the dark as an adaptation period before conducting the calibration task to estimate the threshold brightness.
In the main task, the stimuli were flashed for a period of 8.4 ms, and the participant was instructed to respond using the keyboard any time they observed a flash. The participant had to respond with the space bar within 2 s of the stimulus to be a valid response. The experimenter triggered the stimuli during the times of occipital α oscillation as observed in the real-time data throughout the task. After the experimenter triggered the stimulus, it was presented after a random interval of <750 ms (to mitigate potential confounding effects of experimenter bias in stimulus presentation timing).
Analysis
All data were down-sampled to 512 Hz and re-referenced to averaged mastoids. The signal was bandpass filtered from 0.01 to 50 Hz at –6 dB using a second order IIR Butterworth filter, as implemented in the ERPLAB toolbox (Lopez-Calderon and Luck, 2014), to achieve a zero-phase filter response. A bipolar vEOG signal was created by subtracting the averaged left and right infraorbital electrodes from the averaged Fp1 and Fp2 electrodes.
To isolate the activities of the left and right visual cortices, a left occipital (LO) signal was calculated by averaging the channels O1, PO3, and PO7, and a right occipital (RO) signal was calculated by averaging channels O2, PO4, and PO8. LO and RO were then convolved with a 10-Hz complex Morlet wavelet to obtain time-domain power and phase signals (Cohen, 2014). A peak-to-peak amplitude signal was then calculated by doubling the square root of the power signal.
The Morlet wavelet was constructed with a center frequency of 10 Hz containing 22/3 cycles (the recommended minimum; Cohen, 2019). This cycle count was chosen to optimize temporal resolution and resulted in a wavelet with a full-width at half maximum (FWHM) of 101.6 ms in the time domain and 8.87 Hz in the frequency domain. A FWHM of 8.87 Hz indicates that the wavelet convolution was effectively a bandpass filter with half-power points of 5.57 and 14.44 Hz, approximating the α frequency band.
The following analysis was then performed independently for each participant. Amplitude and phase were measured from LO and RO at the individuals’ t2 relative to stimulus onset for both observed and missed trials. Trials in which LO and RO phases differed by >90° were rejected from further analysis to control for α asynchrony between the hemispheres. For each of the remaining trials, a single amplitude and phase value was obtained by averaging the amplitude and phase of the LO and RO signals. Phase values were averaged using appropriate statistics for circular quantities (Berens, 2009). These trials were then divided by median amplitude into high-amplitude and low-amplitude bins and then further divided by phase using 90° bins centered on 0°, 90°, 180°, and 270° phase angles. The amplitude median was chosen to provide equal sample sizes in each amplitude bin, and since it is appropriate regardless of distribution normality. When the individual amplitude distributions of our 20 subjects were examined, they appeared to be unimodal and approximately normal with no point of division appearing to be more advantageous than the median (Fig. 3). ORs (the percentage of trials in which the participant observed the stimulus) were then calculated for every bin, and the overall OR across all bins (i.e., the average OR over all trials) was subtracted from these values to obtain a ΔOR value for each of the two (amplitude) × four (phase) bins. Note that ΔOR is not a measure of percent change, but simply a difference of the overall OR and the condition-specific OR.
Statistical analysis
This analysis resulted in a within-subjects 2 × 4 repeated measures design with 20 participants and two factors: amplitude (high, low) and phase (0°, 90°, 180°, 270°). These data were analyzed using a repeated measures ANOVA, testing both main effect of amplitude and phase as well as their interaction. Independent t test and Tukey’s HSD tests were used to probe any effects found to be significant. All statistical analyses were conducted using MATLAB’s Statistics and Machine Learning Toolbox (R2019b; The MathWorks).
Results
Individual retina-to-V1 conduction delay (t2) measurement
In the first part of the experiment, the goal was to estimate each participant’s retina-to-V1 conduction delay, t2. To calculate that, we measured in each participant the averaged VEPs in response to upper and lower visual stimuli and calculated the “difference waveform” (based on the difference between two signals); the peak latency of this difference waveform was taken as the participant’s t2 (Fig. 4). The mean t2 for all participants was 75.56 ms [range = [62.99, 88.87] ms, 95% confidence interval (CI) = [72.44, 78.69] ms]. These data indicate that it takes, on average, 75.56 ms for visual information to be transmitted from the retina to area V1. For each participant, the phase and amplitude measurements were made at the individual’s t2 relative to the stimulus onset to assess visual sensitivity.
OR measurement
The second part of the experiment aimed to investigate whether perceptual performance, when measured at t2, is higher at an α wave trough (i.e., phase angle of 270°) than an α wave peak (i.e., phase angle of 90°). To examine this, we presented light flashes to each participant through closed eyes. Light flashes were near individual threshold intensity estimated using the staircase method (Cornsweet, 1962) during a calibration task before the main task to prevent ceiling and floor performance effects. This resulted in the stimuli of average luminance across participants of 6.48 lux. Use of these near-threshold stimuli resulted in total ORs averaging 58.28% across participants.
On average, participants were presented 161.65 stimuli during the main task (range = [120, 200]) with an average interstimulus interval of 12.52 s. LO and RO amplitude and phase signals were measured for each participant and analyzed at the participant’s t2 time point. Trials in which LO and RO phases differed by >90° were rejected from further analysis resulting in 135.75 trials remaining per participant on average (range = [85, 183]). The amplitude measures were split by their median value into two bins (high and low), and the phase measures were split into four 90° bins centered on 0°, 90°, 180°, and 270° phase angles. A within-subjects two (amplitude) × four (phase) repeated measures ANOVA analysis was conducted on the participant ΔOR values (calculated as the OR per bin minus the overall OR). Mauchly tests indicated no significant violation in the assumption of sphericity for phase (χ2(5) = 10.169, p = 0.0706), or the interaction of phase and amplitude (χ2(5) = 3.250, p = 0.6615). The ANOVA analysis showed that phase and the interaction between phase and amplitude have statistically significant effects on ORs, as shown in Table 1. Accordingly, these ANOVA results support our hypothesis that a relationship between α phase and visual observation exists, and that this relationship is modulated by the α wave amplitude.
Multiple comparison tests were performed to investigate the precise nature of the effects found to be significant in the ANOVA. Figure 5 shows the results of independent t tests (significance indicated by #) comparing ΔOR to zero in each of the phase and amplitude levels. Only phase bins 90° and 270° in the high-amplitude condition were found to be significantly different (t(19) = −5.632, p = 0.0002; t(19) = 3.466, p = 0.0207, respectively, Bonferroni corrected for eight comparisons; Bland and Altman, 1995). Tukey’s HSD tests (indicated by *) then compared ΔOR distributions between phase bins within each amplitude level. These tests found significant differences in the phase bins only in the high-amplitude condition. Specifically, the 90° phase bin was statistically different from the 0° (p = 0.0077), 180° (p = 0.0308), and 270° (p < 0.0001) phase bins in the high-amplitude condition (Fig. 5). Importantly, the greatest difference in the high-amplitude condition was found between phase bins 90° and 270°, with a 22.20% increase in ORs (95% CI = [9.77%, 34.63%]; Fig. 5). Collectively, these results show that stimuli are observed with greatest probability when the α wave is near a trough at t2, and with lowest probably when near a peak, when α amplitude is high. The perception-phase relationship is robust only at high α wave amplitude, likely because during low-amplitude oscillations the measured phase is less representative of the underlying neural population activity.
Phase-varying analysis of visual observation
For a high-resolution characterization of the phase effect on ΔOR, we repeated the original procedure at t2 using 90° phase bins rotated in 1° increments for all 360° phase angles. Figure 6 shows ΔOR for each of the 20 participants in both the high-amplitude and low-amplitude conditions as a function of the centered α phase angle of the bins. Indicated along the circumference is the direction of the center of mass of these phase diagrams for each condition, calculated as the angle of the vector mean of each of the 360 points forming the ΔOR distribution. This angle is referred to here as the preferred phase and indicates the phase at which stimuli have the highest probability of being observed given the whole distribution. Note that in the high-amplitude condition, the preferred phase is between 180° and 360° (the negative-amplitude portion of the α cycle) in 18/20 participants. These preferred phase angles have associated magnitudes indicating how far the distribution’s center of mass is offset in that direction and the strength of the effect of phase on ORs. These preferred phase angles and magnitudes for all participants are shown in Figure 7B,D.
Figure 7A,C shows the ΔOR distributions averaged across all participants. Indicated in black along the circumference is the group’s preferred phase. Note that calculating the group’s preferred phase as the direction to the center of mass of the group’s ΔOR distributions (Fig. 7A,C) is equivalent to calculating the vector mean of the 20 individual preferred phases and associated magnitudes (Fig. 7B,D). By taking the ΔOR value at the preferred phase angle and subtracting from it the corresponding opposite phase angle (by 180°, referred to as the pessimal phase), we calculated the preferred phase effect (PPE), which indicates the increase in OR from pessimal to preferred α phase angles, relative to t2. Our results showed that the preferred phase was similar in both the high-amplitude and low-amplitude conditions (272.41° and 284.65°, respectively) with a PPE of 20.96% in the high-amplitude condition but 9.12% in the low-amplitude condition. These data further substantiate an interaction effect on observation probability between α wave phase and amplitude, which was observed in the prior ANOVA analysis.
Additionally, we estimated the 95% CIs for the group’s preferred phase, which are shown as the shaded regions along the circumference of the phase diagrams in Figure 7. These CIs could only be estimated since there is not a known method of computing an angular CI on non-unit vectors directly. To estimate these CIs, the 20 individual preferred phase vectors (with associated magnitudes) were first converted to cartesian values. Then, two 95% CIs were calculated separately on both the x and y components, indicated, respectively, as the width and height of the shaded rectangles in Figure 7B,D. The angular interval that encompassed both CIs was then used to represent the estimated 95% CI on the preferred phase in the polar space, although this interval will not necessarily be centered on the preferred phase (i.e., mean). In the high-amplitude condition, this resulted in an estimated 95% CI of [234.75°, 309.84°], and in the low-amplitude condition a much wider CI of [206.22°, 344.45°]. Together with Figure 7A,B, these results show that observations rates are highest at the trough of the α wave, and lowest at the peak. This effect is prominent in the high-amplitude condition, but also to a much lesser extent in the low-amplitude condition recalling these effects did not reach significance in the prior ANOVA.
Measuring induced activity
α Band desynchronization (ERD) and phase resetting are characteristically observed in response to relevant visual stimuli (Klimesch et al., 2011). These reflect neural processes resulting from the stimulus, not spontaneous activity, and so in our case are only expected in response to observed trials, not missed trials. To ensure that the amplitude and phase measurements made here are the result of spontaneous brain activity and not caused by the stimulus itself, we verified that a drop in amplitude or sudden change in phase coherence across trials does not occur in the observed trials at t2. Thus, we first calculated the percent change in α amplitude, relative to a prestimulus baseline period of −300 to −100 ms, for both the observed and missed trials and these are shown in Figure 8A. There was the expected characteristic drop in α amplitude (i.e., ERD) in response to the observed stimuli; however, the this was not observed until after 100 ms poststimulus, which did not drop below that of the missed stimuli until after 200 ms. This result indicates that measurements made at t2 (whose mean value is 75.56 ms and range = [62.99, 88.87] ms) were safe from stimulus induced amplitude effects.
To ensure that there was no event related phase resetting, we next calculated phase coherence across all trials, as well as just the observed and missed trials (Fig. 8B). As shown in Figure 8B, no phase resetting was apparent in the observed trials or in all trials together, especially before 150 ms. No event related phase resetting would be expected in the missed trials, and this was not seen. However, in the phase coherence across missed trials, there is a prominent peak measured at the 70.31-ms time point, marked by the arrow in Figure 8B, but this not an event related phase reset as will be described further in Discussion. In sum, Figure 8 confirmed that no event related amplitude change or phase reset occurred before 100 ms poststimulus, demonstrating that the phase and amplitude measurements taken in the present study are not the result of stimulus induced activity but of ongoing spontaneous oscillations.
t2 measurements versus prestimulus measurements
To evaluate the benefit of using the individual t2 time point for measuring the α phase effect on perceptual performance, we repeated the above analysis at a time point 100 ms before stimulus onset (t = –100 ms) and compared the results to those obtained above at t2. These measurements have been taken in previous studies in prestimulus periods or at stimulus onset to avoid measuring induced oscillatory effects as mentioned above (Mathewson et al., 2009, 2012; Milton and Pleydell-Pearce, 2016; Kizuk and Mathewson, 2017). Whereas in the previous analysis, the four phase bins centered at 0°, 90°, 180°, and 270° were chosen a priori based on their theoretical significance at the t2 time point, no prediction could be made about the relevant phase bins at this prestimulus time point. Instead, the analysis was first performed in 90° phase bins in 1° increments, as was performed previously (i.e., in Fig. 7), and the four relevant phase bins were chosen empirically based on the observed preferred phase in the high-amplitude condition. In other words, the phase bins were chosen post hoc to reveal the highest effect phase has on amplitude for this prestimulus t = –100 ms time point.
The results of this analysis performed over all 90° bins in both the high-amplitude and low-amplitude conditions are shown in Figure 9. In the high-amplitude condition, a phase effect on perceptual performance can again be observed, however in this case the preferred phase was measured at 33.53° (95% CI = [−31.69°, 80.29°]) with a 15.48% PPE. In the low-amplitude condition, the preferred phase was measured at −17.22° (95% CI = [−90.04°, 90.07°]) with a 5.22% PPE. Based on the preferred phase measured in the high-amplitude condition for prestimulus t = –100 ms, the four 90° phase bins centered at 33°, 123°, 213°, and 303° were therefore chosen for the 2 × 4 repeated measures ANOVA analysis. Mauchly tests indicated no significant violation in the assumption of sphericity for phase (χ2(5) = 5.194, p = 0.3926), or the interaction of phase and amplitude (χ2(5) = 8.861, p = 0.1147). As shown in Table 2, only the main effect of phase was found to have a statistically significant effect on ORs. Despite these phase bins showing the highest phase effects at t = –100 ms, phase, whose bins were chosen a priori based on theoretical significance, had a stronger effect on observation when the measurements were taken at t2 (with roughly twice the effect size).
Figure 10 shows the multiple comparisons result for this data collected at 100 ms before stimulus onset for better comparison to those from t2 shown in Figure 5. Bonferroni corrected independent t tests compared each ΔOR value to zero, but none were found to be significant, indicating that none of the ORs in any of these four phase bins differed from the overall mean OR either amplitude condition. Tukey’s HSD tests (indicated by *) were used to compare ΔOR distributions between phase bins within each amplitude level, but these tests only found a significant difference between 33° and 213° (p = 0.0143), where there was a 15.47% difference in observations rates (95% CI = [2.39%, 28.55%]).
In our original hypothesis, we predicted that the high-amplitude preferred phase would be 270° at t2, which was approximately t = 75 ms on average. Assuming (1) a perfectly stationary and (2) an exactly 10-Hz signal, we would have predicted that the preferred phase at t = –100 ms, as well as at t0, would have been 0°. This was approximately what was found here; however, the inexactness is expected since neither assumption is exactly met in the natural α rhythm. The non-stationarity would additionally explain the decrease in effect size from t2 to t = –100 ms. Collectively, these results show that although an α phase effect on ORs in a prestimulus period can be observed, the strength of the effect is much weaker than when measurements are taken at t2. This finding shows that measurement of α oscillation amplitude and phase have a stronger correlation with visual perception at t2 than at t = −100 ms.
Discussion
This study presents a novel, controlled method for examining the neuronic shutter effect, hypothesized to be reflected in the posterior α oscillations observed in human EEG. Our hypothesis was that perceptual performance is modulated by the α rhythm in a precise phase and timing relationship. To test this relationship, we (1) presented near-threshold intensity visual stimuli through closed eyes to participants; (2) accounted for individual retina-to-V1 conduction delays (t2); (3) controlled for asynchrony between hemispheres; and (4) measured the α phase and amplitude relative to t2. Our results confirm the hypothesized relationship between the α rhythm phase and perception, with the greatest rates of observation occurring at the α wave trough (excitatory state) and the lowest rates at the α wave peak (inhibitory state). The perception-phase relationship appears to be modulated by α amplitude as it is observed at high, but not low, α wave amplitude. This work is novel in considering the underlying neural structure and function of the visual system when predicting the exact phase relationship to perception, and in controlling for individual neural conduction delay and asynchrony between hemispheres.
While the α wave shutter effect has been previously examined (Dustman and Beck, 1965; Mathewson et al., 2009, 2012; Milton and Pleydell-Pearce, 2016; Kizuk and Mathewson, 2017), this is the first study, to our knowledge, to directly examine the relationship between perceptual performance and spontaneous α phase as measured relative to each individual’s conduction delay. Previous studies have either measured α phase at stimulus onset, t0, or during prestimulus periods (Mathewson et al., 2009, 2012; Milton and Pleydell-Pearce, 2016; Kizuk and Mathewson, 2017). Accordingly, the phase measurements were not always well aligned with the LGN excitability state according to the proposed cellular mechanism. Here, when we accounted for each individual’s conduction delay, as opposed to the group average conduction delay or measuring at stimulus onset t0, we demonstrated the existence of a hypothesized relationship between the α phase and perception. Notably, these results provide the most robust evidence consistent with cyclic LGN excitability as the cellular mechanism mediating this effect, although we do not directly examine observe this mechanism. Dustman and Beck (1965) measured reaction times, as opposed to ORs, at α phase adjusted to group average, as opposed to individual conduction delay. This study, similar to ours, found slowest reaction times at the peak (90°) of the α wave, and fastest reaction times close to the α trough (240°).
A 2 × 4 repeated measures ANOVA found that α phase at t2 had a significant effect on perceptual performance as hypothesized. The interaction effect of phase and amplitude was also found to be significant; the multiple comparisons tests in Figure 5 show that low amplitude attenuates the phase effect. This supports the hypothesis that low-amplitude α waves reflect asynchronous inhibition within the visual system. Although not part of our original hypothesis, it is somewhat surprising that no main effect of amplitude was found since increased α power has sometimes been found to predict poorer perceptual performance (Ergenoglu et al., 2004; Mathewson et al., 2009; Brüers and VanRullen, 2018). This perception-power relationship is usually attributed to more generalized early visual inhibition as evidenced by the attenuation of early (C1 and N150) VEP components with increased α power (Iemi et al., 2019). Figure 7 gives a higher phase angle resolution view of the effects seen in Figure 5. In both the high-amplitude and low-amplitude conditions, the preferred phase was near the wave trough (272.41° and 284.65° at high amplitude and low amplitude, respectively). However, in the low-amplitude condition, the 95% CI of the preferred phase was much wider, and the PPE was lower (20.96% and 9.12% at high and low amplitude, respectively). These results show how α amplitude modulates the effect of α phase on perception.
To demonstrate the significance of taking α amplitude and phase measurements at t2, we repeated our analysis at t = −100 ms and compared them to those obtained at t2. This prestimilus time point was chosen for two reasons. First, given the risk in poststimulus measurements getting altered by stimulus processing, a prestimilus time point is, therefore, “safer” than poststimulus or stimulus-onset time points. Second, given that α oscillations have a primary frequency of 10 Hz (i.e., has a period of 100 ms), the amplitude-phase relationship would repeat roughly every 100 ms. Thus, our results at t = –100 ms could help to predict this relationship at stimulus onset, which is 100 ms (or one period) away but under safer signal analysis conditions. At prestimulus time point of −100 ms (t = −100 ms), we did find a significant phase effect, but by using different phase bins more relevant to that time point (i.e., 33°, 123°, 213°, 360°). These phase bins were chosen post hoc to reveal the highest effect phase has on observation for this prestimulus time point. Despite showing the highest phase effects, a phase whose bins were chosen a priori based on theoretical significance had stronger effects on observation when the measurements were taken at t2. Specifically, the new analysis shows that while phase was found to have a statistically significant effect on ORs between the preferred and pessimal phase bins in the high-amplitude condition, the effect size was weaker and roughly half of that at t2. These results show the advantage of making phase and amplitude measurements at t2 rather than in prestimulus periods.
Poststimulus measures of ongoing spontaneous oscillations are generally avoided since there is a risk that they will be altered by stimulus processing. We verified that our t2 amplitude and phase measurements were not influenced by that by measuring α band ERD and phase coherence over the −300- to 500-ms time range relative to stimulus onset (Fig. 8). No stimulus related ERD or phase coherence were observed before 100 ms indicating that our measures at t2 were independent of stimulus processing responses and instead reflected spontaneous activity. Interestingly, there was an overall increase in phase coherence values of the observed and missed trials compared with that of all trials combined (Fig. 8B). This increase is expected since, according to the phase effects on ORs found above, trials that are observed and missed will predominantly occupy specific, opposing phase ranges; thereby resulting in relatively high phase coherence across trials when grouped by observational outcome, and low phase coherence when grouped together. It is also important to note that the missed trial phase coherence trace features a prominent peak at 70.41 ms (Fig. 8B, arrow). We do not interpret this peak as an event related phase reset since (1) it is the peak of an upward trend beginning at least 300 ms before stimulus onset, (2) a decreasing trend is observed immediately afterward, and (3) no phase reset would be expected in response to an unobserved stimulus. Instead, we interpret this peak as marking the time where phase most accurately predicts that a stimulus will not be observed, which, in support of our hypothesis, is within 5 ms of the average estimated t2 (75.56 ms), the point hypothesized to best predict observational outcome.
Neural mechanism
Understanding the biological mechanism underlying α oscillation is important for explaining its function. Although simultaneous recordings with fMRI and PET correlated α activity with fluctuations in the thalamus, thus suggesting this might be the origin of α rhythm (Hughes et al., 2004; Omata et al., 2013), so far the precise mechanism in humans is not yet known. However, sleep spindles, a ∼10 Hz α-like oscillation that occurs in 1- to 3-s bursts during transition to sleep, do have a well-established mechanism. The sleep spindle mechanism includes a negative feedback loop between the LGN and the reticular nucleus (RN; Fig. 11). Both nuclei have T-type Ca+ channels that, when activated, exhibit burst firing. Between each burst of the LGN, the Ca+ channels of the LGN are known to enter a refractory period, and the RN sends a similar, but inhibitory, burst to the LGN. During this time, sensory information from the optic nerve would be less likely to be relayed to V1. Importantly, the refractory periods of these T-type Ca+ channels are known to result in each nucleus firing at a ∼10-Hz rate (Lopes da Silva, 1991; Sherman, 2001; Timofeev and Bazhenov, 2005; Alexander et al., 2006; Timofeev and Chauvette, 2011). Each LGN burst firing generates massive EPSPs in V1, measured as a negativity in the occipital EEG (Timofeev and Bazhenov, 2005) and producing the ∼10-Hz oscillation of sleep spindles seen on EEG. Chen et al. (2016) showed that, in mice, very similar mechanisms give rise to both sleep spindles and waking α oscillations and that the mechanism is capable of phasically modulating sensory transfer through the thalamus, and on this basis, they predicted the perceptual results found here. In cats, this mechanism was also found to produce waking α oscillations with the same phasic sensory gating effect (Lorincz et al., 2009). Although additional and alternate sources, including extrastriate cortical sources, have been proposed to give rise to this oscillatory activity (Steriade et al., 1990; Connors and Amitai, 1997; Bollimunta et al., 2008, 2011; van Dijk et al., 2008), there is much evidence supporting this thalamic mechanism as the primary driver (Hughes and Crunelli, 2005). Although the present study does not directly test this thalamic mechanism, it is offered as a purely speculative explanation for our results on the basis of previous and more direct research findings (Lorincz et al., 2009; Chen et al., 2016).
Cellular shutter effect generates a behavioral graded gating effect
Although our speculation and not directly tested here, if the neural mechanism posited above were correct, the neuronic shutter effect would then best be described as occurring at the level of individual LGN relay cells, in which the effect may occur at a nearly binary level. However, the behavioral effects we observe are graded, with increasing and decreasing probabilities of stimulus observation. This may be explained given that oscillating excitatory states are not always synchronized across all LGN cells. It is not necessarily the case that all cells are simultaneously in burst-mode firing state; some may concurrently operate in the normal tonic firing state. Thus, one cell may be in a state of inhibition and not relay to V1, while others may relay visual information as usual. However, in asynchronous states, the α rhythm should be decreased in amplitude. As shown in this study, at higher α amplitudes, the differences in ORs between peak and trough phases at t2 increased, presumably indicating that the measured phase becomes more representative of the excitatory state of larger populations of LGN cells with increasing α amplitude. Asynchrony is also important to consider, not just among cells within an LGN but between the LGNs of the left and right hemispheres. As on a cellular level, one LGN may restrict the flow of visual information while the other relays it. Presumably, if all cells within and between LGNs were in perfect synchrony, behavior relative to peak/trough phase at t2 would reflect a perfect on/off visual shutter. This simple relationship was not observed in this study, and it is therefore more appropriate to describe the proposed neuronic shutter effect at the cellular level generating a graded, gating effect of perception at the behavioral level.
In sum, to our knowledge, this is the first study to examine the neuronic shutter effect with an exact phase and timing relationship to perceptual performance predicted by the underlying physiology of the visual system. The study design is novel and rigorous in controlling for individual visual conduction delays and hemispheric asynchrony. During times of high-amplitude α oscillations, we found participants on average are most likely to perceive stimuli at waveform trough (272.41°) with ORs 20.96% greater than at the opposing peak phase (92.41°). Given the rigor in our phase measurements, these results provide strong support for a mechanism that modulates perception described by the neuronic shutter effect and is reflected in the posterior α rhythm. Further studies with conduction delay-adjusted phase measurement are needed to investigate this shutter effect in the μ and tau rhythms over the sensorimotor and auditory cortices, to explore the possibility of this mechanism in other sensory systems.
Footnotes
The authors declare no competing financial interests.
S.M.E. was supported by Ball Aerospace under contract to the United States Air Force, the United States National Institute of Health–National Institute of Neurological Disorders and Stroke and National Institute on Aging Grant NS091836, and the National Academy of Sciences (NAS) and the United States Agency for International Development, NAS Subaward 2000009148. Any opinions, findings, conclusions, or recommendations expressed in this article are those of the authors alone, and do not necessarily reflect the views of USAID or NAS.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.