Abstract
Phase entrainment of cells by theta oscillations is thought to globally coordinate the activity of cell assemblies across different structures, such as the hippocampus and neocortex. This coordination is likely required for optimal processing of sensory input during recognition and decision-making processes. In quadruple-area ensemble recordings from male rats engaged in a multisensory discrimination task, we investigated phase entrainment of cells by theta oscillations in areas along the corticohippocampal hierarchy: somatosensory barrel cortex (S1BF), secondary visual cortex (V2L), perirhinal cortex (PER), and dorsal hippocampus (dHC). Rats discriminated between two 3D objects presented in tactile-only, visual-only, or both tactile and visual modalities. During task engagement, S1BF, V2L, PER, and dHC LFP signals showed coherent theta-band activity. We found phase entrainment of single-cell spiking activity to locally recorded as well as hippocampal theta activity in S1BF, V2L, PER, and dHC. While phase entrainment of hippocampal spikes to local theta oscillations occurred during sustained epochs of task trials and was nonselective for behavior and modality, somatosensory and visual cortical cells were only phase entrained during stimulus presentation, mainly in their preferred modality (S1BF, tactile; V2L, visual), with subsets of cells selectively phase-entrained during cross-modal stimulus presentation (S1BF: visual; V2L: tactile). This effect could not be explained by modulations of firing rate or theta amplitude. Thus, hippocampal cells are phase entrained during prolonged epochs, while sensory and perirhinal neurons are selectively entrained during sensory stimulus presentation, providing a brief time window for coordination of activity.
Significance Statement
Neural activity during theta oscillations (6–12 Hz) has long been considered a mechanism for interareal communication, but its temporal dynamics in relation to sensory and mnemonic processing are still poorly understood. We report how sensory neocortical and hippocampal areas temporally coordinate their activity with local field potential activity in the theta band during a behavioral task involving multisensory object discrimination and recognition. Theta phase entrainment in sensory cortical areas selectively occurred during behavioral task epochs where object information was presented in the preferred stimulus modality of a given area. This entrainment was largely independent of firing rate. These findings support the framework of theta-band synchrony as a mechanism for facilitating corticohippocampal communication during sensory and mnemonic processing.
Introduction
The rodent hippocampal local field potential (LFP) expresses prominent theta oscillations (6–12 Hz) during awake behaviors such as locomotion (Sinnamon, 2006; Kropff et al., 2021), whisking during object sampling (Grion et al., 2016), orienting behaviors (Whishaw and Vanderwolf, 1973; Buzsáki, 2009), and REM sleep (Buzsáki, 2009; Bandarabadi et al., 2019). Hippocampal theta oscillations are generated through synchronized synaptic input from the medial septal area (Brandon et al., 2014; Quirk et al., 2021) and represent the coordinated activity of neural ensembles (Buzsáki et al., 2012; Einevoll et al., 2013). Such coordination is thought to facilitate memory encoding (read-in) and retrieval (read-out; Battaglia et al., 2011; Colgin, 2013; Lisman et al., 2017; Buzsáki and Tingley, 2018). In addition, phase entrainment of cell firing by theta oscillations may facilitate communication between areas. In contrast to phase coding (e.g., the encoding of an animal's trajectory through a place field by spike timing relative to theta phase; O’Keefe and Recce, 1993; Huxter et al., 2008; Petersen and Buzsáki, 2020; Yiu et al., 2022), phase entrainment does not imply information coding in spike timing but rather suggests a mechanistic function of oscillations in communication and information transmission. The synchronized input of an entraining entity, whether internal (such as the hippocampus) or external (such as sensory input), could bias spike timing in receiver areas (Salinas and Sejnowski, 2000) and thereby increase synchrony within and between areas and improve interareal communication (Bosman et al., 2014; Singer, 2018). To clarify, different definitions of “synchrony” or “synchronization” are used in the literature. Whereas “synchrony” traditionally refers to neural events occurring simultaneously, or without a phase lag in case of periodic signals, we will use the term in the sense of “phase synchronization,” the phenomenon that periodic signals oscillate with a consistent phase angle relative to each other (Vinck et al., 2011).
Evidence for phase-entrainment by theta oscillations has been found in many areas along the corticohippocampal hierarchy, including somatosensory cortex (Grion et al., 2016; Vinck et al., 2016), visual cortex (Vinck et al., 2016), auditory cortex (Parto Dezfouli et al., 2019), and medial temporal lobe areas such as perirhinal (Bos et al., 2017; Ahn et al., 2019), postrhinal (Furtak et al., 2012), and entorhinal cortex (Alonso and García-Austt, 1987), as well as in subcortical regions such as the ventral striatum (Lansink et al., 2016). Phase entrainment of cells throughout the neocortex by theta oscillations might serve as a mechanism to process sensory information (Kleinfeld et al., 2016; Lakatos et al., 2019) and coordinate memory storage and retrieval required for episodic memory between neocortical areas and hippocampus (Csicsvari et al., 1999; Buzsáki, 2009; Battaglia et al., 2011).
Phase entrainment of cortical cells has been mainly studied in the context of spatial processing and only a few studies addressed phase entrainment by theta oscillations during discrete stimulus processing in freely behaving rodents (Grion et al., 2016). As a result, the temporal dynamics of neocortical phase entrainment and its association with particular behaviors elicited by precisely timed sensory stimuli remain largely unclear. Does phase entrainment of cortical cells manifest itself selectively during epochs of heightened sensory processing and object recognition? Furthermore, how does neocortical theta phase entrainment compare with entrainment in the hippocampus along the time course of sensory discrimination trials?
To answer these questions, we performed ensemble recordings in freely behaving rats during a multisensory two-alternative choice task in a T-maze. Our behavioral paradigm consisted of distinct trial epochs: (1) approach to the object sampling port, (2) sensory object discrimination and recognition, (3) approach to a reward site, and (4) trial outcome. During this experiment, we simultaneously recorded neural activity from two sensory neocortical areas (primary somatosensory barrel cortex and secondary visual cortex) and two medial temporal lobe areas (perirhinal cortex and hippocampus).
Our results show phase entrainment of neocortical cells by hippocampal and neocortical theta oscillations during the sensory discrimination and object recognition phase, while these cells were not phase entrained during navigation and consumptive behavior. In perirhinal cortex, phase entrainment was selective for sensory discrimination as well, with some cells selective for the tactile and other cells selective for the visual modality. In contrast, hippocampal ensembles showed theta phase entrainment throughout the behavioral trial, such that different cells tessellated successive task phases. Our findings support a general role for theta oscillations in temporally organizing and synchronizing spiking activity in the hippocampus, whereas the entrainment in the sensory and perirhinal cortices is especially enhanced during the restricted period of sensory object processing.
Materials and Methods
Experimental design and statistical analysis
Experimental animals
Four male Lister Hooded rats (aged 7–10 months) were housed in pairs during behavioral training. Following implantation, they were individually housed in transparent cages (40 × 40 × 40 cm). All rats were maintained on a reversed 12 h light/dark schedule and tested in the dark phase. During all experiments, rats were food deprived, maintaining body weight of at least 85% of their free-feeding body weight. The experiments were performed in accordance with the National Guidelines on Animal Experiments and were approved by the Animal Experimentation Committee of the University of Amsterdam.
Behavioral apparatus
Behavioral training was performed in a darkened room on a T-shaped, elevated platform [Fig. 1A; width 30 (base) × length 60 (arms left to right) × height 60 cm]. A pneumatic door controlled access to the sampling area, the sampling area itself was marked by a gap wherein the rat could only reach with its head, but not any other body parts. The presence of the animal at the sampling area was tracked using an infrared sensor detected by a phototransistor (Fig. 1A, IR sensor 1). Similarly, entry of the animal into the reward ports was tracked (Fig. 1A, IR sensor 2 and 3). Sucrose solution (15% in water, 60–70 µl) was delivered as reward through fluid wells positioned at the end of the side arms of the maze. Nose pokes by the rat into the well, and licks in the reward well, were tracked using an infrared sensor and phototransistor.
The sampled objects consisted of two duplicated pairs of Lego configurations mounted on a rectangular rotatable platform controlled by a stepper motor (Fig. 1C). To control for motor sounds during object rotation, the objects were rotated randomly at the start of the intertrial interval (ITI). Objects mounted on the long side of the platform were within the rat's whisking range and presented during tactile and multisensory trials (∼15–16 cm away from the maze). Objects mounted on the short side of the rectangle were outside the rat's whisking range, behind a 22–23 cm gap, and were presented during visual and probe trials. During visual and multisensory trials, the object was illuminated using white light LEDs.
Behavioral paradigm
Each trial of the two-alternative forced choice multisensory discrimination task began with an ITI of 12 s in duration, which was followed by the opening of the door giving access to the sampling area. Rats entered the sampling area, where one out of two objects was presented in the tactile, visual, or multisensory condition. These objects remained the same during the training and experiment period and were familiar to the animal during recording. During the tactile condition the rat could only perceive the object using its whiskers (the experimental setup was in a dark room), during the visual condition the rat could only see, but not touch, the object and during the multisensory condition the rat could sample the object both with its whiskers and by way of visual access. Each of the objects was associated with one response side, that is, the left or right arm of the maze, and if the rat would poke into the fluid well at the correct side, it was rewarded with sucrose solution. Thus, in addition to object discrimination, the task involved object recognition based on which rats had to choose the side arm associated with the object presented in the current trial. For instance, when recognizing object A, the rat had to make a response into the left arm to be rewarded at the left fluid well, whereas the opposite arm had to be chosen after sampling object B. The object-side coupling remained constant across all sessions of a given rat. In case of an incorrect response, the ITI was extended by 8 s. Behavioral training spanned several months and was complete once stable performance of 70% correct for each modality during seven consecutive days was reached.
Definition of behavioral epochs
Five types of behavior were defined: baseline behavior, approach, sampling, navigation, and outcome. For each of these epochs, a trigger emitted by the behavioral apparatus was used as reference time point; for analysis we used a 500 ms epoch following and aligned to this trigger to segment the data. The epochs were defined as follows (Fig. 1B):
Baseline: 6–6.5 s following the nose poke for reward (trigger: IR sensor 2 or IR sensor 3) in the foregoing trial. The animal rests, grooms, or wanders around the platform.
Object approach: −0.5 to 0 s before passing IR sensor 1, during which the animal navigates toward the sampling area.
Sensory discrimination: 0–0.5 s following the passage of IR sensor 1, which triggered light onset in the sampling area (during visual and multisensory trials) and allowed for tactile palpation with the whiskers of the stimulus (during tactile and multisensory trials). The animal engages in sensory sampling of the object (trials wherein the animal withdrew its head before 200 ms after passing IR sensor 1 were excluded from analysis).
Reward site approach: −1 to −0.5 s before the nose poke for reward (trigger: IR sensor 2 or IR sensor 3). The animal chooses a side and navigates to one of the reward locations. One of these wells is associated with the sampled object and is rewarded when chosen.
Outcome: 0–0.5 s following the reward poke (trigger: IR sensor 2 or IR sensor 3). The animal pokes into the reward well and receives a reward in case of a correct response.
Electrode implantation and surgery
Tetrodes were constructed from four twisted 13 µm coated nichrome wires (California Fine Wire). The electrode tips were gold plated to reduce electrode impedances to 300–800 kΩ at 1 kHz. Tetrodes were loaded into a custom-built microdrive containing 36 individually movable tetrodes (Lansink et al., 2007; Bos et al., 2017). In Rats 1 and 2, the drives targeted eight tetrodes to the perirhinal cortex (area 35/36; target coordinates in mm: −6.0 AP, 7.0 ML, 6.5 DV; Paxinos and Watson, 2013), eight tetrodes to dorsal hippocampal CA1 and CA3 area (−3.48 AP, 2.0 ML, 2.5 DV, aimed at the hippocampal pyramidal cell layer), eight tetrodes to secondary visual cortex (V2L; −6.0 AP, 5.8 ML, 2.8 DV), and eight tetrodes to the somatosensory cortex (S1 Barrel Field; −3.0 AP, 5.0 ML, 2.8 DV). For Rats 3 and 4, the drives were modified such that 13 tetrodes targeted the perirhinal cortex, 13 tetrodes the secondary visual cortex, and six tetrodes the somatosensory cortex, while maintaining the same target coordinates as in the previous drive.
Prior to surgery, rats received a subcutaneous injection of buprenorphine (Buprecare, 0.04 mg kg−1), meloxicam (Metacam, 2 mg kg−1) and enrofloxacin (Baytril, 5 mg kg−1). Anesthesia was induced by placing the animal in a closed plexiglass box filled with isoflurane vapor (3.0%). During surgery, the animals were mounted in a stereotaxic frame where anesthesia was maintained using isoflurane (1.0–2.0%) and body temperature was maintained between 35 and 36°C using a heating pad. Local anesthetic (lidocaine) was applied directly on the periosteum before exposing the skull. The skull was thoroughly cleaned with a 3% hydrogen peroxide solution to roughen the skull surface. The hydrogen peroxide was removed by rinsing the skull three times using saline. Six screws, from which one in the occipital bone served as ground, were inserted into the skull to improve the stability of the implant. After craniotomy and durotomy were performed, the drive was positioned over the craniotomies. The craniotomies were sealed using silicone adhesive (Kwik-Sil), the skull was covered with a layer of dental adhesive (OptiBond, Kerr) and primer (3M Transbond), and finally the drive was anchored using dental cement (Kemdent). Finally, all tetrodes were lowered into the superficial layers of the cortex. From the third day, tetrodes were gradually lowered toward their target location regions over a span of 7–10 d. Postoperative care included a subcutaneous injection of meloxicam once per day for 2 d following surgery and a single injection of enrofloxacin on the day following surgery.
Data acquisition
For data collection, the rat was connected to the recording equipment (Digital Lynx SX, 144 channels, NeuraLynx) via operational amplifiers (HS-36-LED, NeuraLynx) and a tether cable connected to a commutator (Saturn-5, NeuraLynx). During the days following surgery, tetrodes were lowered to their target location while their current location in the brain was estimated based on the distance the tetrode was moved in the drive and assessing the LFP and spike signals. During experiments, the neurophysiological signals were acquired continuously at 32 kHz, 0.1 Hz high-pass filtered, and stored on a hard drive for further processing. An infrared camera (EVI-D100P, Sony) positioned above the behavioral setup recorded behavioral activity illuminated by infrared LEDs at 25 frames per second. Per trial a high-speed video camera (M3, IDT) recorded whisker motion, illuminated by an infrared backlight (wavelength, 850 nm; Sygonix), at 500 frames per second. The training apparatus was controlled using a Field Programmable Gate Array and custom-written MATLAB software (MathWorks). Programmed commands and events recorded by sensors were sent to and recorded by the recording system as TTL pulses.
Histology
After the final recording session, currents (12 µA, 10 s) were applied to one lead of each tetrode to mark its endpoint with a small lesion. Twenty-four hours after lesioning, the animals were deeply anesthetized with Pentobarbital (Nembutal, Ceva Sante Animale, 60 mg ml−1, 1.0 ml intraperitoneal) and transcardially perfused with saline followed by a 4% paraformaldehyde solution, pH 7.4 (phosphate buffered). After postfixation, the brain was sectioned into 40–50-µm-thick slices using a vibratome. A Nissl staining (cresyl violet) was set to mark cell bodies. The sections were imaged and aligned to the 3D Waxholm reference atlas (Papp et al., 2014), using the QUINT workflow (Yates et al., 2019). Cells were excluded from analysis if their reconstructed recording locations were not within the anatomical borders of our regions of interest.
Spike sorting
Recorded spikes were detected and clustered offline using the Klusta package; spike clusters were manually curated using the Phy gui (Rossant et al., 2016). The quality of the curated clusters was assessed using SpikeInterface (Buccino et al., 2020). Single units were included for analysis if they met the following criteria: signal-to-noise ratio (of the mean waveform amplitude vs the background noise on the same channel) >2.5, <0.5% of spikes violating the refractory period, isolation distance >5, and firing rate over the whole session >0.1 Hz. Waveform characteristics were extracted using the SpikeInterface toolbox and used to classify cells as putative excitatory (pyramidal, PE) or putative inhibitory, fast spiking (FS) cells. First, cells were excluded from classification if no prominent spike waveform was detected (minimum required amplitude was 50 µV relative to background). Thereafter the distribution of the trough-to-peak latency was inspected, and a bimodal distribution, split at 400 µs, was found. Cells were then classified as PE if their trough-to-peak latency was larger than 400 µs or as FS if their trough-to-peak latency was smaller than 400 µs.
Multiple-comparisons correction
Reported results are corrected for multiple comparisons, using Bonferroni’s correction, except for the surrogate permutation tests (bootstrap estimation and permutation shuffling, see below), in which case an alternative to cluster-based correction was used.
Analysis of LFPs
For each session, one lead per recording area was selected for LFP analysis. LFP data recorded from CA3 in Rat 2 were inverted. First, these channels were included based on quality metrics: signal amplitude, signal standard deviations, and lack of 50 Hz artifact in the LFP, throughout recording sessions (Li et al., 2020). From the selected channels, for each session and each area, the channel with the strongest theta oscillation was selected based on manual curation, selecting for regular oscillations and the highest amplitude. The LFP data were visually inspected for noise artifacts, and trials with noise artifacts were excluded from the analysis. Finally, the leads were downsampled from 32 kHz to 1,000 Hz to improve processing speed using the SciPy decimate function.
LFP power and LFP–LFP coherence
We investigated the LFP dynamics by computing the time-frequency dependency of LFP power via the Morlet wavelet transform (7 cycles, computed using the MNE toolbox, tfr_array_morlet; Gramfort, 2013). For the 1/f correction of the power spectral density (PSD), the 1/f distribution was estimated using the IRASA method (Donoghue et al., 2020). To detect whether LFP amplitude was significantly increased and at which frequencies compared with baseline, we used Friedman's test for repeated measurements (p < 0.01) and a post hoc Wilcoxon signed-rank test (p < 0.01).
We focused on theta oscillations as we were interested in interareal communication between dHC and neocortex, and theta oscillations have been implicated in such communication during sensory and mnemonic operations (Buzsáki, 2009). To confirm this, we measured the LFP–LFP coherence between neocortical areas S1BF, V2L, and PER LFPs and dHC LFPs using the Weighted Phase Lag Index (WPLI; Vinck et al., 2011). This measure compensates for volume conduction and approaches zero in case of low coherence and 1 in case of high coherence. Statistical significance of the WPLI was evaluated using a permutation test shuffling the LFP phases (n = 500 shuffles).
Spike–LFP phase entrainment
For the spike phase analysis, the theta-band phase and power were estimated by first bandpass filtering the LFP [6–12 Hz, 4th order Butterworth filter, Elephant toolbox (Ulianych et al., 2021), butter], computing the analytic signal using the Hilbert transform (Elephant toolbox, hilbert) and finally taking its angle and magnitude to quantify the LFP phase and power, respectively. While the LFP was subsampled at 1,000 Hz and the spike data at 32 kHz, we interpolated the LFP phase data onto the spikes to retrieve the spike-triggered phase distribution. When computing spike–LFP coherence for a given cell, we ensured to select an LFP signal from a different tetrode than the tetrode on which that cell was recorded.
First, we asked what fraction of each cell population was significantly entrained during subsequent behavioral epochs (Fig. 5). Cells were considered significantly entrained if their spike-triggered phase distributions were nonuniform, as measured using the Rayleigh's test for uniformity (Astropy toolbox; The Astropy Collaboration et al., 2022; stats.rayleightest; with p value lower than α = 0.05). The same measure was used to detect whether cells were phase entrained by same-area LFP or dHC LFP. The significance of the difference between the fractions of cells locking was assessed using a bootstrap approach: the fraction of cells locking was recomputed by resampling from the cell population with replacement (1,000 resamples); based on the reshuffled data, the 95% confidence intervals were constructed. Fractions of cells were considered significantly different if the confidence intervals did not overlap. To compare the strength of phase entrainment on same-area LFP and dHC, we used the pairwise-phase consistency (Vinck et al., 2012); the significance of the difference between these types of LFP signal was assessed with Wilcoxon's signed-rank test.
Correlation between firing activity and theta power
To investigate the modulation of firing activity by theta power, we grouped the spiking data in deciles based on theta power derived from same-area LFP: per trial 1 s segments of theta-band filtered LFP power were sampled from the period −2 to 10 s around trial start. Over the full session, these segments were ranked in deciles based on their power. For the same segments, we computed the average firing rate for each recorded cell. We visualized the cell's firing rate versus theta power (Fig. 6). Then we determined whether a cell was significantly modulated by theta power by comparing the bootstrap estimated average firing rate in the first decile versus the bootstrap estimated average in the last decile (bootstrap resampling from firing rates per 1 s segment, 1,000 resamples, 95% confidence intervals). Cells were considered significantly modulated (i.e., their firing rate increases or decreases as a function of theta power) when the confidence intervals from the estimated averages did not overlap.
Temporal dynamics of phase entrainment
Next, we investigated the temporal dynamics of phase entrainment (Figs. 7, 8). We computed the cell's phase-locking strength over LFP phases in all trials, binned using a 200 ms wide window shifted by 10 ms steps over a −0.5 to +1.5 s interval relative to sample start. We chose this time window as it included three epochs of interest: object approach, discrimination, and reward site approach. As the behavior was self-paced, the duration of each trial phase varied between trials, and taking a longer time window would introduce more variance in the behavior the further away it is measured from the aligning trigger (sample onset). Because of the narrow window (200 ms), and the resulting low spike counts, we used the pairwise-phase consistency (PPC; Vinck et al., 2012) to evaluate the locking strength, as the PPC is not biased by low sample size. We used permutation tests (n = 500 samples) to assess the significance of the measured PPC in each time bin. For each time bin, the PPC was computed on Nspikes LFP phases which were randomly sampled out of all LFP phases recorded during a session in the −0.5 to +1.5 s time window around sample start, where Nspikes is the number of spikes in the bin under scrutiny. The p value was measured as the empirical datapoint's rank in the shuffled distribution divided by the number of permutations (i.e., if 2 out of 500 shuffled datapoints were higher than the empirical data point, then p < 0.01). This method corrects for the nonuniformity of the LFP phase distribution (hippocampal theta oscillations may take the form of a “sawtooth” pattern) and the number of spikes occurring during a bin. To further reduce noise in this analysis, cells were excluded if, during any of the behavioral epochs, <50 spikes were measured throughout the full recording session. Cells were deemed significantly phase entrained if p < 0.01 but were excluded from being significant if bins were not marked as significant for at least 100 consecutive milliseconds ( = 10 consecutive bins) during the full analyzed time window of 2 s duration. This approach was derived from cluster correction, a multiple-comparisons correction for permutation testing over many samples (Maris and Oostenveld, 2007). We visualized the timing of phase entrainment over the recorded population by normalizing the PPC per cell to its maximum value (compare Figs. 7B, 8A4–B3).
To report phase entrainment across all behavioral epochs (the outcome and baseline epochs were not included in the −0.5 to +1.5 s time window), we also report the phase entrainment on data aligned to the start of each behavioral epoch (baseline, object approach, discrimination, reward site approach, and reward). This approach corrects for the variance introduced into the data by the self-paced free behavior of the animal (i.e., variance in sampling duration, locomotion, etc.). Significance between the PPC values measured in these behavioral epochs was assessed using a pairwise t test (p < 0.05; Bonferroni’s corrected).
Modality selectivity of phase entrainment
We assessed whether cells were selectively phase entrained during tactile, visual, and/or multimodal stimulus conditions. To this end, we computed the PPC during the sensory discrimination epoch per modality. Spike-triggered phases were grouped by the modality of the presented trials. A threshold for the significance of a cell's phase entrainment by a modality was determined using random permutation testing: the PPC was measured 500 times on random LFP phases drawn without repetition from the sampling epochs. The p value was measured as the rank of the empirical PPC in the shuffled distribution (i.e., rank = 2 out of 500: p = 0.04). Measured PPC values were considered significant if p < 0.05.
To verify the significance of the reported fractions of cells phase entrained per modality (Fig. 9A1–C1), the fractions were re-estimated using a bootstrap procedure (1,000 shuffles) and 99% confidence intervals of the fraction of phase-entrained cells were estimated. Reported fractions were considered significant if the estimated confidence intervals did not overlap with zero.
The average strength of phase entrainment per recording area (Fig. 9A2–C2) was computed over all cells previously marked as phase entrained. Differences between modality conditions were evaluated with the Kruskal–Wallis test and a post hoc Wilcoxon's signed-rank test (p < 0.01).
Finally, we assessed whether modulations in firing rate or theta amplitude could explain any modality-selective effects. To determine the effect of variable firing rate between modalities, we recomputed the PPC for each cell and each modality but sampling only Nspikes for each modality, where Nspikes is the number of spikes recorded in the modality with the fewest spikes. Thus any effect purely driven by an increase or decrease in firing rate should not affect the outcome of this analysis. To determine the effect of theta amplitude, we measured the theta amplitude (i.e., the power from theta-bandpassed LFP) occurring during spikes in every modality condition. Differences between groups were assessed using the Kruskal–Wallis test and considered insignificant if p > 0.01.
Results
Electrophysiological recordings in a multisensory object discrimination task
Rats were trained on a two-alternative forced choice discrimination task with solid, 3D objects (Fig. 1A–C, c.f. Fiorilli et al., 2024). The rat's behavior was self-paced and tracked using infrared (IR) light sensors in the object sampling area (Fig. 1A, IR sensor 1) and the reward sites (Fig. 1A, IR sensors 2 and 3). Following trial start, the door blocking the sampling area was lowered and the rat could detect the stimulus (Fig. 1A, sampling area). In each trial, one of two objects (Fig. 1C, object A or object B) was presented. The sensory modality wherein the object was detectable during a given trial varied between tactile (T, object reachable for whiskers but in darkness), visual (V, object not reachable for whiskers but illuminated), or both senses combined (multisensory; M, object reachable for whiskers and illuminated). In order to sample the object, the rat reached with his head into the sampling area (triggering IR 1, sample start). Following object sampling, the rat retracted its head from the sampling area and moved to one of the arms of the elevated platform to retrieve a reward (triggering IR 2 or 3, reward poke). A sucrose solution reward was delivered when the rat poked at the reward site associated with the object on display. Following a poke, an ITI of 12 s started. In case of an incorrect response, no sucrose was delivered, and the ITI was extended by 8 s. Because the experiment was self-paced, the time spent in each trial phase varied per animal and trial (see Materials and Methods).
We segmented the behavioral trials into five 500 ms epochs, using the infrared sensors to track the animal's behavior (Fig. 1B): object approach (the animal navigated to the sampling area, 0.5–0 s before sample start), sensory discrimination (the animal entered the sampling area and identified the object, 0–0.5 s following sample start), reward site approach (the animal moved from the sampling area and navigated to its site of choice, either the left or right reward site, −1 to −0.5 s before reward poke), outcome (the animal poked the reward site and was rewarded upon a correct choice, 0–0.5 s following reward poke), and baseline (taken as the interval of 6–6.5 s following the reward poke). Trials wherein the animal started withdrawing from the sampling area before 200 ms were excluded from analysis. Rats (n = 4) learned to discriminate between the two objects and reported the correct object identity more frequently in the multisensory condition M than those in the unisensory conditions (Fig. 1E; T vs M: t(24) = 6.30, p < 0.01, paired t test and V vs M: t(24) = 7.03, p < 0.01, paired t test), averaged over four animals, 25 sessions: T, 72%; V, 72%; M, 84% correct response rate (see Table 1 for trial count per animal and modality), while performance between unisensory modalities did not differ significantly (T vs V, t(24) = −0.85; p = 0.40; paired t test). For all modalities, performance was significantly higher than during probe trials (51% correct response rate during probe trials; chance level, 50%; T vs P: t(20) = 3.87, p < 0.01; V vs P: t(20) = 3.75, p < 0.01; M vs P: t(20) = 6.81, p < 0.01, paired t test). Behavioral performance was stable across recording days (example data in Fig. 1D). Ensemble activity and LFP traces were recorded from the somatosensory barrel cortex (S1BF), secondary visual cortex (V2L), perirhinal cortex (PER), and dorsal hippocampus (dHC, subfields CA1 and CA3). The recording sites were verified using histological reconstruction of tetrode tracks (Fig. 2; see Materials and Methods). Cells were only included if their reconstructed recording locations were within the region of interest.
Theta oscillations in the hippocampus and neocortex during locomotion and sensory discrimination
Hippocampal theta oscillations (6–12 Hz) have been shown to increase in power during particular behaviors such as locomotion and whisking (Grion et al., 2016; Kropff et al., 2021). We therefore expected increased HPC theta LFP power during engagement of the rat in our behavioral paradigm. For each session, we selected one lead per recorded area per session (for example data, see Fig. 3A1–D1, areas S1BF, V2L, PER, dHC; see Materials and Methods) and computed the average LFP power in the theta range across the lead [Fig. 3A2–D2; data pooled across all sessions: S1BF (all four rats); 21 sessions, V2L (all four rats); 23 sessions, PER (all four rats); 21 sessions, dHC (two out of four rats); 17 sessions]. A clear peak in the theta range was visible in all recorded areas before and during epochs of object approach, sensory discrimination, and reward site approach, which decreased following the nose poke for reward (Fig. 3A2–D2). To quantify the modulation of theta power during behavior, we computed the PSD per behavioral epoch, corrected it for the 1/f distribution, and compared it against baseline (Fig. 3A3–D3). Relative to baseline, this corrected measure of theta power was significantly enhanced during the object approach, sensory discrimination, and reward site approach epochs in all areas (p < 0.01 for all areas; one-sided Wilcoxon signed-rank test). Theta power was also significantly increased during the outcome epoch in dHC, but not in the neocortical areas (p < 0.01; one-sided Wilcoxon signed-rank test; multiple comparison corrected).
The stimulus onset was paired with a shift in the theta peak frequency in areas S1BF, V2L, and dHC (Fig. 3A2–D2,A3–D3; nonoverlapping bootstrapped 99% confidence intervals). The onset of the frequency shift nearly coincided with the onset of stimulus and reached its maximum ∼500 ms following stimulus onset. Theta frequency has been shown to depend on acceleration in body motion (Kropff et al., 2021). This likely explains the shift detected here, as the animals slow down their movement when approaching the object and afterward accelerate again.
Neocortical theta oscillations cohere with the hippocampal theta rhythm
The LFP recordings in neocortical areas S1BF, V2L, and PER showed similar temporal dynamics as dHC in the theta range (Fig. 3A3–D3). These theta oscillations could be generated locally, within the neocortical areas. On the other hand, previous research has suggested that theta oscillations recorded from rat neocortex may arise through volume conduction of the LFP from the dHC (Gerbrandt et al., 1978; Sirota et al., 2008; Vinck et al., 2016). Thus the neocortical theta oscillations could be a reflection of theta oscillations generated in dHC.
We quantified the phase coherency between dHC and neocortical areas using the WPLI (Vinck et al., 2011), a metric of phase synchronization that is less affected by volume conduction, computed between dHC and neocortical LFP theta oscillations (Fig. 3A4–C4). We found significant phase coherence during all behavioral epochs between theta oscillations recorded from all three areas of the neocortex (p < 0.05; permutation test) and those recorded from dHC. As shown in Figure 3A4–C4, the strength of phase synchronization (WPLI) differed per behavioral epoch, with the strongest WPLI values being reached during object approach, sensory discrimination, and reward site approach phases (p < 0.01, one-sided Wilcoxon signed-rank test). All three neocortical areas showed similar profiles with their LFP signals referenced to dHC, and no significant difference between areas was found (p > 0.01; Friedman test), even though only S1BF and V2L reached a significant increase of WPLI coherence compared with baseline during several epochs. Taken together, these results show a widespread presence of coherent theta oscillations throughout the hippocampus and neocortex during task engagement by the animal.
Phase entrainment of neocortical and hippocampal cells by theta oscillations
We next investigated the phase entrainment of cells along the corticohippocampal hierarchy (areas S1BF, V2L, PER and dHC) during the different behavioral epochs, as we expected cells in these areas to be phase entrained depending on the engagement of these areas during specific stimuli and behaviors. We analyzed spiking activity from single units recorded across the four areas (example data in Fig. 4; S1BF, 167 cells; V2L, 371 cells; PER, 239 cells; dHC, 220 cells) while relating this activity to same-area and dHC theta (6–12 Hz) oscillations. We will refer to phase entrainment as the biasing of a cell's activity toward a specific phase in LFP activity as a result of external input, in the case of S1BF and V2L cells: incoming sensory information or top-down input conveyed through a descending pathway (Kleinfeld et al., 2016). The statistical significance of theta phase entrainment of individual cells was determined using Rayleigh's test (cells are marked as phase entrained if p < 0.05). Phase entrainment was quantified during each behavioral epoch and during the sampling epoch separately for each modality condition. In the following sections, we consider a cell phase entrained during the discrimination epoch if it is entrained during at least one of the three presented modality conditions (we further expand on modality selectivity in the final paragraph of the Results).
In both sensory neocortical areas, we found the largest fraction of significantly entrained cells on same-area LFP during the sensory discrimination epoch (Fig. 5A1–D1 and Table 2; S1BF, 37% (29–43%) of cells phase entrained, mean plus 95% confidence intervals; V2L, 23% (18–27%) of cells phase entrained), while the fraction of entrained cells was significantly lower during the other four epochs (bootstrap test, 95% CI). Interestingly, a higher fraction of cells was detected as phase entrained for S1BF and V2L populations to same-area LFP compared with hippocampal LFP (bootstrap test, 95% CI; compare Fig. 5A2–B2, Table 3). Thus we asked whether, if cells were phase entrained by same-area LFP, they were also phase entrained by dHC LFP. To answer this question, we labeled cells as phase entrained by same-area LFP only, by dHC-only, or by both, using Rayleigh's test. Only a small fraction of phase-entrained S1BF and V2L cells were significantly locking on both dHC and same-area LFP, while the largest fraction of cells was phase entrained on same-area LFP only (S1BF, 58% of cells only on same-area LFP, 12% of cells only on dHC LFP, 31% of cells on both same-area LFP and dHC LFP; V2L, 52% of cells on same-area LFP, 31% of cells on dHC LFP, 17% of cells on same-area and dHC LFP). We validated these findings by comparing them against another, unbiased, measure of spike–LFP locking: the PPC (Vinck et al., 2012). We measured the PPC on same-area and dHC LFP during sensory discrimination for each cell that significantly locked on either LFP (Fig. 5A3–B3). The PPC was higher when measured on same-area LFP compared with dHC LFP for S1BF cells (T = 338; p < 0.01; Wilcoxon's signed-rank test) but not for V2L cells (T = 162; p = 0.10; Wilcoxon's signed-rank test), indicating that the entrainment of S1BF cells is stronger with same-area LFP than dHC LFP but, on a population level, the entrainment of V2L cells is not preferentially stronger on same-area LFP compared with dHC LFP.
A small fraction of PER cells was phase entrained on same-area theta LFP during the sampling epoch [15% (11–18%) of cells phase entrained]. Cell preference for same-area or dHC LFP was heterogeneous (30% PER-LFP only, 19% of cells on dHC LFP, and 51% on both same-area LFP and dHC LFP). The PER cell population tended to be somewhat more strongly entrained to dHC LFP than same-area LFP (Fig. 5C3; p = 0.02; Wilcoxon's signed-rank test). Thus, we did find a rather sparse phase entrainment in PER, which did occur preferentially with respect to dHC LFP.
Given its role in episodic and spatial memory, the hippocampus is likely involved in at least some of the behaviors required to perform the task (e.g., object discrimination and recognition, spatial choice, and navigation). Indeed, we found a large fraction of cells phase entrained in dHC during all behavioral epochs (with a minimum of 47% during baseline and a maximum of 63% during object approach; Fig. 5D1). We recorded cells from dorsal hippocampal subfields CA1 and CA3, the pyramidal cells of which show functional differences in firing rate and phase entrainment (Mizuseki et al., 2012). These previous findings prompted us to investigate potential differences between phase entrainment in CA1 and CA3 in our task. We split the dHC cell population by subfields CA1 and CA3 and did not find differences in the fraction of phase-entrained cells (data not shown; bootstrap test, 95% CI), although the phase entrainment of CA3 cells was stronger than CA1 cells during sensory discrimination (T = 768; p < 0.01; Wilcoxon's signed-rank test) and reward site approach epochs (T = 942; p < 0.01; Wilcoxon signed-rank test). In the subsequent analyses, we pooled CA1 and CA3 data under a common denominator (dHC) because the fractions of phase-entrained cells in both subregions were not statistically different, and no further major differences were found between these subregions.
Next, we aimed to gain insight into how same-area theta power and neural activity are related to each other. To do so, we grouped the theta power of same-area LFP in 1 s epochs and grouped them in deciles. For every cell, we then computed the average firing rate per decile. Cells were considered significantly modulated if their firing rate was higher or lower than their firing rate in the first decile (bootstrap test, 1,000 resamples, 95% confidence intervals; see Fig. 6 and Materials and Methods). Further, we distinguished between phase-entrained cells and nonphase-entrained cells, to study whether or not phase-entrained cells are more likely modulated by theta power than nonphase-entrained cells.
In areas S1BF and dHC, we found a significant increase in average firing rate for phase-entrained cells [average change in firing rate: S1BF: +3.38 Hz (2.38–4.59 Hz); dHC: +4.55 Hz (3.39–6.09 Hz), mean plus 95% bootstrap confidence intervals] but not for nonphase-entrained cells [S1BF: 1.14 Hz (0.47–2.01 Hz) Hz; dHC: −0.09 Hz (−0.67 to 0.49 Hz)]. In V2L and PER, an increase in theta power was not paired with an increase in firing rate, neither for phase-entrained cells [V2L: 0.86 Hz (0.40–1.46 Hz); PER: 0.12 Hz (−0.10 to 0.38 Hz)] nor nonphase-entrained cells (V2L: 0.83 Hz (0.59–1.13 Hz); PER: 0.01 Hz (−0.16 to 0.18 Hz)]. We further looked at individual cells. In the neocortical sensory areas (S1BF and V2L) and dHC, single-cell firing rate was modulated by theta power for a considerable fraction of phase-entrained cells [percentage of phase-entrained cells modulated by theta power: S1BF: 35% (24–44%); V2L: 15% (9–21%); dHC: 34% (26–40%); mean plus 95% bootstrap confidence intervals]. However, for most phase-entrained cells, the increase of theta power was not paired with an increase or decrease in firing rate. Only very few nonphase-entrained cells in these areas were significantly modulated by theta power [percentage of nonphase-entrained cells: S1BF: 14% (6–23%); V2L: 8% (4–11%); dHC: 6% (0–12%)]. In PER no cells were modulated by theta power. Thus, the effect of firing rate modulation by theta power was strongest in S1BF and dHC, moderate in V2L and absent in PER, and generally more likely to occur in phase-entrained cells than nonphase-entrained cells (see above for nonoverlapping confidence intervals for these respective groups).
In summary, neuronal entrainment by theta oscillations was detected in sensory neocortical areas S1BF and V2L during object discrimination. S1BF and V2L cells appeared to be strongly and selectively phase entrained when objects were being sampled. Similarly, a smaller fraction of cells was phase entrained in PER during object discrimination, whereas a large fraction of hippocampal cells was phase entrained during all behavioral epochs. High theta power was coupled to higher firing rate for about a third of phase-entrained cells in dHC and S1BF (but less or not so in V2L and PER)—an excitatory relationship that was most notable in phase-entrained cells. These findings are in line with the hypothesis that neocortical phase entrainment in the theta band is intensified when there is a high demand for sensory processing and for the comparison between sensory input and object information retrieved from memory, the latter likely requiring interareal communication.
Phase entrainment in the hippocampus is prolonged and dissociable from firing rate
Our next aim was to explore the finer temporal phase-entrainment dynamics throughout the behavioral trial, elucidating the duration and strength of phase entrainment across the trial and its relationship with the cell's firing rate. We hypothesized that phase entrainment of cells is a neural mechanism different than rate coding (O’Keefe and Burgess, 2005), which we tested by comparing the temporal patterns of phase entrainment and firing rate.
We first characterized the phase distribution of dHC spikes relative to same-area theta oscillations over time (Fig. 7). To do so, we quantified the strength of phase entrainment over time using the PPC and a sliding window (width, 200 ms). We then detected epochs of significant phase entrainment for a cell using random permutation testing (500 shuffles, p < 0.01; cells were excluded from this analysis in case of low firing rates or an absence of significant phase entrainment; see Materials and Methods). This method allowed us to measure the onset and offset of phase entrainment in single cells.
The phase entrainment of single dHC cells (n = 80 cells) on same-area theta oscillations varied between individual cells, showing a variety of firing behaviors, including cells locking during locomotion (object approach) epochs (Fig. 7A1), cells locking before and during sensory sampling (Fig. 7A2), and cells locking throughout the full behavioral trial (Fig. 7A3). Characteristic for phase-locked dHC cells was their long locking duration (990 ± 389 ms, mean ± SD; Fig. 7D; see the next section for significant differences to V2L and S1BF) and the tessellation of all behavioral epochs by the whole population. All task segments were covered by different hippocampal cells (Fig. 7B). Phase entrainment was significantly increased compared with baseline during the object approach (T = 26; p < 0.01; Wilcoxon signed-rank test), sensory discrimination (T = 151; p < 0.01), reward site approach (T = 183; p < 0.01), and outcome (T = 671; p < 0.01) epochs (Fig. 7E). Across consecutive behavioral epochs, phase entrainment decreased, that is, phase entrainment was stronger in the object approach versus discrimination epoch, and the discrimination and reward site approach epochs contained stronger phase entrainment than the outcome epoch (p < 0.01 for all comparisons; Wilcoxon signed-rank test).
Phase-entrainment dynamics could depend on modulation of a cell's firing rate. For instance, if cells are inactive outside the sensory discrimination epoch, the selectivity of phase entrainment for sensory discrimination would be a result of the absence of firing activity outside the corresponding epoch. To examine whether phase entrainment of dHC cells can be dissociated from fluctuations in firing rate, we next asked to what extent the firing rate was positively correlated with the phase entrainment of cells, as would be expected if PE strength is strongly driven by an increase in firing rate. For a considerable fraction of the dHC cell population (30% of cells), the firing rate was not significantly correlated with phase entrainment (Fig. 7C; Pearson’s correlation; p < 0.01) versus 31% cells positively and 42% cells negatively correlated (Pearson’s correlation; p < 0.01). These data reveal a variety of phase-entrainment dynamics in the dHC cell population: cells are phase entrained for prolonged periods of time, and their firing rate is not a predictor for phase entrainment per se, supporting that individual cells may contribute to changes in synchrony dissociated from firing rate (O’Keefe and Burgess, 2005).
Phase entrainment in the somatosensory, visual cortex, and perirhinal cortex is transient and selective for sensory processing
Considering the increase in phase synchronization in S1BF, V2L, and PER during sensory discrimination (Fig. 5), we next asked how phase entrainment in these areas dynamically evolves over trial time. We repeated the previous analysis for S1BF and V2L populations (Fig. 8). For this analysis, we used theta oscillations recorded from the neocortical areas (same-area LFP), as we found cells to be more strongly entrained to cortically recorded theta oscillations than dHC recorded oscillations (Fig. 5). We repeated the analysis for PER cells as well, however, due to the low firing rates of PER too few cells reached significance to allow reliable interpretations.
The overall strength of phase entrainment of S1BF and V2L cells on same-area theta oscillations was weaker than that of dHC cells (S1BF: PPC = 9.47 × 10−3 ± 2.33 × 10−3, mean ± SEM; V2L: PPC = 1.06 × 10−2 ± 2.44 × 10−3; dHC: PPC = 1.18 × 10−1 ± 2.15 × 10−2; p < 0.01; Mann–Whitney one-sided test). In contrast to dHC, where cells were phase entrained throughout behavioral epochs (including the baseline epoch), neurons in S1BF and V2L were generally entrained in a short window following stimulus onset (Figs. 7A5,B5, 8A3,B3; locking durations: S1BF: 187 ± 209 ms; V2L: 180 ± 166 ms, mean ± SEM), significantly shorter than dHC locking durations (p < 0.01; Kruskal–Wallis, post hoc Mann–Whitney test).
We further looked into the effects of firing rate modulation on phase entrainment and thus quantified the Pearson’s correlation between a cell's firing rate and phase entrainment. For about half of the cells in both S1BF and V2L, the phase entrainment was not correlated to their firing rate (Fig. 8A6–B6; S1BF: 24% positively correlated, 20% negatively correlated, 56% not correlated; V2L: 44% positively correlated, 6% negatively correlated, 50% not correlated; p < 0.01). Therefore, the selectivity of neocortical phase entrainment for the sensory discrimination epoch cannot be attributed to modulations in firing rate alone.
Overall, these results show a marked, transient phase entrainment of sensory neocortical cells selectively during object sampling. Interestingly, for large fractions of cortical cells, the phase modulation of spike timing cannot be explained by changes in firing rate, as this parameter was uncorrelated with phase locking in these cells. Thus, phase entrainment in the two sensory cortices is selectively enhanced when the animal is engaged in object sampling, even though cells remain active outside the epochs of sensory discrimination.
Phase entrainment in the sensory cortices, but not the hippocampus, is selective for stimulus modality
If phase entrainment is a mechanism supporting sensory and mnemonic processing, we expect it to occur preferably during stimulus presentation in a sensory area's conventionally associated modality (i.e., the preferred modality for S1BF being tactile and for V2L visual). For each cell that was significantly phase entrained during the discrimination epoch (Fig. 5; S1BF: n = 54 cells, V2L = 85 cells, PER = 35 cells, dHC = 122 cells), we computed the phase-locking strength (PPC) during this epoch (0–500 ms following sampling onset) with the trials split per sensory modality. We asked first whether cells were significantly phase entrained (p < 0.01, permutation test) only during tactile trials (tactile-only, not phase entrained during V trials), during visual trials (visual-only, not phase entrained during T trials), during stimulus presentation in either modality (“both,” phase entrained during both T and V stimulus conditions), or during multisensory trials only (multimodal-only, no phase entrainment during T and V trials alone). Using this classification, we excluded cells that were not significantly phase entrained during any of the modality conditions (some cells marked as phase entrained throughout the session were not during the modality conditions due to the lower trial count). We found the largest fraction of cells in sensory areas S1BF and V2L to be phase entrained only during stimuli of the preferred modality of both areas (Fig. 9A1–B1; the percentages in this and the following paragraph are estimates of the mean percentage and 99% bootstrapped confidence intervals of modality phase-entrained cells), S1BF: 48% (31–65%) of cells tactile-only, and V2L: 48% (32–61%) of cells visual-only. However, in both areas a smaller but significant fraction of cells was phase entrained selectively for the nonpreferred modality [S1BF: 17% (6–30%) visual-only; V2L: 17% (8–31%) tactile-only].
In PER a significant fraction of cells was selectively phase entrained during tactile trials, but not during visual trials, and vice versa [tactile-only: 50% (21–73%) of cells; visual-only: 25% (4–46%) of cells; both T and V: 21% but n.s. relative to 0%]. Such separation of phase-entrainment preference for either modality could be due to separation of modality processing along the rostral-caudal axis of PER (Burwell and Amaral, 1998). In dHC, fewer cells were selective for a single modality [dHC: 13% (6–23%) of cells tactile-only; 15% (7–24%) of cells visual-only] but predominantly during both types of stimulus presentation [dHC: 65% (53–75%) of cells during both stimulus conditions].
We thus find that during visual stimulus presentation, a subset of S1BF and V2L cells is firing in sync with the theta rhythm, while the same set of cells is not firing rhythmically during tactile-only input. Furthermore, another subset of cells is phase entrained only during the presentation of both tactile and visual stimuli. The increase in synchrony of many sensory cortical cells is likely correlated to the engagement of these areas in unimodal processing, although a minority of cells is entrained in the nonpreferred modality.
We further asked whether the selectivity for modality in phase-entrained cells was represented by the locking strength and if the selectivity for modality in the sensory areas could be explained by modulations in firing rate or theta power. Phase-entrained cells in the sensory cortices S1BF and V2L were more strongly phase entrained, as assessed with the PPC measure, during trials of the modality associated with the area under scrutiny, compared with the nonassociated modality (Fig. 9A2–B2; area S1BF: H = 62, p < 0.01, Kruskal–Wallis; post hoc one-sided Wilcoxon's signed-rank test: T vs V: p < 0.01, T vs M: p = 0.02, M vs V: p = 0.02; area V2L: H = 10, p < 0.01, Kruskal–Wallis; post hoc one-sided Wilcoxon's signed-rank test: V vs T: p < 0.01, V vs M: p = 0.01, M vs T: p = 0.01). Possibly, also the firing rate of these S1BF and V2L cells could be modulated by stimulus modality, which could be a confound in interpreting these findings. To investigate this, we computed the average firing rate of each cell in both tactile-only and visual-only conditions (Fig. 9A3–B3). No significant differences in firing rate were detected between modalities for the phase-entrained cells (S1BF: H = 0.77, p = 0.68, Kruskal–Wallis; V2L: H = 2.68, p = 0.26, Kruskal–Wallis). As an additional control for the potential contribution of changes in firing rate to the modality dependency of phase entrainment, we recomputed the phase entrainment of S1BF and V2L cells while sampling the same number of spikes between modalities (i.e., the modal condition marked by the higher firing rate was downsampled). Even when correcting for modulations in firing rate in this manner, S1BF and V2L showed stronger phase entrainment in their associated modalities (data not shown; S1BF: H = 15, p < 0.01; V2L: H = 27, p < 0.01, Kruskal–Wallis).
A second potential confound being at stake is whether LFP theta power had higher amplitudes in either modality. The modality selectivity of phase entrainment could be due to the absence and presence of theta oscillations during the various stimulus presentations (i.e., phase entrainment increases when theta amplitude increases). To investigate this, we extracted theta oscillation amplitudes at the moment of cell spiking and compared the average spike-triggered theta LFP amplitude between the tactile-only and visual-only conditions for each cell (Fig. 9A4–B4). In neither area was the theta power significantly higher during any of the stimulus modalities (S1BF: H = 0.42, p = 0.81, Kruskal–Wallis; V2L: H = 0.81, p = 0.67, Kruskal–Wallis). When correlating the spike-triggered theta amplitude and the PPC, a significant correlation was found for V2L cells during the visual-only condition (Pearson’s correlation; r = 0.64; p < 0.01). No correlation between theta amplitude and PPC was found in the other conditions or in the other areas (Pearson’s correlation; p > 0.01). Taken together, these results show that modulations in theta amplitude do not explain transient phase-entrainment sensory cortical cells.
These results highlight that theta phase entrainment in S1BF and V2L largely conforms to unisensory processing, while some S1BF and V2L cells are phase entrained selectively during presentation of non-preferred modalities (S1BF: visual-only, V2: tactile-only) and some V2L cells are phase entrained only during multimodal conditions. PER cells are predominantly selective for a single modality, be it tactile or visual. Locking of dHC was mostly found to be nonselective for these modality conditions.
Discussion
In our object discrimination paradigm, animals rely on the processing of sensory information, memory recall during object detection, and decision mechanisms to choose a reward site. We recorded neural activity from areas S1BF, V2L, PER, and dHC, detected theta rhythm in all four recorded areas, and found that its power depended on the type of behavioral epoch, increasing during object approach, sampling, and reward site approach epochs across all areas (Fig. 3). Furthermore, we found a strong coherence between the theta-band LFPs recorded from sensory neocortical areas and dorsal hippocampus.
The key results of our study lie in the characterization of the temporal dynamics of phase entrainment of sensory neocortical cells by theta oscillations and its dependency on processing task-relevant features. Phase entrainment of S1BF, V2L, and PER cells was prominent during the behavioral epoch of sensory discrimination (Fig. 9). At the same time, it was not significant during the epochs of object approach, reward site approach, outcome, or ITI and reward delivery, which contrasts with previous findings on theta-band phase locking in rat orbitofrontal cortex (van Wingerden et al., 2010).
The role of theta synchronization in sensory processing is further underlined by the reported modality dependence of cell entrainment: cells were more strongly phase entrained during stimulus presentation in their preferred modality as compared with the nonassociated modality (Fig. 9). The selectivity of phase entrainment of neocortical cells for object sampling contrasts with the phase entrainment of dHC cells, which tessellated the full behavioral task (Fig. 7) and were nonselective for stimulus modality (Fig. 9).
The latter finding aligns with the idea of hippocampal theta as a general reference signal for episodic memory encoding during various behaviors such as locomotion, sensory discrimination, spatial choice, and reward. In other words, the hippocampus may be “open” for extended periods to process theta-phased inputs throughout behavioral epochs, in agreement with its general role in recording lifetime events in episodic memory (Eichenbaum, 2000; Pennartz et al., 2002; Robbe and Buzsáki, 2009; Battaglia et al., 2011). In contrast, S1BF and V2L cells were phase entrained during object sampling for epochs of only a few hundred milliseconds (Fig. 8) and predominantly for their preferred modality.
Neocortical theta oscillations cohere with hippocampal theta rhythm
The exact origin of theta oscillations in areas S1BF, V2L, and PER in our data remains unclear (cf. Schneider et al., 2021). Neocortical theta oscillations probably result from multiple mechanisms interacting, such as volume conduction from the hippocampus (Sirota et al., 2008; Vinck et al., 2016), polysynaptic projections from HPC cells (operating in association with the medial septum and entorhinal cortex; Kleinfeld et al., 2016) to these neocortical areas, or theta oscillations generated by these cortical areas themselves (Ma and Leung, 2018). Therefore the question of which region is the entraining oscillator for neocortical activity cannot be answered simply.
In the case of perfect volume conduction, the phase entrainment of neocortical cells should be equally strong when referring to either same-area or dHC LFP. We found this was not the case, as fewer cells were phase entrained on dHC LFP than those on same-area LFP (Fig. 5A2–C2). Furthermore, we found that most cells were selective for either same-area or dHC LFP, but rarely for both (Fig. 5A3–C3). These findings contrast with the hypothesis that theta recorded in S1BF and V2L would be purely volume conducted. The phase-locking preference of neocortical cells for same-area theta oscillations, over dHC-recorded LFP, may arise from the distance between our dHC and neocortical recording sites, in combination with phase differences of theta oscillations across the septotemporal hippocampal axis (Lubenov and Siapas, 2009; López-Madrona et al., 2020) or from hippocampus-independent theta generators in the neocortex and possibly thalamus (Ma and Leung, 2018). Finally, we found significant theta activity in neocortical areas during the object approach epoch, while no theta phase entrainment of spiking activity was found during this epoch. To better understand the origin of the oscillations reported in this work, future experiments can use high-density silicon probes to determine current source density profiles and thus obtain more insight into the laminar origin of the oscillations. Moreover, new inactivation experiments targeting the hippocampus and sensory neocortex are expected to reveal their causal roles in the genesis of neocortical theta rhythms. Thus, even though it is unknown which inputs exactly govern spiking behavior, theta oscillations are likely not generated purely locally during this behavioral epoch; more research is required to examine which factors govern firing behavior in these areas, besides local synaptic input in the theta band.
Elucidation of the origin of sensory neocortical theta activity must await further hippocampal inactivation experiments. Regardless of this origin, neocortical theta activity showed considerable coherence with hippocampal theta (Fig. 3A3–C3), and consequently it is warranted to discuss the observed phase locking of neocortical cells in relation to functions of hippocampal theta rhythm.
Phase entrainment in sensory neocortices is driven by modality-specific sensory input
Previously Grion et al. (2016) described theta phase entrainment in S1BF during a tactile discrimination task. While not recording PER or HC spikes, they found a transient increase of hippocampal theta power during whisking paired with increased phase entrainment of S1BF cells by hippocampal theta and increased synchrony between whisking oscillations and hippocampal theta. On a related note, a mechanism for sensorimotor integration, including bidirectional coupling between the hippocampus and S1BF, was proposed (Kleinfeld et al., 2016; cf. Dorman et al., 2023). In short, before engagement in tactile discrimination, hippocampal theta oscillations may pull whisking-induced oscillations into their theta frequency range. Tactile sensory information may thus be sent through the ascending cortical pathway to the hippocampus.
Our results build on the findings of Grion et al. and add several pieces to the puzzle of corticohippocampal network functioning. First, we assessed phase entrainment of S1BF cells in the absence of a tactile stimulus (i.e., during visual-only trials), while the animal was nonetheless engaged in whisking. In the presence of whisking but in the absence of a tactile stimulus, most S1BF cells were not phase entrained, even if they were significantly phase entrained in other conditions (Fig. 9). Thus, phase entrainment in S1BF (and also V2L) is at least partially driven by sensory input. In addition, a small pool of S1BF cells was phase entrained selectively during visual-only stimuli, demonstrating cross-modal interactions in the theta band (cf. Bieler et al., 2017). Second, in contrast to Grion et al., our task incorporated a significant spatial component, allowing the animal to move around the T-shaped track freely, which likely drove the increased theta power during the approach epoch. While theta power was high during object approach in all recorded areas, phase entrainment in S1BF and V2L remained largely absent (Fig. 8), indicating that an increase in theta power alone is not enough to enhance phase entrainment in these areas. Third, we report a dissociation between rate and phase coding in S1BF and V2L, which implies that phase entrainment in S1BF and V2L may serve a different function than rate coding. Fourth, different from Grion et al., we presented objects in two sensory modalities, which allowed us to establish that stimulus-induced phase entrainment in S1BF and V2L was largely modality specific. Our results substantiate how S1BF, V2L and PER population synchrony increases during the processing of sensory information. Somewhat surprisingly, theta phase entrainment of PER cells was relatively weak relative to S1BF and V2L, which is, however, in line with recent findings on single-neuron to population coupling in the same network during a visual discrimination task set on a figure-8 maze (Dorman et al., 2023).
The behavioral relevance of our findings warrants further research. While Grion et al. did report a positive correlation between behavioral performance and increased whisking theta oscillation synchrony, we were not able to find a behavioral correlate such as faster reaction times with increased phase entrainment (data not shown). This might be due to the relative long duration of our behavioral trials and the self-paced nature of our behavioral task.
Mechanisms of phase entrainment in sensory cortices
The synchronization of two connected systems by phase entrainment may enable and modulate communication, for instance, during memory read-in and read-out depending on sensory stimuli (for review, see Colgin, 2016; Lakatos et al., 2019). The phase entrainment we describe in sensory cortices thus might act as a neural mechanism which aligns incoming sensory input and cortical population activity, both locally and with respect to remote dynamics in the hippocampus.
Phase entrainment of sensory and perirhinal cortex activity occurs primarily during sensory acquisition, while the hippocampal theta rhythm is proposed to serve as a necessary background state for encoding episodic memories during various behaviors (Fig. 10). When the neocortex is not engaged in processing information, its population firing patterns are characterized by asynchronous firing, that is, without consistent phase relationship with theta oscillations. Following stimulus onset, population synchrony increases (as reported by the increase in phase entrainment), due to the rhythmicity of the sensory input, the network's intrinsic oscillatory mechanisms and/or top-down input, for instance, from the medial temporal lobe.
Interestingly, the presence of theta power was not a predictor for phase entrainment per se (Figs. 6, 9). This could, first, be explained by the fact that the recorded oscillations are partly generated outside the area of recording or second, by the finding that theta oscillations were mostly generated by subthreshold membrane potential activity (Noguchi et al., 2023), as theta oscillations could be present without generating spikes. Third, somatic inhibition can veto spiking yet contribute to extracellular current flow and LFP magnitude (Buzsáki et al., 2012). Similarly a single cell's firing rate (Figs. 7, 8) was not predictive of phase entrainment (O’Keefe and Burgess, 2005). These possibilities call for more research on the factors determining whether a cell is phase entrained or not.
The increased synchrony of neocortical cell firing may facilitate and drive information processing in higher sensory and mnemonic areas, possibly through synchronizing interconnected areas by aligning cycles of excitation and inhibition (Colgin, 2016). The phase entrainment in sensory cortices was characterized by its brief duration of only a few theta cycles. Such short-lived phase entrainment nearly contradicts (Lakatos et al., 2019), who define entrainment as requiring rhythmicity. However, evidence has been raised for phase coding based on nonrhythmic, irregular, oscillations (Eliav et al., 2018), and the phase entrainment reported here could thus be explained as spike synchrony relative to the synaptic population activity, rather than depending on sustained oscillations. Following the offset of sensory input, the sensory regions return to an asynchronous state, whereas the hippocampus continues to linger in a theta phase-entrained state, as other event information may need to be processed at any time point in the task.
Data Availability
The experimental data used in this study is registered as a dataset in the EBRAINS database. A detailed description of the experiment and the included data are published in the EBRAINS knowledge graph: https://doi.org/10.25493/AM91-2D. The code for analyzing these data and generating the figures is publicly available: https://gitlab.com/Truikes/vita-lfp.
Footnotes
The authors declare no competing financial interests.
We thank Jeroen Bos, Gerjan Huis in ‘t Veld, and Laura Mourik Donga for their technical support for surgeries and hyperdrive assembly. We additionally thank Charlotte Oomen and Carien Lansink for their input and advice on the behavioral paradigm and training procedure and Pietro Marchesi for advice on analytical methods. We are grateful for the support for microdrives and behavioral equipment provided by the Technology Center of the University of Amsterdam, in particular by Sven Koot, Tristan van Klingeren, Gerrit Hardeman, Tjeerd Weijers, Udo van Hes, and Alix Wattjes. We thank Rhys Buckton and Maria Duque Quintero for their assistance in animal training. Finally, we thank Ingrid Reiten and Jan Bjaalie for the registration of our histological data into the 3D rat brain atlas and for curating our dataset in the EBRAINS database.
This project has received funding from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreement No. 945539 (Human Brain Project SGA3) to C.M.A.P.
↵*T.R.R. and J.F. contributed equally to this work.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.