Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro
eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleResearch Article: New Research, Sensory and Motor Systems

Dissecting Mismatch Negativity: Early and Late Subcomponents for Detecting Deviants in Local and Global Sequence Regularities

Yiyuan Teresa Huang, Chien-Te Wu, Shinsuke Koike and Zenas C. Chao
eNeuro 3 May 2024, 11 (5) ENEURO.0050-24.2024; https://doi.org/10.1523/ENEURO.0050-24.2024
Yiyuan Teresa Huang
1International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Tokyo 113-0033, Japan
2School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei 100, Taiwan
3Department of Multidisciplinary Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chien-Te Wu
1International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Tokyo 113-0033, Japan
2School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei 100, Taiwan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shinsuke Koike
1International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Tokyo 113-0033, Japan
3Department of Multidisciplinary Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
4University of Tokyo Institute for Diversity & Adaptation of Human Mind (UTIDAHM), Tokyo 113-0033, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zenas C. Chao
1International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Tokyo 113-0033, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Abstract

Mismatch negativity (MMN) is commonly recognized as a neural signal of prediction error evoked by deviants from the expected patterns of sensory input. Studies show that MMN diminishes when sequence patterns become more predictable over a longer timescale. This implies that MMN is composed of multiple subcomponents, each responding to different levels of temporal regularities. To probe the hypothesized subcomponents in MMN, we record human electroencephalography during an auditory local–global oddball paradigm where the tone-to-tone transition probability (local regularity) and the overall sequence probability (global regularity) are manipulated to control temporal predictabilities at two hierarchical levels. We find that the size of MMN is correlated with both probabilities and the spatiotemporal structure of MMN can be decomposed into two distinct subcomponents. Both subcomponents appear as negative waveforms, with one peaking early in the central-frontal area and the other late in a more frontal area. With a quantitative predictive coding model, we map the early and late subcomponents to the prediction errors that are tied to local and global regularities, respectively. Our study highlights the hierarchical complexity of MMN and offers an experimental and analytical platform for developing a multitiered neural marker applicable in clinical settings.

  • EEG
  • hierarchy
  • mismatch negativity
  • predictive coding
  • subcomponents

Significance Statement

Our study provides new insights into the intricate architecture of mismatch negativity (MMN), a key neural indicator for deviant detection. Using a refined oddball paradigm with dual-level temporal controls, we identified two unique MMN subcomponents, each linked to prediction errors at different brain hierarchies. This work establishes a practical platform for a multitiered neural marker, offering clinical applications for assessing brain function across various hierarchies.

Introduction

Mismatch negativity (MMN) is measured by contrasting the event-related potential (ERP) evoked by an infrequent sensory event (referred to as a deviant stimulus) to the ERP from a frequent event (a standard stimulus). This contrast shows a negative waveform peaking at 100–250 ms after event onset and over the frontal-central brain area (Sams et al., 1984; Näätänen et al., 2007). The MMN has been recognized as a key biomarker to evaluate the capacity for statistical learning and deviance detection in various disorders, such as autism spectrum disorder (ASD; Dunn et al., 2008), schizophrenia (Fisher et al., 2012; Erickson et al., 2016; Koshiyama et al., 2020), and developmental delay (Kujala and Näätänen, 2001).

Theoretically, MMN has been interpreted as a form of prediction error within the framework of hierarchical predictive coding theory (Wacongne et al., 2012). The theory postulates that bidirectional cascades operate across the hierarchy to minimize prediction errors: top-down predictions for incoming inputs are generated by learning statistical regularities, and bottom-up prediction errors are generated to refine the predictions when discrepancies between expected and actual sensory inputs occur (Friston, 2010; Bastos et al., 2012; Clark, 2013). The coding has been suggested as an underlying mechanism of the generation of MMN, which has been tested in different oddball paradigms (Garrido et al., 2007, 2008). Furthermore, as the prediction error indirectly reflects the prediction established based on the statistical regularity of stimuli, many studies have also reported that a less frequent stimulus, thus with less predictability, evokes a larger MMN (Javitt et al., 1998; Sato et al., 2000; Sabri and Campbell, 2001).

MMN is also modulated by statistical regularities at a longer timescale. This is evident in a local–global oddball paradigm (Bekinschtein et al., 2009), in which stimulus sequences are manipulated to control temporal regularities at two hierarchical levels: tone-to-tone transitions (local level) and multitone sequence structure (global level). The MMN amplitude triggered by the local deviant is notably smaller as the deviant becomes predictable at the global level (Wacongne et al., 2011), and this reduction is absent in ASD and schizophrenia (Sauer et al., 2017; Goris et al., 2018). These results suggest that different degrees of local and global prediction errors are generated and superimposed in MMN. In the current study, we aim to clearly dissociate the superimposed MMN into subcomponents showing distinct neural signatures. Specifically, we hypothesize that there are two MMN subcomponents, each responding to the local and global levels of temporal statistics and thus representing the local and global prediction-error signals.

We record human electroencephalography (EEG) during an auditory local–global paradigm where stimulus occurrences of the deviant are manipulated at the local and global level to create varying degrees of prediction error in four sequence blocks. To disentangle MMN subcomponents from ERP data, we used an unbiased data-driven decomposition method that was previously applied to isolate prediction-error signals in high-gamma frequency bands (Chao et al., 2018). Two subcomponents are extracted from the data, each exhibiting features of MMN. The first subcomponent is characterized by a negative waveform peaking at 136 ms in the central-frontal area of the brain, and the second subcomponent is also a negative waveform peaking later at 200 ms in a more frontal area. Furthermore, their activation patterns across different sequence blocks are aligned with the predictions from a hierarchical predictive coding model (Chao et al., 2022), suggesting that the early and late MMN subcomponents represent the local and global prediction-error signals, respectively. Our findings reveal the composite nature of MMN and suggest that breaking down MMN into distinct subcomponents may provide a more complete biomarker for pathological conditions associated with unusual prediction mechanisms.

Materials and Methods

Participants

Thirty participants were recruited in this study (15 males and 15 females; age, 24 ± 2.6 years old; mean ± standard deviation). The participants self-reported and were screened to have no participation in drug studies and no history of neurological and psychological conditions. This study was approved by the Research Ethics Committee of the National Taiwan University Hospital (201906081 RINA), and all participants gave written informed consent after understanding experimental procedures and before the experiment.

Local–global oddball paradigm with controlled predictabilities

We implemented an auditory local–global oddball paradigm with four distinct sequence blocks (Fig. 1). Each block was formed by varying stimulus occurrences for a unique combination of local and global regularities. Two tones were created by combining three sinusoidal waves of base frequencies: the low-pitched tone with 350, 700, and 1,400 Hz and the high-pitched tone with 500, 1,000, and 1,500 Hz. Each tone had a 100 ms duration with a 7 ms rise and fall. Each tone sequence consisted of three tones consecutively delivered with a 200 ms stimulus onset asynchrony (SOA). The intertrial interval (ITI) between the end of one sequence's last tone and the start of the next sequence's first tone was randomly set to a value between 1,000 and 1,400 ms, in 50 ms increments.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Task design. The configuration of sequence types, number of trials, transition probabilities, and sequence probabilities in four blocks. The tone icons colored in black and red represent tone x and y in different pitches, while the tone icon with the dashed outline represents omission (“o”, no tone delivered). The stimulus onset asynchrony (denoted as SOA) represents time interval between the onsets of a tone and the next. The inter-trial interval (ITI) represents time interval between the offset of one sequence's last tone and the onset of the next sequence's first tone. The probabilities were rounded to two decimal places.

For each block, there were three types of tone sequence: xxx, representing three tones with the same tone pitch; xxy, representing that the last tone differed from the preceding tones; and xxo, representing that the last tone was omitted. The numbers of trials for the sequences xxx, xxy, and xxo were set to be 96:24:24 for Block 1, 120:12:12 for Block 2, 24:96:24 for Block 3, and 12:120:12 for Block 4. Each block was sectioned into four phases in which the proportion of sequence types was maintained (e.g., in Block 1, every phase contained 24 trials of xxx, 6 trials of xxy, and 6 trials of xxo). Within each phase, the order of the 36 sequences was random. The four blocks were delivered twice: once with the low-pitched tone as tone x and the high-pitched tone as tone y and once with the high-pitched tone as tone x and the low-pitched tone as tone y. The order of eight block presentations was random.

With different sequence ratios, a unique configuration of local and global regularities was created for predictions of the last tone x or y. On the one hand, the local regularity was established by the tone-to-tone transition probability (TP), i.e., the conditional probability of an incoming tone given the previous tone. TP in each block contained three values, TP(x|x), TP(y|x), and TP(o|x), which represent the conditional probabilities of tones x, y, and o occurring, given the previous tone x, within a single block. In sequence xxx, there are two transitions from x to x and one transition from x to o. In sequence xxy, there is one transition from x to x and one transition from x to y. In sequence xxo, there is one transition from x to x and one transition from x to o. Note that the x-to-o transition was considered at the end of the tone sequence xxx, since the sequence ending cannot be known by stimulus transitions but rather by sequence structure (Chao et al., 2022). Then, based on trial numbers of the sequence types, we calculated all instances of the three transition types within one block and the corresponding transition probabilities.

On the other hand, the global regularity was established by the multitone sequence probability (SP). SP in each block contained three values, SP(xxx), SP(xxy), and SP(xxo), which represent the probabilities of sequences xxx, xxy, and xxo within a single block. The four blocks featured distinct combinations of TP and SP (Fig. 1), which allowed us to control the predictability of sensory stimuli and examine varying degrees of prediction error at both local and global levels.

During the experiment, participants were instructed to pay attention to the sound while visually fixating at a central fixation (white cross on a gray background), and no behavioral response was required. A task-irrelevant video was displayed during the break between two block presentations to minimize the influence of the learned regularities in a run being carried over to the next run. All stimulus presentations were programmed with MATLAB-based Psychtoolbox (Pelli, 1997; Kleiner et al., 2007) and presented with a monitor (resolution, 1,920*1,220 pixels; sampling rate, 60 Hz) and a pair of desktop speakers (∼60 dB).

EEG recording and analysis

Raw EEG data were recorded with a 64-channel QuickCap (Compumedics NeuroScan). During recording, the data were referenced to a reference electrode near Cz, and impedances were kept to <2 kΩ for two mastoid electrodes and 5 kΩ for the remaining electrodes. We set a bandpass filter of 0.01 to 100 Hz and a 500 Hz sampling rate.

We used MATLAB-based EEGLAB (Delorme and Makeig, 2004) to preprocess the data from each participant. First, we merged the data from all runs and rereferenced it to the average of two mastoid electrodes to eliminate systematic noise from the environment (functions: pop_mergeset.m; pop_reref.m). Second, epochs of sequences were extracted from 1.2 s before the first tone to 1.9 s after the last tone (pop_epoch.m). Third, bad epochs containing excessive fluctuations or high-frequency noise were manually removed. On average, ∼2% of the total epochs in each participant were removed at this step. Fourth, we used the ADJUST toolbox to remove eye and muscular artifacts (Mognon et al., 2011; pop_runica.m, interface_ADJ.m, pop_subcomp.m). Finally, the processed epochs were corrected with the baseline estimated within the time window of −300 to −100 ms relative to the onset of the first tone (pop_rmbase.m) and downsampled to 250 Hz.

For each channel, block, and participant, we then averaged the signals across trials to obtain the ERP for each sequence type (i.e., xxx, xxy, and xxo); see examples in Figure 2A. To measure the MMN in the local–global oddball paradigm, we compared ERPs from sequence xxy (local deviant) to ERPs from sequence xxx (local standard) (xxy – xxx) for each channel, block, and participant. Note that the contrast was done in all conditions including frequent xxy as the global standard, consistent with previous studies evaluating varied MMN evoked by the local violation and influenced by different degrees of global expectations (Wacongne et al., 2011; Sauer et al., 2017; Goris et al., 2018). Examples of the contrast responses from −200 to 800 ms relative to the onset of the last tone in the four blocks are shown in Figure 2B.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

ERPs and deviant responses from contrasts. A, The group mean ERPs of sequences xxx, xxy, and xxo in Block 1 at channel Cz. Solid vertical lines represent onsets of the stimuli of a sequence. Time zero was set to be the onset of the last stimulus. B, Contrast responses obtained by contrasting ERPs of sequences xxy and xxx in 4 blocks. The gray shade represents the time range of the MMN. C and D, Peak amplitudes and latencies of the MMN at Cz in 4 blocks. The block order was sorted from low to high probability in terms of TP(y|x) and SP(xxy). The 30 dots in each block correspond to participants. The box plot represents the median (red horizontal line), quartiles (the bottom and top edges), 95% confidence interval (notches), and the maximum and minimum (black horizontal lines). Results from repeated measures ANOVA and Spearman's rank correlation analyses are shown above the plots. The red line represents a significant difference between two blocks (*<0.05; ***<0.001).

We also estimated the omission error by contrasting ERPs between sequence xxo and xxx (xxo – xxx), where positive responses were found ∼100–200 ms (Extended Data Fig. 2-1). However, this contrast response contains the sensory response to the last tone in sequence xxx and so does not accurately reflect the true prediction error. Consequently, we did not include the omission trials for further analysis.

Figure 2-1

Download Figure 2-1, TIF file.

Analysis of the MMN amplitude and latency

We examined effects of the TP (local regularity) and SP (global regularity) on MMN amplitudes and latencies across four blocks using repeated measures ANOVA and Spearman's rank correlation coefficient. To select peak amplitudes and latencies of the MMN for each block and participant, we focused on the minimum of the contrast responses (xxy – xxx) at Cz in the range of 120–250 ms after the onset of the last tone (Näätänen et al., 2007). Firstly, we implement a repeated measures ANOVA to measure differences in MMN among four blocks, each with unique TP and SP (functions: fitrm.m and ranova.m). Also, pairwise comparisons with Bonferroni’s correction were implemented for post hoc tests (multcompare.m). Secondly, as Sato et al. (2000) showed that MMN amplitudes become larger with fewer occurrences of novel stimuli, we performed Spearman's rank correlation between the MMN and the probability ranked based on TP(y|x) and SP(xxy).

When a stimulus is presented repeatedly, the evoked neural activity is reduced, an effect known as stimulus-specific adaptation. In our study, three identical tones were consecutively presented (xxx) or the last was replaced with a different tone (xxy). As shown in a previous study (Hofmann-Shen et al., 2020), when comparing the two sequence types, the N1 as an index of the adaptation effect was observed with positive polarity between 80 and 120 ms. We further estimated effects of N1 in the blocks (Extended Data Fig. 2-2). The results of pairwise comparisons showed the size of N1 was not correlated with TP and SP, and this indicates that MMN, but not N1, is varied according to two-level probability manipulations.

Figure 2-2

Download Figure 2-2, TIF file.

Figure 2-3

Download Figure 2-3, DOCX file.

Parallel factor analysis

In our study, the total responses can be pooled in a three-dimensional tensor (62 channels × 250 time points × 4 blocks), which provides a comprehensive description of the spatiotemporal dynamics under different TP and SP combinations. To extract the subcomponents in the tensor that likely overlap in space and time, we used parallel factor analysis (PARAFAC; Harshman and Lundy, 1994) to decompose the data. This approach yields subcomponents within the three corresponding dimensions (Channel × Timecourse × Block), enabling us to quantify their spatiotemporal signatures and distinct activations across blocks.

We resampled the contrast responses of the 30 participants 100 times with the bootstrap method (functions, datasample.m). For each resampling, a new tensor was generated from the averaged contrast responses across the resampled participants. For each tensor, we performed PARAFAC analysis using the N-way toolbox (Andersson and Bro, 2000; function, parafac.m) to decompose the tensor into multiple subcomponents ranging from 1 to 8, leading to a total of 800 (100*8) decompositions. The convergence criterion was set to be 1 × 10−6, and the three dimensions were not constrained in terms of orthogonality or positivity. The optimal number of subcomponents was determined based on the Core Consistency Diagnostic (CORCONDIA; Harshman and Lundy, 1994; Bro and Kiers, 2003). Decomposition with low consistency indicates a poor appropriateness, where high interactions exist between subcomponents. A CORCONDIA of 80–90% is considered a good decomposition and below 50% considered a problematic decomposition (Bro and Kiers, 2003; Pouryazdian et al., 2016).

As the optimal number was two (see details in Results), for all 100 resampled tensors, two subcomponents were extracted and described by activation values (i.e., the score or loading matrix) in their original dimensions of Channel, Timecourse, and Block. The Block dimension represents, for example, how much the contrast response of Block 1 (Block) is contributed by Subcomponent 1 composed of spatial and temporal activation values (Channel and Timecourse). Thus, one subcomponent has four values in the Block dimension, meaning different degrees of contributions from that subcomponent to the contrast responses of the four blocks.

We observed two extracted subcomponents, one with an early negative peak and the other with a late negative peak consistently across 100 resamples. Then we assigned the former one as Subcomponent 1 and the latter one as Subcomponent 2. Examples of subcomponents from 10 resamples are shown in Extended Data Figure 3-1. For Subcomponents 1 and 2, we estimated whether the activation values were significantly different from zero across 100 decompositions in the dimensions of Channel and Timecourse (function, ft_timelockstatistics.mat). Note that values in the dimension of Channel were estimated with cluster corrections. Then, average activations are shown with nonsignificant values replaced by zero.

Figure 3-1

Download Figure 3-1, TIF file.

Functional roles of subcomponents identified by a predictive coding model

To verify the functional roles of the identified subcomponents, we used a predictive coding model which can estimate the size of prediction errors during the local–global oddball paradigm (Chao et al., 2022). The model quantitatively describes prediction and prediction-error signals at two hierarchical levels and has been found to explain predictive coding signaling during the local–global oddball paradigm better than other alternative models.

Using the formula and code provided by Chao et al. (2022), we calculated the theoretical quantities for both the local (or first-level) prediction-error signal and the global (or second-level) prediction-error signal within the deviant-minus-standard difference responses. The local prediction-error signals across the four blocks were represented by four model values (PE1), while the global prediction-error signals across the same blocks were characterized by four model values (PE2).

Our next step was to examine whether Subcomponents 1 and 2 identified from the data were associated with the local and global prediction-error signals described by the model. To achieve this, we reconstructed deviant responses from Subcomponents 1 and 2 with the model values PE1 and PE2 and compared them to the actual responses. For each of the 100 decompositions, the response at time t for channel i and block j was reconstructed (denoted as PE1_PE2) as follows:PE1_PE2(i,j,t)=Channel1(i)*PE1(j)*Timecourse1(t)+Channel2(i)*PE2(j)*Timecourse2(t), (1)where Channel1 represents the 62 activation values in the Channel dimension for Subcomponent 1 and Timecourse1 represents the 250 activation values in the Timecourse dimension for Subcomponent 1. Similarly, Channel2 and Timecourse2 represent those values for Subcomponent 2.

We also tested the reconstruction with different associations: Subcomponent 1 as PE2 and Subcomponent 2 as PE1 (denoted as PE2_PE1), Subcomponents 1 and 2 both as PE1 (denoted as PE1_PE1), and Subcomponents 1 and 2 both as PE2 (denoted as PE2_PE2).PE2_PE1(i,j,t)=Channel1(i)*PE2(j)*Timecourse1(t)+Channel2(i)*PE1(j)*Timecourse2(t), (2)PE1_PE1(i,j,t)=Channel1(i)*PE1(j)*Timecourse1(t)+Channel2(i)*PE1(j)*Timecourse2(t), (3)PE2_PE2(i,j,t)=Channel1(i)*PE2(j)*Timecourse1(t)+Channel2(i)*PE2(j)*Timecourse2(t). (4)We then calculated the mean squared difference (MSD) between reconstructed responses and the actual response, Act, for each channel i and block j. Below, we use the reconstruction of PE1_PE2 as an example:MSD(i,j)=1250∑t=1250(PE1PE2(i,j,t)−Act(i,j,t))2. (5)We also measured Pearson’s correlation coefficients of the reconstructed and actual time courses (t = 1:250) for each channel i and block j:R(i,j)=corr(PE1PE2(i,j,1:250),Act(i,j,1:250)). (6)We averaged MSD and R across 62 channels and four blocks for each of the 100 decompositions and each of four associations. To evaluate which association produced better reconstruction, average MSD and R were tested across the four associations using a repeated measures ANOVA and pairwise comparisons with Bonferroni’s correction.

Results

MMN varies across blocks

Figure 2A shows an example of ERPs, averaged across participants, from the channel Cz in Block 1. For each block, we extracted MMN by contrasting ERPs between sequences xxy and xxx (xxy – xxx). Figure 2B shows the average contrast responses from the four blocks, sorted from low TP(y|x) and SP(xxy) to high TP(y|x) and SP(xxy). To examine whether and how the local and global regularities (i.e., TP and SP) affect the peak amplitude and latency of MMN, we performed a repeated measures ANOVA across all blocks and participants. First, we found significant differences in the peak amplitude among the blocks (F(3,87) = 30.7; p < 0.001; n = 30 participants; repeated measures ANOVA; Fig. 2C). In post hoc tests (Extended Data Fig. 2-3), the amplitude in Block 1 with fewer occurrences of the deviant y was significantly more negative than that in Block 3 (p < 0.001; n = 30 participants; pairwise comparison; Bonferroni’s correction; Fig. 2C, red horizontal line) and Block 4 (p < 0.001), both of which consisted of more occurrences of the deviant. Similarly, the amplitude in Block 2 was significantly more negative than that in Block 3 (p < 0.001) and Block 4 (p < 0.001). There were nonsignificant differences in the rest comparisons between Blocks 1 and 2 and between 3 and 4. Although we did not find the significance when comparing two blocks with close deviant probabilities (e.g., TP of the deviant is 0.03 in Block 2 and 0.06 in Block 1), a trend of the increased TP and SP with the smaller amplitude of the negative peak was observed and further tested (Spearman's rank correlation coefficient: 0.52, p < 0.001; n = 30 participants).

Second, we also found significant differences in the MMN peak latency among the blocks (F(3,87) = 9.4; p < 0.001; n = 30 participants; repeated measures ANOVA; Fig. 2D). In post hoc tests (Extended Data Fig. 2-3), specifically, the latency in Block 2 was significantly greater than that in Block 1 (p = 0.01; n = 30 participants; pairwise comparison; Bonferroni’s correction), Block 3 (p < 0.001), and Block 4 (p = 0.01). There were nonsignificant differences in the rest comparisons between each two of Blocks 1, 3, and 4. A negative rank correlation between peak latency and probability was found (Spearman's rank correlation coefficient, −0.45; p < 0.001; n = 30 participants). Altogether, the contrast responses appeared differently across the blocks; in particular, MMN peaked more negatively and later as TP(y|x) and SP(xxy) became smaller.

Extracting MMN subcomponents

Upon confirming that the MMN is modulated by TP and SP, we further tested if the MMN contains two overlapping subcomponents that respectively encode TP and SP. We resampled the total data of contrast responses 100 times and used PARAFAC to decompose each of the 100 resampled datasets. Each resampled dataset was decomposed into a designated number of subcomponents, ranging from 1 to 8. The CORCONDIA was quantified to evaluate the consistency of the decomposition. We found an abrupt drop in CORCONDIA, from nearly 100 to 45%, when the decomposition changed from two subcomponents to three subcomponents (p < 0.001; n = 100 resamples; pairwise comparison; Extended Data Fig. 3-2). This indicates that the optimal number of subcomponents in the deviant responses was two.

Figure 3-2

Download Figure 3-2, TIF file.

The two subcomponents, Subcomponents 1 and 2, were identified across resampled datasets and visualized in the dimensions of space (Channel), time (Timecourse), and function (Block; Fig. 3). Spatially, Subcomponent 1 was found in the central-frontal area, while Subcomponent 2 was found in a more frontal area (Fig. 3A). Temporally, both components showed negative waveforms after the last tone but with different latencies (Fig. 3B). Subcomponent 1 peaked negatively at 136 ms, followed by a positive response during 200–400 ms. Subcomponent 2 peaked negatively at 200 ms, followed by a long-lasting negative response. Functionally, the two subcomponents had different activation values in the Block dimension (Fig. 3C, blue line). For Subcomponent 1, the different positive values in the Block dimension indicates that the spatiotemporal response pattern described by the Channel and Timecourse dimensions appeared in the contrast responses of the four blocks, each with different positive contributions. Interestingly, the contributions of Subcomponent 2 in Blocks 3 and 4, where sequence xxy was frequent, were negative. This indicates that Subcomponent 2 contributed to the total MMN in an opposite manner when the local deviant became the global standard.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Neural signatures of prediction-error signals. We resampled 100 datasets, each containing contrast responses of 4 blocks, and then performed PARAFAC decomposition for each dataset. For the two extracted subcomponents, activations are shown in (A) Channel, (B) Timecourse, and (C) Block dimensions. The Block dimension represents activation of the subcomponents to the responses (the blue line). The error bar represents the standard deviation. The model values of the local and global prediction errors are denoted as PE1 and PE2 (the green line).

Functional roles of the MMN subcomponents

To identify functional roles of Subcomponents 1 and 2, we compared their contributions (Fig. 3C, blue line) across the four blocks (Block1 and Block2) to the theoretical values of the local and global prediction errors (PE1 and PE2) obtained from a hierarchical predictive coding model (Fig. 3C, green line). Based on their spatiotemporal characteristics, we hypothesized that Subcomponent 1, occurring earlier in a less frontal area, represented the local prediction error. On the other hand, Subcomponent 2, occurring later in a more frontal area, was hypothesized to represent the global prediction error. To validate the hypothesis, we reconstructed signals with the model values and compared them to actual contrast responses. Figure 4A shows examples of reconstructed time courses at channel Cz in Block 1. In this example, the reconstruction PE1_PE2, in which Subcomponent 1 is associated with PE1 and Subcomponent 2 with PE2, was very similar to the actual response. Among the four associations, the mean square error evaluated with the association of PE1_PE2 was significantly smaller than the other three (all p < 0.001; n = 100 resamples; pairwise comparisons; Bonferroni’s correction; Fig. 4B). Similarly, the correlation coefficient evaluated with the association of PE1_PE2 was significantly higher than the other three (all p < 0.001; Fig. 4C). These results suggest that Subcomponents 1 and 2 represented the local and global prediction-error signals, respectively.

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Reconstruction with functional assignments and their differences from the actual ERP. A, Examples of reconstructed signals at Cz in Block 1. The black line represents the actual contrast response. The blue line represents the reconstructed response with Subcomponent 1 as PE1 and Subcomponent 2 as PE2. The orange line represents the reconstructed response with Subcomponent 1 as PE2 and Subcomponent 2 as PE1. The yellow line represents the reconstructed response with Subcomponent 1 and 2 both as PE1. The purple line represents the reconstructed response with Subcomponent 1 and 2 both as PE2. Mean squared differences in panel B and Pearson correlation coefficients in panel C were calculated then averaged across channels and blocks. The 100 dots for each assignment represent resamples. Results from repeated measures ANOVA and pairwise comparisons are shown above the plots.

Discussion

In the current study, we combined an extended auditory local–global oddball paradigm with a data-driven decomposition analysis to extract subcomponents in MMN. We further demonstrated that the early MMN subcomponent in the central-frontal area represents the local prediction-error signal while the late MMN subcomponent in the more frontal area represents the global prediction-error signal. Our study applies the predictive coding framework to reveal the complex nature of MMN and establishes a robust experimental and analytical platform that can examine the functionality of multilevel deviant detection in both healthy and affected brains.

Hierarchical MMN in the predictive coding framework

We revealed that the MMN of the deviant response varied with the hierarchical probabilities of stimulus presentation and notably consisted of two MMN subcomponents with distinct spatial and temporal characteristics obtained from an unbiased decomposition.

The two subcomponents spatially comply with the results of the hierarchical network which is a three-level hierarchy of cortical cascades in response to auditory changes. The network consists of bilateral primary auditory cortices receiving inputs, superior temporal gyrus functioning as a memory trace of priors, and right inferior frontal gyrus modulating the attention allocation (Giard et al., 1990; Opitz et al., 2002; Doeller et al., 2003; Halgren et al., 2011; MacLean and Ward, 2014; Hofmann-Shen et al., 2020). It has also been applied to model data obtained in different types of oddball paradigms (Garrido et al., 2007, 2008). In our findings, the early MMN was activated in the frontal-central area, which is consistent with previous literature and may reflect dipole sources in the superior temporal area (Alho et al., 1998; Molholm et al., 2005; Schönwiesner et al., 2007). On the other hand, the late MMN was activated in the more frontal area and slightly dispersed toward the right side, which may be mapped to an anterior portion of the brain such as the right prefrontal cortex. Although EEG signals are limited in spatial resolution, the temporal patterns of the EEG signals may provide some hints regarding potential sources. Specifically, we found that the early subcomponent (i.e., Subcomponent 1) of MMN at the central-frontal area was elicited earlier and was followed by the late subcomponent (Subcomponent 2) of MMN in the frontal area, which is consistent with differences in the peak latency found in the temporal and frontal generations of MMN (Rinne et al., 2000) and bottom-up gamma oscillations (Chao et al., 2022). Furthermore, in alignment with the hierarchical predictive coding theory, the observed temporal pattern supported feedforward error propagating between low and high cortical hierarchy, and activations of MMN subcomponents closely resembled the quantitative definition of local and global prediction-error signals (Chao et al., 2022).

However, the findings and interpretation of the early and late subcomponents of MMN appear to deviate from previous studies of the local–global oddball paradigm. It has been claimed that the MMN represents the local prediction-error signal while the P300 represents the global prediction-error signal (Wacongne et al., 2011). This stands in our contrast with our finding of MMN encoding both local and global prediction errors. We argue that this discrepancy results from the difficulty of dissociating hierarchical prediction-error signals. P300 in previous literature was typically captured by contrasting responses to the global deviant and the global standard. In fact, the local prediction-error signal cannot be fully controlled in this contrast. For example, two tones are both defined as the global deviant, while one is also defined as the local deviant and the other is defined as the local standard. When responses of theses tones are combined, different transition probabilities for the tones lead to different local predictability. Therefore, the local prediction is difficult to fully cancel out and still plays a part in the contrast product. To separate subcomponents from ERP contrasts, our approach adopted a decomposition approach. The two extracted subcomponents with quantitative values of hierarchical prediction errors then led to reconstructions similar to the actual responses. The reduction of the MMN when the deviant became more predictable on the global level (i.e., Block 3 and Block 4) can also be explained by Subcomponent 2 with negative contributions in these two blocks. Moreover, in Subcomponent 2, the late MMN peaking at ∼200 ms shared similarities with the N2b component, a negative ERP in the N2 family occurring ∼200–300 ms and after MMN (also known as N2a; Näätänen and Picton, 1986; Pritchard et al., 1991; Folstein and Van Petten, 2008). It has been studied in domains of attention, response inhibition, and cognitive control, where participants are required to pay active attention or make behavioral responses (Kasai et al., 1999; Barry and De Blasio, 2015). This relatively late-stage process coincides with cognitive demands for attentional control and awareness in learning the global regularity in the local–global task (Bekinschtein et al., 2009).

Limitations and future research

Although our findings provide a more thorough understanding of hierarchical MMN, there are still some limitations to be resolved and advances to be achieved in future research: hierarchical structure composed of more than two levels, prediction updates over time, and neuronal mechanism of learning statistical regularities at two levels.

Firstly, the functional hierarchy underlying MMN is not limited to two levels, as the prediction error likely does not propagate only across two levels of the hierarchical scheme. For example, the encoding of natural images was modeled from the predictive coding perspective, and multiple layers were established to simulate visual cortical processing (Rao and Ballard, 1999; Spratling, 2017). Temporal sequences can consist of not only single-stimulus transition rules but also arbitrary regularities (Dehaene et al., 2015). In order to extract multilevel prediction-error signals in a real sequential environment, probability calculations for different regularities and quantitative values of the prediction error at each level should be determined.

Secondly, the prediction error is generated to update the precision of the prediction; that is, a prediction update follows a prediction error evoked by sensory input. In this study, we observed that the global prediction-error signals had a long-lasting negative waveform after MMN, which might be related to desynchronized beta oscillations as prediction updates (Bastos et al., 2012; Chao et al., 2018). However, the current method is incapable of dissociating the prediction update and the prediction error because of their interdependence; that is, how much the prediction error is generated leads to how much the prediction will be updated. Also, the signal quality in human EEG makes tracking the dynamics of prediction updates during learning challenging. Future research could incorporate analytical techniques with a focus on temporal order to dissociate the prediction update from the prediction error and trial-by-trial analysis to capture the dynamics.

Thirdly, for the oddball paradigm with complex sequence structure, the underlying mechanism of how multiple regularities interact is computed at the neuronal level in the proposed quantitative predictive coding model. The links between the neuronal computations of the regularities and the MMN subcomponents are critically needed for further investigation. We should consider a combination of a biologically realistic model and a microscopical approach such as calcium imaging and high-density neural recording in the future.

Data Availability

We shared raw EEG data, processed EEG data, and a script for model values of the quantitative predictive coding model on Open Science Framework. Huang, Y. T., Wu, C.-T., Koike, S., & Chao, Z. C. (2024). Dissecting Mismatch Negativity: Early and Late Subcomponents for Detecting Deviants in Local and Global Sequence Regularities. https://doi.org/10.17605/OSF.IO/8YFT4. The raw data featured in this work was previously published in Chao et al. (2022). In the previous work, we constructed a theoretical model specifically designed to quantify hierarchical prediction and prediction-error signals, without placing an emphasis on ERP and MMN.

Footnotes

  • The authors declare no competing financial interests.

  • We thank Hsuan-Chi Liu, Chin-Kun Fu, and Shih-Yao Mao for helping with participant recruitment and experiment preparation. We also thank Felix B. Kern and Amit Yaron for proofreading. This study was supported by Japan Society for the Promotion of Science, Japan (to Y.T.H), Ministry of Science and Technology of Taiwan, Taiwan (MOST 106-2420-H-002-008-MY2 and MOST 109-2410-H-002-106-MY3; to C.-T.W), and World Premier International Research Center Initiative (WPI), MEXT, Japan (to Z.C.C.).

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.

References

  1. ↵
    1. Alho K,
    2. Winkler I,
    3. Escera C,
    4. Huotilainen M,
    5. Virtanen J,
    6. Jääskeläinen IP,
    7. Pekkonen E,
    8. Ilmoniemi RJ
    (1998) Processing of novel sounds and frequency changes in the human auditory cortex: magnetoencephalographic recordings. Psychophysiology 35:211–224.
    OpenUrlCrossRefPubMed
  2. ↵
    1. Andersson CA,
    2. Bro R
    (2000) The N-way toolbox for MATLAB. Chemometr Intell Lab Syst 52:42–58. https://doi.org/10.1016/S0169-7439(00)00071-X
    OpenUrl
  3. ↵
    1. Barry RJ,
    2. De Blasio FM
    (2015) Performance and ERP components in the equiprobable go/no-go task: inhibition in children. Psychophysiology 52:1228–1237. https://doi.org/10.1111/psyp.12447
    OpenUrlCrossRef
  4. ↵
    1. Bastos AM,
    2. Usrey WM,
    3. Adams RA,
    4. Mangun GR,
    5. Fries P,
    6. Friston KJ
    (2012) Canonical microcircuits for predictive coding. Neuron 76:695–711. https://doi.org/10.1016/j.neuron.2012.10.038 pmid:23177956
    OpenUrlCrossRefPubMed
  5. ↵
    1. Bekinschtein TA,
    2. Dehaene S,
    3. Rohaut B,
    4. Tadel F,
    5. Cohen L,
    6. Naccache L
    (2009) Neural signature of the conscious processing of auditory regularities. Proc Natl Acad Sci U S A 106:1672–1677. https://doi.org/10.1073/pnas.0809667106 pmid:19164526
    OpenUrlAbstract/FREE Full Text
  6. ↵
    1. Bro R,
    2. Kiers HA
    (2003) A new efficient method for determining the number of components in PARAFAC models. J Chemom 17:274–286. https://doi.org/10.1002/cem.801
    OpenUrlCrossRef
  7. ↵
    1. Chao ZC,
    2. Takaura K,
    3. Wang L,
    4. Fujii N,
    5. Dehaene S
    (2018) Large-scale cortical networks for hierarchical prediction and prediction error in the primate brain. Neuron 100:1252–1266. https://doi.org/10.1016/j.neuron.2018.10.004
    OpenUrlCrossRefPubMed
  8. ↵
    1. Chao ZC,
    2. Huang YT,
    3. Wu C-T
    (2022) A quantitative model reveals a frequency ordering of prediction and prediction-error signals in the human brain. Commun Biol. 5:1076. https://doi.org/10.1038/s42003-022-04049-6 pmid:36216885
    OpenUrlPubMed
  9. ↵
    1. Clark A
    (2013) Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci 36:181–204. https://doi.org/10.1017/S0140525X12000477
    OpenUrlCrossRefPubMed
  10. ↵
    1. Dehaene S,
    2. Meyniel F,
    3. Wacongne C,
    4. Wang L,
    5. Pallier C
    (2015) The neural representation of sequences: from transition probabilities to algebraic patterns and linguistic trees. Neuron 88:2–19. https://doi.org/10.1016/j.neuron.2015.09.019
    OpenUrlCrossRef
  11. ↵
    1. Delorme A,
    2. Makeig S
    (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods 134:9–21. https://doi.org/10.1016/j.jneumeth.2003.10.009
    OpenUrlCrossRefPubMed
  12. ↵
    1. Doeller CF,
    2. Opitz B,
    3. Mecklinger A,
    4. Krick C,
    5. Reith W,
    6. Schröger E
    (2003) Prefrontal cortex involvement in preattentive auditory deviance detection: neuroimaging and electrophysiological evidence. NeuroImage 20:1270–1282. https://doi.org/10.1016/S1053-8119(03)00389-6
    OpenUrlCrossRefPubMed
  13. ↵
    1. Dunn MA,
    2. Gomes H,
    3. Gravel J
    (2008) Mismatch negativity in children with autism and typical development. J Autism Dev Disord 38:52–71. https://doi.org/10.1007/s10803-007-0359-3
    OpenUrlCrossRefPubMed
  14. ↵
    1. Erickson MA,
    2. Ruffle A,
    3. Gold JM
    (2016) A meta-analysis of mismatch negativity in schizophrenia: from clinical risk to disease specificity and progression. Biol Psychiatry 79:980–987. https://doi.org/10.1016/j.biopsych.2015.08.025 pmid:26444073
    OpenUrlCrossRefPubMed
  15. ↵
    1. Fisher DJ,
    2. Labelle A,
    3. Knott VJ
    (2012) Alterations of mismatch negativity (MMN) in schizophrenia patients with auditory hallucinations experiencing acute exacerbation of illness. Schizophr Res 139:237–245. https://doi.org/10.1016/j.schres.2012.06.004
    OpenUrlCrossRefPubMed
  16. ↵
    1. Folstein JR,
    2. Van Petten C
    (2008) Influence of cognitive control and mismatch on the N2 component of the ERP: a review. Psychophysiology 45:152–170. https://doi.org/10.1111/j.1469-8986.2007.00602.x pmid:17850238
    OpenUrlCrossRefPubMed
  17. ↵
    1. Friston K
    (2010) The free-energy principle: a unified brain theory? Nat Rev Neurosci 11:127–138. https://doi.org/10.1038/nrn2787
    OpenUrlCrossRefPubMed
  18. ↵
    1. Garrido MI,
    2. Kilner JM,
    3. Kiebel SJ,
    4. Stephan KE,
    5. Friston KJ
    (2007) Dynamic causal modelling of evoked potentials: a reproducibility study. NeuroImage 36:571–580. https://doi.org/10.1016/j.neuroimage.2007.03.014 pmid:17478106
    OpenUrlCrossRefPubMed
  19. ↵
    1. Garrido MI,
    2. Friston KJ,
    3. Kiebel SJ,
    4. Stephan KE,
    5. Baldeweg T,
    6. Kilner JM
    (2008) The functional anatomy of the MMN: a DCM study of the roving paradigm. NeuroImage 42:936–944. pmid:18602841
    OpenUrlCrossRefPubMed
  20. ↵
    1. Giard MH,
    2. Perrin F,
    3. Pernier J,
    4. Bouchet P
    (1990) Brain generators implicated in the processing of auditory stimulus deviance: a topographic event-related potential study. Psychophysiology 27:627–640. https://doi.org/10.1111/j.1469-8986.1990.tb03184.x
    OpenUrlCrossRefPubMed
  21. ↵
    1. Goris J,
    2. Braem S,
    3. Nijhof AD,
    4. Rigoni D,
    5. Deschrijver E,
    6. Van de Cruys S,
    7. Wiersema JR,
    8. Brass M
    (2018) Sensory prediction errors are less modulated by global context in autism spectrum disorder. Biol Psychiatry Cogn Neurosci Neuroimaging 3:667–674. https://doi.org/10.1016/j.bpsc.2018.02.003
    OpenUrl
  22. ↵
    1. Halgren E,
    2. Sherfey J,
    3. Irimia A,
    4. Dale AM,
    5. Marinkovic K
    (2011) Sequential temporo-fronto-temporal activation during monitoring of the auditory environment for temporal patterns. Hum Brain Mapp 32:1260–1276. https://doi.org/10.1002/hbm.21106 pmid:20665718
    OpenUrlCrossRefPubMed
  23. ↵
    1. Harshman RA,
    2. Lundy ME
    (1994) PARAFAC: parallel factor analysis. Comput Stat Data Anal 18:39–72. https://doi.org/10.1016/0167-9473(94)90132-5
    OpenUrl
  24. ↵
    1. Hofmann-Shen C,
    2. Vogel BO,
    3. Kaffes M,
    4. Rudolph A,
    5. Brown EC,
    6. Tas C,
    7. Brüne M,
    8. Neuhaus AH
    (2020) Mapping adaptation, deviance detection, and prediction error in auditory processing. NeuroImage 207:116432. https://doi.org/10.1016/j.neuroimage.2019.116432
    OpenUrlCrossRef
  25. ↵
    1. Javitt DC,
    2. Grochowski S,
    3. Shelley A-M,
    4. Ritter W
    (1998) Impaired mismatch negativity (MMN) generation in schizophrenia as a function of stimulus deviance, probability, and interstimulus/interdeviant interval. Electroencephalogr Clin Neurophysiol 108:143–153. https://doi.org/10.1016/S0168-5597(97)00073-7
    OpenUrlCrossRefPubMed
  26. ↵
    1. Kasai K,
    2. Okazawa K,
    3. Nakagome K,
    4. Hiramatsu K-I,
    5. Hata A,
    6. Fukuda M,
    7. Honda M,
    8. Miyauchi M,
    9. Matsushita M
    (1999) Mismatch negativity and N2b attenuation as an indicator for dysfunction of the preattentive and controlled processing for deviance detection in schizophrenia: a topographic event-related potential study. Schizophr Res 35:141–156. https://doi.org/10.1016/S0920-9964(98)00116-9
    OpenUrlCrossRefPubMed
  27. ↵
    1. Kleiner M,
    2. Brainard D,
    3. Pelli D
    (2007) What's new in Psychtoolbox-3? Available at: https://pure.mpg.de/rest/items/item_1790332/component/file_3136265/content.
  28. ↵
    1. Koshiyama D,
    2. Kirihara K,
    3. Tada M,
    4. Nagai T,
    5. Fujioka M,
    6. Usui K,
    7. Araki T,
    8. Kasai K
    (2020) Reduced auditory mismatch negativity reflects impaired deviance detection in schizophrenia. Schizophr Bull 46:937–946. https://doi.org/10.1093/schbul/sbaa006 pmid:32072183
    OpenUrlCrossRefPubMed
  29. ↵
    1. Kujala T,
    2. Näätänen R
    (2001) The mismatch negativity in evaluating central auditory dysfunction in dyslexia. Neurosci Biobehav Rev 25:535–543. https://doi.org/10.1016/S0149-7634(01)00032-X
    OpenUrlCrossRefPubMed
  30. ↵
    1. MacLean SE,
    2. Ward LM
    (2014) Temporo-frontal phase synchronization supports hierarchical network for mismatch negativity. Clin Neurophysiol 125:1604–1617. https://doi.org/10.1016/j.clinph.2013.12.109
    OpenUrlCrossRefPubMed
  31. ↵
    1. Mognon A,
    2. Jovicich J,
    3. Bruzzone L,
    4. Buiatti M
    (2011) ADJUST: an automatic EEG artifact detector based on the joint use of spatial and temporal features. Psychophysiology 48:229–240. https://doi.org/10.1111/j.1469-8986.2010.01061.x
    OpenUrlCrossRefPubMed
  32. ↵
    1. Molholm S,
    2. Martinez A,
    3. Ritter W,
    4. Javitt DC,
    5. Foxe JJ
    (2005) The neural circuitry of pre-attentive auditory change-detection: an fMRI study of pitch and duration mismatch negativity generators. Cereb Cortex 15:545–551. https://doi.org/10.1093/cercor/bhh155
    OpenUrlCrossRefPubMed
  33. ↵
    1. Näätänen R,
    2. Picton TW
    (1986) N2 and automatic versus controlled processes. Electroencephalogr Clin Neurophysiol Suppl 38:169–186. PMID: 3466775
    OpenUrlPubMed
  34. ↵
    1. Näätänen R,
    2. Paavilainen P,
    3. Rinne T,
    4. Alho K
    (2007) The mismatch negativity (MMN) in basic research of central auditory processing: a review. Clin Neurophysiol 118:2544–2590. https://doi.org/10.1016/j.clinph.2007.04.026
    OpenUrlCrossRefPubMed
  35. ↵
    1. Opitz B,
    2. Rinne T,
    3. Mecklinger A,
    4. von Cramon DY,
    5. Schröger E
    (2002) Differential contribution of frontal and temporal cortices to auditory change detection: fMRI and ERP results. NeuroImage 15:167–174. https://doi.org/10.1006/nimg.2001.0970
    OpenUrlCrossRefPubMed
  36. ↵
    1. Pelli DG
    (1997) The VideoToolbox software for visual psychophysics: transforming numbers into movies. Spat Vis 10:437–442. https://doi.org/10.1163/156856897X00366
    OpenUrlCrossRefPubMed
  37. ↵
    1. Pouryazdian S,
    2. Beheshti S,
    3. Krishnan S
    (2016) CANDECOMP/PARAFAC model order selection based on reconstruction Error in the presence of kronecker structured colored noise. Digit Signal Process 48:12–26. https://doi.org/10.1016/j.dsp.2015.08.014
    OpenUrl
  38. ↵
    1. Pritchard WS,
    2. Shappell SA,
    3. Brandt ME
    (1991) Psychophysiology of N200/N400: a review and classification scheme. In: Advances in psychophysiology: a research annual, (Jennings JR, Ackles PK, Coles MGH, eds), pp 43–106. Jessica Kingsley Publishers.
  39. ↵
    1. Rao RP,
    2. Ballard DH
    (1999) Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci 2:79–87. https://doi.org/10.1038/4580
    OpenUrlCrossRefPubMed
  40. ↵
    1. Rinne T,
    2. Alho K,
    3. Ilmoniemi RJ,
    4. Virtanen J,
    5. Näätänen R
    (2000) Separate time behaviors of the temporal and frontal mismatch negativity sources. NeuroImage 12:14–19. https://doi.org/10.1006/nimg.2000.0591
    OpenUrlCrossRefPubMed
  41. ↵
    1. Sabri M,
    2. Campbell KB
    (2001) Effects of sequential and temporal probability of deviant occurrence on mismatch negativity. Cogn Brain Res 12:171–180. https://doi.org/10.1016/S0926-6410(01)00026-X
    OpenUrlCrossRefPubMed
  42. ↵
    1. Sams M,
    2. Alho K,
    3. Näätänen R
    (1984) Short-term habituation and dishabituation of the mismatch negativity of the ERP. Psychophysiology 21:434–441. https://doi.org/10.1111/j.1469-8986.1984.tb00223.x
    OpenUrlPubMed
  43. ↵
    1. Sato Y,
    2. Yabe H,
    3. Hiruma T,
    4. Sutoh T,
    5. Shinozaki N,
    6. Nashida T,
    7. Kaneko S
    (2000) The effect of deviant stimulus probability on the human mismatch process. Neuroreport 11:3703–3708. https://doi.org/10.1097/00001756-200011270-00023
    OpenUrlCrossRefPubMed
  44. ↵
    1. Sauer A, et al.
    (2017) Impairment in predictive processes during auditory mismatch negativity in ScZ: evidence from event-related fields. Hum Brain Mapp 38:5082–5093. https://doi.org/10.1002/hbm.23716 pmid:28677252
    OpenUrlPubMed
  45. ↵
    1. Schönwiesner M,
    2. Novitski N,
    3. Pakarinen S,
    4. Carlson S,
    5. Tervaniemi M,
    6. Näätänen R
    (2007) Heschl’s gyrus, posterior superior temporal gyrus, and mid-ventrolateral prefrontal cortex have different roles in the detection of acoustic changes. J Neurophysiol 97:2075–2082. https://doi.org/10.1152/jn.01083.2006
    OpenUrlCrossRefPubMed
  46. ↵
    1. Spratling MW
    (2017) A hierarchical predictive coding model of object recognition in natural images. Cognit Comput 9:151–167. https://doi.org/10.1007/s12559-016-9445-1 pmid:28413566
    OpenUrlPubMed
  47. ↵
    1. Wacongne C,
    2. Labyt E,
    3. van Wassenhove V,
    4. Bekinschtein T,
    5. Naccache L,
    6. Dehaene S
    (2011) Evidence for a hierarchy of predictions and prediction errors in human cortex. Proc Natl Acad Sci U S A 108:20754–20759. https://doi.org/10.1073/pnas.1117807108 pmid:22147913
    OpenUrlAbstract/FREE Full Text
  48. ↵
    1. Wacongne C,
    2. Changeux J-P,
    3. Dehaene S
    (2012) A neuronal model of predictive coding accounting for the mismatch negativity. J Neurosci 32:3665–3678. https://doi.org/10.1523/JNEUROSCI.5003-11.2012 pmid:22423089
    OpenUrlAbstract/FREE Full Text

Synthesis

Reviewing Editor: Christine Portfors, Washington State University

Decisions are customarily a result of the Reviewing Editor and the peer reviewers coming together and discussing their recommendations until a consensus is reached. When revisions are invited, a fact-based synthesis statement explaining their decision and outlining what is needed to prepare a revision will be listed below. The following reviewer(s) agreed to reveal their identity: Jinxing Wei. Note: If this manuscript was transferred from JNeurosci and a decision was made to accept the manuscript without peer review, a brief statement to this effect will instead be what is listed below.

This manuscript provides new insights into the intricate architecture of mismatch negativity (MMN), a key neural indicator for deviant detection. Using a refined oddball paradigm with dual-level temporal controls, they identified two unique MMN subcomponents, each linked to prediction errors at different brain hierarchies. This work establishes a practical platform for a multi-tiered neural marker, offering clinical applications for assessing brain function across various hierarchies.

The reviewers have only have some minor comments.

(1). In the introduction part, the author should explicitly state the specific aim or objective of your study at the end of the introduction. This will provide readers with a clear understanding of what you intend to achieve with your research.

(2). Additionally, while you briefly touch upon the theoretical framework of hierarchical predictive coding theory, consider providing a bit more context or explanation for readers who may not be familiar with this theory. This could involve defining key terms or concepts and explaining how they relate to MMN.

(3). In the line 5, where is the block 5 in figure 2A?

(4). The description for figures 2C and 2D lacks clarity and should provide more details regarding the statistical methods employed. Specifically, the author should elaborate on the statistical tests utilized to analyze the data presented in these figures.

(5). Regarding figure 2C, it is unclear whether there is a difference between block 1 and block 2. The author should explicitly state whether any statistical analysis was conducted to compare the data between these blocks. If there is a difference, the nature of this difference should be described in detail. Similarly, in figure 2D, it is unclear whether there is a difference between block 3 and block 4. The author should provide clarification on whether statistical tests were performed to compare the data between these blocks. Additionally, if a difference exists, the author should explain the implications of this difference.

Author Response

Synthesis Statement for Author (Required):

This manuscript provides new insights into the intricate architecture of mismatch negativity (MMN), a key neural indicator for deviant detection. Using a refined oddball paradigm with dual-level temporal controls, they identified two unique MMN subcomponents, each linked to prediction errors at different brain hierarchies. This work establishes a practical platform for a multi-tiered neural marker, offering clinical applications for assessing brain function across various hierarchies.

We thank the reviewers for understanding the key points of our paper.

The reviewers have only some minor comments. (1). In the introduction part, the author should explicitly state the specific aim or objective of your study at the end of the introduction. This will provide readers with a clear understanding of what you intend to achieve with your research.

We emphasize our specific research aim in Introduction section: "In the current study, we aim to clearly dissociate the superimposed MMN into subcomponents showing distinct neural signatures. Specifically, we hypothesize that there are two MMN subcomponents, each responding to the local and global levels of temporal statistics and thus representing the local and global prediction-error signals." (page 6, lines 8-12) (2). Additionally, while you briefly touch upon the theoretical framework of hierarchical predictive coding theory, consider providing a bit more context or explanation for readers who may not be familiar with this theory. This could involve defining key terms or concepts and explaining how they relate to MMN.

We further emphasize the cores of the theory: two bi-directional signaling, hierarchical structure, and prediction-error minimization. In addition, we add recent evidence showing that MMN evoked in different oddball paradigms can be explained by computational models based on the predictive coding principle. We also add that the prediction error can be influenced by the statistical regularity, indicated by studies showing MMN changing with the probability manipulation.

We revise and add the details in the second paragraph of Introduction section: "The theory postulates that bi-directional cascades operate across the hierarchy to minimize prediction errors: top-down predictions for incoming inputs are generated by learning statistical regularities, and bottom-up prediction errors are generated to refine the predictions when discrepancies between expected and actual sensory inputs occur (Bastos et al., 2012a; Clark, 2013; Friston, 2010). The coding has been suggested as an undelaying mechanism of the generation of MMN, which has been tested in different oddball paradigms (Garrido et al., 2007, 2008). Furthermore, as the prediction error indirectly reflects the prediction established based on the statistical regularity of stimuli, many studies have also reported that a less frequent stimulus, thus with less predictability, evokes a larger MMN (Javitt et al., 1998; Sabri & Campbell, 2001;

Sato et al., 2000)." (page 5, lines 12-21) (3). In the line 5, where is the block 5 in figure 2A? This is a misspelling, and we delete it. (4). The description for figures 2C and 2D lacks clarity and should provide more details regarding the statistical methods employed. Specifically, the author should elaborate on the statistical tests utilized to analyze the data presented in these figures.

We add details of statistical tests in Result section. "..., we performed a repeated measures ANOVA across all blocks and participants. First, we found significant differences in the peak amplitude among the blocks (F[3,87] = 30.7, p < 0.001, n = 30 participants, repeated measures ANOVA, Figure 2C)." (page 18, lines 8-11) "... (p < 0.001, n = 30 participants, pairwise comparison, Bonferroni correction, Figure 2C, the red horizontal line)..." (page18, lines 13-14) "... (Spearman's rank correlation coefficient: 0.52, p < 0.001, n = 30 participants)." (page18, lines 20-21) "... (F[3,87] = 9.4, p < 0.001, n = 30 participants, repeated measures ANOVA, Figure 2D)." (page19, line 2) "...(p = 0.01, n = 30 participants, pairwise comparison, Bonferroni correction)..." (page19, line 4) "...(Spearman's rank correlation coefficient: -0.45, p < 0.001, n = 30 participants)." (page19, lines 7-8) We add statistical methods in Figures 2C and 2D, Figures 4B and 4C, and Figure 2-2. (5). Regarding figure 2C, it is unclear whether there is a difference between block 1 and block 2.

The author should explicitly state whether any statistical analysis was conducted to compare the data between these blocks. If there is a difference, the nature of this difference should be described in detail. Similarly, in figure 2D, it is unclear whether there is a difference between block 3 and block 4. The author should provide clarification on whether statistical tests were performed to compare the data between these blocks. Additionally, if a difference exists, the author should explain the implications of this difference.

We address this by adding labels for significant pairwise comparisons in Figures 2C and 2D, and summarizing pairwise comparison results in Figure 2-3. I will explain this in more detail below.

- In Figure 2C, which displays the differences in the peak amplitude among four blocks, there is no significant difference between Block 1 and Block 2 as well as between Block 3 and 4. The statistics were obtained using pairwise comparisons with Bonferroni correction, after the significant difference among the four blocks was obtained using repeated measures ANOVA.

Figure 2-3 shows results of the pairwise comparisons (below). The statistic for the non- significant difference between Block 1 and Block 2 was highlighted in yellow. We observed non-significant differences between two blocks with close deviant probabilities (e.g., TP of the deviant is 0.03 in Block 2 and 0.06 in Block 1), but a trend of increased deviant probabilities with smaller MMN size, which was also tested using Spearman's rank correlations.

- In Figure 2D, which displays the differences in the peak latency among four blocks, there is no significant difference between Block 3 and Block 4 as well as between Blocks 1 and 3 and between Blocks 1 and 4. The statistics were obtained as the above-mentioned methods and shown in the table. We observed non-significant differences among Blocks 1, 3, and 4, but a trend of increased deviant probabilities with earlier MMN onset, which was also tested using Spearman's rank correlations.

We add detailed results in Result section. "In post-hoc tests (Figure 2-3), ..." (page 18, lines 11-12; page 19, lines 2-3) "There were non-significant differences in the rest comparisons between Blocks 1 and 2 and between 3 and 4. Although we did not find the significance when comparing two blocks with close deviant probabilities (e.g., TP of the deviant is 0.03 in Block 2 and 0.06 in Block 1), a trend of the increased TP and SP with the smaller amplitude of the negative peak was observed and further tested (Spearman's rank correlation coefficient: 0.52, p < 0.001, n = 30 participants)." (page 18, lines 16-21) "There were non-significant differences in the rest comparisons between each two of Blocks 1, 3, and 4." (page 19, lines 5-6)

Back to top

In this issue

eneuro: 11 (5)
eNeuro
Vol. 11, Issue 5
May 2024
  • Table of Contents
  • Index by author
  • Masthead (PDF)
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Dissecting Mismatch Negativity: Early and Late Subcomponents for Detecting Deviants in Local and Global Sequence Regularities
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Dissecting Mismatch Negativity: Early and Late Subcomponents for Detecting Deviants in Local and Global Sequence Regularities
Yiyuan Teresa Huang, Chien-Te Wu, Shinsuke Koike, Zenas C. Chao
eNeuro 3 May 2024, 11 (5) ENEURO.0050-24.2024; DOI: 10.1523/ENEURO.0050-24.2024

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
Dissecting Mismatch Negativity: Early and Late Subcomponents for Detecting Deviants in Local and Global Sequence Regularities
Yiyuan Teresa Huang, Chien-Te Wu, Shinsuke Koike, Zenas C. Chao
eNeuro 3 May 2024, 11 (5) ENEURO.0050-24.2024; DOI: 10.1523/ENEURO.0050-24.2024
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Data Availability
    • Footnotes
    • References
    • Synthesis
    • Author Response
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • EEG
  • hierarchy
  • mismatch negativity
  • predictive coding
  • subcomponents

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Article: New Research

  • Numbers of granule cells and GABAergic boutons are correlated in shrunken sclerotic hippocampi of sea lions with temporal lobe epilepsy
  • Breaching the blood-brain interface: Vasoactive neurons contact capillary vessels of the brain clock in the suprachiasmatic nucleus
  • Transcriptional Changes Fade Prior to Long-Term Memory for Sensitization of the Aplysia Siphon-Withdrawal Reflex.
Show more Research Article: New Research

Sensory and Motor Systems

  • Robust representation and nonlinear spectral integration of harmonic stacks in layer 4 of mouse primary auditory cortex
  • Changes in Palatability Processing across the Estrous Cycle Are Modulated by Hypothalamic Estradiol Signaling
  • Automatic, but not autonomous: Implicit adaptation is modulated by goal-directed attentional demands
Show more Sensory and Motor Systems

Subjects

  • Sensory and Motor Systems
  • Home
  • Alerts
  • Follow SFN on BlueSky
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Notice
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2026 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.