Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro
eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleResearch Article: Methods/New Tools, Novel Tools and Methods

Assessing Cross-Contamination in Spike-Sorted Electrophysiology Data

Jack P. Vincent and Michael N. Economo
eNeuro 2 August 2024, 11 (8) ENEURO.0554-23.2024; https://doi.org/10.1523/ENEURO.0554-23.2024
Jack P. Vincent
1Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
2Center for Neurophotonics, Boston University, Boston, Massachusetts 02215
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jack P. Vincent
Michael N. Economo
1Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
2Center for Neurophotonics, Boston University, Boston, Massachusetts 02215
3Center for Systems Neuroscience, Boston University, Boston, Massachusetts 02215
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael N. Economo
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Abstract

Recent advances in extracellular electrophysiology now facilitate the recording of spikes from hundreds or thousands of neurons simultaneously. This has necessitated both the development of new computational methods for spike sorting and better methods to determine spike-sorting accuracy. One long-standing method of assessing the false discovery rate (FDR) of spike sorting—the rate at which spikes are assigned to the wrong cluster—has been the rate of interspike interval (ISI) violations. Despite their near ubiquitous usage in spike sorting, our understanding of how exactly ISI violations relate to FDR, as well as best practices for using ISI violations as a quality metric, remains limited. Here, we describe an analytical solution that can be used to predict FDR from the ISI violation rate (ISIv). We test this model in silico through Monte Carlo simulation and apply it to publicly available spike-sorted electrophysiology datasets. We find that the relationship between ISIv and FDR is highly nonlinear, with additional dependencies on firing frequency, the correlation in activity between neurons, and contaminant neuron count. Predicted median FDRs in public datasets recorded in mice were found to range from 3.1 to 50.0%. We found that stochasticity in the occurrence of ISI violations as well as uncertainty in cluster-specific parameters make it difficult to predict FDR for single clusters with high confidence but that FDR can be estimated accurately across a population of clusters. Our findings will help the growing community of researchers using extracellular electrophysiology assess spike-sorting accuracy in a principled manner.

  • electrophysiology
  • spike sorting

Significance Statement

High-density silicon probes are widely used to record the activity of large populations of neurons while animals are engaged in complex behavior. In this approach, each electrode records spikes from many neurons, and “spike-sorting” algorithms are used to group the spikes originating from each neuron together. This process is error-prone, however, and so the ability to assess spike-sorting accuracy is essential for properly interpreting neural activity. The rate of interspike interval (ISI) violations is commonly used to assess spike-sorting accuracy, but the relationship between the ISI violation rate (ISIv) and sorting accuracy is complex and poorly understood. Here, we describe this relationship in detail and provide guidelines for how to properly use ISIv to assess spike-sorting accuracy.

Introduction

Extracellular electrophysiology has become an increasingly popular method for studying neuronal activity at the population level. Silicon probes containing dozens or hundreds of densely packed electrode sites can be used to observe neuronal action potentials in many neurons simultaneously. This multiplexed signal acquisition, however, often necessitates the assignment of observed action potentials to individual neurons—“spike sorting”—as a critical first step prior to many analyses (Todorova et al., 2014; Rey et al., 2015). Approaches for spike sorting vary, though all generally involve comparisons of observed action potential waveforms within and across electrodes and then grouping similar spikes—thought to be produced by the same neuron—together, forming clusters (Gibson et al., 2012; Quiroga and Panzeri, 2013). Ideally, each cluster is composed principally of true-positive (TP) spikes from a single neuron making it a “well-isolated” cluster. Spikes from this single neuron can be erroneously excluded from the cluster, resulting in false negatives (FNs). The primary focus of this work, however, is false positives (FPs), wherein spikes are misassigned to a cluster whose activity is meant to correspond to a different neuron. Clusters with substantial “contamination” by FPs thus represent the combined activity of multiple neurons.

FPs are a persistent problem in spike sorting that results from frequently unavoidable similarities in action potential waveforms, occurrence of spikes at overlapping times, nonstationarity in waveform shape, and recording noise. Contamination can distort the activity of a cluster, potentially leading to incorrect conclusions about how single neurons encode information. For instance, a population of neurons recorded during a task with two cues may contain neurons responsive to just one cue or the other. However, poor sorting may lead to cluster cross-contamination between these two phenotypes, giving the impression that neurons in this region respond to both cues. The prevalence of FPs in a single cluster, recording session, or dataset can be described by the false discovery rate (FDR), a value that ranges between 0 and 1 and reports the proportion of sorted spikes that have been misassigned. While FNs can also be a concern, as they can decrease recorded spike frequencies and thus reduce statistical power in subsequent analyses (Hill et al., 2011), FNs should not generally alter the overall patterns of activity associated with individual clusters and are thus of less concern than FPs.

Algorithmic approaches to spike sorting for high-density silicon probe electrophysiology have seen concentrated development over the last decade, with researchers now selecting from a number of competing options for high-throughput automated spike sorting (Bestel et al., 2012; Pachitariu et al., 2016; Chung et al., 2017; J. H. Lee et al., 2017; Jun et al., 2017; Chaure et al., 2018; Buccino et al., 2020; Saif-ur-Rehman et al., 2021; Toosi et al., 2021), yet concomitant techniques for post hoc analysis of spike-sorted data (Pouzat et al., 2002; Hill et al., 2011; Neymotin et al., 2011; Barnett et al., 2016; Magland et al., 2020) have been given comparatively less attention. Sorting quality metrics generally fall into two categories: assessment of cluster overlap using dimensionally reduced representations of sorted spike waveforms, e.g., L-ratio (Schmitzer-Torbert et al., 2005), isolation distance (Harris et al., 2001), D-prime (Hill et al., 2011), and silhouette score, or empirical measures known to be related to cluster isolation and sorting difficulty, e.g., signal-to-noise ratio, presence ratio, firing range, and interspike interval (ISI) violations. Among all these metrics, ISI violations are unique in that they are tightly linked to the occurrence of FPs. Biophysical limitations prevent neurons from producing consecutive spikes within their absolute refractory period, meaning the presence of action potentials spaced by less than the absolute refractory period, an ISI violation, is always the result of at least one FP.

ISI violations are typically reported as a fraction of the total number of spikes assigned to a cluster. The ISI violation rate (ISIv)—the number of ISI violations divided by the total number of spikes assigned to a cluster—is often interpreted subjectively with only a general understanding that a lower ISIv is associated with a lower FDR. Often, although not always, it is appreciated that most FP spikes do not produce ISI violations and so ISIv << FDR. Previous work has estimated the relationship between ISIv and FDR (Hill et al., 2011; Llobet et al., 2022) using simplifying assumptions, but the accuracy and limitations of predicting cluster FDR on the basis of ISI violations under realistic experimental conditions have not been assessed.

Here, we develop a comprehensive model explaining the relationship between ISIv and FDR following spike sorting with respect to cluster contamination, neuronal firing frequency, the temporal relationships between neurons, and the number of neurons contributing FPs. We benchmark the accuracy of this model in silico through Monte Carlo simulation and explore limitations in the accuracy of FDR estimation imposed by in vivo recording conditions. Finally, we apply this model to publicly available spike-sorted electrophysiology data to estimate FDRs from ISI violations in published studies and provide researchers with intuition about the range of FDRs expected in silicon probe electrophysiology.

Materials and Methods

Monte Carlo simulation of neural spike trains

Stochastic neural spike trains were simulated using Elephant (Electrophysiology Analysis Toolkit; Yegenoglu et al., 2018). Specifically, neurons were modeled as either homogeneous or inhomogeneous Poisson processes (van Vreeswijk, 2010) using either the StationaryPoissonProcess or the NonStationaryPoissonProcess functions of the spike_train_generation module. Custom Python scripts were used for subsequent simulation and analysis. While Poisson point processes were used for all simulations of neural spike trains, post hoc custom simulations of gamma, inverse Gaussian, and log-normal point processes indicated that observed results did not seem to depend on any particular point process or ISI coefficient of variation (CV; Extended Data Fig. 3-1). A refractory period of 2.5 ms was assumed for all simulations as well as all calculations in Table 2. All datasets examined were recorded in mice of both sexes, and most recordings targeted the neocortex. However, for data collected from different brain regions or species, this parameter would need to be adjusted accordingly. Simulating “infinite” contaminant neurons was accomplished by simulating a single contaminant neuron with no refractory period. Simulated recording durations varied depending on the desired level of certainty in ISIv and need to emulate realistic recording conditions. These times included ∼28 h (Fig. 3A,B), ∼17 h (Fig. 3C), 12 h (Fig. 3D), 30 min (Fig. 2B), and 10 min (Figs. 2A,C, 4B,C).

Monte Carlo simulation of cluster populations

In some simulations, all parameters were varied simultaneously to produce populations of clusters with a wide range of physiologically feasible parametric combinations (Figs. 3D, 4B,C). For each cluster, the following multistep process was used: (1) FDR, N, and ft were randomly sampled from bounded probability distributions. Specifically, FDR was sampled from bounded Cauchy distributions as these were found to most accurately replicate ISIv distributions found in electrophysiological data when simulated. Different population median and mean FDRs were obtained by adjusting the location and scale of the Cauchy distribution. ft was randomly selected from a bounded uniform distribution: [4, 20] (Fig. 3D), [1, 10] (Fig. 4B), and [4, 16] (Fig. 4C). Possible values of N were uniformly chosen from among the following values: [1, 2, 5, ∞]. This was found to be the most balanced way to sample N, as the effect of contaminant neuron count on ISI violation occurrence increases logarithmically. (2) Unit vector peristimulus time histograms (PSTHs) were randomly selected from one of the datasets in Table 2 to serve as f^TP and f^FP . These were selected uniformly from among all clusters present in the dataset. (3) These PSTHs were then scaled appropriately and combined to reach the desired ft and FDR. The correct scaling was determined using a function optimization routine (minimize_scalar, SciPy) that sought to minimize the difference between each scaled PSTH's current and desired average firing frequency. TP–FP covariance obtained using this method varied from −49.2 to 113.7 Hz2. Longer simulated recording times were attained through repeated simulation of these PSTHs.

Calculation of PSTHs

PSTHs of cluster responses for in vivo electrophysiology data were calculated using bin sizes of 50 ms. Predicted median and mean FDR were found to be largely unaffected by bin size selection in trial-averaged PSTHs. For continuously recorded data, the greatest length of time that could be extracted around each cue without overlapping between “trials” was used to generate trial-averaged PSTHs.

Estimating unknown cluster parameters

When predicting FDR using recorded data, only limited information is available about each sorted cluster. f→t and ISIv can be computed directly. τ can be obtained through prior knowledge of electrophysiological properties of neurons in the organism and brain region being recorded from. f^FP can be estimated through examination of other sorted clusters present in the recording session. Estimating N is not straightforward.

To account for uncertainty in N, two approaches can be taken. A reasonable guess (e.g., 2, 3) can be chosen, and simply plugged directly into the equation, or multiple values representing extreme cases can be input and their results averaged. For example, FDR can be calculated by assuming N equals either 1 or ∞ and then taking the average of results from these two cases. Empirically, we found this equivalent to assuming a single N of ∼2–3. Our results indicate that FDR estimates are not highly sensitive to the particular choice of N (Fig. 2C).

Depending on the assumed N, f^FP can be estimated from either other single clusters or averaged combinations of clusters. For example, if an N of 1 is assumed, f^FP can be directly obtained from other random clusters, while if an N of 2 is assumed, f^FP can be obtained by taking the average of 2 other random clusters. If an N of ∞ is assumed, f^FP can be obtained from a global average PSTH. Clusters for deriving f^FP can be restricted to those on the same or nearby electrode sites.

Estimation of FDR in simulated cluster populations and electrophysiology datasets

For FDR predictions in both simulated cluster populations (Figs. 3D, 4B,C) and electrophysiology datasets (Table 2), the following methodology was used. An N of both 1 and ∞ were assumed: in the former case, a given sorted cluster was compared with every other cluster in the simulation or recording session, while in the latter, it was compared with a single global f^FP . The final FDR is the mean of the (N = 1) cases averaged with the (N = ∞) case.

For estimating FDR in electrophysiology datasets (Table 2), clusters for deriving f^FP were not restricted to those on the same or nearby electrode sites due to uncertainty about each dataset's probe geometry. Probe geometry was also not implemented in simulations of cluster populations (Figs. 3D, 4B,C). A single τ of 2.5 ms was used for every dataset, all of which were comprised of recordings from mouse brains. This refractory period was modified by the censor period, when necessary. Censor periods were determined through visual inspection of aggregated per-cluster ISI histograms across the entire dataset. For selecting datasets to examine, only papers published in the last 10 years with publicly available spike-sorted electrophysiology data were considered. No limits or minimums were placed on cluster count, and no sorting methods were specifically included or excluded. In some datasets, spontaneous activity was recorded without a cue to calculate PSTHs around. In such cases, FDR was initially predicted using firing frequency vectors calculated across the entire session. However, these predictions were found to be sensitive to the bin size used; therefore the predicted FDR assuming homogeneous firing is given instead (Table 1; Eq. 9).

View this table:
  • View inline
  • View popup
Table 1.

Symbols used

Theoretical limits of predicted FDR

Due to stochasticity associated with ISIv estimates, observed ISIv values sometimes exceed theoretical bounds given by the chosen parameters, producing imaginary predicted FDRs. In such cases, the predicted FDR is capped at its theoretical maximum for a given (assumed) number of contaminant neurons (Eq. 1). This maximum is derived from the fact that given a certain N, if predicted FDR exceeds FDRmax, a lower FDR could be attained by simply selecting a different neuron within the sorted cluster to be the TP neuron. When FDR is being calculated by averaging the (N = 1) and (N = ∞) cases, FDRmax is the average of these two cases’ maximums, 0.75, as follows:FDRmax=NN+1.

Code availability

The code for calculating FDR with user-provided spike–sorted electrophysiology data, reproducing figures, and simulating neuronal firing and spike sorting is available via GitHub (https://github.com/economolab/DCISIv).

Results

The relationship between ISIv and FDR is complex

We sought to understand how the occurrence of ISI violations depends on underlying cluster FDR and how other underlying characteristics of neuronal activity might affect this relationship. To this end, we focused on three variables likely to have a meaningful effect on the occurrence of ISI violations: neuronal firing frequency (Fig. 1A), temporal correlation of activity among the recorded population of neurons (Fig. 1B), and the number of contaminant neurons (Fig. 1C).

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Factors affecting the relationship between ISIv and FDR. Schematic representation of occurrence of ISI violations for a cluster with varying firing frequencies (A) TP–FP covariance (B) and numbers of contaminant neuron(s) (C). In all cases, underlying FDR between the two cases is the same, while observed ISIv varies as a consequence of changes in these characteristics of neuronal activity. Blue corresponds to TP spikes, and red corresponds to FP spikes. Overhead dots indicate observed ISI violations. τ represents the neuronal refractory period. ISI violations can occur between TPs and FPs (TP-to-FP) or, if multiple contaminant neurons are present, FPs and other FPs (FP-to-FP).

To determine how these variables affect the relationship between ISIv and FDR, we used Monte Carlo simulations of neural spiking to examine how the relationship between ISIv and FDR might change as a consequence of varying each parameter in isolation. Total cluster firing frequency was found to have a dramatic effect on ISI violation production, since overall firing frequency played a large role in determining the likelihood that any given FP spike would produce an ISI violation. Critically, clusters with lower firing frequencies and clusters with higher firing frequencies could both present with the same ISIv, even when they had markedly different underlying FDRs (5–50%; Fig. 2A). These results indicated a nonlinear relationship between ISI violation production and firing frequency not accounted for by simply dividing the number of ISI violations by the number of spikes (ISIv). The temporal overlap between TPs and FPs was also found to strongly modulate ISI violation production, although not as strongly as cluster firing frequency (Fig. 2B). Variable TP–FP covariance (0.8 to −0.5 Hz2) altered the probability of any given FP leading to an ISI violation, resulting in three clusters with substantially different FDRs (13–36%) presenting with the same ISIv and firing frequency. Lastly, the number of contaminant neurons was also found to modulate ISI violation production, although not as meaningfully as other variables (Fig. 2C). Greater numbers of contaminant neurons increased the odds of ISI violations between pairs of FPs, meaning the same observed ISIv was associated with a slightly lower FDR in cases of multiple contaminant neurons versus just one contaminant neuron. The dependence of this phenomenon on FP-to-FP violations (Fig. 1C) means its effects only became meaningful at high FDRs (>0.25) and total firing frequencies (>10 Hz).

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Simulated ISI distributions with variable underlying neuronal characteristics. Representative ISI histograms with all factors kept constant except total cluster firing frequency (A), TP–FP covariance (B) and contaminant neuron count (C). ISIs within the red rectangle are shorter than the refractory period and correspond to ISI violations. ISIv was identical across conditions in each row. In A, ISI violations may appear to constitute a smaller proportion of total ISIs in higher firing frequency conditions, but this is only because these conditions had lower average ISIs and all conditions were plotted within the same 0–25 ms ISI domain. Simulated firing frequencies were as indicated (A), 2 Hz (B), and 10 Hz (C). Simulated firing was either homogeneous (A, C) or inhomogeneous (B). In A, outlines of the histograms in the right two panels are overlaid to scale with high transparency on the left panel. Blue corresponds to TP spikes, and red corresponds to FP spikes. FP firing frequency traces (B) are schematics only. The case of “infinite” contaminant neurons was simulated using a single contaminant neuron with no refractory period.

Analytical model of the relationship between ISIv and FDR

We next sought to derive an analytical model describing the dependence of underlying FDR on observed ISIv that incorporates each of these variables. To that end, we first considered a simplified case in which two neurons are each firing homogeneously or at a constant frequency and spikes from both neurons are being assigned to the same cluster. Each TP spike produces a double-sided “violation window” in time. If an FP spike occurs within that window, an ISI violation is observed (Eq. 2; Fig. 1C) as follows:fv=fFP(2τfTP). Here, we represent the number of ISI violations observed per second as fv, the neuronal absolute refractory period as τ, and the frequencies with which the two neurons produce TP and FP spikes as fTP (TP frequency) and fFP (FP frequency). Note that fTP and fFP may not necessarily represent the total firing frequencies of the neurons producing the TP and FP spikes, only the frequencies at which spikes from those neurons are assigned to a sorted cluster. These can be one and the same, e.g., for a neuron contributing TPs with no FNs. Also note that fTP and fFP cannot be measured experimentally, instead representing unseen parameters of spike train generation. By making substitutions based on a few simple relationships (Eqs. 3–5), Equation 2 can be solved using the quadratic formula, defining FDR as a function of the following experimentally observable variables: ISIv, τ, and ft—the total observed firing frequency of the cluster (Eq. 6; see Extended Data for intermediate steps) as follows:ISIv=fvft, FDR=fFPft, ft=fTP+fFP, FDR=12(1−1−2ISIvτft). The larger root is ignored, i.e., the term under the square root is subtracted and not added, because the neuron with the most spikes in the cluster is de facto considered the TP-contributing neuron. Some spike-sorting algorithms make use of a censor period, whereby spikes detected within a certain minimum distance, τc, of another spike are ignored. In such cases, the size of the violation window produced by each TP spike is shortened by this censor period producing a new effective refractory period: τe = τ − τc. Implementation of τe produces Equation 7, which is equivalent to a rearranged form of the equation derived in Hill et al. (2011) as follows:fv=fFP(2τefTP). At the other extreme, consider a situation where the spikes comprising a cluster are generated by an infinite number of neurons. In reality, there can never be an infinite number of contaminant neurons, but this term is a convenient shorthand for describing the limiting case where there as many contaminant neurons as FPs. In this limiting case, any FP can produce an ISI violation with any other FP, necessitating the addition of a second term wherein FP spikes now produce double-sided violation windows as well. This term is scaled by a factor of one-half to prevent double counting of FPs producing ISI violations with one another. Implementation of this term produces Equation 8, which is equivalent to a rearranged form of the equation derived in Llobet et al. (2022). This equation can be solved for FDR using substitutions and the quadratic formula as previously as the following:fv=fFP(2τefTP)+12fFP(2τefFP). For an unspecified number of contaminant neurons N, an additional scaling factor of (N − 1) / N can be added to the FP–FP ISI violations term (Eq. 9). This factor can be interpreted as the fraction of all FPs available for any given contaminant neuron's spikes to produce ISI violations with, e.g., 1/2 for N = 2 and 2/3 for N = 3. This equation, like previous iterations, can be rearranged to solve for FDR with an additional dependence added on N as follows:fv=fFP(2τefTP)+12(N−1N)fFP(2τefFP). In the more biologically relevant case of inhomogeneous firing, or neural spiking frequencies that vary over time, fTP, fFP, ft, and fv can all be considered not as constants but as functions of time (i.e., vector quantities). While this spiking nonstationarity must be taken into account, a time-varying estimate of FDR would be a needless level of granularity and also highly inaccurate given the stochastic nature of neuronal spiking and ISI violations, so of primary interest is a time-averaged estimate of the relationship between violation frequency (f¯v) and underlying variables. In this case, the frequency of violations depends not on the product of the average values of fTP and fFP but on the expected value of their element-wise product, E[f→TPf→FP] , as follows:f¯v=2τeE[f→TPf→FP]+12(N−1N)2τeE[f→FPf→FP]. For two vectors representing firing frequency over time f→TP and f→FP of length n elements, this expected value can be calculated as follows:E[f→TPf→FP]=f→TP⋅f→FPn. Equation 10 can then be solved for |f→FP| , the vector magnitude of f→FP , using the quadratic formula (Eq. 12). Unit vector f^FP can subsequently be scaled by |f→FP| , averaged, and finally divided by the mean total firing frequency of the cluster to obtain a time-averaged estimate of FDR (Eq. 13) as follows:|f→FP|=NN+1(D|f→t|−D2|f→t|2−(N+1N)(f¯tISIvnτe)), FDR=(1f¯t)1n∑(|f→FP|f^FP)=f¯FPf¯t. Here, D corresponds to the dot product of the unit vectors representing total cluster spike frequency and cluster FP spike frequency, D=f^t⋅f^FP . This can be thought of more generally as representing the degree to which the time-varying total cluster spike frequency temporally overlaps with the time-varying FP spike frequency. This final equation depends on a number of parameters specific to each cluster to obtain a single FDR estimate: (1) the effective refractory period τe, (2) the temporal distribution of activity in the cluster of interest (f→t) and (3) of other clusters contributing FP spikes (f^FP) , (4) the observed ISIv, and (5) the number of contaminant neurons N. For information on how one can estimate these parameters from experimental data, see Materials and Methods.

Prediction of FDR in silico

To both assess our model's predictive power and more generally illustrate relationships between model variables, we next simulated neural spike trains while varying all relevant parameters across a range of biologically relevant values and then attempted to predict FDR from the observed ISIv. In this case, parameters like N and f^FP that may normally have associated uncertainty are known exactly. Simulating long periods of time (up to 28 h of recording time) also enables highly accurate estimates of ISIv.

For both homogeneous and inhomogeneous firing, we found that analytical FDR predictions closely approximated the true underlying FDRs (Fig. 3). For homogeneous firing specifically, observed ISIv was found to strongly depend on FDR and total cluster firing frequency, as expected (Fig. 3A). A linear dependence was observed of ISIv on firing frequency at fixed FDR and contaminant neuron counts, despite ISIv often being assumed to already have normalized for cluster firing frequency. Furthermore, FDR was found to scale quadratically with increasing ISIv at a fixed firing frequency and contaminant neuron count (Fig. 3B). ISIv values of 0.1–1% represent typical thresholds in literature for considering a cluster well isolated (Jadhav et al., 2009; Roy and Wang, 2012; Chandrasekaran et al., 2017; Wright et al., 2021; Boucher et al., 2023; Zhao et al., 2023). Yet, our results indicate that FDRs associated with these ISIv values vary considerably with the firing frequency of the cluster in question (Fig. 3A,B). As an illustration, an ISIv of 0.5% reflected a desirable 5% FDR for a cluster firing at 20 Hz or a much higher 50% FDR for one firing at 3 Hz. In general, contaminant neuron count was of limited consequence unless the cluster in question had both a high firing frequency and high FDR, making FP–FP violations frequent enough to meaningfully affect overall ISI violation incidence.

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

Relationship between single-cluster FDR and observed ISIv. A, Dependence of ISIv on total firing frequency given varying FDRs and contaminant neuron counts. Lines correspond to analytical predictions; dots correspond to simulation results. Plotted data apply to both primary and gray axes; the latter provide an inset for the former. B, Dependence of ISIv on FDR given varying firing frequencies and contaminant neuron counts. Conventions as in A. C, Prediction of FDR from observed ISIv with temporally inhomogeneous firing frequencies using either the homogeneous model (Eq. 9) or the inhomogeneous model (Eq. 10). D, Prediction of FDR from observed ISIv for 100 total clusters simulated across a range of physiologically relevant underlying neuronal characteristics. Total firing frequency was varied between 4 and 20 Hz, N was varied between 1 and ∞, and f^FP was obtained by averaging across other clusters (see Materials and Methods for more details). The red line is the unity line, or perfect concurrence between predicted and true FDR; dashed gray line is the line of best fit. RMSE calculated with respect to the unity line. Clusters shown in D simulated using Poisson point processes. See Extended Data Figure 3-1 for validation using non-Poisson point processes. Simulated firing was either homogeneous (A, B) or inhomogeneous (C, D). The model used for FDR predictions was either homogeneous (A, B) or inhomogeneous (D), or both were used (C).

Figure 3-1

Prediction of FDR in non-Poisson point processes. Prediction of FDR from observed ISIv when simulating neural spiking by drawing ISIs from gamma distributions (A), inverse Gaussian distributions (B), or log-normal distributions (C). 100 total clusters simulated in each panel across a range of physiologically relevant underlying neuronal characteristics. Total firing frequency was varied between 4 and 20 Hz, N was varied between 1 and 10, and f^FP was obtained by averaging across other clusters (see Materials and Methods for more details). Coefficient of variation (CV) of ISI distributions varied from 0.5 to 2. Red line is the unity line, or perfect concurrence between predicted and true FDR; dashed gray line is the line of best fit. Root mean square error (RMSE) calculated with respect to the unity line. Simulated firing was homogeneous and the model used for FDR predictions was also homogeneous (Eq. 9). A gamma distribution with CV = 1 is identical to an exponential distribution, producing a Poisson point process. Bottom raster plots show gamma distributed spiking of an example 10 Hz neuron at various CVs. Stacked points indicate spikes occurring in quick succession. Download Figure 3-1, TIF file.

When the activity of clusters was inhomogeneous in time, errors in FDR predicted with the homogeneous model (Eq. 9) scaled linearly with the temporal covariance of TPs and FPs (Fig. 3C). Positive covariance increased the ISIv at the same FDR, resulting in overestimation of FDR, while negative covariance conversely decreased ISI violations, resulting in underestimation of FDR. When inhomogeneous firing was appropriately taken into account (Eq. 10), predicted FDR closely approximated true FDR regardless of the covariances in neuronal firing. When the activity of clusters contributing TP spikes and FP spikes varied independently in time (inhomogeneous but zero covariance), the probability of TP–FP violations was unchanged compared with homogeneous spiking. Therefore, under these conditions, the homogeneous and inhomogeneous predictions agree even if the generation of TPs and FPs individually may not necessarily be homogeneous.

We next attempted to predict FDR from ISIv (Eqs. 12, 13) in a population of clusters simulated inhomogeneously across a broad, randomized parameter space. Regardless of the precise combination of parameters for any given cluster, predicted FDR and true FDR agreed with low root mean square error (RMSE; Fig. 3D; RMSE, 0.02). This prediction accuracy also generalized to simulations of non-Poisson point processes across a range of possible ISI coefficients of variation (Extended Data Fig. 3-1; RMSE, 0.01–0.02).

Prediction of FDR under realistic conditions

Our model performed well when ISIv and other parameters were known exactly, although benchmarking simulations indicated that predictions of FDR are sensitive to small changes in ISIv (Fig. 3), particularly at lower firing frequencies (Fig. 3B). We next wanted to determine whether FDR could be accurately predicted from recordings of finite duration with noisy ISIv estimates and when exact values of N and f^FP are unknown. Simulating spiking for finite durations, we found that observed ISIv values were normally distributed around their true values (Fig. 4A), with increasing recording time decreasing the variance of this distribution, as expected.

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Prediction of FDR for single clusters and populations. A, Probability density functions of observed % ISIv for a prototypical 8 Hz, 15% FDR cluster recorded for varying time lengths. These parameters produce a true % ISIv of 0.5%, a value that has been used as a threshold for considering clusters well isolated (Jadhav et al., 2009; Guo et al., 2014; Chandrasekaran et al., 2017). B, Prediction of FDR in single clusters recorded for 10 min. All parameters varied simultaneously across a range of physiologically relevant underlying neuronal characteristics. Total firing frequency was varied between 1 and 10 Hz, N was varied between 1 and ∞, and f^FP was obtained by averaging across other clusters (see Materials and Methods for more details). Ceiling effect of predicted FDR at 0.75 due to theoretical limit of FDR when predicting with unknown N (see Materials and Methods). The red line is the unity line or perfect concurrence between predicted and true FDR; the dashed gray line is the line of best fit. RMSE calculated with respect to the unity line. C, Prediction of median and mean FDR in 1,000-cluster populations. Cluster FDRs are Cauchy-distributed around the population mean. Total firing frequency was varied between 4 and 16 Hz; otherwise parameters varied simultaneously and conventions as in B. D, Minimum recording time required for predictions of FDR in a single cluster to have a CV of 20%.

To assess the effect of noisy ISIv estimates on FDR predictions, we again simulated spiking while simultaneously varying all previously described parameters, but this time recording duration was restricted to 10 min (Fig. 4B,C). In this case, we found that FDR predictions for individual clusters were substantially less accurate (RMSE, 0.13), although they did not deviate systematically from true FDRs (Fig. 4B). Even with this highly restricted recording time, FDR population statistics could be estimated across a set of clusters with high fidelity so long as a sufficient number of clusters were sampled (Fig. 4C). For example, median FDR and mean FDR could be predicted with RMSEs of 0.03 and 0.02, respectively, across 1,000 clusters. Predicted median FDRs were slightly overestimated at high true median FDR (>0.25).

We next sought to determine the duration of neural recordings necessary to obtain accurate estimates of ISIv and, thus, FDR in single clusters. To accomplish this, we simulated clusters with various firing frequencies and FDRs and determined the recording time sufficient to produce a CV of the predicted FDR of 20% (Fig. 4D; e.g., 50 ± 10% or 5 ± 1%). Surprisingly, we found that clusters with firing frequencies of 1–2 Hz required observations across hundreds of minutes to produce accurate estimates of single-cluster FDR, an infeasible recording time in many common experimental paradigms. For clusters with firing frequencies >5 Hz, FDRs between 5 and 30% could be estimated accurately using tens of minutes of spiking data.

We used our model to estimate the FDRs of clusters contained within 12 publicly available datasets that included spike-sorted electrophysiology recordings (Table 2). For all datasets, the inhomogeneous model (Eq. 10) was used, with the exception of Stringer et al. (2019) and Juavinett et al. (2019). Animal behavior in these datasets was not trial-based, and so it was not straightforward to accurately estimate time-varying firing frequencies, necessitating the use of the homogeneous model (Eq. 9; see Materials and Methods). Given limitations associated with single-cluster FDR predictions (Fig. 4B), estimated median and mean FDRs were reported across all clusters present in each dataset. An average median FDR of 12.9 ± 13.5% (SD) was observed along with an average mean FDR of 24.1 ± 9.2% (SD). No obvious correlations between cluster count, spike-sorting methodology, or recording technology and dataset FDR were observed. With the exception of Juavinett et al. (2019), median estimated FDRs were consistently lower than mean estimated FDRs. This implies that cluster FDR distributions in recorded electrophysiology datasets tend to be right-skewed, composed of a large proportion of clusters possessing FDRs closer to 0 as well as a broadly distributed complement of more contaminated clusters, some of which potentially reaching FDRs well above 0.5.

View this table:
  • View inline
  • View popup
Table 2.

Median and mean FDR of publicly available spike-sorted electrophysiology datasets

Importantly, no attempts were made to curate clusters included in analysis from each dataset; all available sessions and clusters were assessed, with the exception of clusters labeled “multi” or “MUA” in datasets with cluster quality annotations. It's possible that some datasets were precurated, with clusters discarded according to some exclusion criteria prior to being uploaded, while others were shared uncurated. Depending on the nature of the scientific question being addressed, better cluster isolation is undoubtedly a larger priority for some datasets than others. For these reasons and due to uncertainty in assessing censor period in each dataset, we emphasize that these findings should serve generally as an overall survey of the range of expected contamination levels in datasets produced using widely used methods rather than as a commentary on individual datasets or spike-sorting methodologies.

Discussion

ISI violations are the most commonly employed metric of accuracy in spike sorting, serving as an indication of the FDR—the rate at which spikes are erroneously assigned to the wrong cluster. Here, we used Monte Carlo simulations to demonstrate that the ISIv is related to FDR through a complex relationship that depends on many factors, including (1) the neuronal firing frequency, (2) the temporal correlation in activity between neurons contributing to a cluster, and (3) the number of neurons contributing spikes to a cluster—in that order of descending importance. We derived an analytical model that can be used to predict FDR from ISIv that incorporates these factors and determine the accuracy with which FDR can be inferred during finite-length recordings at the level of single clusters and datasets. Finally, we used this model to assess the FDR of clusters contained in publicly available spike-sorted electrophysiology datasets to provide bounds on the accuracy that can be reasonably expected by experimenters. Our study makes four central contributions.

First, we derive an analytical model that can be used to estimate FDR from ISIv accurately across a broad parameter space.

Second, we explicitly demonstrate that FDR is not linearly related to ISIv but depends critically on the total cluster spike frequency. While this dependence can be inferred from previous work (Hill et al., 2011; Llobet et al., 2022), our results underscore the inappropriateness of using ISIv as an inclusion criterion for single clusters—which is still a common practice in many studies using spike-sorted data. Across a common range of firing frequencies (∼2 to 12 Hz), clusters with the same ISIv can be associated with both low (∼5%) and very high (∼50%) FDRs (Figs. 2A, 3A).

Third, we find that estimates of FDR at the single-cluster level are noisy due to the stochasticity of ISI violations as well as uncertainty in cluster-specific parameters (Fig. 4B)—but estimates at the population level are highly robust (Fig. 4C). As a point of reference, our results suggest that ISIv can be estimated accurately enough to predict single-cluster FDRs within 20% of their true values in 1 h recordings only when firing frequencies are greater than ∼5 Hz. Even for clusters that meet these requirements, however, experimental uncertainty in parameters like FP–TP covariance and contaminant neuron counts make single-cluster FDRs difficult to predict with high confidence. Alternatively, population-level statistics of FDR, obtained by averaging across all the clusters in a dataset, can be accurately predicted with recording durations as low as 10 min.

Finally, we predict FDR on the basis of ISI violations in publicly available datasets for the first time. FDR population statistics covered a wide range (median, 3.1–50.0%; mean, 12.5–44.3%) and could be estimated with low standard error (SE median, 0.2–1.3%; SE mean, 0.2–1.0%). Datasets with low FDR were not associated with any obvious external features of data collection, spike-sorting methodology, or recording technology, and the variance in FDRs across datasets is likely a function of the variable importance of low FDR for the scientific goals of individual studies.

Monte Carlo simulations for validating prediction of FDR

Given the absence of high cluster count spike-sorted extracellular neural recordings with associated per-neuron ground truth patch–clamp data, Monte Carlo simulation presents itself as an attractive tool for studying the theoretical mechanisms by which a given level of contamination in a sorted cluster translates into observed ISI violations. Neural spiking was simulated in this work as a collection of independent Poisson processes, a common assumption that has been validated across a number of organisms, brain regions, and behavioral contexts (Werner and Mountcastle, 1965; Tolhurst et al., 1981; Shinomoto and Tsubo, 2001; Abbott and Dayan, 2005; Roxin et al., 2008). Furthermore, the equations derived in this work were equally accurate when applied to simulations of non-Poisson point processes across a range of ISI coefficients of variation, implying that they likely generalize to any renewal process that produces independent and identically distributed ISIs with a physiological CV (Extended Data Fig. 3-1). Beyond this foundation, only aspects of neuronal firing and spike sorting thought to potentially be relevant to ISI violation production were modeled, namely, varying firing frequency amplitudes, temporally inhomogeneous firing, and differing contaminant neuron counts. If an additional aspect not accounted for in this description plays a significant role in determining the relationship between ISIv and FDR, then in silico validation may not reflect true congruence between analytical prediction and reality.

Best practices for using ISI violations in spike sorting

When attempting to determine the success of spike-sorting operations post hoc, ISI violation fraction is frequently used as a per-cluster inclusion criterion. However, unless a cluster is recorded for a long-enough time period given its firing frequency and true FDR and difficult-to-estimate cluster–specific parameters are known, it can be difficult to predict FDR using ISI violations at the single-cluster level with high confidence. The use of ISI violation fractions in this way can easily result in situations where highly contaminated clusters are erroneously kept, while less contaminated clusters are discarded. We posit that the most straightforward and robust use case for ISIv is as a tool for predicting population-level statistics of FDR when coupled with a sound theoretical understanding of how cluster contamination translates into ISI violations (Eqs. 10, 12, 13). Investigators can obtain an accurate estimate of median and mean cluster FDR across a session or dataset and then decide whether they are satisfied with these levels of cross-contamination or if additional effort to improve cluster isolation is needed.

When curating spike-sorted data, it is critical that both algorithmic and manual sorters do not specifically remove individual spikes that generate ISI violations. Typically, only a small fraction of contaminant spikes produce ISI violations; targeted removal of spikes producing ISI violations can reduce ISIv substantially without meaningfully reducing the FDR, thus producing clusters that seem well isolated based on their ISIv, even when they are not. In effect, this practice does not accomplish anything except eliminating the predictive power of ISIv for underlying FDR.

Current state of spike sorting predicted using ISI violations

This work estimates an average median FDR of ∼13% and an average mean FDR of ∼24% in publicly available electrophysiology datasets. The lower median FDR compared with mean FDR across virtually all datasets examined indicates right-skewed FDR distributions. This likely arises as a consequence of FDR having a theoretical floor of 0, with most datasets having many cluster FDRs close to this floor. It may also be a consequence of most spike-sorting datasets being composed of two distinct types of clusters: relatively easier-to-sort clusters whose FDRs typically fall close to 0 and relatively harder-to-sort clusters whose FDRs are likely to fall more broadly over the theoretical range of FDR (0–1).

In the absence of any clear rationale for variance in FDR as predicted using ISIv among publicly available spike-sorted datasets (Table 2), degree and execution of manual curation present themselves as promising explanatory candidates. While modern spike-sorting algorithms serve as an excellent basis for sorting vast quantities of electrophysiology data, many investigators still manually merge, split, and discard algorithmically obtained output clusters to further improve cluster isolation. Time, effort, and skill applied to manual curation are difficult to quantify and therefore unlikely to be reported in literature, although such differences are likely to have a material effect on the final quality of cluster isolation. The median and mean FDRs of the datasets examined here as well as the tendency toward right-skewed FDR distributions support the idea that all datasets are composed of both well and poorly isolated clusters.

Necessity of high-quality spike sorting

It has been posited that well-sorted clusters are not a necessity for many types of neural data analyses, particularly those concerned with studying population dynamics (Christie et al., 2014; Trautmann et al., 2019). In some applications, however, well-isolated clusters remain a critical precondition for answering relevant neuroscientific questions. Characterizing the responsivity of specific cell types that have been identified on the basis of genetic expression, projection target, waveform shape, or activity in vivo represents an expansive line of inquiry wherein high-quality cluster isolation is key (Estebanez et al., 2017; Deubner et al., 2019; E. K. Lee et al., 2021; Ding et al., 2022; Takatoh et al., 2022). Ultimately, the level of cluster isolation necessary for a given study is highly dependent upon the biological questions of interest. The work herein aims to clarify the relationship between ISI violations and cluster contamination, as well as provide a tool by which overall spike-sorting quality can be quickly assessed with a direct, interpretable, and accurate metric, thereby streamlining assessments of sorting performance and increasing confidence that desired cluster isolation levels have been reached.

Footnotes

  • The authors declare no competing financial interests.

  • We thank Munib Hasnain, Maria Moya, Yujin Han, Jaclyn Birnbaum, William Cunningham, and other members of the Economo Lab for their helpful feedback and support. This work was funded by National Institutes of Health Grant 1R01NS121409 (M.N.E.).

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.

References

  1. ↵
    1. Abbott LF,
    2. Dayan P
    (2005) Theoretical neuroscience: computational and mathematical modeling of neural systems. Cambridge, MA: MIT Press.
  2. ↵
    1. Barnett AH,
    2. Magland JF,
    3. Greengard LF
    (2016) Validation of neural spike sorting algorithms without ground-truth information. J Neurosci Methods 264:65–77. https://doi.org/10.1016/j.jneumeth.2016.02.022
    OpenUrlCrossRef
  3. ↵
    1. Bestel R,
    2. Daus AW,
    3. Thielemann C
    (2012) A novel automated spike sorting algorithm with adaptable feature extraction. J Neurosci Methods 211:168–178. https://doi.org/10.1016/j.jneumeth.2012.08.015
    OpenUrlCrossRefPubMed
  4. ↵
    1. Boucher PO,
    2. Wang T,
    3. Carceroni L,
    4. Kane G,
    5. Shenoy KV,
    6. Chandrasekaran C
    (2023) Initial conditions combine with sensory evidence to induce decision-related dynamics in premotor cortex. Nat Commun 14:6510. https://doi.org/10.1038/s41467-023-41752-2 pmid:37845221
    OpenUrlPubMed
  5. ↵
    1. Buccino AP,
    2. Hurwitz CL,
    3. Garcia S,
    4. Magland J,
    5. Siegle JH,
    6. Hurwitz R,
    7. Hennig MH
    (2020) Spikeinterface, a unified framework for spike sorting. Elife 9:e61834. https://doi.org/10.7554/eLife.61834 pmid:33170122
    OpenUrlCrossRefPubMed
  6. ↵
    1. Chandrasekaran C,
    2. Peixoto D,
    3. Newsome WT,
    4. Shenoy KV
    (2017) Laminar differences in decision-related neural activity in dorsal premotor cortex. Nat Commun 8:614. https://doi.org/10.1038/s41467-017-00715-0 pmid:28931803
    OpenUrlCrossRefPubMed
  7. ↵
    1. Chaure FJ,
    2. Rey HG,
    3. Quian Quiroga R
    (2018) A novel and fully automatic spike-sorting implementation with variable number of features. J Neurophysiol 120:1859–1871. https://doi.org/10.1152/jn.00339.2018 pmid:29995603
    OpenUrlCrossRefPubMed
    1. Chinta S,
    2. Pluta SR
    (2023) Neural mechanisms for the localization of unexpected external motion. Nat Commun 14:6112. https://doi.org/10.1038/s41467-023-41755-z pmid:37777516
    OpenUrlPubMed
  8. ↵
    1. Christie BP,
    2. Tat DM,
    3. Irwin ZT,
    4. Gilja V,
    5. Nuyujukian P,
    6. Foster JD,
    7. Ryu SI,
    8. Shenoy KV,
    9. Thompson DE,
    10. Chestek CA
    (2014) Comparison of spike sorting and thresholding of voltage waveforms for intracortical brain–machine interface performance. J Neural Eng 12:016009. https://doi.org/10.1088/1741-2560/12/1/016009 pmid:25504690
    OpenUrlCrossRefPubMed
  9. ↵
    1. Chung JE,
    2. Magland JF,
    3. Barnett AH,
    4. Tolosa VM,
    5. Tooker AC,
    6. Lee KY,
    7. Shah KG,
    8. Felix SH,
    9. Frank LM,
    10. Greengard LF
    (2017) A fully automated approach to spike sorting. Neuron 95:1381–1394.e6. https://doi.org/10.1016/j.neuron.2017.08.030 pmid:28910621
    OpenUrlCrossRefPubMed
  10. ↵
    1. Deubner J,
    2. Coulon P,
    3. Diester I
    (2019) Optogenetic approaches to study the mammalian brain. Curr Opin Struct Biol 57:157–163. https://doi.org/10.1016/j.sbi.2019.04.003
    OpenUrlCrossRefPubMed
  11. ↵
    1. Ding L,
    2. Balsamo G,
    3. Chen H,
    4. Blanco-Hernandez E,
    5. Zouridis IS,
    6. Naumann R,
    7. Preston-Ferrer P,
    8. Burgalossi A
    (2022) Juxtacellular opto-tagging of hippocampal CA1 neurons in freely moving mice. Elife 11:e71720. https://doi.org/10.7554/eLife.71720 pmid:35080491
    OpenUrlCrossRefPubMed
    1. Economo MN, et al.
    (2018) Distinct descending motor cortex pathways and their roles in movement. Nature 563:79–84. https://doi.org/10.1038/s41586-018-0642-9
    OpenUrlCrossRefPubMed
  12. ↵
    1. Estebanez L,
    2. Hoffmann D,
    3. Voigt BC,
    4. Poulet JFA
    (2017) Parvalbumin-expressing GABAergic neurons in primary motor cortex signal reaching. Cell Rep 20:308–318. https://doi.org/10.1016/j.celrep.2017.06.044 pmid:28700934
    OpenUrlCrossRefPubMed
    1. Finkelstein A,
    2. Fontolan L,
    3. Economo MN,
    4. Li N,
    5. Romani S,
    6. Svoboda K
    (2021) Attractor dynamics gate cortical information flow during decision-making. Nat Neurosci 24:843–850. https://doi.org/10.1038/s41593-021-00840-6
    OpenUrlPubMed
    1. Gao Z,
    2. Davis C,
    3. Thomas AM,
    4. Economo MN,
    5. Abrego AM,
    6. Svoboda K,
    7. De Zeeuw CI,
    8. Li N
    (2018) A cortico-cerebellar loop for motor planning. Nature 563:113–116. https://doi.org/10.1038/s41586-018-0633-x pmid:30333626
    OpenUrlCrossRefPubMed
  13. ↵
    1. Gibson S,
    2. Judy JW,
    3. Marković D
    (2012) Spike sorting: the first step in decoding the brain: the first step in decoding the brain. IEEE Signal Process Mag 29:124–143. https://doi.org/10.1109/MSP.2011.941880
    OpenUrl
    1. Guo ZV,
    2. Inagaki HK,
    3. Daie K,
    4. Druckmann S,
    5. Gerfen CR,
    6. Svoboda K
    (2017) Maintenance of persistent activity in a frontal thalamocortical loop. Nature 545:181–186. https://doi.org/10.1038/nature22324 pmid:28467817
    OpenUrlCrossRefPubMed
  14. ↵
    1. Guo ZV,
    2. Li N,
    3. Huber D,
    4. Ophir E,
    5. Gutnisky D,
    6. Ting JT,
    7. Feng G,
    8. Svoboda K
    (2014) Flow of cortical activity underlying a tactile decision in mice. Neuron 81:179–194. https://doi.org/10.1016/j.neuron.2013.10.020 pmid:24361077
    OpenUrlCrossRefPubMed
  15. ↵
    1. Harris KD,
    2. Hirase H,
    3. Leinekugel X,
    4. Henze DA,
    5. Buzsáki G
    (2001) Temporal interaction between single spikes and complex spike bursts in hippocampal pyramidal cells. Neuron 32:141–149. https://doi.org/10.1016/s0896-6273(01)00447-0
    OpenUrlCrossRefPubMed
  16. ↵
    1. Hill DN,
    2. Mehta SB,
    3. Kleinfeld D
    (2011) Quality metrics to accompany spike sorting of extracellular signals. J Neurosci 31:8699–8705. https://doi.org/10.1523/JNEUROSCI.0971-11.2011 pmid:21677152
    OpenUrlFREE Full Text
    1. Inagaki HK,
    2. Chen S,
    3. Ridder MC,
    4. Sah P,
    5. Li N,
    6. Yang Z,
    7. Hasanbegovic H,
    8. Gao Z,
    9. Gerfen CR,
    10. Svoboda K
    (2022) A midbrain-thalamus-cortex circuit reorganizes cortical dynamics to initiate movement. Cell 185:1065–1081.e23. https://doi.org/10.1016/j.cell.2022.02.006 pmid:35245431
    OpenUrlCrossRefPubMed
  17. ↵
    1. Jadhav SP,
    2. Wolfe J,
    3. Feldman DE
    (2009) Sparse temporal coding of elementary tactile features during active whisker sensation. Nat Neurosci 12:792–800. https://doi.org/10.1038/nn.2328
    OpenUrlCrossRefPubMed
  18. ↵
    1. Juavinett AL,
    2. Bekheet G,
    3. Churchland AK
    (2019) Chronically implanted neuropixels probes enable high-yield recordings in freely moving mice. Elife 8:e47188. https://doi.org/10.7554/eLife.47188 pmid:31411559
    OpenUrlCrossRefPubMed
  19. ↵
    1. Jun JJ,
    2. Mitelut C,
    3. Lai C,
    4. Gratiy SL,
    5. Anastassiou CA,
    6. Harris TD
    (2017). Real-time spike sorting platform for high-density extracellular probes with ground-truth validation and drift correction (p. 101030). bioRxiv.
  20. ↵
    1. Lee EK,
    2. Balasubramanian H,
    3. Tsolias A,
    4. Anakwe SU,
    5. Medalla M,
    6. Shenoy KV,
    7. Chandrasekaran C
    (2021) Non-linear dimensionality reduction on extracellular waveforms reveals cell type diversity in premotor cortex. Elife 10:e67490. https://doi.org/10.7554/eLife.67490 pmid:34355695
    OpenUrlPubMed
  21. ↵
    1. Lee JH,
    2. Carlson DE,
    3. Shokri Razaghi H,
    4. Yao W,
    5. Goetz GA,
    6. Hagen E,
    7. Batty E,
    8. Chichilnisky EJ,
    9. Einevoll GT,
    10. Paninski L
    (2017) YASS: yet another spike sorter. Adv Neural Inf Process Syst 30:4002–4012. https://doi.org/10.1101/151928
    OpenUrl
    1. Li N,
    2. Daie K,
    3. Svoboda K,
    4. Druckmann S
    (2016) Robust neuronal dynamics in premotor cortex during motor planning. Nature 532:459–464. https://doi.org/10.1038/nature17643 pmid:27074502
    OpenUrlCrossRefPubMed
  22. ↵
    1. Llobet V,
    2. Wyngaard A,
    3. Barbour B
    (2022). Automatic post-processing and merging of multiple spike-sorting analyses with Lussac (p. 2022.02.08.479192). bioRxiv.
  23. ↵
    1. Magland J,
    2. Jun JJ,
    3. Lovero E,
    4. Morley AJ,
    5. Hurwitz CL,
    6. Buccino AP,
    7. Garcia S,
    8. Barnett AH
    (2020) Spikeforest, reproducible web-facing ground-truth validation of automated neural spike sorters. Elife 9:e55167. https://doi.org/10.7554/eLife.55167 pmid:32427564
    OpenUrlCrossRefPubMed
  24. ↵
    1. Neymotin SA,
    2. Lytton WW,
    3. Olypher AV,
    4. Fenton AA
    (2011) Measuring the quality of neuronal identification in ensemble recordings. J Neurosci 31:16398–16409. https://doi.org/10.1523/JNEUROSCI.4053-11.2011 pmid:22072690
    OpenUrlAbstract/FREE Full Text
  25. ↵
    1. Pachitariu M,
    2. Steinmetz NA,
    3. Kadir SN,
    4. Carandini M,
    5. Harris KD
    (2016) Fast and accurate spike sorting of high-channel count probes with Kilosort. Adv Neural Inf Process Syst 29:4448–4456.
    OpenUrlCrossRefPubMed
  26. ↵
    1. Pouzat C,
    2. Mazor O,
    3. Laurent G
    (2002) Using noise signature to optimize spike-sorting and to assess neuronal classification quality. J Neurosci Methods 122:43–57. https://doi.org/10.1016/S0165-0270(02)00276-5
    OpenUrlCrossRefPubMed
  27. ↵
    1. Quiroga RQ,
    2. Panzeri S
    (2013) Principles of neural coding. Boca Raton, FL: CRC Press.
  28. ↵
    1. Rey HG,
    2. Pedreira C,
    3. Quian Quiroga R
    (2015) Past, present and future of spike sorting techniques. Brain Res Bull 119:106–117. https://doi.org/10.1016/j.brainresbull.2015.04.007 pmid:25931392
    OpenUrlCrossRefPubMed
  29. ↵
    1. Roxin A,
    2. Hakim V,
    3. Brunel N
    (2008) The statistics of repeating patterns of cortical activity can be reproduced by a model network of stochastic binary neurons. J Neurosci 28:10734–10745. https://doi.org/10.1523/JNEUROSCI.1016-08.2008 pmid:18923048
    OpenUrlAbstract/FREE Full Text
  30. ↵
    1. Roy S,
    2. Wang X
    (2012) Wireless multi-channel single unit recording in freely moving and vocalizing primates. J Neurosci Methods 203:28–40. https://doi.org/10.1016/j.jneumeth.2011.09.004 pmid:21933683
    OpenUrlCrossRefPubMed
  31. ↵
    1. Saif-ur-Rehman M, et al.
    (2021) Spikedeep-classifier: a deep-learning based fully automatic offline spike sorting algorithm. J Neural Eng 18:016009. https://doi.org/10.1088/1741-2552/abc8d4
    OpenUrl
  32. ↵
    1. Schmitzer-Torbert N,
    2. Jackson J,
    3. Henze D,
    4. Harris K,
    5. Redish AD
    (2005) Quantitative measures of cluster quality for use in extracellular recordings. Neuroscience 131:1–11. https://doi.org/10.1016/j.neuroscience.2004.09.066
    OpenUrlCrossRefPubMed
  33. ↵
    1. Shinomoto S,
    2. Tsubo Y
    (2001) Modeling spiking behavior of neurons with time-dependent Poisson processes. Phys Rev E 64:041910. https://doi.org/10.1103/PhysRevE.64.041910
    OpenUrl
    1. Steinmetz NA,
    2. Zatka-Haas P,
    3. Carandini M,
    4. Harris KD
    (2019) Distributed coding of choice, action and engagement across the mouse brain. Nature 576:266–273. https://doi.org/10.1038/s41586-019-1787-x pmid:31776518
    OpenUrlCrossRefPubMed
  34. ↵
    1. Stringer C,
    2. Pachitariu M,
    3. Steinmetz N,
    4. Reddy CB,
    5. Carandini M,
    6. Harris KD
    (2019) Spontaneous behaviors drive multidimensional, brainwide activity. Science 364:eaav7893. https://doi.org/10.1126/science.aav7893 pmid:31000656
    OpenUrlAbstract/FREE Full Text
    1. Sylwestrak EL, et al.
    (2022) Cell-type-specific population dynamics of diverse reward computations. Cell 185:3568–3587.e27. https://doi.org/10.1016/j.cell.2022.08.019 pmid:36113428
    OpenUrlCrossRefPubMed
  35. ↵
    1. Takatoh J, et al.
    (2022) The whisking oscillator circuit. Nature 609:560–568. https://doi.org/10.1038/s41586-022-05144-8 pmid:36045290
    OpenUrlPubMed
  36. ↵
    1. Todorova S,
    2. Sadtler P,
    3. Batista A,
    4. Chase S,
    5. Ventura V
    (2014) To sort or not to sort: the impact of spike-sorting on neural decoding performance. J Neural Eng 11:056005. https://doi.org/10.1088/1741-2560/11/5/056005 pmid:25082508
    OpenUrlCrossRefPubMed
  37. ↵
    1. Tolhurst DJ,
    2. Movshon JA,
    3. Thompson ID
    (1981) The dependence of response amplitude and variance of cat visual cortical neurones on stimulus contrast. Exp Brain Res 41:414–419. https://doi.org/10.1007/BF00238900
    OpenUrlCrossRefPubMed
  38. ↵
    1. Toosi R,
    2. Akhaee MA,
    3. Dehaqani M-RA
    (2021) An automatic spike sorting algorithm based on adaptive spike detection and a mixture of skew-t distributions. Sci Rep 11:13925. https://doi.org/10.1038/s41598-021-93088-w pmid:34230517
    OpenUrlPubMed
  39. ↵
    1. Trautmann EM, et al.
    (2019) Accurate estimation of neural population dynamics without spike sorting. Neuron 103:292–308.e4. https://doi.org/10.1016/j.neuron.2019.05.003 pmid:31171448
    OpenUrlCrossRefPubMed
  40. ↵
    1. van Vreeswijk C
    (2010) Stochastic models of spike trains. In: Analysis of parallel spike trains (Grün S, Rotter S, eds), pp 3–20. New York, NY: Springer US.
  41. ↵
    1. Werner G,
    2. Mountcastle VB
    (1965) Neural activity in mechanoreceptive cutaneous afferents: stimulus-response relations, weber functions, and information transmission. J Neurophysiol 28:359–397. https://doi.org/10.1152/jn.1965.28.2.359
    OpenUrlCrossRefPubMed
  42. ↵
    1. Wright NC,
    2. Borden PY,
    3. Liew YJ,
    4. Bolus MF,
    5. Stoy WM,
    6. Forest CR,
    7. Stanley GB
    (2021) Rapid cortical adaptation and the role of thalamic synchrony during wakefulness. J Neurosci 41:5421–5439. https://doi.org/10.1523/JNEUROSCI.3018-20.2021 pmid:33986072
    OpenUrlAbstract/FREE Full Text
    1. Xu D,
    2. Dong M,
    3. Chen Y,
    4. Delgado AM,
    5. Hughes NC,
    6. Zhang L,
    7. O’Connor DH
    (2022) Cortical processing of flexible and context-dependent sensorimotor sequences. Nature 603:464–469. https://doi.org/10.1038/s41586-022-04478-7 pmid:35264793
    OpenUrlCrossRefPubMed
  43. ↵
    1. Yegenoglu A,
    2. Denker M,
    3. Grün S
    (2018) Collaborative HPC-enabled workflows on the HBP collaboratory using the elephant framework. INM-ICS Retreat 2018.
  44. ↵
    1. Zhao S,
    2. Tang X,
    3. Tian W,
    4. Partarrieu S,
    5. Liu R,
    6. Shen H,
    7. Lee J,
    8. Guo S,
    9. Lin Z,
    10. Liu J
    (2023) Tracking neural activity from the same cells during the entire adult life of mice. Nat Neurosci 26:696–710. https://doi.org/10.1038/s41593-023-01267-x
    OpenUrl

Synthesis

Reviewing Editor: Nicholas J. Priebe, The University of Texas at Austin

Decisions are customarily a result of the Reviewing Editor and the peer reviewers coming together and discussing their recommendations until a consensus is reached. When revisions are invited, a fact-based synthesis statement explaining their decision and outlining what is needed to prepare a revision will be listed below. The following reviewer(s) agreed to reveal their identity: Lauri Nurminen, Ivo Vanzetta. Note: If this manuscript was transferred from JNeurosci and a decision was made to accept the manuscript without peer review, a brief statement to this effect will instead be what is listed below.

SYNTHESIS

Two reviewers evaluated the manuscript "Assessing cross-contamination in spike-sorted electrophysiology data" and both reviewers agree that the study provides a valuable evaluation of the quality of spike sorting from extracellular electrophysiology recordings. Specifically, the reviewers agreed that the authors provide a better understanding of how ISI violations and spike cluster contamination relate to each other. Both reviewers indicated a number of changes should be made to improve the clarity of the manuscript, including providing more details of the equations and simulations employed in the manuscript. These issues, and others, are detailed below, which you should find useful in revising your manuscript, if you decide to resubmit it to eNeuro or to another journal.

1) The Poisson assumption.

A major component of the simulations that the authors use is that neurons are Poisson. Do the results in the present manuscript depend on that assumption?

2) Line 201: "The dependence of this phenomenon on...": the meaning of this sentence is unclear. Please explicit as early as possible in the text what you mean by FP-to-FP violation (as well as FP-TP, TP-TP being obviously ruled out). One possibility would be to write it near the example with the arrows in Panel C(right) of Fig. 1.

3) Even if "rate" is customarily used both in the sense of "per total number", e.g., falsely detected spikes/total spikes (FDR) and in the temporal sense of "per second", here, the usage of one and the same word in different meanings quite confusing and the reader should be warned/helped here. We suggest using two different words: if "rate" shall be used for temporal frequency (as in equation (2), then it shouldn't be used for ISI_v and FDR (even if, unfortunately, it appears in the customarily used acronym). Alternatively (and perhaps preferably), the term "frequency" or similar could be used when "rate" is used in the sense of "per second".

4) Inclusion of details in the Methods section

Line 227: The intermediary steps that are taken to link equations 2 and 3 need to be described, perhaps in the Methods section. It might be more logical to permute the order of equations 2 3 4 5 6 as 2 5 6 4. It might also help to explicitly state that the Rs in eq. 2 are to be understood as the parameters of the Poisson processes generating the spike trains (and not as the result of an actual measurement).

Line 114: Please describe the parameters N, Rt, n and RFP in the Methods.

The description of how the inhomogeneous Poisson spike-trains were constructed needs more detail. How were the PSTHs selected exactly?

Line 114-126: The process of simulating the cluster populations is unclear. Is the process such that first target FDR and Rt are sampled, then for a given randomly sampled N, the PSTHs are sampled and scaled to achieve the target FDR and Rt? This section would benefit from significant revision.

We would appreciate a more detailed explanation and maybe a little help for the derivation of eq. 13. This could perhaps be put into the Methods section or into an Appendix in order not to hamper the flow of the main text.

5) What exactly is presented in Figure 2C? The figure legend and the manuscript text state that it shows simulated ISI distributions. How might it be possible to simulate the spiking activity of a cluster with ∞ contaminating neurons. For simulations, to the best of our understanding, N cannot be unbounded.

6) The process for estimating RFP, a necessary parameter for applying the authors' method in real data, should be described in detail,

Details:

1) A table summarizing the definitions of the various symbols (with their units) would be of great help sparing the reader the hassle of searching the text.

2) Eq. 13: the index "i" disappears.... I suppose it runs over the elements of Rfp?

3) Was the ISIv in Fig. 4A was chosen to be 0.5 % because of customary usage? Maybe this could be specified?

4) Why, in the table, was the homogeneous equation used in some cases and in others not?

5) Line 360: "Median estimated FDRs were consistently lower than mean estimated FDRs." This is not true for Juavinett (last line of the table).

6) Figure issues:

Fig 2:

It would be preferable to use the same y-axis scale in all panels, except in the leftmost one in A. Even in A, however, it would help to overlay, in high transparency, the ISI counts for the refractory period of the leftmost panel onto the other two panels - rescaled to their same scale. In order to be able to directly compare the counts between the different conditions.

Please add the information that the model used was the inhomogeneous one to the legend of Fig. 2D.

Fig. 3:

Please specify (preferably in the legend), for each panel, whether firing was homogeneous or inhomogeneous, as well as what was the used equation.

Fig. 4:

What were the N and RFP values that were used for computing the analytical predictions in Figure 4? The figure legend would be a good place to show them.

7) Lines 312-319: I find this part a bit oddly written. I'd put the sentence "when inhomogeneous..." first. Only afterwards I'd mention the fact that, when TP and FP spike rates do not covary in time, the homogeneous equation does a good job, even if firing was inhomogeneous. BTW, do you have any idea why this is the case?

8) It seems to me that you didn't define RMSE. I would do so, even if this is quite a standard acronym.

9) Minor suggestions for language

Line 14: "spikes are misassigned to the wrong cluster": One might write "spikes are assigned to the wrong cluster"

Line 286: as in the rest of the manuscript, keep "data" plural

Author Response

Response to Reviewers Summary - Review 2 We thank the Reviewers for their helpful additional feedback and have addressed their two comments below.

Major Issues 1. Mathematical notation. The mathematical notation should be uniform throughout the manuscript. The scalar product between two vectors is sometimes written with a dot and sometimes without a dot. Further, it appears at times that the dot is used to indicate multiplication between scalar quantities. In addition, it is unclear what the basis was for dropping the index in equation 13.

This is a good point, and one that we had not considered when adding multiplication dots for visual clarity. All dots in mathematical notation have now been removed in favor of implicit multiplication or "multiplication by juxtaposition" except when representing the dot product between two vectors, and parentheses have been added when needed to enhance visual clarity.

In regards to Eq. 13, we agree that the operations being performed in this equation may be a little unclear. It's our understanding that a simple summation of a 1D vector across all elements is commonly represented through omission of summation index and bounds. This summation divided by n just corresponds to calculating the average value of f ⃗_FP or f ̅_FP (fFP-bar). It's expanded in Eq. 13 to make clear that this average is taken after first scaling f ̂_FP by the magnitude |f ⃗_FP | obtained in Eq. 12. To make these points clearer, we have added a second equality in Eq. 13: f ̅_FP/f ̅_t .

2) Computing ISIv. In figure 2, the authors show simulations with different FDR% (columns), but similar ISIv, but the basis for the ISIv was not clear. In particular, the plots seem to be in contradiction with the numbers. ISIs falling within the red/pink rectangle correspond to ISI-violations. They are supposed to be 0.5% of all ISIs, for all spiking frequencies (f_t = 20Hz, 5Hz and 4Hz). However, the ratio between the mean taken over the red rectangle and that taken over the rest of the distribution does not look the same in the three panels. This ratio seems much larger at 4hz (near 1/3?) than at 20Hz (near 1/10?), whereas it should stay the same for all conditions. Also, in C (leftmost panel in particular), this ratio seems to be more like 5% and not 1%. Is there a problem with the computation of ISIv? Addressing this issue is important because a central claim of the manuscript is that that ISI violations can stay the same under different spiking rates, even if the FDR is radically different.

This is a very valid concern, and we thank the reviewers for drawing our attention to a potential source of confusion about a central conclusion of our work in Fig. 2A. The reason that ISI violations may appear to constitute a smaller proportion of total ISIs in this panel for higher firing frequency conditions is because these conditions have lower average ISIs and all conditions are plotted within the same 0-25 ms ISI domain. The average ISI for a 20 Hz cluster is 50 ms, while for a 4 Hz cluster, it's 250 ms. This means that, for a 20 Hz cluster, there are proportionally many more ISIs < 25 ms than for a 4 Hz cluster. Therefore, ISI violations may appear to correspond to a smaller proportion of total ISIs in the 20 Hz cluster's histogram, but calculated across the full possible range of ISIs, ISI violation rates are the same across conditions. FDRs and firing frequencies were specifically chosen and verified through simulation to ensure this would be the case. We have added two sentences clarifying this point in the caption for Fig. 2.

Summary - Review 1 We thank the Reviewers for their close reading of our manuscript, as well as their helpful comments which will both improve readability as well as clarify important methodological details. We are pleased that the Reviewers found that our study provides "a valuable evaluation of the quality of spike sorting from extracellular electrophysiology recordings," as well as "a better understanding of how ISI violations and spike cluster contamination relate to each other." We have addressed each comment made by the Reviewers below, in the order they were provided within the synthesis of reviews.

General Comments 1. The Poisson assumption.

A major component of the simulations that the authors use is that neurons are Poisson. Do the results in the present manuscript depend on that assumption? This is an excellent question and one that we had not fully considered while preparing the initial submission. After simulating non-Poisson point processes across a range of physiological coefficients of variation and then attempting to predict FDR from observed ISIv, we can confirm that our results do not depend on Poisson simulation of neural spiking. FDR predictions remained highly accurate (RMSE = 0.01-0.02) when attempting to predict FDR for neural spiking simulated with ISIs drawn from gamma, inverse Gaussian, and log-normal distributions with coefficients of variation of 0.5, 1, and 2. These results imply that our model likely generalizes to any renewal process that produces independent and identically distributed ISIs with a physiological coefficient of variation. These results have been organized and presented in Extended Data Figure 3-1. Discussion of the model's ability to generalize to non-Poisson point processes, referenced to Extended Data Figure 3-1, is now provided in the text at the start of Materials and Methods, the section of the Results discussing Fig. 3D, and in the Discussion.

2. Line 201: "The dependence of this phenomenon on...": the meaning of this sentence is unclear. Please explicit as early as possible in the text what you mean by FP-to-FP violation (as well as FP-TP, TP-TP being obviously ruled out). One possibility would be to write it near the example with the arrows in Panel C(right) of Fig. 1.

The TP-to-FP and FP-to-FP violations in Fig. 1C have now been labeled appropriately in the figure with a brief explanation also included in the figure legend. A figure citation for Fig. 1C has also been added to the sentence starting "The dependence of this phenomenon on..." after mentioning "FP-to-FP violations." 3. Even if "rate" is customarily used both in the sense of "per total number", e.g., falsely detected spikes/total spikes (FDR) and in the temporal sense of "per second", here, the usage of one and the same word in different meanings quite confusing and the reader should be warned/helped here. We suggest using two different words: if "rate" shall be used for temporal frequency (as in equation (2), then it shouldn't be used for ISI_v and FDR (even if, unfortunately, it appears in the customarily used acronym). Alternatively (and perhaps preferably), the term "frequency" or similar could be used when "rate" is used in the sense of "per second".

In all instances where rate is used in the sense of "per second," e.g. firing rate, true positive spike rate, this term has been replaced with frequency. The symbols in figures and equations have also accordingly been adjusted. Instances where rate is used in the sense of "per total number," have been preserved as rate, e.g. inter-spike-interval violation rate, false discovery rate.

4. Inclusion of details in the Methods section Line 227: The intermediary steps that are taken to link equations 2 and 3 need to be described, perhaps in the Methods section. It might be more logical to permute the order of equations 2 3 4 5 6 as 2 5 6 4. It might also help to explicitly state that the Rs in eq. 2 are to be understood as the parameters of the Poisson processes generating the spike trains (and not as the result of an actual measurement).

A section has been added in the Appendix outlining the derivation of Eq. 6 from previous equations in greater detail. The equations have also been permuted such that the substitutions and relationships needed to derive Eq. 6 are introduced before deriving Eq. 6. Finally, it has been clarified in the text that fTP and fFP represent unseen parameters of spike train generation that are not measured experimentally.

Line 114: Please describe the parameters N, Rt, n and RFP in the Methods.

These parameters are now described in Table 1, which defines and provides units for all the symbols used throughout the manuscript.

The description of how the inhomogeneous Poisson spike-trains were constructed needs more detail. How were the PSTHs selected exactly? PSTHs were selected randomly from among all clusters contained within a dataset. The trial-averaged PSTH was then taken to be the time-varying Poisson rate used to generate spike trains. Longer simulated recording times were attained through repeated simulation of these PSTHs. This is now explained in greater detail in the section titled "Monte Carlo simulation of cluster populations" in Materials and Methods.

Line 114-126: The process of simulating the cluster populations is unclear. Is the process such that first target FDR and Rt are sampled, then for a given randomly sampled N, the PSTHs are sampled and scaled to achieve the target FDR and Rt? This section would benefit from significant revision.

The process described by the reviewers is correct, although this workflow was difficult to understand in the previous manuscript version. As suggested, the section titled "Monte Carlo simulation of cluster populations" in Materials and Methods has been expanded and reworked to improve the clarity and flow of these points. Specifically, each step in the process is now presented with a numeric label in the order it was performed, and significant additional detail has been added at each stage.

We would appreciate a more detailed explanation and maybe a little help for the derivation of eq. 13. This could perhaps be put into the Methods section or into an Appendix in order not to hamper the flow of the main text.

Eq. 13 describes the relatively straightforward calculation of FDR once the magnitude of f ⃗_FP has been determined using Eq. 12. Given that Eq. 12 is much more complex and intermediate parts of its derivation were omitted in the original submission, we assume that the reviewers were referring to Eq. 12 in this comment. Accordingly, a section has been added in the Appendix outlining the derivation of Eq. 12 from previous equations in greater detail to guide interested readers through intermediate steps from Eq. 10 to Eq. 12.

5. What exactly is presented in Figure 2C? The figure legend and the manuscript text state that it shows simulated ISI distributions. How might it be possible to simulate the spiking activity of a cluster with ∞ contaminating neurons. For simulations, to the best of our understanding, N cannot be unbounded.

The reviewers are correct that a cluster with an infinite number of "contaminant" neurons cannot be directly simulated. This was a convenient shorthand we used to describe the limiting case where there are as many contaminant neurons as FP spikes. In that case, any FP can produce an ISI violation with any other FP. This limiting case can be simulated using a single contaminant neuron with no refractory period. We have now clarified this point at the start of Materials and Methods, in the figure legend for Fig. 2, as well as when the concept of infinite contaminant neurons is introduced in the Results (Analytical model of the relationship between ISIv and FDR).

6. The process for estimating RFP, a necessary parameter for applying the authors' method in real data, should be described in detail.

To facilitate a more detailed explanation of the process by which unknown model parameters are estimated as well as improve flow, the section in Materials and Methods previously titled Estimation of FDR in publicly available electrophysiology datasets has now been split into two sections titled Estimating unknown cluster parameters and Estimation of FDR in simulated cluster populations and electrophysiology datasets. The first contains a rigorous explanation of how all parameters needed to predict FDR can be estimated, including f ̂_FP.

Details 1. A table summarizing the definitions of the various symbols (with their units) would be of great help sparing the reader the hassle of searching the text.

Table 1 has now been inserted at the start of Materials and Methods, which defines and provides units for all the symbols used throughout the manuscript.

2. Eq. 13: the index "i" disappears... I suppose it runs over the elements of RFP? The index "i" in this equation is meant to run over the elements of fFP as suggested. Since the only term in the summation symbol is the vector fFP, and therefore the only way to conduct this summation is over the elements of fFP, we have removed the index and summation bounds from the sigma symbol in this equation (Eq. 13).

3. Was the ISIv in Fig. 4A chosen to be 0.5% because of customary usage? Maybe this could be specified? The choice of 0.5% was somewhat arbitrary, although we note that 0.5% has sometimes been used as threshold for considering units well-isolated and is a commonly found % ISIv in spike-sorted electrophysiology data. This explanation is now included in the figure legend for Fig. 4A.

4. Why, in the table, was the homogeneous equation used in some cases and in others not? The homogenous equation was used for datasets acquired during behaviors that were not trial-based in structure. In this case, we could not compute trial-averaged firing rates, which we used for the generation of inhomogeneous Poisson spike trains. One could in principle estimate the time-varying (inhomogeneous) Poisson rate from single-trial data, but we found that the degree of temporal smoothing used to estimate firing rates had a large effect on predicted FDR. This is discussed in detail in Materials and Methods. A brief explanation of the usage of the homogeneous equation has been added in Line #, as well as a pointer to Materials and Methods.

5. Line 360: "Median estimated FDRs were consistently lower than mean estimated FDRs." This is not true for Juavinett (last line of the table).

The dataset from Juavinett et al. not having a lower median than mean estimated FDR has been clarified at line #.

6. Figure issues:

Fig. 2: It would be preferable to use the same y-axis scale in all panels, except in the leftmost one in A. Even in A, however, it would help to overlay, in high transparency, the ISI counts for the refractory period of the leftmost panel onto the other two panels - rescaled to their same scale. In order to be able to directly compare the counts between the different conditions. Please add the information that the model used was the inhomogeneous one to the legend of Fig. 2D.

In each section of Fig. 2 except for Fig. 2A, the y-axis maximum has been set to the same value and the histograms have been scaled accordingly. In Fig. 2A, the right two panels have been set to the same y-axis scale, and their outlines have been superimposed to scale on the leftmost panel in high transparency to facilitate direct comparisons of the three conditions. This superimposition is described in the figure legend as well. Fig. 2D doesn't exist; we assume this comment refers to Fig. 2B and have also added a sentence in the figure legend describing whether each section was simulated with homogeneous or inhomogeneous firing.

Fig. 3: Please specify (preferably in the legend), for each panel, whether firing was homogeneous or inhomogeneous, as well as what was the used equation.

A sentence has now been added at the end of the figure legend denoting for each panel whether simulated firing was homogeneous or inhomogeneous and which equation was used for FDR predictions.

Fig. 4: What were the N and RFP values that were used for computing the analytical predictions in Figure 4? The figure legend would be a good place to show them.

Possible values for ft, N, and f ̂_FP are now listed in the figure legend for Fig. 4B and Fig. 3D along with a pointer to Materials and Methods where the process of simulating cluster populations is described in greater detail.

7. Lines 312-319: I find this part a bit oddly written. I'd put the sentence "when inhomogeneous..." first. Only afterwards I'd mention the fact that, when TP and FP spike rates do not covary in time, the homogeneous equation does a good job, even if firing was inhomogeneous. BTW, do you have any idea why this is the case? These sentences have been reordered as suggested. Additionally, explanation has been added as to why the homogeneous equation returns good predictions for TP and FP spike rates that do not covary in time, even if their firing is inhomogeneous. Briefly, when cov(fTP, fFP) = 0, their concentration in time is, on average, equivalent to the homogenous case, meaning the odds of an ISI violation occurring are unchanged. In this special case, the homogeneous and inhomogeneous predictions agree.

8. It seems to me that you didn't define RMSE. I would do so, even if this is quite a standard acronym.

Root mean square error (RMSE) is now defined both times it appears in figure legends (Fig. 3D, Fig. 4B) and the first time it appears in the text (Line #).

9. Minor suggestions for language Line 14: "spikes are misassigned to the wrong cluster": One might write "spikes are assigned to the wrong cluster" In Line #, "misassigned" has been replaced with "assigned." Line 286: as in the rest of the manuscript, keep "data" plural "Data" has been consistently pluralized throughout the manuscript.

Back to top

In this issue

eneuro: 11 (8)
eNeuro
Vol. 11, Issue 8
August 2024
  • Table of Contents
  • Index by author
  • Masthead (PDF)
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Assessing Cross-Contamination in Spike-Sorted Electrophysiology Data
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Assessing Cross-Contamination in Spike-Sorted Electrophysiology Data
Jack P. Vincent, Michael N. Economo
eNeuro 2 August 2024, 11 (8) ENEURO.0554-23.2024; DOI: 10.1523/ENEURO.0554-23.2024

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
Assessing Cross-Contamination in Spike-Sorted Electrophysiology Data
Jack P. Vincent, Michael N. Economo
eNeuro 2 August 2024, 11 (8) ENEURO.0554-23.2024; DOI: 10.1523/ENEURO.0554-23.2024
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Footnotes
    • References
    • Synthesis
    • Author Response
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • electrophysiology
  • spike sorting

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Article: Methods/New Tools

  • Adapt-A-Maze: An Open Source Adaptable and Automated Rodent Behavior Maze System
  • Generation of iPSC lines with tagged α-synuclein for visualization of endogenous protein in human cellular models of neurodegenerative disorders
  • Chronic Intraventricular Cannulation for the Study of Glymphatic Transport
Show more Research Article: Methods/New Tools

Novel Tools and Methods

  • Adapt-A-Maze: An Open Source Adaptable and Automated Rodent Behavior Maze System
  • Generation of iPSC lines with tagged α-synuclein for visualization of endogenous protein in human cellular models of neurodegenerative disorders
  • Chronic Intraventricular Cannulation for the Study of Glymphatic Transport
Show more Novel Tools and Methods

Subjects

  • Novel Tools and Methods
  • Home
  • Alerts
  • Follow SFN on BlueSky
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Notice
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2025 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.