Robust, automatic spike sorting using mixtures of multivariate t-distributions

doi:10.1016/S0165-0270(03)00120-1

Journal of Neuroscience Methods

Volume 127, Issue 2, 15 August 2003, Pages 111-122

https://doi.org/10.1016/S0165-0270(03)00120-1 Get rights and content

Abstract

A number of recent methods developed for automatic classification of multiunit neural activity rely on a Gaussian model of the variability of individual waveforms and the statistical methods of Gaussian mixture decomposition. Recent evidence has shown that the Gaussian model does not accurately capture the multivariate statistics of the waveform samples’ distribution. We present further data demonstrating non-Gaussian statistics, and show that the multivariate t-distribution, a wide-tailed family of distributions, provides a significantly better fit to the true statistics. We introduce an adaptation of a new expectation-maximization based competitive mixture decomposition algorithm and show that it efficiently and reliably performs mixture decomposition of t-distributions. Our algorithm determines the number of units in multiunit neural recordings, even in the presence of significant noise contamination resulting from random threshold crossings and overlapping spikes.

Introduction

Extracellular recordings of neural activity provide a noisy measurement of action potentials produced by a number of neurons adjacent to the recording electrode. Automatic and semiautomatic approaches to the reconstruction of the underlying neural activity, or ‘spike-sorting’ have been the subject of extensive development over the past 4 decades and reviews of early and recent efforts can be found in the literature (Schmidt, 1984, Lewicki, 1998). It is generally assumed that each neuron produces a distinct, reproducible shape, which is then contaminated by noise that is primarily additive. Identified sources for noise include: Johnson noise in the electrode and electronics, background activity of distant neurons (Fee et al., 1996b), waveform misalignment (Lewicki, 1994), electrode micromovement (Snider and Bonds, 1998) and the variation of the action potential shape as a function of recent firing history (Fee et al., 1996b, Quirk and Wilson, 1999). Given this signal+noise structure, the problem of automatically classifying the different shapes is a clustering problem and can be addressed either in the context of the full time-sampled spike-shape or of a reduced feature set, such as the principal components or a wavelet basis (Hulata et al., 2002).

While the application of general clustering methods such as k-means (Salganicoff et al., 1988), fuzzy c-means (Zouridakis and Tam, 2000), a variety of neural-network based unsupervised classification schemes (Ohberg et al., 1996, Garcia et al., 1998, Kim and Kim, 2000) and ad-hoc procedures (Fee et al., 1996a, Snider and Bonds, 1998) have been pursued by some authors, a number of other studies (Lewicki, 1994, Lewicki, 1998, Sahani et al., 1997, Sahani, 1999), attempting to provide statistically plausible, complete and efficient solutions to the waveform clustering problem, have focused their attention on clustering based on a Gaussian mixture model. The assumption underlying the latter approach is that after accounting for non-additive noise sources (e.g. misalignment, changes during neural bursts), the additive noise component is Gaussian-distributed. As a result, the waveforms resulting from each neuron are samples from a multi-dimensional Gaussian distribution with a certain mean and covariance matrix. Given this statistical structure, it is possible to construct an appropriate statistical model of the data and apply the powerful method of Gaussian mixture decomposition to solve the clustering problem (Jain et al., 2000, McLachlan and Peel, 2000). This allows estimation of model parameters such as the shape of the individual waveforms and the noise characteristics. The estimated model parameters are used to classify each ‘spike’ to one of several mixture components that correspond to different neural units (or possibly noise).

Although the statistical framework resulting from the multivariate Gaussian model is powerful and well studied, recent evidence suggests that it may provide an inaccurate description of the spike statistics (Harris et al., 2000). Examination of the distribution of Mahalanobis squared distances of spikes produced by a single unit reveals a discrepancy between the expected χ²-distribution and the empirical distribution, which exhibits wider tails. Algorithms based on the Gaussian assumption may therefore be ill suited for the task of automatic spike sorting, in particular as it is well known that they are not robust against a significant proportion of outliers. In this study, we provide additional evidence for the non-Gaussian nature of spike-shape statistics and demonstrate that an alternative model, one using multivariate t-distributions instead of multivariate Gaussians is better suited to model the observed statistics. Multivariate t-distributions have attracted some recent attention in the applied statistics literature (Lange et al., 1989), and a mixture decomposition algorithm for multivariate t-distributions was developed (Peel and McLachlan, 2000), based on the expectation-maximization (EM) algorithm. This algorithm requires computation of twice as many hidden variables as in Gaussian mixture decomposition algorithms, and involves an additional computational step for adapting the ‘degrees of freedom’ (DOF) parameter.

In addition to the choice of a statistical model for the mixture components, practical EM-based mixture decomposition algorithms need to address a number of issues including the determination of the number of components, the choice of an initialization procedure and avoiding convergence to local likelihood maxima or parameter singularities. Determination of the number of components in a mixture model has been the subject of extensive research (reviewed by Sahani, 1999, McLachlan and Peel, 2000, Figueiredo and Jain, 2002). The methods most widely used for this task are based on selecting the best mixture models from a set of candidates with different numbers of components. After fitting the parameters of the different models (using the EM algorithm) the different models are compared using a penalized-likelihood function, which penalizes the likelihood for ‘complexity’ (i.e. a larger number of components) and an ‘optimal’ model is found. This class of methods has the disadvantage of requiring estimation of the parameters of multiple mixture models. Other approaches include the use of stochastic model estimation using model-switching Markov-Chain–Monte-Carlo methods (Richardson and Green, 1997), and deterministic annealing based approaches (Sahani, 1999), which we have recently adapted to the case of the multivariate t-mixture model (Shoham, 2002). These approaches suffer from significant computational complexity, and, in addition, annealing approaches are quite sensitive to the specific choice of an annealing schedule. A recently introduced algorithm (Figueiredo and Jain, 2002), provides a new strategy where a process involving competitive elimination of mixture components drives a modified EM algorithm towards the optimal model size, simultaneously with the model parameter estimation. This approach appears currently to offer the best overall profile in terms of computational simplicity, efficiency and selection accuracy, and tends to avoid the usual difficulties of initialization sensitivity and convergence to singularities associated with the EM algorithm. We provide an adaptation of this algorithm for the case of multivariate t-distributed components. Our final algorithm is statistically plausible, simple and well-behaved and can effectively deal with many real data sets.

Section snippets

Theory: statistics of spike-shape variability

In mixture modeling we assume that each sample x_i (in general, a p-dimensional vector) originates from one of g components. In spike sorting, x_i represents a sampled spike waveform or a vector of features, and the different components correspond to g different units. Assuming that each unit accounts for a proportion π_j of the n spikes, and that the distribution of spikes from unit j has parameters θ_j, the likelihood of the data (the probability of obtaining the given data set from this model)

Algorithms: clustering with mixtures of multivariate t-distributions

The most widely used method for estimating the parameters of mixture models is through an iterative loglikelihood maximization procedure called the EM algorithm (Dempster et al., 1977, Jain et al., 2000, McLachlan and Peel, 2000). The EM algorithm for mixtures of Gaussian distributions has been widely used for over three decades. Recently, an EM algorithm for estimating the parameters of mixtures of multivariate t-distributions was presented (Peel and McLachlan, 2000). As noted in the

Experimental methods

The extracellular signals analyzed were recorded with a 100-microelectrode array (Jones et al., 1992) (Bionic Technologies, LLC, Salt Lake City, Utah). The array consists of a rectangular grid of silicon electrodes with platinized tips (200–500 kΩ impedances measured with a 1 kHz, 100 nA sine wave). The array was chronically implanted in the arm region of a macaque monkey's (M. mulatta) primary motor cortex using surgical implantation procedures described elsewhere (Maynard et al., 2000), with

Spike waveform statistics

Fig. 2 shows data collected from a well-isolated unit with signal-to-noise ratio of 16.9 (peak to peak/noise RMS), which was selected for much of the analysis below. Of the nearly 200 000 threshold-crossing events recorded in one behavioral session, 10 000 were selected. Random threshold-crossing events, which constituted nearly one half of the events, were easily identifiable and manually removed using amplitude windows. This left approximately 5300 events to be considered as unit waveforms.

Discussion

One of the most promising recent advances in basic and applied neuroscience research is the fabrication of arrays of electrodes that allow multiple site recording and stimulation in various neural systems (Jones et al., 1992, Hoogerwerf and Wise, 1994, Rousche et al., 2001). Neural activity recorded with such arrays can be used to address a multitude of basic neuroscience questions, and has also been suggested as a brain-computer interface for use by paralyzed individuals (Shoham, 2001,

Acknowledgements

We wish to thank Professors Sri Nagarajan, Mario Figueiredo, and John Donoghue for valuable input and support during the preparation of this manuscript. We thank the two anonymous reviewers for their insightful comments. The work was supported by a State of Utah Center of Excellence contract #95-3365 to R.A.N., and NIH grant # R01NS25074 to Professor Donoghue.

References (41)

M.S. Fee et al.
Automatic sorting of multiple unit neuronal signals in the presence of anisotropic and non-Gaussian variability
J. Neurosci. Methods
(1996)
P. Garcia et al.
Unsupervised classification of neural spikes with a hybrid multilayer artificial neural network
J. Neurosci. Methods
(1998)
K.S. Guillory et al.
A 100-channel system for real time detection and storage of extracellular spike waveforms
J. Neurosci. Methods
(1999)
E. Hulata et al.
A method for spike sorting and detection based on wavelet packets and Shannon's mutual information
J. Neurosci. Methods
(2002)
E.M. Maynard et al.
A technique to prevent dural adhesions to chronically implanted microelectrode arrays
J. Neurosci. Methods
(2000)
F. Ohberg et al.
A neural network approach to real-time spike discrimination during simultaneous recording from several multi-unit nerve filaments
J. Neurosci. Methods
(1996)
M.C. Quirk et al.
Interaction between spike waveform classification and temporal sequence detection
J. Neurosci. Methods
(1999)
M. Salganicoff et al.
Unsupervised waveform classification for multi-neuron recordings: a real-time, software-based system. I. Algorithms and implementation
J. Neurosci. Methods
(1988)
E.M. Schmidt
Computer separation of multi-unit neuroelectric data: a review
J. Neurosci. Methods
(1984)
R.K. Snider et al.
Classification of non-stationary neural signals
J. Neurosci. Methods
(1998)

G. Zouridakis et al.

Identification of reliable spike templates in multi-unit extracellular recordings using fuzzy clustering

Comput. Methods Prog. Biomed.

(2000)

J.D. Banfield et al.

Model-based Gaussian and non-Gaussian clustering

Biometrics

(1993)

G. Celeux et al.

A component-wise EM algorithm for mixtures

(1999)

A.P. Dempster et al.

Maximum likelihood from incomplete data using the EM algorithm (with discussion)

J. R. Stat. Soc. B

(1977)

J.P. Donoghue

Connecting cortex to machines: recent advances in brain interfaces

Nat. Neurosci.

(2002)

M.S. Fee et al.

Variability of extracellular spike waveforms of cortical neurons

J. Neurophysiol.

(1996)

M. Figueiredo et al.

Unsupervised learning of finite mixture models

IEEE Trans. PAMI

(2002)

A.P. Georgopoulos et al.

On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex

J. Neurosci.

(1982)

K.D. Harris et al.

Accuracy of tetrode spike separation as determined by simultaneous intracellular and extracellular measurements

J. Neurophysiol.

(2000)

A.C. Hoogerwerf et al.

A three-dimensional microelectrode array for chronic neural recording

IEEE Trans. Biomed. Eng.

(1994)

Cited by (278)

Classification of overlapping spikes using convolutional neural networks and long short term memory
2022, Computers in Biology and Medicine
Spike sorting is one of the key techniques to understand brain activity. In this paper, we propose a novel deep learning approach based on convolutional neural networks (CNN) and long short term memory (LSTM) to implement overlapping spike sorting. The results of the simulated data demonstrated that the clustering accuracy was greater than 99.9% and 99.0% for non-overlapping spikes and overlapping spikes, respectively. Moreover, the proposed method performed better than our previous deep learning approach named “1D-CNN”. In addition, the experimental data recorded from the primary visual cortex of a macaque monkey were used to evaluate the proposed method in a practical application. It was shown that the method could successfully isolate most overlapping spikes of different neurons (ranging from two to five). In summary, the CNN + LSTM method proposed in this paper is of great advantage for overlapping spike sorting with high accuracy. It lays the foundation for application in more challenging works, such as distinguishing the simultaneous recordings of multichannel neuronal activities.
Perceptual learning as a result of concerted changes in prefrontal and visual cortex
2021, Current Biology
Our perceptual ability remarkably improves with training. Some studies on visual perceptual learning have shown refined neural representation of the trained stimulus in the visual cortex, whereas others have exclusively argued for improved readout and decision-making processes in the frontoparietal cortex. The mixed results have rendered the underlying neural mechanisms puzzling and hotly debated. By simultaneously recording from monkey visual area V4 and ventrolateral prefrontal cortex (PFC) implanted with microelectrode arrays, we dissected learning-induced cortical changes over the course of training the monkeys in a global form detection task. Decoding analysis dissociated two distinct components of neuronal population codes that were progressively and markedly enhanced in both V4 and PFC. One component was closely related to the target stimulus feature and was subject to task-dependent top-down modulation; it emerged earlier in V4 than PFC and its enhancement was specific to the trained configuration of the target stimulus. The other component of the neural code was entirely related to the animal’s behavioral choice; it emerged earlier in PFC than V4 and its enhancement completely generalized to an untrained stimulus configuration. These results implicate two concurrent and synergistic learning processes: a perceptual process that is specific to the details of the trained stimulus feature and a cognitive process that is dependent on the total amount of learning experience in the trained task. When combined, these two learning processes were well predictive of the animal’s learning behavior.
Slow Drift of Neural Activity as a Signature of Impulsivity in Macaque Visual and Prefrontal Cortex
2020, Neuron
Citation Excerpt :
Experimental procedures were approved by the Institutional Animal Care and Use Committee of the University of Pittsburgh and were performed in accordance with the United States National Research Council’s Guide for the Care and Use of Laboratory Animals. Voltage signals were spike sorted with semi-supervised sorting algorithms (Shoham et al., 2003) and visually inspected using custom MATLAB software (Kelly et al., 2007), taking into account waveform shapes and inter-spike interval distributions (https://github.com/smithlabvision/spikesort). Our data consisted of both well-isolated single units and multi-units, and we refer to each unit as a “neuron.”
An animal’s decision depends not only on incoming sensory evidence but also on its fluctuating internal state. This state embodies multiple cognitive factors, such as arousal and fatigue, but it is unclear how these factors influence the neural processes that encode sensory stimuli and form a decision. We discovered that, unprompted by task conditions, animals slowly shifted their likelihood of detecting stimulus changes over the timescale of tens of minutes. Neural population activity from visual area V4, as well as from prefrontal cortex, slowly drifted together with these behavioral fluctuations. We found that this slow drift, rather than altering the encoding of the sensory stimulus, acted as an impulsivity signal, overriding sensory evidence to dictate the final decision. Overall, this work uncovers an internal state embedded in population activity across multiple brain areas and sheds further light on how internal states contribute to the decision-making process.
Dynamic Modeling of Spike Count Data With Conway-Maxwell Poisson Variability
2023, Neural Computation
A Bayesian approach for simultaneous spike/LFP separation and spike sorting
2023, Journal of Neural Engineering
An Unsupervised and Refractoriness-supported Algorithm Design for Spike Sorting
2023, TIPTEKNO 2023 - Medical Technologies Congress, Proceedings

View all citing articles on Scopus

View full text

Robust, automatic spike sorting using mixtures of multivariate t-distributions

Abstract

Introduction

Section snippets

Theory: statistics of spike-shape variability

Algorithms: clustering with mixtures of multivariate t-distributions

Experimental methods

Spike waveform statistics

Discussion

Acknowledgements

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

Comput. Methods Prog. Biomed.

Model-based Gaussian and non-Gaussian clustering

Biometrics

A component-wise EM algorithm for mixtures

Maximum likelihood from incomplete data using the EM algorithm (with discussion)

J. R. Stat. Soc. B

Connecting cortex to machines: recent advances in brain interfaces

Nat. Neurosci.

Variability of extracellular spike waveforms of cortical neurons

J. Neurophysiol.

Unsupervised learning of finite mixture models

IEEE Trans. PAMI

On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex

J. Neurosci.

Accuracy of tetrode spike separation as determined by simultaneous intracellular and extracellular measurements

J. Neurophysiol.

A three-dimensional microelectrode array for chronic neural recording

IEEE Trans. Biomed. Eng.