Detecting precise firing sequences in experimental data

https://doi.org/10.1016/S0165-0270(01)00364-8Get rights and content

Abstract

A precise firing sequence (PFS) is defined here as a sequence of three spikes with fixed delays (up to some time accuracy Δ), that repeat excessively. This paper provides guidelines for detecting PFSs, verifying their significance through surrogate spike trains, and identifying existing PFSs. The method is based on constructing a three-fold correlation among spikes, estimating the expected shape of the correlation by smoothing, and detecting points for which the correlations significantly protrude above the expected correlation. Validation is achieved by generating surrogate spike trains in which the time of each of the real spikes is randomly jittered within a small time window. The method is extensively tested through application to simulated spike trains, and the results are illustrated with recordings of single units in the frontal cortex of behaving monkeys. Pitfalls which may cause false detection of PFSs, or loss of existing PFSs, include searching for PFSs in which the same neuron participates more than once, and attempting to produce a surrogate with some fixed statistical property.

Introduction

Since 1981 there have been numerous studies on firing structures which report that neurons fire at precise times after long delays (Klemm and Sherry, 1981, Abeles, 1982, Abeles et al., 1983, Dayhoff and Gerstein, 1983b, Landolt et al., 1985, Legendy and Salcman, 1985, Lestienne and Strehler, 1987, Frostig et al., 1990, Villa and Abeles, 1990, Villa and Fuster, 1992, Lestienne, 1996, Prut et al., 1998, Lestienne et al., 1999, Nadasdy et al., 1999, Villa et al., 1999, Tetko and Villa, 2001b). Some of these findings were the result of a systematic search for such structures, while other studies reported occasional observations. Several methods for detecting such precise firing sequences (PFSs) have been published. Dayhoff and Gerstein (1983a) were the first to suggest such a method, based on long sequences of inter-spike intervals. However, their method was limited to patterns of a single spike train. Abeles and Gerstein (1988) suggested a way of finding such patterns among many neurons recorded simultaneously. However, their method was based on the assumption that all time intervals within a pattern are equally likely. This tends to introduce false positives when pairs of neurons show high correlations and when the same neuron participates twice (or more) in the same pattern. They also pooled together all patterns of the same complexity (say quintuplets), which tends to generate false negatives when only few of the neurons generate PFSs.

This method was improved and tested by numerous simulations of Poissonian spike trains by Tetko and Villa (1997). In this method too, all patterns of a given complexity and repetitions are pulled together.

In parallel recordings of five or more single units, one may find a large number of excessively recurring triplets, a smaller number (with fewer repetitions) of quadruplets and an even smaller number of quintuplets, etc. (Abeles and Gerstein, 1988, Abeles et al., 1993, Tetko and Villa, 1997, Baker and Lemon, 2000). Our experience has been that in order to study the relevance of such sequences, it is advisable to look at triplets: they repeat to a degree which enables examination of their relationship to behavior. Prut et al. (1998) suggested a method for detecting excessively repeating triplets of spikes which overcame the limitations mentioned above, and revealed several unexpected relationships to behavior. However, their method was based on multiple, partially dependent tests of the same data. This may result in an unknown level of confidence, and may introduces false positives. On the other hand, their method of estimating the probability of seeing a triplet by chance suffered from overfitting, which tends to produce false negatives.

A variant on the above methods was used by Tetko and Villa (2001a). They only considered patterns that appeared to be significant in several searching algorithms. They also allowed for variable precision of the various spikes in a pattern. They reported an excess of precise patterns both in the auditory cortex and in the thalamus. The drawback here is that multiple testing is carried out without evaluating the effect of such testing on the probability of obtaining false positives.

Other definitions of precise patterns have also been put forward. Oram et al. (1999) defined a PFS as a sequence of three spikes that repeated twice within a single response. Nadasdy et al. (1999) defined a pattern as a sequence of ordered firing (A followed by B, followed by C, …) disregarding the exact intervals between spikes. This paper is restricted to triplets composed of spikes from three different neurons and repeating with precisely (up to some accuracy Δ) the same intervals.

The methods and findings described above, confronted two closely related lines of critique: first, when deciding whether a certain pattern reoccurs excessively, what is the null hypothesis which must be rejected? Secondly, how does one demonstrate that such patterns do not occur by chance? Oram et al. (1999), recorded single unit activity from monkey V1 and LGN, and found many repeating triplets. However, when they generated surrogate spike trains with the same firing rate fluctuations, the same number of spikes, and the same refractoriness as in the original data, they found just as many repeating triplets. Similar findings for the primary and supplementary motor cortices were reported by Baker and Lemon (2000).

Date et al., 1998, Date et al., 1999 suggested a solution. Their solution may be phrased as follows. The null hypothesis is the assumption that spike timing accuracy does not matter up to some value W (ms). If the null is true, the timing of every spike may be randomly jittered around its true time, without affecting the relevant statistics derived from the spike train. Jittering may be accomplished within a box-car window of width W, or with other window-shapes. This approach will henceforth be referred to as the BDG approach. They further suggested testing the significance of a deviation from the null hypothesis using four steps: The first step involves computing an appropriate statistic from the real data. The second step involves jittering the spike times within a window W and re-computing the same statistic for the jittered spike train. The jittering is then repeated and the statistic computed many times. In the third step, the probability density function of the statistics as derived from the jittered spike train is estimated. Finally, if the statistic derived from the real data is located at the tail of the distribution, where the area under the tail is less than P, we can reject the null hypothesis at confidence level P. Thus the BDG approach solves both problems described above.

The BDG approach does not, however, tell us how to choose an appropriate statistic! The present study concentrates on finding just that. Identification of excessively recurring triplets of spikes is a by-product of this process.

Section snippets

Methods

The methods of computation and testing will be described in the results section. This section presents the methods used to obtain the data. They were either the outcome of simulations, or taken from recordings in behaving monkeys.

Results

We developed a set of techniques to assess the probability of seeing so many triplets, repeating that many times, by chance. These are presented below, in six sections. First we show how to build a histogram of all possible triplets, then we show the conditions under which the process of counting in this histogram is Poissonian. The third section shows how to attach a probability-like value to each bin in the histogram. This requires an estimation of the form the histogram may take by chance.

Discussion

The procedure described above yields identification of the existence of precise timing and identification of PFSs. The BDG method of jittering is able to test whether our conclusions are valid. The power of their test depends entirely on the statistics we extract from the data (distribution of P-values, and the probability of the least likely event). We first discuss conditions under which our procedure may fail. Following that, we discuss other methods for obtaining surrogate spike trains.

Acknowledgements

We wish to thank A. Aertsen, H. Bergman, D.R. Brillinger, S. Geman, R. Guetig, T. Kumar, Y. Ritov for discussion and suggestions. This research was supported in part by grants from the German Israeli Binational fund (GIF), and the USA Israel Binational Fund (BSF).

References (33)

  • S. Baker et al.

    Precise spatiotemporal repeating patterns in monkey primary and supplementary motor areas occur at chance level

    J. Neurophysiol.

    (2000)
  • L.E. Baum

    An inequality and associated maximization technique in statistical estimation of probabilistic functions of Markov processes

    Inequalities

    (1972)
  • Brillinger DR. Statistical inference for stationary point processes, In: Puri ML, editor. Stochastic processes and...
  • Date A, Bienenstock E, Geman S. On the temporal resolution of neural activity. Technical report, Division of Applied...
  • A. Date et al.

    A statistical technique for detection of fine temporal structure in multi neuronal spike trains

    Soc. Neurosci. Abstr.

    (1999)
  • J. Dayhoff et al.

    Favored patterns in spike trains. I. Detection

    J. Neurophysiol.

    (1983)
  • Cited by (75)

    • Revealing cell assemblies at multiple levels of granularity

      2014, Journal of Neuroscience Methods
      Citation Excerpt :

      Here we introduce a technique to identify such neuron assemblies directly from multivariate spiking data, based on two steps: the definition of a simple biophysically-inspired similarity measure obtained from the observed spiking dynamics, followed by its analysis using a recent framework for multiscale community detection in weighted, directed graphs. A variety of techniques have been proposed to cluster spike-train groups to date, and have shown promising results in particular settings (Fellous et al., 2004; Feldt et al., 2009; Humphries, 2011; Lopes-Dos-Santos et al., 2011, 2013; Quiroga and Panzeri, 2009; Abeles and Gat, 2001; Laubach et al., 1999; Peyrache et al., 2010; Gansel and Singer, 2012). In contrast to these techniques, our methodology provides a dynamics-based framework, in which both the similarity measure and the community detection method are geared toward incorporating key features of neural network dynamics.

    • Detecting cell assemblies in large neuronal populations

      2013, Journal of Neuroscience Methods
    • Detecting synfire chains in parallel spike data

      2012, Journal of Neuroscience Methods
    View all citing articles on Scopus
    View full text