Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro
eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleResearch Article: Methods/New Tools, Novel Tools and Methods

SaLSa: A Combinatory Approach of Semi-Automatic Labeling and Long Short-Term Memory to Classify Behavioral Syllables

Shuzo Sakata
eNeuro 21 November 2023, 10 (12) ENEURO.0201-23.2023; https://doi.org/10.1523/ENEURO.0201-23.2023
Shuzo Sakata
Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow G4 0RE, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Article Figures & Data

Figures

  • Extended Data
  • Figure 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1.

    SaLSa. After recording videos and processing them with DeepLabCut (“pose estimation”), spatial and temporal features are extracted from the egocentric coordinates of tracked body parts (feature extraction). Based on a set of videos, an LSTM classifier is trained (model training). This component consists of two parts. First, through fully automated unsupervised processes, including dimensionality reduction and clustering, behavioral syllable candidates are identified. Using a graphical user interface, the identified candidates are manually labeled (semi-automatic labeling). For the training and evaluation of an LSTM classifier, labeled data will be used. Once the classifier is trained, the entire sequence of extracted features is processed to classify behavioral syllables.

  • Figure 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2.

    Semi-automatic labeling. A, A frame with tracked body parts. Although 10 body parts were tracked, the mid-tail and tail tip were excluded for quantitative analyses. B, The categories of extracted features. C, A chunk of normalized feature values. Each feature was Z-scored for a visualization purpose. Dotted lines separate different feature categories. The feature category number corresponds to the number indicated in B. D, Cumulative distribution of explained variance across principal components (PCs). The threshold for including PCs was set at 85% variance explained. E, UMAP representation of the entire video sequences and example frame sequences of labeled behavioral syllables. Data reduced by principal component analysis (PCA) was further reduced to two dimensions by UMAP. A spectral clustering algorithm was applied. By removing fragmented (<0.25 s) sequences, each cluster was manually annotated. The example sequences indicate a down-sampled sequence of each labeled behavioral syllable. Please note that although the mid-tail and tail tip were shown in sample frames, because the tail shape did not reflect behavioral syllables, the mid-tail and tail tip were excluded from all quantitative analyses including the UMAP analysis.

  • Figure 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3.

    LSTM training data and performance. A, The proportion of each behavioral syllable for LSTM training. B, Receiver operating characteristic curves for each behavioral syllable based on independently labeled 10 videos. FA, false alarm. C, The area under the curve (AUC) values across behavioral syllables. In Extended Data Figure 3-1, a range of parameters in the LSTM model were systematically assessed. The evaluation performance of a multiclass support vector machine is shown in Extended Data Figure 3-2.

  • Figure 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4.

    Benchmark testing of SaLSa. A, Confusion matrix between manually labeled data (y-axis) and SaLSa outputs (x-axis). The values indicate what fraction of manually labeled data were classified as each syllable by SaLSa. B, Confusion matrix between manually labeled data and keypoint-MoSeq (kpms) outputs. Syllable 16 is a subthreshold (0.5% cutoff) class. C, Confusion matrix between outputs from SaLSa and kpms. The values indicate what fraction of frames classified by SaLSa were classified by kpms. D, Confusion matrix between manually labeled data and re-assigned kpms syllables. The original kpms syllables were re-assigned to one of six syllables based on comparison to manually labeled data (B). E, Confusion matrix between SaLSa output and the re-assigned kpms syllables.

  • Figure 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5.

    Quantification of behavioral syllables. A, The median speed per episode of each video across behavioral syllables. B, The median turning angle per episode of each video across behavioral syllables. C, The median compactness per episode of each video across behavioral syllables. D, The fraction of each behavioral syllable. E, The average episode duration of each video across behavioral syllables. Inset, p-values of post hoc pair-wise comparisons (n = 43, two-way ANOVA with post hoc Tukey’s HSD test). L, locomotion; T, turning; R, rearing; G, grooming; P, pause.

  • Figure 6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 6.

    Age-dependent effects of genotype and sex on behavioral contents in 5xFAD mice. The fraction of each behavioral syllable as a function of age. Behavioral syllables are (A) locomotion, (B) turning, (C) rearing, (D) grooming, and (E) pause. p-values of ANCOVA are shown.

Extended Data

  • Figures
  • Extended Data Figure 3-1

    Systematic comparison of three parameters for evaluation accuracy and training duration. A, B, The evaluation accuracy of LSTM models with variable maximum numbers of epochs and hidden units. The gradient threshold was set at 1 in A and 2 in B. C, D, Training duration across conditions. Download Figure 3-1, TIF file.

  • Extended Data Figure 3-2

    Performance of a multiclass support vector machine. A, Receiver operating characteristic curves for each behavioral syllable based on independently labeled 10 videos. FA, false alarm. B, The area under the curve (AUC) values across behavioral syllables. Download Figure 3-2, TIF file.

Back to top

In this issue

eneuro: 10 (12)
eNeuro
Vol. 10, Issue 12
December 2023
  • Table of Contents
  • Index by author
  • Masthead (PDF)
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
SaLSa: A Combinatory Approach of Semi-Automatic Labeling and Long Short-Term Memory to Classify Behavioral Syllables
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
SaLSa: A Combinatory Approach of Semi-Automatic Labeling and Long Short-Term Memory to Classify Behavioral Syllables
Shuzo Sakata
eNeuro 21 November 2023, 10 (12) ENEURO.0201-23.2023; DOI: 10.1523/ENEURO.0201-23.2023

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
SaLSa: A Combinatory Approach of Semi-Automatic Labeling and Long Short-Term Memory to Classify Behavioral Syllables
Shuzo Sakata
eNeuro 21 November 2023, 10 (12) ENEURO.0201-23.2023; DOI: 10.1523/ENEURO.0201-23.2023
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Acknowledgments
    • Footnotes
    • References
    • Synthesis
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • aging
  • Alzheimer’s disease
  • classification
  • computational ethology
  • deep learning
  • long short-term memory

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Article: Methods/New Tools

  • Building an Ecosystem of Seizure Localization Methods: Neural Fragility as the First Step
  • Open-source platform for adjustable training regimes in freely moving and head-fixed mice
  • TriNet-MTL: A Multi-Branch Deep Learning Framework for Biometric Identification and Cognitive State Inference from Auditory-Evoked EEG
Show more Research Article: Methods/New Tools

Novel Tools and Methods

  • Representation Biases: Variance Is Not Always a Good Proxy for Importance
  • Building an Ecosystem of Seizure Localization Methods: Neural Fragility as the First Step
  • What Is My Neuron Doing? Commentary on Huang et al. (2026)
Show more Novel Tools and Methods

Subjects

  • Novel Tools and Methods
  • Home
  • Alerts
  • Follow SFN on BlueSky
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Notice
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2026 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.