Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT

User menu

Search

  • Advanced search
eNeuro
eNeuro

Advanced Search

 

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Blog
    • Collections
    • Podcast
  • TOPICS
    • Cognition and Behavior
    • Development
    • Disorders of the Nervous System
    • History, Teaching and Public Awareness
    • Integrative Systems
    • Neuronal Excitability
    • Novel Tools and Methods
    • Sensory and Motor Systems
  • ALERTS
  • FOR AUTHORS
  • ABOUT
    • Overview
    • Editorial Board
    • For the Media
    • Privacy Policy
    • Contact Us
    • Feedback
  • SUBMIT
PreviousNext
Research ArticleResearch Article: New Research, Cognition and Behavior

Pupil Correlates of Decision Variables in Mice Playing a Competitive Mixed-Strategy Game

Hongli Wang, Heather K. Ortega, Huriye Atilgan, Cayla E. Murphy and Alex C. Kwan
eNeuro 15 February 2022, 9 (2) ENEURO.0457-21.2022; https://doi.org/10.1523/ENEURO.0457-21.2022
Hongli Wang
1Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heather K. Ortega
1Interdepartmental Neuroscience Program, Yale University School of Medicine, New Haven, CT 06511
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Huriye Atilgan
2Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cayla E. Murphy
2Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Cayla E. Murphy
Alex C. Kwan
2Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06511
3Department of Neuroscience, Yale University School of Medicine, New Haven, CT 06511
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alex C. Kwan
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

Article Figures & Data

Figures

  • Extended Data
  • Figure
    • Download figure
    • Open in new tab
    • Download powerpoint
  • Figure 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1.

    Performance of head-fixed mice in a matching pennies game. A, A schematic illustration of the competitive game. Head-fixed mouse makes a left or right choice by licking the spouts. A computer monitors the mouse’s past choices and outcomes, generating its own left or right choice every trial. B, The payoff matrix of the game. The mouse receives a water reward if it chooses the same choice as the computer does in the same trial. C, Trial timing: the mouse waits for a go cue, licks a spout to indicate its response, and the outcome is delivered immediately. A random ITI follows the outcome. D, An example session. The reward rate for this session was 52.1%. Top, The mouse’s choices and outcomes. Bottom, The computer’s choices. Blue and red bars indicate right (R) and left choices (L), respectively. Black bars indicate rewards. E, Cumulative number of different three-choice patterns detected as the mouse progressed in the session shown in D. F, Summary from 81 sessions. Left, The average trials performed each session is 513 ± 13. Middle, The average entropy of the three-choice sequences is 2.87 ± 0.02. Right, The average reward rate is 44.0 ± 0.5%. G, The histogram of the ITI durations for all trials. ITI would range from 4 to 8 s long if the mouse did not lick to trigger additions to the ITI. H, The response times for trials in which the mouse chose left (left) or right (right).

  • Figure 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2.

    Computational modeling of the animals’ behavior during matching pennies. A, Schematic of the hybrid FQ_RPE_CK model. QL,QR denote the action values for left and right choices; KL,KR denote the choice kernels for left and right choices. The agent chooses based on a weighted sum of action-value and choice-kernel differences (ΔQ, ΔK). The chosen action is used to compute the CKE, which is used to update the choice kernels. The outcome is used to calculate the RPE, which is used to update the action values. B, Model comparison using BIC. C, An example of the time course of the latent variables and predicted behavior. Top, Long red and blue bars indicate rewarded left and right choices; short red and blue bars indicate unrewarded left and right choices. Gray line shows the observed probability to choose left, smoothed by a Gaussian kernel. Black line shows the probability to choose left predicted by the hybrid model. Middle, The action values of left (red) and right (blue) choices estimated by the hybrid model. Bottom, The choice kernels of left (red) and right (blue) choices estimated by the hybrid model.

  • Figure 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3.

    Comparison between matching pennies and two-armed bandit: behavioral results, model fitting, and computer simulations. A, A schematic diagram of the two-armed bandit task. Top, Head-fixed mouse makes a left or right choice by licking the spouts. The trials are separated into different blocks based on the high (0.7) or low (0.1) reward probability assigned to each of the two choices. Middle, The assigned reward probabilities for the left (red) and right (blue) choices in an example session. Bottom, The choices and outcomes in the same session. The reward rate was 52.8%. Blue and red bars indicate right and left choices. Black bars indicate rewards. B, Summary from 26 sessions. Top, The average trials performed each session is 702 ± 26. Bottom, The average reward rate is 43.5 ± 1.1%. C, Model comparison using BIC. D, Learning parameters extracted from fitting the hybrid model to the matching pennies data. Left, The relative weight of choice kernel, βK/(β + βK) = 0.75 ± 0.04. Middle, Sum of inverse temperature, β + βK = 2.15 ± 0.07. Right, Relative learning rate of the choice kernel. αK/α = 0.22 ± 0.04. E, Psychometric curve based on the whole dataset. Black histograms indicate the distribution of trials according to the weighted sum of the difference of action values (ΔQ) and the difference of choice kernels (ΔK). Dashed purple line shows the predicted probability to choose left according to the softmax equation used by the hybrid model; purple dots show the observed probability to choose left. F, The performance of a computational agent playing the game, where βK/(β + βK) was varied, while the learning rates and (β + βK) were fixed and set to be the median of the fitted values based on animal data. The solid and open dots indicate the median βK/(β + βK) value fitted based on animal data. G–I, Same as D–F for the two-armed bandit task. See also Extended Data Figure 3-1.

  • Figure 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4.

    Effects of choices and outcomes on pupil responses during matching pennies. A, A schematic illustration of the pupillometry set up. B, An example still frame from a video showing both human labeling (dot) and DLC labeling (cross) for five labels. Scale bar: 50 pixels. C, The pupil response at any time during a trial (−1–5 s from the cue) is the z score at the corresponding time subtracted by the baseline, which is the mean z score between −2 and −1 s before the cue onset. D, A schematic diagram of the multiple linear regression model, i.e., Equation 9, that was fit to the pupil response in each 100-ms time bin. E, The fraction of sessions with significant regression coefficient for choice in the next trial cn+1, choice in the current trial cn, choice in the previous trial cn–1, and choice in the trial before the previous trial cn–2. Red shading indicates the p-value from the χ2 test, without correcting for multiple comparison. F, Same as E for trial outcomes. G, Same as E for the interactions of choice and outcome. H, Same as E for recent reward rate, calculated as a moving average over last 20 trials, and the cumulative reward from start of session to current trial. I, The mean regression coefficients of several predictors: choice of the current trial (cn), reward of the current trial (rn). Shading indicates the 95% confidence interval estimated by bootstrap. See also Extended Data Figure 4-1.

  • Figure 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5.

    Effects of decision variables for action selection on pupil responses during matching pennies. A, A schematic diagram of the multiple linear regression model, i.e., Equation 12. B, The fraction of sessions with significant regression coefficient for the choice cn, outcome rn, and the interaction cn x rn in the current trial. Red shading indicates the p-value from the χ2 test, without correcting for multiple comparison. C, Same as B for the choice, outcome, and interaction in the previous trial. D, Same as B for the difference in action values (QnL−QnR ), the action value of the chosen action (Qnchosen ), and the difference in the choice kernel (KnL−KnR ). E, Same as B for choice kernel of the chosen action (Knchosen ), moving-average reward rate (rnMA¯ ), and cumulative reward (rnCum. ).

  • Figure 6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 6.

    Effects of decision variables for value updating on pupil responses during matching pennies. A, A schematic diagram of the multiple linear regression model, i.e., Equation 13. B, The fraction of sessions with significant regression coefficient for choice in the current trial cn, choice in the previous trial cn–1, and outcome in the previous trial rn–1. Red shading indicates the p-value from the χ2 test, without correcting for multiple comparison. C, Same as B for difference in action values (QnL−QnR ), the RPE, and the difference in choice kernels (KnL−KnR ). D, Same as B for the choice kernel error (CKE ), moving-average reward rate (rnMA¯) , and cumulative reward (rnCum.) . E, The mean regression coefficients for RPE (quantified separately for trials with positive or negative RPE) and CKE. Shading indicates the 95% confidence interval estimated by bootstrap.

  • Figure 7.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 7.

    Multiple linear regression analyses of factors influencing pupil responses during two-armed bandit. A, The fraction of sessions with significant coefficient for choice in the next trial cn+1, choice in the current trial cn, choice in the previous trial cn–1, outcome in the next trial rn+1, outcome in the current trial rn, and outcome in the previous trial rn–1. The results were obtained by fitting Equation 9 as we did in Figure 4 but onto pupil responses from two-armed bandit task. Red shading indicates the p-value from the χ2 test, without correcting for multiple comparison. B, Same as A but for the difference in action values (QnL−QnR ), the action value of the chosen action (Qnchosen ), the difference in the choice kernel (KnL−KnR ), and the choice kernel of the chosen action (Knchosen ). The results were obtained by fitting Equation 12 as we did in Figure 5 but onto pupil responses from two-armed bandit task. C, Same as A but for the RPE and the CKE. The results were obtained by fitting Equation 13 as we did in Figure 6 but onto pupil responses from two-armed bandit task.

  • Figure 8.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 8.

    Two-pupil recordings during the bandit task. A, Schematics of the two-pupil recordings setup. Two cameras were placed in front of the two pupils with the same angle while the mouse was performing the task. B, Scatter plot of the linear regression coefficients of the current choice within the 3–5 s from the cue time. The linear regression is the same as shown in Figure 4. x-axis: coefficients of the left pupil; y-axis: coefficients of the right pupil. Different colors represent different subjects. Each dot represents a 0.1-s interval within the 3- to 5-s period. The black line shows the diagonal when coefficients of the left pupil equal to that of the right pupil. C, Same as B for current outcome.

Extended Data

  • Figures
  • Extended Data Figure 3-1

    Computational modeling of the animals’ behavior during two-armed bandit. A, The switch of the reward probabilities of left (red) and right (blue) choices. B, An example of the time course of the latent variables and predicted behavior in the same session as A. Top, Long red and blue bars indicate rewarded left and right choices. Short red and blue bars indicate unrewarded left and right choices. Gray line shows the observed probability to choose left, smoothed by a Gaussian kernel. Black line shows the probability to choose left predicted by the hybrid model. Middle, The action values of left (red) and right (blue) choices estimated by the hybrid model. Bottom, The choice kernels of left (red) and right (blue) choices estimated by the hybrid model. Download Figure 3-1, EPS file.

  • Extended Data Figure 4-1

    Extracting the pupil size using DLC. A, Deviation of DLC labels from manually selected labels in the x-axis (n = 324 frames, taken from different sessions of multiple animals). B, Same as A for y-axis. C, An example trace of the z score of pupil diameter during a session of matching pennies. D, Mean traces of z score of pupil diameter for four different trial types: rewarded left choice, unrewarded left choice, rewarded right choice, unrewarded right choice. Shading indicates the SD. Download Figure 4-1, EPS file.

Back to top

In this issue

eneuro: 9 (2)
eNeuro
Vol. 9, Issue 2
March/April 2022
  • Table of Contents
  • Index by author
  • Ed Board (PDF)
Email

Thank you for sharing this eNeuro article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Pupil Correlates of Decision Variables in Mice Playing a Competitive Mixed-Strategy Game
(Your Name) has forwarded a page to you from eNeuro
(Your Name) thought you would be interested in this article in eNeuro.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Pupil Correlates of Decision Variables in Mice Playing a Competitive Mixed-Strategy Game
Hongli Wang, Heather K. Ortega, Huriye Atilgan, Cayla E. Murphy, Alex C. Kwan
eNeuro 15 February 2022, 9 (2) ENEURO.0457-21.2022; DOI: 10.1523/ENEURO.0457-21.2022

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Share
Pupil Correlates of Decision Variables in Mice Playing a Competitive Mixed-Strategy Game
Hongli Wang, Heather K. Ortega, Huriye Atilgan, Cayla E. Murphy, Alex C. Kwan
eNeuro 15 February 2022, 9 (2) ENEURO.0457-21.2022; DOI: 10.1523/ENEURO.0457-21.2022
Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Visual Abstract
    • Abstract
    • Significance Statement
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Acknowledgments
    • Footnotes
    • References
    • Synthesis
    • Author Response
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • action selection
  • arousal
  • decision-making
  • decisions
  • mouse
  • value

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Article: New Research

  • Heading and then saccades predict visual discrimination decisions in freely moving ferrets
  • Disrupting motor cortical regional activity during motor sequence skill training impairs human motor visuomotor skill acquisition and learning that is not sequence-specific
  • Whole-Brain Mapping of Neuronal Activity Associated with Vocal Socialization Behaviors in Adult Mice
Show more Research Article: New Research

Cognition and Behavior

  • Disrupting motor cortical regional activity during motor sequence skill training impairs human motor visuomotor skill acquisition and learning that is not sequence-specific
  • Whole-Brain Mapping of Neuronal Activity Associated with Vocal Socialization Behaviors in Adult Mice
  • Effect of Functionally Selective Dopamine D1 Receptor Agonists on Complex Cognitive Processes in a Rodent Touchscreen Operant Chamber Task
Show more Cognition and Behavior

Subjects

  • Cognition and Behavior
  • Home
  • Alerts
  • Follow SFN on BlueSky
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Latest Articles
  • Issue Archive
  • Blog
  • Browse by Topic

Information

  • For Authors
  • For the Media

About

  • About the Journal
  • Editorial Board
  • Privacy Notice
  • Contact
  • Feedback
(eNeuro logo)
(SfN logo)

Copyright © 2026 by the Society for Neuroscience.
eNeuro eISSN: 2373-2822

The ideas and opinions expressed in eNeuro do not necessarily reflect those of SfN or the eNeuro Editorial Board. Publication of an advertisement or other product mention in eNeuro should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in eNeuro.