Learning to select actions with spiking neurons in the Basal Ganglia

Terrence C Stewart; Trevor Bekolay; Chris Eliasmith

doi:10.3389/fnins.2012.00002

Learning to select actions with spiking neurons in the Basal Ganglia

Front Neurosci. 2012 Jan 31:6:2. doi: 10.3389/fnins.2012.00002. eCollection 2012.

Authors

Terrence C Stewart¹, Trevor Bekolay, Chris Eliasmith

Affiliation

¹ Centre for Theoretical Neuroscience, University of Waterloo Waterloo, ON, Canada.

Abstract

We expand our existing spiking neuron model of decision making in the cortex and basal ganglia to include local learning on the synaptic connections between the cortex and striatum, modulated by a dopaminergic reward signal. We then compare this model to animal data in the bandit task, which is used to test rodent learning in conditions involving forced choice under rewards. Our results indicate a good match in terms of both behavioral learning results and spike patterns in the ventral striatum. The model successfully generalizes to learning the utilities of multiple actions, and can learn to choose different actions in different states. The purpose of our model is to provide both high-level behavioral predictions and low-level spike timing predictions while respecting known neurophysiology and neuroanatomy.

Keywords: basal ganglia; neural engineering framework; reinforcement learning; two-armed bandit; ventral striatum.