Reinforcement learning and decision making in monkeys during a competitive game

Daeyeol Lee; Michelle L Conroy; Benjamin P McGreevy; Dominic J Barraclough

doi:10.1016/j.cogbrainres.2004.07.007

Reinforcement learning and decision making in monkeys during a competitive game

Brain Res Cogn Brain Res. 2004 Dec;22(1):45-58. doi: 10.1016/j.cogbrainres.2004.07.007.

Authors

Daeyeol Lee¹, Michelle L Conroy, Benjamin P McGreevy, Dominic J Barraclough

Affiliation

¹ Department of Brain and Cognitive Sciences, Center for Visual Science, University of Rochester, Rochester, NY 14627, USA. dlee@cvs.rochester.edu

PMID: 15561500
DOI: 10.1016/j.cogbrainres.2004.07.007

Abstract

Animals living in a dynamic environment must adjust their decision-making strategies through experience. To gain insights into the neural basis of such adaptive decision-making processes, we trained monkeys to play a competitive game against a computer in an oculomotor free-choice task. The animal selected one of two visual targets in each trial and was rewarded only when it selected the same target as the computer opponent. To determine how the animal's decision-making strategy can be affected by the opponent's strategy, the computer opponent was programmed with three different algorithms that exploited different aspects of the animal's choice and reward history. When the computer selected its targets randomly with equal probabilities, animals selected one of the targets more often, violating the prediction of probability matching, and their choices were systematically influenced by the choice history of the two players. When the computer exploited only the animal's choice history but not its reward history, animal's choice became more independent of its own choice history but was still related to the choice history of the opponent. This bias was substantially reduced, but not completely eliminated, when the computer used the choice history of both players in making its predictions. These biases were consistent with the predictions of reinforcement learning, suggesting that the animals sought optimal decision-making strategies using reinforcement learning algorithms.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms
Animals
Behavior, Animal
Competitive Behavior / physiology*
Decision Making / physiology*
Feedback
Learning / physiology*
Logistic Models
Macaca mulatta
Male
Models, Psychological
Probability
Reinforcement, Psychology*
Time Factors

Abstract

Publication types

MeSH terms

Grants and funding