Neural Circuitry of Reward Prediction Error

Annu Rev Neurosci. 2017 Jul 25:40:373-394. doi: 10.1146/annurev-neuro-072116-031109. Epub 2017 Apr 24.

Abstract

Dopamine neurons facilitate learning by calculating reward prediction error, or the difference between expected and actual reward. Despite two decades of research, it remains unclear how dopamine neurons make this calculation. Here we review studies that tackle this problem from a diverse set of approaches, from anatomy to electrophysiology to computational modeling and behavior. Several patterns emerge from this synthesis: that dopamine neurons themselves calculate reward prediction error, rather than inherit it passively from upstream regions; that they combine multiple separate and redundant inputs, which are themselves interconnected in a dense recurrent network; and that despite the complexity of inputs, the output from dopamine neurons is remarkably homogeneous and robust. The more we study this simple arithmetic computation, the knottier it appears to be, suggesting a daunting (but stimulating) path ahead for neuroscience more generally.

Keywords: arithmetic; circuitry; dopamine; learning; prediction error; reward.

Publication types

  • Review

MeSH terms

  • Animals
  • Brain / physiology*
  • Dopamine / physiology*
  • Humans
  • Learning / physiology*
  • Nerve Net / physiology*
  • Neural Pathways / physiology
  • Reward*

Substances

  • Dopamine