Abstract
Sensory cues are critical for shaping decisions and invigorating actions during reward seeking. Dopamine neurons in the ventral tegmental area (VTA) are central in this process, supporting associative learning in Pavlovian and instrumental settings. Studies of intracranial self-stimulation (ICSS) behavior, which show that animals will work hard to receive stimulation of dopamine neurons, support the notion that dopamine transmits a reward or value signal to support learning. Recent studies have begun to question this, however, emphasizing dopamine's value-free functions, leaving its contribution to behavioral reinforcement somewhat muddled. Here, we investigated the role of sensory stimuli in dopamine-mediated reinforcement, using an optogenetic ICSS paradigm in tyrosine hydroxylase (TH)-Cre rats. We find that while VTA dopamine neuron activation in the absence of explicit external cues is sufficient to maintain robust self-stimulation, the presence of cues dramatically potentiates ICSS behavior. Our results support a framework where dopamine can have some base value as a reinforcer, but the impact of this signal is modulated heavily by the sensory learning context.
Significance Statement
Dopamine neurons are critical for cue-based learning, and animals will work hard to activate them with brain stimulation. This self-stimulation behavior suggests dopamine transmits a reward or value signal, but recent work has emphasized dopamine's value-free functions, leaving its contribution to behavioral reinforcement somewhat unclear. We examined this question by manipulating the sensory learning context associated with dopamine neuron self-stimulation. We find that optogenetic activation of dopamine neurons in the absence of external cues is sufficient to support robust self-stimulation, but this behavior is dramatically potentiated in the presence of cues. Our results support a dual role for dopamine neurons in representing value and guiding behavior as a teaching signal.
Introduction
Environmental cues are critical for successful decision making. Through learning, sensory events associated with rewarding experiences acquire motivational impact, gaining the power to invigorate and reinforce actions on their own. Dopamine neurons in the ventral tegmental area (VTA) are critical in this process, supporting associative learning in both Pavlovian and instrumental settings (Tsai et al., 2009; Witten et al., 2011; Steinberg et al., 2013; Hamid et al., 2016; Sharpe et al., 2017; Saunders et al., 2018; Keiflin et al., 2019).
Classic studies of intracranial self-stimulation behavior demonstrate that animals work hard to receive electrical stimulation delivered to areas of the brain associated with dopamine transmission - including the ventral midbrain, medial forebrain bundle, and striatum (Olds and Milner, 1954; Mogenson et al., 1979; Corbett and Wise, 1980; Fibiger et al., 1987; Owesson-White et al., 2008). More recently, precise cell-type specific optogenetic targeting has demonstrated that direct activation of dopamine neurons can reinforce self-stimulation behavior (Witten et al., 2011; Kim et al., 2012; Rossi et al., 2013; Ilango et al., 2014; Steinberg et al., 2014; Covey and Cheer, 2019). This supports a perspective that dopamine neuron activity can signal value in a way that mimics a natural rewarding stimulus, like food, in the context of learning and reinforcement (Wise, 2004; Berridge, 2007; Steinberg et al., 2013). Recent studies have begun to question the scope of dopamine's role in this domain, suggesting more prominent value-free functions instead, including in novelty, salience, learning rate, and movement kinematics, among other processes (Panigrahi et al., 2015; Coddington and Dudman, 2019; Engelhard et al., 2019; Hughes et al., 2020; Maes et al., 2020; Sharpe et al., 2020; Kutlu et al., 2021; Jeong et al., 2022; Millard et al., 2022; Coddington et al., 2023; Markowitz et al., 2023).
While these various interpretations are not mutually exclusive, it remains somewhat unclear to what extent brief, phasic dopamine neuron activation is itself a direct reinforcer, or if an external sensory element or salient state change is needed for a clear demonstration of ICSS behavior. Determining this is made complicated by the fact that the majority of ICSS studies either include a cue or some other reward paired with stimulation (or fail to report if cues were used), apply ICSS tests after some other learning has occurred, and/or use a targeting method that is not specific to dopamine neurons. Here, we investigated the role of sensory stimuli in dopamine-mediated reinforcement, using an optogenetic ICSS paradigm in naive TH-cre rats (Witten et al., 2011). We found that while brief VTA dopamine neuron activation (1 s at 20 Hz) in the absence of any explicit external cueing stimulus was sufficient to clearly maintain self-stimulation, the presence of cues paired with dopamine activation dramatically potentiated ICSS behavior. Our results support a framework where dopamine has some base value as a reinforcer, but the impact of this signal is flexible and scalable depending on the sensory learning context. In real-world behaving agents, where dopamine neurons are not activated in isolation during learning, dopamine's role is likely often to assign, track, and update meaning to associated stimuli and states.
Materials and Methods
Subjects
Male and female Th-Cre transgenic rats (N = 22; 15M, 7F), bred on a Long Evans background, were used in these studies. These rats express Cre recombinase under the control of the tyrosine hydroxylase (TH) promoter (Witten et al., 2011). Wild-type littermates (Th-Cre) were used as controls. After surgery rats were individually housed with ad libitum access to food and water on a 0700–1,900 light/dark cycle (lights on at 0700). All rats weighed >250 g at the time of surgery and were 5–9 months old at the time of experimentation. Rats were handled by the experimenter several times before and after surgery. Experimental procedures were approved by the Institutional Animal Care and Use Committees at the University of California, San Francisco, and at Johns Hopkins University and were carried out in accordance with the guidelines on animal care and use of the National Institutes of Health of the United States. Upon initial investigation, we found no evidence of sex-biased effects, so males and females were collapsed within each ICSS group for all reported data.
Surgical procedures
Viral infusions and optic fiber implants were carried out as previously described. Rats were anesthetized with 5% isoflurane and placed in a stereotaxic frame, after which anesthesia was maintained at 1–3%. Rats were administered saline, carprofen anesthetic, and cefazolin antibiotic subcutaneously. The top of the skull was exposed, and holes were made for viral infusion needles, optic fiber implants, and four to five skull screws. A Cre-dependent virus coding for ChR2 (AAV5-Ef1α-DIO-ChR2-eYFP, titer 1.5–4 × 1012 particles/ml, University of North Carolina) was infused unilaterally (1 μl at each target site) at the following coordinates from Bregma for targeting VTA cell bodies: posterior −6.2 and −5.4 mm, lateral +0.7, and ventral −8.4 and −7.4. Viral injections were made using a microsyringe pump at a rate of 0.1 μl/min. Injectors were left in place for 5 min, then raised 200 microns dorsal to the injection site, left in place for another 10 min, and then removed slowly. Custom-made optic fiber implants (300 micron glass diameter) were inserted unilaterally just above and between viral injection sites (posterior −5.8 mm, lateral +0.7, ventral −7.5). Implants were secured to the skull with dental acrylic applied around skull screws and the base of the ferrule(s) containing the optic fiber. Headcaps were painted black, to block light transmission during laser delivery. At the end of all surgeries, topical anesthetic and antibiotic ointment was applied to the surgical site, rats were removed to a heating pad and monitored until they were ambulatory. Rats were monitored daily for 1 week following surgery. Optogenetic manipulations commenced at least 4 weeks after surgery.
Optogenetic stimulation
We used 473 nm lasers (OptoEngine), adjusted to produce output during individual 5 ms light pulses during experiments of ∼2 mW/mm2 at the tip of the intracranial fiber. Light power was measured before and after every behavioral session to ensure that all equipment was functioning properly. For all optogenetic studies, optic tethers connecting rats to the rotary joint were sheathed in a lightweight armored jacket to prevent cable breakage and block visible light transmission.
Intracranial self-stimulation training
Following recovery from surgery and at least 4 weeks for viral expression, rats were brought to the testing chambers (Med Associates) and connected to an optic cable tether to receive optogenetic stimulation. For all ICSS sessions for all groups, chamber lights were constantly illuminated, to further occlude unwanted light pollution from the laser delivery events. Two nose poke ports were positioned on one side of the chamber. For rats in the Cre+ cued and Cre− cued groups, pokes in the active port resulted in a 1 s laser train (20 Hz, 20 5 ms pulses, fixed-ratio 1 schedule with a 1 s timeout during each train), that coincided with the delivery of a cue complex – lights in the nose poke illuminated and a tone played. For rats in the Cre+ no cue group, active nose pokes produced the same 1 s laser train but with no cues. Inactive nose pokes were recorded but had no consequences. Laser stimulation parameters were chosen to align with those commonly used in other dopamine ICSS studies (Witten et al., 2011; Ilango et al., 2014; Saunders et al., 2018; Covey and Cheer, 2019; Fraser et al., 2023). This level of stimulation should evoke dopamine levels in the striatum that are comparable in magnitude and duration to a natural reward, such as sucrose (Saunders et al., 2018; Covey and Cheer, 2019; Hollon et al., 2021; van Elzelingen et al., 2022). In the no cue group, we eliminated all explicit, task-related cues that could be associated with stimulation behavior, but it remains possible that sensorimotor or interoceptive signals associated with movement and nose-poking behavior could provide some cues impacting learning.
Extinction
The day after the final ICSS training session, rats were returned to the testing chambers for the first day of extinction. During this 1 h session, rats were tethered and nose pokes were measured as before, but laser was not delivered. For rats in all groups, no cue was presented following active nose pokes. Extinction continued for five daily sessions.
Histology
Rats were deeply anesthetized with sodium pentobarbital and transcardially perfused with cold phosphate buffered saline followed by 4% paraformaldehyde. Brains were removed and postfixed in 4% paraformaldehyde for ∼24 h, then cryoprotected in a 25% sucrose solution for at least 48 h. Sections were cut at 50 microns on a cryostat (Leica Microsystems). To confirm viral expression and optic fiber placements, brain sections containing the midbrain were mounted on microscope slides and coverslipped with Vectashield containing DAPI counterstain. Fluorescence from ChR2-eYFP as well as optic fiber location was then visualized. A tissue from wild-type animals was examined for lack of viral expression and optic fiber placements. To verify localization of viral expression in dopamine neurons, we performed immunohistochemistry for tyrosine hydroxylase and GFP. Sections were washed in PBS and incubated with bovine serum albumin (BSA) and Triton X-100 (each 0.2%) for 20 min. About 10% of normal donkey serum (NDS) was added for a 30 min incubation, before primary antibody incubation (mouse anti-GFP, 1:1,500, Invitrogen; rabbit anti-TH, 1:500, Fisher Scientific) overnight at 4°C in PBS with BSA and Triton X-100 (each 0.2%). Sections were then washed and incubated with 2% NDS in PBS for 10 min and secondary antibodies were added (1:200 Alexa Fluor 488 donkey anti-mouse, 594 donkey anti-rabbit or 647 chicken anti-rabbit) for 2 h at room temperature. Sections were washed two times in PBS and mounted with Vectashield containing DAPI. Brain sections were imaged with a Zeiss Axio 2 microscope.
Statistics
Behavioral data were recorded with Med-PC software (Med Associates) and analyzed using Prism 9.0. Two-way repeated measures ANOVA was used to analyze changes in behavior among the groups across training. Unpaired t tests were used to compare some group averages, with a Welch's correction applied when unequal variances were observed. For cumulative responding analysis, nose poke data were put into minute-long bins. Effect sizes were not predetermined. Rats were included in optogenetic behavioral analyses if optic fiber tips were no more than ∼500 microns dorsal to the VTA. Statistical significance was set at p < 0.05.
Results
Sensory cues accelerate acquisition of dopamine neuron self-stimulation
ChR2-YFP was targeted to dopamine neurons of the VTA in TH-Cre+/− rats (Figs. 1, 2A). Optic fibers were implanted unilaterally above the VTA for optogenetic excitation of dopamine neurons via blue laser light. Fiber placements were clustered over the central VTA for rats in all groups (Fig. 1).
Targeting and optic fiber placements in the VTA. ChR2 was targeted to dopamine neurons in the ventral tegmental area and optic fibers were placed above the VTA for optogenetic manipulations.
Sensory cues facilitate acquisition of VTA dopamine neuron self-stimulation behavior. A, Optic fibers were targeted to the VTA for activation of ChR2-expressing dopamine neurons. B, Rats were allowed to self-administer blue laser in 10 1 h intracranial self-stimulation session. Active nose pokes resulted in a 1 s blue light delivery (20 Hz, 5 ms pulses). For rats in the cued groups, this coincided with the illumination of the cue light in the nose poke port, and a delivery of a tone for 1 s. C, Across training sessions, cued Cre+ ICSS rats acquired ICSS faster than no cue Cre+ rats and Cre− controls, as measured by active nose poke behavior. D, Cued Cre+ rats completed significantly more active nose pokes in total, compared to the other groups. No cue Cre+ rats completed significantly more active nose pokes than Cre− controls. E, Cued Cre+ acquired ICSS faster than no cue Cre+ rats and Cre− controls, as measured by stimulations received. F, Cued Cre+ rats received more stimulations in total, compared to the other groups. No cue Cre+ rats received significantly more stimulations than Cre− controls. Responding among the Cre− controls remained low across all sessions. Error represents SEM. ***p < 0.001, **p < 0.01, *p < 0.05.
Animals experienced 10 1 h intracranial self-stimulation sessions, during which they were tethered to an optic fiber connected to a 473-nm laser. Nose pokes into the active port resulted in a 1 s train of light delivery through the optic fiber (20 Hz, 5 ms, 2 mW pulses), which was paired with a 1 s cue (illumination of the active nose port + tone) in the Cre+ cue and Cre− cue groups (Fig. 2B). Cre+ no cue rats received identical laser stimulation, but with no cues. Self-stimulation behavior was substantially different across groups (Fig. 2). The presence of cues accelerated the acquisition of self-stimulation, with rats in the Cre+ cue group making more nose pokes in initial training sessions and reaching greater overall nose poke per session response levels by the end of training (Fig. 2C; two-way ANOVA interaction of group and session, F(18,171) = 5.92, p < 0.0001; main effect of group, F(2,19) = 7.24, p = 0.0046). Cre+ cue rats made significantly more active nose pokes overall, compared to the Cre+ no cue group (Fig. 2D, unpaired t test, t(8.5) = 2.43, p = 0.020). Cre+ no cue rats responded less for laser stimulation alone (Fig. 2C) but made significantly more active nose pokes than Cre− cue control rats (Fig. 2D, unpaired t test, t(6) = 2.35, p = 0.028) whose responses remained low across all sessions. Across groups, inactive nose pokes were low throughout, and there were no differences in total inactive responses (Fig. 2D inset, one-way ANOVA no effect of group, F(2,19) = 0.14, p = 0.87). Across training, Cre+ cue (p = 0.0039) and Cre+ no cue (p = 0.0078) rats, but not Cre− cue (p = 0.99) rats, discriminated between the active and inactive nose pokes.
We saw a similar pattern for stimulations taken. The Cre+ cue group had the most stimulations across training, finishing with a larger number of stimulations per session, compared to the Cre+ no cue group (Fig. 2E; two-way ANOVA interaction of group and session, F(18,171) = 3.67, p < 0.0001; main effect of group F(2,19) = 5.92, p = 0.010). Total stimulations taken were also significantly greater in the Cre+ cue versus no cue group (Fig. 2F, unpaired t test, t(12) = 2.39, p = 0.033). Cre+ no cue rats took significantly more stimulations than Cre− cue controls (Fig. 2F, unpaired t test, t(6) = 1.97, p = 0.04). Together these data indicate that dopamine neuron stimulation can support self-stimulation behavior in the absence of cues. However, acquisition of self-stimulation behavior is substantially accelerated by the presence of cues signaling dopamine neuron activation events.
We next examined the detailed pattern of ICSS by plotting cumulative responding within each session. The slope of response curves quickly differentiated and became super linear for Cre+ cue rats (Fig. 3), such that they responded at an accelerated pace consistently within each session, compared to no cue rats (Fig. 3A). The effect of cue presence on the slope of responding was apparent almost immediately in training. Cre+ cue rats were behaviorally distinguishable from Cre+ no cue rats within the first 10 min of the first training session (Fig. 3B; two-way ANOVA interaction of group and minute, F(18,171) = 3.86, p < 0.0001; main effect of group, F(2,19) = 4.71, p = 0.022). This early-in-session behavioral potentiation in Cre+ cue rats further escalated as training progressed (Day 5: Fig. 3C; two-way ANOVA interaction of group and minute, F(18,171) = 5.43, p < 0.0001; main effect of group, F(2,19) = 5.77, p = 0.011; Day 10: Fig. 3D; two-way ANOVA interaction of group and minute, F(18,171) = 7.55, p < 0.0001; main effect of group, F(2,19) = 9.83, p = 0.001). Together these data further support the notion that cues paired with dopamine neuron self-stimulation facilitate rapid acquisition of dopamine-seeking behavior.
Sensory cues rapidly escalate VTA dopamine neuron self-stimulation onset. A, ICSS data was expressed as cumulative active nose pokes completed within each 1 h training session, in 1 min bins. Faster responding is denoted by a steeper slope, which was evident for Cre+ cue rats versus Cre+ No cue rats. B, Cumulative responding for the first 10 min of session 1 was significantly higher for Cre+ cue rats. The rapid onset of self-stimulation at the beginning of sessions continued escalate for Cre+ cue rats in session 5 (C) and session 10 (D). Error represents SEM. ****p < 0.0001.
Sensory cues potentiate the vigor of dopamine neuron self-stimulation
We next quantified the rate and vigor of self-stimulation behavior, by examining the extent to which rats made “extra,” nonreinforced nose pokes. During training, in the 1 s period during which the laser was on for a stimulation event, additional active nose pokes were counted but had no additional consequence. Behavior during these “timeout” windows was significantly different across the self-stimulation groups (Fig. 4). Cre+ cue rats made many more active nose pokes during each stimulation event, as measured by subtracting the number of stimulations from the number of active nose pokes across each session (Fig. 4A; two-way ANOVA interaction of group and session, F(18,171) = 6.49, p < 0.0001; main effect of group, F(2,19) = 7.09, p = 0.005). Averaged across sessions, Cre+ cue rats had significantly greater total active nose pokes made during laser delivery timeouts, compared to Cre+ no cue rats (Fig. 4B, unpaired t test, t(7.3) = 2.72, p = 0.014). Although they made many fewer than Cre+ cue rats, there was a trend for Cre+ no cue rats to make more timeout active nose pokes compared to Cre− cue controls (Fig. 4B, unpaired t test, t(6) = 1.7, p = 0.06). We also calculated a ratio score of active nose pokes to stimulations, reflecting the rate of responding per stimulation. This revealed that only Cre+ cue rats consistently showed elevated nose poke behavior above the minimum required to receive stimulations. Across training sessions the active nose poke to stimulation ratio increased for Cre+ cue rats relative to the other groups (Fig. 4C; two-way ANOVA interaction of group and session, F(18,171) = 2.28, p = 0.0033; main effect of group, F(2,19) = 11.34, p = 0.0006). Across all sessions, the average active nose poke to stimulation ratio was significantly higher in Cre+ cue versus no cue rats (Fig. 4D, unpaired t test, t(13) = 3.02, p = 0.0049). Notably, Cre+ no cue rats were not significantly different from Cre− cue controls (Fig. 4D, unpaired t test, t(7.7) = 1.04, p = 0.17) in average active nose poke to stimulation ratio. These data indicate that the presence of cues paired with dopamine neuron activation potentiate the intensity and vigor of self-stimulation.
Sensory cues invigorate VTA dopamine neuron self-stimulation behavior. Cre+ cue rats exhibited more vigorous self-stimulation behavior, relative to Cre+ no cue rats and controls. A, This was evident in the elevated number of active nose pokes made, compared to the number of stimulations received. Cre+ cue rats made many more active nose pokes during the 1 s timeout period that coincided with laser delivery, resulting in a positive active nose pokes stimulation value across most sessions and (B) in total summed across training. In contrast, Cre+ no cue rats made few extra active nose pokes during the time out period, and the sum total of extra active nose pokes was not significantly greater than Cre− controls. C, Similarly, the ratio of active nose pokes to stimulations escalated in the Cre+ cue rats, relative to other groups. D, The average active nose poke to stimulation ratio was significantly higher in Cre+ cue rats, and not different between Cre+ no cue and Cre− controls, despite the former showing otherwise robust self-stimulation behavior. ****p < 0.0001, ***p < 0.001, **p < 0.01, *p < 0.05.
Cue history affects rate of ICSS extinction
We next examined the extinction of ICSS behavior (Fig. 5). Across five sessions, rats were again tethered to an optical cable and allowed to nose poke, where nose pokes had no consequences – no cues or laser were given (Fig. 5A). Behavior varied substantially across groups under these conditions. Rats trained with a cue present during ICSS training maintained a higher level of responding across extinction, compared to the other groups (Fig. 5B, two-way ANOVA interaction of group and session, F(8,76) = 16.18, p < 0.0001; main effect of group, F(2,19) = 65.85, p < 0.0001). However, normalizing extinction nose pokes as a percentage of the final training session nose pokes, we found that the rate of attrition of behavior in rats with a cue history was more sudden than for rats who did not have a cue during training (Fig. 5C, two-way ANOVA interaction of group and session, F(4,52) = 2.979, p = 0.027). This suggests that ICSS behavior in the Cre+ cue group had become heavily dependent on the cue's presence.
Cue history affects the rate of ICSS extinction. A, Following self-stimulation sessions, rats were returned to the testing chambers and allowed to nose poke under extinction conditions, wherein no cues or laser stimulations were delivered. B, Cre+ cue rats maintained higher levels of responding across extinction sessions, relative to Cre+ No cue rats and controls. C, When extinction nose pokes were normalized to the final day of ICSS training, however, extinction was clearly more immediate in Cre+ cue rats, relative to Cre+ no cue rats. ****p < 0.0001, *p < 0.05.
Discussion
Here, we demonstrate that dopamine neurons can support ICSS behavior independent of explicit sensory cueing. We found that brief phasic optogenetic activation of VTA dopamine neurons in the absence of any other external signaling event was sufficient to produce reliable self-stimulation behavior compared to control animals. Critically, in the presence of explicit external cues associated with the receipt of dopamine neuron activation, we observed dramatically potentiated self-stimulation behavior. Rats experiencing dopamine stimulation paired with cues acquired self-stimulation more rapidly, took more stimulations, and responded more vigorously/at a faster rate than rats receiving optogenetic stimulation alone. Removal of cues under extinction conditions also lead to more rapid attrition of behavior in rats with a history of cue paired during training, relative to no cue rats. Our results add to a long line of research showing that electrical or optogenetic activation of dopamine neurons can support intracranial self-stimulation behavior, by clarifying that an uncued, brief increase in dopamine activity can act as a valuable, reinforcing signal akin to a natural reward.
Dopamine neurons in the VTA have long been closely linked to reward learning, motivation, and reinforcement. However, important recent work has begun to question the extent to which dopamine signals reflect behavioral reinforcement as a value signal that substitutes for natural rewards, versus a value-free teaching signal transmitting prediction, salience, novelty, internal state, exploration, or other information (Maes et al., 2020; Sharpe et al., 2020; Kutlu et al., 2021, 2022; Coddington et al., 2023; Markowitz et al., 2023). This work collectively suggests that dopamine itself, divorced from cues, state changes, movement, and other factors, carries no reward-like properties: dopamine serves as a guide of behavior, rather than reinforcing it directly. Our results support a hybrid view. First, we clarify that dopamine neuron activity can directly reinforce actions in the absence of any explicit cue-related event, which supports the classic reward value interpretation. We also show strong evidence that this value signal is malleable and can be potentiated by the presence of sensory cues that have been explicitly paired with dopamine neuron activity. This suggests that dopamine neuron signals do not necessarily represent a set static value, but the impact of a given dopamine event is modulated by other features of the learning context, including additional sensory information to support the establishment of associative contingencies. Consistent with this, previous work has shown that in the context of Pavlovian learning, noncontingent, unsignaled dopamine neuron activation does not produce obvious behavioral consequences, including nonspecific movement (Coddington and Dudman, 2018; Saunders et al., 2018; Pan et al., 2021). In contrast, if that stimulation event is signaled to the animal with a sensory cue, or if it occurs following a specific action, robust learning and behavioral reinforcement can occur.
Our results suggest that phasic dopamine transmission has the ability to instantiate some basic value to directly support reinforcement, but that signal must interact with other brain processes and states evoked by cues, rewards, and actions for its impact to be fully realized. The extent to which dopamine's contribution to the net “value” within a learning and decision-making context will likely depend on the details - the identity, duration, configuration, and learning history, etc. - of those other features. This makes sense as, in real behavioral contexts, where dopamine neurons are never activated in isolation, a myriad other processes modulate dopamine neuron activity and function, including sensory, homeostatic, ingestive, and motor control systems (Redgrave and Gurney, 2006; McCutcheon, 2015; Tian et al., 2016; Coddington and Dudman, 2019; Grove et al., 2022). Thus, value is not solely the domain of dopamine, and in many cases, the value component of dopamine's contribution to a given behavior state may be negligible (Adamantidis et al., 2011; Maes et al., 2020; Sharpe et al., 2020; Zell et al., 2020; Coddington et al., 2023; Markowitz et al., 2023). Collectively, this supports a framework where VTA dopamine serves an important role in credit assignment (what led to that?) as well as evaluation (how was it?).
Our results are consistent with a recent study (Fraser et al., 2023) demonstrating that VTA dopamine neuron activity supports flexible self-stimulation behavior. Rats will engage in complex seeking tasks, including completing a chain of actions, to receive VTA dopamine neuron activation. Further, the presence of cues paired with VTA dopamine neuron activation promotes higher breakpoints on a progressive ratio schedule. While we did not manipulate the effort required to self-stimulate, collectively our data and this study suggest that VTA dopamine neurons can assign value to stimuli and actions, which facilitates both vigorous but also flexible behavior. In contrast, self-stimulation of SNC dopamine neurons, while similar to VTA levels for continuously reinforced responding, does not persist when multiple actions are required to produce stimulation (Fraser et al., 2023). Thus while VTA and SNC dopamine neurons may broadly signal a similar base value (Ilango et al., 2014; Hollon et al., 2021), the learning processes they facilitate are qualitatively different (Wise, 2009; Howe and Dombeck, 2016; Saunders et al., 2018; Keiflin et al., 2019).
In our studies, the presence of an explicit cue dramatically elevated dopamine neuron self-stimulation behavior, where some rats made thousands of responses in 1 h. Cued rats not only received many more stimulations than noncued rats, they also nose poked at more than double the rate. This particular expression of self-stimulation could be consistent with the development of compulsive-like behaviors thought to underlie addiction and related diseases. While we did not assess the extent that cued ICSS was potentially more resistant to devaluation or punishment, our data suggest that cues play a prominent role in the development of highly vigorous dopamine seeking, which could be indicative of a compulsive-like phenotype. Recent studies have shown that individual differences in the extent to which dopamine neuron self-stimulation becomes compulsive-like relate to the emergence of unique plasticity within corticostriatal circuits (Pascoli et al., 2015, 2018). Notably, individual differences in cue responsivity and dopamine activation patterns likely interact to produce different vulnerabilities in these disease states (Flagel and Robinson, 2017; Poisson et al., 2021).
The conceptual landscape for VTA dopamine's functions grows ever more complex, as new tools and models allow for greater precision in manipulation and measurement of the system during behavior. Our results are consistent with an emerging view of VTA dopamine as both reward and value-free teaching signal. This work, in combination with several other recent studies, highlights the importance of the learning context in which dopamine's functions are being probed, for determination of its function.
Footnotes
The authors declare no competing financial interest.
This work was supported by National Institutes of Health Grants R01 DA035943, F32 DA036996, R00 DA042895, R01 MH129370, and R01 MH129320.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.