External globus pallidus input to the dorsal striatum regulates habitual reward-seeking behavior

doi:10.21203/rs.3.rs-2210532/v1

Download PDF

Article

External globus pallidus input to the dorsal striatum regulates habitual reward-seeking behavior

https://doi.org/10.21203/rs.3.rs-2210532/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 12 Jul, 2023

Read the published version in Nature Communications →

Version 1

posted

You are reading this latest preprint version

The external globus pallidus (GPe) coordinates action-selection through GABAergic projections throughout the basal ganglia. GPe arkypallidal (arky) neurons project exclusively to the dorsal striatum, which regulates goal-directed and habitual reward-seeking. However, the role of GPe arky neurons in reward-seeking remains unknown. Here, we identified that a majority of arky neurons target the dorsolateral striatum (DLS). Using fiber photometry, we found that arky activities were higher during random interval (RI; habit) compared to random ratio (RR; goal) operant reward-seeking. Support vector machine analysis demonstrated that arky neuron activities have sufficient information to distinguish between RR and RI behavior. Genetic ablation of this arky^GPe◊DLS circuit facilitated a shift from goal-directed to habitual behavior. Conversely, chemogenetic activation reduced habitual seeking-behaviors, which was blocked by systemic D1R agonism. Our findings reveal a novel role of this arky^GPe◊DLS circuit in constraining habitual reward-seeking, which is relevant to addictive behaviors and other compulsive disorders.

Biological sciences/Neuroscience/Neural circuits

Biological sciences/Neuroscience/Reward

external globus pallidus

arkypallidal neuron

goal-directed behavior

habit

reward-seeking behavior

The external globus pallidus (GPe) has often been thought of as a relay nucleus in the indirect pathway of the basal ganglia, receiving input from the dorsal striatum, and projecting to downstream output targets^1–3. The importance of the globus pallidus in motor function and clinical relevance in movement disorders has long been appreciated^4–10. More recently, the GPe has also been implicated in non-motor functions such as decision-making and reward-seeking behaviors by coordinating output from the dorsomedial (DMS) and dorsolateral (DLS) striatum, which are known to regulate goal-directed and habitual reward-seeking, and compulsive disorders such as addiction^6,11−14.

This is further supported by cell- and circuit-specific characterization of GABAergic GPe neurons showing distinct projections throughout the basal ganglia^6,15. Evidently, the GPe is subdivided into primarily two types of GABAergic projection neurons, prototypic and arkypallidal (arky)^15–18. Prototypic neurons represent approximately two-thirds of GPe neurons and innervate the downstream subthalamic nucleus (STN) and other output nuclei^16,18. In contrast, arky neurons comprise approximately one-quarter of GPe neurons and project back to the dorsal striatum^16,18. While pavalbumin (PV) is the primary cellular marker for the prototypic neurons, neuronal PAS domain protein 1 (NPAS1) and forkhead box p2 (FOXP2) are two main molecular markers defining arky neurons^15,16,18,19. Interestingly, increased arky activities have been shown to reduce dorsal striatum neural activities and inhibit dorsal striatum-dependent motor behaviors^20,21. Neural activity changes in the DMS and DLS have also been shown to underly the transition between goal-directed and habitual reward seeking^{11,13,22−26}. However, whether GPe arky neurons also control behavioral inhibition in reward-seeking behaviors through this feedback circuit to the dorsal striatum has not been studied.

In the present study, we provide a novel role of GPe arky neurons in goal-directed and habitual reward-seeking. Using fiber photometry Ca²⁺ imaging, computational modelling, genetic ablation, chemogenetic, and behavioral approaches, we revealed how the arky^GPe◊DLS circuit regulates action-selection and habit suppression.

Arkypallidal neurons primarily project to the DLS. To determine if GPe arky neurons showed preferential projection to either the DMS or DLS, we injected an anterograde virus [pAAV1-CamKII(1.3).eYFP.WPRE.hGH] into the GPe. We then examined the distribution of synaptic terminals in various output regions using confocal imaging (Fig. 1a). Interestingly, GPe arky neurons preferentially projected to the DLS compared to the DMS (Fig. 1b). Prototypical projections were also observed in the STN and SNr. Next, we injected retrobeads into the DMS or DLS and examined the retrograde signal in the GPe and cortex (Fig. 2a). Then we used GPe cell-specific markers FOXP2 (arky) and PV (prototypic) to confirm the arky marker-labeled cells overlap with the retrobeads in the GPe (Fig. 2b). As previously characterized, we confirmed prefrontal cortical areas projecting to the DMS and more significant motor cortical projections to the DLS^{13,22,27−29}. Again, we observed predominant retrograde signal in the GPe from the retrobeads injected into the DLS. Consistent with previous estimates, most of the DLS-projecting neurons in the GPe expressed FOXP2 (~ 80%), but not PV^15–17.

Mice exhibit goal-directed or habitual behaviors through training on random operant schedules. Based on previous findings that GPe arky neurons are important for regulating dorsal striatum-dependent behaviors^20,21, we sought to determine the temporal dynamics of arky neurons during goal-directed and habitual behavior for a 20% sucrose reward using the genetically encoded calcium-sensitive fluorescent proteins, GCaMP6³⁰. A retrograde virus expressing GCaMP6s (AAV-hSyn1-GCaMP6s-P2A-nls-dTomato) was injected into the DLS, and we recorded intracellular Ca²⁺ signal using fiber photometry during the last training sessions of random ratio (RR; goal) and random interval (RI; habit) schedules, where the operant behaviors were presumably sufficiently learned (Fig. 3a-c). During magazine training (MT), both groups of mice showed reduced latency to the magazine from the first day to the last day. Both RR and RI groups showed increased nose poke rates across sessions and an increased likelihood to choose the active nose-poke hole compared to the inactive hole (Two-way RM ANOVA, p < 0.05; Fig. 3d). In the devaluation test, which compares nose poking between the valued and devalued conditions, RI-trained mice did not show a decrease in nose poking in the devalued condition, indicating habitual reward-seeking (Wilcoxon test, p > 0.05; Fig. 3e). RR-trained mice showed a reduction in nose poking in the devalued condition, indicating goal-directed reward-seeking (p < 0.05; Fig. 3e).

Arkypallidal neurons exhibit increased Ca ²⁺ signaling during habitual (RI) reward-seeking. After confirming that mice showed goal-directed (RR) or habitual (RI) reward-seeking in the reward-devaluation task, we examined GPe arky neural activities surrounding the six behavioral events: rewarded nose-poke (NP R+), unrewarded nose-poke (NP R-), rewarded magazine entry (M_entry R+), unrewarded magazine entry (M_entry R-), rewarded magazine exit (M_exit R+), and unrewarded magazine exit (M_exit R-). We aligned the Ca²⁺ signal data 2 seconds prior and 2 seconds following each behavioral event (4s total, 120 frames; Fig. 4a-b). Mice showed increased GPe arky neuron activities during RI120 task compared to the RR20 task for rewarded and unrewarded nose poke (Two-way RM ANOVA, p < 0.05; Fig. 4c), and rewarded magazine entry (p < 0.05; Fig. 4f). GPe arky Ca²⁺ signal was significantly higher in the RR20 task for rewarded magazine exit compared to RI 120 (p < 0.05; Fig. 4i). However, no effect of operant schedule for unrewarded magazine entry or unrewarded magazine exit was observed (p > 0.05; Fig. 4f, i). Specific time ranges for significant post hoc comparisons between operant schedules are presented in Supplementary Table 3. For RI mice, comparison of mean arky activities before and after each behavioral event showed a significant increase in Ca²⁺ signal following rewarded nose poke, rewarded magazine entry, and unrewarded magazine entry (paired t-test, p < 0.05; Fig. 4d, g). GPe arky Ca²⁺ signal was decreased following unrewarded magazine exit (p < 0.05; Fig. 4j) without differences in rewarded magazine exit or unrewarded nose poke (p > 0.05, Fig. 4d, g, j). For RR mice, we found a significant increase in Ca²⁺ signal following rewarded nose pokes (paired t-test, p < 0.05; Fig. 4d) without changes in the other 5 behavioral events (p > 0.05, Fig. 4d, g, j). The degree of change was significantly higher in RI mice compared to RR mice for rewarded nose poke, unrewarded nose poke and rewarded magazine entry (unpaired t-test, p < 0.05; Fig. 4e, h).

Additionally, we examined whether the temporal cellular activities at the time of action-selection were stable or changed across the duration of an operant session. We used a regression analysis to compare the relationship between degree of change in Ca²⁺ signal surrounding each of the behavioral events with the progression of the behavioral trial. Due to the high variability in the total number of behavioral events across individuals and operant schedules, we transformed each trial into 10 blocks, each representing 10% increments of that behavioral event for the session. GPe arky Ca²⁺ signal for RI mice is progressively increased in activity change across the duration of a trial for rewarded nose poke, unrewarded nose poke, rewarded magazine entry, and unrewarded magazine entry (linear regression, p < 0.05; Supplementary Fig. 1a-d). During the RR task, arky activities are only increased across the trial duration for rewarded nose poke and rewarded magazine entry (p < 0.05; Supplementary Fig. 1a, c). Overall, arky Ca²⁺ signal was increased across trial duration at a greater rate for RI compared to RR for unrewarded nose poke and rewarded magazine entry (p < 0.05; Supplementary Fig. 1b-c).

GPe arkypallidal neuronal activities have information sufficient to distinguish goal-directed and habitual reward seeking. To identify whether GPe arky Ca²⁺ signal can distinguish which type of reward-seeking mice exhibited, we trained a support vector machine (SVM), a supervised learning model with minimal risk of overfitting and demonstrated utility in analyzing neural activity data^31–34. To accommodate the temporal dynamics of GPe arky neurons, as opposed to the average value of GPe arky neuron activities, we utilized all the trials from the RR and RI tasks. In addition, the time window was set to 2 seconds before and after the behavioral events. Our model can successfully differentiate arky neural activities between the RR and RI tasks for all behavioral events (Fig. 5a, Supplementary Table 1). Neural activities surrounding the nose poke behavioral events especially were a strong predictor of the reward-seeking type (Post hoc Dunn’s test; Fig. 5a-b; p < 0.05 for accuracy, sensitivity, and specificity compared to other behavioral events). Together, this indicates that GPe arky Ca²⁺ signal could be a predictor of action-selection underling goal-directed and habitual reward-seeking.

Caspase3-dependent ablation of GPe arkypallidal projections to the DLS shift mice towards habitual behavior. To determine whether ablation of this arky^{GPe→ DLS} circuit modulates goal-directed or habitual reward-seeking, we used a Cre-dependent caspase 3 virus which induces cell-autonomous death with minimal toxicity to neighboring cells^35–39. We bilaterally injected an mCherry-tagged retrograde virus expressing Cre recombinase (AAV-Ef1a-mCherry-IRES-Cre) into the DLS, followed by a second injection of Cre-dependent caspase-3 (pAAV5-flex-taCasp3-TEVp; or control pAAV5-Ef1a-DIO EYFP) into the GPe (Fig. 6a). We validated a significant reduction in mCherry-positive neurons in the GPe of the caspase group (unpaired t-test, p < 0.05; Fig. 7b-c). Supporting that GPe arky ablation disinhibits DLS cellular activities, caspase mice had a significantly higher number of cFos-positive cells in the DLS compared to the control group, but not in the DMS (p < 0.05; Supplementary Fig. 2a-b).

Since previous studies have implicated GPe arky neurons being associated with motor function^20,21, we examined if GPe arky neuron ablation resulted in motor dysfunction. In the open field test, we observed no significant differences in spontaneous locomotion or velocity, indicating that partial arky^{GPe→ DLS} circuit ablation does not alter basic motor function (Mann-Whitney test, p > 0.05; Supplementary Fig. 2d). In the first 10 minutes, we found no significant changes in time in the center zone, or center zone entries (p > 0.05; Supplementary Fig. 2e), suggesting no observable impact on anxiety-like behavior. To assess dorsal striatum-dependent motor learning, we utilized an accelerated rotarod paradigm which has previously been shown to result in DLS-dependent skill acquisition⁴⁰. Both groups learned the task well indicated by a significant increase in latency to fall across training sessions (Two-way RM ANOVA, p < 0.05; Supplementary Fig. 2g) without significant overall group differences in latency to fall, nor were there any group differences in the change across sessions (p > 0.05; Supplementary Fig. 2g). However, for daily average latency to fall values, we found an interaction between group differences and the day of testing (p < 0.05; Supplementary Fig. 2h). The caspase group had a shorter latency to fall time for days 1 and 2 (p < 0.05; Supplementary Fig. 2h), but similar in the remaining three training days, indicating that partial arky^{GPe→ DLS} circuit ablation may slow the initial motor learning without long-term effects.

During magazine training, both control and caspase mice showed reduced latency to the magazine from the first day to the last for both RR and RI (Two-way RM ANOVA, p < 0.05; Fig. 6e) training without differences between caspase and control mice for RR or RI groups. During operant training, mice showed increased nose poke rates for both RR and RI (p < 0.05; Fig. 6f) schedules in both the caspase and control groups. Altogether our results demonstrate arky^{GPe→ DLS} circuit ablation does not impair overall performance or acquisition rate in the operant reward-seeking task. In the devaluation test, control mice in the RR group showed a reduction in extinction session nose pokes for the devalued state (Wilcoxon test, p < 0.05; Fig. 6g), consistent with goal-directed behavior. Interestingly, RR caspase mice exhibit habitual behavior (p > 0.05; Fig. 6g), indicating that loss of arky^{GPe→ DLS} circuit function promotes a shift from goal-directed to habitual behavior. In contrast, RI caspase mice showed no significant differences between the valued and devalued states in both sham control and caspase mice (p > 0.05; Fig. 6h).

To determine whether this shift towards habitual behavior is specific to sucrose reward-seeking, we trained an additional set of mice with a 10% sucrose and 10% ethanol solution reward (10S10E). Similar to the sucrose reward-seeking paradigm, in the devaluation extinction test, we found no difference between the valued and devalued states for caspase mice on the RR operant schedule (p > 0.05; Supplementary Fig. 3a). Also, both RI caspase and sham control mice exhibited no differences between the valued and devalued states (p > 0.05; Supplementary Fig. 3b). To determine whether this shift to habitual behavior is possibly due to a change in motivation or valuation of the reward, or a more specific reinforcement learning process, we compared 10% ethanol preference and consumption between control and caspase mice in a 24h continuous-access two-bottle choice paradigm. We found no difference in 10% ethanol preference or consumption (Two-way RM ANOVA, p > 0.05; Supplementary Fig. 3c) between the caspase and control mice, suggesting that habitual seeking behavior is not necessarily correlated to reward preference.

Chemogenetic activation of GPe arkypallidal neurons reduces overall seeking-behaviors during valuation extinction testing. To determine if activation of GPe arky neurons could inhibit or reverse RI habitual reward-seeking, we selectively expressed the Gq-coupled designer receptors exclusively activated by designer drugs (DREADD) in arky neurons by first injecting a retrograde virus expressing Cre recombinase into the DLS (pENN.AAV.hSyn.HI.eGFP-Cre.WPRE.SV40). Next, we injected a Cre-dependent hM3Dq DREADDs virus into the GPe [pAAV5-hSyn-DIO-HM3D(Gq)-mCherry; Fig. 7a] and trained mice on an RI schedule (Fig. 7b). We confirmed DREADD expression in arky neurons via the overlapping of the mCherry with the FOXP2 cellular marker (Fig. 7c). During magazine training, mice showed reduced latency to the magazine from the first day to the last and increased nose poke rates across training sessions (Two-way RM ANOVA, p < 0.05; Fig. 7d). To test the effect of arky activation on habitual behavior, C21 (1 mg/kg i.p.) was administered 30 minutes before the extinction test for the valued and devalued states. As arky neurons have been shown to inhibit both dopamine 1 receptor (D1R) and dopamine 2 receptor (D2R)-expressing neurons in the dorsal striatum^17,19−21, we sought to determine whether any behavioral changes primarily occurred via D1R- or D2R- dependent mechanisms by testing combined C21 + Raclopride (D2R antagonist; 0.1 mg/kg) and C21 + SKF38393 (D1R agonist; 1.0 mg/kg) injection groups (Fig. 7b). In the devaluation test, we observed no significant differences between the valued and devalued states for saline, C21, C21 + raclopride, nor C21 + SKF (Wilcoxon test, p > 0.05) injection groups (Fig. 7e). However, the C21 injection significantly reduced nose pokes and magazine entries (Dunn’s post hoc test, p < 0.05; Fig. 7f-g) compared to the saline injection. Addition of raclopride similarly reduced nose poke, magazine entries, and magazine duration (p < 0.05; Fig. 7f-h) compared to saline injections. Interestingly, only coadministration of the D1R agonist, SKF38393, prevented C21-induced reductions in seeking-behaviors (p > 0.05; Fig. 7f-h), indicating behavioral effects of arky activation may primarily be through D1R-expressing dMSN’s.

In the present study, we provide a novel role in the function of this arky^GPe◊DLS circuit. This is, to the best of our knowledge, the first time showing that GPe arky neurons exhibit notably distinct activity patterns between RR and RI operant schedules, which develop goal-directed and habitual behavior, respectively. In addition, targeted genetic ablation of arky^GPe◊DLS circuit promoted a transition from goal-directed to habitual reward-seeking, whereas chemogenetic activation led to reduced habitual reward-seeking, which was prevented by D1R agonism as we illustrated in Supplementary Fig. 5.

Our findings initially seemed to contradict since we observed an increased Ca²⁺ signal during the habit training. However, it is important to note that the behavioral shaping time points in RI120 and RR20 are different than the actual assessment of goal-directed and habitual behaviors through devaluation and extinction. One possibility is that GPe arky activities during the behavioral training represent a counteracting signal to DLS activities that is more prominent in habitual behavior^17,20,21,41. Consequently, higher GPe arky activities during RI habit shaping may also underlie optimization of action-selection in the distinct random operant schedules. RR operant conditioning maintains the action-outcome association and administers reward directly proportional to the animal’s effort and response rates. However, RI conditioning is solely time-interval based and does not deliver increased reward with increased nose-poke rates^{11,13,24,27,42,43}. Thus, optimal effort for reward during RI conditioning is for animals to suppress seeking-behaviors between the random time intervals. This is further supported by reduced nose-poke rates in the RI operant schedules compared to RR (Fig. 3d, 6e), as well as the suppressed reward-seeking behaviors following chemogenetic activation (Fig. 7e, Supplementary Fig. 4). On the other hand, it is also possible that GPe arky neurons could support behavioral flexibility and action-selection through an inverted U-curve relationship frequently seen in cortical dopamine signaling and cognitive control in other studies^44–46. Future work examining simultaneous activities in GPe arky and DLS medium spiny neurons (MSN) may clarify the competing striatal “go” and arky “stop” relationship in habitual reward-seeking.

Arky neurons suppress dorsal striatum-dependent behaviors through GABA release onto striatal neurons^20,21. Indeed, our study demonstrated that GPe^{Arky → DLS} circuit ablation resulted in increased cellular activity markers in the DLS, but not the DMS (Supplementary Fig. 1a-b). Similarly, previous whole-GPe lesion led to increased dopamine-induced striatal cFos expression⁴⁷. Circuit studies have shown that arky neurons project to fast spiking interneurons as well as both direct and indirect MSN (dMSN, iMSN) in the dorsal striatum^6,17,19. We showed that chemogenetic activation of GPe arky neurons reduced habitual seeking-behaviors, but it remained unknown which cellular target may primarily mediate these behavioral effects. Notably, only systemic coadministration of C21 and the D1R agonist SKF38393 prevented C21-induced reductions in habitual seeking behaviors (Fig. 7f-h). This suggests that arky neurons may suppress habitual seeking behaviors primarily through D1R-expressing dMSN’s in the DLS. This is consistent with previous findings showing that habit development is accompanied by strengthening of both dMSN and iMSN in the DLS, whereas habit suppression was solely associated with weakened dMSN output⁴¹. Our findings are limited due to the systemic injection of the D1R agonist. Future studies with pharmacologic microinjections or optogenetic manipulations of DLS dMSN and iMSN will validate the precise impact of arky neurons in the DLS. Ex vivo electrophysiology studies will also be beneficial to characterize these circuit-specific effects and determine if GPe arky activities can alter neuroplastic changes in the DLS including potentiation of corticostriatal glutamatergic synapses that accompany habit development^{11,22,24,27,29}. The current study targeted arky neurons via a projection-dependent strategy using retrograde viruses injected into the DLS and showed significant overlap with the FOXP2 cellular marker. While all GPe cells that express FOXP2 project back to the striatum, small numbers of non-FOXP2 + pallidostriatal neurons have been identified, which include NPAS1+, LHX6+, or even a small percentage of PV + neurons^15,16. It is possible that these sub-populations of arky neurons may have distinct dorsal striatum targets. Thus, future studies are warranted to complement these projection-based studies using various Cre- mouse lines or promoter-driven viruses to determine if arky subtypes preferentially interact onto dMSN, iMSN, or other interneurons in the dorsal striatum.

While we provide evidence that the arky^GPe◊DLS circuit regulates goal-directed and habitual behavior, it remains unclear whether reward prediction and action-outcome contingencies are computed in the GPe or in its afferent projections. The largest innervation of the GPe is from GABAergic iMSN’s in the dorsal striatum and glutamatergic neurons in the STN^1,11,13. While the DMS and DLS have not been documented to have direct communication, our studies provide a possibility that arky neurons could form a more direct circuit between dorsal striatal regions and directly regulate the transition between goal-directed and habitual behavior. Dorsal striatum iMSN’s have been shown to change GPe arky activities directly and indirectly through a disynaptic circuit involving GPe prototypic neurons^20,48. However, it has not been studied in the context of dorsal striatum lateralization, nor for habitual reward-seeking behaviors. A small portion of dMSN’s also reportedly project to the GPe as well and could contribute to arky neuron activities⁴⁸. Interestingly, M1 and M2 motor cortex neurons have been shown to contain glutamatergic projections onto GPe arky neurons, but its functional effects on behavior have not been investigated⁴⁹. As the DLS also receives significant glutamatergic innervation from the motor cortex, we may need to address if there is overlap or separation of motor cortex neurons that directly project to the DLS or GPe arky neurons. Modified rabies virus transsynaptic tracing could prove particularly useful in characterizing di-synaptic circuits through GPe arky neurons, as well as cell-specific tracing in Npas1- or Foxp2-Cre mice^50–52.

Increasing evidence indicates that GPe plays a critical role in non-motor functions including action selection, reward learning and prediction, and behavioral flexibility^1,6. We showed that GPe arky neurons preferentially target the DLS compared to the DMS, which is known to regulate habitual reward-seeking and motor-skill acquisition^{13,22,41,53,54}. Previous behavioral studies investigating GPe arky neurons have focused on motor function and show an essential role for behavioral and locomotive suppression^20,21. Thus, when examining the non-motor function of GPe arky neurons, it is critical to document any effects manipulations may have on basic motor function which could confound results. We show that arky^GPe◊DLS genetic ablation did not affect distance travelled or velocity in the open field test (Supplementary Fig. 2d). We did observe some minor deficits during the first two days of accelerated rotarod, however caspase animals recovered to control levels by day three and beyond (Supplementary Fig. 2h). Despite this, we importantly did not find any deficits during operant conditioning acquisition (Fig. 6e-f).

We primarily utilized sucrose for our behavioral studies as a reward. While it is not completely representative of other addictive drugs, sucrose is a highly motivating reward for operant conditioning in rodents and can produce many of the characteristics of substance use disorders including inflexible reward-seeking and an over-reliance on maladaptive habitual behaviors^{23,24,26,27,55,56}. We showed that loss of GPe arky neurons resulted in inflexible habitual reward-seeking and an insensitivity to reward devaluation. This suggests reduced or dysfunctional arky activities could contribute to substance use disorders in humans, and that their activation or restoration of function could be a potential therapeutic strategy for addiction. Like the sucrose experiments, we found the same shift towards habitual behavior with arky^GPe◊DLS deletion, and reduction in habitual seeking-behaviors following chemogenetic activation using an ethanol-containing reward (Supplementary Fig. 3a, 4a-b). This supports the generalizability of our findings for more addictive rewards and specifically alcohol use disorder. Investigating sex-specific functions of GPe arky neurons will also be critical to this generalizability given documented sex-differences in habit formation and addictive behaviors^57–59. Interestingly, we did not find any baseline differences in ethanol preference or consumption in a two-bottle choice paradigm following arky^GPe◊DLS deletion (Supplementary Fig. 3c). This indicates that arky neurons may specifically regulate habitual reward-seeking and habit development as opposed to valuation of the reward itself. Additional studies may reveal the effects of chronic stress or ethanol exposure on arky regulation of the DLS. Chronic ethanol exposure has been shown to disrupt top-down regulation of the dorsal striatum by the OFC and produce habitual reward-seeking and could similarly explain how dysfunctional bottom-up regulation by arky neurons could lead to maladaptive habitual behaviors⁶⁰.

Overall, our study found that GPe arky neurons play an important role in the regulation of goal-directed and habitual reward-seeking behaviors. Not only did we show distinct activity patterns in GPe arky activity during goal/habit shaping, but targeted manipulation of the arky^GPe◊DLS circuit revealed a role in suppressing habitual reward-seeking. Further, diminished arky^GPe◊DLS function disinhibits DLS activity and led to an inability to properly inhibit reward-seeking upon devaluation. These findings may represent a novel therapeutic strategy for treating addiction and other compulsive disorders.

Animals. All experimental procedures were approved by the Mayo Clinic Institutional Animal Care and Use Committee and performed following NIH guidelines. C57BL/6J mice mice were purchased from Jackson Laboratory (Bar Harbor, ME). Mice were housed in standard Plexiglas cages. The colony room was maintained at a constant temperature (24 ± 1°C) and humidity (60 ± 2%) with a 12 h light/dark cycle (lights on at 07:00 A.M.). We used 8- to 10-week-old male mice for all experiments. Mice were allowed ad libitum access to food and water. For the operant conditioning tests, mice were food restricted to 85% of their baseline weight, at which time they were maintained for the duration of experimental procedures.

Operant conditioning. We conducted operant conditioning using the same operant chambers/schedules as our previous studies^61–63. Briefly, mice were placed in operant chambers (Med-Associates, St Albans, VT) in which they poke a single hole for an outcome of 20% sucrose (dissolved in tap water; 10 ul per reinforcement). For some experiments, sweetened ethanol (10% sucrose, 10% ethanol) was used as the reward. Before training, mice were food restricted to approximately 85% body weight, which was maintained for the duration of experimental procedures. On the first day, mice were trained to approach the reward magazine on a random time schedule with a reward delivered for 30 minutes. Next, mice were trained on a fixed ratio 1 schedule for 1h in 3 sessions. After acquiring nose-poking behavior, mice were trained on random interval (RI30 2 days/RI60 2–3 days/RI120 4 days) to develop habitual reward-seeking or random ratio (RR2 1–2 days/RR5 2–3 days/RR10 2–3 days/RR20 2–3 days) to form goal-directed reward-seeking. Sessions were completed after 30 minutes or following 60 reward reinforcements. RI30/60/120 delivered one reward outcome on average every 30/60/120 sec after the last reward outcome to develop habitual reward-seeking. RR2/5/10/20 delivered one reward outcome on average every 2/5/10/20 response in the correct nose-poke hole to develop goal-directed reward-seeking.

Reward devaluation and extinction test. On the devalued day, mice were given 1 hour of ad libitum access to the outcome (20% sucrose or 10% ethanol/10% sucrose) previously earned by nose poking (for devaluation) or food pellets, and then underwent serial nonreinforced extinction sessions in each training context. The order of the valuation context was counterbalanced across mice and were 10 minutes in duration. Drug treatments were administered IP 30 minutes prior to each extinction session.

Stereotaxic surgery for virus injection. Mice were anesthetized with isoflurane (1.5% in oxygen gas) and placed on the digital stereotaxic alignment system (model 1900; David Kopf instruments). Hair was trimmed and the skull was exposed using 8-gauge electrosurgical skin cutter (KLS martin, Jacksonville, FL). The skull was leveled using a dual-tilt measurement tool. Holes were drilled in the skull at the appropriate stereotaxic coordinates. Viruses were infused to the DLS (AP + 0.7 mm, ML +/- 2.5 mm, DV – 3.1 mm from bregma) or GPe (AP – 0.4 mm, ML +/- 2.0 mm, DV − 3.8 mm from bregma) at 100 nl/min for 4 minutes through a 33-gauge injection needle (cat # NF33BV; World Precision Instruments) using a microsyringe pump (Model UMP3; World Precision Instruments). The injection needle remained in place for an additional 5 min following the end of the injection. All viruses were purchased from Addgene, and were injected at the following titers: pAAV1-CamKII(1.3).eYFP.WPRE.hGH (1x10¹³ vg/mL), retrograde AAV-hSyn1-GCaMP6s-P2A-nls-dTomato (7x10¹² vg/mL), retrograde AAV-Ef1a-mCherry-IRES-Cre (7x10¹² vg/mL), retrograde pENN.AAV.hSyn.HI.eGFP-Cre.WPRE.SV40 (7x10¹² vg/mL), pAAV5-flex-taCasp3-TEVp (7x10¹² vg/mL), pAAV5-Ef1a-DIO EYFP (1x10¹³ vg/mL), pAAV5-hSyn-DIO-hM3D(Gq)-mCherry (7x10¹² vg/mL). Following stereotaxic surgery, we injected buprenorphine sustained release (1 mg/kg, s.c.; ZooPharm, Laramie, WY, USA) to alleviate post-surgery pain.

Immunofluorescence. Brains were fixed with 4% paraformaldehyde (Sigma-Aldrich, St. Louis, MO) and transferred to 30% sucrose (Sigma-Aldrich) in phosphate-buffered saline at 4^oC for 72 hr. Brains were then frozen in dry ice and sectioned at 40 µm using a microtome (Leica Corp., Bannockburn, IL). Brain slices were stored at -20oC in a cryoprotectant solution containing 30% sucrose (Sigma-Aldrich) and 30% ethylene glycol (Sigma-Aldrich) in phosphate-buffered saline. Sections were incubated in 0.2% Triton X-100 (Sigma-Aldrich), 5% bovine serum albumin in phosphate-buffered saline for 1 hr, followed by incubation with the primary antibody in 5% bovine serum albumin overnight at 4^oC. After three washes in phosphate-buffered saline, the sections were mounted onto a glass slide coated with gelatin and cover-slipped with a VECTASHIELD® antifade mounting medium (Vector Laboratories, Burlingame, CA). Images were obtained using an LSM 700 laser scanning confocal microscope (Carl Zeiss, Heidelberg, Germany) using a 10x or 40x water-immersion lens.

In vivo Ca²⁺ signal with fiber-photometry. We recorded the cellular Ca²⁺ transients in real-time in vivo by fiber photometry as described previously ⁶³. Briefly, we implanted an optic cannula into the GPe (AP -0.46 mm, ML + 2.0 mm, DV -3.8 mm from bregma) of mice injected with the retrograde GCaMP in the DLS and recorded fluorescence at 30 frames per second. ΔF/F₀ = [F(t) - F₀]/F₀. F(t) is the fluorescent value at a given time. F₀ is the resting averaged fluorescence value in the 3 s preceding the time alignment.

Classification modeling. To investigate whether GPe arky activity can predict reward-seeking strategy (RR or RI), we trained a support vector machine (SVM) with 4-fold cross-validation^31–34. Average arky Ca²⁺ signal during the 2 s period prior to and following each behavioral event (NP, Mentry, Mexit) was used as input data. To balance the dataset, we performed random under sampling. Random under sampling was repeated 400 times for each analysis and an averaged prediction performance of the SVM was calculated. We used MATLAB (version R2020a) for the SVM analyses.

Caspase 3-mediated cell ablation. To ablate cells in a targeted fashion, we used a genetically engineered caspase 3, whose activation commits a cell to apoptosis³⁷. Endogenous caspase 3 exists as procaspase 3, which is cleaved into its active form by upstream apoptotic signals and other caspase proteins. This genetically engineered caspase lacks the cleavage site for upstream caspases and can only be cleaved by a tobacco etch virus protease (TEVp) which is coexpressed in an AAV³⁶. AAV-flex-taCasp3-TEVp was expressed in a cre-dependent manner in the GPe to only ablate cells that express Cre recombinase from retrograde injections in the DLS (AAV-Ef1a-mCherry-IRES-Cre). Importantly, caspase 3 triggers cell-autonomous apoptosis, minimizing the risk of off-target effects and toxicity to neighboring cells^35,38,39.

Chemogenetics and drug treatments. We purchased compound 21 from Hello Bio (C21; Princeton, NJ). Based on previous experiments⁶³, we administered C21(1 mg/kg), dopamine receptor 1 agonist SKF38393 (1.0 mg/kg), dopamine receptor 2 antagonist raclopride (0.1 mg/kg), or saline i.p. 30 minutes before the experiments in mice. These concentrations have been previously shown to alter reward-seeking behaviors with minimal motor effects or risk of seizure^44,64−68.

Open field test. The open-field test (OFT) was conducted in chambers (Med-Associates, St Albans, VT) to measure locomotor responses of mice. The session lasted 30 minutes and total distance and velocities were recorded using beam breaks. The first 10-minute bin was referenced to measure anxiety-like behavior. Time spent in the open zone (%) was measured as time spent in the open zone / total time x 100.

Rotarod. A computer-interfaced rotarod accelerating from 4–40 rotations per min over 300s was used. Animals were trained with ten trials per day for 5 days (trained every other day). This training protocol was based on previous behavioral, pharmacological, and electrophysiological studies that showed this extended training resulted in DLS-dependent skill acquisition⁴⁰.

Two-bottle choice. Oral ethanol consumption and preference were examined using a two-bottle choice test in the mouse home cage. Mice were individually housed and given 24-hour access to two bottles, water, and ethanol. The concentration of ethanol was raised from 3–6% to 10% ethanol (10E, v/v) on every 4th day to adapt ethanol intake. Every other day the bottle placement was switched to prevent place preference development. After increasing the ethanol concentration to 10% the mice had 14 days of 10E access. Ethanol and water consumption were normalized for evaporation as previously described ⁶⁹. Briefly, the total volume of the liquid evaporated was calculated by averaging 2-days of evaporation from the four control cages without mice. Then, then volume of water or ethanol that evaporated was subtracted from the total consumption of water or ethanol for each mouse.

Data analysis. All data are represented as mean ± SEM using Prism 9.0 (GraphPad Software, San Diego, CA). The statistical significance was set at p < 0.05. Detailed statistical tests and data with exact p values are listed in Supplementary Table 2.

Data availability. All data are available from the authors upon reasonable request

Code availability. All code used in this manuscript is available at https://github.com/brain-machine-intelligence/Baker_2022.

Conflict of Interest

D.S.C. is a scientific advisory board member to Peptron Inc., and the Peptron had no role in the preparation, review, or approval of the manuscript; nor the decision to submit the manuscript for publication. The remaining authors declare that the research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgment

We thank all the laboratory members for their helpful discussion and comments. Figures were created with BioRender.com. This research was supported by the Samuel C. Johnson for Genomics of Addiction Program at Mayo Clinic, the Ulm Foundation, and the National Institute of Health (AA029258, AA028968, AA027773, AG072898). This study was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (NRF-2019M3E5D2A01066267, Development of metacognitive AI for rapid learning), and Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No.2019-0-01371, Development of brain-inspired AI with human-like intelligence).

Author Contributions

MB, SK, and DSC thought of the study. MB, SK, and SH performed all behavioral and tracing experiments. MB, LP, and HE collected, processed, and imaged tissue for histology. MS, MAY, and SWL sorted and analyzed Ca²⁺signaling data and created SVM models. MB and DSC wrote the manuscript. All authors reviewed and edited the manuscript.

Dong, J., Hawes, S., Wu, J., Le, W. & Cai, H. Connectivity and Functionality of the Globus Pallidus Externa Under Normal Conditions and Parkinson's Disease. Front Neural Circuits 15, 645287, doi:10.3389/fncir.2021.645287 (2021).
Schwab, B. C. et al. Synchrony in Parkinson's disease: importance of intrinsic properties of the external globus pallidus. Front Syst Neurosci 7, 60, doi:10.3389/fnsys.2013.00060 (2013).
Wichmann, T. & Dostrovsky, J. O. Pathological basal ganglia activity in movement disorders. Neuroscience 198, 232–244, doi:10.1016/j.neuroscience.2011.06.048 (2011).
Bevan, M. D., Magill, P. J., Terman, D., Bolam, J. P. & Wilson, C. J. Move to the rhythm: oscillations in the subthalamic nucleus-external globus pallidus network. Trends Neurosci 25, 525–531, doi:10.1016/s0166-2236(02)02235-x (2002).
Chazalon, M. et al. GAT-3 Dysfunction Generates Tonic Inhibition in External Globus Pallidus Neurons in Parkinsonian Rodents. Cell Rep 23, 1678–1690, doi:10.1016/j.celrep.2018.04.014 (2018).
Gittis, A. H. et al. New roles for the external globus pallidus in basal ganglia circuits and behavior. J Neurosci 34, 15178–15183, doi:10.1523/JNEUROSCI.3252-14.2014 (2014).
Grabli, D. et al. Behavioural disorders induced by external globus pallidus dysfunction in primates: I. Behavioural study. Brain 127, 2039–2054, doi:10.1093/brain/awh220 (2004).
Liu, J. et al. Facilitation of GluN2C-containing NMDA receptors in the external globus pallidus increases firing of fast spiking neurons and improves motor function in a hemiparkinsonian mouse model. Neurobiol Dis 150, 105254, doi:10.1016/j.nbd.2021.105254 (2021).
Rajput, A. H. et al. Globus pallidus dopamine and Parkinson motor subtypes: clinical and brain biochemical correlation. Neurology 70, 1403–1410, doi:10.1212/01.wnl.0000285082.18969.3a (2008).
Vitek, J. L., Hashimoto, T., Peoples, J., DeLong, M. R. & Bakay, R. A. Acute stimulation in the external segment of the globus pallidus improves parkinsonian motor signs. Mov Disord 19, 907–915, doi:10.1002/mds.20137 (2004).
Yin, H. H. & Knowlton, B. J. The role of the basal ganglia in habit formation. Nat Rev Neurosci 7, 464–476, doi:10.1038/nrn1919 (2006).
Lovinger, D. M. & Gremel, C. M. A Circuit-Based Information Approach to Substance Abuse Research. Trends Neurosci 44, 122–135, doi:10.1016/j.tins.2020.10.005 (2021).
Lipton, D. M., Gonzales, B. J. & Citri, A. Dorsal Striatal Circuits for Habits, Compulsions and Addictions. Front Syst Neurosci 13, 28, doi:10.3389/fnsys.2019.00028 (2019).
Bogacz, R., Martin Moraud, E., Abdi, A., Magill, P. J. & Baufreton, J. Properties of Neurons in External Globus Pallidus Can Support Optimal Action Selection. PLoS Comput Biol 12, e1005004, doi:10.1371/journal.pcbi.1005004 (2016).
Abrahao, K. P. & Lovinger, D. M. Classification of GABAergic neuron subtypes from the globus pallidus using wild-type and transgenic mice. J Physiol 596, 4219–4235, doi:10.1113/JP276079 (2018).
Hernandez, V. M. et al. Parvalbumin + Neurons and Npas1 + Neurons Are Distinct Neuron Classes in the Mouse External Globus Pallidus. J Neurosci 35, 11830–11847, doi:10.1523/JNEUROSCI.4672-14.2015 (2015).
Mallet, N. et al. Dichotomous organization of the external globus pallidus. Neuron 74, 1075–1086, doi:10.1016/j.neuron.2012.04.027 (2012).
Mastro, K. J., Bouchard, R. S., Holt, H. A. & Gittis, A. H. Transgenic mouse lines subdivide external segment of the globus pallidus (GPe) neurons and reveal distinct GPe output pathways. J Neurosci 34, 2087–2099, doi:10.1523/JNEUROSCI.4646-13.2014 (2014).
Glajch, K. E. et al. Npas1 + Pallidal Neurons Target Striatal Projection Neurons. J Neurosci 36, 5472–5488, doi:10.1523/JNEUROSCI.1720-15.2016 (2016).
Aristieta, A. et al. A Disynaptic Circuit in the Globus Pallidus Controls Locomotion Inhibition. Curr Biol 31, 707–721 e707, doi:10.1016/j.cub.2020.11.019 (2021).
Mallet, N. et al. Arkypallidal Cells Send a Stop Signal to Striatum. Neuron 89, 308–316, doi:10.1016/j.neuron.2015.12.017 (2016).
Corbit, L. H., Nie, H. & Janak, P. H. Habitual alcohol seeking: time course and the contribution of subregions of the dorsal striatum. Biol Psychiatry 72, 389–395, doi:10.1016/j.biopsych.2012.02.024 (2012).
Everitt, B. J. & Robbins, T. W. From the ventral to the dorsal striatum: devolving views of their roles in drug addiction. Neurosci Biobehav Rev 37, 1946–1954, doi:10.1016/j.neubiorev.2013.02.010 (2013).
Everitt, B. J. & Robbins, T. W. Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu Rev Psychol 67, 23–50, doi:10.1146/annurev-psych-122414-033457 (2016).
Hong, S. I., Kang, S., Baker, M. & Choi, D. S. Astrocyte-neuron interaction in the dorsal striatum-pallidal circuits and alcohol-seeking behaviors. Neuropharmacology 198, 108759, doi:10.1016/j.neuropharm.2021.108759 (2021).
Luscher, C., Robbins, T. W. & Everitt, B. J. The transition to compulsion in addiction. Nat Rev Neurosci 21, 247–263, doi:10.1038/s41583-020-0289-z (2020).
Everitt, B. J. & Robbins, T. W. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat Neurosci 8, 1481–1489, doi:10.1038/nn1579 (2005).
McGeorge, A. J. & Faull, R. L. The organization of the projection from the cerebral cortex to the striatum in the rat. Neuroscience 29, 503–537, doi:10.1016/0306-4522(89)90128-0 (1989).
Voorn, P., Vanderschuren, L. J., Groenewegen, H. J., Robbins, T. W. & Pennartz, C. M. Putting a spin on the dorsal-ventral divide of the striatum. Trends Neurosci 27, 468–474, doi:10.1016/j.tins.2004.06.006 (2004).
Chen, T. W. et al. Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature 499, 295–300, doi:10.1038/nature12354 (2013).
Gupta, R., Alam, M. A. & Agarwal, P. Modified Support Vector Machine for Detecting Stress Level Using EEG Signals. Comput Intell Neurosci 2020, 8860841, doi:10.1155/2020/8860841 (2020).
Koren, V. Uncovering structured responses of neural populations recorded from macaque monkeys with linear support vector machines. STAR Protoc 2, 100746, doi:10.1016/j.xpro.2021.100746 (2021).
Parvar, H. et al. Detection of event-related potentials in individual subjects using support vector machines. Brain Inform 2, 1–12, doi:10.1007/s40708-014-0006-7 (2015).
Wang, M. et al. Support Vector Machine for Analyzing Contributions of Brain Regions During Task-State fMRI. Front Neuroinform 13, 10, doi:10.3389/fninf.2019.00010 (2019).
Patton, M. S., Heckman, M., Kim, C., Mu, C. & Mathur, B. N. Compulsive alcohol consumption is regulated by dorsal striatum fast-spiking interneurons. Neuropsychopharmacology 46, 351–359, doi:10.1038/s41386-020-0766-0 (2021).
Yang, C. F. et al. Sexually dimorphic neurons in the ventromedial hypothalamus govern mating in both sexes and aggression in males. Cell 153, 896–909, doi:10.1016/j.cell.2013.04.017 (2013).
Chelur, D. S. & Chalfie, M. Targeted cell killing by reconstituted caspases. Proc Natl Acad Sci U S A 104, 2283–2288, doi:10.1073/pnas.0610877104 (2007).
Mallet, V. O. et al. Conditional cell ablation by tight control of caspase-3 dimerization in transgenic mice. Nat Biotechnol 20, 1234–1239, doi:10.1038/nbt762 (2002).
Smart, A. D. et al. Engineering a light-activated caspase-3 for precise ablation of neurons in vivo. Proc Natl Acad Sci U S A 114, E8174-E8183, doi:10.1073/pnas.1705064114 (2017).
Yin, H. H. et al. Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill. Nat Neurosci 12, 333–341, doi:10.1038/nn.2261 (2009).
O'Hare, J. K. et al. Pathway-Specific Striatal Substrates for Habitual Behavior. Neuron 89, 472–479, doi:10.1016/j.neuron.2015.12.032 (2016).
Hilario, M. R. & Costa, R. M. High on habits. Front Neurosci 2, 208–217, doi:10.3389/neuro.01.030.2008 (2008).
Lerner, T. N. Interfacing behavioral and neural circuit models for habit formation. J Neurosci Res 98, 1031–1045, doi:10.1002/jnr.24581 (2020).
Natsheh, J. Y. & Shiflett, M. W. Dopaminergic Modulation of Goal-Directed Behavior in a Rodent Model of Attention-Deficit/Hyperactivity Disorder. Front Integr Neurosci 12, 45, doi:10.3389/fnint.2018.00045 (2018).
Cools, R. & D'Esposito, M. Inverted-U-shaped dopamine actions on human working memory and cognitive control. Biol Psychiatry 69, e113-125, doi:10.1016/j.biopsych.2011.03.028 (2011).
Uddin, L. Q. Cognitive and behavioural flexibility: neural mechanisms and clinical considerations. Nat Rev Neurosci 22, 167–179, doi:10.1038/s41583-021-00428-w (2021).
Miwa, H., Fuwa, T., Nishi, K. & Mizuno, Y. Effects of the globus pallidus lesion on the induction of c-Fos by dopaminergic drugs in the striatum possibly via pallidostriatal feedback loops. Neurosci Lett 240, 167–170, doi:10.1016/s0304-3940(97)00952-x (1998).
Ketzef, M. & Silberberg, G. Differential Synaptic Input to External Globus Pallidus Neuronal Subpopulations In Vivo. Neuron 109, 516–529 e514, doi:10.1016/j.neuron.2020.11.006 (2021).
Karube, F., Takahashi, S., Kobayashi, K. & Fujiyama, F. Motor cortex can directly drive the globus pallidus neurons in a projection neuron type-dependent manner in the rat. Elife 8, doi:10.7554/eLife.49511 (2019).
Kim, E. J., Jacobs, M. W., Ito-Cole, T. & Callaway, E. M. Improved Monosynaptic Neural Circuit Tracing Using Engineered Rabies Virus Glycoproteins. Cell Rep 15, 692–699, doi:10.1016/j.celrep.2016.03.067 (2016).
Wall, N. R., Wickersham, I. R., Cetin, A., De La Parra, M. & Callaway, E. M. Monosynaptic circuit tracing in vivo through Cre-dependent targeting and complementation of modified rabies virus. Proc Natl Acad Sci U S A 107, 21848–21853, doi:10.1073/pnas.1011756107 (2010).
Knowland, D. et al. Distinct Ventral Pallidal Neural Populations Mediate Separate Symptoms of Depression. Cell 170, 284–297 e218, doi:10.1016/j.cell.2017.06.015 (2017).
Daw, N. D., Niv, Y. & Dayan, P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 8, 1704–1711, doi:10.1038/nn1560 (2005).
Yin, H. H., Knowlton, B. J. & Balleine, B. W. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci 19, 181–189, doi:10.1111/j.1460-9568.2004.03095.x (2004).
Bobadilla, A. C. et al. Cocaine and sucrose rewards recruit different seeking ensembles in the nucleus accumbens core. Mol Psychiatry 25, 3150–3163, doi:10.1038/s41380-020-00888-z (2020).
Sieburg, M. C. et al. Reward Devaluation Attenuates Cue-Evoked Sucrose Seeking and Is Associated with the Elimination of Excitability Differences between Ensemble and Non-ensemble Neurons in the Nucleus Accumbens. eNeuro 6, doi:10.1523/ENEURO.0338-19.2019 (2019).
Barker, J. M., Torregrossa, M. M., Arnold, A. P. & Taylor, J. R. Dissociation of genetic and hormonal influences on sex differences in alcoholism-related behaviors. J Neurosci 30, 9140–9144, doi:10.1523/JNEUROSCI.0548-10.2010 (2010).
Becker, J. B. & Chartoff, E. Sex differences in neural mechanisms mediating reward and addiction. Neuropsychopharmacology 44, 166–183, doi:10.1038/s41386-018-0125-6 (2019).
Ngun, T. C., Ghahramani, N., Sanchez, F. J., Bocklandt, S. & Vilain, E. The genetics of sex differences in brain and behavior. Front Neuroendocrinol 32, 227–246, doi:10.1016/j.yfrne.2010.10.001 (2011).
Renteria, R., Baltz, E. T. & Gremel, C. M. Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits. Nat Commun 9, 211, doi:10.1038/s41467-017-02615-9 (2018).
Hong, S. I., Bullert, A., Baker, M. & Choi, D. S. Astrocytic equilibrative nucleoside transporter type 1 upregulations in the dorsomedial and dorsolateral striatum distinctly coordinate goal-directed and habitual ethanol-seeking behaviours in mice. Eur J Neurosci 52, 3110–3123, doi:10.1111/ejn.14752 (2020).
Hong, S. I., Kang, S., Chen, J. F. & Choi, D. S. Indirect Medium Spiny Neurons in the Dorsomedial Striatum Regulate Ethanol-Containing Conditioned Reward Seeking. J Neurosci 39, 7206–7217, doi:10.1523/JNEUROSCI.0876-19.2019 (2019).
Kang, S. et al. Activation of Astrocytes in the Dorsomedial Striatum Facilitates Transition From Habitual to Goal-Directed Reward-Seeking Behavior. Biol Psychiatry 88, 797–808, doi:10.1016/j.biopsych.2020.04.023 (2020).
Abraham, A. D., Neve, K. A. & Lattal, K. M. Activation of D1/5 Dopamine Receptors: A Common Mechanism for Enhancing Extinction of Fear and Reward-Seeking Behaviors. Neuropsychopharmacology 41, 2072–2081, doi:10.1038/npp.2016.5 (2016).
Alleweireldt, A. T., Weber, S. M., Kirschner, K. F., Bullock, B. L. & Neisewander, J. L. Blockade or stimulation of D1 dopamine receptors attenuates cue reinstatement of extinguished cocaine-seeking behavior in rats. Psychopharmacology (Berl) 159, 284–293, doi:10.1007/s002130100904 (2002).
Faure, A., Leblanc-Veyrac, P. & El Massioui, N. Dopamine agonists increase perseverative instrumental responses but do not restore habit formation in a rat model of Parkinsonism. Neuroscience 168, 477–486, doi:10.1016/j.neuroscience.2010.03.047 (2010).
O'Sullivan, G. J. et al. Dopamine D1 vs D5 receptor-dependent induction of seizures in relation to DARPP-32, ERK1/2 and GluR1-AMPA signalling. Neuropharmacology 54, 1051–1061, doi:10.1016/j.neuropharm.2008.02.011 (2008).
Sabioni, P., D'Almeida, V., Andersen, M. L., Andreatini, R. & Galduroz, J. C. SKF 38393 reverses cocaine-conditioned place preference in mice. Neurosci Lett 513, 214–218, doi:10.1016/j.neulet.2012.02.041 (2012).
Nam, H. W. et al. Adenosine transporter ENT1 regulates the acquisition of goal-directed behavior and ethanol drinking through A2A receptor in the dorsomedial striatum. J Neurosci 33, 4329–4338, doi:10.1523/JNEUROSCI.3094-12.2013 (2013).

Yes there is potential Competing Interest. D.S.C. is a scientific advisory board member to Peptron Inc., and the Peptron had no role in the preparation, review, or approval of the manuscript; nor the decision to submit the manuscript for publication.

NCSupplementalInformationfinal.docx
Supplementary Information

Download PDF

Journal Publication

published 12 Jul, 2023

Read the published version in Nature Communications →

Version 1

posted

You are reading this latest preprint version

External globus pallidus input to the dorsal striatum regulates habitual reward-seeking behavior

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1