The current study examined the ability of ADHD individuals to assign and act upon internal action values across stages of a sequential decision-making task. Participants with/without ADHD made decisions across a three-step deterministic tree task. First, we used reinforcement learning modeling to gain estimates of the latent and internal action values for each individual, trial and step. We replicated previous findings showing that across both groups, estimates of choice accuracy and RT variability improved when the two available actions were well differentiated in terms of their latent values. Second, we found that ADHD individuals were less sensitive to initiatory actions value differences across both choice accuracy and RT variability estimates compared with their HC. Specifically, a larger difference between the values of the two available first stage actions had smaller benefit on ADHD choices and RTs compared to HC. Remarkably, these group differences diminished and disappeared when making terminal third stage actions, to a point where the two groups were remarkably indistinguishable. Our results therefore suggested a striking attenuated sensitivity to value actions in ADHD, only in initiatory but not terminal task action.
The current finding extends previous literature in several ways. First, previous studies have already demonstrated that individuals with ADHD are uniquely characterized by inconsistent decision-making, as apparent in both choice and reaction times estimates 52,53. These studies further suggested that these behavioral inconsistencies in ADHD tend to wash out under conditions of immediate reward. For example, ADHD have been found to exhibit marked behavioral inconsistency in reaction times (i.e., increased RT variability) when performing sustained attention/executive function tasks (for review see 50,53,54). In a meta-analysis report, RT variability was further found to show the highest effect-size for group differences, thus the mechanisms that contribute to this ADHD phenotype can be of much importance to how we understand ADHD. The sustained attention/executive functions tasks that reported increased RT variability in ADHD were mostly performed in the absence of any immediate reward or feedback. Remarkably, when immediate feedback or reward was delivered, ADHD demonstrated a significant improvement in RT variability estimates. Improved RT variability with delivery of immediate reward was found in a variety of paradigms including go/no-go 54, choice reaction time task 55,56, simple reaction time 57, and time discrimination task 58,59. However, these studies are characterized almost exclusively with choice-feedback coupling, such that outcome is given after a single choice and use between-block manipulations (blocks with/without feedback). Here, we show both increased RT variability in ADHD compared to HC, and the absence of group difference in RT variability estimates within a single trial. This is important since it refutes claims that a general motivation induced across a block is the sole reason for improved performance in ADHD under conditions of immediate reward/feedback. Instead, our finding points towards a more refined value-action mechanism that leads to increased inconsistency in ADHD.
Second, we find that only for first stage choices, ADHD showed attenuated sensitivity to action values in terms of RT variability. Therefore, one explanation for both increased inconsistency in RTs in ADHD 50,53,54, and previous findings regarding the improvement in this estimate under immediate reward 54–57,59 might be an attenuated ability to assign and follow internal action-values. Specifically, when performing a task in the absence of immediate reward/feedback, the individual is required to hold in mind the overall value of task completion. This might lead to noisier action values in ADHD compared with HC, which in turn is reflected in noisier reaction times. Therefore, individuals need to hold an overall value of task completion to keep interest and guide the selection of accurate actions. Since individuals with ADHD have known deficits in reward neural processing, and tend to discount future outcomes 59, they may encounter particular difficulty in assigning value to current actions based on a distant goal. Indeed, reinforcement learning studies have confirmed both in simulations and empirical studies that a reduced ability to assign value to actions lead to prolonged and more inconsistent RTs, a finding that we also replicated in the current study across groups. Therefore, we speculate that the process of internally assigning value to actions, can be a major contributing mechanism to increased RT variability in ADHD.
Why do ADHD individuals exhibit noisier choices during first stage choices? The current study cannot offer a clear explanation, yet we will discuss a few explanations. First, it might be that ADHD individuals are more explorative in their behavior 6,25,52,60,61. Specifically, a few reinforcement learning studies suggested that less value-directed behavior in ADHD might be the results of directed decision to explore, rather than reduced ability to follow action values. For example, Dubois et al., 62 examined explore/exploit decision making between children, early and late adolescence. They found that participants with high scores in self-report symptoms of ADHD made more exploratory decisions. Next, Dubois & Hauser 63 further linked between exploration and the impulsivity component within ADHD, in an explore/exploit decision making task. Addicott et al., 64 also examined explore/exploit decision making using the 6-armed bandit task where on each trial, six bandit options were depicted on a computer screen and participants selected one and received a reward in various probabilities. They compared ADHD and HC groups of adults on explore/exploit decisions that were modeled using reinforcement learning algorithms. They found that ADHD participants made more exploratory decisions (i.e., chose options without the highest expected reward value) and earned fewer points than HC in all three study days. Frank et al., 25 examined go/no-go decision making between ADHD and HC groups of young adults. They additionally used computational models in order to predict participants' behavior. They found that ADHD participants were more likely to display inconsistent choices from trial to trial, and that increased inconsistent behavior was correlated with increased variability in reaction times.
Another explanation of the reduced tendency to act according to value actions in first stage choice in ADHD might be the temporal distance from the outcome. That is, in the current task we cannot rule out the fact that it is not the actual sequence or the fact that first stage choices were initiatory actions in a sequence of three decisions. Rather it might be that the temporal delay per-se is the reason for the reduced ability to follow action values. We note that a previous study suggested that counter intuitively, ADHD individuals improve in their ability to learn from feedback when action outcome epoch is prolonged 65. However, dedicated studies should carefully disentangle temporal delay from task stages to account for this explanation. Finally, it might be the complexity of the task that was difficult for the ADHD participants during first stage choices. Studies have shown reduced working memory in ADHD 36,40,66–70, which in turn might reduce their ability to hold in mind and follow the task structure during first stage choices.
Finally, a few limitations should be mentioned. First, our design does not allow us to differentiate whether ADHD individuals adopted different speed-accuracy response criteria in first stage choices. Further studies can address this by using dedicated models (e.g., evidence accumulation). Second, our study cannot disentangle whether temporal distance from the outcome, unique properties of first stage actions, or both are behind the effect. Finally, our study was performed in a college student population, and further studies are needed to demonstrate the generalization to the children population.
In conclusion, individuals with ADHD exhibit reduced choice accuracy and higher RT variability compared to HC individuals for first-stage, initiatory action, but not for the third stage, terminal action. Our computational modelling uncovered that individuals with and without ADHD assign and act upon internal action values in sequential decision-making tasks, and as shown here there are differences between the groups in the ability to do so. This highlights the need to address and deeper understand the internal action values as an important mechanism in ADHD.