Participants
Participants were recruited from a university subject pool, with a total of 356 individuals. Due to a computer error during the experiment, one participant's data was excluded, resulting in a final sample of 355, including 118 with GAD (and low depression), 113 with depression (and low GAD), and 124 healthy controls (HC). Participants were assigned to the GAD group if they met the full diagnostic criteria on the Generalized Anxiety Disorder Questionnaire-IV (GAD-Q-IV; Newman et al., 2002) and scored 13 or less on the Beck Depression Inventory-II (BDI-II; Beck, Steer, & Brown, 1996). They were included in the depression group if they scored 20 or above on the BDI-II and did not meet the diagnostic criteria on the GAD-Q-IV. Participants were categorized as HC if they neither met the diagnostic criteria on the GAD-Q-IV nor scored 14 or above on the BDI-II. Participants were randomly assigned to scripted (either scripted worry or scripted rumination) or personalized inductions (either personalized worry or personalized rumination), with stratification by group (GAD, depression, and HC)[1]. The sample was predominantly female (282 women, 79.4% vs. 73 men, 20.6%). The racial composition included 267 White (75.2%), 37 Asian (10.4%), 22 Hispanic (6.2%), 18 Black or African-American (5.1%), and 11 from other racial backgrounds (3.1%). The mean age was 18.23 (SD = 2.83). The average score for the GAD-Q-IV (using continuous scoring) was 4.88 (SD = 3.88), whereas the average BDI-II score was 12.35 (SD = 10.42). Further details on participant characteristics can be found in Table 1.
Table 1
Demographic characteristics
Variable | Total (N = 355) | GAD (n = 118) | Depression (n = 113) | HC (n = 124) | Statistics |
Categorical variables | | | | | χ² | p | V |
Gender, n (%) | | | | | 2.16 | .340 | .08 |
Women | 282 (79.4) | 99 (83.9) | 87 (77.0) | 96 (77.4) | | | |
Men | 73 (20.6) | 19 (16.1) | 26 (23.0) | 28 (22.6) | | | |
Race, n (%) | | | | | 9.08 | .336 | .11 |
White | 267 (75.2) | 93 (75.0) | 81 (65.3) | 93 (75.0) | | | |
Asian | 37 (10.4) | 13 (10.5) | 13 (10.5) | 11 (8.9) | | | |
Hispanic | 22 (6.2) | 6 (4.8) | 9 (7.3) | 7 (5.6) | | | |
African-American | 18 (5.1) | 1 (0.8) | 8 (6.5) | 9 (7.3) | | | |
Others | 11 (3.1) | 5 (4.0) | 2 (1.6) | 4 (3.2) | | | |
Continuous variables | | | | | F | P | 𝜂2 |
Age, M (SD) | 18.23 (2.83) | 18.30 (3.21) | 18.34 (2.85) | 18.06 (2.42) | .377 | .687 | .00 |
GAD-Q-IV, M (SD) | 4.88 (3.88) | 9.10 (1.58) | 4.40 (3.10) | 1.30 (1.35) | 846.47 | < .001 | .89 |
BDI-2, M (SD) | 12.35 (10.42) | 8.92 (2.93) | 25.75 (6.19) | 3.39 (3.50) | 570.8 | < .001 | .84 |
Note. GAD, Generalized Anxiety Disorder; HC, Healthy Controls; GAD-Q-IV, Generalized Anxiety Disorder Questionnaire-IV-Continuous Score; BDI-2, Beck Depression Inventory-2 |
Procedure
This study was approved by the Institutional Review Board at the authors' affiliated institution. Prior to the experiment, participants provided written consent, and the concepts of worry and rumination were explained to them. Following Borkovec et al. (1983) and Nolen-Hoeksema (1991), worry was defined as “a chain of uncontrollable thoughts and images about things that might happen in the future,” whereas rumination was described as “passively and repetitively thinking about possible causes, implications, and consequences of stressful events and negative feelings as opposed to its solutions.” The experiment began with a 5-minute resting baseline to help participants acclimate to the setting. They then completed the assigned perseverative thought induction task. In the two personalized induction groups, they engaged either with their most worrisome or ruminative topics as vividly as possible for two minutes. In the two scripted induction groups, participants read a list of either worrisome or ruminative topics for eight minutes at their own pace. After the baseline and induction phase, participants rated their levels of worry and rumination. They received research credits for participating. All procedures were implemented using E-Prime 2.0 (Psychology Software Tools inc., 2002).
Instruments
Generalized Anxiety Disorder Questionnaire-IV (Newman et al., 2002)
The GAD-Q-IV is a nine-item self-report measure assessing GAD symptoms following diagnostic criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 2013). Diagnoses can be established either through a dimensional cutoff or by determining if individuals meet full diagnostic criteria. In this study, we opted for criterion scoring based on a prior study (Moore et al., 2014) demonstrating that it provided higher sensitivity (89%) and specificity (82%) than dimensional scoring. The internal consistency of the current sample was great (Cronbach’s α = .82)[2].
Beck Depression Inventory-II (Beck, Steer, & Brown, )
The BDI-II is a 21-item self-report questionnaire that assesses major depressive disorder symptoms. It demonstrated good convergent and discriminant validity (Beck, Steer, & Brown, 1996; Steer et al., 1999) and high retest reliability (Beck et al., 1988). A cutoff score of 18 has high sensitivity (94%) and specificity (92%) (Arnau et al., 2001). We used a more rigorous cutoff − 20 or above for moderate to severe depression, and 13 or less for minimal depression, as suggested by the original study (Beck, Steer, Ball, et al., 1996) - to screen participants. Internal consistency in our sample was strong (Cronbach’s α = .92).
Subjective Emotion Scales
During both the resting baseline and induction phases, participants were asked to rate their levels of worry and rumination using a 9-point Likert scale. The scale ranged from 0, indicating “not at all”, to 8, indicating “extremely”.
Scripted Induction Methods
For script-based induction of worry, we presented items from the Worry Domains Questionnaire (WDQ; Tallis et al., 1994). The WDQ comprises 25 items selected for their high intensity and frequency from the General Worry Questionnaire (GWQ; Tallis et al., 1992). The WDQ covers a wide range of worry-provoking topics, such as relationships, an uncertain future, work-related concerns, and financial issues. Following previous studies (e.g., Freeman et al., 2013; Ikani et al., 2022), we instructed participants to read through the worry-inducing topics at their own pace for eight minutes.
For the scripted rumination induction, we used the method developed by Nolen-Hoeksema and Morrow (1993), consisting of 45 statements that encourage reflection on oneself, along with the causes and consequences of current feelings and situations. Participants read these statements at their own pace for eight minutes. Additional information is available in Method S1.
Personalized Induction Methods
For personalized worry and rumination inductions, we adapted Borkovec and Inz (1990)’s induction method. Before the experiment, participants wrote five scenarios that triggered the most intense worry or rumination for them. They then practiced thinking about each scenario for one minute and rated their level of worry and rumination on a 9-point Likert scale from 0 (not at all) to 8 (extremely). To ensure the scenarios were suitable for the experiment, we pre-screened them using a set of standards. To qualify for emotion manipulation, the target emotion had to score above 4 on the scale and at least 3 points higher than the non-target emotions. Additionally, the temporal orientation had to align with the induction type: for worry, it needed to be future-oriented (above 4), and for rumination, past-oriented (below 4). During the induction period, participants engaged with their most intense worry or rumination scenario for two minutes. Further details are provided in Method S2.
Analytic plan
All analyses were conducted using RStudio version 2023.12.0 + 369 (RStudio Team, 2023). We computed descriptive statistics for demographic characteristics and GAD-Q-IV and BDI-II scores. To assess differences across groups and experimental conditions, we used ANOVA and chi-square tests, followed by pairwise comparisons when results were significant. To compare different induction methods (scripted vs. personalized) and types (worry induction vs. rumination induction), we used random-intercept linear mixed models. These models included group, induction (either method or type), time (resting baseline vs. induction phase), their interactions, and demographic covariates (age, gender, and race). When a significant two-way interaction between induction and time, or a three-way interaction among induction, time, and group was found, we conducted simple slope analyses to examine the significance of the slopes and the differences between them.