Study setting
The cRCT will be conducted at worksites in Japan from September 2023. Approximately 2,700 companies that have contracts with IID Co. Ltd. to outsource the National Stress Check Program will be recruited. This program assesses psychosocial stress in employees at least once a year.29 The potential company participants will cover all regions of Japan, namely Hokkaido, Tohoku, Kanto, Chubu, Kinki, Chugoku, Shikoku, Kyushu, and Okinawa. Based on the Japan Standard Industrial Classification, the potential company participants will cover a wide range of industries, except mining.30
Invitation flyers will be sent to the companies explaining the background and scientific evidence for organizational interventions to prevent mental illness in the workplace as well as the objectives of the present study. Stressful worksites will be prioritized for recruitment to clearly detect the effects of the interventions using the percentage of “high-stress” employees from the previous results of the National Stress Check Program. The definition of “high-stress” is based on a combination of high scores in stress response and job stressors, and low scores in social support. The predictive validity of “high-stress” employees for long-term sickness absence at a one-year follow-up has been reported previously.31
Figure 1 presents the participant flowchart of this study. When the company agrees to participate, a list of worksites will be provided to the research team with information on location, worksite size, industry, and contact details. All employees nested within the worksites meeting the eligibility criteria will be recruited. Informed consent will be obtained from all employees at the baseline survey via paper-signed or online forms. After completion of the baseline survey, the worksites will be randomly allocated to one of the five arms. The intervention programs will last for 6–12 months. The follow-up surveys will be conducted 6, 12, and 18 months after the baseline survey. Of the included employees, 800 will be individually recruited for quantification of cortisol levels extracted from fingernails. These employees will be asked to clip extended fingernails into a zip-lock plastic bag and answer a questionnaire about lifestyle habits associated with fingernails in each of the surveys.
----------------------Insert Fig. 1 here----------------------
Participants
This cRCT will recruit multi-level participants, including worksites and employees. The inclusion criteria for worksites are 1) conducting the National Stress Check Program; 2) analyzing the results of the National Stress Check Program in sufficiently large groups for work environment improvement29; 3) employing 5–299 workers subject to the National Stress Check Program; 4) having at least one supervisor working with at least one subordinate. There are no exclusion criteria for worksites. The inclusion criteria for employees are 1) being aged ≥ 18 years; 2) being able to answer the questionnaires written in Japanese; 3) having access to the Internet and the intervention programs offered online; 4) having a private smartphone available; 5) working at a workplace (or workplaces) where at least one supervisor and one subordinate work together. The exclusion criterion for employees is absence from the workplace because of health problems in the 12 months before baseline. The inclusion/exclusion criteria for employees at the participating worksites, including teleworkers, do not involve other work parameters, such as employment, shift, and occupational types. Additional exclusion criteria are set for 800 employees who will be providing samples for quantification of cortisol levels extracted from fingernails: 1) participants with fingernails shortened using only a nail file (not cutting fingernails) and 2) participants using false nails.
Interventions
This cRCT will offer four intervention programs and an active control program at the worksite level. The interventions include psychoeducation for workers and supervisors, work environment improvement, and physical activity promotion.
Psychoeducation for workers
Workers in the worksites allocated to this intervention group will be asked to use a web-based app that provides psychoeducation on job crafting for six months (Supplementary File 1). The participants will access the contents by smartphones, tablets, or computers. The contents of the app were based on the theory of job crafting, “the physical and cognitive change individuals make in the task or relational boundaries of their work,” as defined by Wrzesniewski and Dutton.32 It was developed mainly based on a previous face-to-face job crafting program,33 and includes three types of job crafting (i.e., task, relation, and cognition).
The education program on job crafting contains four sessions, each lasting 10–15 minutes. First, the app provides lectures about the concept and importance of job crafting. Second, it introduces various examples of job crafting, such as how workers change their tasks or relationships in their working lives to be more desirable. Third, it requires participants to create their own job crafting plans and try them out. After that, they look back at their plan and make a new improved version of the plan. Each session can be taken at any time and at the participants’ convenience. During the intervention period, participants will be recommended to try various job crafting plans by using the app. If they have any questions or comments about the app, they will be able to contact researchers by e-mail. To encourage participants to use the app, a reminder e-mail by the researcher will be sent once a week for the first month, and once a month from the second to the sixth month.
Psychoeducation for supervisors
Supervisors at the worksites allocated to this arm will be asked to take an online-based training program named “Maximize Team Capabilities! Workplace Psychological Safety Navigation” for six months. The program is mainly based on the rationale of team psychological safety by Edmondson.34 Psychological safety is defined as a shared belief that the team is safe for interpersonal risk-taking (i.e., doing learning behavior that may place workers at risk, including seeking feedback, sharing information, asking for help, talking about errors, and experimenting).
This program provides an explanation of the rationale and importance of team psychological safety at work, how to promote team psychological safety, and specific actions for supervisors to build team psychological safety. The program also provides information on how to support subordinates who may be experiencing mental problems or concerns (i.e., how to notice changes in their health, approach them, and encourage consultation with mental health professionals). This program comprises four video lessons (Supplementary File 2) and e-newsletters. The videos will be released on YouTube, one per week during the first month. The videos are approximately 15 minutes each and can be viewed as many times as participants like. The e-newsletter will be delivered three times a week during the first month, and then once a week from the second month to the sixth month. The e-newsletters offer lesson summaries, encourage viewing for those who have not watched, and provide specific examples of actions to promote team psychological safety in the workplace.
Work environment improvement
Workers in the worksites in this intervention group will participate in the Participatory Work Environment Improvement Program (PWIP)35 for 7–12 months, which focuses on improving the psychosocial factors of the workplace. It is based on participatory organizational interventions and participatory approaches that are undertaken to respond to local needs at each workplace.36 This program will be conducted by internal facilitators who will be selected in each workplace and will assess, facilitate, and provide suggestions for the PWIP. The external coordinators (i.e., the research team) will contact, train, and support the internal facilitators to conduct the PWIP program.
First, a single, 120-minute training session for the internal facilitators will be held immediately after allocation. Next, the internal facilitators will assess the readiness of the PWIP by using the Basic Organizational Development for Your workplace (also known as BODY) checklist.37 This checklist consists of five items and classifies readiness of the workplace into four levels (from 0–4). The PWIP will be tailored based on the proportion of workers with affirmative answers to the Basic Organizational Development for Your workplace items, as follows: for a workplace classified as level 2 or 3, the standard PWIP program will be conducted; for a workplace classified as level 0 or 1, the external coordinators increase the frequency of contact and support for the internal facilitators. The planning and application of the PWIP will be supported by a web-based app (Supplementary File 3). Workers will access the app to identify the favorable conditions and conditions that need improvement in the workplace, participate in group work, and contribute to the improvement of the workplace.38 The PWIP consists of three sessions, as described herein: 1) a kick-off work (first time), encompassing a brief workplace-level workshop, which in turn uses a) presentations of local good examples, b) action checklists listing typical low-cost actions,39 c) group work guidance materials on the website for job stress prevention; 2) follow-up/planning and actions (1–3 months later), including workplace-level feasible multifaceted improvement plans, which in turn will be devised by encouraging consensus building and facilitating actions at each work unit or workplace; 3) report and shared achievements (6–12 months later), emphasizing locally-achieved good practices that improved working conditions and worker safety and health.
Physical activity promotion
Workers in the worksites in this intervention group will be asked to install a smartphone app on their smartphone and use it for six months. The app is named ASHARE, and is a native app for iOS and Android smartphones, supporting version 12.0 or later in iOS and version 5.0 or later in Android (Supplementary File 4).40 The iOS and Android versions of the app are distributed via App Stores and Google Play, respectively.
The app was developed to promote physical activity and prevent depression and anxiety in a healthy working population, and is based on basic behavioral change techniques (i.e., self-monitoring, feedback, and sharing of users’ physical activity data). Moreover, the needs of workers identified through qualitative interviews in a past study were reflected in the functions and interface of the app.41 When users install the app, they are initially asked to register their username and email address and input their age, gender, occupation, employment status, shift type, working hours per week, holiday patterns, and preferred activities. The app works with Apple Healthcare (iOS) and Google Fit (Android) and obtains data on the duration of moderate-to-vigorous physical activity every 15 minutes.42 These data are stored on a cloud server. The duration of the overall physical activity on the previous day is depicted as a graph on the top screen of the app.
In addition, the deep-learning model embedded in the app accesses the data for the previous day on cloud servers when the user starts the app and predicts a score of psychological distress for the day, which in turn is measured using the six-item Kessler Psychological Distress Scale (K6).43 Every day, participants are provided with feedback on predicted scores and levels of distress, which are presented as weather conditions, accompanied by rule-based comments from a monkey anime character (mental health forecast). A previous study on the predictive performance of the deep-learning model underlying the ASHARE app reported that the correlation coefficient between the predicted and measured values for psychological distress was 0.679 (R2 = 0.463), and that the overall classification accuracy for levels of psychological distress was 76.3%.44 The predicted scores and levels of psychological distress are recorded and monitored on a monthly basis.
The ideal usage of the app, as expected by the researchers, is for the participants to take about 5 minutes every day to open the app, check their physical activity patterns and the results of their mental health forecast, and to then increase the amount of any type of moderate-to-vigorous physical activity. As a function of data sharing, users with a high percentage of sunny days are ranked as “best performers.” Rankings can be modified freely based on categories such as age, gender, occupation, and preferred activities. If users are reluctant to share data with other users, they can decline to participate in the ranking process. The app has a notification function that notifies users on the smartphone of new information and the result of the mental health forecast (provided at six o’clock every morning). The reminders for app use will be sent once per week using this function.
Active control
Workers in the worksites in this group will be asked to download and read a booklet with basic information about stress, and to complete a simple individual work for six months. The booklet is a 14-page PDF file (Supplementary File 5) comprising explanations about differences between stressors and stress reactions (or strain); stress-performance curve (i.e., Yerkes–Dodson law)45; stressful life events46; and National Institute for Occupational Safety and Health (also known as NIOSH) model of job stress.47 It also includes a worksheet to write down the acute reactions (i.e., psychological, physiological, and behavioral) that are likely to occur when the participants themselves are placed in stressful situations. Since this is a control group, as a rule, no assistance other than the distribution of the booklet will be provided.
Strategies to improve adherence to intervention protocols
To improve adherence (completion rates) to the programs, reminders will be sent to participating worksites and employees via email and/or applications during the program duration. The reminders will include greetings, a recommendation to work on the contents of assigned programs, and the sharing of the statistics of the completion rates at other worksites within the arms (e.g., mean rates and champion worksite). Completion rates will be monitored using the digital records of the intervention and active control programs. At each survey point, reminders to complete the questionnaire will be sent weekly. These process evaluations of the cRCT will be summarized immediately after each survey (at the 6-, 12-, and 18-month follow-ups) and sent to the participating worksites and employees with a letter thanking them for their cooperation.
Outcomes
Table 1 presents a list of the outcomes according to the timeline of this cRCT. All outcomes will be measured at the employee level. As the primary outcomes, depression and anxiety will be measured at baseline and at the 6-, 12-, and 18-month follow-ups. As secondary outcomes, psychosocial factors at work, work engagement, health-related QOL, well-being, economic outcomes, physiological outcomes from a health checkup, and levels of cortisol extracted from fingernails will be measured. The process and implementation outcomes will be assessed as well.
Table 1
A list of outcome measurements
Outcomes | Scales | Timeline |
---|
Baseline | 6-month follow-up | 12-month follow-up | 18-month follow-up |
---|
Primary outcome | | | | | |
Depression and anxiety | K6 | × | × | × | × |
Secondary outcomes | | | | | |
Psychosocial factors at work | | | | | |
Job stressors | BJSQ | × | × | × | × |
Social support | BJSQ | × | × | × | × |
Stress responses | BJSQ | × | × | × | × |
Psychological safety | Psychological safety scale | × | × | × | × |
Work engagement | UWES-9 | × | × | × | × |
Health-related quality of life | EQ-5D-5L | × | × | × | × |
Well-being | | | | | |
Job and life satisfaction | BJSQ | × | × | × | × |
Seligman’s PERMA | Workplace PERMA Profiler | × | × | × | × |
Economic outcomes | | | | | |
Work performance | WHOHPQ | × | × | × | × |
Absenteeism | Data from the worksites | × | - | × | - |
Turnover from worksites | Data from the worksites | - | - | × | - |
The amount of monthly salary | Data from the worksites | × | - | × | - |
The amount of the yearly bonus | Data from the worksites | × | - | × | - |
Physiological outcomes | Annual health checkups | × | - | × | - |
Cortisol extracted from fingernails | Nail sampling | × | × | - | - |
Process and implementation outcomes | | |
Process evaluation | Completion rates of the programs | Immediately after each intervention |
Implementation outcomes | iOSDMH | Immediately after each intervention |
BJSQ: Brief Job Stress Questionnaire; EQ-5D-5L: the Japanese five-level version of the scale by the EuroQoL group; UWES, Utrecht Work Engagement Scale; iOSDMH: Implementation Outcome Scales for Digital Mental Health; K6: Kessler Psychological Distress Scale; WHOHPQ: World Health Organization Health and Work Performance Questionnaire.
Depression and anxiety
Depression and anxiety (non-specific psychological distress) will be measured using the Japanese version of the K6 scale.43 It comprises six items that assess the frequency of experience of non-specific psychological distress, which are rated on a five-point Likert scale (0 = none of the times and 4 = all the times). The reliability and validity of the Japanese version of the K6 have been confirmed previously.43 In this study, the total score for the K6 scale will be considered an indicator of depression and anxiety.
Psychosocial factors at work
Job stressors, stress responses, and psychological safety will be measured as indicators of psychosocial factors at work. Job stressors include job demands, job control, and social support from supervisors and coworkers, which will be measured using subscales of the Brief Job Stress Questionnaire (BJSQ).48 The BJSQ was developed in 2000 in Japan and has been widely used in occupational health studies and practice.49 The subscales for job stressors comprise three items each and are job demands, job control, supervisor support, and coworker support. All items are rated on a four-point Likert scale (for job demands and job control: 1 = not at all, 4 = very much so; for supervisor and coworker support: 1 = not at all, 4 = extremely). The total subscale scores will be calculated.
Stress responses will also be measured using the BJSQ, and the subscales are vigor (three items), anger-irritability (three items), fatigue (three items), anxiety (three items), depression (six items), and physical complaints (11 items). All items are rated on a four-point Likert scale (1 = almost never, 4 = almost always). The total subscale scores will be calculated.
Psychological safety will be measured using the Japanese version of the Psychological Safety Scale (also known as PSS).50 It assesses workers’ perceptions of the consequences of interpersonal risks in the workplace and consists of three subscales: team leader (nine items, e.g., “If I had a question or was unsure of something in relation to my role at work, I could ask my team leader”), peers (seven items, e.g., “I can communicate my opinions about work issues with my peers”), and team as a whole (three items, e.g., “It is easy to ask other members of this team for help”).51 All items are rated on a seven-point Likert scale (1 = strongly disagree, 7 = strongly agree). The total subscale scores will be calculated.
Work engagement
The nine-item Japanese version of the Utrecht Work Engagement Scale (also known as UWES) will be used to assess work engagement.52 It consists of three subscales, namely vigor (three items, e.g., “At my job, I feel strong and vigorous”), dedication (three items, e.g., “I am enthusiastic about my job”), and absorption (three items, e.g., “I am immersed in my work”). All items are rated on a seven-point Likert scale (0 = never, 6 = always). The reliability and unidimensional validity of the Japanese version of this scale have been confirmed previously.52 The average scores from each of the nine items will be used for analyses.
Health-related QOL
Health-related QOL will be measured using the Japanese five-level version of the scale by the EuroQoL group (also known as EQ-5D-5L).53 It consists of five dimensions of QOL (mobility, self-care, usual activities, pain or discomfort, and anxiety or depression). Each dimension has five levels for perceived problems, and a higher score indicates a higher QOL. We will use the value set of the Japanese version of this scale that has been developed in previous research.54
Subjective well-being
Job and life satisfaction will be measured as indicators of subjective well-being using questions from the BJSQ,48,49 with one item for each: “I am satisfied with my job” and “I am satisfied with my family life,” respectively. The two items are rated on a four-point Likert scale (1 = dissatisfied, 4 = satisfied), with higher scores indicating higher satisfaction.
The Japanese version of the Workplace PERMA Profiler will also be used as a multidimensional concept of well-being.55 This scale is based on Seligman’s PERMA model, and consists of five domains: positive emotion (P), engagement (E), relationships (R), meaning (M), and accomplishment (A).56 An overall score of well-being at work is calculated as the average of 15 items (three items each for five dimensions) and happiness (one item). All items are rated on an 11-point Likert-type scale (ranging from 0–10; e.g., for the first item of positive emotion, 0 = never and 10 = always). The reliability and validity of the Japanese Workplace PERMA Profiler have been confirmed previously.55
Economic outcomes
Economic outcomes include work performance, absenteeism, worksite turnover, and monthly salary and yearly bonuses. Work performance will be assessed using an item for the absolute score of work performance from the Japanese version of the World Health Organization Health and Work Performance Questionnaire short version (also known as WHOHPQ).57 The item rates an individual’s overall job performance in the past month on a scale of 0‒10 (0 = worst job performance, 10 = best job performance). The ratings are multiplied by 10 to calculate work performance according to the scoring guidelines.
Regarding absenteeism, data on absent days in the year before the surveys, monthly salary, and yearly bonus will be obtained from the worksite database at the time of the baseline survey and the 12-month follow-up survey. Regarding worksite turnover, data on the employees leaving the worksite will be obtained at the 12-month follow-up survey.
Physiological outcomes from a health checkup
Data on physiological outcomes, based on the annual health checkups conducted by the participating worksites, will also be collected at baseline and at the 12-month follow-up surveys. These outcomes include height, weight, waist circumference, blood pressure, blood lipids, and blood glucose levels.
Cortisol extracted from fingernails
In total, 800 employees will be asked to clip extended fingernails into a zip-lock plastic bag for one month, and cortisol levels will be measured from the nail samples. This procedure will be conducted at baseline and at the 6- and 18-month follow-ups. During sample collection, health conditions and lifestyle topics associated with cortisol levels and fingernails will be assessed using a self-reported questionnaire. The questionnaire will include items on marital status, educational background, income, medical history, current medications, pregnancy/childbirth in the previous year, smoking and drinking habits, frequency of cutting nails, frequency of using manicures, how to shorten nails (cut only or cut and file), habit of biting nails, frequency of soap use, frequency of hand disinfection, and frequency of kitchen detergent use.
Process and implementation outcomes
For process evaluation, the completion rates of the intervention and active control programs will be collected from the digital records of the programs. Implementation outcomes will be assessed using the Implementation Outcome Scales for Digital Mental Health (also known as iOSDMH), which was shown to have acceptable reliability and validity.58 This assessment tool was developed to measure the implementation outcomes of mental health interventions delivered by digital and telecommunication technologies, and comprises the following five subscales: acceptability (three items), appropriateness (four items), feasibility (six items), overall satisfaction (single item), and harm (five items). These evaluations will be conducted immediately after each intervention.
Sample size calculation
The required sample size has been calculated according to the guidelines of the Consolidated Standards of Reporting Trials (also known as CONSORT) for cRCTs,59 considering the intraclass correlations of the outcomes nested by the worksites. The sample sizes in cRCTs should be multiplied by the design effect (1+[m-1]ρ), where m is the average cluster size and ρ is the intraclass correlations.60 In the dataset from a cohort of healthy Japanese workers,61,62 the intraclass correlation for depression and anxiety measured by the K6 scale was 0.013. Therefore, a very small intraclass correlation (0.05, 5% of the outcome would be shared at the worksite level) was estimated for the primary outcome in this study. The average cluster size (the number of employees within one worksite) was set to 130 based on the information about potentially recruiting 2,700 companies from IID Co. Ltd.
The effect size of the four intervention programs on the primary outcome, that is, depression and anxiety, was estimated to be 0.20 in the standardized mean difference (d). A previous RCT that offered an Internet-based cognitive behavioral approach to prevent depression among nonclinical workers reported the effect size (d) of the intervention as 0.20 at the 6-month follow-up.63 The effect sizes of work environment improvement and physical activity promotion on mental health among nonclinical workers were reported to be larger.35,64 An alpha error probability (α) has been set to 0.05/4 = 0.0125 (one-tail) because we plan four tests between the intervention programs and control programs. The statistical power (1-β) has been set to 0.80.
Based on these parameters, the required sample size has been calculated as 3,554 for each arm using G* Power version 3.1.9.2.65 Considering that the average number of employees within one worksite is assumed to be 130, 27.3 worksites will be needed in each arm. Therefore, we have set the required number of worksites and employees to 28 and 3,640 in each arm and to 140 and 18,200 in total.
The required number of worksites (140) is 5.2% of the recruited companies (2,700). Previous RCTs on nonclinical workers at multiple worksites in Japan reported acceptance rates of approximately 4%.63,66 In this cRCT, the acceptance rate is likely to be higher and to reach the target sample size by prioritizing the recruitment of stressful worksites, which may in turn require countermeasures for stress management.
Randomization
Each included worksite will be randomized to one of the five arms. The randomization will be stratified into 3×2×2 = 12 strata, based on worksite size (three strata, 5–49, 50–99, and 100–299 employees), industries (two strata, manufacturing/construction/transportation, and other industries), and the percentage of “high-stress” employees in the worksites (two strata, < 15% and ≥ 15%). Companies that engage in manufacturing/construction/transportation are considered qualitatively different from other companies because they exert a physical demand on the workers and focus on reducing work-related injuries. The cutoff point for the percentage of “high-stress” employees in the worksites was decided according to previous results of the National Stress Check Program in the potentially participating companies. Permuted block randomization (block size = 5) will be adopted for equal randomization. A stratified permuted block random table will be created by an independent biostatistician at the Department of Clinical Research, Kitasato University Hospital. This table will be managed by another research assistant in the Department of Public Health, Kitasato University School of Medicine, who will be blinded to researchers. The results of randomization will be open to the participating worksites and employees because the contents of all intervention programs will be notified before enrollment. The assessment of the outcomes will also be open because of the self-assessment of depression and anxiety. Data analysis will be blinded because it will be conducted by the second author (HH), who will not participate in the recruitment, enrollment, allocation, and delivery of the intervention programs.
Statistical analysis
For the main analysis, repeated-measures multi-level mixed modeling will be used to test the effects of each intervention compared with the control at the 6-, 12-, and 18-month follow-ups. The interaction effect of the group (1 = intervention, 0 = control) by time will be considered an indicator of the intervention effect. The multi-level is based on employees (level 1) and worksites (level 2). The variables used in the stratified randomization (i.e., worksite size, industries, and the percentage of “high-stress” employees in the worksites) will be used as covariates. Intention-to-treat analysis will be adopted by including all employees who completed the baseline survey in the analysis. Multiple imputations will be conducted for missing values owing to dropouts. Effect sizes will be calculated as the standardized mean differences (d) of the scores on the K6 scale, based on the estimated mean differences.
The primary endpoint of the primary outcomes is set at the 6-month follow-up, and the corresponding statistical significance will be the primary indicator of effectiveness. If recruitment does not achieve the target sample size (140 worksites and 18,200 employees in total), we will assess the intervention effectiveness based on a minimally important difference. As the estimated effect size of the interventions is 0.20, the distribution-based approach67 is useful for estimating the minimally important difference of depression and anxiety measured by the K6 scale. In a previous RCT among nonclinical workers in Japan,63 the standard deviation of the change in the K6 score at the 6-month follow-up was 4.16. From the distribution, 20% of the standard deviation is estimated to be 0.832. Therefore, a 0.85 score change in the K6 score at the 6-month follow-up is considered to be the minimally important difference.
The secondary and continuous outcomes will be analyzed using the same method as the primary outcomes, that is, multi-level mixed modeling. At the 12-month follow-up, the prevalence rate of worksite turnover in each intervention group will be compared with that in the control group using the logistic regression model. The means of the completion rates of the programs and the scores of implementation outcomes will be compared among the five arms using analysis of covariance. Per protocol set analyses will be conducted assuming that some participants may not complete all protocols of the intervention programs. All statistical analyses will be conducted using STATA version 17 (StataCorp LLC, Texas, USA).
Data monitoring
A data monitoring committee will be established comprising the corresponding author (KW), the manager of enrollment (KY), and intervention program providers (AS, KI, TY, KW, and AI). The data monitoring committee meeting will be held every six months, in line with the surveys. The purpose of the meetings will be to review the recruitment process, program completion rates, attrition rates, and any harm caused by the interventions.