Use of guidelines, checklists, frameworks, and recommendations in behavioral intervention preliminary studies: A scoping bibliometric review

doi:10.21203/rs.3.rs-1972508/v1

Download PDF

Research Article

Use of guidelines, checklists, frameworks, and recommendations in behavioral intervention preliminary studies: A scoping bibliometric review

https://doi.org/10.21203/rs.3.rs-1972508/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 13 Sep, 2023

Read the published version in Pilot and Feasibility Studies →

You are reading this latest preprint version

Background

Guidelines, checklists, frameworks, and recommendations (GCFRs) related to preliminary studies serve as essential resources to assist behavioral intervention researchers in reporting findings from preliminary studies, but their impact on preliminary study quality is unknown. The purpose of this study was to conduct a scoping bibliometric review of recently published preliminary behavioral-focused intervention studies to 1) examine the prevalence of GCFR usage and 2) determine the associations between GCFR usage and reporting feasibility-related characteristics.

Methods

A systematic search was conducted for preliminary studies of behavioral-focused interventions published between 2018-2020. Studies were limited to the top 25 journals publishing behavioral-focused interventions, text mined to identify usage of GCFRs, and categorized as either not citing GCFRs or citing ≥2 GCFRs (Citers). A random sample of non-Citers was text mined to identify studies which cited other preliminary studies that cited GCFRs (Tertiary Citers) and those that did not (Never Citers). The presence/absence of feasibility-related characteristics was compared between Citers, Tertiary Citers, and Never Citers via univariable logistic regression.

Results

Studies (n=4,143) were identified and 1,316 were text mined to identify GCFR usage (n=167 Citers). A random sample of 200 studies not citing a GCFR were selected and categorized into Tertiary Citers (n=71) and Never Citers (n=129). Compared to Never Citers, Citers had higher odds of reporting retention, acceptability, adverse events, compliance, cost, data collection feasibility, and treatment fidelity (OR range =2.62-14.15, p <0.005). Citers also had higher odds of mentioning feasibility in purpose statements, providing progression criteria, framing feasibility as the primary outcome, and mentioning feasibility in conclusions (OR range =6.31-17.04, p <0.005), and lower odds of mentioning efficacy in purpose statements, testing for efficacy, mentioning efficacy in conclusions, and suggesting future testing (OR range =0.13-0.50, p <0.05). Tertiary Citers had higher odds of reporting acceptability and treatment fidelity (OR range =2.12-2.39, p <0.05), but lower odds of testing for efficacy (OR=0.36, p <0.05) compared to Never Citers.

Conclusion

The citation of GCFRs is associated with greater reporting of feasibility-related characteristics in preliminary studies of behavioral-focused interventions. Researchers are encouraged to use and cite literature that provides guidance on design, implementation, analysis, and reporting to improve the quality of preliminary studies.

Preliminary Studies

Pilot

Feasibility

Framework

Review

Reporting Quality

Early-stage, preliminary studies (i.e., pilot/feasibility) are the foundation of larger-scale behavioral interventions. Preliminary studies provide critical evidence regarding trial feasibility – recruitment and retention of participants, measurement of outcomes; intervention feasibility – participant enjoyment/acceptability, attendance/dosage, missing or needed intervention components; and preliminary efficacy – whether changes are observed in primary or secondary outcomes to determine if an investment in a larger-scale, well-powered intervention is warranted (1, 2). Particularly for preliminary studies, the capturing and reporting of both trial and intervention feasibility are critical procedures needed to understand if the research can be appropriately conducted and whether the scale-up to a larger, more well-powered trial is worth pursuing.

Published guidelines, checklists, frameworks, and recommendations (GCFRs) are an essential resource for researchers conducting behavioral interventions to aid in the collection and reporting of intervention studies. For example, the Consolidated Standards of Reporting Trials (CONSORT) statement (3), the Template for Intervention Description and Replication (TIDieR) checklist (4) (5) and the Standard Protocol Items: Recommendations for Interventional Trials (SPIRIT) statement (6, 7), each provide important guidance for key elements needing to appear in published clinical trials. Evidence suggests the adoption of these guidelines has resulted in an overall improvement in reporting, although additional work needs to be done to improve some elements of reporting.

For preliminary studies, there are multiple resources describing what should be collected and reported. Collectively, these resources describe high quality preliminary studies as those which primarily focus on feasibility and thoroughly report all aspects of feasibility-related characteristics, study design, and implementation. These include the CONSORT extension to randomized pilot and feasibility trials (8) and recommendations from Pearson et al. (9) and Bowen et al. (10) for feasibility testing. Whether the use of such GCFRs lead to higher quality preliminary studies, however, is unknown. Evidence from a historical scoping review of 600 obesity-related preliminary studies published between 1982–2020 (11) suggests citing common GCFRs (e.g., CONSORT extension to pilot and feasibility studies) is positively associated with a more thorough reporting of feasibility indicators and higher preliminary study quality. Guidance on preliminary studies, however, has only recently been published (within the past 20 years) and the usage and impact of these publications on recently published preliminary study quality is not well understood. Additionally, there have not been any investigations into how GCFR utilization impacts the quality of preliminary studies from a broader range of behavioral focused intervention topics, not just those targeting obesity. The purpose of this study is to conduct a scoping bibliometric review of recently published (2018–2021) behavioral-focused preliminary intervention studies to identify the application of GCFRs and their association with the reporting of feasibility indicators and other feasibility-related study characteristics.

This scoping bibliometric review was conducted and reported according to the Preferred Reporting Items of Systematic Reviews and Meta-Analyses extension for scoping reviews (PRISMA-ScR) guidelines (12).

Identification of guidelines, checklists, frameworks, and recommendations

An iterative and systematic search was conducted to identify preliminary study-related guidelines, checklists, frameworks, and recommendations. Known GCFRs were used as starting points. These included the CONSORT Extension for Pilot and Feasibility Studies (8), Medical Research Council guidance (13), and publications such as Bowen et al. (10), Pearson et al. (9), and Eldridge et al. (14). Backward citation tracking was used to identify additional GCFRs, which involved searching reference lists of identified GCFRs to locate additional relevant publications. This iterative approach was conducted until saturation was reached and no new GCFRs could be identified. This approach has been used in previous literature reviews to identify and consolidate theoretical approaches in implementation research (15) and guidance on complex population health interventions (16). Additionally, two separate samples of over 600 pilot studies from other reviews (11, 17) were used to identify GCFRs via backward citation tracking. Finally, the EQUATOR Network (18) was searched for any additional GCFRs related to preliminary studies. In total, we identified 152 GCFRs which we classified into 10 domains related to preliminary studies: Adaptations, Definitions of Pilot and Feasibility Studies, Design and Interpretation, Feasibility, Implementation, Intervention Development, Progression Criteria, Reporting, Sample Size, and Scale-Up. The Supplementary File contains a full list of these publications.

Preliminary study search strategy

A systematic literature search was conducted in PubMed/Medline, Embase/Elsevier, EBSCOhost, and Web of Science online databases for preliminary behavioral intervention studies published from January 1, 2018 to December 31, 2021. The initial search strategy consisted only of the keywords: “pilot”, “feasibility”, “preliminary” “proof-of-concept”, “vanguard” paired with the keyword “intervention” as found in the title and/or abstract. The following additional filters were applied to each search: English language, human species, articles only, peer-reviewed journals.

Eligibility criteria and sampling strategy

Before the initial screening, the full list of articles was purposively sampled only from journals that served as likely outlets for published behavioral interventions. This was done to systematically reduce the large number of articles needed to be screened and focus on studies that were likely behavioral interventions. The full list of included journals can be found in the Supplementary File.

All preliminary studies published between 2018–2021 that delivered a behavioral intervention were considered for inclusion in this review. This time frame was chosen to highlight the most recently published behavioral intervention preliminary studies and to adequately capture usage of the most recently published GCFRs. Similar to previous reviews (11, 17), behavioral interventions were defined as “interventions that target actions which lead to improvements in health indicators, separate from mechanistic, laboratory, pharmacological, feeding/dietary supplementation, and medical device or surgical procedure studies” and preliminary studies were defined as “those studies which are conducted separately from and prior to a large-scale trial and are designed to test the feasibility of an intervention and/or provide evidence of preliminary effects before scale-up”. Articles were excluded if they employed a non-behavioral intervention, were not a preliminary study, were only observational/cross-sectional in nature, or were presented only qualitative data.

Screening process

Database search results were electronically downloaded as a RIS file, converted to an XML file, and uploaded to Microsoft Excel for review. The XML file contained study information such as year, author, title, abstract, and the journal in which each article was published. Using the journal information, an article count for each journal was created and the list of journals was sorted by article count. Journals that were not likely to have published a behavioral intervention were excluded from the list and their associated articles not considered for title and abstract screening. Articles published in the 25 journals with the largest article count were considered for screening. It is worth noting that all 25 journals endorsed the use of reporting guidelines when appropriate for the study type. Title and abstract screening were completed by three reviewers (CDP, LV, AB) to identify remaining references that met the eligibility criteria. Disagreements were solved by having a third member of the research team (MWB) review the reference and make a final decision. Full-text PDFs were retrieved for references that passed the initial title and abstract screening process and were reviewed by three members of the research team.

Study Coding and Categorization

The full list of studies that remained after the screening process was uploaded to NVivo 12 Plus Qualitative Data Analysis Software (QSR International, 2021) as PDF files. To determine GCFR usage within each preliminary study, reference lists within each PDF were text mined using the title of each GCFR identified from the search previously described. The presence or absence of a GCFR was documented for each study and each GCFR. An Excel matrix indicating the presence (coded as 1) or absence (coded as 0) of each GCFR for each preliminary study was exported from NVivo and a sum total of cited GCFRs was created in a new column of the spreadsheet. Studies were initially categorized based on this sum total as “Citers” (citing two or more GCFRs) and “Non-Citers” (citing no GCFRs).

Because the Non-Citers category had a large number of articles (n = 826), a random sample of 200 (~ 25%) were chosen for final inclusion in the review. All articles coded as Non-Citers were assigned a unique, random identification number with STATA’s ‘rannum’ command, sorted numerically, and the first 200 studies were included. After randomization, the Non-Citers were further categorized as Never Citers, or Tertiary Citers, using backward citation tracking and are defined below. Reference lists of the Non-Citers were searched for 1) a protocol or preliminary study of the same intervention conducted by the same author, 2) a protocol or preliminary study of a different intervention conducted by the same author, or 3) a protocol or preliminary study conducted by different authors. If any of these were found in the Non-Citers’ reference lists, the reference list of the cited study was searched for the presence/absence of GCFRs in the same way as the original sample. If a GCFR was present in the cited study’s reference list, the original Non-Citer was categorized as a Tertiary Citer. If GCFRs were not present in any of the cited studies’ reference lists, the original Non-Citer was categorized as a Never Citer.

Data extraction and coding

Descriptive characteristics

Study and participant-level characteristics were extracted by four members of the research team and coded in an Excel spreadsheet. These included publication year, location of study (country), baseline sample size, participant age, intervention treatment length, study design, trial registration information, and type of funding.

Reporting of feasibility indicators

The presence/absence of nine feasibility indicators was cataloged for each included preliminary study. Feasibility indicators included recruitment, retention, participant acceptability (satisfaction), adverse events, attendance, compliance, cost, data collection feasibility, and treatment fidelity. The list of relevant feasibility indicators was established in a previous review (11) and was based on definitions from the NIH and other peer-reviewed sources (19). The identification of feasibility indicators was done by utilizing a combination of text mining and manual search procedures. Text mining procedures were conducted in NVivo 12 Plus Qualitative Data Analysis Software (QSR International, 2021) by two members of the research team and consisted of full-text searches with keywords related to feasibility outcomes. Full text PDFs were also manually searched to ensure text mining procedures identified all possible reported feasibility indicators. The presence or absence of each feasibility indicator was coded as 1 = present or 0 = absent for each included preliminary study. Examples of keywords used in the text mining procedures and the operational definitions of each feasibility indicator are provided in the Supplementary File.

Feasibility-related characteristics

The presence or absence of several feasibility-related characteristics were manually extracted. These included whether 1) the title included feasibility-related language (had “pilot”, “feasibility”, or similar language in the title), 2) the purpose statement mentioned feasibility and/or efficacy, 3) progression criteria were provided, 4) feasibility was framed as the primary outcome, 5) the conclusion mentioned feasibility and/or efficacy, 6) caution was advised regarding interpretations of preliminary efficacy, and 7) future testing was suggested. The presence or absence of each of these characteristics was similarly coded as 1 = present or 0 = absent. These feasibility-related characteristics, in addition to the reporting of feasibility indicators, were of interest because they also contribute to overall preliminary study quality according to much of the guiding literature (CITE).

Statistical analysis

Descriptive statistics were compared between the subsample and the larger sampling pool as well as between Never Citers, Tertiary Citers, and Citers using Kruskal-Wallis tests and chi-square tests when appropriate. Tests conducted between the subsample and larger sampling pool were to ensure the quasi-random sampling procedure used to select studies did not produce any systematic differences between groups. Univariable logistic regression models were employed to assess differences in feasibility-related characteristics and in the reporting of feasibility indicators between Never Citers, Tertiary Citers, and Citers. The presence or absence of feasibility-related characteristics and feasibility indicators was treated as the binary dependent variable while two dummy variables identifying Tertiary Citers and Citers (with Never Citers as the reference category) were independent variables. Because we did not consider the original CONSORT (3) a true preliminary study-related GCFR, we accounted for its presence by including its use as a covariate in all univariable models (coded as 0 = not cited, 1 = cited). Additionally, for four of the feasibility-related characteristics (mentioning feasibility in the conclusion, mentioning efficacy in the conclusion, advising caution regarding efficacy, suggesting future testing), we did not include protocols in the sample used for the logistic regression models, as these aspects of feasibility would not have been reported in protocol studies. All analyses had an alpha level of p < 0.05 and were carried out using STATA v17.0 statistical software package (College Station, Texas, USA).

Search results

Results of the screening and randomization procedures are communicated in the study flow diagram (Fig. 1). A total of 33,840 records were identified from the database search. Limiting records to only the 25 health behavior journals with the largest article counts reduced the number of records to 4,143. Abstract screening further reduced this sample to 1,657. After assessing for eligibility, 1,316 articles were included in the final sampling pool.

The final sampling pool of 1,316 articles were then text-mined to identify the usage of GCFRs within their references. A total of 826 articles cited zero GCFRs and 167 cited two or more. A random sample of 200 articles from the 826 which cited zero GCFRs were further categorized into Never Citers (n = 129) and Tertiary Citers (n = 71) using the methodology previously described. Thus, a total of 367 articles were included in this bibliometric review.

Descriptive characteristics

Table 1 communicates descriptive statistics for all studies included in the review, and separately for Never Citers, Tertiary Citers, and Citers. For the total sample (N = 367), most studies were conducted in Europe (n = 135, 36.8%) or North America (n = 133, 36.2%), targeted adults (n = 290, 79.0%), had a treatment length of 12 weeks or less (n = 246, 69.4%), and reported funding of some kind (n = 320, 87.2%). Most studies employed either a RCT (n = 156, 42.5%) or a non-randomized, single group design (n = 142, 38.7%) and 210 (57.2%) were registered trials. A total of 126 (34.3%) studies were protocols.

Table 1

Characteristics of included preliminary studies (N = 367).
Characteristics	All Studies		Never Citers		Tertiary Citers		Citers
	N = 367		N = 129		N = 71		N = 167
	N	Percent (%)	N	Percent (%)	N	Percent (%)	N	Percent (%)
Year
2018	77	20.9	34	26.4	15	21.1	28	16.8
2019	96	26.2	36	27.9	17	23.9	43	25.7
2020	106	28.9	35	27.1	20	28.2	51	30.5
2021	88	23.9	24	18.6	19	26.8	45	26.9
Location
Africa	21	5.7	10	7.8	6	8.5	5	2.9
Asia	30	8.2	19	14.7	6	8.5	5	2.9
Europe	135	36.8	32	24.8	15	21.1	88	52.7
North America	133	36.2	56	43.4	32	45.1	45	26.9
Oceania	43	11.7	11	8.5	11	15.5	21	12.6
South America	2	0.5	1	0.8	1	1.4	0	0.0
Multi-Continent	3	0.8	0	0.0	0	0.0	3	1.8
Baseline Sample Size (N)
1–24	68	22.4	21	17.9	20	31.8	27	21.9
25–49	91	30.0	41	35.0	15	23.8	35	28.5
50–99	72	23.7	26	22.2	11	17.5	35	28.5
100 or more	72	23.7	29	24.8	17	26.9	26	21.1
Participant Age
Youth	37	10.1	16	12.4	10	14.1	11	6.6
Adult	290	79.0	98	75.9	54	76.1	138	82.6
Both	40	10.9	15	11.6	7	9.9	18	10.8
Treatment Length (weeks)
6 weeks or less	101	28.5	45	36.6	21	30.4	35	21.5
7–12 weeks	145	40.9	41	33.3	30	43.5	74	45.4
13–26 weeks	71	20.0	26	21.1	12	17.4	33	20.3
27–52 weeks	28	7.9	8	6.5	3	4.4	17	10.4
53 weeks or more	10	2.8	3	2.4	3	4.4	4	2.5
Design
Randomized Controlled Trial	156	42.5	51	39.5	24	33.8	81	48.5
Non-Randomized, Single Group	142	38.7	54	41.9	33	46.5	55	32.9
Non-Randomized, Multi-Intervention	41	11.2	18	13.9	10	14.1	13	7.8
Randomized, Multi-Intervention	28	7.6	6	4.7	4	5.6	18	10.8
Protocols	126	34.3	29	22.5	12	16.9	85	50.9
Registered Trials	210	57.2	49	37.9	46	64.8	131	78.4
Funding
Any	320	87.2	107	82.9	62	87.3	151	90.4
CIHR (Canada)	7	2.2	2	1.9	2	3.2	3	1.9
MRC (UK)	9	2.8	4	3.7	1	1.6	4	2.6
NIH (United States)	91	28.4	33	30.8	19	30.6	39	25.8
NIHR (Europe)	46	14.4	10	9.3	10	16.1	26	17.2
NHMRC (Australia)	12	3.8	5	4.7	3	4.8	4	2.6

Usage of guidelines, checklists, frameworks, and recommendations

Table 2 displays the list of cited GCFRs by topic, with citation counts for the larger sampling pool (N = 1,316) and for Citers (N = 167). The Supplementary File contains a full list of GCFRs that were searched for within the references of the studies in the larger sampling pool. For the larger sampling pool (N = 1,316), the top three cited domains were Reporting (n = 281, 21.4%), Design and Interpretation (n = 269, 20.4%), and Definitions of Pilot and Feasibility Studies (142, 10.8%). Table 3 displays the top ten cited GCFRs. Briefly, the top five GCFRs were Craig et al. (13, 20) (n = 128, 9.7%), Eldridge et al. (8) (n = 123, 9.3%), Hoffman et al. (21) (n = 79, 6.0%), Bowen et al. (10) (n = 63, 4.8%), and Thabane (22) (n = 62, 4.6%).

Table 2

List of GCFR topics with citation counts for the full sample and those in the Citers category.
GCFR Topic	Full Sample (n = 1,316)		Citers (n = 167)
GCFR Topic	Citation Count	Percent (%)	Citation Count	Percent (%)
Adaptations	27	2.1	15	8.9
Defining Pilot and Feasibility Studies	142	10.8	76	45.5
Design and Interpretation	269	20.4	129	77.2
Feasibility	97	7.3	50	29.9
Guidance Review	2	0.2	1	0.6
Implementation	123	9.3	50	29.9
Intervention Development	35	2.7	17	10.2
Progression Criteria	16	1.2	9	5.4
Reporting	281	21.4	113	67.7
Sample Size/ Power	76	5.8	31	18.6
Scale-Up	30	2.3	13	7.8

Table 3

List of the Top 10 most cited GCFRs.
GCFR	Topic	Full Sample (n = 1,316)		Citers (n = 167)
GCFR	Topic	Citation Count	Percent (%)	Citation Count	Percent (%)
Craig, 2008, 2013	Design and Interpretation	128	9.7	65	38.9
Eldridge, 2016 (CONSORT Ext)	Reporting	123	9.3	35	20.9
Hoffmann, 2014	Reporting	79	6.0	43	25.7
Bowen, 2009	Feasibility	63	4.8	27	16.2
Thabane, 2010, 2017, 2019	Reporting	62	4.6	37	22.2
Lancaster, 2004	Design and Interpretation	56	4.3	33	19.8
RE-AIM	Implementation	50	9.8	21	12.6
Leon, 2011	Design and Interpretation	43	3.3	10	5.9
Eldridge, 2016 (Defining)	Defining Pilot and Feasibility Studies	39	2.9	18	10.8
Chan, 2013	Reporting	32	2.4	12	7.2
Billingham, 2013	Sample Size/ Power	32	2.4	18	10.8

Reporting of feasibility indicators

Table 4 summarizes the reporting of feasibility indicators for all studies included in the review and separately for Never Citers, Tertiary Citers, and Citers. For the total sample (N = 367), recruitment (n = 281, 76.6%), retention (n = 278, 75.8%), and participant acceptability (n = 262, 71.4%) were the most common feasibility indicators reported. Participant compliance (n = 61, 16.6%), data collection feasibility (n = 56, 15.3%), and cost (n = 39, 10.6%) were the least common feasibility indicators reported. In terms of the total number of feasibility indicators reported in the total sample of preliminary studies, 57 (15.5%) reported none, 75 (20.4%) reported one, 93 (25.3%) reported two, 71 (19.4%) reported three, and 71 (19.4%) reported four or more.

Table 4

Feasibility-related characteristics of included preliminary studies (N = 367).
Feasibility-Related Characteristics	All Studies		Never Citers		Tertiary Citers		Citers
	N = 367		N = 129		N = 71		N = 167
	N	Percent (%)	N	Percent (%)	N	Percent (%)	N	Percent (%)
Feasibility Indicators
Recruitment	281	76.6	91	70.5	47	66.2	143	85.6
Retention	278	75.8	80	62.0	51	71.8	147	88.0
Acceptability	262	71.4	64	49.6	50	70.4	148	88.6
Adverse Events	163	44.4	42	32.6	28	39.4	93	55.7
Attendance	66	17.9	11	8.5	8	11.3	47	28.1
Compliance	61	16.6	18	13.9	14	19.7	29	17.4
Cost	39	10.6	6	4.7	3	4.2	30	17.9
Data Collection	56	15.3	9	6.9	5	7.0	42	25.2
Treatment Fidelity	136	37.1	20	15.5	20	28.2	96	57.5
Total Outcomes Reported
0	57	15.5	40	31.0	12	16.9	5	2.9
1	75	20.4	39	30.2	20	28.2	16	9.6
2	93	25.3	27	20.9	24	33.8	42	25.2
3	71	19.4	17	13.2	3	4.2	51	30.5
4 or more	71	19.4	6	4.7	12	16.9	53	31.7
Feasibility-Related Characteristics
Title includes feasibility-related language	279	76.0	93	72.0	46	64.8	140	83.8
Feasibility mentioned in purpose statement	274	74.7	74	57.4	46	64.8	154	92.2
Efficacy mentioned in purpose statement	204	55.6	81	62.8	40	56.3	83	49.7
Progression criteria provided	45	12.3	2	1.6	2	1.8	41	24.6
Statistical testing for efficacy	246	67.0	112	86.8	50	70.4	84	50.3
Feasibility framed as primary outcome	267	72.8	70	54.3	44	61.9	153	91.6
Feasibility of the study mentioned in conclusion¹	180	74.7	62	62.0	45	76.3	73	89.0
Efficacy of the study mentioned in conclusion¹	134	55.6	64	64.0	32	54.2	38	46.3
Caution advised regarding efficacy¹	130	53.9	58	58.0	36	61.0	36	43.9
Future testing suggested¹	216	89.6	95	95.0	53	89.8	68	82.9
Note: ¹Protocols were not included in this analysis. See Table 1 for total number of protocols in each citation category.

Results from univariable logistic regression models for the presence or absence of feasibility indicators are presented in Table 5. Compared to Never Citers, Citers were significantly more likely to report all feasibility indicators except for recruitment and attendance. These included retention (OR = 3.75, 95%CI: 1.98–7.13), participant acceptability (OR = 14.15, 95%CI: 6.43–31.15), adverse events (OR = 4.38, 95%CI: 2.07–9.23), participant compliance (OR = 2.62, 95%CI: 1.55–4.45), cost (OR = 4.35, 95%CI: 1.66–11.40), data collection (OR = 4.35, 95%CI: 1.93–9.78), and treatment fidelity (OR = 8.82, 95%CI: 4.73–16.44). Compared to Never Citers, Tertiary Citers were more likely to report participant acceptability outcomes (OR = 2.39, 95%CI: 1.29–4.45) and outcomes related to treatment fidelity (OR = 2.12, 95%CI: 1.05–4.29).

Table 5

Summary of logistic regression analysis for reporting feasibility indicators and other related characteristics in studies considered true positives, false negatives, and citers. (N = 367).
Feasibility-Related Reporting Characteristics	Tertiary Citers¹			Citers¹
	N = 71			N = 167
	Odds Ratio	95% CI²	p-value	Odds Ratio	95% CI²	p-value
Feasibility Indicators
Recruitment	0.83	0.44–1.54	0.544	1.78	0.97–3.28	0.065
Retention	1.58	0.84–2.96	0.155	3.75	1.98–7.13	< 0.001
Acceptability	2.39	1.29–4.45	0.006	14.15	6.43–31.15	< 0.001
Adverse Events	1.36	0.52–3.55	0.531	4.38	2.07–9.23	< 0.001
Attendance	1.51	0.70–3.25	0.293	1.39	0.69–2.79	0.358
Compliance	1.35	0.74–2.46	0.330	2.62	1.55–4.45	< 0.001
Cost	0.91	0.22–3.73	0.891	4.35	1.66–11.40	0.003
Data Collection	1.01	0.33–3.14	0.984	4.35	1.93–9.78	< 0.001
Treatment Fidelity	2.12	1.05–4.29	0.036	8.82	4.73–16.44	< 0.001
Feasibility-Related Characteristics
Title includes feasibility-related language	0.72	0.39–1.34	0.303	1.51	0.82–2.78	0.188
Feasibility mentioned in purpose statement	1.36	0.75–2.48	0.314	9.87	4.58–21.23	< 0.001
Efficacy mentioned in purpose statement	0.77	0.43–1.38	0.379	0.54	0.32–0.91	0.020
Progression criteria provided	1.86	0.26–13.51	0.539	17.04	3.91–74.29	< 0.001
Statistical testing for efficacy	0.36	0.18–0.75	0.006	0.13	0.07–0.26	< 0.001
Feasibility framed as primary outcome	1.37	0.76–2.48	0.296	9.66	4.63–20.17	< 0.001
Feasibility of the study mentioned in conclusion³	1.95	0.95–4.03	0.070	6.31	2.43–16.36	< 0.001
Efficacy of the study mentioned in conclusion³	0.67	0.35–1.28	0.223	0.50	0.26–0.99	0.046
Caution advised regarding efficacy³	1.13	0.59–2.18	0.716	0.61	0.31–1.19	0.152
Future testing suggested³	0.46	0.33–1.59	0.223	0.26	0.08–0.84	0.024
Note: ¹Never Citers (N = 129) is the referent group; ²95% CI stands for 95% Confidence Interval; ³Protocols excluded from analysis; Bold denotes significance at the p < 0.05 level.

Feasibility-related characteristics

Table 4 also summarizes the presence of feasibility-related characteristics for all studies included in the review and separately for Never Citers, Tertiary Citers, and Citers. For the total sample (N = 367), 279 (76.0%) had feasibility-related language in the title and 274 (74.7%) mentioned feasibility in the purpose statement of the study. Feasibility was framed as the primary outcome in 267 (72.8%) studies. Conversely, 204 (55.6%) studies mentioned efficacy in the purpose statement and 246 (67.0%) conducted statistical testing for efficacy. Only 45 (12.3%) provided progression criteria. For non-protocol studies (n = 241), 180 (74.7%) mentioned feasibility in the conclusion while 134 (55.6%) mentioned efficacy in the conclusion. A total of 130 (53.9%) non-protocol studies advised caution when interpreting the preliminary efficacy of their intervention and 216 (89.6%) suggested future testing of the intervention.

Results from univariable logistic regression models for the presence or absence of feasibility-related characteristics are also presented in Table 5. Compared to Never Citers, Citers were significantly more likely to mention feasibility in the purpose statement (OR = 9.87, 95%CI: 4.58–21.23), provide progression criteria (OR = 17.04, 95%CI: 3.91–74.29), frame feasibility as the primary outcome (OR = 9.66, 95%CI: 4.63–20.17), and mention the feasibility of the study in the conclusion (OR = 6.31, 95%CI: 2.43–16.36). Citers were significantly less likely to mention efficacy in the purpose statement (OR = 0.54, 95%CI: 0.32–0.91), conduct statistical testing for efficacy (OR = 0.13, 95%CI: 0.07–0.26), mention the efficacy of the study in the conclusion (OR = 0.50, 95%CI: 0.26–0.99), and suggest future testing of the intervention (OR = 0.26, 95%CI: 0.08–0.84). Compared to Never Citers, Tertiary Citers were significantly less likely conduct statistical testing for efficacy (OR = 0.36, 95%CI: 0.18–0.75).

This was a scoping bibliometric review of 367 behavioral-focused preliminary intervention studies published between 2018–2020. We examined the usage of guidelines, checklists, frameworks, and recommendations (GCFRs) related to preliminary studies and determined associations between GCFR usage and the reporting of feasibility indicators and feasibility-related characteristics. Citing two or more GCFRs was associated with reporting a greater number of feasibility indicators and framing the study findings from a feasibility perspective. These data demonstrate the use of GCFR, as inferred from citations within reference lists, have a clear positive impact on the overall comprehensiveness of the information presented. This information, in turn, should lead to greater transparency in reporting and more informed decisions regarding the viability of a behavioral intervention in a larger-scale trial.

To the authors’ knowledge, this is the first review to document and analyze the usage of GCFRs in preliminary health behavior preliminary intervention studies. Literature on Reporting, Design and Interpretation, and Defining Pilot and Feasibility Studies made up the majority of citations and the three most cited GCFRs were the Medical Research Council (MRC) Guidance for Developing and Evaluating Complex Interventions (13, 20), the CONSORT Extension for Pilot and Feasibility Studies (8), and the Template for Intervention Description and Replication (TIDieR) (21). According to the Web of Science (accessed April 14, 2022), the MRC guidance has been cited over 4,000 times, the CONSORT Extension over 500 times, and TIDieR over 3,000 times, although preliminary studies do not account for all of these citations. It is not surprising though that these are the top cited GCFRs in our sample of preliminary studies as each of them are widely supported in the behavioral sciences, funding agencies. These GCFRs can also be found on the EQUATOR network, which makes them easily accessible to a larger audience of researchers developing intervention studies.

Less common were GCFRs related to Adaptations and Scale Up, with the least commonly cited GCFR category being Progression Criteria. This might explain the very low presence of progression criteria in our included sample of preliminary studies, which has been found in other reviews as well (23, 24). The low usage of GCFRs related to Scale-Up is interesting, especially since the majority of preliminary studies are presumably designed for future scale-up. Several frameworks, including the National Institutes of Health Stage Model (25) and the Obesity-Related Behavioral Intervention Trials (ORBIT) model (26), outline the sequential and iterative processes for developing large-scale interventions, and specific attention is devoted to preliminary studies as a foundational piece of this scale-up process. Many of the GCFRs highlighted in this review can provide useful guidance on the scale-up process.

The reporting of feasibility indicators and feasibility-related characteristics in the sample of preliminary studies included in this review is similar to that of other reviews of health behavior intervention studies that are not preliminary studies (11, 24, 27). Most studies did not report feasibility indicators such as treatment fidelity, data collection feasibility, cost, participant compliance and attendance, and adverse events. Overall, most studies reported two or fewer feasibility indicators, which were typically recruitment and retention of participants. While trial-related feasibility (i.e., recruitment and retention) is important to assess, intervention-related feasibility indicators (treatment fidelity, the ability to collect data on participants, etc.) are equally important to measure and report to highlight the potential viability of an intervention and/or what aspects of the intervention need to be addressed and altered to ensure the intervention is successfully scaled-up in the future.

While most studies did mention feasibility (or use similar language) in the title and purpose statement, over half mentioned efficacy in the purpose statement and conducted statistical testing for efficacy. It is well established researchers should exercise caution when conducting efficacy testing within preliminary intervention studies and should not depend on these estimates to inform the design of a larger-scale trial, which would likely be conducted with hypothesis testing in mind (28–30). In fact, many of the GCFRs identified for this review caution against the use and interpretation of preliminary study effect sizes. This could explain why the use of GCFRs was associated with lower odds of many efficacy-related study characteristics in our sample. There are also inherent issues (both Type I and Type II error) with using preliminary study effect sizes for sample size estimation for a larger-scale trial, although we did not identify whether this was being done in our sample of preliminary studies. It is worth noting that just over half of the included preliminary studies did advise caution regarding interpretations of efficacy and most studies framed feasibility indicators as the primary outcomes of interest. This is a promising finding and shows that many authors of recently published preliminary studies are 1) acknowledging the pitfalls of using preliminary studies to demonstrate the efficacy of an intervention and 2) prioritizing feasibility indicators as the main outcome of interest.

While this study is not the first to explore reporting quality in preliminary studies (11, 24, 27), it is the first to investigate associations between the usage of GCFRs and preliminary study reporting. Some studies have broadly explored the impact of guidelines (mostly CONSORT and TIDIER) on study reporting quality, not necessarily in preliminary studies exclusively, and the results of these studies are heterogeneous (31–33). There are also several other examples of these types of reviews for a variety of disciplines and study designs (34–38), but most use somewhat ambiguous proxies for the usage of guidelines. In other words, most reviews have assessed reporting quality before and after the publication of a guideline, with the assumption that the presence of the published guideline might influence reporting quality. Our study utilized text mining to identify preliminary studies that cited GCFRs, using the presence of a GCFR citation as the indicator for usage and found that usage significantly and positively associated with the reporting of most feasibility indicators and feasibility-related characteristics. These findings provide compelling support for the use of GCFRs as a tool to improve the reporting quality of preliminary intervention studies.

Not only does the usage of GCFRs (via citation) associate with better reporting, but our results in the Tertiary Citer group show citing a different preliminary study that cited a GCFR associated with increased odds of reporting acceptability and treatment fidelity as well as decreased odds of conducting statistical testing for efficacy. These results demonstrate a possible diffusive nature of the information published in GCFRs, whereby authors use other published preliminary studies (which cited a GCFR) as a model for their work. Thus, the true beneficial impact of GCFRs related to preliminary studies may go beyond just improving the reporting quality of studies which cite them. But their reporting was not a much as Citers – correct?

Strengths and Limitations

This review included a large sample (N = 367) of preliminary intervention studies published between 2018–2020, capturing some of the most recently published health behavior preliminary studies in the literature. Preliminary studies were not excluded based on location, design, or health behavior topic, which means results are generalizable to a large audience of health behavior researchers who conduct preliminary intervention studies. However, several limitations need to be considered. Included studies came from a sample 25 journals publishing the largest number of preliminary studies. Sampling from a wider variety of journals may have produced different results. Because the number of studies not citing any GCFR (Non-Citers) was so large, we opted to randomly sample 200 of them. Although random sampling is supposed to be an unbiased approach to sampling and no differences between the subsample and larger sample were found in our study, there is a possibility of the presence of sample selection bias. Limitations regarding the coding of studies are also important to mention. First, the presence/absence of GCFR usage and the reporting of feasibility indicators and feasibility-related characteristics was identified via a combination of text-mining and manual approaches. It is possible that some items may have either not been identified or improperly coded. While we used an extensive list of GCFRs, it is possible that some GCFRs were missed and therefore not searched within our sample, which could influence results by miscoding studies that did cite a GCFR which we did not include in our search. Finally, we recognize that what is reported in a study is not completely identical to the actual conduct of the study. The failure to cite a GCFR or report a feasibility indicator does not mean authors of a study had not considered preliminary study guidance or measured feasibility indicators during their preliminary intervention study.

Preliminary studies provide an ideal opportunity to improve multiple facets of an intervention study. Each of these improvements has the potential to enhance the overall rigor and reproducibility of the intervention when delivered at scale. Tools to aid researchers in the development and implementation of preliminary studies exist, namely in the form of published guidelines, checklists, frameworks, and recommendations, but our review indicates that many are never cited. Results from this review provide evidence the use of GCFRs (via citation) is associated with more thorough reporting of feasibility-related characteristics in behavioral-focused preliminary studies, and this more thorough reporting could potentially represent a higher-quality study. Researchers should utilize available literature that provides guidance on various aspects of preliminary study design, implementation, analysis, and reporting to improve the quality of their preliminary studies, inform efficient scale-up to larger-scale trials, and avoid unnecessary research waste.

CIHR

Canadian Institutes of Health Research

CONSORT

Consolidated Reporting of Trials

GCFRs

Guidelines, Checklists, Frameworks, and Recommendations

MeSH

Medical Subject Headings

MRC

Medical Research Council

NHMRC

National Health and Medical Research Council

NIH

National Institutes of Health

NIHR

National Institutes of Health Research

PRISMA

Preferred Reporting Items for Systematic Reviews and Meta-Analyses

PRISMA-ScR

Preferred Reporting Items of Systematic Reviews and Meta-Analyses for scoping reviews

QUERI

Quality Enhancement Research Initiation

RCT

Randomized Controlled Trial

TIDieR

Template for Intervention Description and Replication

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials

The datasets used and analyzed during the current study are freely available at https://osf.io/5sd28/.

Competing interests

The authors declare that they have no competing interests.

Funding

Research reported in this abstract was supported by The National Heart, Lung, and Blood Institute of the National Institutes of Health under award number R01HL149141 (Beets), F31HL158016 (von Klinggraeff), F32HL154530 (Burkart) as well as by the Institutional Development Award (IDeA) from the National Institute of General Medical Sciences of the National Institutes of Health under award number P20GM130420 for the Research Center for Child Well-Being. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Acknowledgements

The authors would like to thank Parker Kinnard and Jessica Parker for their efforts with data collection and data management.

Beets MW, von Klinggraeff L, Weaver RG, Armstrong B, Burkart S. Small studies, big decisions: the role of pilot/feasibility studies in incremental science and premature scale-up of behavioral interventions. Pilot and Feasibility Studies. 2021;7(1).
von Klinggraef L, Dugger R, Okely AD, Lubans D, Jago R, Burkart S, et al. Early-stage studies to larger-scale trials: investigators' perspectives on scaling-up childhood obesity interventions (vol 8, 31, 2022). Pilot and Feasibility Studies. 2022;8(1).
Rennie D. CONSORT revised - Improving the reporting of randomized trials. Jama-J Am Med Assoc. 2001;285(15):2006–7.
Hoffmann TC, Glasziou PP, Boutron I, Milne R, Perera R, Moher D, et al. Better reporting of interventions: template for intervention description and replication (TIDieR) checklist and guide. Bmj-Brit Med J. 2014;348.
Palmer W, Okonya O, Jellison S, Horn J, Harter Z, Wilkett M, et al. Intervention reporting of clinical trials published in high-impact cardiology journals: effect of the TIDieR checklist and guide. Bmj Evid-Based Med. 2021;26(3):91–7.
Chan AW, Tetzlaff JM, Altman DG, Laupacis A, Gotzsche PC, Krleza-Jeric K, et al. SPIRIT 2013 statement: defining standard protocol items for clinical trials. Ann Intern Med. 2013;158(3):200–7.
Tan ZW, Tan AC, Li T, Harris I, Naylor JM, Siebelt M, et al. Has the reporting quality of published randomised controlled trial protocols improved since the SPIRIT statement? A methodological study. Bmj Open. 2020;10(8).
Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. Pilot Feasibility Stud. 2016;2:64.
Pearson N, Naylor PJ, Ashe MC, Fernandez M, Yoong SL, Wolfenden L. Guidance for conducting feasibility and pilot studies for implementation trials. Pilot Feasibility Stud. 2020;6(1):167.
Bowen DJ, Kreuter M, Spring B, Cofta-Woerpel L, Linnan L, Weiner D, et al. How We Design Feasibility Studies. Am J Prev Med. 2009;36(5):452–7.
Pfledderer CD, von Klinggraeff L, Burkart S, Wolfenden L, Ioannidis JPA, Beets MW. Feasibility indicators in obesity-related behavioral intervention preliminary studies: A historical scoping review. Research Square. 2022.
McGowan J, Straus S, Moher D, Langlois EV, O'Brien KK, Horsley T, et al. Reporting scoping reviews-PRISMA ScR extension. J Clin Epidemiol. 2020;123:177–9.
Craig P, Dieppe P, Macintyre S, Michie S, Nazareth I, Petticrew M. Developing and evaluating complex interventions: the new Medical Research Council guidance. Int J Nurs Stud. 2013;50(5):587–92.
Eldridge SM, Lancaster GA, Campbell MJ, Thabane L, Hopewell S, Coleman CL, et al. Defining Feasibility and Pilot Studies in Preparation for Randomised Controlled Trials: Development of a Conceptual Framework. PLoS ONE. 2016;11(3):e0150205.
Damschroder LJ, Aron DC, Keith RE, Kirsh SR, Alexander JA, Lowery JC. Fostering implementation of health services research findings into practice: a consolidated framework for advancing implementation science. Implementation Science. 2009;4.
Skivington K, Matthews L, Simpson SA, Craig P, Baird J, Blazeby JM, et al. A new framework for developing and evaluating complex interventions: update of Medical Research Council guidance. BMJ. 2021;374:n2061.
Beets MW, Weaver RG, Ioannidis JPA, Geraci M, Brazendale K, Decker L, et al. Identification and evaluation of risk of generalizability biases in pilot versus efficacy/effectiveness trials: a systematic review and meta-analysis. Int J Behav Nutr Phys Act. 2020;17(1):19.
The EQUATOR Network. - Enhancing the Quality and Transparency of Health Research 2022 [Available from: https://www.equator-network.org/.
Pilot Studies. Common Uses and Misues. 2021.
Craig P, Dieppe P, Macintyre S, Michie S, Nazareth I, Petticrew M, et al. Developing and evaluating complex interventions: the new Medical Research Council guidance. BMJ. 2008;337:a1655.
Hoffmann TC, Glasziou PP, Boutron I, Milne R, Perera R, Moher D, et al. [Better Reporting of Interventions: Template for Intervention Description and Replication (TIDieR) Checklist and Guide]. Gesundheitswesen. 2016;78(3):e174.
Thabane L, Ma J, Chu R, Cheng J, Ismaila A, Rios LP, et al. A tutorial on pilot studies: the what, why and how. BMC Med Res Methodol. 2010;10:1.
Mbuagbaw L, Kosa SD, Lawson DO, Stalteri R, Olaiya OR, Alotaibi A, et al. The reporting of progression criteria in protocols of pilot trials designed to assess the feasibility of main trials is insufficient: a meta-epidemiological study. Pilot Feasibility Stud. 2019;5:120.
Mellor K, Eddy S, Peckham N, Bond CM, Campbell MJ, Lancaster GA, et al. Progression from external pilot to definitive randomised controlled trial: a methodological review of progression criteria reporting. BMJ Open. 2021;11(6):e048178.
National Institutes of Health Stage Model for Behavioral Intervention Development: National Institute on Aging; 2022 [Available from: https://www.nia.nih.gov/research/dbsr/nih-stage-model-behavioral-intervention-development.
Czajkowski SM, Powell LH, Adler N, Naar-King S, Reynolds KD, Hunter CM, et al. From ideas to efficacy: The ORBIT model for developing behavioral treatments for chronic diseases. Health Psychol. 2015;34(10):971–82.
Horne E, Lancaster GA, Matson R, Cooper A, Ness A, Leary S. Pilot trials in physical activity journals: a review of reporting and editorial policy. Pilot Feasibility Stud. 2018;4:125.
Kraemer HC, Mintz J, Noda A, Tinklenberg J, Yesavage JA. Caution regarding the use of pilot studies to guide power calculations for study proposals. Arch Gen Psychiat. 2006;63(5):484–9.
Leon AC, Davis LL, Kraemer HC. The role and interpretation of pilot studies in clinical research. J Psychiatr Res. 2011;45(5):626–9.
Moore CG, Carter RE, Nietert PJ, Stewart PW. Recommendations for Planning Pilot Studies in Clinical and Translational Research. Cts-Clin Transl Sci. 2011;4(5):332–7.
Plint AC, Moher D, Morrison A, Schulz K, Altman DG, Hill C, et al. Does the CONSORT checklist improve the quality of reports of randomised controlled trials? A systematic review. Med J Australia. 2006;185(5):263–7.
Ivers NM, Taljaard M, Dixon S, Bennett C, Mcrae A, Taleban J, et al. Impact of CONSORT extension for cluster randomised trials on quality of reporting and study methodology: review of random sample of 300 trials, 2000-8. Bmj-Brit Med J. 2011;343.
Chhapola V, Tiwari S, Brar R, Kanwal SK. Reporting quality of trial abstracts-improved yet suboptimal: A systematic review and meta-analysis. J Evid Based Med. 2018;11(2):89–94.
Agha RA, Fowler AJ, Limb C, Whitehurst K, Coe R, Sagoo H, et al. Impact of the mandatory implementation of reporting guidelines on reporting quality in a surgical journal: A before and after study. Int J Surg. 2016;30:169–72.
Alvarez F, Meyer N, Gourraud PA, Paul C. CONSORT adoption and quality of reporting of randomized controlled trials: a systematic analysis in two dermatology journals. Br J Dermatol. 2009;161(5):1159–65.
Candy B, Vickerstaff V, Jones L, King M. Description of complex interventions: analysis of changes in reporting in randomised trials since 2002. Trials. 2018;19(1):110.
Ghimire S, Kyung E, Lee H, Kim E. Oncology trial abstracts showed suboptimal improvement in reporting: a comparative before-and-after evaluation using CONSORT for Abstract guidelines. J Clin Epidemiol. 2014;67(6):658–66.
Svenkerud S, MacPherson H. The impact of STRICTA and CONSORT on reporting of randomised control trials of acupuncture: a systematic methodological evaluation. Acupunct Med. 2018;36(6):349–57.

Download PDF

Journal Publication

published 13 Sep, 2023

Read the published version in Pilot and Feasibility Studies →

Reviewers agreed at journal
18 Feb, 2023
Reviewers invited by journal
02 Feb, 2023
Editor assigned by journal
18 Aug, 2022
First submitted to journal
17 Aug, 2022

You are reading this latest preprint version

Use of guidelines, checklists, frameworks, and recommendations in behavioral intervention preliminary studies: A scoping bibliometric review

Status:

Journal Publication

Version 1

Abstract

Figures

Background

Methods

Identification of guidelines, checklists, frameworks, and recommendations

Preliminary study search strategy

Eligibility criteria and sampling strategy

Screening process

Study Coding and Categorization

Data extraction and coding

Descriptive characteristics

Reporting of feasibility indicators

Feasibility-related characteristics

Statistical analysis

Results

Search results

Descriptive characteristics

Usage of guidelines, checklists, frameworks, and recommendations

Reporting of feasibility indicators

Feasibility-related characteristics

Discussion

Strengths and Limitations

Conclusions

Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1