Using Trial Sequential Analysis for estimating the sample sizes of further trials: example using smoking cessation intervention

doi:10.21203/rs.3.rs-35669/v1

Download PDF

Research article

Using Trial Sequential Analysis for estimating the sample sizes of further trials: example using smoking cessation intervention

https://doi.org/10.21203/rs.3.rs-35669/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 30 Nov, 2020

Read the published version in BMC Medical Research Methodology →

You are reading this older preprint version

Read the latest preprint version →

Background

Assessing benefits and harms of health interventions is resource-intensive and often requires feasibility and pilot trials followed by adequately powered randomised clinical trials. Data from feasibility and pilot trials are used to inform the design and sample size of the adequately powered randomised clinical trials. When a randomised clinical trial is conducted, results from feasibility and pilot trials may be disregarded in terms of benefits and harms.

Methods

We describe using feasibility and pilot trial data in the Trial Sequential Analysis program to estimate the required sample size for one or more trials investigating a behavioural smoking cessation intervention. We show how data from a new, planned trial can be combined with data from the earlier trials using Trial Sequential Analysis to assess the intervention’s effects.

Results

We provide a worked example to illustrate how we successfully used Trial Sequential Analysis methods to argue for the research funds needed to undertake a randomised clinical trial.

Conclusions

Trial Sequential Analysis can utilise data from feasibility and pilot trials as well as other trials, to estimate a sample size for one or more future randomised clinical trials. As this method uses available data, estimated sample sizes may be smaller than they would have been using conventional sample size estimation methods.

Health Economics & Outcomes Research

Meta-analysis

Trial Sequential Analysis

Sample size

Information size

Smoking

Pregnancy

Randomised clinical trial

Pilot trial

Feasibility trial

Demonstrating that health interventions work requires substantial resources. Often feasibility and pilot randomised clinical trials are conducted before larger-scale randomised clinical trials (RCTs) are designed to determine benefits and harms (1–3). Feasibility trials are used to ascertain information such as intervention acceptability, feasibility of intervention delivery, and recruitment likelihood to help design more decisive RCTs (1). A pilot trial is a smaller version of a large-scale RCT, and is used to test whether the main components of the trial, such as recruitment, randomisation, treatment, and follow-up assessments can all work together (1). Moreover, their data can be used to inform sample sizes for large-scale RCTs (2, 3).

Trial Sequential Analysis is a methodology that can be used in systematic reviews and meta-analyses to control random errors, and to assess whether further trials need to be conducted (4). Here we employ Trial Sequential Analysis and combine data from feasibility and pilot RCTs testing a text message-based smoking cessation intervention for pregnant women (‘MiQuit’) (5, 6) to estimate the sample size that one or more future RCTs would need to recruit, to provide a more decisive answer regarding intervention benefit.

Conventional meta-analysis

Meta-analyses often influence future research; when planning future trials, investigators frequently use meta-analysis to provide an accurate summary of an intervention’s likely effect. If all available RCTs are included, systematic reviews with meta-analyses are considered the best available evidence, because power and precision of the estimated intervention effect are maximal (7, 8). However, this does not necessarily mean that the available evidence is either sufficient or strong. Conventional meta-analysis methods do not consider the amount of the available evidence in relation to the required sample size (9–11). The reliability of a statistically significant intervention effect generated by meta-analysis is often overvalued, particularly where sparse data (number of events and participants) or repetitive analyses (type I errors) are seen (8, 12–14). In other situations, intervention effects that are not statistically significant are often interpreted as showing that the intervention has no effect, and it is assumed that no more evidence is required (type II errors) (15, 16).

In conventional meta-analysis, there is no way to differentiate between an underpowered meta-analysis and a true finding of an intervention being ‘ineffective’. However, it is imperative that a conclusion as to whether an intervention is truly ineffective or truly effective is made as soon as possible after trials are completed, in order to guide investigators’ decisions as to whether further trials could be informative or not (12). Trial Sequential Analysis is a method and a statistical program that can overcome this issue by distinguishing whether meta-analyses provide evidence for either beneficial or harmful intervention effects, lack of effect (futility), or insufficient evidence for evaluation of the intervention effect (12, 17).

Trial Sequential Analysis

Meta-analyses aim to discover the benefit or harm of an intervention as early and as reliably as possible. As a result, they tend to be updated when new trials are published (18). When intervention evaluation has just begun and only few, smaller trials are available, meta-analyses may be conducted on sparse amounts of data and are at high risk of random errors (19). As meta-analyses are updated they are subjected to repeated significance testing, which increases the risk of type I errors (20). When there are few data available, Trial Sequential Analysis resolves these issues by having stringent thresholds for assessing statistical significance, using monitoring boundaries. Monitoring boundaries also take into account the volume of significance testing which has been undertaken through adjusting the thresholds that are used to define whether or not results are considered statistically significant (12).

Trial Sequential Analysis is also able to assess when an intervention has an effect smaller than what would be considered clinically minimally important (12). Futility boundaries, originally developed for interim analysis in RCTs, can be estimated and used to provide a threshold below which an intervention would be considered to have no clinically important effect (12). Thus, performing further trials is considered futile as the intervention does not possess the postulated clinically minimally important effect (12).

In Trial Sequential Analysis, when neither the monitoring boundaries nor the futility boundaries are crossed, further information is required. Trial Sequential Analysis can also inform how much more information is required to get a conclusive answer regarding the effect of the intervention versus its comparator – this is called the distance between the accrued information and the required information.

Required information size: For RCTs, an estimation of the required sample size is performed to ensure the number of participants included is enough to detect or reject a minimum clinically important effect size (17). For binary outcomes, such as death, the sample size estimation is based on the expected proportion of deaths in the control group, the expected relative risk reduction of the intervention, and the selected maximum risks of both type I and type II errors (18). Similarly, for meta-analyses to produce adequately powered findings regarding intervention efficacy, sufficient numbers of participants need to be included. This number is referred to as the ‘required information size’ (or ‘optimal information size’ or ‘meta-analytic sample size’) (21, 22). The meta-analytic required information size can be estimated using similar parameters as those used in sample size estimation for a single trial if one uses a fixed-effect model. If one intends to use a random-effects model, then one needs to consider adjusting for any between-study heterogeneity measured by diversity (D²) (17). Heterogeneity between studies is likely to be observed in meta-analyses due to the magnitude of the intervention effect varying when used in different study populations, in studies with different methodological characteristics, or due to variations in the intervention itself (11). Thus, sample size estimations need to be increased to allow for this between-trial heterogeneity (17).

In Trial Sequential Analysis, trials are chronologically ordered, and interim analyses are conducted as each trial is added. In a Trial Sequential Analysis where the ‘required information size’ has not been reached, the threshold for statistical significance is inflated to account for sparse data and multiple testing of the interim analyses using monitoring boundaries; thus, the 95% confidence interval is not providing coverage of the real uncertainty and the cut-off for determining statistical significance is below the usual nominal figure of 0.05 (17).

In the worked examples below, we show how Trial Sequential Analysis methods can be used to estimate the sample size required for one or more new trials to add further data to a meta-analysis to provide more firm evidence for an intervention either having or not having the postulated effect.

In this section, we provide an example of how Trial Sequential Analysis successfully used data from feasibility and pilot RCTs that tested MiQuit, a text-message, self-help smoking cessation intervention for pregnant women, to justify research funds to undertake a third, more adequately powered RCT.

Previous MiQuit trials

Smoking during pregnancy increases the risk of miscarriage, stillbirth, low birth-weight, premature birth, perinatal morbidity and mortality, sudden infant death, as well as adverse infant behavioural outcomes (23, 24). Pregnancy is a life event which motivates cessation attempts amongst smokers and over 50% of pregnant women who smoker attempt to quit during this time (25), consequently pregnancy is an opportune moment to offer smoking cessation support. Text message, self-help support, smoking cessation programmes developed for non-pregnant smokers are effective, but such programmes are inappropriate for use during pregnancy (26–28). To address the lack of acceptable self-help, support cessation programmes for pregnant smokers in the UK, MiQuit was developed (5). MiQuit delivers individually-tailored text messages to pregnant smokers, with the aim of encouraging them to stop smoking (5). Further details on MiQuit can be found elsewhere (5).

A MiQuit feasibility RCT was conducted, including 207 women. Biochemically-validated, 7-day point prevalence cessation at 12 weeks post randomisation (~ 6 months gestation) was 12.5% in the MiQuit group, compared with 7.8% in the control group (odds ratio (OR) 1.68, 95% confidence interval (CI) 0.90 to 3.16) (5). Although the trial was small, and the cessation period brief, the trial provided an estimate suggesting that MiQuit could have a positive impact in addition to routine care.

Next, we conducted a pilot RCT to investigate the feasibility of undertaking a fully-powered multi-centre RCT in UK National Health Service (NHS) settings (6). The pilot MiQuit RCT recruited 407 pregnant smokers and the prolonged abstinence rate from smoking, validated in late pregnancy was 5.4% in the MiQuit group versus 2.0% in the control group (OR 2.70, 95% CI 0.93 to 9.35) (6). This trial also suggested a beneficial effect of MiQuit.

As MiQuit is a cheap intervention and can be disseminated widely, we anticipated that even a 1–2% absolute effect on smoking cessation in pregnancy could be clinically important and cost effective (6). The results from the feasibility and pilot trials suggested that an impact of this size was attainable; however, an adequately powered RCT would still be needed to determine whether MiQuit is effective and guide future routine clinical practise.

Conventional meta-analysis

The conventional way to determine if an intervention is effective or not is to use the naïve alpha of 5% and the naïve 95% confidence interval (8). Since both the feasibility and pilot trials used virtually the same design as that which would be used in any new RCT, they can be considered as pilots and it would be appropriate to meta-analyse these trials’ findings together. Using a random-effects model, a traditional meta-analysis of pilot and feasibility studies’ data found, that women randomised to MiQuit were more than twice as likely to be abstinent in their pregnancy (pooled OR 2.26, 95% CI 1.04 to 4.93; I² = 0%, p = 0.041). This result seems to be significant according to conventional assessment (p < 0.05). However, this result should be interpreted with caution because, as described above, findings from meta-analyses based on only two small RCTs can produce spurious findings due to type I error (9, 10, 21).

In the next sections, we use conventional sample size estimation methods to estimate the sample size for an RCT which, on its own would have enough power to show whether MiQuit might be effective, using a plausible treatment effect estimate derived from the conventional meta-analysis above. We also calculate a second sample size estimate for one or more further RCTs, which when pooled with data from feasibility and pilot trials using Trial Sequential Analysis methods, would be similarly decisive.

Conventional sample size estimation

As the pilot trial (6) was considered at lower risk of bias compared to the feasibility trial (5), a traditional sample size calculation using smoking cessation rate estimates derived from the pilot trial suggests a new trial would require a total sample size of 1292 participants. This estimate has 90% power (10% type II error) and 5% significance (2-sided test; type I error) to detect a 3.4% absolute difference in prolonged abstinence from smoking from 4 weeks after enrolment until 36 weeks gestation between the MiQuit and control groups (5.4% versus 2.0%) (6).

Trial Sequential Analysis

The z-score is the test that helps you decide whether to accept or reject the null hypothesis. Very high positive or very low negative z-scores are associated with very small p-values. The critical z-score values when using a 95% confidence level which are known as the ‘conventional test boundaries’, are − 1.96 and + 1.96 and these relate to a two-sided p-value of 0.05. If the z-score is between − 1.96 and + 1.96, the p-value will be larger than 0.05, and the null hypothesis of no difference between intervention groups is accepted. The z-curve represents the cumulative z-score as each RCT is added to the analysis. In Fig. 1.I, when trial B is added to the analysis, the z-curve crosses the conventional test boundary (p = 0.05). This is consistent with the results from the conventional meta-analysis for MiQuit, where we found p = 0.041.

The required information size is represented by the vertical red line in Fig. 1. The required information size was estimated using the same parameters as used for the conventional sample size estimation above (90% power, 5% significance, to detect a 3.4% absolute difference) (6); although this estimate could take into account observed heterogeneity, there was none in this meta-analysis (I² = 0% and D² = 0). Consequently, the estimated required information size of 1296 participants is only slightly different to that using conventional sample size estimation due to rounding errors; the estimates would be larger if heterogeneity were present.

As the cumulative z-curve does not cross the upper trial sequential monitoring boundary which indicates MiQuit being effective, this Trial Sequential Analysis shows that further information is required before any firm conclusion can be reached about MiQuit efficacy. Although the conventional meta-analysis suggested, with borderline significance, that pregnant women randomised to MiQuit were more than twice as likely to be abstinent from smoking in late pregnancy, Trial Sequential Analysis indicates that this finding is not sufficiently robust. The Trial Sequential Analysis-adjusted confidence intervals for cessation using MiQuit (pooled OR 2.26, Trial Sequential Analysis-adjusted CI 0.66 to 7.70), are much wider than those of the conventional meta-analysis (pooled OR 2.26, 95% CI 1.04 to 4.93).

Without Trial Sequential Analysis having been undertaken, an interpretation of the conventional meta-analysis would have been that MiQuit is effective. However, Trial Sequential Analysis indicates that one cannot be secure in this interpretation and further trial data should be collected to eliminate the possibility that this is a false positive result, which can occur early in intervention evaluation when small trials are undertaken.

Calculating sample size for a third MiQuit RCT

Trial Sequential Analysis has demonstrated that further RCT data are required before a firm conclusion about MiQuit efficacy can be determined. As the initial two trials were sufficiently similar to be combined in Trial Sequential Analysis, we will now demonstrate how Trial Sequential Analysis methods can be used to estimate the sample size for (a) further trial(s) – data from which, when combined with the previous two trials in the Trial Sequential Analysis, would be expected to provide a more decisive answer regarding MiQuit efficacy. We will also demonstrate how exemplar theoretical findings from future trials which are both in favour and against MiQuit having a positive effect would impact the Trial Sequential Analysis result.

Trial Sequential Analysis sample size estimation: Estimates derived from the Trial Sequential Analysis found the required information size as 1296 participants. From the feasibility and pilot studies, 605 women have already been recruited and randomised; therefore, the required sample size for further RCTs can be estimated as the difference between the required information size minus the number of women already recruited into the previous trials; thus a sample size of 691 women (346 per intervention group) would be needed, assuming a 1:1 ratio.

When a theoretical third trial (D) with a negative outcome is included in the Trial Sequential Analysis (Fig. 1.III), we observe a different output. Here, the third trial of sample size 630 was intentionally given a negative outcome (absolute difference of -0.63% in favour of control). Here we observe the z-curve drop below the conventional test boundary, and in a meta-analysis we would have concluded that MiQuit was not effective. However, in the Trial Sequential Analysis, the futility boundary is not crossed, so we are unable to decisively say that MiQuit is not as effective as control for smoking cessation. Due to the diversity, the required information size has increased to 1941, meaning future trials will need a further 706 participants.

A conservative approach to sample size estimation: In the above example, the required information size was derived using the smoking cessation rate from the pilot trial (6). Therefore, it can be contested whether data from the pilot trial should be included in subsequent Trial Sequential Analysis. Consequently, consistent with this one could exclude the data from the pilot trial from the Trial Sequential Analysis and re-estimate the total number required (Fig. 2.I). Using this approach, to provide a conclusive result, either a single trial of 1098 participants (549 per intervention group, assuming a 1:1 ratio) or multiple trials cumulating to a total of 1098 participants, would be needed. This figure, although conservative, is still less than the estimate from the conventional sample size calculation.

Sensitivity analysis

The modelled scenario, in which there is no heterogeneity between trials in a meta-analysis is rare; in most situations where the described approach is used, some heterogeneity between studies might be expected. Trial Sequential Analysis provides 95% confidence intervals for heterogeneity (I-square) within meta-analyses. One way to fully allow for heterogeneity is to perform a sensitivity analysis using the upper boundary for heterogeneity. This would increase the required information size. In our example, the program could not calculate the 95% confidence interval surrounding the I-square of 0% as there were less than three included studies. In this case it is possible to input an estimate for heterogeneity into the TSA software.

The above example demonstrates how Trial Sequential Analysis can be used to determine the required sample size for one or more additional RCTs to make a meta-analysis more conclusive. This sample size would be considered underpowered in comparison to a traditional RCT sample size calculation. By using Trial Sequential Analysis in such a way, future trials could be planned using significantly fewer resources and with less cost than trials planned using traditional sample size calculations.

In the worked example, data from the pilot trial was used in the Trial Sequential Analysis to estimate the required information size. Ignoring that the same data is being used twice (for the estimation and for the meta-analysis) could mean that the estimate generated is not sufficiently conservative. Thus, we present a modification which attempts to overcome this issue. This approach increases the difference between required information size minus the accrued information by the sample size of the trial used in the estimation.

It is important to note that in the example, the meta-analysis of the existing two MiQuit trials quantified heterogeneity as 0%, thereby indicating that none of the variation in the meta-analysis was due to heterogeneity. However, it is unlikely that this will be the case for meta-analyses of other interventions aimed at changing addictive behaviours (29, 30); therefore, trial sequential analysis methods have been developed to account for this (21). In Trial Sequential Analysis, estimated information size and monitoring boundaries, vary with the level of heterogeneity in the meta-analysis, the greater the level of heterogeneity, the larger the sample size needed for firm conclusions.

In the example presented, odds ratios were also used instead of relative risk, as the feasibility study was powered using an odds ratio from a meta-analysis investigating mobile phone interventions for smoking cessation in the general population (5). Moreover, the quit rates are relatively low, so there is very little difference between the odds ratio and relative risk. In other trial sequential analyses, it may be advisable to use relative risks instead of odds ratios, to avoid overestimates. Additionally, it may be inappropriate to use the odds ratio used to power the feasibility trial to estimate sample sizes for future MiQuit trials since data now exists from the feasibility and pilot trials. In our example, the stipulated intervention effect was derived from the pilot trial (‘internal data’), and it may be argued that such adaptive data should not be used in meta-analysis (31).

Kulinskaya and Wood argued that in an underpowered meta-analysis, not only is it necessary to assess the gap from the accrued information size to the required information size (i.e. the number of additional participants you need to randomise), but also the number of trials that should be conducted to randomise this number of participants (32). Using multiple trials to reach the required information size may be beneficial in meta-analyses where heterogeneity occurs (32). Smaller trials have more imprecise estimates of intervention effects; hence heterogeneity is reduced in the meta-analysis of such trials. However, setting up more than one trial can be more expensive and this may not be realistic in practice.

Recently, Cochrane evaluated and updated their guidance on using sequential approaches in meta-analysis in their reviews (8, 33). The Cochrane Handbook authors concluded that sequential methods should not be used in primary analyses or to draw conclusions, but could be used as secondary analyses in reviews if they are prospectively planned and the assumptions underlying the design are clearly justified (8). In their guidance, the evidence synthesis group state that authors interpretations of evidence should be based on estimated magnitude of effect of an intervention and its uncertainty rather than drawing binary conclusions, and decisions should not be influenced by plans for future updates of meta-analyses (8). These criticisms of sequential approaches in meta-analyses apply to the traditional use of Trial Sequential Analysis, whereas our paper demonstrates an alternative use of the method.

Another reason given by the Cochrane Handbook authors against using sequential methods as a primary analysis in reviews, is the argument that a meta-analyst does not have any control over designing trials that are eligible for meta-analysis (8). It would therefore be impossible to construct a set of stopping rules (8). In our example, the opposite is the case. Both the feasibility and pilot trials were conducted by the same group of investigators, and any future trials would have a consideration for the desired properties of a stopping rule.

Finally, the Cochrane Handbook authors also highlight that there are methodological limitations to sequential methods when heterogeneity is present (8). In the example described in this paper, heterogeneity was not present and therefore these limitations are not relevant in this case. However, we do discuss how the presence of heterogeneity can be overcome in trial sequential analysis by performing a sensitivity analysis.

In conclusion, Trial Sequential Analysis is a method and a freely available program that can utilise data from feasibility and pilot trials as well as other trials, in order to estimate a sample size for one or more future RCTs, to provide an adequately powered conclusion regarding an intervention’s benefits and harms. This simple use of expensively-collected trial data could be usefully exploited by researchers evaluating other interventions.

Confidence interval

NHS

National Health Service

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials

Trial Sequential Analysis software, user manual and further information regarding the mathematics behind the method are available at http://www.ctu.dk/tsa/ for free.

All data generated or analysed during this study are included in the following published articles:

Naughton F, Prevost AT, Gilbert H, Sutton S, Naughton F, Prevost AT, et al. Randomized controlled trial evaluation of a tailored leaflet and SMS text message self-help intervention for pregnant smokers (MiQuit). Nicotine & Tobacco Research. 2012;14(5):569-77.

Naughton F, Cooper S, Foster K, Emery J, Leonardi‐Bee J, Sutton S, et al. Large multi‐centre pilot randomized controlled trial testing a low‐cost, tailored, self‐help smoking cessation text message intervention for pregnant smokers (MiQuit). Addiction. 2017;112(7):1238-49.

Competing interests

RC, CG, IB and TC declare that they have no competing interests.

JLB reports fees from undertaking independent statistical review for Danone Nutricia Research, and in relation to providing statistical expertise to the Food Standards Agency, both outside the subject of the submitted work.

Funding

This study is funded by the National Institute for Health Research (NIHR) Applied Research Collaboration East Midlands (ARC EM). Professor Coleman is a NIHR Senior Investigator. The views expressed are those of the author(s) and not necessarily those of the NIHR, the Department of Health and Social Care, or Rigshospitalet.

Authors' contributions

RC, JLB, IB and TC conceived the idea for this manuscript. RC input all data into the software, and produced the results. RC, JLB and CG all contributed to the interpretation of the data. RC produced an initial draft of the manuscript, and all authors made substantial revisions to the work. All authors commented on the final draft of the manuscript and RC finalised the text. All authors read and approved the final manuscript.

Acknowledgements

Not applicable.

Arain M, Campbell MJ, Cooper CL, Lancaster GA. What is a pilot or feasibility study? A review of current practice and editorial policy. BMC medical research methodology. 2010;10(1):67.
Wittes J, Brittain E. The role of internal pilot studies in increasing the efficiency of clinical trials. Statistics in medicine. 1990;9(1-2):65–72.
Thabane L, Ma J, Chu R, Cheng J, Ismaila A, Rios LP, et al. A tutorial on pilot studies: the what, why and how. BMC medical research methodology. 2010;10(1):1.
Brok J, Thorlund K, Gluud C, Wetterslev J. Trial sequential analysis reveals insufficient information size and potentially false positive results in many meta-analyses. J Clin Epidemiol. 2008;61(8):763–9.
Naughton F, Prevost AT, Gilbert H, Sutton S, Naughton F, Prevost AT, et al. Randomized controlled trial evaluation of a tailored leaflet and SMS text message self-help intervention for pregnant smokers (MiQuit). Nicotine Tob Res. 2012;14(5):569–77.
Naughton F, Cooper S, Foster K, Emery J, Leonardi-Bee J, Sutton S, et al. Large multi‐centre pilot randomized controlled trial testing a low‐cost, tailored, self‐help smoking cessation text message intervention for pregnant smokers (MiQuit). Addiction. 2017;112(7):1238–49.
Garattini S, Jakobsen JC, Wetterslev J, Bertelé V, Banzi R, Rath A, et al. Evidence-based clinical practice: Overview of threats to the validity of evidence and how to minimise them. European journal of internal medicine. 2016;32:13–21.
Higgins J, Thomas J, Chandler J, Cumpston M, Li T, Page M, et al. Cochrane Handbook for Systematic Reviews of Interventions: Cochrane; 2019. Available from: .
Imberger G, Thorlund K, Gluud C, Wetterslev J. False-positive findings in Cochrane meta-analyses with and without application of trial sequential analysis: an empirical review. BMJ open. 2016;6(8):e011890.
Thorlund K, Imberger G, Walsh M, Chu R, Gluud C, Wetterslev J, et al. The number of patients and events required to limit the risk of overestimation of intervention effects in meta-analysis—a simulation study. PLoS ONE. 2011;6(10):e25491.
Imberger G, Gluud C, Boylan J, Wetterslev J. Systematic reviews of anesthesiologic interventions reported as statistically significant: problems with power, precision, and type 1 error protection. Anesthesia Analgesia. 2015;121(6):1611–22.
Thorlund K, Engstrøm J, Wetterslev J, Brok J, Imberger G, Gluud C. User manual for trial sequential analysis (TSA). Copenhagen Trial Unit, Centre for Clinical Intervention Research. Copenhagen Denmark. 2011;1:1–115.
Harrison W, Angoulvant F, House S, Gajdos V, Ralston SL. Hypertonic Saline in Bronchiolitis and Type I Error: A Trial Sequential Analysis. Pediatrics. 2018;142(3):e20181144.
Simmonds M, Salanti G, McKenzie J, Elliott J, Agoritsas T, Hilton J, et al. Living systematic reviews: 3. Statistical methods for updating meta-analyses. J Clin Epidemiol. 2017;91:38–46.
Moher D, Tetzlaff J, Tricco AC, Sampson M, Altman DG. Epidemiology and reporting characteristics of systematic reviews. PLoS Med. 2007;4(3):e78.
Jackson D, Turner R. Power analysis for random-effects meta‐analysis. Research synthesis methods. 2017;8(3):290–302.
Wetterslev J, Jakobsen JC, Gluud C. Trial Sequential Analysis in systematic reviews with meta-analysis. BMC Med Res Methodol. 2017;17(1):39.
Brok J, Thorlund K, Wetterslev J, Gluud C. Apparently conclusive meta-analyses may be inconclusive—trial sequential analysis adjustment of random error risk due to repetitive testing of accumulating data in apparently conclusive neonatal meta-analyses. Int J Epidemiol. 2009;38(1):287–98.
Nguyen T-L, Collins GS, Lamy A, Devereaux PJ, Daurès J-P, Landais P, et al. Simple randomization did not protect against bias in smaller trials. J Clin Epidemiol. 2017;84:105–13.
Borm GF, Donders ART. Updating meta-analyses leads to larger type I errors than publication bias. J Clin Epidemiol. 2009;62(8):825–30.
Wetterslev J, Thorlund K, Brok J, Gluud C. Trial sequential analysis may establish when firm evidence is reached in cumulative meta-analysis. J Clin Epidemiol. 2008;61(1):64–75.
Pogue JM, Yusuf S. Cumulating evidence from randomized trials: utilizing sequential monitoring boundaries for cumulative meta-analysis. Controlled clinical trials. 1997;18(6):580–93.
Batstra L, Hadders-Algra M, Neeleman J. Effect of antenatal exposure to maternal smoking on behavioural problems and academic achievement in childhood: Prospective evidence from a Dutch birth cohort. Early Human Dev. 2003;75(1–2):21–33.
Turner-Warwick M. Smoking and the Young: A report of a working party of the Royal College of Physicians. Tob Control. 1992;1(3):231–5.
McAndrew F, Thompson J, Fellows L, Large A, Speed M, Renfrew MJ. Infant Feeding Survey 2010: Health and Social Care Information Centre: Health and Social Care Information Centre; 2012 [Available from: http://www.hscic.gov.uk/catalogue/PUB08694/Infant-Feeding-Survey-2010-Consolidated-Report.pdfAvailable from: http://digital.nhs.uk/catalogue/PUB08694.
Abroms LC, Ahuja M, Kodl Y, Thaweethai L, Sims J, Winickoff JP, et al. Text2Quit: results from a pilot test of a personalized, interactive mobile health smoking cessation program. Journal of health communication. 2012;17(sup1):44–53.
Abroms LC, Boal AL, Simmens SJ, Mendel JA, Windsor RA. A randomized trial of Text2Quit: a text messaging program for smoking cessation. Am J Prev Med. 2014;47(3):242–50.
Free C, Whittaker R, Knight R, Abramsky T, Rodgers A, Roberts IG. Txt2stop: a pilot randomised controlled trial of mobile phone-based smoking cessation support. Tobacco control. 2009;18(2):88–91.
Higgins JPT. Commentary. Heterogeneity in meta-analysis should be expected and appropriately quantified. Int J Epidemiol. 2008;37(5):1158–60.
Thorlund K, Imberger G, Johnston BC, Walsh M, Awad T, Thabane L, et al. Evolution of heterogeneity (I2) estimates and their 95% confidence intervals in large meta-analyses. PLoS ONE. 2012;7(7):e39471.
Bauer P, Bretz F, Dragalin V, König F, Wassmer G. Twenty-five years of confirmatory adaptive designs: opportunities and pitfalls. Stat Med. 2016;35(3):325–47.
Kulinskaya E, Wood J. Trial sequential methods for meta-analysis. Research synthesis methods. 2014;5(3):212–20.
Should Cochrane apply error-adjustment methods when
Schmid C, Senn S, Sterne J, Kulinskaya E, Posch M, Roes K, et al. Should Cochrane apply error-adjustment methods when.
conducting repeated meta-analyses?: Cochrane Scientific Committee; 2018 [Available from: https://methods.cochrane.org/sites/default/files/public/uploads/tsa_expert_panel_guidance_and_recommendation_final.pdf.

Download PDF

Journal Publication

published 30 Nov, 2020

Read the published version in BMC Medical Research Methodology →

Editorial decision: Major revision
13 Aug, 2020
Review #2 received at journal
10 Aug, 2020
Review #1 received at journal
27 Jul, 2020
Reviewer #2 agreed at journal
22 Jul, 2020
Reviewer #1 agreed at journal
29 Jun, 2020
Reviewers invited by journal
25 Jun, 2020
Editor invited by journal
16 Jun, 2020
Editor assigned by journal
15 Jun, 2020
Submission checks completed at journal
14 Jun, 2020
First submitted to journal
12 Jun, 2020

You are reading this older preprint version

Read the latest preprint version →

Using Trial Sequential Analysis for estimating the sample sizes of further trials: example using smoking cessation intervention

Status:

Journal Publication

Version 1

Abstract

Background

Methods

Results

Conclusions

Figures

Background

Conventional meta-analysis

Methods

Trial Sequential Analysis

Results

Previous MiQuit trials

Conventional meta-analysis

Conventional sample size estimation

Trial Sequential Analysis

Calculating sample size for a third MiQuit RCT

Sensitivity analysis

Discussion

Conclusions

Abbreviations

Declarations

Ethics approval and consent to participate

Consent for publication

Availability of data and materials

Competing interests

Funding

Authors' contributions

Acknowledgements

References

Status:

Journal Publication

Version 1