SurvExtrap: A tool to obtain precise parametric survival extrapolations

doi:10.21203/rs.3.rs-1975445/v1

Download PDF

Software

SurvExtrap: A tool to obtain precise parametric survival extrapolations

https://doi.org/10.21203/rs.3.rs-1975445/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Economic evaluation of emerging health technologies is mandated by agencies such as the National Institute of Health and Care Excellence (NICE) to ensure their cost is proportional to their benefit. To avoid bias, NICE stipulate that the benefit of a treatment is assessed across the lifetime of the patient population, which can be many decades. Unfortunately, follow-up from a clinical trial will not usually cover the required period and the observed follow-up will require extrapolation. For survival data this is typically done by selecting a preferred model from a set of candidate parametric models. This approach is limited in that the choice of model is restricted to those originally fitted. What if none of the models are consistent with clinical prediction or external data?

Results

This paper introduces SurvExtrap, a tool that estimates the parameters of common parametric survival models which interpolate key survival time co-ordinates specified by the user, which could come from external trials, real world data or expert clinical opinion. This is achieved by solving simultaneous equations which are rearranged from the survival functions of the parametric models. The application of SurvExtrap is shown through two examples where traditional parametric modelling did not produce models that were consistent with external data or clinical opinion.

Conclusions

SurvExtrap allows precise parametric survival models to be estimated and carried forward into economic models. It provides access to extrapolations that are consistent with multiple data sources such as observed data and clinical predictions, opening the door to exploration of regions of uncertainty/disagreement. SurvExtrap could avoid the need for post-hoc adjustments such as applying background mortality or treatment switching often applied to obtain a plausible survival model. Phase III clinical trials are not designed with extrapolation in mind, and so it is sensible to consider alternative approaches that incorporate external information.

Emerging health technologies are mandated to demonstrate their clinical and cost-effectiveness by agencies such as the National Institute of Health and Care Excellence (NICE) to ensure their cost is proportional to their benefit. NICE has established thresholds which it compares treatments against to ensure fairness across the consideration of different health technologies and disease areas, and that the National Health Service (NHS) obtains value for money and is able to sustainably provide optimal healthcare.

To avoid bias when appraising a health technology, NICE stipulate that the benefit of a treatment is captured across the lifetime of the patient population, which can be many decades.¹ Unfortunately, follow-up from a clinical trial will not usually provide data for this lifetime period and the observed follow-up will require extrapolation in order for the treatment benefit to be estimated. For a time-to-event outcome, such as death, this is typically done by fitting a parametric model or other model type to the observed data, and extrapolating the model until virtually all patients are predicted to have had the event of interest.² A set of candidate models will be fitted to the data, and a preferred model will be selected by an assessment of their goodness-of-fit and the plausibility of their extrapolations. Alternative plausible models may be explored as a form of sensitivity analysis. Uncertainty around a certain may be explored in a probabilistic sensitivity analysis, by sampling randomly around the mean parameter estimates using the 95% confidence intervals around the parameters and their correlation.³

Limitations of this approach include the lack of options if no plausible extrapolations are yielded, forcing the modeller to pursue alternative approaches, or to compromise on a sub-optimal model. Such an approach assumes that the observed data will be representative of the routine use of the health technology. This may not be true, particularly in the case where the data has come from a clinical trial with strict inclusion criteria or other carefully controlled conditions. It also assumes that the follow-up data are sufficient to produce a model that accurately predicts the future survival of patients, despite the potential for a clear distinction between patients who respond well to therapy and those who do not. It is plausible that neither of these assumptions hold due to the uncertainty of future real-world efficacy and decreasing maturity of trial data included in technology appraisal submissions. Simulations have shown that extrapolation with parametric models can contain bias and/or high uncertainty.^4–6

Existing approaches of obtaining an extrapolation perhaps place too strong an emphasis on the fit of a survival model to the observed data, and do not account for external predictions of real-world efficacy. When a clinical trial was not designed with extrapolation in mind, it raises the question of whether current approaches are suitable for providing reliable estimates of effectiveness which contribute to the assessment of cost-effectiveness of a health technology. In fact, simulation studies have shown it may not be.

Implementation

This paper presents SurvExtrap (https://dgallacher.shinyapps.io/survextrap/), a tool produced using R Shiny which allows the user to specify population survival at key points and obtain a parameters for a survival model that produce a model with estimates consistent with those specified by the user. Instead of fitting to survival data, which may not represent real-world use nor be consistent with clinical predictions, SurvExtrap provides a means of obtaining a parametric survival model that is consistent with specified points that could be based on data from data based on trial or real-world evidence, or expert opinion.

Selecting a single point estimate from observed data for which there is sufficient confidence, such as the median survival or earlier, may be more than adequate to inform an extrapolation where as much as 94% of the treatment benefit is estimated from the extrapolated period and not supported by any observed data.³

SurvExtrap rearranges the survival function and solves it as a series of simultaneous equations using the rootSolve package, interpolating the points specified by the user. SurvExtrap is currently able to estimate parameters for the exponential, Weibull, log-logistic and Gompertz survival models, using the parameterisations as described in the flexsurv package. These forms allow a range of varying survival curve shapes and should provide the user with at least one model that is consistent with their data. The exponential distribution requires the user to specify a single point to interpolate, whilst the other parametric models have two parameters to estimate and so require the user to specify two points. In addition, SurvExtrap provides a visual representation of the resulting survival mode, demonstrating the successful interpolation of the specified point(s). It also allows the user to upload event-time and -type data to overlay a Kaplan-Meier plot to the parametric models to assess visually which model best represents the data.

SurvExtrap has at least two areas of application, which are demonstrated through two examples.

Example 1: Obtaining Consistency With An External Data Source

There is increasing demand for ways to incorporate into technology assessments information from data registries which boast much larger sample sizes and longer follow-up than clinical trials. However, access to patient level survival data may not be available. Conference abstracts are a common example where patient survival may be minimally reported, e.g. only be reported at 5 and 10 year milestones, without reporting any further information on survival rates at other times.⁷ Using SurvExtrap this information can easily be turned into a range of potential survival extrapolations, or it can be combined with a point estimate taken from an alternative data source, e.g. combining a clinical trial and a historical cohort.

In the technology appraisal TA519 of pembrolizumab for previously treated advanced or metastatic urothelial cancer, one key discussion point was the survival of the comparator population who received best-supportive care (BSC).⁸ The company’s extrapolations of the BSC data from their KEYNOTE 045 trial produced estimates that disagreed with the 5 year survival rates reported by Cancer Research UK (CRUK). Figure 1 demonstrates a visual representation of the problem, showing the inconsistency of the extrapolations and the CRUK data. This problem persisted even after the company applied an adjustment for the treatment switching that had occurred in the control arm. Whilst this could be explained by differences in baseline characteristics, there was still a desire to use a model that was consistent with the CRUK report, however it was not possible to get a reasonable extrapolation.

Using SurvExtrap and interpolating the median survival time as estimated by the Kaplan-Meier curve of the recreated unadjusted data for the BSC arm from KEYNOTE 045 (7.7 months) and the 5-year CRUK estimate (10%) provides a simple way of obtaining a model that is consistent with the data and with the external source. On this occasion a Gompertz model provided the best visual fit to the data (Fig. 2). The Gompertz model obtained using SurvExtrap appears an equivalent fit to the models fitted to the data. Any difference in the life-years estimated for the observed period would be negligible, and the reliability of the life-years estimated for the extrapolated period has improved considerably.

Example 2: Exploring Uncertainty

Consider the case where the uncertainty associated with the long-term efficacy of a therapy is high, with a wide disparity of estimates made by clinical experts about the survival of patients beyond the observed period. New and emerging cell gene therapies are relevant example of this. Typically, the uncertainty could be explored by exploring the uncertainty around the parameters of a particular model, or by varying the choice of survival model. However, in such a case, these may be unsatisfactory and fail to fully explore the uncertainty expressed by clinical experts.

Such an approach assumes that the unobserved survival outcomes can be predicted by those that have been observed, yet there might be a clear distinction between the mortality rates of these two groups that cannot be accurately estimated from the data.

Using hypothetical data, we show a range of parametric extrapolations (dashed – black) fitted to observed data show by the pink Kaplan Meier curve (Fig. 3). Beyond the observed period, there are three differing opinions on the long-term survival of the patient population. The Weibull model may be selected as it best suits the neutral opinion, but the possibility of the other opinions being right should also be considered. In this case, the Gompertz model fitted to the observed data could be a considered satisfactory to explore a pessimistic scenario, and the two log-models acceptable for an optimistic scenario. The problem with both of these assumptions is that neither are consistent with the opinions provided by the clinical experts. The curves for both scenarios overestimate survival relative to the opinions, the worst violation being the long-term prediction of the log models exceeding the expert’s predictions.

Using the SurvExtrap, and specifying interpolation of the points S(1.46) = 0.691 and either S(4.73) = 0.101 for the pessimistic scenario or S(12.7) = 0.108 for the optimistic scenario produced estimates of Weibull curve parameters that allowed modelling of the curves seen in Fig. 4. A comparison of the two shows that the models coming from SurvExtrap are much closer fits to the predictions made, whilst are also consistent to the observed data. No great care was taken when selecting these points, and the user could prioritise better fits to earlier or later points, depending on their preference and convergence of the solving algorithm.

SurvExtrap provides the modeller with greater flexibility and freedom to consider any potential extrapolation, releasing them from the typically limited set of parametric models fitted to observed data. This paper has shown two cases where SurvExtrap can markedly improve the available survival extrapolations which will result in more informative economic analyses. SurvExtrap cannot tell you which survival model is most appropriate, and this must be assessed through careful consideration of the visual fit to the data and plausibility of the extrapolation. Selection of interpolation points and model shapes should be performed in cooperation with robust evidence sources and expert clinical opinion. Understanding the underlying hazard rate of each model type is also key in selecting the optimal model. As SurvExtrap is not fitting to data there are no goodness-of-fit statistics to utilise, however the utility of statistics such as AIC and BIC has been shown to be limited.^{4, 5} Alternative methods such as dynamic modelling and mixture models may prove to yield improved extrapolations compared to traditional parametric techniques, but still require sufficient follow-up in order for an accurate extrapolation to be obtained.^{9, 10} An advantage of SurvExtrap is that it does not need access to mature follow-up from a single source to obtain a plausible model.

Planned improvements to SurvExtrap include functionality to estimate parameters that could inform a probabilistic sensitivity analysis, requiring the user to upload their data. Adding the ability to average across multiple models will give the user additional flexibility when seeking an optimal extrapolation.

Technology appraisal submissions are increasingly reliant on adjustments to populations to account for baseline differences or treatment switching. However, it is rare for the statistical modelling behind these approaches to be reported in sufficient detail for appraisers and decision-makers to be confident in their implementation. Reluctance to share patient data means that the analyses behind these often ad-hoc adjustments are typically more opaque than primary trial results. SurvExtrap could serve as a valuable tool to health-economists when such adjustments are not performed and reported transparently, allowing alternative scenarios to be modelled and their cost-effectiveness impact to be assessed.

It is vital to be able to estimate the benefit and value of treatments accurately, to ensure current and future healthcare is delivered sustainably. SurvExtrap offers a simple alternative to parametric extrapolation of data, allowing the exploration of uncertainty and providing a solution in cases where no plausible models are otherwise available. This is helpful when survival information comes from multiple non-combinable sources or is otherwise minimally available. SurvExtrap allows more precise modelling of treatment benefits and improves the reliability of cost-effectiveness assessments.

Availability and requirements

Project Name: SurvExtrap

Project Homepages: https://github.com/daniel-g-92/SurvExtrap https://dgallacher.shinyapps.io/survextrap/

Operating System: N/A (web application)

Programming Language: R

License: CC0 1.0 Universal

Ethics approval and consent to participate: Not applicable.

Consent for publication: Not applicable.

Availability of data and materials: Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

Competing interests: None.

Funding: No funding was received specific to this work.

Author Contributions: DG conceived the study idea, designed and created the tool, and authored the manuscript.

Acknowledgements: DG is grateful to Dr Martin Connock, Ewen Cummins and Prof Nigel Stallard who provided helpful comments in the development of the tool.

NICE. Guide to the methods of technology appraisal [PMG9]. 2013.
Bell Gorrod H, Kearns B, Stevens J, et al. A review of survival analysis methods used in NICE technology appraisals of cancer treatments: consistency, limitations and areas for improvement. Med Decis Making. 2019;39(8):899–909.
Gallacher D, Auguste P, Connock M. How Do Pharmaceutical Companies Model Survival of Cancer Patients? A Review of NICE Single Technology Appraisals in 2017. Int J Technol Assess Health Care. 2019;35(2):160–7.
Gallacher D, Kimani P, Stallard N. Extrapolating Parametric Survival Models in Health Technology Assessment: A Simulation Study. Med Decis Making. 2021;41(1):37–50.
Gallacher D, Kimani P, Stallard N. Extrapolating Parametric Survival Models in Health Technology Assessment Using Model Averaging: A Simulation Study. Med Decis Making. 2021;41(4):476–84.
Gallacher D, Kimani P, Stallard N. Biased Survival Predictions When Appraising Health Technologies in Heterogeneous Populations PharmacoEconomics. 2022;40(1):109–20.
Nio M, Ohi R, Miyano T, et al. Five- and 10-year survival rates after surgery for biliary atresia: a report from the Japanese biliary atresia registry. J Pediatr Surg. 2003;38(7):997–1000.
Gallacher D, Armoiry X, Auguste P, et al. Pembrolizumab for Previously Treated Advanced or Metastatic Urothelial Cancer: An Evidence Review Group Perspective of a NICE Single Technology Appraisal. PharmacoEconomics 2019;37(1):19–27.
Kearns B, Stevenson MD, Triantafyllopoulos K, et al. Comparing current and emerging practice models for the extrapolation of survival data: a simulation study and case-study. BMC Med Res Methodol. 2021;21(1):263.
Klijn SL, Fenwick E, Kroep S, et al. What Did Time Tell Us? A Comparison and Retrospective Validation of Different Survival Extrapolation Methods for Immuno-Oncologic Therapy in Advanced or Metastatic Renal Cell Carcinoma. PharmacoEconomics 2021.

Download PDF

Version 1

posted

You are reading this latest preprint version

SurvExtrap: A tool to obtain precise parametric survival extrapolations

Status:

Version 1

Abstract

Figures

Background

Implementation

Results

Example 1: Obtaining Consistency With An External Data Source

Example 2: Exploring Uncertainty

Discussion

Conclusions

Declarations

References

Status:

Version 1