Indirect comparison of survival data based on the Shiny method: the role of control groups in the assessment of heterogeneity

doi:10.21203/rs.3.rs-2006322/v1

Download PDF

Short Report

Indirect comparison of survival data based on the Shiny method: the role of control groups in the assessment of heterogeneity

https://doi.org/10.21203/rs.3.rs-2006322/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Objective: New techniques have recently been developed to reconstruct patient-level data from Kaplan-Meier curves. Based on these “reconstructed patients”, indirect comparisons can be made to rank the effectiveness of the main treatments available. In these studies, the assessment of heterogeneity is a crucial phase. Our objective was to set up an approach to evaluate heterogeneity when reconstructed patients represent the clinical material.

Results: Two data sets are analyzed: the first regards treatments for triple-negative breast cancer (3 trials, 3 treatments), while the second focuses on nonmetastatic prostate cancer (3 trials, 3 treatments and placebo). Heterogeneity has been quantified according to the likelihood ratio test. In the first case, a significant degree of heterogeneity is found (likelihood ratio test, 12.94; df, 1; p=0.0003); consequently, the results of indirect comparisons risk being misleading and therefore require a quite complex interpretation. By contrast, in the second case, the lack of heterogeneity (likelihood ratio test, 1.17; df 2, p=0.60) suggests that the results of indirect comparisons are reliable. Further research is needed to define the most appropriate cut-off in the likelihood ratio test to identify cases where heterogeneity has a relevant impact.

indirect comparisons

time-to event endpoints

Kaplan-Meier curves

Shiny method

reconstructed individual patient data

heterogeneity assessment

A growing literature has accumulated over the past two years regarding indirect between-treatment comparisons based on reconstructed patient data [1]. The typical setting refers to a small number of clinical trials (typically, 3 to 6) that have separately evaluated different treatments aimed at the same disease condition. The clinical endpoint must be time-to-event (e.g. overall survival [OS} or progression-free survival); the original studies must have reported a Kaplan-Meier graph in the presentation of clinical results; the inclusion criteria must be essentially the same across included trials. In these cases, an artificial intelligence online tool (IPDfromKM [2], sometimes denoted as Shiny method) can be used to generate reconstructed patient data from the Kaplan-Meier graph. In this context, the final result of the analysis is a new and original Kaplan-Meier curve that evaluates all treatments in the same graph along with relevant statistics.

Initial experiences with this approach [1] have shown that the main weakness in these analyses is represented by the risk of heterogeneity across included trials. The assessment of this risk can be based on different approaches:

a careful evaluation of the criteria for enrolling patients in the different trials
employing a more conservative statistical approach to perform indirect comparisons (Messori et al., unpublished observations)
performing a separate heterogeneity assessment in which all control groups of included trials are compared with one another so that the overall statistical variability of these data sets can be estimated. For this purpose, we propose herein to use the likelihood ratio test; this test has a general validity in meta-analysis and can be used as a tool to assess homogeneity/heterogeneity across studies [3]

In the present article, we examine the third of these options by reporting two case studies.

Two experiences are described in which the heterogeneity assessment has been applied to data sets obtained from reconstructed patient-level data. In the first case, the high degree of heterogeneity, identified by the likelihood ratio test, was found to influence the results of indirect comparisons. In the second case, the degree of heterogeneity was low, thus suggesting full reliability of the results of indirect comparisons.

Case study n°1: first-line treatments for triple-negative advanced breast cancer

Both atezolizumab and pembrolizumab are known to significantly improve overall survival (OS) in patients with PDL-1 positivity at values of CPS ≥ 10 [4–6]. On the other hand, the controversy about the relative effectiveness of these two agents arises when we consider all PDL-1 positive patients [7]. In this population, pembrolizumab did not determine any significant survival benefit, whereas atezolizumab induced a significant prolongation of OS [4–6]. This result has found confirmation in a patient-level pooled analysis of all PDL-1 positive patients enrolled in KEYNOTE-355, IM-PASSION-130 and IM-PASSION-131 trials [7]; this analysis found a significant difference in OS in favor of atezolizumab vs pembrolizumab (hazard ratio [HR], 0.73; 95% confidence interval [CI], 0.61 to 0.87, p < 0.001; median, 20.4 vs 15.5 months). This result was obtained through an indirect comparison performed according to the Shiny method combined with the IPDfromKM tool [2]).

One hypothesis to explain this finding in the absence of a true difference between the two agents is that the population given pembrolizumab had worse prognostic characteristics than that treated with atezolizumab (or vice-versa). To assess this hypothesis, one suitable method is to perform an indirect comparison across the three control groups of the three trials. In more detail, the controls of the KEYNOTE-355 trial received chemotherapy (N = 211) whereas, in the two trials on atezolizumab (IM-PASSION-130 and IM-PASSION-131), the controls received nab-paclitaxel (N = 184) and paclitaxel alone (N = 101), respectively.

In a preliminary assessment (detailed data not shown), we verified that there was a very similar survival pattern between the controls treated with nab-paclitaxel (N = 184) in the IM-PASSION-130 trial and those treated with paclitaxel in the IM-PASSION-131 trial (HR in favor of the former control group, 0.91; 95%CI, 0.64 to 1.28; P = 0.57). Hence, these two patient groups were pooled into a single control group of 285 patients.

Thereafter, in comparing the controls of the two atezolizumab trials (N = 285) vs those of the pembrolizumab trial (N = 211), the hazard ratio (HR) in favor of the former control group was estimated to be 0.67 (95%CI, 0.54 to 0.83, p < 0.001). Figure 1A shows the Kaplan-Meier curves of this indirect comparison. Figure 1B shows the heterogeneity assessment based on the analysis of the control groups. The likelihood ratio test is 12.94 (df, 1; p = 0.0003) and the p-value is lower than threshold of 0.05; this indicates that a homogeneous model is inappropriate or, in other words, that a significant heterogeneity is present.

Case study n°2: first-line treatments for advanced or metastatic prostate cancer.

Numerous treatments have been developed for nonmetastatic castration-resistant prostate cancer [8–10]. Because direct comparisons between these treatments are not available, indirect comparisons can be of interest. The analysis conducted by Rivano et al. [11] evaluated second-generation hormone treatments proposed for this disease condition (namely, apalutamide, darolutamide, and enzalutamide). Three phase-III studies were studied; details about these studies are reported in Supplementary Table 2.

As shown in Fig. 2A, apalutamide (HR, 0.75, 95%CI: 0.64–0.88), darolutamide (HR: 0.70, 95%CI: 0.58–0.84) and enzalutamide (HR, 0.77, 95%CI: 0.65–0.90) were all significantly more effective than the controls given placebo. Our results showed no difference in OS between any of these three active agents.

To assess heterogeneity, comparisons across the controls of the 3 included trials are shown in Fig. 2B. The likelihood ratio test was 1.17 (df 2, p = 0.60); this result clearly shows that there is no significant heterogeneity in these data sets. Furthermore, using the controls of the apalutamide trial as common comparator, the following values of HR were estimated: i) controls of the darolutamide trial, HR = 1.09 (95%CI, 0.85 to 1.40; p = 0.48); ii) controls of the enzalutamide trial, HR = 1.13 (95%CI, 0.90 to 1.43; p = 0.29).

In estimating heterogeneity from survival data sets obtained from reconstructed patient data, the approach described herein is, to our knowledge, the first reported in the literature. Regarding the limitations, one should keep in mind that the two experiences described above are the first in which a formal method of heterogeneity assessment (the likelihood ratio test) has been employed in the context of the Shiny analysis. While further experiences in this area are needed, the two case studies described in this paper anyhow suggest a specific operational procedure in this area, which deserves to be tested in further analyses.

The Shiny method, combined with a design of indirect comparisons, is increasingly used to interpret effectiveness data based on time-to-event endpoints [1]. On the other hand, the likelihood ratio test, already used for other purposes in survival statistics, is proposed herein for application together with the Shiny method.

All in all, the combined application of these two techniques will need a more thorough assessment in terms of pros and cons. Interestingly enough, a recent paper [12] has proposed a new strategy to apply the Shiny method (called one-to-many approach), which will likely determine a further increase in the use of reconstructed patient-level data.

HR, hazard ratio

CI, confidence interval

df, degrees of freedom

OS, overall survival

CPS, Combined Positive Score

Ethics approval and consent to participate: this analysis is based on previously published secondary data.
Consent for publication: not applicable.
Availability of data and materials: the data sets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Competing interests: the author declares that he has no competing interests.
Funding: not applicable.
Authors' contributions: single-author paper.
Acknowledgment: not applicable.

Messori A. Application of the Shiny method in the analysis of survival curves: a synopsis of 16 references (preprint). Open Science Framework, 2022, published 15 August, url https://osf.io/4nyah
Liu N, Zhou Y, Lee JJ. IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves. BMC Med Res Methodol. 2021 Jun 1;21(1):111. doi: 10.1186/s12874-021-01308-8. PMID: 34074267; PMCID: PMC8168323
Hu D, Wang C, O'Connor AM. A likelihood ratio test for the homogeneity of between-study variance in network meta-analysis. Syst Rev. 2021 Dec 9;10(1):310. doi: 10.1186/s13643-021-01859-3. PMID: 34886897; PMCID: PMC8662889.
Rugo H. Final results of KEYNOTE-355 (LBA16): A randomized, double-blind, phase-3 study of pembrolizumab + chemotherapy vs placebo + chemotherapy for previously untreated locally recurrent inoperable or metastatic triple negative breast cancer. OncologyPRO – ESMO Congress 2021, url https://oncologypro.esmo.org/oncology-news/esmo-videos/esmo21-highlights-on-combination-of-pembrolizumab-plus-chemotherapy-in-mtnbc-the-keynote-355-study ex 1
Emens LA, Adams S, Barrios CH, et al. First-line atezolizumab plus nab-paclitaxel for unresectable, locally advanced, or metastatic triple-negative breast cancer: IMpassion130 final overall survival analysis Ann Oncol. 2021;32(8):983–993. doi:10.1016/j.annonc.2021.05.355 [published correction appears in Ann Oncol. 2021 Dec;32(12):1650].
Miles D, Gligorov J, André F, et al. Primary results from IMpassion131, a double-blind, placebo-controlled, randomised phase III trial of first-line paclitaxel with or without atezolizumab for unresectable locally advanced/metastatic triple-negative breast cancer. Ann Oncol. 2021;32(8):994–1004. doi:10.1016/j.annonc.2021.05.801
Di Spazio L, Rivano M, Cancanelli L, Chiumente M, Mengato D, Messori A. The Degree of Programmed Death-Ligand 1 (PD-L1) Positivity as a Determinant of Outcomes in Metastatic Triple-Negative Breast Cancer Treated With First-Line Immune Checkpoint Inhibitors. Cureus. 2022 Jan 9;14(1):e21065. doi: 10.7759/cureus.21065, https://pubmed.ncbi.nlm.nih.gov/35028245/ url https://assets.cureus.com/uploads/review_article/pdf/82270/20220110-16971-mzer54.pdf
Smith MR, Saad F, Chowdhury S, et al. Apalutamide and Overall Survival in Prostate Cancer. Eur Urol. 2021;79:150–8. doi: 10.1016/j.eururo.2020.08.011
Fizazi K, Shore N, Tammela TL, et al. Nonmetastatic, Castration-Resistant Prostate Cancer and Survival With Darolutamide. N Engl J Med. 2020;383:1040–9. doi: 10.1056/ NEJMoa2001342
Sternberg CN, Fizazi K, Saad F, et al. Enzalutamide and Survival in Nonmetastatic, Castration-Resistant Prostate Cancer. N Engl J Med. 2020;382:2197–206. doi: 10.1056/ NEJMoa2003892
Rivano M, Cancanelli L, DI Spazio L, Mengato D, Chiumente M, Messori A. Survival with novel hormonal therapies in patients with nonmetastatic castration-resistant prostate cancer: indirect comparison of three randomized Phase III trials. World Journal of Urology 2022 (in press), available at http://www.osservatorioinnovazione.net/papers/worldjournalurology2022.pdf
Messori A, Rivano M, Cancanelli L, et al. (August 25, 2022) The “One-to-Many” Survival Analysis to Evaluate a New Treatment in Comparison With Therapeutic Alternatives Based on Reconstructed Patient Data: Enfortumab Vedotin Versus Standard of Care in Advanced or Metastatic Urothelial Carcinoma. Cureus 14(8): e28369. doi:10.7759/cureus.28369, url https://www.cureus.com/articles/110065-the-one-to-many-survival-analysis-to-evaluate-a-new-treatment-in-comparison-with-therapeutic-alternatives-based-on-reconstructed-patient-data-enfortumab-vedotin-versus-standard-of-care-in-advanced-or-metastatic-urothelial-carcinoma?medium=email&src=email_share&utm_campaign=share&utm_medium=email&utm_source=email_share_mailer

No competing interests reported.

Supplementarymaterial.pdf

Download PDF

Version 1

posted

You are reading this latest preprint version

Indirect comparison of survival data based on the Shiny method: the role of control groups in the assessment of heterogeneity

Status:

Version 1

Abstract

Figures

Introduction

Main Text

LIMITATIONS

List of abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1