Safety analysis of new medications in clinical trials: A simulation study to assess the differences between cause-specific and subdistribution frameworks in the presence of competing events

doi:10.21203/rs.3.rs-2475247/v1

Download PDF

Research Article

Safety analysis of new medications in clinical trials: A simulation study to assess the differences between cause-specific and subdistribution frameworks in the presence of competing events

https://doi.org/10.21203/rs.3.rs-2475247/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 12 Jul, 2023

Read the published version in BMC Medical Research Methodology →

You are reading this latest preprint version

Safety is an essential part of the evaluation of new medications and competing risks that occur in most clinical trials are a well identified challenge in the analysis of adverse events. Two statistical frameworks exist to consider competing risks: the cause-specific and the subdistribution framework. To date, the application of the cause-specific framework is the standard practice in safety analyses. Here we analyze how the safety analysis results of new medications would be affected if instead of the cause-specific the subdistribution framework was chosen. We conducted a simulation study with 600 participants, equally allocated to verum and control groups and a 30 months follow-up period. Simulated trials were analyzed for safety in a competing risk (death) setting using both the cause-specific and subdistribution frameworks. Results show that comparing safety profiles in a subdistribution setting is always more pessimistic than in a cause-specific setting. For the group with the longest survival and a safety advantage in a cause-specific setting, the advantage either disappeared or a disadvantage was found in the subdistribution analysis setting. These observations are not contradictory but show different perspectives. To evaluate the safety of a new medication over its comparator, one needs to understand the origin of both the risks and the benefits associated with each therapy. These requirements are best met with a cause-specific framework. The subdistribution framework seems better suited for clinical prediction, and therefore more relevant for providers or payers, for example.

Competing risks

Drug development

Cause-specific versus subdistribution hazard ratio

Safety analysis

Simulation study

Safety data are an essential part of the clinical evaluation of new medicinal products and regulatory submissions. However, their analysis might be challenged by the existence of competing risks. These are intercurrent events, defined as mutually exclusive events (death, other adverse events, change of treatment, noncompliance, end of study, etc.) whose occurrence precludes the event of interest from happening (Allignol, Beyersmann, & Schmoor, 2016). Competing risks are common. They are present in the vast majority of clinical trials (Koller, Raatz, Steyerberg, & Wolbers, 2012; van Walraven & McAlister, 2016) and might bias the results (Schuster, Hoogendijk, Kok, Twisk, & Heymans, 2020; van Walraven & McAlister, 2016). They represent a well-recognized problem in the analysis of adverse events (Stegherr, Beyersmann, et al., 2021; Stegherr, Schmoor, et al., 2021) and general recommendations urge the use of survival techniques that methodically account for the presence of competing risks (European Medicines Agency, 2020; Koller et al., 2012; Stegherr, Beyersmann, et al., 2021; Stegherr, Schmoor, et al., 2021). These techniques acknowledge that for a given adverse event there are other types of risks that occur at the same time.

The standard survival data situation corresponds to a Markov process with the two states: “event-free” and “event”. Splitting the “event” state into more states corresponding to different causes (“event 1”, “event 2”, “dead”, etc.) results in a Markov model for competing risks (Borgan, 1997). The analytical object in the presence of competing risks is the same as in standard marginal survival analysis: to estimate the probabilities, also named risks, and hazard rates of the event of interest over time and, if relevant, to assess whether there are differences between groups. However, a competing risks setting that extends the capabilities of analysis of two state survival models to deal with multi-state models (cf. Figure 1) is required, when subjects can experience more than one event (Therneau, Crowson, & Atkinson, 2020). The risk of the event of interest over time is estimated among the risk of other competing events whose occurrence precludes it from happening. The concepts of risks and rates generalize easily to the competing risk situation: hazard rates become cause-specific hazard rates and risks become cumulative incidences (Andersen, Geskus, de Witte, & Putter, 2012).

Two statistical frameworks exist to perform survival analysis in the presence of competing risks: the cause-specific and the subdistribution settings. All standard methods for survival data apply to the cause-specific setting (Geskus, 2016; Putter, Fiocco, & Geskus, 2007) which focuses on the cause-specific hazard function. This function estimates the probability of each type of event separately, right-censoring individuals at the time of the competing event, as well as for loss of follow-up, withdrawal, or at the end of the observation time. For the subdistribution setting, specific approaches were developed that based on the cumulative incidence function (Fine & Gray, 1999; R. J. Gray, 1988). This function focuses on the cumulative incidence (or “subdistribution”) from a particular cause and does not treat competing events as censored observations.

These settings differ in their definitions. The aim of this study is to compare their properties and to recommend how to perform safety analyses in clinical research and regulatory submissions. We investigate whether systematic differences exist between the estimates obtained with each approach and define to what extent the interpretation of the results of survival analysis depends on the choice of one or the other setting. For both settings non-parametric approaches (Borgan, 1997; Edwards, Hester, Gokhale, & Lesko, 2016; R. J. Gray, 1988; Schuster et al., 2020) as well as regression models (Fine & Gray, 1999; Klein, van Houwelingen, Ibrahim, & Scheike, 2013) exist. Classical hazard-based methods for survival data apply when analyzing cause-specific hazards: Kaplan-Meier and Nelson-Aalen estimators as well as the Cox proportional hazards regression model. These methods, however, do not allow to draw inference for subdistribution functions of competing risks. Specific approaches were developed: the Aalen-Johansen estimator and the Fine and Grey model. This paper focuses on (semi-) parametric approaches: cause-specific (Cox regression) and subdistribution hazard regression (Fine & Gray model). Both offer two major advantages in comparison to the non-parametric approaches. First, they allow to adjust for covariates when assessing and comparing event probabilities over time and thus provide more insight into the mechanisms that lead to the occurrence of an event. Second, they allow to use a fitted model to make predictions (e.g., for certain attributes of the population under study).

In the methods section, we provide a brief, nontechnical description of the cause-specific and subdistribution settings in survival analysis. Detailed technical descriptions can be found elsewhere (Borgan, 1997; Fine & Gray, 1999; Klein et al., 2013; Therneau & Grambsch, 2000). A short introduction to the non-parametric estimators can be found in the additional file 1. To examine and to compare the properties of the cause-specific and subdistribution settings in survival analysis, a simulation study was conducted. It covers all possible practical outcomes: from superiority to inferiority of the medical intervention and from small to large effect sizes. In the results section, we report the results of safety analyses performed on each simulated dataset with a cause-specific and a subdistribution setting. Finally, the results are discussed in terms of their practical implications and relevance for safety analyses of new medicinal products and regulatory submissions.

2.1 Cause-specific hazard regression

The Cox proportional hazards model (Cox, 1972) is expressed by the hazard function $h\left(t\right)$ presented in Eq. 1. In this model, $h\left(t\right)$ is determined by a set of covariates and expressed as:

$h\left(t\right)= {h}_{0}\left(t\right) \times {e}^{\beta {\prime }{X}}$ 1

where $t$ is the time, ${X}$ is a vector of covariates, $\beta {\prime }$ is the vector of regression coefficients that measures the effect size of each covariate on the hazard and ${h}_{0}\left(t\right)$ is the baseline hazard, under the assumption that all explanatory variables are either set to zero (${X}$=0) or represent average values. The quantities of interest in the Cox model are the hazard ratios (HR) ${e}^{{\beta }_{j}}$, where j $\in$ $\left\{1, 2, \dots , c\right\}$represents the $c$ covariates considered in the analysis. HR are relative measures of an effect between different values taken by a covariate. They do not provide any information about the absolute risks. A very common covariate investigated in medical and pharmacological research is the group attribution. In this case, in a binary setting with two groups, the HR associated with this covariate is the ratio of the rates of occurrence of an event in both groups. A value equal to one indicates no differences between the groups, a value of less than one indicates a higher and a value of more than one a lower rate of occurrence in the reference group. The cause-specific Cox proportional hazard model is a natural extension of the standard Cox proportional hazard regression where a model is fitted separately to each cause-specific hazard by censoring all individuals who experienced one of the competing risks before the event of interest.

2.2 Subdistribution hazard regression

While in the Cox model the hazard for the event of interest only depends on its own (cause-specific) hazard, Fine and Gray (1999) proposed a model that expresses an instantaneous hazard function $h\left(t\right)$ by the cumulative (subdistribution) hazard function $F\left(t\right)$ that is described in Eq. 2. The subdistribution model contains an additive component and the instantaneous risk of occurrence of the event of interest $k$, ${\text{F}}_{k}\left(t\right)$, depends on all cause-specific risks. It can be expressed as:

${\text{F}}_{k}\left(t\right)= {\text{F}}_{0,\text{k}}\left(\text{t}\right) \times {\text{e}}^{\sum _{i}{{\gamma }}_{i}{\mathbf{X}}_{{i}}}$ 2

where $k$ is the event of interest while i$\in \left\{1, 2, \dots , n\right\}$ represents all the competing events (including $k$) considered in the analysis. Analogous to the expression of the Cox model presented in Eq. 1, $t$ is the time and, ${X}$ is a vector of covariates. Similar to$\beta$ in the Cox model, $\gamma$ is the vector of regression coefficients measuring the effect size of each covariate on the cumulative hazard. ${F}_{0, k}\left(t\right)$ is the baseline cumulative hazard, that is the cumulative hazard under ${X}$=0. The regression coefficients $\gamma$ can be interpreted in a similar way as the $\beta$ from a Cox model, except that they are relative measures of risk between the values of certain covariate, taking into account competing events occur that preclude the occurrence of the event of interest. This means that the size of the effect due to each competing event on the HR for the event of interest cannot be isolated. It should be noted that the model considers an extended risk set where individuals are still at risk for the event of interest even after they experienced the competing risk. Fine and Gray acknowledged that this is unnatural but necessary in order to get a model that correctly predicts cumulative incidence functions (Fine & Gray, 1999).

2.3 Simulation study

A simulation study was conducted to investigate the differences in the results of safety analysis performed in presence of competing risks when the subdistribution setting is chosen instead of the standard cause-specific setting. Three possible outcomes were considered, i. e. (1) superiority of the verum group compared to the control group, (2) inferiority of the verum group compared to control group, and (3) equivalence between both groups.

As commonly done in biometrics, HR was estimated to compare the risk of occurrence of the adverse event of interest between the verum and control groups (Collet, 2015; Zwiener, Blettner, & Hommel, 2011). HR in the cause-specific setting (HR_cs) was fitted by a Cox regression model and in the subdistribution setting (HR_sd) by a Fine and Gray model.

The following assumptions were made:

i. Each study comprised 600 patients allocated into two study groups (verum and control) in a 1:1 ratio.

ii. Two competing event types (the adverse event of interest and death as competing risk) were simulated with event times for both types following an exponential distribution. We selected a common and simple one-parameter event time distribution that implies a time constant hazard rate $h$(t)= $h$ that makes it easy to control the characteristics of the simulated data (Klein & Moeschberger, 2003). The hazard ratesof the exponential distributions were defined according to the targeted median time to event ${t}_{\text{0,5}}$ for a given treatment group and a given event type:

$$h=\frac{log\left(2\right)}{{t}_{\text{0,5}}}$$

iii. Administrative censoring occurred after 30 months if neither the primary event (adverse event of interest) nor the competing event (death) had occurred in a patient by then.

iv. The characteristics of the distribution of the competing event death were kept constant across all simulated scenarios. Median survival was set to 20 and 10 months for the verum and control groups, respectively, which corresponds to a HR of 0.5, in favor of the verum group.

v. Median time to first adverse event was incremented in one-month intervals between 1 and 20 months in the verum and control groups resulting in 400 patterns (i. e. 20x20), hereafter referred to as “conditions of interest”.

The following statistics were reported for the competing risk analysis of each condition of interest:

i. Median time to adverse event (AE), median time to death and their corresponding standard deviations;

ii. HR_cs and HR_sd, their corresponding 95% confidence intervals and two-sided p-values to investigate for group differences.

The simulation of each of these conditions of interest was repeated 1 000 times. The statistics were separately assessed on each of the 1 000 datasets generated for each condition on interest and then pooled together according to Rubin’s rule (Rubin, 1987; cf. Figure 2).

2.4 Presentation of results

For an initial assessment of whether changing the setting from cause-specific to subdistribution leads to a change in the three possible outcomes of the safety analysis performed in a competing risk setting, results were classified into nine possible categories. These categories were defined in a two-step process:

First, starting from a cause-specific setting, the results of survival analysis of the 400 conditions of interest were classified into the possible outcome categories:

(a) Superiority if HR_cs<1 and p-value ≤ 0.05

(b) Inferiority if HR_cs>1 and p-value ≤ 0.05

Second, for each condition of interest, we assessed whether HR_sd fell in the same outcome category as the HR_cs or in one of the two possible alternative categories. This resulted in nine possible outcome categories when switching from the cause-specific to the subdistribution setting. The proportion of the 400 conditions of interest falling in each of these nine possible categories was reported.

Heat maps provide a graphical overview of the results, from the classification of the 400 conditions of interest to the observed differences between the true HR and the outcomes of the survival analysis performed in both the cause-specific and subdistribution settings.

2.5 Software

All analyses were conducted using R version 3.6.1 (R.CoreTeam, 2019). The R package survival was used to fit the Cox proportional hazards model (Therneau & Grambsch, 2000; Therneau, Lumley, Elizabeth, & Cynthia, 2021). The R package cmprsk was used to fit the Fine and Gray model (B. Gray, 2020). The pooled-analysis of the parameters of the 1 000 simulated repetitions for each of the 400 condition of interest was done by the R package mice (van Buuren & Groothuis-Oudshoorn, 2011). Heat maps for the graphical presentation of results were created with the R package ggplot2 (Wickham, 2016). Detailed information on how to use these R packages can be found in the original publication for each package.

The true value of the HR (HR_true) for safety analysis, defined as the entry value given to simulate the occurrence of safety events, is known. The HR of the competing risk death was kept constant to 0.5 in favor of verum over all simulations. Each condition of interest was categorized according to HR_true: superiority (HR_true<1), inferiority (HR_true>1) or equivalence (HR_true=1) of the verum group compared to the control group. In 47.5% of the simulated conditions of interest, verum was safer than control (superiority). The amount of the conditions of interest where verum was less safe than control (inferiority) was the same. In the remaining 5% of simulated conditions of interest, verum and control were equally safe (equivalence) (cf. Figure 3).

Estimated competing risks HR_cs of the 400 conditions of interest ranged from 0.05 to 20.16 and HR_sd from 0.15 to 11.36.

As is detailed in Fig. 3, 35.8% of the competing risks safety analysis performed in a cause-specific setting resulted in superiority, 35.8% in inferiority and 28.4% in equivalence of the verum group compared to the control group. The slight differences between HR_cs and HR_true can be easily explained. Unlike HR_true, HR_cs is calculated in a competitive risks setting where patients who experienced the competing event (death) before the event of interest were censored. Censoring leads to a reduced number of patients, especially towards the end of the observation period which may as well reduce the statistical power needed to detect existing differences between the treatment groups.

Among the conditions of interest that show superiority of verum in a cause-specific setting (n = 143), 62.2% still showed superiority when analyzed in a subdistribution setting (category 1). For the remaining 37.8%, the superiority of the verum group disappeared in the subdistribution setting. Statistical tests were not significant and the outcome category changed from superiority of verum to equivalence (category 2). A change from superiority of verum in the cause-specific setting to inferiority of verum in the subdistribution setting was not observed (category 3; cf. Figure 3).

Of the conditions that showed equivalence between verum and control in a cause-specific setting (n = 114), none showed superiority when analyzed in a subdistribution setting (category 4). Equivalence between the two treatment groups remained in about half of the conditions of interest (47.4%; category 5) while the other half (52.6%, category 6) turned to inferiority of verum when analyzed in a subdistribution setting (cf. Figure 3).

Among the conditions of interest that resulted in inferiority of verum in a cause-specific setting (n = 143), all remained significantly disadvantageous for verum (category 9) when analyzed in a subdistribution setting, the other possible outcomes of category 7 and category 8 were not observed in the study (cf. Figure 3).

The heat map of panel (A) in Fig. 4 presents the HR_true of the entry values for the 400 conditions of interest in the simulated safety study with a constant HR of the competing risk (death) of 0.5 in favor of the verum group. The verum group is superior to the control group if the median time to first adverse event is longer than in the control group (HR_true<1, green shadings, lower right part). Conversely, the verum group is inferior to the control group if the median time to first adverse event is shorter in the verum group than in the control group (HR_true>1, red shadings, upper left part). If the median time to first adverse event is the same for both groups, they are considered equivalent (HR_true=1, yellow shadings, the diagonal separating lower right and upper left parts). Figure 4, panel (B) shows for each condition of interest in the simulation study in the cause-specific setting the ratios of HR_cs and HR_true. Accordingly, a ratio around the value of 1 (yellow shadings) indicates no difference between HR_cs and HR_true. This is observed for most ratios of the 400 conditions of interest in the simulated safety study in a cause-specific setting; independent of median time to first adverse event in verum and control groups. For ratios with values less than 1 (green shadings), HR_cs is lower than HR_true. This is observed for some conditions of interest, especially when the median time to first adverse event is much higher in the verum than in the control group. Only in these cases is a deviation in HR_cs observed in favor of the verum group. Ratios with values above 1 indicates higher HR_cs than HR_true values which is not observed in the simulated data.

Finally, the heat map in Fig. 5 indicates the categories into which the conditions of interest are classified according to the HR_sd from simulated safety analysis after switching from the cause-specific to the subdistribution setting (see also Fig. 3).

The safety analysis in the subdistribution setting, as in the cause-specific setting, resulted in superiority of the verum group over the control group if the median time to first adverse event in the control group is short and occurs earlier than in the verum group (red shading in Fig. 5; corresponds to category 1 in Fig. 3). However, when the median time to first adverse event in the control group increases, but is still shorter than in the verum group, results in the subdistribution setting no longer show superiority, but equivalence between both groups (light brown shading in Fig. 5; corresponds to category 2 in Fig. 3).

For the conditions of interest, for which in the cause-specific setting the safety in both groups was equal, the analysis in the subdistribution setting also shows equivalence, if the median time to first adverse event for both groups is close to each other (dark brown shading in Fig. 5; corresponds to category 5 in Fig. 3). If the median time to first advent event however is earlier in the verum than in the control group, the outcome changes from equivalence in the cause-specific setting to inferiority of the verum group in the subdistribution setting (light blue shading, in Fig. 5; corresponds to category 6 in Fig. 3).

For all conditions of interest, for which in the cause-specific setting an inferiority of the verum group was the result of safety analysis, this is also confirmed in the subdistribution setting (dark blue shading in Fig. 5; corresponds to category 9 in Fig. 3).

All other possible outcome categories (3), (4), (7), and (8) of safety analysis when switching from cause-specific to subdistribution setting are not present in the simulation study data (see also Fig. 3).

In sum, the results of the simulated safety analysis with death as a competing event show that comparing safety profiles in a subdistribution setting is always more pessimistic than in a cause-specific setting. For the group with the longest survival and the safety advantage there is either no more advantage or a newly found disadvantage compared to its analysis in the cause-specific setting.

4.1 Understanding the etiology of risks for clinical evaluation

Defining the benefit/risk balance of medications in comparison to that of the standard of care in a given indication, implies to understand the origin of both the risks and the benefits associated with each therapy. The decision is based on acceptable trade-offs. Addressing epidemiological questions of etiology has long been recognized as the strength of the cause-specific setting (Austin, Allignol, & Fine, 2017; Austin, Lee, & Fine, 2016; Lau, Cole, & Gange, 2009; Putter et al., 2007; Schuster et al., 2020; Van Der Pas, Nelissen, & Fiocco, 2018), because of the censoring at the competing event. Censoring equals “disallowing” competing events so that censored patients could still experience the event of interest. Considering this hypothetical population, in which the event of interest would eventually happen for everyone, prevents competing events to get in the way when one is interested in comparing instantaneous rates of occurrence of the event of interest, between an intervention and its comparator. However, this hypothetical population may not be suitable for all research questions. For instance, it may not be of interest for providers, payers, or policymakers who need to predict the burden on human and financial resources of clinical events on patients enrolled in the care system (Austin et al., 2016; Pepe & Mori, 1993). In this case, the subdistribution setting that acknowledges that the event of interest will not happen for everyone, because competing risks can preclude it, is more appropriate. The subdistribution analysis points out the treatment with the lowest probability of all types of events within a given time frame. This outcome is best suited for clinical prediction (Austin et al., 2017; Austin et al., 2016; Lau et al., 2009; Putter et al., 2007; Schuster et al., 2020; Van Der Pas et al., 2018).

4.2 Cause-specific and subdistribution framework when survival competes safety

The results of our simulation study give a clear picture of the differences between both safety analysis settings. When analyzing safety data, prolonged survival in one group will mostly translate into a higher probability of adverse events in a subdistribution setting, where the risk is assessed by combining the hazards of all competing events within a single cumulative incidence function over the entire follow-up period. Our simulation shows that the results of the subdistribution analysis are always more pessimistic than the results of the cause-specific analysis. For the group with the longest survival and the safety advantage a change of the analysis setting translates in either a smaller advantage, no more advantage, a larger or even a newly found disadvantage.

However, the outcomes of neither the cause-specific nor the subdistribution settings are biased, they just answer different research questions. The subdistribution outcome reflects the effect of treatment on both safety and survival, with no possibility to differentiate between the two, while the cause-specific analysis reflects the effect of treatment on safety only.

When very serious adverse events are considered and longer living comes at the price of unbearable safety events, the outcome of the subdistribution analysis could be used to compare the safety profiles of both medicinal products. However, in most cases, prolonged survival is still very much desirable despite the occurrence of minor or manageable adverse events. In this case, the effect of survival present in the subdistribution outcome does not allow to interpret the safety profile of the intervention.

4.3 Recommendations for clinical evaluations

As a general rule, we recommend, to first describe the competing risks as well as their expected impact on the analysis. When competing risks have been identified, competing risks analysis should be preferred to marginal analysis when the number of competing events in the study is at least equal to that of the event of interest (Berry, Ngo, Samelson, & Kiel, 2010), or when the absolute percentage of competing events is greater than 10% (Fine & Gray, 1999). When competing risks analysis is indicated, we recommend a cause-specific setting, together with a justification of the choice of the competing events considered. This recommendation is in line with the suggestions made by the Committee for Medicinal Products for Human Use of the European Medicines Agency (2020) in its Composite variable strategies.

The Cox proportional hazards model that is routinely presented in clinical study reports should remain the standard approach. The presentation of Kaplan-Meier estimates is also justified, although said to overestimate cumulative event probabilities (Berry et al., 2010; Satagopan et al., 2004; Schuster et al., 2020; Stegherr, Schmoor, et al., 2021). Kaplan-Meier in a cause-specific setting represents the absolute risk of having an event of interest, as if nothing else could happen before (Austin et al., 2016). In comparison, Aalen-Johansen estimates the fraction of patients who will experience an event of interest within the given time frame, given the presence of other precluding events. The cause-specific setting therefore allows many more subjects to experience the event of interest. This explains the observation, also made in our simulation, that Kaplan-Meier estimates are systematically larger than those derived from the Aalen-Johansen method. Although this effect should be known and understood, we do not agree with the terminology commonly used in the literature that Kaplan-Meier “overestimates” the incidence of events. This wording implies that one setting delivers correct estimates and the other not, while it is in fact a matter of context.

As an alternative estimator to Kaplan-Meier for the same function, the Nelson-Aalen estimator could be considered (Colosimo, Ferreira, Oliveira, & Sousa, 2002). Our simulation confirmed that it delivers the same information as the Kaplan-Meier estimator in comparative analysis, but its understanding is less straightforward. For this reason, the Nelson-Aalen estimator is less popular than the Kaplan-Meier estimator in time-to-event analysis since its first publication in the late 1950s (Kaplan & Meier, 1958). As clinical study reports are also meant to be reviewed by non-statisticians such as medical experts and epidemiologists within the frame of regulatory activities and clinical evaluations, the well-known and commonly presented Kaplan-Meier curve should be favored. One might argue that there is no harm in presenting both, but we do not recommend it as a standard approach. Clinical reports usually contain large amounts of analyses, and the non-essential presentation of the Nelson-Aalen estimator for each endpoint, might cause most readers to feel overwhelmed.

4.4 Limitations

In this study, we chose to keep the time to competing event constant in both study groups and across all simulated scenarios. It was therefore not possible to investigate further discrepancies between both settings on various times to occurrence of the competing events. It would be interesting to confirm that the conclusion of this work remains valid for a wide range of time to competing event. Also, the number of patients in both study groups was kept constant across the simulated scenarios. An interesting question to investigate would be how sample size influences the results. The size of the trial impacts the statistical significance that is the p-value and the breadth of the confidence interval of the estimates. Gaining deeper insights into the role of sample size is particularly interesting for the special case of rare disease and pediatric trials where only small numbers of eligible trial participants are available. Finally, event times were simulated with an exponential distribution. This simple, known, parametric distribution is widely used to simulate survival data to investigate the properties of the Cox Model (Bender, Augustin, & Blettner, 2005). It offers an easy control of the regression coefficients and has proportional hazards, which is advantageous for the implementation. However, it assumes that the baseline hazard function is constant, which is not always the case. An exponential distribution was deemed sufficient for this application, where the focus was to compare methodological approaches rather than to perform a realistic description of various survival time data. However, more complex statistical approaches have been described (Bender et al., 2005; Beyersmann, Allignol, & Schumacher, 2012; Beyersmann, Latouche, Buchholz, & Schumacher, 2009; Wan, 2017) and it would be interesting to investigate how the simulation framework influences the results.

When analyzing survival data in the presence of competing events, there is no absolute right or wrong when it comes to the choice between a cause-specific and a subdistribution setting. The decision rather depends on the research question at hand. We claim that the risk/benefit profile of a medication is better assessed in a cause-specific setting. The authorities in charge assess the effect of the intervention on the risk of experiencing adverse events. They need estimates of the instantaneous risk of adverse events while on treatment, as well as separate estimates of the effect of the intervention on the competing events. These requirements can be met in a cause-specific setting but not in a subdistribution setting where a single cumulative incidence function that includes all the risks in presence is estimated. The subdistribution setting may be relevant, however, if economic questions should be answered or when both events are similar in the clinical harm (e. g. Death and extremely serious adverse events that tremendously impact patients’ wellbeing and Quality of life). The Kaplan-Meier estimate of the survival function, or its complement, and the Cox proportional hazard model for comparative analysis should remain the standard approach in clinical study reports. In the presence of competing risks, they should be embedded in a cause-specific setting and the choice of the competing events in the analysis should be justified.

Ethics approval and consent to participate

Not applicable.

Consent for Publication

Not applicable.

Availability of data and materials

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Competing interests

All authors declare no competing interests.

Funding

Not applicable.

Authors’ contributions

AG, SB, and FL were involved in the conception of this analysis. AG, SB, FL, KB and RG discussed and agreed on the methods used for the analysis. AG conducted the literature search and computed the simulation study. AG, KB und RG analyzed the data, prepared tables and figures, and wrote the initial draft of the manuscript.

All authors revised the manuscript. All authors read and approved the final manuscript.

Acknowledgements

Not applicable.

Allignol, A., Beyersmann, J., & Schmoor, C. (2016). Statistical issues in the analysis of adverse events in time-to-event data. Pharmaceutical Statistics, 15(4), 297-305. doi:10.1002/pst.1739
Andersen, P. K., Geskus, R. B., de Witte, T., & Putter, H. (2012). Competing risks in epidemiology: possibilities and pitfalls. International Journal of Epidemiology, 41(3), 861-870. doi:10.1093/ije/dyr213
Austin, P. C., Allignol, A., & Fine, J. P. (2017). The number of primary events per variable affects estimation of the subdistribution hazard competing risks model. Journal of Clinical Epidemiology, 83, 75-84. doi:10.1016/j.jclinepi.2016.11.017
Austin, P. C., Lee, D. S., & Fine, J. P. (2016). Introduction to the Analysis of Survival Data in the Presence of Competing Risks. Circulation, 133(6), 601-609. doi:10.1161/circulationaha.115.017719
Bender, R., Augustin, T., & Blettner, M. (2005). Generating survival times to simulate Cox proportional hazards models. Statistics in Medicine, 24(11), 1713-1723. doi:10.1002/sim.2059
Berry, S. D., Ngo, L., Samelson, E. J., & Kiel, D. P. (2010). Competing risk of death: an important consideration in studies of older adults. Journal of the American Geriatrics Society, 58(4), 783-787. doi:10.1111/j.1532-5415.2010.02767.x
Beyersmann, J., Allignol, A., & Schumacher, M. (2012). Competing Risks and Multistate Models with R: Springer, New York, NY.
Beyersmann, J., Latouche, A., Buchholz, A., & Schumacher, M. (2009). Simulating competing risks data in survival analysis. Statistics in Medicine, 28(6), 956-971. doi:10.1002/sim.3516
Borgan, Ø. (1997). Three contributions to the Encyclopedia of Biostatistics: The Nelson-Aalen, Kaplan-Meier, and Aalen-Johansen.
Collet, D. (2015). Modelling Survival Data in Medical Research.
Colosimo, E., Ferreira, F., Oliveira, M., & Sousa, C. (2002). Empirical comparisons between Kaplan-Meier and Nelson-Aalen survival function estimators. Journal of Statistical Computation and Simulation - J STAT COMPUT SIM, 72, 299-308. doi:10.1080/00949650212847
Cox, D. R. (1972). Regression Models and Life-Tables. Journal of the Royal Statistical Society. Series B (Methodological), 34(2), 187-220.
Edwards, J. K., Hester, L. L., Gokhale, M., & Lesko, C. R. (2016). Methodologic Issues When Estimating Risks in Pharmacoepidemiology. Curr Epidemiol Rep, 3(4), 285-296. doi:10.1007/s40471-016-0089-1
European Medicines Agency (2020) ICH E9 (R1) addendum on estimands and sensitivity analysis in clinical trials to the guideline on statistical principles for clinical trials EMA/CHMP/ICH/436221/2017. https://www.ema.europa.eu/documents/scientific-guideline/ich-e9-r1-addendum-estimands-sensitivity-analysis-clinical-trials-guideline-statistical-principles_en.pdf
Fine, J. P., & Gray, R. J. (1999). A Proportional Hazards Model for the Subdistribution of a Competing Risk. Journal of the American Statistical Association, 94(446), 496-509. doi:10.2307/2670170
Geskus, R. B. (2016). Data Analysis with Competing Risk and Intermediate States: Chapman and Hall/CRC.
Gray, B. (2020). Subdistribution Analysis of Competing Risks. In. Gray, R. J. (1988). A Class of K-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk. The Annals of Statistics, 16(3), 1141-1154.
Kaplan, E. L., & Meier, P. (1958). Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association, 53, 457-481.
Klein, J. P., & Moeschberger, M. L. (2003). Survival Analysis - Techniques for Censored and Truncated Data (2 ed.): Springer, New York, NY.
Klein, J. P., van Houwelingen, H. C., Ibrahim, J. G., & Scheike, T. H. (2013). Handbook of Survival Analysis (1st ed.) (10.12.2013 ed.).
Koller, M. T., Raatz, H., Steyerberg, E. W., & Wolbers, M. (2012). Competing risks and the clinical community: irrelevance or ignorance? Statistics in Medicine, 31(11-12), 1089-1097. doi:10.1002/sim.4384
Lau, B., Cole, S. R., & Gange, S. J. (2009). Competing Risk Regression Models for Epidemiologic Data. American Journal of Epidemiology, 170(2), 244-256. doi:10.1093/aje/kwp107
Pepe, M. S., & Mori, M. (1993). Kaplan-Meier, marginal or conditional probability curves in summarizing competing risks failure time data? Statistics in Medicine, 12(8), 737-751. doi:10.1002/sim.4780120803
Putter, H., Fiocco, M., & Geskus, R. B. (2007). Tutorial in biostatistics: competing risks and multi-state models. Statistics in Medicine, 26(11), 2389-2430. doi:10.1002/sim.2712
R.CoreTeam. (2019). R: A Language and Environment for Statistical Computing. In. Vienna, Austria: R Foundation for Statistical Computing.
Rubin, D. B. (1987). Multiple Imputation for Nonresponse in Surveys.
Satagopan, J. M., Ben-Porat, L., Berwick, M., Robson, M., Kutler, D., & Auerbach, A. D. (2004). A note on competing risks in survival data analysis. British Journal of Cancer, 91(7), 1229-1235. doi:10.1038/sj.bjc.6602102
Schuster, N. A., Hoogendijk, E. O., Kok, A. A. L., Twisk, J. W. R., & Heymans, M. W. (2020). Ignoring competing events in the analysis of survival data may lead to biased results: a nonmathematical illustration of competing risk analysis. Journal of Clinical Epidemiology, 122, 42-48. doi:10.1016/j.jclinepi.2020.03.004
Stegherr, R., Beyersmann, J., Jehl, V., Rufibach, K., Leverkus, F., Schmoor, C., & Friede, T. (2021). Survival analysis for AdVerse events with VarYing follow-up times (SAVVY): Rationale and statistical concept of a meta-analytic study. Biometrical Journal, 63(3), 650-670. doi:10.1002/bimj.201900347
Stegherr, R., Schmoor, C., Beyersmann, J., Rufibach, K., Jehl, V., Brückner, A., . . . Friede, T. (2021). Survival analysis for AdVerse events with VarYing follow-up times (SAVVY)-estimation of adverse event risks. Trials, 22(1), 420. doi:10.1186/s13063-021-05354-x
Therneau, T. M., Crowson, C., & Atkinson, E. (2020). Multi-state models and competing risks. 1 - 29.
Therneau, T. M., & Grambsch, P. M. (2000). Modeling Survival Data: Extending the Cox Model (1 ed.).
Therneau, T. M., Lumley, T., Elizabeth, A., & Cynthia, C. (2021). Survival Analysis. In. van Buuren, S., & Groothuis-Oudshoorn, K. (2011). mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software, 45(3), 1 - 67. doi:10.18637/jss.v045.i03
Van Der Pas, S., Nelissen, R., & Fiocco, M. (2018). Different competing risks models for different questions may give similar results in arthroplasty registers in the presence of few events. Acta Orthopaedica, 89(2), 145-151. doi:10.1080/17453674.2018.1427314
van Walraven, C., & McAlister, F. A. (2016). Competing risk bias was common in Kaplan-Meier risk estimates published in prominent medical journals. Journal of Clinical Epidemiology, 69, 170-173.e178. doi:10.1016/j.jclinepi.2015.07.006
Wan, F. (2017). Simulating survival data with predefined censoring rates for proportional hazards models. Statistics in Medicine, 36(5), 838-854. doi:10.1002/sim.7178
Wickham, H. (2016). ggplot2 Elegant Graphics for Data Analysis (2 ed.): Springer, Cham.
Zwiener, I., Blettner, M., & Hommel, G. (2011). Survival Analysis Part 15 of a Series on Evaluation of Scientific Publications. Dtsch Arztebl Int. doi:10.3238/arztebl.2011.0163

No competing interests reported.

Additionalfile1.pdf

Download PDF

Journal Publication

published 12 Jul, 2023

Read the published version in BMC Medical Research Methodology →

Editorial decision: Major revision
04 May, 2023
Reviews received at journal
28 Apr, 2023
Reviewers agreed at journal
19 Apr, 2023
Reviewers invited by journal
15 Apr, 2023
Editor assigned by journal
29 Mar, 2023
Editor invited by journal
25 Jan, 2023
Submission checks completed at journal
25 Jan, 2023
First submitted to journal
13 Jan, 2023

You are reading this latest preprint version

Safety analysis of new medications in clinical trials: A simulation study to assess the differences between cause-specific and subdistribution frameworks in the presence of competing events

Status:

Journal Publication

Version 1

Abstract

Figures

1. Background

2. Methods

2.1 Cause-specific hazard regression

2.2 Subdistribution hazard regression

2.3 Simulation study

2.4 Presentation of results

2.5 Software

3. Results

4. Discussion

4.1 Understanding the etiology of risks for clinical evaluation

4.2 Cause-specific and subdistribution framework when survival competes safety

4.3 Recommendations for clinical evaluations

4.4 Limitations

5. Conclusions

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1