Overall, 94 patient records were collected. All patients had been treated with a curative intent. Patient and tumor characteristics are listed in Table 1. Mean age was 67.4 years and 47 patients (50.0%) were female. There were no patients with coexisting serious medical conditions.
Table 1
Study population characteristics
Characteristic | Total (N = 94) |
Gender | |
Female | 47 (50.0%) |
Male | 47 (50.0%) |
Age | |
Mean (SD) | 67.4 (11.5) |
Range | 22.0–87.0 |
Smoker | |
No | 51 (54.3%) |
Yes | 43 (45.7%) |
Performance status | |
0 | 87 (92.6%) |
1 | 7 (7.4%) |
Body mass index | |
≤ 35 kg/mq | 89 (94.7%) |
> 35 kg/mq | 5 (5.3%) |
Co-morbidity | |
None | 21 (22.3%) |
Cardiovascular | 61 (64.9%) |
Other | 12 (12.8%) |
Distance from anal verge (cm) | |
> 8 | 32 (34.0%) |
6–8 | 26 (27.7%) |
< 6 | 36 (38.3%) |
Clinical tumor stage (cT) | |
2 | 13 (13.8%) |
3 | 56 (59.6%) |
4 | 25 (26.6%) |
Clinical nodal stage (cN) | |
0 | 9 (9.6%) |
1 | 25 (26.6%) |
2 | 60 (63.8%) |
cT dimension (cm) | |
> 5 | 33 (35.1%) |
≤ 5 | 61 (64.9%) |
SD: standard deviation; kg/mq: kilograms per square meter; cm: centimeters |
The vast majority of patients (n = 85, 90.4%) had regional lymph node involvement at diagnosis.
Most tumors were located in the low rectum (n = 60, 63.8%).
In the sample, the minimum and the maximum observed survival time was t = 2 months and t = 82 months, respectively. In the entire cohort, 17 patients (18.1%) died and the last death event occurred 53 months after diagnosis. Of these 17 death events, one case (5.9%) was attributed to Covid-19 and was recorded after 52 months. Therefore, on the entire sample of 94 patients, there were 16 patients dead of disease (DoD), 1 patient dead of Covid-19 (DoC) and 77 cases of censoring (Cen). This is summarized in Table 2. The basic problem in this kind of samples is how to treat the DoC cases in order to obtain a usual sample composed of only DoD and Cen cases and to which the classical Kaplan-Meier estimator, as well as any statistical tool for survival analysis, can be applied.
Table 2
Details of exit causes in the entire sample
exit cause | number | min t (months) | max t (months) |
DoD | 16 | 2 | 53 |
Cen | 77 | 11 | 82 |
DoC | 1 | 52 | 52 |
Total | 94 | | |
DoD: dead of disease; Cen: censored; DoC: dead of Covid-19; min: minimum; |
t: survival time; max: maximum |
Let us consider the Kaplan-Meier estimator. Using the standard model, it is possible to estimate the survival probabilities in our sample according to three different options: i) excluding the DoC event from the data analysis (accounting for a total of 93 observations); ii) considering the DoC event as a Cen event at the same time point; iii) considering the DoC event as a DoD event at the same time point. The corresponding Kaplan-Meier survival plots are provided in Fig. 1a. In the figure, the black curve (without DoC) corresponds to option (i), the green curve (DoC as Cen) refers to option (ii) and the red curve (DoC as DoD) corresponds to option (iii). Using CoDMI algorithm an additional option is available. This essentially consists in considering the DoC event as an “incomplete data” which is adjusted by mean imputation, that is the observed lifetime of the DoC patient is extended by an additional expected lifetime. The crucial point is that this lifetime extension is computed by the algorithm using the Kaplan-Meier model itself and therefore does not introduce inconsistencies in the survival estimates.
In Fig. 1a a blue curve (DoC Imputed) is also provided corresponding to option (iv.a), where the DoC event is considered as an “incomplete data” which is adjusted by mean imputation using CoDMI algorithm in its standard form, that is assuming that it would have been a virtual DoD event at a later time point. In this case, CoDMI algorithm estimates that the DoC patient that died due to Covid-19 at time t = 52 months, without Covid-19 infection would have survived for additional 27.5 months and would have died of tumor at time t = 79.5 months (52 + 27.5 months). Therefore, the observed DoC event at time 52, indicated by a red triangle on the blue line in the figure, is changed as a virtual DoD event at time 79.5, which is indicated by a circle.
If the virtual DoC event is considered unlikely, it is possible to modify the standard-form result by applying CoDMI algorithm with an additional option, called adjustment for censoring, where the survival estimate is obtained counting the DoC event as a virtual Cen rather than a virtual DoD. If this option is used, a reverse Kaplan-Meier estimate is computed by CoDMI, providing an expected survival of 13.98 months beyond the observed DoC event. In this case, the DoC event at time 52 is then changed as a virtual Cen event at time t = 65.98 months (52 + 13.98 months). This is illustrated by the blue line in Fig. 1b, where the circle on the blue curve (DoC Imputed) corresponds now to the time point of the virtual Cen event.
The choice between the standard and the no-standard mode of CoDMI application can be a matter of the clinician’s discretion, but it can also be made directly by the algorithm, which computes the probability of a virtual DoC event vs that of a virtual Cen event.
The survival rates illustrated in Fig. 1 and Fig. 2 are reported in Table 3 for two selected time points, 2 and 5 years, together with the corresponding 95% confidence intervals (CI). For options (i), (ii) and (iii) the confidence intervals are computed using the classical Greenwood’s formula. For options (iv.a) and (iv.b), where the observed DoC time point is replaced by the expected DoD and Cen time point, respectively, the confidence intervals must be computed taking into account that changing an observed value with an expected value (i.e. applying the mean imputation) provides an increase of the estimation uncertainty. This correction is provided by CoDMI algorithm, which includes a built-in extension of Greenwood’s formula. The table shows that the 2-year OS rate was 89.8% (CI 83.7–96.4) and the 5-year OS rate was 74.8% (CI 63.7–0.87.9), when the DoC event was not included in survival analysis (option i). Considering that Covid-19 patient died after 52 months of follow-up, the 2-year OS rate was equal (81.1%, CI 72.3–91.0) in all other cases (option ii, iii, iv.a and iv.b). The 5-year OS where different in case of DoC as Cen, DoC as DoD, DoC as virtual DoD and DoC as virtual Cen, and survival rates were 75.1%, 71.9%, 75.3% and 75.3%, respectively. As concerning the confidence intervals, the differences between those provided by the classical Greenwood’s formula and those computed with the extended formula, result to be immaterial in this case with a single DoC event.
Table 3
Survival rate estimates for different treatment of Covid-19 death event
Option | 2y-OS | 5y-OS |
i) Without DoC | 89.8% (CI 83.7–96.4%) | 74.8% CI 63.7–87.9%) |
ii) DoC as Cen | 89.9% (CI 83.9–96.4%) | 75.1% (CI 64.1–88.1%) |
iii) DoC as DoD | 89.9% (CI 83.9–96.4%) | 71.9% (CI 59.9–86.2%) |
iv.a) DoC as virtual DoD | 89.9% (CI 83.9–96.4%) | 75.3% (CI 64.2–88.3%) |
iv.b) DoC as virtual Cen | 89.9% (CI 83.9–96.4%) | 75.3% (CI 64.2–88.3%) |
DoC: dead of Covid-19; Cen: censored; DoD: dead of disease; |
2y: 2 years; 5y: 5 years; OS: overall survival; CI: 95% confidence intervals |
It should be noted that, independently on these survival rate estimates, the DoC imputations provided by CoDMI algorithm under option (iv.a) or (iv.b) provide, correspondingly, an adjusted sample including a total of 94 observations (the total number of original observations), where, however, only DoD and Cen events are now present. This is summarized in Table 4 for the two options. As shown in [5], the mean imputations provided by CoDMI are roughly unbiased. Therefore, all the usual statistical tools can be applied to these “standardized” samples and the information conveyed by DoC events is then consistently used. As an example, a standard proportional hazard model was applied to our data. Table 5 summarizes results from the Cox regression obtained according to the five possible DoC options. Several variables deemed to be relevant to overall survival (including clinical T stage, clinical N stage, tumor diameter and age) were included in the multivariate analysis. As expected, given the presence of only one DoC case in the sample, even in this application the differences between the estimates with the different options are almost immaterial. Obviously, in the estimation of the regression coefficients, options (ii) and (iv.b) provide exactly the same results.
Table 4
Details of exit causes after CoDMI imputation in standard form (option iv.a) and with adjustment for censoring (option iv.b)
CoDMI option | exit cause | number | min t (months) | max t (months) |
option iv.a: DoC imputed as DoD | DoD | 17 | 2 | 79.5 |
Cen | 77 | 11 | 82.0 |
total | 94 | | |
option iv.b: DoC imputed as Cen | DoD | 16 | 2 | 53.0 |
Cen | 78 | 11 | 82.0* |
total | 94 | | |
(*) Including 1 event at t = 65.98 |
Table 5
Multivariate analysis of prognostic factors for overall survival according to DoC options
| Without DoC (option i) | DoC as Cen (option ii) | Doc as DoD (option iii) | DoC as virtual DoD (option iv.a) | DoC as virtual Cen (option iv.b) |
Prognostic factor | HR (95%CI) | p value | HR (95%CI) | p value | HR (95%CI) | p value | HR (95%CI) | p value | HR (95%CI) | p value |
cT4 (no versus yes) | 0.53 (0.16–1.77) | 0.303 | 0.56 (0.17–1.84) | 0.336 | 0.46 (0.14–1.51) | 0.200 | 0.46 (0.14–1.50) | 0.198 | 0.56 (0.17–1.84) | 0.336 |
cN2 (no versus yes) | 1.83 (0.55–6.02) | 0.323 | 1.80 (0.54–5.95) | 0.337 | 2.00 (0.61–6.59) | 0.256 | 1.90 (0.58–6.28) | 0.292 | 1.80 (0.54–5.95) | 0.337 |
Lesion diameter > 5 cm (no versus yes) | 2.90 (0.96–8.80) | 0.060 | 2.80 (0.92–8.55) | 0.069 | 2.98 (1.01–8.82) | 0.048 | 3.00 (1.00-8.98) | 0.049 | 2.80 (0.92–8.55) | 0.069 |
Age > 70 (no versus yes) | 3.58 (1.14–11.21) | 0.028 | 3.76 (1.19–11.87) | 0.024 | 3.42 (1.14–10.27) | 0.028 | 3.67 (1.18–11.41) | 0.025 | 3.76 (1.19–11.87) | 0.024 |
DoC: dead of Covid-19; Cen: censored; DoD: dead of disease; HR: hazard ratio; CI: confidence interval |