Phantom based reproducibility and robustness of radiomics features
Supplemental Table 3 shows the mean CCC values from the liver phantom radiomics studies, assessed over the 5 repeat scans, OS-EM iterations 1/2, with/without Gaussian filtering and across all conditions (scans and parameters). CCC for sphericity is always 1 because the shape feature does not depend on the PET scan. There are in total 15 features that have mean CCC > 0.85: 1 global feature sphericity, 1 GLCM feature correlation, 2 GLRLM features grey level nonuniformity (GLN), and run length nonuniformity (RLN), 7 GLSZM features large zone emphasis (LZE), grey level nonuniformity (GLN), zone size nonuniformity (ZSN), zone percentage (ZP), large zone low grey level emphasis (LZLGE), large zone high grey level emphasis (LZHGE), grey level variance (GLV), 4 NGTDM features coarseness, busyness, complexity and strength. The average CCCs for repeatability (same conditions, different scans) and reproducibility (different iterations and filtering) have similar results as shown in Supplemental Table 3. Comparing with the mean CCC for both repeatability and reproducibility, there is 1 more robust feature for repeatability (dissimilarity), 6 more robust features (variance, contrast, dissimilarity, LGRE, SRLGE, GLV_GLRLM) and 2 less robust features (LZHGE, GLV_GLSZM) for different iterations, 2 less robust features (ZSN, LZHGE) for with/without filtering.
Lesion dosimetry and outcome data
A total of 105 lesions > 2 mL were segmented. The average lesion volume was 45 mL (median:10 ml, range:2 - 833). The average lesion absorbed dose was 336 Gy (median: 265, range:1-1271). The response rate according to RECIST applied at the lesion level was 31% (32/105). The number of metastasis and primary HCC lesions are 70 and 35, respectively, with lesion specific response rate being 26% (9/35) and 33% (23/70) for the 2 groups. There are 103 lesions that have progression data, two metastatic lesions were excluded due to lack of follow-up. The number of progression events for all the lesions was 14 (4 HCC, 10 metastatic). The mean time-to-event are 322 days (median: 229 days, range: 44-1174 days). The mean time-to-event was 342 days (median: 309, range: 50-1174) for metastatic lesions and 284 days (median: 199 days, range: 44-860 days) for HCC. Kaplan Meier analysis showed that the time to progression for HCC and metastasis was not statistically significantly different (P=0.49)
Outcome models: Radiomics, absorbed dose, and combined models
Univariate analysis
The univariate results for volume, radiomics features and absorbed dose are shown in Supplemental Table 2 and Table 1 (with Supplemental Table 2 showing all the features and Table 1 showing only the 15 robust radiomics features). These are the Spearman correlation between specific features (or absorbed dose) and OR, and the univariate Cox regression results for progression. Volume has been shown to correlate with patient prognosis for different cancer types [41]. In our study, the Spearman coefficients of volume in terms of OR is -0.215 (p-value = 0.028). Among the 46 radiomics features (including volume), 10 features are significant (p-value < 0.001) for OR: 2/9 GLCM features, 3/13 GLRLM features, 4/13 GLSZM features and 1/5 NGTDM features. Among the 15 robust radiomics features, 8 features are significant for OR: LZE (p-value= 0.0005), ZP (p-value= 0.0004), LZLGE (p-value= 0.001), LZHGE (p-value= 0.002), GLV (p-value= 0.0009), Coarseness (p-value= 0.003), Busyness (p-value= 0.001), and Strength (p-value= 0.003). Absorbed dose is a significant predictor of the OR (p-value= 0.0003). In comparison, among the 46 radiomics features (including volume), no features are significant for progression. ZSN, a robust feature, is the most significant one (p-value= 0.063) for progression. Absorbed dose is a marginally significant predictor for progression (p-value= 0.005).
Inter-feature correlation is shown in the correlation heat map of Fig. 2. GLN, RLN, LZE, LZHGE are highly correlated with volume (Spearman coefficients > 0.85). In general, the radiomics features are highly correlated with each other (except sphericity). Though most of the radiomics features are still significantly correlated with dose (except sphericity, GLN, and ZSN), the correlation of radiomics features with dose is generally lower than radiomics features amongst them, as shown in Table 1.
Table 1 Summary of statistical analysis for volume, the 15 robust radiomics features and absorbed dose with Bonferroni correction.
|
Features
|
Spearman correlation with absorbed dose
|
P value for dose correlation
|
Spearman correlation with OR
|
P value for OR
|
C-index for progression
|
Hazard Ratio for progression
|
P value for progression
|
|
Volume
|
-0.262
|
0.007
|
-0.215
|
0.028
|
0.565
|
0.282
|
0.417
|
Global
|
Sphericity
|
0.061
|
0.539
|
0.142
|
0.148
|
0.590
|
0.728
|
0.313
|
GLCM
|
Correlation
|
-0.340
|
3.882e-4
|
-0.216
|
0.027
|
0.438
|
1.019
|
0.950
|
GLRLM
|
GLN
|
-0.362
|
1.45e-4
|
-0.269
|
0.006
|
0.600
|
0.297
|
0.323
|
RLN
|
-0.252
|
0.010
|
-0.236
|
0.015
|
0.639
|
0.213
|
0.201
|
GLSZM
|
LZE
|
-0.482
|
1.989e-7
|
-0.333
|
0.0005
|
0.562
|
0.415
|
0.629
|
GLN
|
-0.078
|
0.427
|
-0.121
|
0.218
|
0.734
|
0.326
|
0.088
|
ZSN
|
-0.057
|
0.565
|
-0.081
|
0.412
|
0.752
|
0.358
|
0.063
|
ZP
|
0.483
|
1.828e-7
|
0.341
|
0.0004
|
0.491
|
0.804
|
0.502
|
LZLGE
|
-0.548
|
1.485e-9
|
-0.317
|
0.001
|
0.460
|
0.872
|
0.760
|
LZHGE
|
-0.293
|
0.002
|
-0.300
|
0.002
|
0.676
|
0.006
|
0.348
|
GLV
|
0.467
|
5.104e-7
|
0.320
|
0.0009
|
0.549
|
0.491
|
0.136
|
NGTDM
|
Coarseness
|
0.379
|
6.789e-5
|
0.285
|
0.003
|
0.601
|
1.027
|
0.930
|
Busyness
|
-0.509
|
2.862e-8
|
-0.307
|
0.001
|
0.482
|
0.522
|
0.585
|
Complexity
|
0.324
|
7.596e-4
|
0.244
|
0.012
|
0.609
|
1.124
|
0.657
|
Strength
|
0.245
|
0.012
|
0.284
|
0.003
|
0.669
|
1.110
|
0.321
|
DOSE
|
Mean absorbed dose
|
NA
|
NA
|
0.345
|
0.0003
|
0.819
|
0.121
|
0.005
|
Multivariate analysis
Given the limited sample size, we included both primary and metastasis cases in the modeling. For the subset of robust features, the model order is 2 for both OR and progression endpoints, with top 2 features for OR being absorbed dose and zone percentage (ZP), and for progression being absorbed dose and ZSN. Fig. 3 shows the model order determination for the robust features and absorbed dose. The top 5 features are shown in Table 2 for OR and progression models. (Model order determination and the top 5 features using all the radiomics features and absorbed dose are presented in the supplemental materials Fig. 1 and table 4).
After the model order and top features were decided, nested cross-validation was applied to estimate the performance of the final model. The results for models with ZP only, ZSN only, absorbed dose only and the combined models (radiomics robust + dose) are listed in Table 3. When considering the entire cohort, for the combined models the average AUCs for OR (0.729 (95% CI: 0.702-0.758)), and the average c-indexes for progression (0.803 (95% CI: 0.790-0.815) are superior to the corresponding values for the absorbed dose only and ZP/ZSN only models. The results for the subgroups of primary and metastasis cases are shown in Table 3 as well. For the OR model in the subgroup of HCC, the radiomics only model shows the best performance with average AUC of 0.762 (95% CI: 0.680-0.834), and in the subgroup of metastasis, the absorbed dose only model shows the best performance with average AUC of 0.696 (95% CI: 0.654-0.737). For the progression analysis, in both subgroups the combined model outperforms the individual models although the difference was not statistically significant. The ROC curve for OR using radiomics alone, dose alone and combined models is shown in Fig. 4, and the Kaplan-Meier plot for progression for the combined models is shown in Fig. 5, respectively. Log-rank test was used for the comparison of high and low risk groups for progression. The cutoff was median value of the predicted Cox survival probability. The weights of OR model and progression models are shown below.
OR model (Generalized linear regression model):
logit(y) ~ -0.892 + 0.520 ZP + 0.488 Dose
Distribution = Binomial
Progression Cox model:
h(t) ~ h0(t) * exp(-0.530 ZSN + -1.707 Dose)
Artificially increasing the number of cases using ADASYN was evaluated for progression endpoint as well, but found no substantial difference. Last part of table 3 shows the results for ADASYN method. Fig. 6 shows the calibration curves for OR models, with the calibration curves’ slope and intercept values available.
Table 2. Top 5 features for the combined models with robust radiomics features, volume and absorbed dose
OR
|
Progression
|
Mean absorbed dose
|
Mean absorbed dose
|
ZP
|
ZSN
|
Sphericity
|
Strength
|
GLV
|
Complexity
|
Coarseness
|
Sphericity
|
Table 3. Average AUC/c-index for individual and combined models with all the lesions, HCC lesions and metastasis lesions
OR Model
|
Average AUC (95 % confidence intervals)
|
|
All (105)
|
Primary HCC (35)
|
Metastasis (70)
|
Radiomics (ZP)
|
0.713 (0.685-0.741)
|
0.762 (0.680-0.834)
|
0.658 (0.623-0.693)
|
Absorbed Dose
|
0.713(0.678-0.746)
|
0.717 (0.642-0.786)
|
0.696 (0.654-0.737)
|
Combined (Dose + ZP)
|
0.729 (0.702-0.758)
|
0.734 (0.660-0.802)
|
0.692 (0.653-0.723)
|
Progression Model
|
Average c-index (95 % confidence intervals)
|
All (103)
|
Primary HCC (35)
|
Metastasis (68)
|
Radiomics (ZSN)
|
0.694 (0.676-0.710)
|
0.565 (0.528-0.598)
|
0.656 (0.629-0.680)
|
Absorbed Dose
|
0.754 (0.742-0.766)
|
0.613 (0.585-0.635)
|
0.719 (0.700-0.737)
|
Combined (Dose+ZSN)
|
0.803 (0.790-0.815)
|
0.638 (0.610-0.661)
|
0.762 (0.740-0.780)
|
ADASYN
Progression Model
|
Average c-index (95 % confidence intervals)
|
All (103)
|
Primary HCC (35)
|
Metastasis (68)
|
Radiomics (ZSN)
|
0.712 (0.698-0.726)
|
NA
|
0.595 (0.575-0.615)
|
Absorbed Dose
|
0.771 (0.762-0.781)
|
NA
|
0.726 (0.713-0.739)
|
Combined (Dose+ZSN)
|
0.794 (0.785-0.803)
|
NA
|
0.728 (0.716-0.740)
|