Uncovering robust radiomics features is an important task for building robust models for identifying responders and non-responders and prediction of cancer progression. Thus, radiomics features extracted from repeated PET scans, different number of OS-EM PET iterations, with/without Gaussian post-filtering were evaluated for robustness using CCC. Despite the higher noise associated with 90Y PET compared with FDG PET, 15 radiomics features were identified as robust with CCC > 0.85. In general, the robust features for different scans (repeatability), OS-EM iterations 1/2 and with/without filtering largely overlap, which indicates that robust features tend to be consistent for different imaging settings. The results also showed that more features are robust to different iteration setting and less features are robust to application of Gaussian filtering. In a study of intratumor FDG PET uptake heterogeneity quantification by Hatt et al., zone percentage (ZP) was found to be robust with respect to the delineation method used and the partial volume effects. This feature also demonstrated high differentiation power for prediction of response in esophageal carcinoma [35]. In a study by Doumou et al., ZP presented substantial agreement across different segmentation and different levels of smoothing [36]. A study by Ashrafinia et al. showed that ZSN extracted from ⁹⁹ᵐTc-Sestamibi Myocardial-Perfusion SPECT (MPS) images showed high reproducibility [37]. Another recent study by Li et al. on FDG PET radiomics analysis, showed that ZSN is a stable feature [38].
The aim of this work is to find radiomics signature that can facilitate dose metrics in the prediction of tumor response. The final model order is small being 2 (dose + ZP and dose + ZSN), which is reasonable considering the high correlation between most radiomics features (Fig. 2). The correlation between ZP and absorbed dose is 0.483 (p-value = 1.828e-7) and ZSN and absorbed dose is -0.057 (p-value = 0.565) (Table 1), which indicates that ZSN could provide more complementary information to the combined model than ZP. This is consistent with the substantial higher c-index for the combined absorbed dose and ZSN model (0.803) compared with ZSN only (0.694) and absorbed dose only (0.754) models for progression, but only slightly higher AUC for the combined absorbed dose and ZP model (0.729) compared with the ZP only (0.713) and absorbed dose only (0.713) models for OR (Table 3). Further studies, such as obtaining radiomics features from FDG-PET, CT, or MRI, could potentially add more complementary information and further improve the performance.
ZP is a feature from GLSZM matrix, quantifying the coarseness of the texture by the ratio of number of zones and number of voxels. The higher the value is, the finer the texture is, and according to our results the higher the probability the tumor will respond. Figure 6 (a), (b) show example lesions with large/small ZP, that were classified as responder/non-responder; (c), (d) show lesions with large/small ZSN, that did not progress for a long follow-up time (1174 days) and progressed in a short time (44 days). Smaller ZP values correspond to coarser appearance and worse response. In another study by Ha et al, ZP was one of the features used to characterize locally advanced breast cancer [39]. The trend is consistent with what we found in our study, that larger ZP is associated with better response. ZSN measures the variability of size zone volumes in the ROIs, higher the value, larger the variance of the size zone volumes. The hazard ratio for ZSN is smaller than 1, which means the higher the ZSN, the better the lesion prognosis.
The modified LASSO method we developed was inspired by R. Bach’s work on Bolasso, which showed that the Lasso selects all the variables that should enter the model with probability tending to one exponentially fast [40]. So, if we run the Lasso for multiple bootstrapped replications of a given sample, then intersecting the supports of the Lasso (i.e., non-zero coefficients) leads to consistent model selection. However, the direct application failed since the intersection of the supports lead to null for some datasets. Bunea et al. came up with similar variants of bootstrap enhanced LASSO (BE-LASSO) [41]. The percentage of times each predictor was selected (variable inclusion probability) was recorded and user-defined threshold (50%) was used to determine the variables. V. Abram et al. built upon Bunea’s method of Be-LASSO [42]. Instead of user-defined probability for feature selection, they used the quantiles of the bootstrap distribution of the coefficients of variables to determine the significance of that variable. In our study, we developed a new way to select features, still based on the bootstrap LASSO. Instead of using predefined probability or the distribution quantile, we obtained a ranking of the features based on the frequency of being selected in the bootstrap, then, we performed cross validation to calculate the AUC/c-index vs. number of top features included in the model. In this way, we obtained the most parsimonious model, which is desired when small sample size is unavoidable.
In summary, absorbed dose is a strong predictor for tumor control, both in terms of OR at first follow-up and time to progression, which is consistent with recent reports [14–16]. The radiomics feature signals the complimentary value of texture to improve the absorbed dose only model prediction. It is interesting to explore the underlying biological mechanism of the reason for higher ZP and ZSN leading to better prognosis, which should be investigated on larger dataset in the future. The two features model can be interpreted as: given the dose being fixed, the change in ZP/ZSN will help to predict tumor control (OR/progression). Using this information, additional attentions would be given to the lesions that possess lower ZP/ZSN value, which have a higher risk of failure (in terms of OR/progression), which is potentially informative for clinical decisions. Immediate prediction of response, based on radiomics features and dose metrics both of which can be derived from 90Y PET/CT performed immediately after RE, has clinical utility. Instead of waiting for the first follow up morphologic imaging that typically occurs at > 2 months, the potential to predict non-responding lesions immediately after therapy would facilitate adaptive therapy to selected lesions where 90Y RE is followed by further treatment such as stereotactic body radiation therapy or microwave ablation. Limitation of our study include the heterogeneous patient cohort and the small sample size. Patient 90Y imaging data is scarce because post-therapy imaging is not routinely performed after RE, but studies reporting 90Y SPECT/CT and PET/CT imaging is rising and is expected to become more readily available, enabling studies with larger cohorts.