In this multicenter retrospective study, we constructed and validated MRI-based radiomics signatures for the prediction of Ki-67 expression status and GS in PCa. To Remove the unbalance of the training data set, we used up-sampling by repeating random cases to make positive/negative samples balance. We applied the normalization on the feature matrix. Without data balancing, the predictive performance of radiomics signatures was inadequate, with obviously low specificity. After data balancing by the up sampling, the synthesized performance of radiomics signatures was further improved, indicating that data balancing contributes to construct more powerful prediction models. For the Ki-67 classifier, we settled on a set of 15 features, while for the GS classifier, we narrowed it down to a set of 9. Prediction of Ki-67 expression status and the GS was best for the LASSO and SVM in this investigation, and DCA showed that this model produced good clinical net benefit. Therefore, radiomics based on MRI might aid in predicting the Ki-67 expression and the GS in PCa.
Our results have significant implications for the practical use of MRI-based radiomics to predict the prognosis of prostate patients. MRI offers significant benefits over CT and ultrasound in identifying PCa grade and directing the urologist's choice of surgical technique[17]. The prediction of PCa features (GS, PIRADS v2, risk class) and automatic prostate segmentation in a fully automated quality control system make MRI-based radiomics a promising diagnostic tool for PCa[17]. Numerous aspects of PCa, including cancer diagnosis, early biochemical recurrence, and treatment response have been predicted using radiomics[28]. NCCN recommendations elucidated that MRI aided in the staging and risk stratification of prostate cancer, and that its combination with many biomarkers might minimize needless biopsies[29].Prostate cancer's proliferative and aggressive heterogeneity can be best gauged by measuring Ki-67 and GS, although this information is presently only accessible through invasive procedures. The developing field of radiomics uses medical pictures to predict the biological activity of distinct malignancies by extracting high-throughput imaging data[23, 30, 31]. Radiomics is also being investigated more often as a prognostic marker in other solid tumor; specifically, Ki-67 has shown clinically significant outcomes in the diagnosis of bladder cancer[23], lung cancer[32], and breast cancer[33]. Non-invasive and capable of extracting more dimensional and abstract features, radiomics allows for the extraction of more concealed information in images than conventional radiologist observations, which can be used to construct predictive models analyzing the prognostic heterogeneity of PCa patients[34, 35]. As a result, the findings of our study have significant clinical implications, and the AUC of the prediction model constructed using Lasso and SVM is between 0.793 and 0.813 on the validation set. In the work by Fan X et al., samples of 252 cases from a single-center were included, and the random forest (RF) classifier created using conventional machine learning techniques predicted a Ki-67 in PCa AUC of 0.87, which is comparable to our findings[36].In addition, OSMAN et al. determined that imaging histological features of conventional CT scans could be observed non-invasively for PCa, and that the classifier of the evaluation model based on the training data set was more accurate in distinguishing high-risk patients from low-risk patients and in classifying GS = 7 versus GS > 7 and GS = 7(3 + 4) versus GS = 7(4 + 3) with high accuracy[37]. Obviously, the use of imaging histological characteristics to predict PCa pathology and scoring studies are not a replacement for conventional diagnostic methods, and additional prospective multicenter investigations are still required. Hence, our research has strengthened the evidence in this field which is recruited PCa patients from different institution.
Due to the rising uses of radiomics in oncology and its efficacy in differentiating malignancies, which is difficult with conventional radiologic interpretations, the use of radiomics is interesting[38–40]. Radiomics illuminates the invisible link between diseased outcomes and extracted variables, hence facilitating clinical decision making[19]. Radiomics postulates that the most important information is included in the spatial distribution of voxel intensities[26]. Indeed, our preliminary results demonstrated that four dominant radiomics categories from T2-WI and DWI of the selected texture 318 features could quantify intratumor heterogeneity, and that these categories were consistent with those from prior studies that reported the relevant features that are predictors for staging PCa aggressiveness[27, 41]. With the use of more potent algorithmic models, the top radiomic characteristics presented in this research may aid in giving more accurate predictions. Moreover, by exploring similarities or differences in tumor heterogeneity genes at the microscopic level, several key factors are provided for the development of improved radiomics models prediction for hall marks of PCa, proliferation and aggressiveness.
Computer-aided diagnostics (CAD) is a system that employs ML algorithms to analyze imaging and/or non-imaging medical data and assess the patient’s condition, which can be used to assist physicians in decision making[42]. RF, SVM, and LASSO algorithms are the CAD tools commonly used to classify resampled instances to analyze medical images from pictures databases[43]. In this work, we tested SVM, LDA, RF, and LASSO to find the best model, and finally selected SVM and LASSO for predicting Ki-67 expression and GS in the data based on their higher predictive accuracy. This was also an exploration of the application of CAD. As clinical urologists, we are more interested in the algorithmic maturity of ML or DL, as well as the scope and applicability of clinical applications. The SVM and LASSO models that we have chosen from the results of the existing studies have different application characteristics in the CAD of PCa. SVM has the best accuracy in distinguishing GS in image augmentation[44], the ability to handle high-dimensional data, and good generalization ability[45, 46]. However, it also has drawbacks such as high training and testing costs, sensitivity to parameters, and difficulty in interpretation[47]. On the other hand, the advantage of LASSO is that it can extract features and regularize radiomics data simultaneously to prevent overfitting, and it is suitable for correlated high-dimensional data[48, 49]. However, it may lose some important features and be sensitive to outliers. In contrast, direct comparisons between different ML or DL algorithms are scarce in terms of available research. Most of the above-mentioned studies did not compare their classifiers, so a conclusion about the best algorithm classifier to be used for the study of radiomics in PCa cannot be drawn at this time.
Our results corresponded with the prognosis of PCa patients has been the final argument. In previous retrospective clinical research, Ki-67 and GS shown a strong association with the prognosis of patients with PCa[10, 50]. Our findings are identical to those of previous research, but we emphasize the non-invasive, repeatable, and user-friendly character of radiomics. In this study, we aimed to investigate the association among PCa patient Ki-67 expression status, the GS and survival outcomes (5-year OS) predicted by radiomics using the LASSO and SVM. Our model predicted high Ki-67 expression, high GS in PCa patients with considerably reduced Ki-67 expression (Fig. 9c, d), and low GS in patients with OS (Fig. 9g, h), as shown by the data. The foregoing findings may imply that the model we built may directly predict the prognosis of patients using MRI data, therefore providing urologists with more important information for clinical decision-making.
The present study has some limitations, partly due to the exploratory nature of assessing the feasibility of the new application domain. (a) First, this is a retrospective multicenter analysis with a relatively small sample size. A prospective multicenter study with a larger sample size is needed to validate our results for future clinical applications. (b) The complementary role of radiomics signatures in the established predictive model is worth exploring, which will be our future direction to provide a more robust model. (c) Although the findings are promising and the models performed well, the inherent uncertainty should be considered before applying this information in clinical settings.