Our research indicates potential clinical application of DL model to predict expression of Ki-67 using breast cancer imaging before surgery. As Ki-67 expression is an important indicator, which also influences breast cancer subtype classification, early prediction of the Ki-67 expression through clinical images before obtaining biopsy results may have potential benefits by enabling early decision-making to determine initial treatment strategy.
Previous research reported the usefulness of radiomics analysis to predict Ki67 values of breast cancer [12, 17]. In spite of the different setting of the cut-off values for positive Ki-67 status, our DL model showed higher AUC values than previous radiomics analyses [12, 17]. Previous radiomics studies suggested that the results of radiomics analysis may vary significantly depending on the setting of the ROI due to tumor heterogeneity of breast cancer, such as the periphery or inside of the tumor [12, 17] and the robustness of radiomics analysis is highly affected by the ROI setting [17]. Whereas, DL model using CNN, as a process to convolute morphological information of image, allows to capture comprehensive information throughout the entire tumor. Therefore, our DL model inherently offers potentially more consistent and reliable insights.
Although image-based prediction of Ki-67 expression is also investigated by various imaging modalities such as MRI [10], our DL model used DBP for the imaging modality. DBT, as an advanced imaging modality of FFDM, has become a widely used and commonly employed breast imaging technique. Our DL models using widely available DBP imaging have potential advantage of clinical application when compared to MRI. In addition, as DBT provides more detailed information about morphological information of tumor when compared to FFDM, this may accurately reflect tumor heterogeneity 19.
Our result suggested that the predictive accuracy of Ki-67 expression varies among sub-dataset of radiological characteristics of breast cancer. The accuracy in the mass sub-dataset was higher compared to the other sub-datasets, whereas that in the calcification sub-dataset was lower compared to the other sub-datasets. This pattern is consistent with the prior research, where a lower accuracy of calcification compared to other findings was observed to predict the presence of stromal invasion of breast cancer [20]. In the paper, they suggested that the DL model did not represent the relationship between calcification and invasion because of the donwnsampling image processing for DBT image. We assumed the same reason for the lower accuracy to predict Ki-67 expression in the calcification sub-dataset. The accuracy in the distortion sub-dataset was lower than that in the mass sub-dataset, and lower than that in the calcification sub-dataset. Although image interpretation is sometimes difficult, presence of distortion, presence of distortion considered to be pathognomonic for malignancy, when once detected. Therefore, it is reasonable that the accuracy for distortion would fall between that of mass and calcification. FAD sub-dataset did not show any clear trends, which may be associated with the small number of cases.
The clinical utility of Ki-67 has undergone historical transitions. It was previously used for the classification of luminal breast cancer. Based on the St. Gallen International Consensus Guidelines [3] previous radiomics studies set the threshold of Ki-67 expression 14% [21]. Recently, many studies recommended clinically significant threshold of Ki-67 expression as 30% in association with determination of adjuvant chemotherapy in hormone-positive, HER2-negative early breast cancer 21. This is the reason why 30% was used as the threshold for Ki-67 classification in this research.
Our research has several limitations. First, this research conducted at a single institution using a single vendor, has a small sample size. This could impact the accuracy of the predictions and limit the generalizability of the findings. In the future, it will be necessary to conduct studies across multiple institutions and with larger sample sizes to ensure the robustness of our DL model and its applicability to broader populations. Second, we used cropped images of lesions for our DL model. For clinical application, targeted breast lesions need to automatically detect targeted breast lesions. Third, the expression of Ki-67 was obtained from a preoperative biopsy sample. Localized biopsy samples may not represent the Ki-67 expression of entire tumors due to heterogeneity.
In conclusion, the DL model utilizing DBT has the potential to accurately predict the expression of Ki-67, which can serve as a valuable non-invasive tool in determining the treatment strategy for breast cancer preoperatively.