Clustering primarily serves as an analytical technique that extracts meaningful information based on the pattern characteristics of a target data set. It divides unlabelled data into different groups, where the similarity within each group is high, and the dissimilarity between groups is significant. Currently, this approach is applied in many fields[15]. Our study adopts a hierarchical clustering method, providing a structured approach to discern intrinsic imaging phenotypes of breast tumors. Unlike previous research primarily reliant on qualitative interpretation for identifying distinct imaging patterns linked to histopathologic correlates[16], our study introduces a systematic, quantitative evaluation framework. Cluster analysis aims to categorize data points within a population into groups where members of each cluster share more similarities with each other than with those in different clusters[9]. We employed unsupervised cluster analysis to uncover natural groupings within the imaging features we extracted, leading to the identification of intrinsic phenotypes characterized by distinct and measurable imaging patterns. This method reflects the clustering techniques pivotal in uncovering the inherent molecular subtypes of breast cancer. To visualize and interpret these imaging phenotypes, we generated heat maps akin to microarray expression clustering dendrograms. This visualization aids in beginning to understand the correlation between the imaging presentation of cancer and its molecular profile, potentially offering insights into the likelihood of cancer recurrence. The study's findings reveal notable differences in several critical parameters between the two patient groups. The significant variations observed in these areas underscore the efficacy of radiomics clustering analysis in distinguishing breast cancer cases based on their intrinsic phenotypes. This differentiation is crucial, as it paves the way for more personalized treatment approaches, tailoring strategies to the specific characteristics and needs of each tumor type. The ability to categorize breast cancer more precisely based on these key parameters holds significant potential for enhancing the effectiveness of treatment plans and improving patient outcomes.
The cluster analysis results of our study, showing statistical significance in molecular subtypes, ER, PR, size, histological grading, edema, LNM, and location, align with recent advancements in breast cancer research. Lyu et al analyzed factors influencing sentinel lymph node metastasis in breast cancer, highlighting the importance of molecular subtypes, tumor size, and histological grade in predicting lymph node involvement[17]. Wang et al analyzed risk factors of axillary lymph node metastasis and prognosis in T1 breast cancer using the SEER database, highlighting the impact of tumor size, histological grade, and subtype on lymph node involvement and survival outcomes[18]. This further validates our study's emphasis on these factors in breast cancer assessment.This supports our findings on the significance of these parameters in breast cancer characterization. Xu et al investigated molecular phenotypes and clinical characteristics of familial hereditary breast cancer, emphasizing the role of clinical pathologic subtypes, tumor location, histological grade, and lymph node metastasis. The study results showed significant differencesbetween hereditary and non-hereditary breast cancers, including histological grading (grade II/III), LNM, PR, and HER2[19]. Their findings resonate with our study, underscoring the complexity and diversity of breast cancer subtypes. Our study contributes significantly to breast cancer research by providing a comprehensive cluster analysis that correlates imaging features with key clinical and pathological parameters. We offer a novel approach to breast cancer classification, integrating imaging phenotypes with molecular typing, ER, PR, tumor size, histological grading, edema, and lymph node metastasis , these may have varying influences on the prognosis and survival outcomes in breast cancer patients. This multidimensional analysis enhances the understanding of breast cancer heterogeneity and supports the development of personalized treatment strategies.
Molecular subtypes, particularly those defined by hormone receptor status, have been widely recognized for their prognostic implications. ER-positive and PR-positive tumors often suggest a better prognosis and responsiveness to hormonal therapies, which could lead to improved survival rates[20]. Conversely, tumors with negative hormone receptor status, particularly those falling into the triple-negative category, are often associated with a more aggressive course and poorer outcomes [21]. Lee et al demonstrated the potential of machine learning in radiomics for predicting prognostic biomarkers and molecular subtypes of breast cancer, emphasizing the role of tumor heterogeneity and angiogenesis properties on MRI[6]. This aligns with our observation of distinct phenotypic profiles in breast cancer, highlighting the importance of personalized treatment strategies. Our results also align with Romeo et al, who applied a machine learning-based radiomics approach to hybrid 18F-FDG PET/MRI for predicting axillary lymph node involvement in breast cancer[22]. This supports our findings on the utility of radiomics in assessing tumor aggressiveness and invasion potential. Furthermore, Araz et al explored the role of preoperative 18F-FDG PET/CT radiomics features in predicting hormone receptor positivity in primary breast tumors[23]. While their study found limited predictive value in radiomics parameters for hormone receptor status, it highlights the complexity and challenges in correlating radiomics features with specific tumor characteristics, a theme also evident in our study. Metzger Filho et al assessed the impact of HER2 heterogeneity on response to HER2-targeted therapy, revealing that HER2 heterogeneity is associated with resistance to such treatments[24]. Szep et al demonstrated that whole-tumor ADC texture analysis could predict hormonal status in breast cancer masses[25]. These findings are particularly relevant to our study, as they highlighted the importance of understanding tumor heterogeneity in treatment planning, and emphasized on the potential of radiomics in breast cancer diagnosis and treatment.
Tumor size, histological grading, and tumor location are also critical in determining the aggressiveness of the disease. Larger tumors and those with higher histological grades are generally indicative of a more advanced disease state, potentially leading to lower survival rates. Min Y et al. identified clinicopathological characteristics associated with the risk of distant metastasis and prognosis in breast cancer patients without lymph node metastasis, highlighting the importance of tumor size and histological grade[26]. Similarly, the presence of lymph node metastasis is a well-established marker of poor prognosis, as it signifies the spread of cancer cells beyond the primary tumor site. Lymph node metastasis in breast cancer patients is a critical factor that significantly influences prognosis. The presence of cancer cells in the lymph nodes, particularly the axillary lymph nodes, is often considered a hallmark of the spread of the disease beyond the primary tumor site. This metastatic spread is a key determinant in staging breast cancer, which in turn guides treatment decisions and helps predict patient outcomes. Patients with lymph node involvement generally have a poorer prognosis compared to those without. Recent studies, including the development of nomogram models, have highlighted several independent predictive factors for axillary lymph node metastasis in breast cancer patients, and these factors include tumor size, primary site, molecular subtype, and histological grading[27].
Edema, while not as extensively studied as other factors, could provide additional insights into the tumor microenvironment and its interaction with the surrounding tissues, which might have implications for disease progression and metastatic potential. In the study by Zeyan Xu et al., edema is a promising predictive factor for lymph node metastasis in breast cancer, which is very important for patients with early-stage breast cancer and may help in formulating treatment plans. Additionally, a positive correlation was observed between BES and various aggressive clinicopathological factors (p < 0.05)[28]. In future studies, it would be crucial to integrate these factors into survival analysis models to comprehensively understand their collective impact on patient outcomes. Such analyses could not only validate the clinical significance of our cluster analysis results but also contribute to the development of more tailored and effective treatment strategies, ultimately improving patient survival and quality of life in breast cancer.
Our study makes significant contributions to the field of breast cancer research by integrating radiomics clustering analysis with traditional histopathological markers. We provide a comprehensive approach to breast cancer classification, enhancing the understanding of tumor heterogeneity and its implications for personalized medicine. Our findings contribute to the growing body of evidence supporting the use of radiomics in breast cancer diagnosis and treatment planning. In evaluating conflicting explanations of our results, we defend our approach by highlighting the robustness of our methodology and the consistency of our findings with current literature. While some studies suggest limited predictive value of radiom[28]ics in certain aspects of breast cancer, our study demonstrates its utility in differentiating intrinsic phenotypes, supporting its role in personalized treatment strategies.
While our study provides valuable insights into the classification and treatment of breast cancer using radiomics, it is important to acknowledge its limitations. Firstly, the limited sample size of 194 patients, although substantial, may not fully capture the diversity and complexity of breast cancer phenotypes. Another significant limitation is the absence of survival analysis. Our study focused on the classification of breast cancer based on radiomics and traditional histopathological markers, without delving into the long-term outcomes of these patients. Future studies with larger cohorts are necessary to validate and extend our results, and include survival data to provide a more comprehensive understanding of the clinical significance of radiomics-based classification. Additionally, our study did not incorporate genetic testing results, which could provide deeper insights into the molecular underpinnings of the observed phenotypes. The lack of genetic information represents a missed opportunity to explore the interplay between genotypic and phenotypic characteristics of breast cancer. In conclusion, while our study makes significant contributions to the field of breast cancer research, addressing these limitations in future studies will be crucial for advancing our understanding of breast cancer and improving patient care. Expanding the sample size, including survival analysis, and integrating genetic testing results are key areas for future research that could significantly enhance the impact and applicability of our findings.