According to the latest cancer statistics 1, BC has replaced lung cancer as the most common cancer in the America. Consequently, BC treatment and handling the corresponding complications have brought a heavy medical burden to the society, which is a major problem associated with human cancer. In particular, the quality of life and survival rate of BC patients are significantly reduced after they develop BC. This is mainly attributed to the occurrence of skeleton-related events (SREs) after BM. Studies have revealed that the severe vertebral invasion, pathological fractures, bone pain, and other SREs pose a serious threat to the prognosis of patients with BM 20,21. A previous survey found that the cumulative incidence of SREs in patients with BM is about 45.1% 22. In the present study, the incidence of BM in BC patients was about 2.3%, which reflects the difficulty of diagnosing patients with BM and the harmfulness of BM. Therefore, it is necessary to effectively screen patients who are prone to develop BM and have poor prognosis after BM. Bone scan combined with CT is the gold standard for detecting BM, and is also the preferred method recommended in current guidelines 23–25. A recent European prospective study showed that [18F] FDG PET/MRI and MRI were significantly better than CT or bone scintigraphy for the detection of BM in newly diagnosed BC patients 26. However, these tests have some drawbacks, such as radiation damage and high cost, and not all patients are willing to undergo BM testing. Thus, to more effectively address these issues, this study aimed at developing two facile clinical models for early detection of high-risk BCBM patients and prediction of BCBM prognosis.
With the rapid development of artificial intelligence technology, machine learning is increasingly being applied in the field of biomedicine, and it also has great potential in future clinical practice 27,28. In 2022, an article published in the journal Nature by Stephen-John Sammut et al. presented a study encompassing clinical information, pathology, genomics, and transcriptomics of 168 patients with breast cancer undergoing chemotherapy. They successfully predicted the complete response of chemotherapy patients using a multi-group machine learning approach (AUC = 0.87) 29. This groundbreaking study demonstrates the significant medical value that mature machine learning models can offer in clinical practice, enabling the provision of more accurate assistance to doctors and patients through alternative methods.
Nevertheless, despite significant advancements in building and utilizing various models, there is still considerable scope for improvement. Li Li et al. developed a deep learning algorithm that predicts bone metastasis in breast cancer by incorporating MRI radiological features from 96 cases of metastatic breast tumors and 192 cases of non-metastatic breast tumors. The predictive performance of the model is evaluated using statistical morphology and grayscale characteristics, employing metrics such as AUC, sensitivity, and specificity 30. Nonetheless, due to the high demand for front-end MRI images and data, this model cannot be widely adopted. Thio et al. integrated survival data from thousands of cancer patients with extensive bone metastasis to develop a survival prediction model 31. However, the majority of the data utilized for model development and verification originates from laboratory sources, including biochemical data, blood routine data, protease data, and more. While accurately predicting the survival rate, it also imposes more stringent demands on the types of data used. Our machine learning model is specifically designed to predict the occurrence of bone metastasis in breast cancer patients and prognosticate patients with bone metastasis. All the parameters required for the model are derived from routine clinical practice, making them more accessible than specific images or laboratory data. This model can also be utilized by hospitals in remote areas or by junior clinicians to guide the comprehensive treatment planning of breast cancer patients, enabling early intervention to prevent and address potential clinical adverse events. Secondly, it employs multiple strategies, such as preventing overfitting and utilizing shrinkage and column subsampling techniques, to enhance algorithmic generalization and learning speed. The XGBoost algorithm, which has demonstrated high accuracy and ease of use in numerous studies 32–34, is referenced in this model.
This study used univariable and multivariable logistic regression analyses to screen fifteen independent risk factors, including age, race, sex, grade, T stage, N stage, surgery, radiotherapy, chemotherapy, tumor size, brain metastasis, lung metastasis, liver metastasis, breast subtype, and PR. According to the order of importance of the SHAP diagram, the features that contributed prominently were surgery, N stage, and T stage. Next, univariate and multivariate Cox proportional hazards regression analyses were applied to screen thirteen independent prognostic factors, including age, race, marital status, grade, breast subtype, surgery, radiotherapy, chemotherapy, brain metastases, liver metastases, lung metastases, ER, and PR. All features were also ranked by importance, with results showing that surgery, liver metastases, and lung metastases were the three factors strongly associated with prognosis. However, some features that were considered meaningful in the multivariable logistic regression analysis and multivariate Cox proportional hazards regression analysis had a SHAP value of zero in importance ranking. This may further reflect the superiority of machine learning. Specifically, it can better eliminate unnecessary features unlike traditional linear regression analysis, which has the problem of overfitting. Machine learning enables us to obtain more accurate predictive models by continuously improving operational efficiency and self-improvement.
This study found that BC patients who did not undergo surgery were at high risk of developing BM. Yao et al. 17 also suggested that surgery was an independent risk factor for BCBM. Despite the hazard of radiation damage, we still recommend bone scans to examine BM in unoperated BC patients. We also found that T stage and N stage were strong predictors of BM. Studies have demonstrated that the increase of T and N stages of malignant tumors indicates the increase of tumor volume, and the expansion of the degree and extent of involvement of adjacent tissues and lymph nodes, which are the manifestations of further development of malignant tumors 35,36. It is well known that the TNM staging system proposed by the AJCC is a widely used prognostic system 37. However, previous studies have shown that the accuracy of using the TNM staging system alone to predict metastases is not high, and thus researchers often obtain better prediction results through comprehensive analysis of multiple factors 38,39. Interestingly, surgery was also the most prominent feature with regard to prediction of BCBM prognosis. Although metastatic BC remains an incurable disease, surgery to remove the primary tumor is associated with improved survival in patients with distant metastatic BC at diagnosis. One study reported that patients who underwent primary surgery had significantly longer median survival than those who did not, and primary tumor resection for primary BCBM reduced the risk of death by approximately 40% 40. A randomized controlled trial conducted in Turkey found that the 3-year OS was similar in patients with and without primary BC surgery. However, at a median follow-up of 5 years, patients who underwent surgery had a prolonged median OS by approximately nine months 41. In addition, a trial conducted in India, revealed that the OS of patients with de novo metastatic BC was not improved after surgery for their primary BC 42. Scholars in Europe concluded that surgical treatment of the primary tumor in patients with de novo metastatic BC could not benefit majority of them 43. A retrospective study by Gong et al. 44 identified surgery as an independent prognostic factor for BCBM, which is consistent with our findings. Therefore, whether the primary tumor of BCBM should be operated is still controversial, which calls for further multicenter prospective studies for verification. Liver and lung metastases play an important role in predicting the prognosis of BCBM. This study found that BCBM patients with liver metastasis or lung metastasis had a poor prognosis, and their 5-year survival rate was lower than that of other types of BCBM patients. We comprehensively considered all meaningful features to predict the prognosis of BCBM and achieved good predictive performance.
The ultimate purpose of building models is to be more convenient for clinical application and help clinicians make decisions. Consequently, based on the XGBoost algorithm, we built two accessible online websites (https://share.streamlit.io/lry4000/bone_metastasis/main) and (https://share.streamlit.io/lry4000/sc5_new/main ). Specifically, a streamlined web page structure enables users to input data more efficiently. The clinical parameters mentioned in the article are displayed on the right side of the webpage, allowing users to input corresponding clinical data based on the actual condition of the patients. The system will instantly generate the predicted probability of bone metastasis for the patient. The results can be presented in various formats and shared with a broader range of clinical participants. The second web page, which predicts the survival rate, follows a similar usage process.
There are some limitations in our study. First, this is a multicenter retrospective study involving only patients from the United States, and thus it inevitably suffers from selection bias. Therefore, there is a need for external data from other countries to validate the reproducibility of our results. Second, although our model achieved good clinical performance on the basis of SEER database, it is essential to further confirm the reliability of the model through prospective studies. Third, the SEER database does not include blood routine, biochemical indicators, and Charlson Comorbidity Index (CCI), which may lead to the model missing some important features.