Comparison of different machine learning models based on ultrasound-based radiomics to predict central lymph node metastasis of papillary thyroid carcinoma

doi:10.21203/rs.3.rs-3446340/v1

Download PDF

Research Article

Comparison of different machine learning models based on ultrasound-based radiomics to predict central lymph node metastasis of papillary thyroid carcinoma

https://doi.org/10.21203/rs.3.rs-3446340/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: Accurate methods to predict central lymph node metastases preoperatively are needed to improve the management of patients with papillary thyroid carcinoma. The objective of this study was to apply machine learning models based on ultrasound radiomic data to predict central lymph node metastases and to identify the best differential diagnosis model.

Methods: Clinicopathological information was retrospectively collected. All patients underwent preoperative thyroid ultrasound and postoperative lymph node pathology analysis. The regions of interest were manually drawn using a three-dimensional slicer and features specific to each area of injury were extracted. Five machine learning models were established to identify the appearance of central lymph node metastases, including logistic regression, support vector machine, random forest, decision tree, and adaptive boost.

Results: Patients (n=229) were randomly divided into training (n=161) and validation (n=68) cohorts at a ratio of 7:3. Sixty-four patients exhibited central lymph node metastases. Logistic regression was the preferred algorithm to predict the occurrence of central lymph node metastases. The area under the curve, sensitivity, specificity, precision, recall, accuracy, and F1-score were 0.722, 0.761, 0.682, 0.833, 0.761, 0.735, and 0.795, respectively.

Conclusions: Novel ultrasound radiomic machine learning models accurately predicted the occurrence of central lymph node metastases in patients with papillary thyroid carcinoma. The radiomic-based logistic regression model was the most effective and reliable preoperative method for the differential diagnosis of central lymph node metastases.

central lymph node metastases

papillary thyroid carcinoma

machine learning models

ultrasound-based radiomics

The incidence of thyroid cancer (TC), the most common type of endocrine tumor, has increased rapidly in recent decades, especially in patients with papillary TC (PTC)(1). PTC, which constitutes approximately 85–90% of all TC cases༈2༉, presents the best prognosis among all subtypes. However, the incidence of cervical lymph node metastasis in PTC, especially in the central compartment, is notably high, ranging from 30–90%༈3༉. Central lymph node metastasis (CLNM) is the primary site of cervical lymph node metastasis in PTC༈4༉.

Currently, ultrasound (US) is an important diagnostic approach for TC and CLNM because it is economical, noninvasive, and repeatable. Nonetheless, preoperative screening using US has a low diagnostic value for diagnosing CLNM, with a sensitivity below 40%(3, 5), probably owing to disease diversity. First, the majority of CLNM are often microscopic, occult, and are not detectable on preoperative US. Second, the precision and quality of US diagnosis highly relies on the technician's experience and subjective judgment༈6༉.

Whether routine prophylactic dissection of the lymph nodes of the central compartment (pCCND) should be performed in clinically negative patients (cN0) with PTC remains controversial(7, 8). Although pCCND may reduce local recurrence and improve the prognosis of this disease༈9༉, it is also linked to a significantly higher rate of complications than non-dissection, including hypoparathyroidism and recurrent laryngeal nerve injury༈7, 9༉. Therefore, a reasonable and prudent preoperative central cervical lymph node dissection strategy is necessary, given that both excessive and insufficient dissection could have an entirely different impact on patient outcomes and quality of life. Consequently, there is an urgent need for a more objective and accurate method to predict CLNM in patients with PTC before attempting any surgical intervention.

US radiomics (USR) is an innovative tool used to extract data from medical images, incorporating hundreds of quantitative features to create an image-based biomarker for disease diagnosis and the assessment of tumor physiology(10, 11). Meanwhile, machine learning (ML) is a computer-based data analysis method extensively employed in the medical field and, in particular, in radiology༈12, 13༉. Previous research has demonstrated that the performance of radiomic models can vary significantly depending on feature selections and different tumor types༈6, 14༉. However, to our knowledge, studies comparing ML models (MLMs) based on USR for predicting CLNM in patients with PTC are limited.

Study population

This retrospective study was approved by the Ethics Committee of Guangxi Medical University Cancer Hospital (approval number: LW2023145) and adhered to the Helsinki Declaration of 1975 (revised in 2013). Given the study's retrospective design, informed consent was waived. Consecutive patients who underwent initial thyroid surgery at our medical institution from June 2020 to June 2022 were selected from the electronic health records, and clinical data were subsequently reviewed retrospectively. The inclusion criteria were: (1) preoperative thyroid fine needle aspiration biopsy (FNAB) with BRAF V600E gene assessment at the center; (2) patients diagnosed with TC who underwent unilateral lobectomy plus isthmectomy with pCCND; (3) complete and clear preoperative US images of good quality; and (4) patients who had not received any form of chemoradiotherapy or other cancer treatments prior to surgery. The exclusion criteria were: (1) non-PTC or mixed TC; (2) suspected lateral lymph node metastasis on preoperative US; and (3) missing postoperative pathological results. Following the eighth edition of the American Joint Committee on Cancer TNM system, lymph node metastasis in VI and VII compartments confirmed by postoperative pathology was identified as CLNM (N1a)(15). A total of 229 patients were included in the study and were randomly allocated to either the training or validation set at a 7:3 ratio. An overview of our analysis workflow is illustrated in Fig. 1.

US and image analysis

Prior to surgery, all patients underwent a US examination, which was performed by two well-trained technicians using the Logiq E9 US system and the Toshiba Apolio 500 US system, employing a 5–12 MHz high-frequency linear array probe for thyroid cross-sectional, longitudinal section imaging and the size of cervical lymph nodes. This examination aimed to evaluate nodule characteristics and assessed the status of the cervical lymph nodes.

Region of interest (ROI) segmentation and radiomic feature extraction

The ROI was manually segmented using US images by two radiologists experienced in thyroid disease diagnosis, utilizing 3D Slicer software (v.4.10.2). This segmentation was employed to identify specific areas within the thyroid gland, excluding regions with necrosis, hemorrhage, or cysts. To evaluate the consistency of ROI placements, 30 cases were randomly chosen from the entire patient cohort, and a second radiologist with expertise in thyroid US diagnosis independently positioned the ROIs over corresponding structures. Overall, 837 radiomic signatures were extracted, which included first-order statistics, grey-level dependence matrix, grey-level co-occurrence matrix, grey-level run-length matrix, grey-level size zone matrix, and neighborhood grey tone difference matrix, utilizing wavelet filter images.

Feature selection and radiomic signature construction

The dataset was randomly divided into training and validation sets, with 70% of the data assigned to the training set (n = 161 patients) and 30% to the validation set (n = 68 patients). The radiomic signature was created using the least absolute shrinkage and selection operator (LASSO) regression. LASSO is a regression analysis method that has been widely used in feature selection; it can simultaneously combine feature selection and regularization, thereby enhancing model prediction accuracy(16). We chose LASSO for feature selection owing to its effectiveness demonstrated in previous studies༈17, 18༉. Finally, 10 radiomic signatures potentially linked to CLNM were assessed within both the training and validation sets.

ML techniques

Five well-established ML algorithms were employed for modeling, namely logistic regression (LR), support vector machine (SVM), random forest (RF), decision tree (DT), and adaptive boost (AdaBoost). The models were trained using the training set, with optimization achieved through a grid search method incorporating five-fold cross-validation to minimize prediction errors. Model performance in both the training and validation sets was assessed using standard clinical metrics such as the area under the curve (AUC), sensitivity, specificity, precision, recall, accuracy, and F1-score(19). The performance of models in the validation set was identified, and the diagnostic performance of the models was compared to select the best classifier.

Statistical analysis

Statistical analyses were conducted using R software (v.3.5.3) and IBM SPSS Statistics for Windows (v.26.0). Significance was established at a two-tailed P-value < 0.05. Continuous variables in the study were expressed as mean ± standard deviation (SD), while categorical variables were presented as frequencies. To visually represent the LR model's prediction results, we utilized a confusion matrix, a precision-recall curve, and a nomogram.

Demographic data of the enrolled patients

We included 229 patients diagnosed with PTC who underwent unilateral lobectomy plus isthmectomy with pCCND. Table 1 displays the basic demographic data of these patients. The BRAF V600E mutation was detected using FNAB analysis, whereas CLNM was confirmed by pathological analysis.

Table 1

Patient characteristics of the training and validation sets
Characteristic	total	training set(n = 161)	Validation set (n = 68)	t/x2	p
Age,mean ± SD, years	43.048 ± 10.209	43.298 ± 9.664	42.455 ± 11.451	0.570	0.570
Sex, n
Female	174	122	52
Male	55	39	16	0.013	0.911
BRAF(+)	167	123	44
BRAF(-)	62	38	24	3.310	0.069
CLNM(+)	64	42	22
CLNM(-)	165	119	46	0.932	0.334

Feature selection and model construction

LASSO regression was employed to select the most relevant predictive features from the aforementioned characteristics. Consequently, an optimal set of 10 features for building the MLMs was identified, as depicted in Fig. 2A and 2B.

Predictive performance of ML-based models

Predictive models for PTC CLNM were established using the five aforementioned algorithms. Table 2 displays the predictive performance of these models, which includes metrics such as the AUC, sensitivity, specificity, precision, recall, accuracy, and F1-score. Notably, significant performance differences were observed across the different models. While the AUC value for the LR model equaled that of the SVM (0.722 vs. 0.724), the LR classifier outperformed the other four classifiers in the validation set, achieving the highest values for accuracy (0.735) and F1-score (0.795). Consequently, the LR model was considered the most accurate.

Table 2

Predictive performance of the machine learning models for the training and validation sets.
Training Set								Validation Set
	AUC	SEN	SPEC	PRE	REC	ACC	F1	AUC	SEN	SPEC	PRE	REC	ACC	F1
LR	0.831	0.905	0.639	0.469	0.905	0.708	0.618	0.722	0.761	0.682	0.833	0.761	0.735	0.795
DT	0.869	0.810	0.849	0.654	0.810	0.839	0.724	0.601	0.543	0.682	0.781	0.543	0.588	0.641
RF	1.000	1.000	1.000	1.000	1.000	1.000	1.000	0.662	0.739	0.591	0.791	0.739	0.691	0.764
SVM	0.890	0.908	0.762	0.915	0.908	0.870	0.911	0.724	0.909	0.543	0.488	0.909	0.662	0.635
Adaboost	0.983	0.952	0.941	0.851	0.952	0.944	0.899	0.663	0.565	0.773	0.839	0.565	0.632	0.675
AUC, the area under the curve; SEN, sensitivity; SPEC, specificity; PRE, presicion; REC, recall; ACC, accuracy, LR, logistic regression, DT, decision tree; RF, random forest; SVM, support vector machine; Adaboost, adaptive boosting.

LR model performance

The CLNM prediction model, established using the LR algorithm, yielded the following performance metrics: AUC 0.722, sensitivity 0.761, specificity 0.682, precision 0.833, recall 0.761, accuracy 0.735, and F1-score 0.795, as outlined in Table 2. Figure 3 displays the confusion matrix and the receiver operating characteristic curve (ROC) for both the training and validation sets of the LR model. In the confusion matrix, among the 161 patients in the training set, the model accurately classified 17 (TP) of 26 patients with CLNM and 110 (TN) of 135 patients without CLNM (Fig. 3A and 3B). Additionally, the precision-recall curve for this model is shown in Fig. 4A and 4B.

Performance of the radiomic score and nomogram

The calculation formula for the radiomic score (Rad_score) incorporated 10 radiomic features (Table 3). Figure 5 illustrates the nomogram developed using the Rad_score, which was defined as follows:

Table 3

Selected features and their coefficients
Features	Coefficient	p	HR(95%CI)
wavelet-LHL_glcm_MCC	-1.083	< 0.0001	0.339(0.19–0.604)
wavelet-LHH_gldm_DependenceVariance	0.236	0.508	1.267(0.629–2.551)
Wavelet-LHH_glszm_SmallAreaLowGrayLevelEmphasis	0.335	0.185	1.397(0.852–2.292)
wavelet-HLH_gldm_DependenceVariance	1.028	0.014	2.794(1.228–6.36)
wavelet-HLH_glszm_SizeZoneNonUniformityNormalized	-0.231	0.617	0.794(0.320–1.965)
wavelet-HLH_glszm_SmallAreaEmphasis	-0.678	0.19	0.508(0.184–1.398)
wavelet-HHL_glcm_JointEnergy	1.416	0.032	4.119(1.130-15.016)
wavelet-HHL_glcm_MaximumProbability	-0.568	0.318	0.567(0.186–1.725)
wavelet-HHH_firstorder_Maximum	1.256	< 0.0001	3.512(1.841–6.698)
wavelet-HHH_glcm_Imc1	-0.468	0.195	0.626(0.309–1.270)
Constant	-1.408

The incidence of PTC is rising progressively, with an increasing number of patients diagnosed worldwide. Surgical resection of the thyroid tumor region is recognized as the preferred treatment method. However, whether pCCND is needed simultaneously for all patients with PTC who are clinically node negative (cN0) remains debatable. According to the 2015 ATA guidelines, performing pCCND for all patients with PTC is not advisable, especially for those with small primary (T1 or T2) and noninvasive tumors(20). However, in China, the latest guidelines for differentiated TC recommend that, while preserving the parathyroid gland and the recurrent laryngeal nerve, at least ipsilateral pCCND should be performed༈21༉. In this study, 64 of the 229 patients were confirmed to have CLNM by postoperative pathology, accounting for approximately 28% of the overall sample, a percentage that is consistent with the literature༈22༉. Therefore, even when the PTC surgery is conducted by highly specialized surgeons and experts, clinicians should carefully access the relative risks and benefits of pCCND for every PTC patient. To accurately guide patients in selecting the appropriate surgical method, thus avoiding overtreatment and reducing individual and medico-economic burden, there is a pressing need to develop a practical predictive model to enhance the preoperative predictive accuracy of CLNM༈20༉.

US images of most patients were recognized as the best choice to diagnose PTC, although they may not reveal any abnormal findings about CLNM preoperatively. The US examination is not reliable for visualizing deep anatomic structures, especially for objects acoustically shadowed by air and bone(23). Furthermore, owing to the subjective nature of the US examination, which is based heavily on the sonographer’s experience, it can result in variability between observers and may affect the accuracy of the CLNM diagnosis༈24༉. Radiomics is a novel and noninvasive method, which extracts and analyzes medical image characteristics according to tumor heterogeneity to establish a predictive model to improve diagnostic and predictive capacity༈25༉. USR techniques have rapidly advanced and have found application for the differential diagnosis of tumors, including the prediction of microvascular invasion in hepatocellular carcinoma༈26༉, for the discrimination of high-risk endometrial cancer༈27༉, and for the evaluation of breast cancer response to chemotherapy༈28༉. These radiomic features hold promise as noninvasive biomarkers for predicting CLNM in patients with PTC. The advancement of computing power has fostered the development and growth of artificial intelligence applications, which can provide significant assistance to humans in tackling complex decision-making tasks༈29༉. However, various algorithms utilize distinct principles for constructing prediction models. Therefore, finding a compatible algorithm to improve the accuracy and diagnostic capacity of radiomic models is becoming the focus of attention༈19, 30༉.

In our study, artificial intelligence ML algorithms, five models of which were constructed to assess preoperative prediction of CLNM in patients with PTC based on US radiometric characteristics filtered by LASSO, were compared. Ultimately, we selected 10 radiomic signatures using LASSO regression. The parameters included are shown in Table 3. All the chosen radiomic features were wavelet-based, capable of uncovering hidden information within medical images across multiple scales(31).

We compared the MLMs established by the RF, DT, and AdaBoost algorithms, although we generated a marked overfitting problem. RF, DT, and AdaBoost are tree-based non-linear algorithms that are efficient and accurate methods for variable selection and classification(32); however, these classifiers result in robust noises and outliers, causing an overfitting༈33, 34༉. This is in accordance with the research findings reported by Yin et al.༈17༉. The performance of SVM and LR models showed better stability than RF, DT, and AdaBoost in the training and validation sets. In the training sets, the AUC of the SVM model (0.890) slightly outperformed that of the LR model (0.831). However, in the validation sets, the RF model exhibited superior performance compared to the SVM, achieving the highest values for both accuracy (0.735) and F1-score (0.795) among the five classifiers. In general, the results of this study reveal that the AUC values demonstrated the LR-based ML model's ability to distinguish between CLNM-positive and CLNM-negative patients with PTC in both the training (AUC: 0.831) and validation (AUC: 0.722) sets, signifying a satisfactory model performance. Therefore, LR was established as the CLNM prediction model for PTC in this study. Thereafter, a nomogram, utilizing the Rad_score, was created to visually represent the LR model.

The LR model in this study demonstrated a higher predictive accuracy than some models established based on traditional clinical features in previous studies(30). Agyekum et al.༈6༉ developed a CLNM prediction model incorporating clinical risk and USR, employing the LR algorithm. The AUC (0.710) in the validation set for the CLNM diagnosis model was slightly lower than that observed in the present study. In the study by Li et al.༈35༉, the authors created a computer model based on deep learning for CLNM diagnosis in patients with PTC, and the AUC of their validation sets (0.794) was higher than that observed in our study (0.722). This divergence could be attributed to the superior capacity of deep learning algorithms to extract high-level features from datasets compared to traditional ML algorithms. Zhou et al.༈36༉ developed a USR nomogram for preoperatively predicting CLNM in 609 patients; the AUCs in the training and validation sets were 0.816 and 0.858, respectively. The high value of AUC could be related to the larger sample size of patients included in the study and the integration of USR features with clinical features. Furthermore, the LR model showed excellent and consistent performance in data processing and ML prediction of various diseases, such as prediction of pulmonary nodules༈37༉ and prediction of breast cancer invasiveness༈38༉.

This study had several strengths. First, the strict enrollment criteria and patient inclusion avoided interference due to bilateral PTC lesions leading to bilateral CLNM. Next, to mitigate selection bias, this study employed a completely randomized grouping design and conducted consistency assessments among observers and intra-observers. Lastly, the USR model established based on the LR algorithm was a simple diagnostic and prognostic tool, assisting PTC patients without CLNM in avoiding unnecessary surgery.

Nonetheless, this study had few limitations. This was a retrospective study; therefore, the results may be influenced by a case-selection bias. Furthermore, the sample size was limited, and all cases were collected from a single hospital. Additionally, while we demonstrated the potential feasibility of applying the ML model and incorporated US-based radiomic data to predict CLNM in patients with PTC, our study was further limited by the absence of external validation. Therefore, future prospective studies with a larger number of samples and centers are required to verify this model.

By exploiting ML technology, we established and validated a feasible and noninvasive model, based on the characteristics of USR using an LR algorithm, to predict CLNM in patients with PTC. This model can assist clinicians in devising preoperative treatment strategies for patients with PTC and potentially prevent unnecessary surgical procedures.

AdaBoost, adaptive boost; AUC, area under the curve; CLNM, central lymph node metastases; cN0, clinically negative patients; DT, decision tree; FNAB, fine needle aspiration biopsy; LN, central lymph node; LR, logistic regression; ML, machine learning; MLMs, machine learning models; pCCND, prophylactic dissection of the lymph nodes of the central compartment; PTC, papillary thyroid carcinoma; Rad_score, radiomic score; RF, random forest; ROC, receiver operating characteristic; ROI, region of interest; SVM, support vector machine; TC, thyroid cancer; US, ultrasound; USR, US radiomics. LASSO, least absolute shrinkage and selection operator.

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki (revised in 2013). The retrospective nature of this research was approved by the Ethics Committee of Guangxi Medical University Cancer Hospital (approval number: LW2023145), which waived the requirement for informed consent.

Consent for publication

Not Applicable.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Conflict of Interest:

The authors declare that they have no competing interest.

Funding:

None

Author contributions:

(I) Conception and design: Peng Zhao

(II) Administrative support: Bangde Xiang

(III) Provision of study materials or patients: Lulu Liang

(IV) Collection and assembly of data: Yulin Bao

(V) Data analysis and interpretation: Xian Wei, Yongbiao Luo, Quankun Liang

(VI) Manuscript writing: All authors

(VII) Final approval of manuscript: All authors

Acknowledgments

The authors acknowledge the contribution of individuals who participated in this study.

Kim J, Gosnell JE, Roman SA. Geographic influences in the global rise of thyroid cancer. Nat Rev Endocrinol. 2020;16:17–29. 10.1038/s41574-019-0263-x.
Huang Y, Yin Y, Zhou W. Risk factors for central and lateral lymph node metastases in patients with papillary thyroid micro-carcinoma: Retrospective analysis on 484 cases. Front Endocrinol (Lausanne). 2021;12. 10.3389/fendo.2021.640565.
Feng JW, Ye J, Qi GF, Hong LZ, Wang F, Liu SY, Jiang Y. LASSO-based machine learning models for the prediction of central lymph node metastasis in clinically negative patients with papillary thyroid carcinoma. Front Endocrinol. 2022;13. 10.3389/fendo.2022.1030045.
Wang Z, Qu L, Chen Q, Zhou Y, Duan H, Li B, Weng Y, Su J, Yi W. Deep learning-based multifeature integration robustly predicts central lymph node metastasis in papillary thyroid cancer. BMC Cancer. 2023;23(128). https://doi.org/10.1186/s12885-023-10598-8.
Alabousi M, Alabousi A, Adham S, Pozdnyakov A, Ramadan S, Chaudhari H, Young JEM, Gupta M, Harish S. Diagnostic test accuracy of ultrasonography vs computed tomography for papillary thyroid cancer cervical lymph node metastasis: A systematic review and meta-analysis. JAMA Otolaryngol Head Neck Surg. 2022;148(2):108–18. 10.1001/jamaoto.2021.3387.
Agyekum EA, Ren YZ, Wang X, Cranston SS, Wang YG, Wang J, Akortia D, Xu FJ, Gomashie L, Zhang Q, Zhang D, Qian X. Evaluation of Cervical Lymph Node Metastasis in Papillary Thyroid Carcinoma Using Clinical-Ultrasound Radiomic Machine Learning-Based Model. Cancers (Basel). 2022;14(21):5266. 10.3390/cancers14215266.
Alsubaie KM, Alsubaie HM, Alzahrani FR, Alessa MA, Abdulmonem SK, Merdad MA, Al-Khatib T, Marzouki HZ, Algarni MA, Alherabi AZ. Prophylactic Central Neck Dissection for Clinically Node-Negative Papillary Thyroid Carcinoma. Laryngoscope. 2022;132(6):1320–8. 10.1002/lary.29912.
Viola D, Materazzi G, Valerio L, Molinaro E, Agate L, Faviana P, Seccia V, Sensi E, Romei C, Piaggi P, Torregrossa L, Sellari-Franceschini S, Basolo F, Vitti P, Elisei R, Miccoli P. Prophylactic central compartment lymph node dissection in papillary thyroid carcinoma: clinical implications derived from the first prospective randomized controlled single institution study. J Clin Endocrinol Metab. 2015;100(4):1316–24. 10.1210/jc.2014-3825. Epub 2015 Jan 15. PMID: 25590215.
Wang Y, Xiao Y, Pan Y, Yang S, Li K, Zhao W, Hu X. The effectiveness and safety of prophylactic central neck dissection in clinically node-negative papillary thyroid carcinoma patients: A meta-analysis. Front Endocrinol (Lausanne). 2023;13. 10.3389/fendo.2022.1094012.
Limkin EJ, Sun R, Dercle L, Zacharaki EI, Robert C, Reuzé S, Schernberg A, Paragios N, Deutsch E, Ferté C. Promises and challenges for the implementation of computational medical imaging (radiomics) in oncology. Ann Oncol. 2017;28(6):1191–206. 10.1093/annonc/mdx03.
Wang X, Agyekum EA, Ren Y, Zhang J, Zhang Q, Sun H, Zhang G, Xu F, Bo X, Lv W, Hu S, Qian X. A Radiomic Nomogram for the Ultrasound-Based Evaluation of Extrathyroidal Extension in Papillary Thyroid Carcinoma. Front Oncol. 2021;11. 10.3389/fonc.2021.625646.
Park YM, Lee BJ. Machine learning-based prediction model using clinicopathologic factors for papillary thyroid carcinoma recurrence. Sci Rep. 2021;11(4948). 10.1038/s41598-021-84504-2.
Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, Bussink J, Monshouwer R, Haibe-Kains B, Rietveld D, Hoebers F, Rietbergen MM, Leemans CR, Dekker A, Quackenbush J, Gillies RJ, Lambin P. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. 2014. 10.1038/ncomms5006.
Zhang B, He X, Ouyang F, Gu D, Dong Y, Zhang L, Mo X, Huang W, Tian J, Zhang S. Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma. Cancer Lett, 2017(403): p. 21–7 10.1016/j.canlet.2017.06.004..
Amin MB, Edge SB, Greene FL. AJCC Cancer Staging Manual(8th Edition)[M]. NewYork: Springe; 2017.
Mahmoudian M, Venäläinen MS. Kle ́ n R, Stable iterative variable selection. Bioinformatics. 2021;37:4810–7. 10.1093/bioinformatics/btab501.
Yin P, Mao N, Zhao C, Wu J, Sun C, Chen L, Hong N. Comparison of radiomics machine-learning classifiers and feature selection for differentiation of sacral chordoma and sacral giant cell tumour based on 3D computed tomography features. Eur Radiol. 2019;29(4):1841–7. 10.1007/s00330-018-5730-6.
Hu W, Wang H, Wei R, Wang L, Dai Z, Duan S, Ge Y, Wu PY, Song B. MRI-based radiomics analysis to predict preoperative lymph node metastasis in papillary thyroid carcinoma. Gland Surg. 2020;9(5):1214–26. 10.21037/gs-20-479.
Huang J, Li Z, Zhong Q, Fang J, Chen X, Zhang Y, Huang Z. Developing and validating a multivariable machine learning model for the preoperative prediction of lateral lymph node metastasis of papillary thyroid cancer. Gland Surg. 2023;12(1):101–9. 10.21037/gs-22-741.
Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, Nikiforov YE, Pacini F, Randolph GW, Sawka AM, Schlumberger M, Schuff KG, Sherman SI, Sosa JA, Steward DL, Tuttle RM, Wartofsky L. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid. 2016;1(1):1–133. 10.1089/thy.2015.0020.
Chinese Society of Endocrinology, Thyroid and Metabolism Surgery Group of the Chinese, Society of Surgery; China Anti-Cancer Association, Chinese Association of Head and Neck Oncology, et al, Guidelines for the Diagnosis and Management of Thyroid Nodules and Differentiated Thyroid Cancer (Second Edition) Int J Endocrinol Metab et al. 2023. 43(2): p. 149–194 10.3760/cma.j.cn311282-20221023-00589-1.
Zheng X, Peng C, Gao M, Zhi J, Hou X, Zhao J, Wei X, Chi J, Li D, Qian B. Risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: a study of 1,587 patients. Cancer Biol Med. 2019;16(1):121–30. 10.20892.
Guo L, Ma YQ, Yao Y, Wu M, Deng ZH, Zhu FW, Luo YK, Tang J. Role of ultrasonographic features and quantified BRAFV600E mutation in lymph node metastasis in Chinese patients with papillary thyroid carcinoma Sci Rep, 2019. 9(1) 10.1038/s41598-018-36171-z.
Jiang M, Li C, Tang S, Lv W, Yi A, Wang B, Yu S, Cui X, Dietrich CF. Nomogram Based on Shear-Wave Elastography Radiomics Can Improve Preoperative Cervical Lymph Node Staging for Papillary Thyroid Carcinoma. Thyroid. 2020;30(6):885–97. 10.1089/thy.2019.0780.
Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;4(12):749–62. 10.1038/nrclinonc.2017.141.
Hu HT, Wang Z, Huang XW, Chen SL, Zheng X, Ruan SM, Xie XY, Lu MD, Yu J, Tian J, Liang P, Wang W, Kuang M. Ultrasound-based radiomics score: a potential biomarker for the prediction of microvascular invasion in hepatocellular carcinoma. Eur Radiol. 2019;29(6):2890–901. 10.1007/s00330-018-5797-0.
Moro F, Albanese M, Boldrini L, Chiappa V, Lenkowicz J, Bertolina F, Mascilini F, Moroni R, Gambacorta MA, Raspagliesi F, Scambia G, Testa AC, Fanfani F. Developing and validating ultrasound-based radiomics models for predicting high-risk endometrial cancer. Ultrasound Obstet Gynecol., 2022. 60(2): p. 256–268 10.1002/uog.24805. PMID: 34714568.
Jiang M, Li CL, Luo XM, Chuan ZR, Lv WZ, Li X, Cui XW, Dietrich CF. Ultrasound-based deep learning radiomics in the assessment of pathological complete response to neoadjuvant chemotherapy in locally advanced breast cancer. Eur J Cancer. 2021;147:95–105. 10.1016/j.ejca.2021.01.028.
Spicer J. Sanborn AN What does the mind learn? A comparison of human and machine learning representations. Curr Opin Neurobiol. 2019;97–102. 10.1016/j.conb.2019.02.004.
Cao Y, Zhong X, Diao W, Mu J, Cheng Y, Jia Z. Radiomics in Differentiated Thyroid Cancer and Nodules: Explorations, Application, and Limitations. Cancers (Basel). 2021. 13(10): p. 2436 10.3390/cancers13102436.
Zhou Y, Zhou G, Zhang J, Xu C, Zhu F, Xu P. DCE-MRI based radiomics nomogram for preoperatively differentiating combined hepatocellular-cholangiocarcinoma from mass-forming intrahepatic cholangiocarcinoma. Eur Radiol. 2022;32(7):5004–5015., 2022. 32(7): p. 5004–5015 10.1007/s00330-022-08548-2.
Lai SW, Fan YL, Zhu YH, Zhang F, Guo Z, Wang B, Wan Z, Liu PL, Yu N, Qin HD. Machine learning-based dynamic prediction of lateral lymph node metastasis in patients with papillary thyroid cancer. Front Endocrinol (Lausanne). 2022;13:101903710.3389/fendo.2022.1019037.
Breiman L. Random forests. Random forests. 2001;45(1):5–32.
Zhang B, He X, Ouyang F, Gu D, Dong Y, Zhang L, Mo X, Huang W, Tian J, Zhang S. Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma. Cancer Lett, 2017(10;403): p. 21–7. 10.1016/j.canlet.2017.06.004.
Li YY, Sun WX, Liao XD, Zhang MB, Xie F, Chen DH, Zhang Y, Luo YK. A Thyroid Ultrasound Image-based Artificial Intelligence Model for Diagnosis of Central Compartment Lymph Node Metastasis in Papillary Thyroid Carcinoma. Zhongguo Yi Xue Ke Xue Yuan Xue Bao. 2021;43(6):911–6.
10.3881/j
.
Zhou SC, Liu TT, Zhou J, Huang YX, Guo Y, Yu JH, Wang YY, Chang C. An Ultrasound Radiomics Nomogram for Preoperative Prediction of Central Neck Lymph Node Metastasis in Papillary Thyroid Carcinoma. Front Oncol. 2020;10(1591). 10.3389/fonc.2020.01591.
She Y, Zhao L, Dai C, Ren Y, Jiang G, Xie H, Zhu H, Sun X, Yang P, Chen Y, Shi S, Shi W, Yu B, Xie D, Chen C. Development and validation of a nomogram to estimate the pretest probability of cancer in Chinese patients with solid solitary pulmonary nodules: A multi-institutional study. J Surg Oncol. 2017;116(6):756–62. 10.1002/jso.24704. Epub 2017 Jun 1.
Fang SUN, Yongbo XU, Guanghe CUI, Xinyan LI, Jingyun DONG, Yuting JIAO, Liwei TANG. Predict the Luminal type of invasive breast cancer using machine learning models based on ultrasonographic features. J Practical Med. 2022;38(18):2279–83.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Comparison of different machine learning models based on ultrasound-based radiomics to predict central lymph node metastasis of papillary thyroid carcinoma

Status:

Version 1

Abstract

Figures

Background

Methods

Study population

US and image analysis

Region of interest (ROI) segmentation and radiomic feature extraction

Feature selection and radiomic signature construction

ML techniques

Statistical analysis

Results

Demographic data of the enrolled patients

Feature selection and model construction

Predictive performance of ML-based models

LR model performance

Performance of the radiomic score and nomogram

Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Status:

Version 1