In this retrospective study, we analyzed the risk factors of LNM in 955 NSCLC patients manifested as part-solid nodules on CT findings from multiple centers. Of all the lymph nodes detected (9933) in training group, only 217 (2.2%) were cancerous, indicating a low incidence of LNM in part-solid NSCLC. According to the distribution of the examined lymph nodes in each station, it was found in this study that LNM occurred at stations 2–4 (superior mediastinal) with the highest probability, followed by the lung parenchyma, and no LNM occurred in the supraclavicular region in station 1; for N-LNM, the lymph nodes were mainly located in the N1 region, followed by the superior mediastinal region, and few lymph nodes were detected in the supraclavicular region and aortic region.
Regarding CT semantic features, the solid component proportion was an independent risk factor of LNM. In the present study, the ratio of solid component of nodule to tumor in the LNM group was mainly higher than 75% (32/53); therefore, the solid component of nodule was an adverse factor of LNM, which also been confirmed in other studies [22, 23]. Moreover, the mean value of CT attenuation (CTmean) in the LNM group (50.6 ± 26.1) was significantly higher than that in the N-LNM group (19.3 ± 98.3), which we hypothesized that this was related to the solid component proportion of nodules; there was a positive correlation between solid component and CT attenuation value, and most nodules in the LNM group had a larger proportion of solid component. Perhaps this was one of reasons why the mean value of CT lost its significance in multivariate logistic regression. In terms of pleural contact type, a study of 478 peripheral solid NSCLC patients with a largest short-diameter > 5mm found that type II pleural involvement (a linear or cord-like pleural tag or tumor abut to the pleura with a broad base observed on both lung and mediastinal window images) was an independent predictor of occult lymph node metastasis (OLNM) [24]. Similar to them, in the present study, nodules were more likely to come into direct contact with the pleural than no contact, both in patients with LNM and N-LNM. However, for patients with LNM, pleural tags were more likely to cause indentation, and the probability of directing contact with the pleural concomitant pleural tags was also higher than that of N-LNM (27.6% vs. 8.9%). There are abundant lymphatic vessels on the surface of the subpleural lymphatic plexus [25]. We inferred that when the lesion is in contact with the pleura, the tumor cells first enter the lymphatic vessels, proliferate with lymph fluid in the lymphatic circulation, and enter the lymph node station to complete the invasion.
Emphysema is characterized pathologically by the presence of diffuse chronic inflammation of the lung parenchyma, oxidative stress, and lung destruction [26]. A previous study reported [27], emphysema was an independent risk factor for lung cancer, and the more severe the emphysema was, the higher the incidence was. The correlation between emphysema and LNM was also explored in the results of this study. When it comes to pulmonary disease, the results showed that patients with LNM were more likely to suffer from concomitant emphysema or bullae than patients with N-LNM. Some studies reported [28, 29], A longer mean diameter was an independent risk factor for LNM. DuComb et al. retrospectively studied 332 patients with T1 non-small cell lung cancer and showed that among patients with LNM, the most common pathological type was adenocarcinoma, which was consistent with the results of this study (56/58). In addition, they found that neither tumor diameter nor location was a risk factor for LNM [30]. In this study, location was also not shown to be a significant factor for metastases, but in contrast, mean diameter was an independent risk factor for the occurrence of LNM, possibly due to different inclusion and exclusion criteria: Our study included no restriction on tumor size, while they enrolled patients with T1 (8-30mm).
Previous studies have shown that model-based schemes can make better utilize radiographic information to predict lymph node diseases [31]. Das et al. integrated clinical parameters and radiomics features extracted from three ROIs of gross tumor volume (GTV), peritumoral volume (PTV), and LN in different ways to create different nomograms for predicting preoperative LNM in adenocarcinoma, and compared the predictive efficiency of each model [32]. The results showed that the AUC of radiological features based on GTV, PTV and LN in the external verification cohort were 0.74,0.72 and 0.64 respectively; the AUC of integrating GTV and PTV (GPTV) was 0.75 in the external validation cohort. GPTV combined with LN showed an AUC of 0.76, and the strongest predictive power was the integrated nomogram of clinical parameters along with CT radiomics information from GTV, PTV and LN, with an AUC of 0.79 (95%CI 0.66–0.93). Our study explored the predictive ability of three combined models to identify LNM. The results showed that compared with the combined model 1 including 9 risk factors and the combined model 2 based on all independent factors, the image predictors model established by pulmonary disease, solid component proportion, pleural contact type and mean diameter had the highest diagnostic accuracy in validation group, and its efficiency was better than all the individual models. The image predictors model only incorporated CT semantic features of the significant independent factors in the multivariate logistic regression results; this model embodied fewer fusion factors but higher predictive ability, which had profound implications: CT semantic features have a good ability to predict LNM in patients with NSCLC, and can provide preoperative guidance for clinical practice. Due to the poor image quality caused by respiratory movement and heartbeat, the prediction of LNM is limited in lung MR imaging [33]. PET is another imaging method for mediastinal staging, but its high cost, relatively low rate of lymph node involvement, and high false negative rate have hindered its routine implementation [34]. Martinez-Zayas et al. conducted a multicenter prospective validation of two retrospectively developed diagnostic models, called HAL and HOMER. In their study, they took 1799 NSCLC patients who underwent EBUS-TBNA staging and PET-CT as subjects. HOMER was used to predict N0 vs N1 vs N2/3 (three-classifier) and HAL was used to predict N2/3 vs N0/1 (two-classifier). Their results showed that HAL and HOMER had good multicentre discrimination: HAL had an AUC of 0.873, HOMER yielded AUC of 0.837 for predicting N1/2/3, and AUC of 0.876 for predicting N2/3 [35].
Several potential limitations of this study merit comment: 1) As its retrospective nature, potential selection bias would hinder the comparability and reproducibility of the results. 2) In the evaluation of CT image features, this study mainly focused on the characteristics of nodules. The image evaluation of lymph nodes can be considered to be added in later research for more in-depth exploration. 3) What we have to admit was that because this study analyzed image features based on subjective description and measurement, which restricted the consistency of models based on semantic features. With the development of radiomics that makes it possible to transform images into image quantitative feature data, it will contribute to describe tumor characteristics more objectively and quantitatively, so it is expected to have more studies to explore the prognosis prediction and survival analysis of LNM by radiomics.