Ethical statement
This study was approved by the Ethics Committee of Peking University People's Hospital. Given the anonymized patient data and retrospective design, written informed consent was not required for this study.
Study Population
The clinical records of 7,931 patients diagnosed with pulmonary malignant tumors and admitted to the Department of Thoracic Surgery at Peking University People's Hospital between 1999 and 2018 (PKUPHTOI dataset) were systematically reviewed. This well-managed dataset has been used before[10-13]. Inclusion criteria were as follows: (1) Diagnosis of NSCLC; (2) Underwent surgical resection; (3) stages IIB and IIIA (the 9th edition of lung cancer TNM staging manual). Exclusion criteria comprised: (1) Age < 18 years; (2) Presence of N3 disease; (3) Presence of M1 disease; (4) R1/R2 resection; (5) Sublobar resection; (6) Non-systematic lymph node dissection; (7) Primary lung cancer; (8) Receipt of neoadjuvant therapy; (9) History of previous malignancies; (10) Unavailability of clinicopathological data. A total of 2,051 eligible patients were finally included, consisting of 224 individuals with stage IIB and 227 with stage IIIA disease. Figure 1 illustrates the patient selection process. Based on the lymph node metastasis status, the stage IIB patients were further divided into three categories: IIB N2a1, IIB N2a2, and IIB N0-1.
Data Collection
The clinicopathological dataset encompassed various parameters, including age, gender (male/female), smoking history (yes/no), family tumor history (yes/no), preoperative comorbidities (yes/no), body mass index (BMI), American Society of Anesthesiologist (ASA) classification (grades 1/2/3/4), surgical approach (thoracoscopy/thoracotomy), extent of surgical resection (lobectomy, bilobectomy, and pneumonectomy), and postoperative complications. Additionally, histopathological characteristics such as histology (adenocarcinoma, squamous cell carcinoma and other), along with factors like visceral pleural invasion (VPI) (yes/no), lymphovascular invasion (LVI) (yes/no), occurrence of postoperative complications (yes/no), and receipt of postoperative adjuvant therapy (yes/no). Patient outcomes, including mortality and time to recurrence, were also recorded. Complete data analysis was performed in this study.
Surgical procedure
To ensure the absolute accuracy of N staging, we exclusively included patients who underwent radical surgical resections and systemic mediastinal lymph node dissection. The standard protocol of surgery was similar to the one previously described by Xu et.al[14]. Systemic lymphadenectomy was defined as mediastinal lymph node dissection of at least 3 stations, including station 7 (the subcarinal lymph node), from station 4L, 5, 6, 7, 8 and 9 for the left-side NSCLCs and station 2R, 3A, 4R, 7, 8 and 9 for the right-side NSCLCs. In addition, at least 6 lymph nodes were harvested. As for N1 station lymph nodes, in routine, the station 10, 11 and 12 were dissected intraoperatively. The station 13 and 14 lymph nodes were dissected by resident doctors from the excised specimen, but this procedure was not mandatory.
Follow-up
The routine follow-up strategies for patients in our center have been previously reported[12]. Specifically,follow-up data were acquired through comprehensive review of medical records, direct patient consultations, and telephonic interviews. Our center adheres to a rigorous postoperative follow-up protocol, involving assessments every three months during the initial two years post-surgery, biannually for the subsequent three to five years, and annually thereafter. Each follow-up session includes thorough physical examinations, monitoring of serum tumor markers, and chest computed tomography scans. Additional diagnostic modalities such as brain magnetic resonance imaging and bone scans are conducted as warranted by clinical indications. The primary endpoints of this study were overall survival (OS) and disease-free survival (DFS). OS was defined as the period from the date of diagnosis to all-cause death or the date of last follow-up. DFS was defined as the period from the date of diagnosis to the date of disease recurrence, death or the last follow-up.
Statistical Analysis
Statistical analysis was conducted using IBM SPSS Statistics (version 27.0.1, IBM Corp, Armonk, NY, USA) and R version 4.3.1 (The R Foundation for Statistical Computing, Vienna, Austria; http://www.r-project.org). Categorical variables were presented as frequencies and percentages and compared using Pearson's chi-square test or Fisher's exact test. The Shapiro-Wilk test was utilized to assess whether data follows a normal distribution. Non-normally distributed continuous variables were described using medians and ranges and compared using the Mann-Whitney U test. Survival rates were assessed using the Kaplan-Meier method and the log-rank test. Bonferroni's adjustment was applied in comparisons involving multiple subgroups in the 1:1 analysis. To mitigate bias arising from disparate baseline characteristics, one-to-one propensity score matching (PSM) was performed utilizing the R package "MatchIt" (method=nearest, replace=FALSE). Univariate and multivariate Cox proportional hazards regression analysis (forced enter method) was employed to explore the prognostic factors, with hazard ratios (HR) and 95% confidence intervals (CI) serving as statistical indicators to ascertain independent prognostic factors. The proportional hazards assumption was checked using the Schoenfeld residuals. A two-tailed P value < 0.05 was considered statistically significant.