Patient characteristics
From January 2020 to November 2021, a total of 81 patients with ESCC were screened for eligibility. Eventually, 79 eligible patients were enrolled (Fig. 1). The 79 patients received two cycles of neoadjuvant camrelizumab and chemotherapy. Twenty-one patients were excluded because twenty patients did not have PET/CT scans before surgery and one patient refused surgery. Finally, 58 patients were included in this analysis. Among the 58 patients, 19 patients (19/58, 32.8%) had pathological complete response (pCR), while the other 39 patients (39/58, 67.2%) did not. The patients had a median age of 59 years old, and most patients (46/59, 79.3%) were male. There was no significant difference in baseline characteristics and between pCR and non-pCR patients in terms of age, gender, smoking history, drinking history, clinical tumor stage, clinical lymph node stage and clinical stage (Table 1). The location of tumors was different between the pCR and non-pCR group (P=0.006). Among the pCR patients, 6 patients (6/19, 31.6%) had the tumor in upper esophagus, 8 patients (8/19, 42.1%) had the tumor in middle esophagus and the other 5 patients (5/19, 26.3%) had the tumor in lower esophagus. As for the non-pCR patients, 1 patient (1/39, 2.6%) had the tumor in upper esophagus, 20 patients (20/39, 51.3%) had the tumor in middle esophagus and the other 18 patients (18/39, 46.2%) had the tumor in lower esophagus. Of the 58 patients, 42 patients took 18F-FDG PET/CT examination both at baseline (scan-1) and before surgery (scan-2), while the other 16 patients underwent 18F-FDG PET/CT examination only before surgery (scan-2).
Tumor metabolic parameters of 18F-FDG PET/CT prove a significant difference between pCR and non-pCR groups
At scan-1, there was no significant difference after Bonferroni correction in SUVmax, SUVmean, SUVTBR, MTV and TLG between the pCR and non-pCR groups (Table 2). All metabolic parameters of scan-2 were significantly lower in patients with pCR than in those with non-pCR (P<0.001, P<0.001, P<0.001, P=0.001 and P<0.001 for SUVmax, SUVmean, SUVTBR, MTV and TLG, respectively).
Delta TLG% and MTV% were significantly lower in the pCR group than that in the non-pCR group after neoadjuvant immunochemotherapy (P=0.003 and P=0.002). Delta SUVmax%, SUVmean%, and SUVTBR% showed no significant differences between the two groups (Table 2).
Tumor metabolic parameters of 18F-FDG PET/CT predicts the therapeutic response of pCR
At scan-1, TLG (AUC, 0.716) and MTV (AUC, 0.721) discriminated pCR from patients with non-pCR. The following parameters at scan-2 corelated with pathological assessment discriminated pCR: SUVmax (AUC, 0.848), SUVmean (AUC, 0.853), TLG (AUC, 0.850), MTV (AUC,0.856), and above all SUVTBR (AUC, 0.860 [95%CI: 0.760, 0.959]; cut-off value, 2.1) providing 69.6% (16/23) PPV and 91.4% (32/35) NPV (Table 3, Supplemental Fig. 1), but the pairwise comparison ROC curve analysis among them didn’t show difference in Delong’s test (Supplemental table 1). At the percentage changes of delta TLG% (AUC, 0.762) and delta MTV% (AUC, 0.772) between scan-1 and scan-2 discriminated pCR from non-pCR patients (Table 3). According to the univariate analyses and correlation tests (Supplemental Fig. 2), metabolic parameters of PET scans were selected for inclusion in logistic regression models. The MTV of scan-1, SUVmax of scan-2 and ΔMTV% were included in binary logistic regression analysis to predict pCR. Independent variables in the regression models didn’t indicate significant collinearity (variance inflation factor <10 and tolerance >0.1) in collinearity diagnostic test (Supplemental table 2). The multivariate logistic regression achieves AUC of 0.888, which is significantly higher than MTV of scan-1 (P=0.034) but didn’t show significant difference with SUVmax of scan-2 or ΔMTV% (Table 3, Supplemental table 3, Supplemental Fig. 3). Fig. 2 (a-f) showed two representative cases patient with pCR and non-pCR.
PET/CT identified lymph node (LN) involvement
A total of 484 nodal stations from 58 participants were evaluated in our study (mean number of nodal stations sampled per patient: 8.3). Of these lymph nodes, 21 lymph node stations (21/484, 4.3%) in 12 patients (12/59, 20.7%) proved positive after surgery. At scan-1, SUVmax and SAD were significantly higher after Bonferroni correction in LN involvement (+) group than LN involvement (-) group (P=0.002 and P<0.001). At scan-2, SUVmax of lymph node were higher in the LN involvement (+) group than in the LN involvement (-) group but didn’t show statistical significance after Bonferroni correction (P=0.009), while there was no significant difference in SAD between LN involvement (+) and LN involvement (-) group. Delta SUVmax% and SAD% between scan-1 and scan-2 showed no significant difference between the two groups (Table 4).
From ROC analysis of scan-1, the optimal cut-off value for SUVmax (AUC, 0.733) and SAD (AUC, 0.769) distinguishing metastatic lymph nodes from benign ones was 4.8 and 6.5mm, respectively. However, ROC curve combined SUVmax and SAD didn’t show difference with univariate ROC curves (Table 5, Supplemental table 4, Supplemental Fig. 4).At scan-2, the diagnostic performance of SUVmax (AUC, 0.746; cut-off value 1.4) to distinguish metastatic from benign nodes was significantly better than of SAD (AUC, 0.605; cut-off value 9.5) in the Delong’s test (P=0.014), but combined ROC curves of them (AUC, 0.737) didn’t show better diagnostic performance than SUVmax (P=0.333) (Table 5, Supplemental table 5, Supplemental Fig. 4). Moreover, there was no significant differences between the combined ROC curves of scan-1 and scan-2 (P=0.918). By setting the cut-off value of SUVmax of scan-2 at 1.4, the sensitivity, specificity, accuracy, PPV and NPV was 81.0% (17/21), 61.1% (283/463), 62.0% (300/484), 8.6% (17/197) and 98.6% (283/287), respectively (Table 5). Fig. 2 (g-i) showed a representative false-positive lymph node in FDG PET/CT imaging.