Genome landscapes of ESCCs before and after NCRT
WES profiling data of 22 pairs of pre- and post-NCRT tumor samples, 7 additional pre-NCRT tumor samples, and corresponding 29 pre-NCRT white blood cell (WBC) samples were obtained from 29 patients with locally advanced ESCCs who received NCRT and surgery (Fig. 1A, Supplemental Table 1). There was no significant correlation between tumor regression grades (TRGs) and NCRT chemotherapy regimens as well as baseline clinical characteristics (all P > 0.05). The median sequencing depth was 250-fold (range: 228–363) for the 29 pre-NCRT tumors, 238.5-fold (range: 221–496) for the 22 post-NCRT tumors, and 143-fold (range: 116–214) for the corresponding 29 pre-NCRT WBC samples (Supplemental Table 2).
WES of 51 tumor specimens identified a total of 3,562 somatic nonsynonymous mutations involving 2,704 genes (Supplemental Table 3). The mutation rates of known ESCC driver genes in pre-NCRT samples were similar to those in the TCGA ESCC dataset and the largest whole genome sequencing dataset of ESCC published to date14, including TP53 (90%), NOTCH1 (21%), AJUBA (14%), and NFE2L2 (14%) (Supplemental Fig. 1A). Distinct alterations in driver mutations were observed between pre- and post-treatment samples. Although the majority of these mutations vanished following treatment, a subset either persisted during NCRT or emerged after NCRT (Fig. 1B). Notably, in post-NCRT samples with histopathological purity estimation at 0% (TRG0, n = 3), WES detected low-frequency residual mutations, which might be due to sampling bias between samples used for sequencing and pathological examination. Besides, it is possible that these post-NCRT TRG0 samples contained a few cells with genomic changes but not morphological changes that could be detected by traditional histopathology, which was reported in TRG0 patients after neoadjuvant treatment in non-small cell lung cancers15. Our observation is consistent with previous reports that TRG0 ESCC patients may nonetheless experience tumor recurrence after NCRT and surgery16.
Somatic nonsynonymous mutations were detected in 22 post-NCRT samples with a median tumor mutation burden (TMB) value of 14 (range: 3–93), significantly lower than that (median: 104.5, range: 29–224) in paired pre-NCRT samples (P < 0.001, Fig. 1C). Significant decrease in tumor purity estimated by ABSOLUTE17 was also observed after NCRT (P = 0.020, Fig. 1C). To figure out whether the decrease in TMB was due to decreased tumor purity or not, we conducted a simulation analysis18 and found that the observed TMB per post-NCRT sample was lower than expected one, indicating that magnitude of TMB decrease could not simply attributed to NCRT-induced tumor shrinkage (P = 0.006, Supplemental Fig. 1B).
As for the somatic copy number variation (SCNV) analysis, we identified an average of 1,851.0 MB amplification and 53.5 MB deletion in 22 pre-NCRT tumors, which reduced to 871.4 MB amplification and 27.9 MB deletion in paired post-NCRT tumors (P < 0.001 and P = 0.272, respectively, Supplemental Fig. 1C). The most recurrent SCNVs in 22 pre-NCRT samples were similar to the results identified previously in treatment-naive ESCC genome19, including the amplifications of 11q11.3 and 8q24.3, as well as the deletions of 9q21.3 and 21q21.1 (Fig. 1D). However, SCNV patterns significantly changed in post-NCRT samples, in which the most recurrent SCNVs were amplifications of 9q24.3, 8q21.1, 4q35.2 and 1q36.22 and deletion of 12q21.31 (Fig. 1D).
Genomic predictors of response to NCRT were intended to be identified in pre-NCRT samples. In 29 pre-treatment samples, NFE2L2 mutations tended to be enriched more in patients who achieved TRG2/3 than those TRG0/1 (P = 0.076, Supplemental Fig. 1D), suggesting a potential association between NFE2L2 mutations and resistance to NCRT.
Clonal evolution of ESCC under NCRT
The CCF of a somatic nonsynonymous mutation was estimated after copy-number and purity normalization20. Using the assumption that somatic nonsynonymous mutations with similar CCFs represented a tumor clone21, unsupervised clustering of the CCFs of somatic nonsynonymous mutations identified the average of 2.40 clones (range: 1–8) in 22 pre-NCRT samples, which were much higher than the average of 1.73 clones (range: 1–7) (P = 0.039) in paired post-NCRT samples. Genomic intratumoral heterogeneity (ITH) for each sample was evaluated by Shannon index22 based on inferred clone architecture, which was positively correlated with single cell RNA sequencing (scRNA)-ITH estimated by the tumor cell-specific transcriptomic diversity scores among 8 samples with both WES and scRNA profiles available in our previous work23 (rho = 0.795, P = 0.018, Supplemental Fig. 2A). With the median of genomic ITH as the cutoff point, 29 pre-NCRT samples were classified into two groups, and the patients with high pre-NCRT genomic ITH had worse survival than those with low pre-NCRT genomic ITH [P = 0.026 and P < 0.001 for overall survival (OS) and disease free survival (DFS), respectively, Fig. 2A]. However, patients evaluated as TRG0/1 or TRG2/3 showed no statistical difference in either prognosis (P = 0.470 for OS and P = 0.157 for DFS, repectively, Supplemental Fig. 2B) or pre-NCRT genomic ITH (P = 0.295, Supplemental Fig. 2C).
The frequency variation of tumor clone in pre- and post-NCRT samples provides an evolutionary perspective. We identified two clonal dynamics in 22 patients with paired tumors: 13 patients exhibited clonal extinction, whose clones in the pre-treatment samples all disappeared after NCRT (Fig. 2B left, Supplemental Fig. 2D), while 9 patients exhibited clonal persistence, who had at least one pre-NCRT clone persisted during NCRT (Fig. 2B right, Supplemental Fig. 2D). We utilized fishplot24 to visualize temporal clonal evolution. In a representative case (P1) from the clone-extinct group, the pre-NCRT clone (Clone 1) extinguished after NCRT. And the post-NCRT-specific clone (Clone 2) seemed to be newly developed during NCRT (Fig. 2C left), although it is possible that Clone 2 existed in the pre-NCRT sample at a very low frequency, which could not be detected due to the limited WES sequencing depth (median 250-fold). However, in the clone-persistent P14 case, pre-NCRT-existed Clone 2 and 4 remained present in the post-NCRT sample with shifted clonal frequencies (Fig. 2C right). Patients experiencing these two evolution modes showed distinct outcomes, with those in clone-persistent group having a worse prognosis (P = 0.014 for OS and P = 0.014 for DFS, Fig. 2D). However, there was no significant correlation between clonal evolution modes and pre-NCRT clinical characteristics, chemotherapy regimens, post-NCRT TRGs, or the time spans from the end of NCRT to esophagectomy (Supplemental Table 4). To validate the prognostic implications of the clonal dynamics, we performed clonal dynamic analyses in another cohort of 21 ESCC patients with WES data of paired pre- and post-treatment samples available.25 Among the 21 ESCCs, 12 patients were with clonal extinction while 9 with clonal persistence after treatment (Supplemental Fig. 3A). And the clone-persistent patients also had a worse prognosis than the clone-extinct patients (P = 0.001 for OS and P = 0.033 for local regional recurrence survival, Supplemental Fig. 3B). Consistent with the results in our data, the patients with high pre-NCRT genomic ITH had worse survival than those with low pre-NCRT genomic ITH (P = 0.001 and P = 0.016 for OS and local regional recurrence survival, respectively, Supplemental Fig. 3C).
Mutational landscape was further compared between clone-extinct and clone-persistent patients. No single mutation with statistical bias towards either clone-extinct or clone-persistent groups was found in pre-NCRT samples, however, mutations in TP53, NBPF20, and NOTCH1 were only identified in post-NCRT samples from clone-persistent patients (Supplemental Fig. 4A). Post-NCRT mutations in clone-extinct group were absent in paired pre-NCRT samples (Supplemental Fig. 4B), while most of post-NCRT mutations in clone-persistent group were shared by paired pre-NCRT sample (Supplemental Fig. 4C). To dissect the functional impact of these persistent mutations in clone-persistent patients, we conducted pathway enrichment analysis using REACTOME database26, and identified the enrichment of mutations in pathways involved in NOTCH signaling pathways, apoptosis and programmed cell death, and extracellular matrix interaction, suggesting that these pathways may have an important role in the NCRT-driven clonal persistence (Fig. 2E).
Besides, the pre-NCRT genomic ITH was significantly correlated with the different clonal dynamic modes (P < 0.001, Fig. 2F). Further, the receiver operating characteristic (ROC) curve showed a good performance of pre-NCRT genomic ITH status in discriminating clonal dynamic modes with an area under the curve (AUC) of 0.906 (95% confidence interval: 0.774–1.00, Fig. 2G).
Mutational signatures related to NCRT-driven clonal evolution
Different mutational processes often generate various combinations of mutation types, termed mutational signatures, which can provide an etiological perspective on clonal evolution7. To characterize the evolution of mutational signatures during NCRT, somatic mutations were decoded into six classes of base substitution. After NCRT, significant decreases in C > T, C > G and C > A mutations and significant increases in T > C and T > G mutations were observed (all P < 0.05, Fig. 3A). Incorporating information on the bases immediately 5′ and 3′ to each mutated base, 96 mutation classes were further identified, which showed some shifts in the relative contributions between pre- and post-NCRT samples (Supplemental Fig. 5A).
A total of 6 single base substitution (SBS) signatures were extracted using the data of 96 mutations classes (Fig. 3B), including SBS1, caused by age-related accumulation of 5-methylcytosine deamination events; SBS3, a predictor of homologous recombination deficiency (HRD); SBS5, whose aetiology is unknown; SBS6, associated with defective DNA mismatch repair; SBS13, attributed to Apolipoprotein B mRNA Editing Catalytic Polypeptide-like mutagenesis; and SBS16, previously associated with alcohol consumption in ESCC27. The comparison of SBS signatures between pre- and post-NCRT samples showed that SBS1 and SBS13 decreased (P = 0.003 for SBS1, P < 0.001 for SBS13) while SBS6 and SBS5 increased after NCRT (P = 0.046 for SBS6, P = 0.015 for SBS5, Fig. 3C). Subgroup analyses showed that the SBS1 and SBS13 decreased (P < 0.001 and P = 0.003, respectively) while SBS5 and SBS6 increased (P = 0.021 and P = 0.017, respectively) in the clone-extinct group after NCRT. In the clone-persistent group, SBS13 decreased significantly after NCRT (P = 0.036), however, SBS1, SBS5, and SBS6 did not change significantly after NCRT (all P > 0.05, Supplemental Fig. 5B). We further sought to determine whether there were any biased mutational signatures between clone-extinct and clone-persistent groups. SBS16 was enriched more in the clone-persistent group than in the clone-extinct group in pre-NCRT samples (P = 0.037, Fig. 3D), while SBS5 and SBS6 were enriched more in the clone-extinct group than in the clone-persistent group in post-NCRT samples (P = 0.044 for SBS5, P = 0.016 for SBS6, Fig. 3E).
In addition to SBS signatures, other genomic alteration signatures including doublet-base substitution (DBS), short insertion and deletion (INDEL) and CNV signatures were also analyzed (Fig. 3F). We mainly focused on pre-NCRT samples as few alterations could be detected after NCRT. In terms of DBS and INDEL signatures, DBS3 (Supplemental Fig. 5C), whose proposed aetiology is polymerase epsilon exonuclease domain mutations, and ID1, which is composed predominantly of insertions of thymine (Supplemental Fig. 5D) and is left on the replicated DNA strand by slippage error during DNA replication28, were found to be highly enriched in the pre-NCRT samples from clone-extinct patients (P = 0.043 for DBS3 and P = 0.044 for ID1; Fig. 3G). In addition, analyzing the CNV signatures in pre-NCRT samples also found that HRD-related signature CN17, which consisted of loss of heterozygosity (LOH) segments with total copy numbers between 2 and 4, as well as heterozygous segments with total copy numbers between 3 and 8 (Supplemental Fig. 5E)29, were highly enriched in clone-extinct patients instead of clone-persistent ones (P = 0.045, Fig. 3G). Besides, HRD scores, which were calculated based on CNV profiling data with scarHRD30, were higher in pre-NCRT samples from clone-extinct patients than their counterparts (P = 0.045, Fig. 3G). In summary, mutational signatures provide us with an etiological perspective on clonal dynamics, including potential associations between exogenous factors like alcohol metabolism (SBS16) and clonal persistence, as well as intrinsic factors such as DNA repair deficiencies (DBS3, ID1 and CN17) and clonal extinction.
High subclonal neoantigens and persistent human leucocyte antigen (HLA)-LOH in clone-persistent patients
To investigate how tumor neoantigen landscape evolved during NCRT, we conducted analysis of neoantigens. A total of 2494 neoantigens were computationally identified and tumor neoantigens burden (TNB) was positively correlated with TMB (rho = 0.967, P < 0.001, Fig. 4A) in all 29 pre- and 22 post-NCRT samples. Although total TNB was not significantly different between clone-extinct and clone-persistent pre-NCRT samples (P = 0.442, Fig. 4B), the proportion of subclonal neoantigens in pre-NCRT samples was higher in clone-persistent group than that in clone-extinct group (P = 0.038, Fig. 4C). After NCRT, significant decrease in TNB was observed in both clone-extinct and clone-persistent group (P = 0.002 and P = 0.042) and the magnitude of clonal TNB decrease was larger in clone-extinct group than that in clone-persistent group (P = 0.047, Fig. 4D). To further investigate whether the decrease of neoantigens were simply due to overall tumor regression or selective depletion of neoantigen-bearing cells, we estimated the theoretical TNB merely caused by overall tumor regression and found that the observed number of neoantigens was significantly lower than the expected number in the clone-extinct group, but not in the clone-persistent group (P = 0.016 and P = 0.377, Fig. 4E), suggesting the critical roles of immune elimination of neoantigen-bearing cancer cells in clone-extinct patients.
Alteration of HLA, the important protein for antigen presentation, was a common way for tumor escaping from immune clearance, we therefore investigated whether HLA dysfunction contributed to distinct neoantigen changes between two clonal dynamics. Loss of an HLA haplotype resulted in reduced antigen presentation and thus facilitate immune evasion31. Fifteen out of 51 ESCCs exhibited loss of 1–3 HLA haplotypes, resulting in the failure to present 0-100% of neoantigens (Fig. 4F). There were no significant differences in the frequency of HLA-LOH or in the number of neoantigens that could not be presented due to HLA-LOH in either pre- or post-NCRT samples between clone-extinct and clone-persistent groups (all P > 0.05, Supplemental Fig. 6A-B). However, persistently existed HLA-LOH and consequent failure in neoantigen presentation were only observed in two cases (P3 and P30) of the clone-persistent group after NCRT (Fig. 4F). In P3, peptides derived from somatic nonsynonymous mutations might not be recognized by immune system with loss of HLA-B*13:01, which was used for presentation of 100% neoantigens in both pre- and post-NCRT samples in P3. In P30, loss of HLA-A*11:01 resulted in 6.3% of neoantigens being unpresented and newly occurred loss of HLA-C*04:01 after NCRT resulted in the other 6.3% neoantigens failed to be present on the surface of tumor cells. These results implied that persistence of HLA dysfunction-related immune evasion was a critical driver of clonal persistence in ESCC (Fig. 4F).
Activated immune programs and increased immune infiltrations after NCRT were pronounced in clone-extinct patients
In addition to inducing genome evolution, NCRT may sculpt tumor and tumor microenvironment transcriptomes, which might be associated with NCRT response. Gene-set enrichment analysis conducted on pre-NCRT transcriptome data revealed that pathways related to TNFα signaling via NFKB, IL6-JAK-STAT3 and inflammatory response were significantly upregulated in the clone-extinct group, indicating the presence of a pre-existing inflamed environment (Supplemental Fig. 7A). Concordantly, immune-related pathways were significantly overexpressed in post-NCRT samples from clone-extinct group (Supplemental Fig. 7B). Further analysis of differentially expressed genes (DEGs) between paired pre- and post-NCRT samples identified 6654 DEGs in clone-extinct group, and 3661 DEGs in clone-persistent group (fold change ≥ 2 & adjust P value < 0.05, Supplemental Fig. 7C). Generally, pathways of tumor proliferation, including E2F targets, MYC targets were downregulated after NCRT, while pathways of angiogenesis, epithelial mesenchymal transition, myogenesis, and immune response, such as complement, inflammatory response, and allograft rejection were enriched in post-NCRT samples from both clone-extinct and clone-persistent groups (Fig. 5A). These results suggested that immune programs were activated after NCRT, which were more pronounced in the clone-extinct group.
We next investigated the adaptive changes of microenvironment composition in response to NCRT. Immune infiltration assessed by MCPcounter33 revealed that monocytic lineage cells were enriched in clone-extinct patients before NCRT (P = 0.027, Supplemental Fig. 7D), while CD8 + T cells were enriched in clone-extinct patients after NCRT (P = 0.015, Supplemental Fig. 7E). By comparing the differential infiltration between pre- and post-NCRT samples, we identified increased infiltrations of a series of immune cells, endothelial cells and fibroblasts after NCRT regardless of the clonal dynamic modes (all P < 0.05, Fig. 5B). Furthermore, the increase of CD8 + T cells in clone-extinct patients was significantly higher than that in clone-persistent ones after NCRT (P = 0.027, Fig. 5C). Quantification of immune infiltration by other algorithms supported the pronounced elevation of CD8 + T cell infiltration in clone-extinct patients following treatment as compared to the alterations observed in clone-persistent patients (Supplemental Fig. 7F). Immunohistochemistry staining of samples with RNA sequencing in our study also confirmed that the increase in proportion of CD8 + T cell after NCRT were higher in the clone-extinct group than in the clone-persistent group (P = 0.043, Fig. 5D). Our results suggested that NCRT might induced an adaptive immune response during NCRT and the pronounced elevation of CD8 + T cell infiltration correlated significantly to clonal extinction.
Integrated T cell clone and diversity metrics associated with clonal evolution
Tumor neoantigens need to be specifically recognized by T cell receptors (TCRs) to induce an immunogenic response. Hence, complementarity-determining region 3 (CDR3) sequences in TCRs that bind specifically to neoantigens were inferred using transcriptome data. Unique CDR3 represents T cell clonetype and the number of unique CDR3 negatively correlated to genomic ITH (rho = -0.352, P = 0.035, Fig. 6A). The numerical abundance of CDR3s in pre-NCRT samples did not differ between clone-extinct and clone-persistent group (P = 0.717, Fig. 6B). However, not all T-cell clonetypes recognize the binding neoantigens, and we questioned if there was a difference in the number of T-cell clonetypes that bind neoantigens between the two groups. Applying deep-learning model pMTnet34 to predict which T cell clonetypes are able to bind neoantigens, the absolute number of TCR-neoantigen interactions were significantly higher in pre-NCRT samples from clone-extinct patients than those from clone-persistent patients (P = 0.044, Fig. 6C). We further adopted neoantigen immunogenicity effectiveness score (NIES), which is based on the product of the variant allele frequency of the neoantigen’s corresponding somatic nonsynonymous mutation and the clonal fraction of the TCRs that bind the same neoantigen, to measure this antitumor effect of interactions between neoantigens and TCRs34. The higher the NIES score, the more expanded TCRs are specific against clonal neoantigens, which is generally favorable for clinical outcomes34. We found that the NIES score was higher in pre-NCRT samples from clone-extinct patients than those from clone-persistent patients (P = 0.046, Fig. 6D). Analysis of TCR-neoantigens was not performed in post-NCRT samples as few neoantigens could be detected after NCRT.
As a surrogate for the level of T cell infiltration in the microenvironment, the numerical abundance of CDR3s undergoes augmentation in both clone-extinct and clone-persistent groups after NCRT (P = 0.003 and P = 0.078, respectively), although statistical significance was not achieved in clone-persistent group (Fig. 6E). The ability to bind neoantigens and the clonality of binding neoantigens will affect the expansion of T cell clones. Therefore, different T cell clones are likely to expand unevenly during treatment. We introduced D50 index, an indicator of T cell repertoire evenness of which higher values indicated the expansion of general clonal populations and lower values indicated the expansion of specific clonal populations35, to evaluate the change of T cell clonal distribution evenness. After NCRT, the evenness of T cell clonetype distribution decreased significantly in clone-extinct patients (P = 0.042), but increased significantly in clone-persistent patients (P = 0.016, Fig. 6F). Furthermore, reduction of clonal neoantigens were positively associated with the decrease of D50 index during NCRT, an indicator of increased skewness of T cell clonetypes, in the clone-extinct group (rho = 0.718, P = 0.017, Fig. 6G) but not in the clone-persistent group. No other significant correlations were identified between the changes of TNB and the distribution of T cell clonetypes during NCRT. In conclusion, our investigation of T cell repertoire and clonal evolution suggested that there was a more effective T cell-mediated antitumor immunity in pre-NCRT samples from clone-extinct group, which correlated to a significant reduction of tumor clones bearing clonal neoantigens.