DOI: https://doi.org/10.21203/rs.3.rs-137615/v1
Background: Immunotherapy is a new strategy for Colorectal cancer (CRC) treatment. Tumor mutation burden (TMB) may act as an emerging biomarker for predicting responses to immunotherapy. Nevertheless, no studies investigate if these gene mutations correlate to TMB and tumor-infiltrating immune cells.
Methods: Somatic mutation data for CRC samples were obtained from The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC) datasets. Then, we investigated the relationship between these mutant genes, TMB and overall survival outcomes. GSEA analysis was performed to explore the underlying mechanism of mutant gene. Finally, we further verified the connection between gene mutations and immune response.
Results: We identified 17 common mutant genes shared by both two cohorts. Further analysis found that MUC4 mutation was strongly related to higher TMB and predicted a poorer prognosis. Moreover, functional enrichment analysis of samples with MUC4 mutation revealed that they were involved in regulating the natural killer cell mediated cytotoxicity signaling pathway. Significant changes in the proportion of the immune cells of CD8+ T cells, activated NK cells, M1 macrophages and resting memory CD4+ T cells were observed using the CIBERSORT algorithm.
Conclusions: Our research revealed that MUC4 mutation significantly correlated with increased TMB, a worse prognosis and modulating the immune microenvironment, which may be considered a biomarker to predict the outcome of the immune response in colorectal cancer.
Colon adenocarcinoma (COAD) is one of the most aggressive malignancies. It is also a heterogeneous disease at the molecular level resulting from complex genetic and epigenetic alterations[1]. The incidence of COAD tends to increase year by year in China[2], Though the development at full speed of treatments, such as targeted therapies, chemotherapy, surgery and radiotherapy, high recurrence rate and poor prognosis remain a concern[3].
Breakthroughs have been made in immunotherapy for COAD. It is considered to be a promising treatment with great potential in the future, particularly for high microsatellite instability (MSI-H) or deficient mismatch repair (MMR). Researchers have recently found that the tumor mutation burden (TMB) is an emerging characteristic of cancer for assessing the effectiveness of immunotherapy[4, 5]. Tumor cells with mutations might activate T cells, causing tumor cells to be attacked and cleared by immune system[6, 7]. Patients with high TMB were highly sensitive to immunotherapy according to several previous studies[4, 8]. At present, several clinical studies have confirmed that TMB had prognostic value for COAD immunotherapy[9–11]. Increased mutation load is associated with microsatellite instability[12]. Another study result suggests that TMB has a certain correlation with chemotherapy results in COAD patients. Therefore, TMB potential to act as sensitive biomarkers or predicting immunotherapy response. Nevertheless, the immune landscape between tumors with different TMB was unclear in COAD patients.
At present, we explored the somatic mutations of COAD patients in TCGA and ICGC data portal. The common mutant genes shared by these two cohorts were identified. Then, we investigated the relationship between these mutant genes, TMB and overall survival outcomes. Finally, we further analyzed the connection between gene mutations and tumor microenvironment. Overall, our study aimed to provides additional prognostic information for immunotherapy in colon adenocarcinoma.
The mutation data were downloaded from the TCGA (https://tcga- data.nci.nih.gov/tcga/) and ICGC dataset (https://dcc.icgc.org/). The TCGA dataset including 510 American COAD samples, 434 patients had complete clinical information. The ICGC portal included 321 Chinese COAD samples.
TMB was referred to the average number of genes mutations per megabase (Mb). TSV files, including data of somatic variants for Chinese COAD samples, were annotated and visualized by the GenVisR package. The project "TCGA-COAD" including data of somatic variants for American COAD samples, which was downloaded as a MAF file that was detected using VarScan and visualized with the GenVisR package. The TMB estimate for each sample is equal to the total number of mutation frequency/38. The 38 Mb was routinely used to estimate the exome size of a human.
COAD patients from the TCGA dataset were split into two groups based on the mutation status of MUC4. c2.cp.kegg.v7.2.symbols.gmt as the reference gene sets using GSEA software (4.0.1), The cut-off criterion was P<0.05.
CIBERSORT is a deconvolution tool that can analyze immune cell landscapes[13]. It was performed to quantify the fractions of immune cells at a threshold of P-value < 0.05. The result of immune cells distribution was visualized by "vioplot" package.
"Survminer" R package was used to draw the survival curve. Kaplan-Meier(KM) method and log-rank test were performed to compare the survival outcomes. Univariate and multivariate Cox regression was utilized to investigated if the MUC4 mutation was an independent prognostic parameter. Statistical analyses were performed with R software (version 4.0.0). P-value < 0.05 was considered significant.
The waterfall diagram revealed the 30 genes with high mutation rate from the TCGA cohort (Figure 1a). At the same time, we explored 30 frequently mutated genes from the ICGC dataset (Figure 1b). The results showed that some genes frequently mutated in both two cohorts. Thus, the Venn diagram visualizes the overlapping genes, including APC, TP53, TTN, KRAS, MUC16, MUC4, SYNE1, FLG, FAT4, OBSCN, FAT3, RYR2, PIK3CA, FBXW7, DNAH11, MUC5B and ZFHX4 (Figure 1c).
The result revealed that mutation in TTN, MUC16, MUC4, SYNE1, FLG, FAT4, OBSCN, FAT3, RYR2, PIK3CA, FBXW7, DNAH11, MUC5B and ZFHX4 presented relatively higher TMB in the COAD samples (Figure 2a). Among the mutated genes, only MUC4 mutation (P= 0.002) was related to an inferior prognosis(Figure 2b). The results of univariate Cox model suggested that MUC4 mutation was correlated to a worsening of the outcome (HR=2.084 p=0.003, 95% CI [1.286-3.376]) (Table 1). Afterward, the entire variables were analyzed via multivariate Cox regression. Further statistical tests revealed that the MUC4 mutation leads to an inferior prognosis (HR=2.093, p =0.003 95% CI [1.290–3.396]), suggesting that MUC4 mutation could act as an independent prognostic parameter in COAD patients.
Factors |
Univariate |
Multivariate |
||
HR |
p-value |
HR |
p-value |
|
age(year) (<65, ≥65) |
1.57(0.99-2.50) |
0.06 |
|
|
gender (male, female) |
1.37(0.88-2.12) |
0.16 |
|
|
stage (low, high) |
2.82(1.79-4.43) |
0.00 |
2.82(1.79-4.43) |
<0.001 |
MUC4 (wide, mutant) |
2.08(1.29-3.38) |
<0.001 |
2.09(1.29-3.40) |
<0.001 |
We utilized GSEA enrichment analysis to reveal the underlying mechanism of MUC4 mutation,the result showed that natural killer cell mediated cytotoxicity were mainly involved in patients with MUC4 mutation. Thus, we concluded that MUC4 mutation regulated pathways referred to the immune system (Figure 3).
We further explored the connection between MUC4 mutation and tumor microenvironment in COAD patients. we compared the landscape of the immune cell in the MUC4 mutation group and wild group by CIBERSORT algorithm (Figure 4a). The difference analytical results showed that CD8+ T cells, activated NK cells and M1 macrophages were comparatively higher involved in tumors with MUC4 mutation. As shown in Figure 4b, resting memory CD4+ T cells were mainly enriched in the wild group. The outcome of Correlation analysis shown that CD8+T cells had a negative association with resting memory CD4+ T cells (Figure 4c).
The tumorigenesis of colon adenocarcinoma is commonly known as a multi-stage process, which involves a series of genetic changes interacting with the tumor microenvironment (TME)[14, 15]. This study was conducted to analyze mutations in COAD samples from the TCGA dataset and ICGC dataset. We found a high mutation frequency of MUC4 in both cohorts. We were able to show that mutation in MUC4 was significantly correlated with increased TMB and indicated a worse prognosis. Moreover, samples with MUC4 mutations were enriched in the NK cell mediated cytotoxicity signaling pathway. The result of immune cells infiltration demonstrated that MUC4 mutated samples were more infiltrated in memory CD8 T cells, activated NK cells and M1 macrophages, but less infiltrated in resting memory CD4 T cells.
Some researches have revealed the relationship between tumor mutation burden (TMB) and immunotherapy response.MUC4 exhibited a relatively high mutation rate in COAD patients. Mucins are important glycoproteins, falling into two categories: the membrane-bound mucins and the secreted mucins[16, 17]. MUC4 is a useful biomarker of tumor progression[18, 19]. Its biological effects are mainly used in lung cancer, pancreatic cancer, ovarian and oesophageal malignancy [20–24]. MUC4 is expressed in epithelial cells of the digestive tract, as well as some goblet cells[25]. Abnormal MUC4 expression contributes to the progression of esophageal cancer[21] and colon cancer[26, 27]. Besides, an increased expression in MUC4 demonstrates a poor survival outcome, specifically in the early stages of colon cancer [28]. Due to the size of the MUC4 gene is relatively large, this may increase its mutation probability. Thus, MUC4 is the most common mutated gene in the state of stress exposure, for example, aging or nicotine treatment[29, 30]. MUC4 mutation was significant in Renal Carcinoma and was related to survival outcomes. The mutation of MUC4 could be tightly linked with smoking in kidney cancer [13] [31]. In present study, we revealed that COAD samples with a high rate of MUC4 mutation had a significantly worse survival outcome and were highly correlated with increased TMB. TMB has attracted attention as a biomarker for predicting effectiveness of immune checkpoint inhibitors, higher TMB may respond favorably to anti-PD-1/PD-L1 immunotherapy[32, 33]. However, recent studies [33, 34] confirmed that high TMB is a poor prognostic factor. The possible reasons are that high TMB tends to be associated with chromosomal instability that could depressed immune activities[35].
We used GSEA analysis to explore the potential biological function of MUC4 mutation in COAD. The result showed that the NK cell-mediated cytotoxicity pathway was the main driving mechanism for MUC4 mutation influencing the outcome for COAD patients. Together these functional analyses suggested that the MUC4 mutation had an important function in regulating the immune response in COAD.
Previous studies have indicated that the tumor immune microenvironment(TIM)has a significant meaning in predicting prognosis and evaluating therapeutic efficacy[8]. In our study, we found that that MUC4 mutation was significantly related to increased CD8+ T cells, NK cells and M1 macrophages. We hypothesized that a relatively higher proportion of CD8+ T cells in samples with MUC4 mutated was due to an increased number of tumor neoantigens, which strongly activated immune response. Interestingly, the correlation analysis result revealed that CD8+T cells had a significant negative relation with resting memory CD4+ T cells. We also discovered a higher level of activated NK cells infiltrated in samples with MUC4 mutated, indicating that MUC4 mutation could stimulate activation of NK cells,however, more data are needed for various kinds of tumors. Studies show that NK cells were major players in the early detection, control of tumor metastasis in colon cancer[36–38]. Previous studies reported that M2-tumor-associated macrophages (M2-TAMs) were common in malignant tumors, which creates a favorable microenvironment for tumor progression[39, 40]. Meanwhile, M2-TAMs were found to be linked with poor survival outcome, while M1-TAMs point to a contrary conclusion [41–43]. M1 macrophages could be transformed into M2 with tumor progression. Interestingly, our study found that MUC4 mutated samples were more infiltrated in M1 macrophages. This is probably caused by an early staging of the majority cases in the TCGA datasets. Therefore, further studies are required to verify this hypothesis.
In summary, the present study implicates that MUC4 had a relatively high mutation rate in colon adenocarcinoma. MUC4 mutation was closely related to increased TMB and a worse prognosis. Finally, the NK cell mediated cytotoxicity signaling pathway were mainly enriched in samples with MUC4 mutations, and we also verified the association between mutations and immune cell infiltration. These findings identify that MUC4 may be considered a potential biomarker to predict the outcome of the immune response in colon adenocarcinoma.
COAD: colon adenocarcinoma; GSEA: Gene set enrichment analysis; ICGC: International Cancer Genome Consortium; M2-TAMs: M2-tumor-associated macrophages; MSI-H: high microsatellite instability; MUC4: Mucins 4; TCGA: The Cancer Genome Atlas; TIM: tumor immune microenvironment; TMB: tumor mutation burden; MMR: mismatch repair
Not applicable.
Not applicable.
The datasets analysed during the current study are available in the TCGA repository (https://portal.gdc.cancer.gov/) and ICGC dataset (https://dcc.icgc.org/).The data is publicly available.
The authors declare that they have no competing interests.
Not applicable.
TL performed data analysis and wrote the manuscript. WJ.H collected the data. FG designed and revised the manuscript. The final manuscript was read and approved by all authors.
Not applicable.