Landscape of immune cell infiltration of TME in TNBC
Using ssGSEA algorithm, we calculated 29 immune-related gene sets of TNBC datasets from TCGA database and described the TIME status of TNBC patients. By unsupervised clustering, the samples were divided into two different immune-related subgroups, including Immunity_H group (52 samples) with a high expression level of immune-related gene sets and Immunity_L group (60 samples) with a low expression level of immune-related gene sets. We also calculated the immune scores (ESTIMATEScore, ImmuneScore, StromalScore) and tumor purity between two subgroups using the ESTIMATE algorithm. The involvement of the subgroups with the immune scores and tumor purity was explored and depicted in the comprehensive heatmap (Fig. 2a). It suggested that the immune scores of Immunity_H group were generally higher than those of Immunity_L group, and the tumor purity was lower than that of Immunity_L group (P < 0.05; Fig. 2b).
The Immune Status And Prognostic Difference Between Two Subgroups
In order to further clarify and verify the accuracy of the immune differences between Immunity_H and Immunity_L groups, firstly, we compared some immune-related gene expression levels between Immunity_H and Immunity_L groups. The expression levels of MHC gene family, CD4, CD8, CD274 (PD-L1) and CTLA4 in Immunity_H group were higher than Immunity_L group (P < 0.05; Fig. 3a-b). Meanwhile, the relative subpopulations of infiltrating immune cells were estimated with CIBERSORT approach and the distribution of the proportion of immune cells in TNBC were characterized by bar plot. By comparison, the proportion of immune cells differed between Immunity_H and Immunity_L groups. Patients from Immunity_L group exhibited a remarkably higher infiltration of Macrophages M0, Macrophages M2 and activated Mast cells. Immunity_H group was marked by a significant increase in the subpopulation of the antitumor lymphocyte cell subsets, such as CD8 T cells, CD4 memory activated T cells, gamma delta T cells and Macrophages M1 (P < 0.05; Fig. 3c). Based on above results, it could be regarded that Immunity_H group was characterized an elevated inflammation response than that of Immunity_ L group. Interestingly, Immunity_H group was characterized by a higher immune score, which indicated an immunologically “hot” phenotype. Previous reports indicated that the cell composition of TME had an important impact on the prognosis of patients, including the degree of immune cells and stromal cells infiltration. In addition, the median overall survival time of Immunity_H group was longer than Immunity_L group (P = 0.035, HR = 0.34[0.13–0.93]); Fig. 3d).
Screening Of Degs And Construction Of Risk Score Model
In order to further explore the reasons of the difference in immunogenicity between Immunity_H and Immunity_L group, we analyzed the mRNA-seq data of TNBC patients from TCGA database. And the volcano plot and heat map were drawn to distinguish the DEGs between the two groups (Fig. 3e-f). Combined with LASSO regression (Fig. 3g) to reduce the risk of overfitting, three genes with the most significant different expression level were obtained, namely CXCL13, CCL5 and GZMB. And the risk score model was constructed with the following formula: risk score= -0.0060×CXCL13 expression-0.0046×CCL5 expression-0.0606×GZMB expression. Then we calculated the risk score of each sample and combined with its clinical characteristics to analyze the relationship between these various factors and OS by Cox risk regression analysis. Based on the results of the univariate Cox analysis (Fig. 3h), it showed that tumor stage (P < 0.001, HR = 9.615[3.368–27.450]), tumor size (P = 0.010, HR = 4.147[1.397–12.313]), lymph node status (P < 0.001, HR = 2.873[1.821–4.533]) and risk score (P = 0.017, HR = 4.674[1.312–16.645]) had a statistically significant impact on patient survival. The results of multivariate Cox analysis (Fig. 3i) revealed that lymph node status (P < 0.001, HR = 3.457[1.701–7.026]) and risk score (P = 0.037, HR = 5.347[1.107–25.829]) were favorable prognostic factors for survival of TNBC patients. The results suggested that risk score could be an independent prognostic factor in both univariable and multivariable Cox regression analyses.
The Correlation Of Degs With Immune Infiltration And Prognosis
We used the Kaplan-Meier Plotter to evaluate prognostic value of DEGs in TNBC. The results showed that the expression of DEGs was related to OS of TNBC patients, and their expression level was positively correlated with better OS among all TNBC patients (P < 0.05; CXCL13, HR = 0.31[0.2–0.48]; CCL5, HR = 0.36[0.24–0.54]; GZMB, HR = 0.31[0.2–0.48]; Fig. 4a). The TIMER database was performed to investigate whether DEGs were related to immune infiltration in breast cancer. The results revealed that the expression level of DEGs was positively correlated with tumor immune infiltration in breast cancer. Except for macrophages, other immune cells had significant statistical differences (P < 0.05; Fig. 4b). These results suggested that DEGs may serve as an immune-related tumor marker in breast cancer which could predict the prognosis in TNBC.
Validation Of Risk Score Model Based On Degs In Tnbc
In order to further verify the accuracy of the prognostic prediction of risk score model in TNBC, we confirmed the reliability in three TNBC cohorts, including 112 cases from TCGA database as training set (Fig. 5a-c), 107 cases from GEO database (Fig. 5d-f) and 209 cases from METABRIC database as validation sets (Fig. 5g-i). We calculated the risk score of each sample using the risk score model formula (risk score=-0.0060×CXCL13 expression-0.0046×CCL5 expression- 0.0606×GZMB expression). According to the median value of risk score, the TNBC patients were divided into RiskScore_L group with low risk score and RiskScore_H group with high risk score. Kaplan-Meier survival curve was performed. The results showed that the prognosis of RiskScore_L group was significantly better than that of RiskScore_H group in all three databases (P < 0.05; TCGA, HR = 0.24[0.09–0.63]; GEO, HR = 0.28[0.13–0.59]; METABRIC, HR = 0.67[0.47–0.97]; Fig. 5a, d, g). The ranking was based on the risk score values of the DEGs from low to high, the risk score distribution (Fig. 5b), patient survival status (Fig. 5e) and risk gene expression (Fig. 5h) in sets were shown, respectively. The time-dependent ROC curve analysis of the risk score in three databases revealed the robustness of the prognostic capability of the risk score for OS (TCGA, 3 year survival, AUC = 0.703; 5 year survival, AUC = 0.785; GEO, 3 year survival, AUC = 0.757; 5 year survival, AUC = 0.720; METABRIC, 3 year survival, AUC = 0.608; 5 year survival, AUC = 0.64; Fig. 5c, f,i). The above results indicated that the risk score model based on three immune-related DEGs was validated to be a strong predictive signature on the survival of TNBC patients.
We collected the clinical samples from our hospital to identify our study. We detected the immune cells in tumor microenvironment of 32 TNBC pathological tissue specimens using mIF, which including marker PanCK, CD8, HLA-DR, CD68, CD56 (Fig. 6a). The expression levels of three DEGs (CXCL13, CCL5 and GZMB) were performed by IHC staining on TNBC tissue samples (Fig. 6b). The risk score and immune cells fractions of 32 TNBC tumor tissue specimens were calculated (Supplementary Table 1). And the samples were divided into two groups based on the median of risk score. We analyzed the correlation between risk score and the proportions of immune cells in TME. The results displayed that the fractions of immune cells were different in two groups, especially about NK cells and macrophages. Low risk score group was associated with higher proportions of NK cells and macrophages M1/M2 ratio (Supplementary Fig. 1).
Predictive Significance For Immunotherapy And Immune-related Pathways
We checked the correlation between the risk score and six key ICB related genes, including CD274 (PD-L1), PDCD1, CTLA4, HAVCR2, IDO1 and LAG3. We observed that risk score had a remarkably negative relationship to the expression of these ICB related genes (P < 0.05; Fig. 7a), which showed that risk score might act as a significant role in the prediction of responsiveness to ICB treatment in TNBC. Furthermore, we confirmed the IPS to predict efficacy of ICB with PD-1 and/or CTLA-4 inhibitors, including the IPS, IPS-PD1 blocker, IPS-CTLA4 blocker and IPS- PD1 + CTLA4 blocker. In our study, the predictive value of risk score for ICB treatment were checked. RiskScore_H group was marked by a significantly lower IPS than RiskScore_L group (P < 0.05; Fig. 7b) which suggested that RiskScore_L group might be more suitable for immunotherapy. These results suggested the risk score model might be a potential predictive signature for response to immunotherapy in TNBC. At the same time, we performed GSEA to explore the biological processes related to risk score in TNBC. The top twenty significant pathways with comprehensive details were recorded in Supplementary Table 2. KEGG significant pathways associated with low risk score were mainly involved in immune-related pathways, including antigen processing and presentation, B cell receptor signaling pathway, cytokine-cytokine receptor interaction, chemokine signaling pathway, intestinal immune network for IgA production, leukocyte transendothelial migration, natural killer cell mediated cytotoxicity, T cell receptor signaling pathway (Fig. 7c). GO biological processes significant pathways associated with low risk score were also mainly enriched in immune-related signal pathways, such as IL-1 signaling pathway, IL-6 signaling pathway, leukocyte proliferation, lymphocyte activation involved in immune response, mononuclear cell differentiation, regulation of immune response, T cell activation, T cell differentiation and so on (Fig. 7d). Through GSEA results of risk score, the immune-related pathways were observed obvious enrichment in low risk score group which suggested that risk score was closely linked to immune cell infiltration, inflammatory reactions and TME modification of TNBC.