Establishment of IRGPI in HCC
The TCGA database was used to obtain RNA-seq transcriptomic data, and the expression levels of immune-related genes in tumor and normal tissues were analyzed. The results showed that out of 1811 immune-related genes, 555 genes were found to be differentially expressed in tumor and normal tissues (P< 0.05) (Figure 1A). These results suggested that immune-related genes play an indispensable role in the occurrence and development of HCC.
To further clarify the function of these genes, the underlined immune-related genes were selected as candidates for GO and KEGG enrichment analysis. GO analysis explains the roles of these genes using three aspects: cellular component (CC), molecular function (MF), and biological process (BP). The results of GO analysis suggest that the functions of these genes are mainly focused on the positive regulation of immunity such as cell proliferation, cytokine secretion and cell chemotaxis (Figure 1B). KEGG analysis was performed to investigate the pathways involved in the regulation of downstream genes by immune-related genes. The anti-tumor immune response is closely associated with mitogen-activated protein kinase (MAPK) and antigen processing and presentation, which are the most critical pathways (Figure 1C).
WGCNA was performed to identify the co-expressed gene modules among the 555 DEIRGs, and to investigate the association between genes and HCC. The results obtained from WGCNA show that the genes in the blue module are closely related to the occurrence of HCC (R=0.66, p-value=2e-43) (Figure 2A). Therefore, 58 genes from the blue module were selected for further study.
Univariate Cox proportional hazard regression analysis identified 13 genes that are closely associated with the prognosis of HCC (p-value < 0.05, Figure 2B). LASSO regression prevents overfitting through variable selection and regularization, which helps to improve the accuracy of the model. After minimizing overfitting by LASSO regression, 4 genes were selected as hub IRGs of the model: MAPT, CCL14, GHR and CD5L. Therefore, IRGPI was derived by multiplying hub IRGs with the univariate COX regression coefficient as follows: IRGPI score = [expression of MAPT × 0.401232] + [expression of CCL14 × -0.22304] + [expression of GHR × -0.19147] + [expression of CD5L × -0.09349].
IRGPI predicts survival for HCC patients
According to the median IRGPI score, 374 HCC patients from the TCGA database were classified into IRGPI-low and IRGPI-high subgroups. Finally, a survival-prognosis model was established, using Kaplan-Meier (K-M) method. The K-M survival analysis of the model showed that the survival period of patients in the IRGPI-low group was significantly longer than that of the IRGPI-high group, which implied a remarkable ability in differentiating good or poor clinical outcomes among the two subgroups (Figure 2C). Moreover, based on the 4 hub genes in the model, we divided patients into high and low-expression groups, followed by performing survival analysis. The results showed that patients with high expression of MAPT had a poor prognosis, while patients with high expression of the other three genes had a longer survival time. Consistent with the results of the above Univariate Cox regression analysis, the obtained results reveal that MAPT is a high-risk gene while the remaining genes are low-risk genes (Figure 2D-G).
We further used univariate and multivariate independent prognostic analyses to evaluate the predictive value of the prognostic model. All independent prognostic analysis results were consistent, which shows that the IRGPI can be independent of other clinical characteristics as an independent prognostic factor (Figure 2H, I).
IRGPI significantly related to the disease progression
The correlation analysis was performed via a chi-square test to explore the possible correlation between IRGPI and clinicopathologic factors. The results showed that patients in the IRGPI-high group have higher tumor pathological grades, which implied a higher degree of tumor malignancy (Figure 3A). Furthermore, patients in the IRGPI-high group have higher clinical stages and a higher tumor infiltration area (Figure 3B, C). This suggests that IRGPI-high patients had more aggressive tumors, faster tumorigenesis, and a poorer prognosis.
IRGPI is accompanied by a higher TMB and frequency of driver gene mutations
Herein, this study identified the 20 most often mutated genes as driver genes for HCC by studying the somatic structural variation of HCC. Moreover, the frequency of driver gene mutations in the IRGPI-high group was found to be significantly higher than that in the IRGPI-low group, with a higher TMB (Figure 3D, E). A previous study indicated that TP53 or LRP1B mutations are associated with higher TMB and worse survival in HCC [31]. The tumor microenvironment of patients with CTNNB1-mutant had significantly reduced activated immune cells and expression of immune-stimulating molecules. In contrast, the number of M2-type macrophages and active immunological depletion pathways increased significantly [32]. This shows that ICIs are less significant for the IRGPI-high group with greater CTNNB1 mutation frequencies. CTNNB1 mutation-associated aldolase A (ALDOA) phosphorylation promotes HCC cell proliferation, which can be mutually verified with the results of our survival analysis [33].
IRGPI predicts tumor infiltrating immune cells and immune function in the microenvironment
Immune infiltrating cells is one of the important factors affecting immunotherapy. It is necessary to study the abundance and function of immune infiltrating cells between the two groups [14]. CIBERSORT was used to evaluate the relative proportion of 29 types of immune cells and immune function influencing the procedure of anti-tumor immune response in HCC. It is a useful tool for analyzing immune infiltrating cells in the tumor microenvironment. The results showed that the number of tumor-infiltrating cells in the IRGPI-low group was significantly greater than that of the IRGPI-high. Among the 14 immune infiltrating cells, patients in the IRGPI-low group have a higher abundance of B cells, CD8+ T cells, mast cells, neutrophils, NK cells, and T helper cells, but the abundance of macrophages is lower. In terms of immune function, patients in the IRGPI-low group are more active than IRGPI-high group, such as cytolytic activity, inflammation-promoting, and response to interferon (IFN) (Figure 4A).
Herein, the survival of patients was analyzed based on immune cells. According to the obtained results, the effector cells in anti-tumor immune response such as B cells, CD8+ T cells, NK cells, T helper cells and TILs were associated with a good prognosis. Furthermore, macrophages that can secrete immune negative regulatory factors to promote tumor cell immune escape predicted poor survival. (Figure 4B-G) The underlined results are also supported by preliminary research [34-37]. Furthermore, the impact of immune function on the prognosis was analyzed which revealed that the pro-inflammatory factors that promote the activity of immune cells in the tumor microenvironment are related to a good prognosis. (Figure 4H-K)
The immunogenicity of more than 10,000 tumor samples of 33 cancers from TCGA has already been reported, calculating the correlation coefficients among 160 immune characteristics. Furthermore, cluster analysis was performed to obtain the immune expression characteristics of 5 core modules. According to these 5 immune expression characteristics, all non-hematological tumors in the TCGA database are clustered into 6 immune subtypes: wound healing, IFN-γ dominant, inflammatory, lymphocyte depleted, immunologically quiet, and TGF-β dominant (C1-C6). The survival analysis of 6 immune subtypes revealed that C3 had the best prognosis, followed by C1 and C2, and C4 and C6 had the worst prognosis [38]. Matching the IRGPI group with the TCGA immune subtypes, we found that half of the patients in the low-IRGPI group belonged to the C3 with the best prognosis. Three-quarters of patients in the high-IRGPI group are C1, C2, or C4 with relatively poor prognoses. (Figure 5A)
IRGPI predicts responses of ICIs
To verify whether the score can accurately predict the patient's response to ICIs, herein, the difference between traditional biomarkers and the TIDE score between the IRGPI-low group and the high were analyzed. The results showed that the TMB of the IRGPI-high group is greater than the IRGPI-low group. (Figure 5B) Regarding the expression of immune checkpoint molecules, the expression of programmed cell death-ligand 1 (PD-L1, CD274) was considerably higher in the IRGPI-low group than in the IRGPI-high group. However, the expression of cytotoxic T-lymphocyte-associated protein 4 (CTLA-4) was the opposite. (Figure 5C, D)
Recent studies have revealed that tumors have two different immune escape mechanisms. Even when a substantial number of cytotoxic T lymphocytes infiltrate the microenvironment of certain cancers, their functions are vague under the influence of immunosuppressive molecules [39]. In some tumors, immune-negative regulatory cells and factors can eliminate T cells infiltrating the tumor tissues and form a state of immune exclusion [40]. Therefore, the researchers have designed a new computing architecture: the TIDE score. It is also believed that TIDE score values can replace a single biomarker predicting the efficacy of ICI [30]. Herein, TIDE scores were performed on all patients which revealed that patients in the IRGPI-high group had higher immune exclusion levels. (Figure 5E) The finding indicated that fewer immune cells were infiltrating the tumor microenvironment, which was consistent with the outcomes of our analysis of immune cells. However, patients in the IRGPI-low group are more in an immune dysfunction state. (Figure 5F) Previous studies have shown that patients with immune exclusion are resistant to ICIs, and the treatment effect is not as good as that of immune dysfunction [41]. Furthermore, the TIDE score of the IRGPI-high group was significantly higher than the IRGPI-low group, which means that the immunotherapy effect of the high-risk group is poor. (Figure 5G)
To further validate the accuracy of the model’s ability to predict the effect of immunotherapy, we analyzed three immunotherapy datasets. The IMvigor210 cohort contains 298 patients with urothelial cancer treated with PD-L1 blockers. Each patient was scored and assigned to IRGPI-high and low groups according to the median value. Patients with complete response (CR) and partial response (PR) were defined as a response, and patients with stable disease (SD) and progressive disease (PD) were defined as a non-response. Notably, the objective response rate (ORR) of the PD-L1 blocker was higher in the IRGPI-low than in the IRGPI-high group in the IMvigor210 cohort (chi-square test, P=0.008), and the IRGPI of non-responders were significantly higher than responders (Wilcoxon, p-value < 0.001) (Figure 6A-C). A similar outcome was observed in the GSE78220 and GSE67501 cohort, which contains 39 patients with melanoma and non-small cell lung cancer (NSCLC) treated with PD-1 therapy (chi-square test, P=0.043) (Figure 6D-E). Taken together, the IRGPI can effectively predict the response to ICIs. Immunostaining on the HCC cohort further confirmed that CD8+ T cells were more abundant in IRGPI-low group, while the tumor microenvironment of IRGPI-high group presents a scene of immune desert (Figure 7A, B). Statistical analysis of the number of CD8+ T cells showed that the proportion of CD8+ T cells in the IRGPI-low group was significantly increased than that in the IRGPI-high group (Figure 7C). Survival analysis showed that the survival time of IRGPI-low group was significantly longer than that of IRGPI-high group (Figure 7D). In addition, we followed up the patients for three years. Survival analysis showed that the three-year survival rate of IRGPI-low group was significantly higher than that of IRGPI-high group. Together, these results indicate that IRGPI-low group has a unique immune ecosystem, with increased CD8+ T cells. This is one reason why patients in the IRGPI-low group have a better response to ICIs and a longer survival period.