Identification of prognostic ferroptosis and pyroptosis-related DEGs and Construction of a prognostic model
A total of 50 normal tissue samples and 374 tumor tissue samples of HCC were screened from the TCGA database. Table1 shows the clinicopathological data of the patients with HCC included in the study. A total of 60 differential genes were identified between tumor and the adjacent nontumor tissues. The DEGs expression in tumor and the adjacent nontumor tissues was represented in the heatmap(Fig.1a, blue: low expression level;red:high expression level). Also, The relationship between these genes is presented in Fig1b,which suggesting the tight relationship between these genes.
The prognostic values of DEGs were evaluated by univariate Cox analysis, and 19 genes that were significantly related to the OS of patients were identified. There are many ways to minimize overfitting, recently, Lasso penalized Cox regression has been employed frequently and it could minimize overfitting. To build a prognostic model for HCC, we used training dataset and applied the LASSO Cox regression analysis to identify stable markers from 19-survival-related genes mentioned above. To find an optimal λ, 10-fold cross validation with minimum criteria was employed, so the final model gave minimum cross validation error. Then, 4 DEGs(ABCC1, G6PD, GSDME, HMOX1)were identified(Fig.2a-2b). Using the following formula, the risk score was calculated 0.045673* expression level of ABCC1+ 0.005602* expression level of G6PD+ 0.143950* expression level of GSDME+ 0.005637* expression level of HMOX1. A median cut-off value was applied to stratify patients into a high-risk group (n=183) and a low-risk group (n =182)(Fig.3a). The PCA showed that patients in different risk groups were distributed in two directions by the constructed model cohort (Fig.3b). In the high risk score group, the risk of mortality obviously increased with the risk score increased (Fig.3c). Kaplan-Meier analysis showed that the probability of survival was significantly higher in low-risk patients than in high-risk patients(Fig.3d,p<0.05). Also, the AUC of the time-dependent ROC curve was plotted to assess the prognostic effect. the AUC of the prognostic risk assessment model was 0.826 at 1 years, 0.796 at 3 years, 0.867 at 5 years (Fig. 3e). The calibration plots showed that calibration appeared good between the predicted and observed risks(Fig. 3f).
Validation of the 4-gene prognostic model
In order to evaluate the robustness of the prognostic model constructed using data from the training cohort, its performance was also assessed using validation cohort. The risk score for each patient in validation cohort was calculated with the same formula as training group. Similar, patients in the high-risk group were significantly worse off the overall survival time compared to the low-risk group(Fig.4a). Besides, the AUC of the 4-gene signature was 0.860 at 1 years, 0.724 at 3 years, 0.769 at 5 years(Fig.4b).
Expression level of DEGs in HCC tissues
Quantitative real-time PCR (qRT-PCR) and western blotting was used to detect DEGs expression. The results showed that ABCC1, G6PD, GSDME expression in HCC was significantly higher than that in adjacent tissues. On the other hand, gene expression of HMOX1 showing significantly lower expression in tumors compared to adjacent tissues (Fig.5a). Correspondingly, the expression of ABCC1, G6PD, GSDME protein is upregulated in HCC tissue as compared to normal tissue in comparison to that in normal tissue with the help of Human Protein Atlas.
Independent prognostic value of the risk model
To investigate the prognostic value of different clinical features in HCC, univariate Cox regression analysis was performed. According to the results of univariate Cox regression analysis, the risk model showed their prognostic value in predicting OS(HR= 1.9757, 95% CI: 1.3507−2.8901, Fig.6a). The multivariable Cox regression analysis suggested risk score had prognostic value for HCC patients. Even after adjusting for confounding factors, the risk score is still a prognostic factor(HR= 1.8146, 95% CI: 1.2267−2.6841, Fig.6b) for patients with HCC. Fig.6c shows heatmaps of the pyroptosis-related DEGs modules on the clinical data and found a different distribution of patients' age and survival status between low- and high-risk subgroups.
DEGs-based tumor classification.
In addition, We explore the connections between the expression of DEGs and HCC subtypes, consensus clustering analysis was used to classify HCC patients into subtypes. By increasing the clustering variable (k) from 2 to 10, we found that the highest intra-group correlation and low inter-group correlation was observed when k=2 (Fig. 7a). The results showed that the HCC patients could be well divided into two clusters based on the DEGs.The gene expression profile were matched to the clinical data and was presented in a heatmap (Fig. 7b), We also compared the OS in 2 clusters.There were significant differences in 2 clusters (Fig. 7c).
Functional analyses and infiltrating immune cells
Meanwhile, GO function and KEGG pathway enrichment analysis were used to investigate the intrinsic pathway of DEGs. GO function analysis was performed using three ontologies: biological process (BP), cellular component (CC), and molecular function (MF). Based on GO analysis, the DEGs were most enriched in BP, and specifically in immune related processes(such as T cell activation, regulation of T cell activation, leukocyte cell−cell adhesion, regulation of mononuclear cell proliferation, regulation of leukocyte proliferation, regulation of leukocyte cell−cell adhesion, Fig.8a). In addition, KEGG pathway analyses indicated that these DEGs were highly enriched in Cytokine−cytokine receptor interaction, Neuroactive ligand−receptor interaction, Chemokine signaling pathway, Th1 and Th2 cell differentiation, Th17 cell differentiation. To further explore relationship between immune cell infiltration and risk scores in the prognostic risk mode, ssGSEA score was used to quantify the level of enrichment of immune cell functions or pathways in cancer samples(Fig.9a-b). One interesting finding is the many types of immune cells were significantly different between the two groups(such as macrophages, activated dendritic cell, Treg). Another intriguing fact was the immune cell functions were also significantly different between the two groups(such as APC_co_inhibition, CCR, Check−point, HLA, MHC_class_I, Type_II_IFN_Reponse). Complementarily, we found the expression of immune cheeckpoint have a dramatically different in different tumor grade (Fig.9c). After uploading the upregulated genes in the HCC samples and querying the Cmap database for connectivity scores, the top 3 negatively correlated compounds were filtered out: cicloheximide(-0.538), 4,5-dianilinophthalimide(-0.504), natamycin(-0.484).