3.1 DE-TEMDRGs in HCC
We obtained data of 370 tumor and 50 normal tissue samples from patients with HCC from TCGA. Using GeneCards, 413 TERGs and 2,415 MDRGs were obtained. Following the intersection of TERGs and MRGs, 135 TEMDRGs were obtained (Figure 1A, Supplementary Table S4). Of the 135 TEMDRGs, 32 TEMDRGs (29 upregulated and 3 downregulated) were differentially expressed in the TCGA-HCC cohort (Figure 1B, Supplementary Table S5). The GO term enrichment analysis revealed these TEMDRGs to mainly participate in the response to oxidative stress, negative regulation of phosphorylation, and regulation of mitochondrion organization. The KEGG pathway enrichment analysis suggested that the TEMDRGs were mainly involved in hepatitis B, the p53 signaling pathway, carbon metabolism, and the TNF signaling pathway (Figure 1C–1E). Next, we constructed a PPI network of the 32 TEMDRGs (Figure 1F).
3.2 Genetic variation of DE-TEMDRGs
We analyzed CNVs and SNVs of the 32 DE-TEMDRGs in the TCGA-HCC cohort. Among the 75 HCC samples, 56 (74.67%) exhibited genetic mutations (Figure 2A–2B). The most common variant classification, variant type, and SNV class were missense mutation, SNP, and C>T, respectively (Figure 2A). Of the 32 DE-TEMDRGs, PRKDC (29%) was the most frequently mutated gene, followed by MIK617 (15%) and BAX (7%) (Figure 2B). Figure 2C shows that the SNV mutation patterns tended to be dominated by C>T and T>C transitions. Notably, most DE-TEMDRGs exhibited CNVs in HCC (Figure 2D). Among the distribution of CNV classes, the highest heterozygous amplification rates were noted in PARP1, PRKDC, and BAK1; the highest heterozygous deletion rates were noted in TAT, CCNA2, and CXCL10; and the highest homozygous amplification rate was noted in PARP1, SOCS3, and PRKDC (Figure 2E–2F). Figure 2G shows the heatmap of CNV and mRNA expression in the TCGA cohort. BAG6 and PARP1 were the two most common genes whose mRNA expression was significantly correlated with CNV.
3.3 Identification of HCC Molecular Subtypes Based on PR-DE-MDRTEGs
The univariate analysis showed that 27 genes were associated with HCC prognosis, among which TAT and AR were protective factors, whereas the other genes were risk factors (Supplementary Table S6). The patients with HCC were divided into two subclusters using ConsensusClusterPlus (v1.54.0) (Figure 3A). A prominent clustering effect, as well as consistency and stability within subgroups, was observed (Supplementary Table S7, Supplementary Figure S1A–S1I). Cluster C1 showed high PR-DE-MDRTEG expression, indicative of a TEMD-high subtype. Conversely, cluster C2 showed low expression, indicative of a TEMD-low subtype (Figure 3B). Survival analyses demonstrated that patients with the TEMD-low subtype generally had a favorable prognosis, whereas those with the TEMD-high subtype had poorer prognoses (Figure 3C). Clinical parameters, such as stage and grade, were significantly different between the clusters (Supplementary Figure S2A–S2E).
GSEA was conducted to compare the two subgroups to further identify the activated signaling pathways. The gene sets significantly enriched in the TEMD-high groups included WP_T cell_antigen_receptor_TCR_signaling, WP_T cell_receptor_and_costimulatory_signaling, WP_oxidative_stress, WP_tgf-β_signaling_pathway, and KEGG_pathways_in_cancer (Supplementary Table S8, Figure 3D). In addition, distinct somatic mutation profiles were observed between these subtypes. Mutations in TP53 (40%), TTN (26%), and CTNNB (22%) were highly prevalent in the TEMD-high subtype, whereas CTNNB (26%), TTN (24%), and MUC16 (16%) mutations were most prominent in the TEMD-low subtype (Figure 3E).
3.4 Tumor Microenvironment Immune Landscape, and Chemotherapy Response in the TEMD subtypes
Evidence suggest that T-cell exhaustion and mitochondrial dysfunction affect antitumor immune responses. Thus, we determined the immune landscape within tumor microenvironments of the two TEMD-based subtypes. Patients with the TEMD-high subtype exhibited increased proportions of activated T regulatory cells (Tregs) and M0 macrophages, whereas patients with the TEMD-low subtype had an increased percentage of monocytes (Figure 4A). In addition, the TEMD-high subtype showed high levels of human leukocyte antigen (HLA), as well as upregulation of immune checkpoint expression, compared to the TEMD-low subtype (Figure 4B–C). The stemness score for the TEMD-high subtype was higher compared to the TEMD-low subtype (Figure 4D).
In light of the significant differences in TMEs between the subtypes, we assessed their responses to immunotherapy and chemotherapy. Compared to the TEMD-low subtype, TEMD-high tumors had a higher TIDE score and lower IPS score (Figure 4E–F), indicative of a less prominent immunotherapy response (31-33). Meanwhile, TEMD-high subtypes had a lower IC50 for chemotherapy agents (Figure 4G), indicating a better response to chemotherapy.
3.5 Construction of a TEMD-Related Prognosis Signature and Clinical Correlation Analysis
The TCGA-HCC cohort was randomly divided into training group and testing group at a ratio of 7:3. To avoid overfitting, LASSO analysis was used, with three PR-DE-TEMDRGs retained (Figure 5A–B). Subsequently, two risk genes, EZH2 and G6PD, were retained to construct the optimal LRPS (Figure 5C–D). Their expression was higher in TCGA-HCC (Figure 5E) and the ICGC-HCC (Supplementary Figure S3A) samples than in normal liver samples. Furthermore, the protein levels of these genes showed the same trend in the Human Protein Atlas database (Supplementary Figure S3B). The prognostic value of these two genes and their correlations with clinicopathological characteristics were further investigated. In both TCGA (Figure 5F–G) and ICGC (Supplementary Figure S3C–S3D) HCC cohorts, EZH2 and G6PD expression levels were positively correlated with stage and grade. According to the KM analyses, patients with high EZH2 and G6PD levels had a poor outcome in both TCGA (Figure 5H–5I) and ICGC (Supplementary Figure S3E–S3F).
3.6 Identification and Validation of the TEMD Prognostic Signature
A signature of the two TEMD genes was identified. Risk score was calculated according to the following formular: risk score = EZH2 × 0.33188 + G6PD × 0.21697. Patients of the TCGA training cohort were divided to low-risk group and high-risk group based on the median value of the risk score (1.808). A scatter plot was created to visualize the distribution of risk scores and their relationship with outcome in the training cohort (Figure 6A). The TEMD signature was further validated both internally (TCGA testing and entire TCGA-HCC cohort) and externally (ICGC-HCC cohort) (Figure 6A). The KM analysis revealed that compared to the low-risk group, the high-risk group exhibited a poor outcome in not only the TCGA training cohort (p < 0.001) but also the TCGA testing (p = 0.038), TCGA-HCC (p < 0.001), and ICGC-HCC (p < 0.001) cohorts (Figure 6B). The prognostic model showed an area under the ROC curve of 0.841, 0.738, and 0.747 at 1, 3, and 4 years, respectively, in the training cohort. Furthermore, it showed a robustness predictive value in the test cohort, entire TCGA cohort, and external ICGC cohort (Figure 6C), which indicated a good predictive value.
3.7 Correlation Between the TEMD Signature and Clinical characteristics
The correlation between the TEMD signature and clinical features was compared. In the TCGA-HCC cohort, high-risk scores were associated with advanced stage (Figure 7A), advanced grade (Figure 7B), and HBV infection (Figure 7C) but not HCV infection (Figure 7D). The stratified survival analysis revealed that advanced stages (Figure 7E) and a history of HBV (Figure 7G) or HCV (Figure 7H) were associated with poor overall survival (OS). However, the grade-specific survival analysis did not reveal any differences (Figure 7F). The high-risk score group was significantly associated with poorer outcomes across all subgroups (Supplementary Figure S4A–S4B, S4D), except in those without a history of HBV infection (Supplementary Figure S4C). In the ICGC-HCC cohort, the risk scores of HCC samples with early stage and grade were relatively less than those with advanced stage and grade (Figure 7I–7J) and also tended to have a better prognosis (Figure 7K–7L). However, no significant differences were observed among the different subtypes for prognosis between the high-risk group and low-risk group owing to the low number of cases (Supplementary Figure S4E–S4F).
3.8 Development of a Clinical Nomogram
Univariate and multivariate Cox regression analyses revealed that risk score, stage, and HCV status were independent prognostic factors in the TCGA-HCC cohort (Figure 8A–8B). In addition, risk score and stage remained independent prognostic factors in the ICGC-HCC cohort (Supplementary Figure S5A–S5B), which was consistent with the findings of the TCGA cohort.
A nomogram for OS prediction was then developed for TCGA (Figure 8C) and ICGC (Supplementary Figure S5C) cohorts. The calibration curves showed good concordance between nomogram-predicted and actual survival in TCGA (Figure 8D) and ICGC cohorts (Supplementary Figure S5D). The DCA curves revealed that the nomogram offered a greater net benefit in both cohorts (Figure 8E–8G and Supplementary Figure S5E–S5G). According to these results, the predictive nomogram for OS was better than clinical factors alone and could assist clinical management.
3.9 Analysis of T-Cell Exhaustion and Mitochondrial Dysfunction
To examine whether TEMD signatures correlate with T-cell exhaustion status in HCC samples, we compared the expression of marker genes in the low-risk group and high-risk group. STAT5B, CCR8, PDCD1, TGFB1, LAG3, CTLA4, HAVCR2, TIGIT, CXCL13, LAYN, CD274, IDO1, IDO2, PDCD1LG2, and VSIR are established marker genes of T-cell exhaustion (34-36). Figure 9A shows a higher expression of marker genes in the high-risk group, indicating greater T-cell exhaustion relative to that in the low-risk group.
The GSEA data showed that high-risk tumors exhibited significantly enriched pathways related to T-cell exhaustion (including regulation of T-cell activation (37), TGF-β signaling (38, 39), NOTCH signaling (40), and IFNG signaling (41) (Figure 9B–9E).
Both HCC groups were subjected to GSEA to assess mitochondrial dysfunction status. The analysis revealed various mitochondrial dysfunction-related pathways (including ROS metabolic process, respiratory system, ATPase activity, glycolysis, and glycerophospholipid metabolic process) as significantly enriched (Supplementary Figure S6A–S6E).
3.10 Analysis of Immune Cell Infiltration, Immune Checkpoint Expression, and Immunotherapy Response
To investigate the relationship of our TEMD prognostic signature with immunotherapy efficacy, we compared the immune infiltration levels between the high-risk group and low-risk group. High-risk scores were associated with less infiltrating activated CD4+ T cells, Tfh cells, plasma B cells, Tregs, resting myeloid dendritic cells, neutrophils, etc. (Figure 10A). In addition, the high-risk group presented upregulation of immune checkpoints and high HLA gene expression (Figure 10B–10C). Furthermore, the stemness score was higher for the high-risk group (Figure 10D).
The high-risk group presented a lower IPS score and higher TIDE score than the low-risk group (Figure 10E–10F). The TEMD signature-based risk score positively correlated with TIDE score (Figure 10H). These findings suggested a better response to immunotherapy in low-risk patients.
3.11 Chemotherapy Response Analysis and TMED-Related Small-Molecule Drug Screening
Based on GDSC data, responses to common chemotherapeutic agents were predicted for the two groups. Interestingly, the high-risk group had higher IC50 values for doxorubicin, gemcitabine, sorafenib, and 5-fluorouracil (Figure 10H), suggesting that these patients may not respond to chemotherapy.
The L1000fwd web-based tool was used to identify small-molecule targets for HCC treatment. According to the differentially expressed genes (DEGs) (Supplementary Table S9), we screened the three most common small-molecule drugs as drug candidates for patients with HCC (Supplementary Table S10). A 2D display of ephedrine (Figure 11A), digoxin (Figure 12B), and tacrolimus (Figure 11C) is available in the PubChem database, along with a 3D display of ephedrine (Figure 11D).