The landscape of immune cell infiltration in HCC as assessed by scRNA-seq analysis
Based on scRNA-seq data from GSE140228[26], we obtained gene expression profiles of 59509 high-quality cells from sixteen patients who were pathologically diagnosed with liver cancer, including thirteen males and three females, and these cells were identified for subsequent analysis after strict quality control. Fourteen cell clusters were identified by dimensionality reduction analysis with UMAP. Subsequently, the cell identity of each cluster was annotated using a reference dataset from the Human Primary Cell Atlas, and six cell types were identified: natural killer (NK) cells, T cells, mononuclear macrophages, B cells, plasma cells and endothelial cells (Fig. 1A). Next, the proportions and relationships of different cell types in each cluster were identified, as displayed in Fig. 1B; T cells, NK cells and mononuclear macrophages were the three most predominant cell types in HCC. The FindAllMarkers function was used to detect DEGs in the different cell types. The top 5 genes with high or low expression ranked according to the log2FC value for each of the six types of cells are shown in Fig. 1C, and all the differentially expressed marker genes are listed in Additional file 5. Table. S1.
We further conducted GSVA using the hallmark gene sets as reference gene sets derived from MSigDB to determine the enrichment scores of the different immune cell clusters to provide insights into the different biological functions of the groups. Excitingly, we found that mononuclear macrophages showed enrichment of the following hallmarks: inflammatory response, complement, TNF-α signalling via NF-κB, oxidative phosphorylation, adipogenesis and mTORC1 signalling; thus, these cells had a promising antitumour immunophenotype (Fig. 2). Therefore, in this study, we focused on the identification of a mononuclear macrophage marker genes (MMG) signature and further investigated the relationships between different immune subtypes and molecular subtypes.
Identification of MMG expression profiles in HCC
To unravel the underlying biological characteristics of macrophage immunophenotypes, we identified a total of 17661 genes of 574 samples from the TCGA expression profile dataset, with 371 tumour samples, and the HCCDB18 dataset, with 203 tumour samples. The limma and SVA algorithms were applied to the data for batch correction, and two datasets were separated in the PCA plots because of heterogeneity, which was eliminated after normalization (Fig. 3A). Next, we selected the top 50 MMGs with high expression in macrophages identified by the abovementioned scRNA-seq analysis, among which 48 genes were expressed in the combined data. 32 of the 48 MMGs showed a significant effect on patient survival outcomes in the combined dataset (Additional file 1. Fig. S1). Ultimately, we obtained 13 MMGs in HCC according to cut-off criteria of |logFC|>1 and adjusted p value<0.05 in the univariate Cox regression analysis (Additional file 6. Table. S2) : S100A9, CD14, GRN, NPC2, CD68, IER3, MAFB, HLA-DRB5, CPVL, IFI30, PLAUR, S100A8, and FCER1G; furthermore, the comprehensive landscape of MMG interactions, regulatory connections, and prognostic value in patients with HCC is shown in Fig. 3B. We next analysed the expression distribution of these 13 MMGs, and as expected, they were mainly specifically expressed in the macrophage cell cluster (Fig. 3C). In addition, we combined multiple single-cell liver cancer datasets from TISCH to verify the above conclusion and confirmed that the 13 identified MMGs were generally highly expressed in macrophages (Additional file 2. Fig. S2).
Three MMG-associated clusters and their functional differences
To further explore the expression profiles of tumour-infiltrating MMGs in HCC, we applied an unsupervised consensus clustering algorithm to categorize patients with HCC based on the expression of the 13 MMGs. Our results showed that k=3 was the optimal selection value for sorting the entire cohort into 3 clusters: A (n=145), B (n=194) and C (n=227) (Fig. 4A). Furthermore, we explored the prognostic implications of macrophage gene clusters by integrating them with survival information. The analysis was performed by using Kaplan‒Meier plotter, and we found that patients in gene cluster B had a better prognosis, whereas patients in gene clusters A and C had unfavourable outcomes (log rank test, p=0.028; Fig. 4B). In addition, we found that the expression levels of MMGs were mostly lower in cluster B than in clusters A and C, except for that of CD14 (Fig. 4C). Considering the role of macrophage subtypes in HCC, we also compared the clinical features of the 3 clusters of macrophages. The relationships between HCC patient clinical factors and macrophage subtypes were delineated in a heatmap (Fig. 4D).
To identify differentially enriched pathways among the 3 macrophage clusters in the HCC cohort, we downloaded the Hallmark, KEGG and Reactome pathway sets from MSigDB and carried out GSVA to identify the enriched pathways of these 3 clusters. The enrichment heatmap illustrated that cluster B had significant enrichment of inflammatory and antitumour hallmarks (including PI3K-AKT-mTOR signalling, IFN-γ response, IFN-α response, IL-6-JAK-STAT3 signalling, IL-2-STAT5 signalling, Kras signalling, TNF-α signalling via NF-κB and inflammatory response). Cluster B had significant enrichment of KEGG pathways related to immune infiltration (including NOD-like receptor signalling pathway, Toll-like receptor signalling pathway, T-cell receptor signalling pathway, B-cell receptor signalling pathway, cytokine‒cytokine receptor interaction and chemokine signalling pathway). For Reactome pathways, cluster B had significant enrichment of inflammation and immune activity (including PD-1 signalling, DAP12 interactions, IL-10 signalling, and signalling by interleukins and inflammasomes) (Fig. 5).
Estimation of TME infiltration and functional differences in distinct macrophage clusters
Macrophages play a vital role in antitumour immunity. Though a close relationship between macrophage subclusters and immune activity was identified, macrophages and the immune response are processes involving multiple steps and molecules that act in a highly united and cooperative manner. We further explored the relationship of the macrophage subclusters with immune cell infiltration in HCC patients. PCA was used for dimensionality reduction (Fig. 6A), and by using the ESTIMATE algorithm, we found that cluster B had lower StromalScores, ImmuneScores and ESTIMATE scores than clusters A and C (p<0.001) (Fig. 6B). To investigate the role of MMGs in the TME of HCC, we assessed the correlations between the 3 macrophage clusters and 23 human immune cell subsets using single-sample gene set enrichment analysis (ssGSEA). Notably, we observed significant differences in the infiltration of most immune cells between the three clusters; for example, cluster A was found to have more immune cell infiltration, but cluster B had less immune infiltration (Fig. 6C). Next, we performed differential gene analysis for the three clusters, and DEGs among the three clusters were visualized in volcano plots. As shown in Fig. 6D, the number of DEGs between cluster B and cluster A was the largest, which was consistent with the above phenotypic results. On this basis, the gene sets were filtered using the criteria |logFC|> 1 and p<0.05, and 761 DEGs were acquired. The clusterProfiler R package was adopted for GO analysis. The 761 DEGs were mainly enriched in the biological process (BP) terms cell‒cell adhesion, leukocyte activation, and cell chemotaxis; the cellular component (CC) terms collagen-containing extracellular matrix, external side of plasma membrane and vesicle lumen; and the for molecular function (MF) terms glycosaminoglycan binding, extracellular matrix structural constituent and immune receptor activity (Fig. 6E). Further KEGG pathway analysis found that the DEGs were mainly enriched in complement and coagulation cascades, phagosomes, drug metabolism (cytochrome P450), and metabolism of xenobiotics by cytochrome p450, which may imply that the macrophage subclusters have effects on antitumour drug metabolism (Fig. 6F).
Hub gene subtypes based on macrophage clusters and construction of the Macrosig scoring system
Multiple studies have shown that the immune system is related to therapy outcomes, and protumour or antitumour activity can exist [27, 28]. To clarify the prognostic value of the identified macrophage clusters in HCC, we performed a univariable Cox regression analysis using the 761 DEGs, and 12 hub genes of the 761 DEGs with p<0.000001 were identified among the three macrophage clusters (G6PD, MYBL2, LPCAT1, PKIB, ATP1B3, PON1, AFM, SLC1A5, SPP1, SLC22A1, ADH1C, and SMOX), which also demonstrated noticeable prognostic power in the Cox regression analysis (Fig. 7A, Additional file 7. Table S3). In addition, we also conducted a comprehensive analysis of 12 genes using the GSCA datasets, and they were all differentially expressed in the HCC TCGA dataset; we also analysed deleterious mutations, mutation distribution, CNV percentage, and methylation differences (Additional file 3. Fig. S3).
Given the crucial regulatory function of macrophages in the evaluation of prognosis and TME immune landscapes, we further constructed a scoring system with PCA (PC1+PC2) based on the 12 hub DEGs. We termed it the Macrosig scoring system. We divided patients into low score and high score groups to quantify macrophage regulation and the immune microenvironment in individual HCC samples. We found that a higher Macrosig score indicated a longer survival time (p<0.001) (Fig. 7B). Moreover, according to the tumour-infiltrating immunocyte analysis, we determined the levels of 23 types of infiltrating immune cells in patients with different Macrosig scores; we found that the Macrosig score was negatively correlated with the majority of infiltrating immune cells (Fig. 7C). These results preliminarily suggest that the higher the Macrosig score is, the less immune cell infiltration and the better the prognosis of HCC patients.
Correlation between the Macrosig score, immunological characteristics and clinical features
In terms of clinicopathological features, the Macrosig score was also correlated with patient stage and survival status. As shown in Fig. 7D, significant differences in the Macrosig score were observed among different HCC stages; patients with a low Macrosig score (59% stage I+II and 41% stage III+IV) presented more severe stages of disease than those with a high Macrosig score (77% stage I+II and 23% stage III+IV) (p<0.001). More importantly, we also observed that the high Macrosig score group (80% alive and 20% dead) presented a prominent survival advantage over the low Macrosig score group (55% alive and 45% dead) (p<0.001, Fig. 7E), indicating that the Macrosig score has potential prognostic value for HCC patients. Furthermore, we investigated the correlation between the Macrosig score and immune checkpoints. In line with the above negative correlation of the Macrosig score with the levels of most infiltrating immune cells, the results showed that patients with a low Macrosig score had significantly higher expression of most immune checkpoints, including CTLA-4, LAG3, PDCD1 and TIGIT (Fig. 7F), than patients with a high score, which suggested that HCC patients in the low Macrosig score group were more likely to be responsive to immunotherapy.
Increasing evidence has demonstrated that somatic mutation patterns are associated with responsiveness to immunotherapy. We thus assessed the distribution of somatic variants in HCC driver genes between the low and high Macrosig score subgroups. The HCC driver genes were identified by using the maftools R package[29], and the top 10 driver genes with the highest alteration frequency in the high (altered in 157 of 228 samples) and low (altered in 95 of 124 samples) Macrosig score groups were further analysed. The mutation annotation files revealed that the alteration frequencies of TP53, NFASC, EPHA4, DMD, CPAMD8, NLRP2, ADAMTSL3, ADARB2, IQGAP3, LPIN2 and SLC6A11 were significantly higher in the low score group than in the high score group, but this pattern did not hold for DYNC2H1 (Fig. 8A-C). These results might provide novel ideas for investigating the mechanism of gene mutation in immune checkpoint blockade therapy.
The relationship of the Macrosig score with cytokines
Chemokines, interleukins and interferons are all cytokines; they play an important role in the composition of the TME, coordinate immune cell trafficking, exert multifaceted roles in the tumour immune response and tumour biology, and have a strong influence on patient prognosis and response to therapy; therefore, the cytokine network has emerged as a potential immunotherapy target[30-32]. TAMs generally include two subtypes: M1 macrophages play an antitumour immunity role, while M2 macrophages promote tumour growth, invasion and migration and inhibit tumour immunity by secreting anti-inflammatory cytokines. However, whether the Macrosig score has an effect on cytokine expression is unknown. As such, we next examined the expression of various chemokines in the high and low Macrosig score groups and observed that the expression of most immune activity-related cytokines was significantly different between the groups with different Macrosig score, with the high score group expressing decreased levels of cytokines (Fig. 9A). In the subsequent analysis, the correlations between the Macrosig score and 50 hallmark terms were calculated, and the results were visualized via ggplot2 (Fig. 9B). The Macrosig score was positively correlated with xenobiotic metabolism, bile acid metabolism, fatty acid metabolism, and so on, and negatively correlated with MYC targets V1, G2M checkpoint, E2F targets, unfolded protein response, unfolded protein response, DNA repair, and so on.
Macrosig score-related treatment strategy for HCC
Immune checkpoint blockade therapy, which is used to block T-cell inhibitory molecules in cancer treatment, has shown promising outcomes and the potential to improve disease conditions in advanced cancers, but it is not effective in all patients[33, 34]. The higher the Macrosig score was, the lower the immune infiltration, immune checkpoint expression, cytokine expression and frequency of gene mutation. Therefore, we hypothesized that the high score group would be insensitive to immunotherapy. To test this, we used the immunotherapy database to verify the efficacy of the Macrosig score in predicting immunotherapy response. By employing a real-world dataset derived from patients with metastatic urothelial cancer treated with immune checkpoint inhibition (GSE176307)[15], we discovered that the survival probability was significantly higher in the low Macrosig score group (p=0.046); moreover, the complete response (CR) or partial response (PR) rate was also higher than that in the high Macrosig score group (29% CR+PR vs. 12% CR+PR) (Fig. 10A, B). In addition, based on data from the phase 3 JAVELIN Renal 101 trial (n = 886; NCT02684006) that aimed to evaluate the efficacy of avelumab (anti-PDL1 antibody)[16], we further revealed that patients with low Macrosig scores exhibited longer survival times and lower disease progression rates than those with high Macrosig scores (p=0.0049) (Fig. 10C, D); all of these results demonstrated that the Macrosig score can play a crucial role in predicting the immune response.
With regard to targeted immunotherapy, we comprehensively evaluated the sensitivities of targeted drugs in different Macrosig score groups using pRRophetic, with a higher IC50 indicating less treatment sensitivity. The results showed that patients in the low score group had lower IC50 values for chemotherapeutics, including AZD.2281 (olaparib), A.443654 (an Akt inhibitor), ABT.263 (navitoclax), ABT.888 (a PARP1 inhibitor), AG.014699 (rucaparib) and ATRA; that is, patients in the low Macrosig score group were more likely to benefit from these drugs. The IC50 values of AZD6482 (a PI3Kβ inhibitor), AKT inhibitor VIII, AS601245 (a JNK inhibitor), AZ628 (a Raf kinase inhibitor), AZD.0530 (saracatinib), and AZD6244 (selumetinib) were significantly lower in the patients in the high Macrosig score group than in those in the low Macrosig score group (Fig.6E), suggesting that high Macrosig score patients are more likely to be sensitive to these treatments. These results indicate that the Macrosig score can be used to predict sensitivity to chemotherapeutic drugs and predict patient treatment benefit. In addition, we used the RegNetwork database to predict the upstream miRNAs and transcription factors of the above 12 genes (Additional file 4. Fig. S4) to lay a foundation for further mechanistic studies.