A Novel Immune and Inammatory Related Gene Signature in Hepatocellular Carcinoma

Background: The prognosis of hepatocellular carcinoma (HCC) is closely related to immunity and inammation, but the value of using immune and inammation-related genes as predicting the prognosis of HCC requires further research. Methods: The Hepatocellular Carcinomar mRNA data was downloaded in the TCGA and ICGC database. The R package "limma" was used to analyze the differential expression of genes (DEGs) irelated to immune and inammatory .Univariate Cox analysis screen for immune and inammation related genes with prognostic value, then construction and verication of the prognostic model in Hepatocellular Carcinomar. The correlation between risk score with tumor immune immersion and immune cell function was assessed through tumor microensure and immune response analysis. NCI-60 cell line to explore the relationship between prognostic gene expression and drug sensitivity. Results: We evaluated 8 immune and inammatory-related genes to build a prognostic risk prediction model, riskscore is an independent risk factor affecting prognosis, closely related to histological grading and clinical staging. The immune of adCs, macrophages, Tfh cells, Treg cells and Th1 cells higher in the tumor microenvironment leads to poor prognosis of liver cancer. Using data from the NCI-60 cell line, DNASE1L3 high expression may increased resistance of liver cancer cells to bovine platinum, solanil and bovine platinum. The expression of SLC7A11 can increase the sensitivity of liver cancer to arsenic trioxide (ATO). Simultaneously constructing models and tumor microenvironment and drug resistance may provide effective and safe strategies for HCC chemotherapy and immunotherapy. Conclusion:Our study screened eight immune and inammation-related genes play an important role in HCC tumor immunity and can be used to predict the prognosis of HCC.


Background
Primary liver cancer refers to malignant tumors originating from hepatocytes and intrahepatic bile duct epithelial cells, among which hepatocellular carcinoma(HCC) is the most common. The global incidence of HCC is 10.1 patients per 100,000 people per year, 80% of which occur in East Asia and Africa [1] .The causes of hepatocellular carcinoma include chronic infection with hepatitis B virus and hepatitis C virus, alcoholism, non-alcoholic fatty liver, and dietary toxins such as a atoxin [2] . In recent years, many scholars have focused on the research of liver cancer prevention and treatment strategies, including cancer risk assessment, surgical resection, liver transplantation, comprehensive treatment, etc. However, the 5-year survival rate of liver cancer patients still does not exceed 20%, and the early diagnosis rate is low and high recurrence risk is still an important issue that plagues treatment [3][4][5][6] .
In ammation is a basic pathological process. It is an immune defense mechanism produced by the body in the face of damage caused by various in ammatory factors. It is also an important part of many diseases, including malignant tumors [7] .More and more evidences have emphasized the importance of chronic in ammation in the pathobiology of liver cancer, by changing the tumor immune microenvironment to affect its progression, metastatic potential and treatment outcome.Studying the crosstalk between immunotherapy and in ammation has clinical signi cance in Central Asia for the development of new treatments and overcoming resistance [8] . In ammation associated with cancer has been determined to be a key process in controlling tumor progression. Typical Th1-related/in ammatory mediators, such as interferon gamma, can play an important role in enhancing acquired anti-tumor immune activities in the initial stage of tumor development, and in ammation usually becomes a chronic disease that promotes tumor growth and metastasis [9] .Many malignant tumors are rich in proin ammatory cytokines and in ammatory chemokines. These different pro-in ammatory components lead to the occurrence and metastasis of primary tumors, and ultimately lead to poor survival and prognosis of patients [10] . The high in ltration of ILC2 in human melanoma is associated with a good clinical prognosis. ILC2 can coordinate the recruitment and activation of eosinophils to enhance the antitumor response. The expression of tumor-in ltrating ILC2s limits the accumulation, proliferation and antitumor effect of PD-1 in tumors. This inhibition can block PD-1 and enhance the anti-tumor response by binding to IL-33-driven ILC2 activation [11] . Epithelial Mesenchymal Transition(EMT) plays an important role in tumor progression and metastasis. A variety of in ammatory mediators, such as tumor necrosis factor, interleukin 1β, interleukin 6, interleukin 11 and interleukin 8, are effective inducers of EMT [12] .
Cytokines, chemokines and growth factors play a leading role in the formation of the in ammatory environment in Tumor Microenvironment(TME), and promote tumor progression by directly acting on tumor cells or indirectly regulating other components of TME [8] .
Tumor immunotherapy is to control and kill tumor cells by modulating the function of the body's immune system and enhancing anti-tumor immunity.The traditional method of tumor treatment is to rely on external forces to kill tumor cells, and the target of action is tumor cells, while the current target of new tumor immunotherapy is immune cells, that is, by mobilizing the body's own immune system to inhibit or eliminate tumor cells.Tumor-associated macrophages (TAMs) are the most abundant immune cells in TME, and the increase of TAM leads to poor prognosis of HCC patients. According to different environmental stimuli, TAMs can be divided into two phenotypes: pro-in ammatory (M1) and antiin ammatory (M2). M1 TAMs act as synaptic antigen-presenting cells and play an anti-tumor effect, while M2 TAMs have higher anti-in ammatory cytokines, thereby suppressing the immune response [13] .The immune system's monitoring function plays an important role in early tumors, but immune escape and tumor microenomic disorders may be involved in tumor development [14] . Natural killer cells (NK) cells are innate immune cells that exert an anti-tumor effect by releasing cell-dissolved particles and cytokine secretion to produce direct cytotoxic effects [15][16][17] . NK cells play a key role in immune monitoring of normal livers, but NK cell failure and dysfunction (NK cell phethal abnormalities, differences in the identi cation of ligand receptors, dysfunction, and interference with other immune cells) may lead to the progression of HCC [18] .
Our research aims to study the role of Immune (humoral) and in ammatory response factors in the prognosis of liver cancer, and to nd important targets related to Immune and in ammation. At the same time, we analyze the correlation between in ammatory factors and the immune system, and further verify through experimental and clinical studies, so as to better Improve the prognosis of hepatocellular carcinomar. The mRNA expression pro le of hepatocellular Carcinomar patients and the corresponding clinical data were downloaded from public databases. Then, a multi-gene prognostic model was constructed in the TCGA cohort through the differentially expressed genes related to Immune (humoral) and in ammatory response, and this model was validated in the ICGC cohort. Finally, a functional enrichment analysis was further carried out to explore its potential biological mechanism.

Results
Screen the prognostic immune and in ammation-related DEGs in the TCGA cohort In the TCGA cohort, 50 immune and in ammatory response-related genes were differentially expressed in tumor tissues and non-tumor tissues ( Figure 1A). Univariate Cox analysis showed that 11 of them were related to OS ( Figure 1B). The heatmap indicates the expression of 11 DEGs( Figure 1C). The hazard ratio of KCNQ3 gene was 3.553 (95%CI=1.552-8.130, P=0.003, Figure 1D). The correlation between these genes is shown in Figure 1E.
Construction and verify the stability of prognostic-related models Lasso-Cox regression was used to analyze the expression pro les of the above 11 genes, and a prognostic model was established according to the best value of λ, a marker containing 8 genes is determined( Figure 1F and 1G). The calculation method of the risk score is as follows: riskScore= the express of CCNF*0.13310369848214 + the express of DNASE1L3*-0.0647265921366507 + the express of ENTPD2*0.030515515064479 + the express of MFAP2 *0.00849895046403311 + the express of SLC16A3*0.0466599278032965 +the express of SLC7A11*0.204041600878428 +the express of SPP1 *0.0496893487484051 + the express of STMN1*0.122778720315189.According to the median value, patients were divided into high-risk groups and low-risk groups (Figure 2A). In the TCGA cohort, the highrisk group was found to be signi cantly related to tumor grade and stage (Table 1). In addition, the scatter plot shows that high-risk patients have a higher mortality rate ( Figure 2B). The overall survival time of patients in the low-risk group was signi cantly better than that in the high-risk group( Figure 2C). The prognostic model was used to generate a time-correlated ROC curve for survival prediction analysis. The area under the curve (AUC) reached 0.779 at 1 year, 0.724 at 2 years, and 0.695 at 3 years ( Figure   2D).PCA analysis and t-SNE analysis showed that patients in the high-and low-risk groups were distributed in two different directions ( Figure 2E-F). Verify the stability of 8 prognostic-related gene models through the ICGC cohort, and the results are consistent with the TCGA cohort, indicating that the prognostic model has good stability( Figure 2G-L).In order to explore the relationship between the expression of 8 prognostic-related genes and OS, survival analysis was performed based on the expression value of each gene. ENTPD2 has nothing to do with overall survival ( Figure 3A-H). Except for the high expression of DNASE1L3, which indicates that overall survival is higher than the low expression group, the remaining 7 genes are in tumor tissues low expression is bene cial to the overall survival of patients ( Figure 3I-P).  E).ROC curve analysis showed that in the TCGA data set (AUC=0.782) and the ICGC data set (AUC=0.730) ( Figure 4C, F), the risk score has good predictive accuracy for the prognosis in Hepatocellular Carcinomar.
The relationship between the riskScore of the prognostic model and the clinical characteristics By analyzing the correlation between the riskscore and the clinical characteristics of HCC patients (Table   1), we found that the histological grade 3-4 (P<0.001), the TNM stage III-IV (P<0.001) , the risk score was signi cantly higher than the histological grade 1-2 or TNM stages I-II ( Figure 5C, D). In addition, the risk score of TNM stages III-IV in the ICGC data set is signi cantly higher than that of TNM stage I-II (there is no data on HCC classi cation in the ICGC data set) ( Figure 5G). Extract the results of multi-factor analysis from the TCGA cohort and ICGC cohort data to establish a nomogram, verify the scores of 8 independent prognostic factors, nd the corresponding probabilities on the nomogram, and estimate the patient's 1 year Probability of survival at 2 and 3 years.The nomogram performs well in predicting 1, 2, and 3-year survival probabilities( Figure 6).

Tumor microenvironment and immune status analysis
In order to further explore whether there are differences in the immune-related status between the high-risk and low-risk groups, different immune cells and related immune pathways were scored by ssGSEA analysis. In terms of the antigen presentation process, it was found that aDCs, Macrophages, HLA, MHCclass-I, Parain ammation were signi cantly different between the high-risk and low-risk groups, and the immune score of the high-risk group was higher than that of the low-risk group(Figure7A,B). Compared with the low-risk group, the immune scores of Tfh cells, Treg cells and Th1 cells in the high-risk group were higher, indicating that there are differences in T cell regulation between the high-risk group and the low-risk group. In addition, the high-risk group had higher CCR and checkpoint scores, while the type II IFN reactivity was just the opposite. In the low-risk group, the immune scores of B cells, mast cells, neutrophils and NK cells were higher, indicating that there are differences in immune regulation between the high-risk group and the low-risk group.In the high and low risk groups, the results of the ICGC cohort are similar to the results of the TCGA cohort( Figure 7C,D) In order to understand the correlation between immune components and risk scores, we conducted a correlation analysis between immune in ltration and risk scores. There are six types of immune in ltration in human tumors, corresponding to tumors promoted to tumor suppression: C1 (wound healing), C2 (INF-γ dominant), C3 (in ammation), C4 (lymphocytes) Depletion), C5 (immune quietness) and C6 (TGF-β dominant). No patient sample in HCC belongs to the C5 immune subtype, and there is only 1 sample from C6. Therefore, the C5 and C6 immune subtypes are not included in this study. In order to further analyze the correlation between the HCC risk score and immune in ltration in the TCGA data, it was found that the high risk score was signi cantly correlated with C1, while the low risk score was signi cantly correlated with C3 ( Figure 7E). Tumor stemness can be measured by DNA stemness of DNA methylation pattern (DNAss) and RNA stemness score (RNAss) score of gene expression. The matrix score and immune score are used to evaluate the tumor immune microenvironment. In order to explore whether there is a correlation between the risk score and tumor stem cells and the immune microenvironment, the results showed that the risk score RNAss was not signi cantly correlated, and it was positively correlated with DNAss (P<0.001), immune score (P=0.024) and negatively correlated with matrix score (P=0.012) (( Figure 7F-I).
PD1/PD-L1 and CTLA4 are classic immune checkpoints, which play an inhibitory role under physiological conditions to regulate the duration and intensity of immune response, and participate in assisting tumor immune escape in the tumor microenvironment. The expression levels of PD-L1 and CTLA4 in the highrisk group were signi cantly higher than those in the low-risk group ( Figure 9A, B), and the expression levels of these immune checkpoints were positively correlated with the risk score.The expression of other immune checkpoints, such as TLT8, CXCR2, GPC3, also increased in the high-risk group, and the expression level was positively correlated with the risk score. Among them, the GPC3-derived peptide vaccine showed good anti-tumor e cacy.

Functional enrichment analysis
The GO function and KEGG pathway enrichment analysis between the high-risk and low-risk groups were completed through R. GO function enrichment analysis showed that nuclear division, chromosome segregation, steroid hydroxylase activity. KEGG enrichment analysis main pathways involved were Cell cycle, Human T−cell leukemia virus 1 infection, HIF−1 signaling pathway, PPAR signaling pathway, IL−17 signaling pathway, PI3K−Akt signaling pathway, Central carbon metabolism in cancer( Figure 10A, B).The results of the ICGC cohort are similar to the results of the TCGA cohort( Figure 10C, D) Prognostic gene expression and sensitivity of cancer cells to chemotherapy The expression of eight prognostic-related genes in the NCI-60 cell line was studied, and the relationship between the expression levels of eight prognostic genes and drug sensitivity was analyzed. We selected the rst 16 signi cant differences in the p-value to analyze the correlation between gene and drug sensitivity ( Figure 11).DNASE1L3 ENTPD2 MFAP2 SLC7A11 SPP1 STMN1 related to increased drug resistance of cancer cells to Fulvestrant Ifosfamide Nelarabine Bisacodyl, active ingredient of Viraplex Isotretinoin Oxaliplatin Fluphe0zine Imiquimod Megestrol acetate Hydroxyurea Dexamethasone Decadron Epirubicin Carmustine.
Interestingly, the increase in SLC16A3 expression is associated with the increase in the sensitivity of cancer cells to Arsenic trioxide and Ixazomib citrate.

Discussion
The incidence of primary liver cancer ranks 6th among all tumors and 4th among cancer-related deaths.
In 2018, there were approximately 841,000 new cases of primary liver cancer worldwide, and 782,000 deaths related to liver cancer [23] . Hepatocellular carcinoma (HCC) accounts for about 75-85% of primary liver cancer. Most HCC patients have reached the advanced stage when they are diagnosed, lose surgical opportunities and have a poor prognosis. The 5-year overall survival rate (OSR) is 20% [24] . The main treatments for HCC include surgical resection, liver transplantation, transarterial chemoembolization (TACE) and radiofrequency ablation (RFA), supplemented by systemic chemotherapy and targeted drug therapy [25] . Because early liver cancer no obvious symptoms or atypical clinical manifestations, more than 80% of patients are already in the advanced stage at the time of diagnosis and cannot receive radical treatment [26] .Alpha-fetoprotein (AFP) is the most commonly used biomarker in HCC monitoring and diagnosis, but it has limited speci city and sensitivity in detecting early HCC [27] . The combination of AFP and serum GP73 is better than AFP alone in the early screening of HCC [28] .Serum AFP and AFP-L3 combined with ultrasonography can signi cantly improve the sensitivity of liver cancer diagnosis (94.3%) [29] .Using serum AKR1B10 alone to diagnose early HCC has a sensitivity of 61% and a speci city of 86%, which is better than AFP alone. The combination of the two AUROC is as high as 94%, which can be vigorously promoted as a marker for early liver cancer diagnosis [30] . However, there are no relevant reports on the immune and in ammation-related markers in serum. At present, the prediction of liver cancer models related to iron death, DNA methylation, m6A methylation, long non-coding RNA, endoplasmic reticulum stress and glycolysis has good predictive effects [31][32][33][34][35][36] .The prognostic model constructed by immune and in ammation-related genes screened in this study has an AUC of 77.9% (Figure2D), which also has a good predictive effect.
In this study, we analyzed the expression of 526 immune and in ammatory response-related genes in HCC tissues, and selected 50 DEGs from the TCGA cohort ( Figure 1A). In univariate Cox analysis, 11 DEGs were associated with OS ( Figure 1C, D).Prognostic models of 8 immune and in ammatory responserelated genes were constructed through LASSO regression analysis and veri ed in the ICGC cohort. According to the median risk score, patients were divided into high-risk groups and low-risk groups. Independent prognostic analysis showed that risk score is an independent predictor of OS (Figure4).
The prognosis model established in this study includes 8 immune and in ammatory response related genes (CCNF, DNASE1L3, ENTPD2, MFAP2, SLC16A3, SLC7A11, SPP1, STMN1). Except for DNASE3L3, these genes are all up-regulated in HCC tumor tissues and are associated with poor prognosis (Figure3).Cyclin F (CCNF) has been shown to regulate the cell dNTP pool and maintain the stability of the genome by interacting with ribonucleotide reductase family member 2 (RRM2) [37] . Patients with low CCNF expression may have a shorter survival time than those with high expression (P=0.001), and have a higher tendency to relapse (P=0.037) [38] . DNASE1L3 is very important for DNA catabolism and apoptosis. It combines with DNASE1 and plays a key role in the neutrophil extracellular trap and cfDNA degradation, thereby reducing organ damage after in ammation [39] . The up-regulated expression of DNASE1L3 alleviates the accumulation of cytoplasmic DNA under DDR activation conditions, which in turn leads to cell senescence and SASP inhibition, and tumor angiogenesis is impaired. These results indicate that DNASE1L3 is a potential biomarker for predicting the prognosis of HCC [40] , and the positive expression of DNASE1L3 can be used as a key indicator of a good prognosis after liver cancer [41] . E-NTPDases are transmembrane exonucleases on the surface of CD39 superfamily cells, which regulate in ammation and tissue repair by catalyzing the phosphohydrolysis of extracellular nucleotides and regulating purine signaling [42] . IL-6 transcription down-regulates the expression of ENTPD2 in portal vein broblasts without inducing myo broblast differentiation [43] . Micro bril-associated protein 2 (MFAP2), also known as micro bril-associated glycoprotein 1 (MAGP1), is a component of extracellular elastic micro brils and interacts with brin to affect the function of micro brils [44] . MFAP2 is signi cantly up-regulated in HCC, and may promote the development and progression of HCC through its interaction with mutant brillin-1. DNMTs inhibitors can down-regulate MFAP2, and MFAP2 may become a potential immunotherapy target for HCC patients [45] .SLC16A3 has been proven to be a downstream factor of TSTA3 immune responsemediated metabolic coupling cell cycle and non-neoplastic hepatitis/cirrhosis tissue replication [46] . The SLC7A11 gene is located on human chromosome 4 and contains 14 exons. It is widely expressed in tissues and cells such as brain, liver, macrophages and retinal pigment cells. Down-regulation of SLC7A11 in HCC cells may increase intracellular ROS levels. In order to inhibit cell proliferation in vitro, the overexpression of SLC7A11 may be related to the advanced pathological stage of liver cancer [47] . Secreted phosphoprotein 1 (SPP1) In tumors, alternative splicing variants are associated with many malignant characteristics in cancer, such as epithelial-mesenchymal plasticity, cancer cell stem cell resistance, chemotherapy resistance and radioresistance, in HCC Among them, SPP1 is an effective prognostic biomarker, which may induce chemotherapy resistance by regulating the autophagy of HCC cells [48] . In HCC, the high expression of STMN1 is positively correlated with higher AFP levels, tumor size, vascular invasion and intrahepatic metastasis, lower 5-year survival rate and early recurrence rate. The MET inhibitor crizotinib can effectively inhibit the crosstalk between cancer cells and stellate cells caused by the HGF/MET pathway triggered by STMN1, and can effectively slow down the tumor growth caused by the high expression of stmn1 [49] .
In order to understand the relationship between risk scores and immune cells and functions, we studied the role of risk scores in the types of immune in ltration. The results showed that the high-risk score was signi cantly related to C1, while the low-risk score was clearly related to C3 (in ammation) (Figure?), suggesting that C1 promotes the occurrence and development of tumors, and C3 is a good protective factor. It shows that immunity and in ammation play an important role in tumors. In terms of the association between risk score and clinical characteristics, we found that the overall OS of the high-risk group, Grade3-4 and StageIII-IV was signi cantly shortened (Figure7E).In KEGG enrichment analysis, the main signaling pathways, cell cycle, human T-cell leukemia virus 1 infection, HIF-1 signaling pathway, PPAR signaling pathway, IL-17 signaling pathway, PI3K-Akt signaling pathway, cancer metabolism are closely related to tumor occurrence and development the pathways are signi cantly enriched in high risk (Figure10).
In the high-risk group, the immune scores of aDCs, macrophages, Tfh cells, Treg cells and Th1 cells were higher (Figure7). There are a large number of in ammatory cells in ltrated in tumor tissues, among which macrophages have tumor-promoting and immunosuppressive effects [50] . The immunosuppression of Tfh cells, Treg cells and Th1 cells in the tumor microenvironment leads to poor prognosis of liver cancer [51] .Programmed cell death protein-1 (PD-1) is mainly expressed on the surface of activated T cells, B cells and macrophages. PD-1 inhibits the activation of antigen-speci c T cells by binding to its ligands PD-L1 and PD-L2. When PD-L1, which is highly expressed on the surface of tumor cells, binds to PD-1 on the surface of immune cells, it will cause tumor immune escape [52] .CTLA-4 inhibitors induce T cell anergy, thereby inhibiting the anti-tumor immune response [53] . TLR7/8 agonists promote NK-DC crosstalk and enhance the anti-tumor effect of NK cells in hepatocellular carcinoma [54] . sGPC3 attenuates the antitumor activity of CAR-T cells in vitro and in vivo, possibly through the combination of membrane-bound GPC3 and CAR-T cells, which leads to immune escape [55] .
Cancer stem cells (CSCs) have been shown to be the cause of HCC [56] recurrence, metastasis, and resistance to local and systemic treatments. Among them, the HIF-1 signaling pathway, Akt signaling pathway and IL-17 signaling pathway lead to an increase in the proportion of CSCs in liver cancer tissues, leading to immunosuppression [57] . Our research found that MFAP2 is negatively correlated with RNAss and DNAss, and STMN1 is positively correlated with RNAss and DNAss. It shows that MAPF2 may inhibit the differentiation of cancer stem cells, and STMN1 promotes the differentiation of cancer stem cells, but its mechanism of promoting tumor proliferation and invasion is worthy of further study. (Figure8), but they promote tumor proliferation and invasion. The speci c mechanism is worthy of further study.
Prognostic gene expression also has a certain correlation with stromal score and immune score. CCNF, DNASE1L3, ENTPD2, MFAP2, SLC16A3, SPP1, STMN1 have a strong correlation with Stromal score, indicating that they may be secreted by stromal cells or involved in stromal correlation. DNASE1L3, MFAP2, SLC16A3, and SPP1 are closely related to the immune score, indicating that they are closely related to tumor immunity.This study found that prognostic gene expression also has a certain correlation with Stromal score, immune score and ESTIMATE score. DNASE1L3, MFAP2, SLC16A3 and SPP1 are positively correlated with interstitial score and immune score, indicating that they may be related to interstitial cells and immune cells [58][59][60] .
Using data from the NCI-60 cell line, we found that the increased expression of some prognostic genes is associated with the increased resistance of some FDA-approved chemotherapeutic drugs, such as oxaliplatin, cyclinamide, epirubicin, nelarabine and uoride. Of course, various prognostic genes are also related to the increase in drug sensitivity of some drugs.
For example, the increased expression of DNASE1L3 and the resistance of liver cancer cells to oxaliplatin, sorafenib combined with oxaliplatin, uorouracil and calcium folinate in the treatment of portal vein invasion of hepatocellular carcinoma and clinical trials [61] .The expression of SLC7A11 can increase the sensitivity of liver cancer to arsenic trioxide (ATO). Studies have reported that ATO can effectively induce the differentiation of CSCs by down-regulating CSC-related genes, inhibiting the tumorigenic ability of CSCs. The combination therapy of ATO and 5-FU/cisplatin can signi cantly improve the therapeutic effect of liver cancer cells. ATO and 5-FU/cisplatin synergistically inhibit LIF/JAK1/STAT3 and NF-kB signaling pathways are the potential molecular mechanisms of their differentiation [62] .

Conclusion
Our study screened eight immune and in ammatory response-related genes and constructed a new and stable risk model. The TCGA and ICGC cohorts veri ed that the model is closely related to the OS of HCC patients and can detect the risk level of HCC. The eight genes of the risk model may play a role in the development and progression of HCC, and the underlying mechanisms of immunity and in ammation are worthy of further study. Simultaneously constructing models and tumor microenvironment and drug resistance may provide effective and safe strategies for HCC chemotherapy and immunotherapy.

Hepatocellular Carcinomar related data and Immune and in ammation-related Genes
The Hepatocellular Carcinomar mRNA data for 370 patients and the corresponding clinical data was downloaded in The TCGA (https://portal.gdccancer.gov/repository) database in a standardized FPKM format as the training group data. Then downloaded the data from the International Cancer Genome Consortium (ICGC) Database (https://dcc.icgc.org/) of the International Union for Hepatocellular Carcinomar,including mRNA-seq data from 231 patients and clinical information.All data of TCGA and ICGC were public. We follow the data access policies and release guidelines of TCGA and ICGC. Our study obtained 546 immune(humoral) and in ammation related genes(Systematic name: M8838) were found in the molecular signature database(http://www.gsea-msigdb.org/gsea/).

Construction and veri cation of the prognostic model in Hepatocellular Carcinomar
In the TCGA cohort, the R package "limma" was used to analyze the differential expression of genes (DEGs) in tumor tissues and paracancerous tissues to screen out differential genes related to immune and in ammation. The screening threshold is |log 2 FC|>2, and FDR <0. 05. At the same time, a Univariate Cox analysis of overall survival (OS) was performed to screen for immune and in ammation related genes with prognostic value. Use the R package "venn" to intersect the iron death-related genes with differences and prognostic value. In order to achieve the purpose of minimizing the risk of over tting, the author uses Lasso Cox regression analysis and combines the R package "glmnet" to construct a prognostic model [19,20] .The independent variable in the regression analysis is the gene expression at the intersection of difference and prognosis, and the dependent variable is the overall survival rate and status of breast cancer patients in the TCGA cohort, and the gene of the optimal model and the corresponding regression coe cient are obtained.Calculate the risk score of each patient according to the standardized expression level of each gene and its corresponding regression coe cient [risk score = e (gene 1 expression × corresponding regression coe cient +… + gene n expression × corresponding regression coe cient] According to the obtained risk score to nd the median, the patients are divided into high-risk group and low-risk group. In the survival analysis, the R package "survminer" is used to construct the survival curve according to the high-risk and low-risk groups. The R package "timeROC" is used to perform ROC curve analysis to evaluate the predictive ability of its characteristic gene prognostic model. Principal component analysis (PCA) and t-distributed random neighbor embedding (t-SNE) analysis are used, and the two-dimensional visualization data is realized with the "Rtsne" and "ggplot2" packages. Univariate and multivariate Cox regression analysis were used, and the "survival" package was used to determine independent prognostic factors. The nominal graph is used to predict the survival probability of HCC patients with R package "rms".

Functional enrichment analysis
Use gene set enrichment analysis (GSEA) to perform GO and KEGG function enrichment analysis on DEGs between high-risk and low-risk groups, and perform function enrichment analysis on Hallmark genes. Single-sample gene set enrichment analysis (ssGSEA) calculated 16 immune in ltration scores and the activity of 13 immune-related pathways between high and low risk groups based on the R package "gsva" [21] .
Tumor microenvironment and immune response analysis Immune score, matrix score and comprehensive score are generated using the ESTIMATE package loaded in R language (version 4.0.3) to estimate the ratio of immunity, matrix components and the sum of the two in TME for each sample, with 3 scores formal performance: Immune Score, StromalScore and Estimate Score, which respectively represent the scores of the immune component, matrix component and the sum of the two components, that is, the higher the respective score, the greater the proportion of the corresponding component in the TME [22] . Drug sensitivity analysis NCI-60 data of 60 different cancer cell lines from 9 different tumors were collected through CellMiner (https://discover.nci.nih.gov/cellminer).Pearson correlation analysis was performed to explore the relationship between prognostic gene expression and drug sensitivity. Related analysis of the e cacy of 263 drugs or clinical trials approved by the FDA.
The establishment of forecast nomogram Page 13/27 The 8 predictive genes determined by multiple regression analysis were used as independent prognostic factors to construct a nomogram. Use survival and time-related genes to construct a speci c model and connect to the ROC curve to determine the accuracy of the prediction model for 1 year, 2 years, and 3 years. Use the calibration chart and the consistency index (C index) to correct the nomogram (through the guidance method of 1000 resampling).

Statistical Analysis
WilCoxon test was used to compare DEGs between tumor tissue and normal tissue. The test of different proportions adopts the chi-square test. The Mann-Whitney test was used to compare the ssGSEA scores of immune cells or immune pathways between the high and low risk groups, and the BH method was used to adjust the P value. The Kaplan-Meier analysis method was used to compare the differences in OS between different groups. Univariate and multivariate Cox analysis were used to screen independent predictors of OS. Spearman or Pearson correlation analysis was used to detect the correlation of prognostic model risk score or prognostic gene expression level with dryness score, interstitial score, immune score and drug sensitivity.

Figure 11
Scatter plot of the relationship between 8 related prognostic gene expression and drug sensitivity.