Ferroptosis Related Prediction Model for Hepatocellular Carcinoma Patients Sensitive to Chemotherapy Embolization Therapy Based on Bioinformatics Analysis

doi:10.21203/rs.3.rs-3088052/v1

Download PDF

Research Article

Ferroptosis Related Prediction Model for Hepatocellular Carcinoma Patients Sensitive to Chemotherapy Embolization Therapy Based on Bioinformatics Analysis

https://doi.org/10.21203/rs.3.rs-3088052/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Objective: The objective of this study was to develop a predictive model that can help with effective transcatheter arterial chemoembolization (TACE) in treating hepatocellular carcinoma by identifying ferroptosis-associated genes.

Methods: In this study, the GSE104580 dataset from the GEO database was analyzed to identify significantly differentially expressed genes (DEGs), which were then used to identify genes associated with chemoembolization sensitivity and ferroptosis using the weighted gene co-expression network analysis (WGCNA). These genes were then used to construct a TACE treatment sensitivity prediction model using lasso regression. Immune infiltration analysis was also conducted, and a hub mRNA, hub miRNA, and hub lncRNA interaction network was established. The TCGA dataset was used to construct a prediction model which was validated by ICGC dataset.

Results: Using the GSE104580 dataset, a total of 2689 DEGs were screened, resulting in the identification of 37 genes. Protein-protein interaction (PPI) network analysis was performed based on these genes, and key genes involved in predicting TACE treatment sensitivity for liver cancer were identified through GO, KEGG, and GSEA analyses. Using the lasso regression method, six hub genes were identified: GLS2, CDKN1A, GPT2, ASNS, SLC38A1, and SLC2A1. Two distinct ferroptosis patterns were identified based on these hub genes, and immune infiltration analysis was conducted to further investigate potential associations with liver cancer. Additionally, a hub mRNA, miRNA, and LncRNA interaction network was constructed using data from miRTarBase, TarBase, and Starbase databases. Utilizing a 6-gene signature, two distinct risk groups were identified. Remarkably, patients classified within the high-risk group exhibited a significant decrease in overall survival when compared to their low-risk counterparts (P < 0.001 in the TCGA cohort and P = 0.013 in the ICGC cohort). In addition, the predictive capacity of this signature was further validated by receiver operating characteristic (ROC) curve analysis.

Conclusion: This study suggests that the six hub genes identified in this research could serve as important targets for improving liver cancer prognosis. Additionally, these genes can be utilized to construct effective TACE sensitive prediction models to help clinicians in treating hepatocellular carcinoma.

Transarterial chemoembolization (TACE)

Hepatocellular carcinoma

ferroptosis

immune infiltration

Primary liver cancer is a significant public health concern worldwide and ranks seventh in terms of occurrence. An alarming trend shows it as the second most common cause of cancer-related deaths globally[1]. The highest incidence rates for this disease are predominantly found in Asia and Africa[2], where Hepatocellular carcinoma (HCC) is the most prevalent malignancy originating from liver cells. Unfortunately, HCC's incidence and mortality rates have been increasing worldwide in recent years[3]. Among all malignant tumors. The prognosis for patients diagnosed with HCC is unfavorable due to high recurrence, metastasis, and malignancy rates. A variety of treatment options such as surgery, chemotherapy, targeted therapy, and transcatheter arterial chemoembolization (TACE) are available for HCC patients[4, 5].

TACE is an interventional therapeutic approach commonly employed to treat primary liver cancer[6]. This method involves the injection of iodized oil, carrying chemotherapy drugs, into the arterial supply zone of the liver cancer through femoral artery puncture into the hepatic artery[5]. The goal is to increase the concentration of chemotherapeutic drugs within the lesion and minimize systemic toxicity. However, this therapy can cause damage to the patient's liver since it not only kills cancer cells but also destroys normal liver tissue, leading to abnormal liver function[7]. Although TACE is one of the common techniques used in clinical treatment of advanced HCC patients, its effectiveness is not entirely satisfactory. One of the main reasons for the unsatisfactory outcomes of TACE treatment is that tumor cells have a higher metabolic activity, and when they face hypoxic-ischemic conditions, they initiate a series of reactions to restore vascular supply, thus promoting the formation of new blood vessels within the tumor tissue[8].

Iron-dependent programmed cell death, also known as ferroptosis, is a form of cell death caused by iron-dependent lipid peroxidation[9]. Numerous studies have shown that ferroptosis plays a crucial role in the occurrence and progression of liver cancer, making it a potential target for diagnosis, treatment, prognosis and survival analysis[3]. Abnormalities in multiple iron metabolism and regulation pathways, such as excessive iron overload and deficiency of iron regulatory proteins, are closely associated with the pathological and physiological processes of liver cancer[9]. However, the mechanisms of ferroptosis-related genes (FRGs) in hepatocellular carcinoma (HCC) remain unclear. To address this issue, this study utilized GEO database to analyze differentially expressed genes in TACE-treated effective and ineffective liver cancer tissues and combined them with FRGs to construct a predictive model for effective TACE treatment in liver cancer, providing clues for further exploration of the significance of FRGs in HCC.

2.1 Data acquisition and differential gene expression analysis

To identify genes associated with the response to transarterial chemoembolization (TACE), we obtained tumor biopsy gene expression data from the GEO database for both TACE-effective and ineffective patients (GSE104580)[10]. The dataset comprised 147 patients, which were divided into two groups: 81 TACE-effective patients (Response) and 66 TACE-ineffective patients (Non-Response). Using the R package limma[11], we performed differential analysis on the gene expression values to determine the impact of TACE on gene expression levels. We identified genes as up-regulated or down-regulated if the log fold-change (logFC) was greater than 0.3 or less than − 0.3 and the adjusted P-value was less than 0.05, respectively[12].

From the FerrDb database (http://www.zhounan.org/ferrdb/current/), we obtained the Ferroptosis-associated gene set (FAG), which contains 382 genes that are related to iron-mediated cell death. We then overlapped the differentially expressed genes and the FAG gene set to obtain the TACE-effective Ferroptosis-associated gene set[13]. We visualized this intersection using a Venn diagram and performed protein-protein interaction network analysis on the overlapping genes.

2.2 Functional and pathway enrichment analysis of DEGs

Gene Ontology (GO)[14] and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis are commonly used methods for studying large-scale functional enrichment of genes. GO includes biological processes (BP), molecular functions (MF), and cellular components (CC). Using the R package clusterProfiler, we performed GO and KEGG pathway enrichment analysis for the TACE-effective differentially expressed genes. We considered a p-value less than 0.05 to be statistically significant[15].

2.3 Gene Set Enrichment Analysis (GSEA)

Gene Set Enrichment Analysis (GSEA) is a computational method used to determine whether a specific gene set shows statistical differences between two biological states[16]. It is commonly used to estimate changes in pathway and biological process activity in expression datasets. To investigate the biological differences between TACE-effective and -ineffective patients, we used the R package clusterProfiler to perform GSEA on the gene expression data from dataset GSE104580 using the "c2.all.v7.5.2.entrez.gmt" gene set downloaded from the MSigDB database[17]. A false discovery rate-adjusted p-value less than 0.05 was considered statistically significant. We also presented the adjusted p-values and normalized enrichment scores (NES) obtained from the analysis.

2.4 Weighted Gene Co-expression Network Analysis (WGCNA)

Weighted Gene Co-expression Network Analysis (WGCNA) is a systems biology approach used to describe gene correlation patterns between different samples[18]. It can be employed to identify highly coordinated sets of genes and candidate biomarkers or therapeutic targets based on the interconnectivity of the gene set and its association with phenotypes. In this study, we used the R package WGCNA to analyze the GSE104580 dataset. The minimum module gene number was set to 50, cut height was set to 135, soft power was set to the optimal threshold of 5, module merge cut height was set to 0.95, and minimum distance was set to 0.2 to obtain shared expression modules.

2.5 Protein-Protein Interaction (PPI) Network Construction

The STRING database (STRING) is a comprehensive resource for searching known and predicted protein-protein interactions[19]. We utilized the STRING database to construct PPI networks for differentially expressed genes associated with TACE activity and differentially expressed prognostic genes related to TACE activity, respectively. Cytoscape (version 3.6.1)[20], an open-source bioinformatics software program, was used for visualizing molecular interaction networks. We employed the CytoHubba plugin in Cytoscape to identify the top 10 hub genes in the PPI network, using the maximum correlated standard Matthews correlation coefficient (MCC) algorithm. To assess the functional relatedness of key genes, we calculated the functional similarity among them using the R package GOSemSim[21].

2.6 LASSO Regression Model Construction

LASSO (Least absolute shrinkage and selection operator) regression is a feature selection method that simultaneously fits a generalized linear model and performs variable selection and complexity adjustment[22]. By regularizing the coefficients using L1 regularization, it shrinks the less important features to zero, thereby improving model interpretability. We used the R package Glmnet to perform LASSO regression analysis on the module genes. During the model construction process, we selected the optimal model to build a diagnostic model for TACE activity and identified the genes in the model as TACE activity-related features (Ferroptosis-related Gene, FRG). We employed the R package pROC to validate the model using ROC curves[23]. To demonstrate the differential expression of TACE activity-related features between TACE-affected patients and healthy controls, we utilized boxplots with P-values less than 0.05.

2.7 Molecular Subtype Analysis of TACE Efficacy

Consensus Clustering, a resampling-based algorithm, was utilized to identify the members and subgroups of each cluster, as well as to validate the rationality of clustering[24]. Genes previously filtered from GSE104580 that were related to ferroptosis were defined as key genes associated with ferroptosis. ConsensusClusterPlus package in R was used for consensus clustering based on the gene expression profile of the previously filtered ferroptosis-related genes from GSE104580, and the best clustering cluster was selected. Different ferroptosis patterns were identified based on the results.

2.8 Analysis of immune infiltration

CIBERSORT algorithm is a deconvolution algorithm based on linear support vector regression, which estimates the immune cell infiltration status of TACE-effective and TACE-ineffective patients using RNA-Seq data[25]. We used CIBERSORT algorithm to analyze immune infiltration between TACE-effective and TACE-ineffective patients, identified enriched immune cells differences between TACE-effective and TACE-ineffective patients in GSE104580 data, calculated the Pearson correlation coefficient between two ferroptosis-related gene expression levels and immune cells, and evaluated the relationship between ferroptosis-related genes and immune infiltration levels.

2.9 Construction of Hub-mRNA, Hub-miRNA, and Hub-LncRNA Interaction Networks

Analysis of miRNA and LncRNA expression interacting with hub genes was performed at the post-transcriptional stage. miRTarBase database is a database specialized in collecting experimental evidence-supported microRNA-target interactions (MTIs). The database has collected more than 8,500 papers supporting miRNA-target interaction experiments. With the increase of new CLIP-seq datasets, the latest collection of MTI in miRTarBase exceeds 500,000. By improving natural language processing (NLP) technology, more targeting relationship pairs and their network functions and annotation information were collected. We used miRTarBase 2020 (https://mirtarbase.cuhk.edu.cn/) to predict the miRNA binding with hub genes[26]. TarBase database is an evidence-supported miRNA target gene database[27]. The latest version is v8, which has been collected and organized for about 10 years, and it contains miRNA target gene information for multiple species. We predicted the interaction of hub genes and miRNA using both miRTarBase and TarBase databases, and took the intersection.

Starbase database[28] searches for micorRNA targets through high-throughput CLIP-Seq experimental data and degradation group experimental data, providing a variety of visualization interfaces to explore the targets of microRNAs, including abundant miRNA-ncRNA, miRNA-mRNA, RBP-RNA, and RNA-RNA data. We used Starbase database to predict the interaction between miRNA and LncRNA. We constructed hub-mRNA, hub-miRNA, and hub-LncRNA interaction networks, visualized the interactions network using Cytoscape software, and presented the interaction network using Sankey diagrams.

2.10 Construction and Validation of a Prognostic Ferroptosis-Related Gene Signature

To construct a prognostic ferroptosis-related gene signature, we acquired level 3 RNA sequencing (RNA-seq) data and corresponding clinical information of 363 patients with hepatocellular carcinoma (HCC) from the TCGA website up to November 15, 2022 (https://portal.gdc.cancer.gov/repository)[29]. Additionally, we obtained RNA-seq data and clinical information of another 232 tumor samples primarily derived from a Japanese population with HBV or HCV infection from the ICGC portal (https://dcc.icgc.org/projects/LIRI-JP).

To normalize the gene expression profiles, we used the scale method provided in the "limma" R package[11]. Normalized read count values were used for the samples obtained from the ICGC portal. Both datasets are publicly available, and thus, this study was exempted from the approval of local ethics committees. This research follows the TCGA and ICGC data access policies and publication guidelines. We performed Cox regression analysis on the TCGA-LIHC cohort to construct a prognostic model based on FRG. We then validated this model using the ICGC (LIRI-JP) cohort.

2.11 Statistical Analysis

All data computations and statistical analyses were performed using R programming language version 4.0.2 (https://www.r-projec t.org/). Two-group comparisons of continuous variables were estimated for statistical significance with the independent Student's t-test, assuming normal distribution. For non-normally distributed variables, differences between groups were analyzed using the Mann-Whitney U test (also known as Wilcoxon rank-sum test). All statistical P-values reported were two-tailed, and a P-value less than 0.05 was considered to be statistically significant.

3.1 Differential analysis results.

To investigate the changes in gene expression between TACE effective and ineffective patients, we performed differential analysis on the GSE104580 dataset using the limma package. We set the criteria as an absolute value (logFC) > 0.3 and adjPvalue < 0.05, resulting in 2689 differentially expressed genes (DEGs), including 1255 upregulated genes and 1434 downregulated genes. We generated a hierarchical cluster heatmap (Fig. 2A) and a volcano plot (Fig. 2B) of the top ten upregulated and downregulated DEGs, respectively, which effectively distinguished TACE effective and ineffective patients. Furthermore, we identified 45 common genes by intersecting the DEGs from GSE104580 and ferroptosis-related genes, illustrated by a Venn diagram (Fig. 2C). To explore the potential interactions among these 45 genes, we constructed a protein-protein interaction network (Fig. 2D).

3.2 Functional Enrichment Analysis of Differentially Expressed Genes

To investigate the relationship between differentially expressed genes (DEGs) significantly associated with TACE activity and various biological processes, molecular functions, cellular components, biological pathways, and diseases, we conducted a functional enrichment analysis on these DEGs. We found that the DEGs significantly associated with TACE activity were mostly enriched in several biological processes including "cellular response to chemical stress," "response to nutrient levels," and "response to extracellular stimulus" (Fig. 3A). Furthermore, they were also found to be enriched in various cellular components such as "basal plasma membrane," "basal part of cell," and "melanosome" (Fig. 3B), as well as molecular functions like "neutral amino acid transmembrane transporter activity," "organic anion transmembrane transporter activity," and "L-amino acid transmembrane transporter activity" (Fig. 3C). Next, we performed pathway enrichment analysis on these DEGs and found that they were enriched in several biological pathways such as "Central carbon metabolism in cancer," "Ferroptosis," and "Bladder cancer" (Fig. 3D).

3.3 Enrichment Analysis using Gene Set Enrichment Analysis (GSEA)

The results of the GSEA enrichment analysis with an adjusted P-value of less than 0.05 yielded significant gene sets including REACTOME_PHASE_I_FUNCTIONALIZATION_OF_COMPOUNDS, REACTOME_CYTOCHROME_P450_ARRANGED_BY_SUBSTRATE_TYPE, REACTOME_BIOLOGICAL_OXIDATIONS, WP_OXIDATION_BY_CYTOCHROME_P450, and REACTOME_RESOLUTION_OF_SISTER_CHROMATID_COHESION (Fig. 4A-D).

3.4 Identification of Co-expression Modules in DEGs through WGCNA Analysis

To identify the co-expression modules present in the differentially expressed genes (DEGs) of TACE ineffective and effective groups, we performed weighted gene co-expression network analysis (WGCNA). During the WGCNA analysis of the GSE104580 dataset, we observed one outlier sample through cut height setting (Fig. 5A). By plotting a scatter plot, we determined 5 as the optimal soft threshold value for subsequent analyses (Fig. 5B). Subsequently, co-expressed genes from both groups were clustered into darkturquoise, ivory, royalblue, and sienna3 modules (Fig. 5C). Based on the expression patterns of module genes and grouping information, we assessed the correlation between the modules and TACE effectiveness. We selected the darkturquoise, ivory, royalblue, and sienna3 modules that showed positive correlation with TACE effectiveness and significant p value < 0.05 for further analysis (Fig. 5D).

3.5 Protein-protein interaction network analysis

To identify the crucial genes associated with TACE, a comprehensive analysis was conducted using GSE104580 dataset. A total of 37 intersecting genes were identified through co-expression and differential expression analysis with ferroptosis-related genes and TACE-associated genes. The Venn diagram was constructed to visualize the overlapping genes (Fig. 6A).

To further explore the protein-protein interactions among the TACE-associated genes, a protein-protein interaction network was constructed using Cytoscape software (Fig. 6B). The cytoHubba plugin's MCC algorithm was utilized to calculate the top 10 hub genes, which included GLS2, CDKN1A, SRC, GPT2, SLC7A11, SLC7A5, ASNS, SLC38A1, SLC2A1, and SLC1A5 (Fig. 6C). The correlation coefficient heatmap was generated for the hub genes to illustrate their relationships (Fig. 6D).

3.6 LASSO Regression Constructing RESPONSE Diagnostic Model and Identifying Feature Genes

To determine the effective disease-specific feature genes of TACE and analyze their diagnostic capabilities for diseases, we employed the LASSO regression model method. We randomly split GSE104580 into a training group and a testing group at a ratio of 4:1 and used the training group to construct the model and the testing group to verify it. During the model building process, as λ increased, the selected feature parameters decreased while the coefficient absolute value increased (Fig. 7A,B). After simulating and selecting the number of features, we constructed a model and identified six feature genes in the model: GLS2, CDKN1A, GPT2, ASNS, SLC38A1, and SLC2A1(ferroptosis-related genes, FRGs). We validated the risk model by plotting ROC curves and calculating AUC (area under the curve) for both datasets. The results showed that the AUC values of the Train group and Test group were 0.846 and 0.822, respectively (Fig. 7C). Furthermore, we performed differential analysis on the feature genes between the effective and ineffective groups of TACE, and identified six differentially expressed feature genes with statistical significance (p < 0.05), including GLS2, CDKN1A, GPT2, ASNS, SLC38A1, and SLC2A1. We presented these findings using box plots (Fig. 7D).

3.7 Identification of Two Ferroptosis Patterns Based on Feature Genes

In this study, we utilized the "ConsensusClusterPlus" package in R software for cluster analysis of six iron-induced cell death-related genes based on their feature genes. A consensus clustering method was employed, and a consensus clustering plot for k = 2 was generated (Fig. 8A). Additionally, we calculated the relative change in area under the cumulative distribution function curve from k = 2 to k = 9 using the CDF plot (Fig. 8B). Furthermore, we illustrated the cumulative distribution function of the consensus clustering results (Fig. 8C) as well as the tracking plot (Fig. 8D).

3.8 Immunoinfiltration Analysis

To investigate the differences in immunoinfiltration levels between patients who responded to transarterial chemoembolization (TACE) and those who did not, we utilized the cibersort algorithm to determine the degree of infiltration of 22 types of immune cells in both groups using data from the GSE104580 dataset (Fig. 9A). Upon applying the wilcox.test algorithm, six types of immune cells demonstrated significant differences between TACE responders and non-responders in the GSE104580 dataset (Fig. 9B), namely T cells follicular helper, T cells gamma delta, Macrophages M0, Macrophages M1, Mast cells resting, and Neutrophils.

Furthermore, we analyzed the correlation between the expression levels of six ferroptosis-related genes and the abundance of 22 immune cell types. Our analysis identified significant correlations at p < 0.05 between ASNS and T cells gamma delta, NK cells activated, and Mast cells resting; CDKN1A and NK cells resting and Macrophages M0; GLS2 and T cells CD4 memory activated, T cells follicular helper, T cells regulatory (Tregs), and Macrophages M0; SLC2A1 and T cells follicular helper and Neutrophils; and SLC38A1 and T cells CD4 memory activated, T cells gamma delta, and NK cells activated (Fig. 9C).

3.9 Interactions among hub-mRNAs, hub-miRNAs, and hub-LncRNAs in a network

In this study, we have constructed an interplay network among LncRNA, miRNA, and mRNA. The network included two ferroptosis -related genes, CDKN1A and SLC2A1. Using miRTarBase and TarBase databases, we predicted the interactions between miRNAs related to ferroptosis and the ferroptosis-related genes. We found 101 intersecting groups of interaction relationships (Fig. 10A). Further screening of these interactions resulted in 75 groups with experimental evidence supporting them. We then used the Starbase database to predict LncRNAs that interact with the miRNAs, resulting in the construction of a Sankey diagram (Fig. 10B) and network map (Fig. 10C) of the LncRNA-miRNA-mRNA network.

3.10 Construction and validation of LICH prognostic model based on ferroptosis-related genes

In this study, we constructed a prognostic model for liver hepatocellular carcinoma (LIHC) patients based on six ferroptosis-related genes (GLS2, CDKN1A, GPT2, ASNS, SLC38A1, and SLC2A1). Using patient data from The Cancer Genome Atlas (TCGA), we classified patients into high and low risk groups. Kaplan-Meier survival analysis showed that the model had a hazard ratio of 0.52 (0.36–0.74) and P < 0.001 (Fig. 11A). Additionally, the time-dependent receiver operating characteristic (ROC) curve analysis revealed an area under the curve (AUC) of 0.705, 0.636, and 0.658 at 1, 2, and 3 years, respectively (Fig. 11B).

To validate the model, we used patient data from the International Cancer Genome Consortium (ICGC) and calculated the risk score for each patient. We then classified patients into high and low risk groups and performed Kaplan-Meier survival analysis. The results showed that the model had a hazard ratio of 0.45 (0.24–0.85) and P = 0.013 (Fig. 11C). Moreover, the time-dependent ROC curve analysis revealed an AUC of 0.728, 0.642, and 0.657 at 1, 2, and 3 years, respectively (Fig. 11D).

Hepatocellular carcinoma (HCC) is one of the most common malignant tumors worldwide, characterized by high mortality and low survival rates[1]. Transarterial chemoembolization (TACE) is widely used in the treatment of advanced HCC[5]. However, TACE therapy still faces many challenges such as sub-optimal efficacy and potential side effects. Therefore, identifying and selecting the most sensitive patients by predicting the biological markers for TACE therapy response is crucial. Ferroptosis, a novel non-apoptotic programmed cell death mechanism, has received increasing attention in recent years and has been demonstrated to have potential applications in the treatment of HCC[3]. The basic characteristics of ferroptosis include iron metabolism disorder, amino acid antioxidant system imbalance, and accumulation of peroxidized lipids. In HCC treatment, ferroptosis can inhibit the growth, proliferation, and metastasis of HCC cells through various induction mechanisms, key enzymes, and signaling pathways[3]. Therefore, inducing ferroptosis in HCC cells by targeting ferroptosis-related proteins, transcription factors, and key enzymes may be a promising direction for future specific HCC therapy. However, the role of ferroptosis molecules in TACE therapy for HCC is still unclear.

In this study, we systematically evaluated the role of ferroptosis molecules in HCC and selected 10 prognostic genes (GLS2, CDKN1A, SRC, GPT2, SLC7A11, SLC7A5, ASNS, SLC38A1, SLC2A1, SLC1A5) for further study. In addition, we constructed and validated a new set of prognostic markers based on ferroptosis molecules for predicting the therapeutic efficacy of TACE in HCC patients (GLS2, CDKN1A, GPT2, ASNS, SLC38A1, SLC2A1). In the constructed hub-mRNA, hub-miRNA and hub-LncRNA interaction networks, CDKN1A and SLC2A1 were found to be located in the core position of the network. The CDKN1A gene encodes an important protein, which is a key molecule in cell cycle regulation. It can inhibit various cyclin-dependent kinases that play different roles at different stages of the cell cycle, and its activity is strictly regulated[30]. The expression and regulation of CDKN1A have important effects on processes such as cell cycle, apoptosis, and DNA damage repair. In addition to cell cycle regulation, CDKN1A can also participate in DNA damage response and repair processes. Moreover, the expression of CDKN1A can be regulated by various signaling pathways such as p53, NF-κB, and TGF-β. Bin Ma reported that N-trans-feruloyldopamine can accelerate HCC cell apoptosis by regulating the CDKN1A signal[31]. Bei Li found that SNHG1 promotes the progression of HCC by epigenetically silencing CDKN1A and CDKN2B in the nucleus[30]. The SLC2A1 gene encodes glucose transporter, which is one of the important glucose transporters in mammals. This protein transports glucose from high concentration regions to low concentration regions through transport channels on the cell membrane to meet the needs of cells for glucose. In humans, SLC2A1 is mainly distributed in the blood-brain barrier and other tissues and has other important physiological functions. For example, it can participate in insulin-mediated glucose metabolism and regulate neuronal energy metabolism. Ya Li reported that STAT3 directly regulates the transcription of CD47 and SLC2A1 and confirmed that STAT3 is a potential target for treating HCC by reshaping the tumor immune microenvironment[32]. Jie Yao reported that isorhamnetin-3-O-galactoside can significantly inhibit the expression of SLC2A1/GLUT1 and glucose uptake, leading to the activation of the AMPK-ULK2 axis in HepG1 cells[33]. The overexpression of SLC2A1/GLUT1 can reverse iso-induced autophagy.

Ferroptosis plays an important role in tumor immune regulation. In order to further explore the relationship between our TACE efficacy and the tumor immune microenvironment, we compared the immune cell infiltration between the effective and ineffective groups of TACE patients. The results showed that the immune cell infiltration, such as macrophages and neutrophils, was significantly increased in the TACE ineffective group. This suggests that there is an immune-suppressive microenvironment in TACE ineffective patients. Immune-suppressive microenvironment is an important mechanism for tumor cells to evade immune attacks and accelerate disease progression. However, clinical studies are needed to confirm the above speculation in the real world. In the future, we hope to break the limitations of the immune-suppressive microenvironment by further understanding, exploring, and developing effective treatment strategies.

In conclusion, we have established and validated a therapeutic efficacy model for transarterial chemoembolization (TACE) in hepatocellular carcinoma (HCC) patients. This model is based on a set of prognostic molecules that are closely associated with the occurrence of ferroptosis, including GLS2, CDKN1A, GPT2, ASNS, SLC38A1, and SLC2A1. Among these genes, CDKN1A and SLC2A1 were found to be central in the complex regulatory network involving long non-coding RNA (LncRNA), microRNA (miRNA), and messenger RNA (mRNA). Nevertheless, the specific molecular mechanisms underlying the function of these genes in TACE therapy and ferroptosis remain elusive, and further investigation is warranted.

Acknowledgments

This work was supported by the Science and technology development fund of Nanjing Medical University (nos. NMUB20210212).

Authors’ contributions: Jiang Rui, Liu Zhengli and Fu Guanqi wrote the main manuscript. Zhao Boxiang, Gong Maofeng prepared figures 1-6, Lu Zhaoxuan, Zhou Yangyi prepared figures 7-11, and Chen Liang helped with the revised picture. Jiang Rui, Fu Guanqi collected data and Kong Jie, Liu Zhengli conducted data interpretation. Kong Jie, Su Haobo, Lou Wensheng, and Chen Guoping performed data analysis, and Wang Feng performed the Immunocytochemistry. Critical revision of the manuscript and final approval for publication were done by Gu Jianping and He Xu. Gu Jianping, Kong Jie and He Xu designed, guided and funded the study. All authors reviewed the manuscript.

Data available to be shared:

The raw data required to reproduce the above findings are available to download from the database I shared in the manuscript. The processed data required to reproduce the above findings are available to download from Kong Jie, one of the Corresponding authors.

Disclosure

The authors report no proprietary or commercial interest in any product mentioned or concept discussed in this article.

McGlynn KA, Petrick JL, El-Serag HB: Epidemiology of Hepatocellular Carcinoma. Hepatology 2021, 73 Suppl 1:4-13.
Nguyen LH, Nguyen MH: Systematic review: Asian patients with chronic hepatitis C infection. Aliment Pharmacol Ther 2013, 37:921-936.
Bekric D, Ocker M, Mayr C, Stintzing S, Ritter M, Kiesslich T, Neureiter D: Ferroptosis in Hepatocellular Carcinoma: Mechanisms, Drug Targets and Approaches to Clinical Translation. Cancers (Basel) 2022, 14.
Zhou H, Song T: Conversion therapy and maintenance therapy for primary hepatocellular carcinoma. Biosci Trends 2021, 15:155-160.
Chang Y, Jeong SW, Young Jang J, Jae Kim Y: Recent Updates of Transarterial Chemoembolilzation in Hepatocellular Carcinoma. Int J Mol Sci 2020, 21.
Chakraborty E, Sarkar D: Emerging Therapies for Hepatocellular Carcinoma (HCC). Cancers (Basel) 2022, 14.
Huang DQ, El-Serag HB, Loomba R: Global epidemiology of NAFLD-related HCC: trends, predictions, risk factors and prevention. Nat Rev Gastroenterol Hepatol 2021, 18:223-238.
Auer TA, Collettini F, Segger L, Pelzer U, Mohr R, Krenzien F, Gebauer B, Geisel D, Hosse C, Schöning W, Fehrenbach U: Interventional Treatment Strategies in Intrahepatic Cholangiocarcinoma and Perspectives for Combined Hepatocellular-Cholangiocarcinoma. Cancers (Basel) 2023, 15.
Wu J, Wang Y, Jiang R, Xue R, Yin X, Wu M, Meng Q: Ferroptosis in liver disease: new insights into disease mechanisms. Cell Death Discov 2021, 7:276.
Barrett T, Troup DB, Wilhite SE, Ledoux P, Rudnev D, Evangelista C, Kim IF, Soboleva A, Tomashevsky M, Edgar R: NCBI GEO: mining tens of millions of expression profiles--database and tools update. Nucleic Acids Res 2007, 35:D760-765.
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK: limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 2015, 43:e47.
Yu G, Wang LG, Han Y, He QY: clusterProfiler: an R package for comparing biological themes among gene clusters. Omics 2012, 16:284-287.
Zhou N, Bao J: FerrDb: a manually curated resource for regulators and markers of ferroptosis and ferroptosis-disease associations. Database (Oxford) 2020, 2020.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 2000, 25:25-29.
Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M: KEGG: integrating viruses and cellular organisms. Nucleic Acids Res 2021, 49:D545-d551.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102:15545-15550.
Liberzon A, Birger C, Thorvaldsdóttir H, Ghandi M, Mesirov JP, Tamayo P: The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell Syst 2015, 1:417-425.
Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008, 9:559.
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, et al: STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res 2019, 47:D607-d613.
Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T: Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 2011, 27:431-432.
Yu G, Li F, Qin Y, Bo X, Wu Y, Wang S: GOSemSim: an R package for measuring semantic similarity among GO terms and gene products. Bioinformatics 2010, 26:976-978.
Mazumder R, Hastie T: The graphical lasso: New insights and alternatives. Electron J Stat 2012, 6:2125-2149.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Müller M: pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 2011, 12:77.
Brière G, Darbo É, Thébault P, Uricaru R: Consensus clustering applied to multi-omics disease subtyping. BMC Bioinformatics 2021, 22:361.
Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA: Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol Biol 2018, 1711:243-259.
Huang HY, Lin YC, Li J, Huang KY, Shrestha S, Hong HC, Tang Y, Chen YG, Jin CN, Yu Y, et al: miRTarBase 2020: updates to the experimentally validated microRNA-target interaction database. Nucleic Acids Res 2020, 48:D148-d154.
Karagkouni D, Paraskevopoulou MD, Chatzopoulos S, Vlachos IS, Tastsoglou S, Kanellos I, Papadimitriou D, Kavakiotis I, Maniou S, Skoufos G, et al: DIANA-TarBase v8: a decade-long collection of experimentally supported miRNA-gene interactions. Nucleic Acids Res 2018, 46:D239-d245.
Li JH, Liu S, Zhou H, Qu LH, Yang JH: starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res 2014, 42:D92-97.
Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, Sabedot TS, Malta TM, Pagnotta SM, Castiglioni I, et al: TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res 2016, 44:e71.
Li B, Li A, You Z, Xu J, Zhu S: Epigenetic silencing of CDKN1A and CDKN2B by SNHG1 promotes the cell cycle, migration and epithelial-mesenchymal transition progression of hepatocellular carcinoma. Cell Death Dis 2020, 11:823.
Ma B, Li J, Yang WK, Zhang MG, Xie XD, Bai ZT: N-trans-Feruloyloctopamine Wakes Up BBC3, DDIT3, CDKN1A, and NOXA Signals to Accelerate HCC Cell Apoptosis. Anal Cell Pathol (Amst) 2021, 2021:1560307.
Li Y, Song Z, Han Q, Zhao H, Pan Z, Lei Z, Zhang J: Targeted inhibition of STAT3 induces immunogenic cell death of hepatocellular carcinoma cells via glycolysis. Mol Oncol 2022, 16:2861-2880.
Yao J, Tang S, Shi C, Lin Y, Ge L, Chen Q, Ou B, Liu D, Miao Y, Xie Q, et al: Isoginkgetin, a potential CDK6 inhibitor, suppresses SLC2A1/GLUT1 enhancer activity to induce AMPK-ULK1-mediated cytotoxic autophagy in hepatocellular carcinoma. Autophagy 2023, 19:1221-1238.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Ferroptosis Related Prediction Model for Hepatocellular Carcinoma Patients Sensitive to Chemotherapy Embolization Therapy Based on Bioinformatics Analysis

Status:

Version 1

Abstract

Figures

1. Introduction

2. Materials and Methods

2.1 Data acquisition and differential gene expression analysis

2.2 Functional and pathway enrichment analysis of DEGs

2.3 Gene Set Enrichment Analysis (GSEA)

2.4 Weighted Gene Co-expression Network Analysis (WGCNA)

2.5 Protein-Protein Interaction (PPI) Network Construction

2.6 LASSO Regression Model Construction

2.7 Molecular Subtype Analysis of TACE Efficacy

2.8 Analysis of immune infiltration

2.9 Construction of Hub-mRNA, Hub-miRNA, and Hub-LncRNA Interaction Networks

2.10 Construction and Validation of a Prognostic Ferroptosis-Related Gene Signature

2.11 Statistical Analysis

3. Results

3.1 Differential analysis results.

3.2 Functional Enrichment Analysis of Differentially Expressed Genes

3.3 Enrichment Analysis using Gene Set Enrichment Analysis (GSEA)

3.4 Identification of Co-expression Modules in DEGs through WGCNA Analysis

3.5 Protein-protein interaction network analysis

3.6 LASSO Regression Constructing RESPONSE Diagnostic Model and Identifying Feature Genes

3.7 Identification of Two Ferroptosis Patterns Based on Feature Genes

3.8 Immunoinfiltration Analysis

3.9 Interactions among hub-mRNAs, hub-miRNAs, and hub-LncRNAs in a network

3.10 Construction and validation of LICH prognostic model based on ferroptosis-related genes

4. Discussion

5. Conclusions

Declarations

References

Additional Declarations

Status:

Version 1