Single cell RNA sequence combined with bulk RNA sequence to construct and validate a novel prognostic signature based on cell developmental trajectory- related genes to predict prognosis and immunotherapeutic response in Hepatocellular Carcinoma

doi:10.21203/rs.3.rs-2293095/v1

Download PDF

Article

Single cell RNA sequence combined with bulk RNA sequence to construct and validate a novel prognostic signature based on cell developmental trajectory- related genes to predict prognosis and immunotherapeutic response in Hepatocellular Carcinoma

https://doi.org/10.21203/rs.3.rs-2293095/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: Immunotherapy is the first-line treatment for advanced liver cancer. However, there is a dearth of validated molecular indicators to predict immunotherapy response.

Methods:Through a combined analysis of bulk RNA sequence data from 365 HCC cases and single cell sequence data from 2 HCC cases, we created a predictive signature based on cell developmental trajectory-related genes. Some comprehensive examinations, such as Microsatellite Instability (MSI), Tumor Mutation Burden (TMB), somatic mutations, and Tumor Immune Dysfunction and Exclusion (TIDE), were carried out to study the connection of immunotherapy response to prognostic signature. External datasets and immunotherapy datas were downloaded and analysed to validate the robustness and superiority of the signature.

Results:365 HCC patients were divided into high-risk group or low-risk group based on scores assessed by the prognosis signature. T stage and Tumor Grade were found to be positively correlated with risk scores. And prognosis in low-risk group was much better when compared with high-risk group. We also confirmed this prognostic signature using external independent data. Interestingly, the high-risk patients had a higher proportion of intratumoral immune cell infiltration, a higher MSI score, a higher TMB score, and a lower TIDE score. These findings suggest that high-risk patients may be more responsive and suitable for immunotherapy. IMvigor210 immunotherapy data confirmed higher response rates in the high-risk group of patients.

Conclusion : Our prognostic signature based on cell developmental trajectory-related genes showed superior performance in predicting the prognosis and immunotherapy response in HCC patients, and has the potential to be an effective tool for clinicians to screen a population suitable for immunotherapy in the future.

Biological sciences/Cancer

Biological sciences/Immunology

Biological sciences/Molecular biology

Hepatocellular carcinoma

scRNA-seq

TMB

MSI

In 2020, about 906,000 new cases of hepatocellular carcinoma (HCC) and 830,000 deaths occurred worldwide, making it the sixth most common cancer^[1]. Although radical treatment for liver cancer includes surgery, ablation, liver transplantation, etc., most patients are in advanced stage when they visit the hospital and lose the opportunity of these radical treatments, leaving few viable treatment options^[2]. The phenotypic and genetic heterogeneity of cancer increases the risk of recurrence and metastasis following potentially curative operations like excision and ablation^[3]. About fifty percent of patients were treated with systemic therapy, such as immune checkpoint inhibitors (ICIs), anti-angiogenic targeted drugs (AATDs), tyrosine kinase inhibitors (TKIs), and systemic chemotherapy, is administered to around 50% of HCC patients globally^[4]. There is an increasing attention to the phenomenon that the interaction between malignant cells and the immune microenvironment promotes the formation and development of HCC. The tumor microenvironment (TME) is a dynamic network of tumor cells interacting with diverse surrounding cellular materials^[5].Immunosuppressive microenvironment facilitates tumor cell growth, treatment resistance, invasion, and metastasis in HCC^[6]. By "normalize" the TME, immunotherapy and targeted medicines currently increase autoimmune cell destruction of malignant cells^[7]. In the last five years, immunotherapy has shown to be an increasingly important role in the perioperative treatment of HCC, and has achieved significant achievement in improving the prognosis of patients with HCC^{[8, 9]}. Although immunotherapy has made important progress, immunotherapy for HCC still faces many challenges, such as low response rate, lack of effective biomarkers to predict immunotherapy response and strong toxic side effects, which seriously affect the efficacy of immunotherapy^[10]. The limitation of immunotherapy also contains that only a subset of patients can actually benefit, and in some cases, patients even experience a hyper-progressive disease(HPD) state after immunotherapy^[11]. Therefore, screening people who are suitable for immunotherapy based on biological markers is of great value. The common clinical predictors for immunotherapy efficacy include as follows: TMB, MSI, and immune checkpoint-related proteins’ expression, and so on. However, none of these predictors can judge the immunotherapy efficacy ideally in HCC patients. Therefore, to develop ideal prognostic markers is of urgent need.

Single-cell sequencing differs from conventional sequencing in that it focuses on the sequencing of a single cell, the smallest functional unit of a living organism. Because of its high resolution, the characteristics of cell subset or single cell could be precisely identified, thereby revealing the gene signatures of a single cell and reflecting the heterogeneity between cells. Single-cell sequencing has emerged as a potent tool for studying the tumor microenvironment, directing cancer immunotherapy, and elucidating the causes of immunotherapy resistance.

In this study, we combined single-cell sequence data and bulk RNA sequence data to construct a new prognostic signature based on genes associated with cell developmental trajectory. We comprehensively analyzed the relationship between the prognostic signature and clinical features, immune cell infiltration, MSI, TMB, somatic mutations, and TIDE. Our signature not only can effectively predict the prognosis of HCC patients, but also can effectively screen the population suitable for immunotherapy, providing an important reference value for clinicians' decision making.

Data Acquisition

The single-cell sequencing data of the two HCC patients(HCC1,HCC2) included in this paper were downloaded from the GEO database (GSE146115). The bulk RNA sequencing information and clinical information of 365 HCC samples in the test set were downloaded from TCGA database (https://portal.gdc.cancer.gov/), and the bulk RNA sequencing information and clinical information of 240 HCC patients in the validation set were downloaded from ICGC database (https://dcc.icgc.org/). Immunotherapy datas of 195 bladder cancer patients (IMvigor210) were downloaded from the IMvigor210CoreBiologies R package

Process flow of scRNA-seq Data

We first used the Seurat package of R software to create Seurat objects for the two HCC samples. Next, we filtered out any cells where the average expression level of genes was below 500, above 8000, or above 5% for mitochondrial genes. Expression data normalization was performed with the NormalizeData function, and data normalization was performed with the ScaleData function. We used the FindVariableFeatures function to select the top 2000 highly variable genes for principal component analysis (PCA). The top 20 PCAs were utilized to downscale the cells into clusters using a t-distributed random neighborhood embedding (t-SNE) method. The FindAllmarker function was used to find the highly expressed genes in each cell cluster. Pseudo-temporal analysis of the cells was carried out by me using the Monocle package of R software in order to search for genes that change during cell development.

Weighted Gene Correlation Network Analysis Identifies the most relevant module for Tumor Grade

We used the WGCNA package of R software to analyze the cell development-related genes and identify the gene modules most strongly associated with Tumor Grade(Grade). WGCNA has two parts: expression clustering analysis and phenotypic association, which has four steps: gene correlation coefficients, gene module identification, co-expression network, and module-trait linkage. Soft thresholding power is the lowest power with which scale-free topology fit index reaches 0.90. After clustering, the modules were shown via dendrogram. Examining their correlation coefficients and p values, we then determined which modules were associated with Tumor Grade. Finally, we selected the genes corresponding to the pertinent modules for the subsequent round of analysis.

Differentially expressed genes associated with cell developmental trajectories and Functional enrichment analysis

To obtain differentially expressed cell developmental trajectory-related genes, we extracted the mRNA expression matrix of cell developmental trajectory-related genes and analyzed it between tumor and normal tissues in 424 patients (365 tumor tissues, 64 normal tissues) using the Limma package of R software. We screened genes with adjust.pvalue < 0.05 and log2FoldChange > 1 or log2FoldChange < -1 for downstream analysis. To explore the molecular mechanisms of genes associated with differential cell developmental trajectories, we performed Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes(KEGG) functional enrichment analysis using the clusterProfiler package of R software.

Construction and validation of a prognostic signature based on genes associated with cell developmental trajectories

We first performed Cox univariate analysis of the differentially expressed cell developmental trajectory-associated genes obtained to screen for genes that affect the prognosis of HCC patients. Subsequently, we used the glmnet package of R software to screen for genes that could be used as independent factors to influence patient prognosis. We calculated the risk score for each HCC patient in the test set using the formula: Risk score=∑(Expi*Coefi), where Expi represents the gene expression value and Coefi represents the risk coefficient. Based on the median risk score, we divided the test set of 365 HCC patients into high-risk and low-risk groups and compared the overall survival differences between the two groups using Kaplan-Meier survival curve analysis. ROC curves were used to determine the performance of risk scores for predicting 1-, 3-, and 5-year survival in HCC patients. The validation group of 240 HCC patients all had their risk scores determined using the same methodology. Patients were classified as either high-risk or low-risk based on their median risk score. The outcomes of the two groups were compared using Kaplan-Meier survival curve analysis. The validity cohort's ROC curves were used to evaluate the risk scores' ability to predict outcomes.

Identification of prognostic signature as an independent prognostic factor

To further explore the underlying mechanisms behind the risk scores, we further compared the relationships between risk scores and clinical characteristics such as age, gender, T-stage, N-stage, grade, and survival status. We first compared, the differences in the distribution of risk scores between the different clinical subgroups. To identify prognostic factors, we first ran a Cox univariate analysis of clinical parameters and risk scores. On the basis of the acquired data, we then conducted a Cox multivariate analysis to identify additional independent risk factors influencing the prognosis of HCC patients. To further enhance the clinical feasibility of the prognostic signature, we used the rms package of R software to incorporate risk scores and clinical characteristics into the analysis and construct a nomogram to more accurately predict the prognosis of HCC patients.

Gene Set Enrichment Analysis（GSEA）

Gene Set Enrichment Analysis determines if a priori chosen gene sets reveal statistically significant differences between two biological states. GSEA is not limited to differential genes and can include minor yet coordinated alterations in biological processes. We used GSEA software to compare the differences between the high-risk and low-risk groups in the enrichment pathway. The p<0.05 was used to determine whether the pathway was significantly enriched..

Mutation analysis and Microsatellite Instability(MSI)

We downloaded mutation information from TCGA for 365 HCC patients and calculated the somatic mutation frequency and tumor mutation burden for each sample using the maftools package of R software. We then compared the relationship between prognostic signature and tumor mutation burden. Microsatellite Instability (MSI) is a phenomenon of microsatellite (MS) sequence length change due to insertion or deletion mutations during DNA replication, often caused by mismatch repair (MMR) defects.

To fully explore, the relationship between prognostic signature and immunotherapy response, we MSI was introduced to the analysis. The MSI data of patients in TCGA-LIHC were obtained from the study of Bonneville^[12] et al.

Immune Cells Infiltration and Gene Sets Variation Analysis (GSVA)

CIBERSORT is based on the principle of linear support vector regression, which deconvolutes the expression matrix of human immune cell subtypes, and the content of each immune cell subpopulation is calculated by deconvolution^[13]. To fully elucidate the underlying mechanisms behind the prognostic signature, we used the CIBERSORT function of R software to deconvolute the bulk RNA sequencing data from 365 HCC patients to obtain the proportion of immune cells infiltrating the tumor microenvironment. By translating the expression matrix of genes across samples into the expression matrix of gene sets across samples, GSVA evaluates the enrichment of various pathways across samples^[14]. Therefore, with the special features of GSVA, we evaluated the proportion of 28 immune cells in each sample using a gene set containing information on the expression of 28 immune cells.

Prediction of response to immunotherapy

The degree of expression of immune checkpoint-associated proteins is the cornerstone of HCC immunotherapy. The link between prognostic signature and mRNA expression levels of immune checkpoint-related genes routinely used in immunotherapy for hepatocellular cancer was investigated initially. The TIDE score was subsequently developed to analyze the association between prognostic signature and immunotherapeutic response. TIDE employs the induction of T-cell dysfunction in tumors with high infiltrating cytotoxic T lymphocytes and the prevention of T-cell infiltration in tumors with low infiltrating cytotoxic T lymphocytes to model the mechanism of tumor immune escape and determine the efficacy of immunotherapeutic response. To validate the relationship between prognostic signature and immunotherapy, we validated our results in the IMvigor210 immunotherapy data.

Statistical analysis

Wilcoxon t-test was used to compare categorical variables between subgroups. cox univariate analysis and cox multivariate regression analysis were used to screen for independent risk factors affecting prognosis. Kaplan-Meier analysis was used to compare survival differences between subgroups and to plot survival curves. p<0.05 was determined to be statistically significant. All analyses in this study were performed in R software (version 4.1.2).

Acquisition of genes related to cell developmental trajectories

After a rigorous selection process, we obtained 1180 high-quality cells for downstream analysis. PCA results showed that the cells of HCC1 and HCC2 could be clearly distinguished(Figure 1A).Following PCA and tSNE analysis, seven distinct types of cell clusters were found(Figure 1B). We next annotated each cell cluster using the "SingleR "package of R software, and then used tSNE to display the annotated cell type(Figure 1C). We used a heatmap to show the top 10 highly expressed signature genes in each cell cluster(Figure 1D). Three distinct cell types were isolated: hepatocytes, macrophages, and T cells. Hepatocytes constitute a significant portion of them. All cells were subjected to cell trajectory and pseudo-time analyses(Figure 1E- Figure 1F,Figure S1A-Figure S1B). By pseudo-temporal analysis, we identified 442 genes associated with cell developmental trajectories for subsequent analysis.

Identification of modules associated with Tumor Grade

Tumor Grade is closely related to the prognosis of liver cancer. We utilized WGCNA to investigate 424 genes linked to cell developmental trajectories in order to find the gene modules most strongly correlated with Tumor Grade. When developing the co-expression network, we found that the soft thresholding power was 5 when the fit index of the scale-free topology reached 0.90(Figure 2A). Four modules were found using soft thresholding power and average linkage hierarchical clustering. For further study, we extracted 270 genes from the MEturquoise, MEbrown, and MEgrey modules, all of which were found to be associated with Tumor Grade(Figure 2B).

Identification of genes associated with differentially expressed cell developmental trajectories

We used mRNA expression data from TCGA-LIHC to perform differential analysis on normal (n=64) and malignant (n=365) tissues in the training set. Using the filtering criteria of p-value <0.05 and logFC > 1 or logFC < -1, we were able to determine that 75 genes were upregulated s and 21 genes were downregulated in Tumor.(Figure 2C). The heatmap revealed striking differences in gene expression between tumor and normal tissue(Figure 2D). GO enrichment analysis showed that these differential expressed genes were mainly enriched in the metabolic signaling pathways of olefin, terpenoid, pro-alcohol and isoprostanol (Figure 2E-F). KEGG enrichment analysis showed significant enrichment in steroid hormone biosynthesis, fatty acid degradation, chemical carcinogenesis-reactive oxygen species, and glycolysis/gluconeogenesis signaling pathways(Figure S2). These results suggest that the obtained differentially expressed genes related to cell developmental trajectories are engaged in cellular energy and material metabolism.

Construction and validation of a prognostic signature

Cox univariate analysis was performed on 96 differentially expressed genes, and 49 genes were found to affect HCC patients' prognoses (p<0.05). After that, we performed Lasso Cox analysis on 49 genes and selected 9 genes to construct prognostic signature(Figure 3A- Figure 3B). In the training set, we calculated the risk score for each patient using the following formula: Risk score = CCT5 mRNA expression level * 0.1426 + FLVCR1 mRNA expression level * 0.1057 + CCDC88A mRNA expression level * 0.086 + ADH4 mRNA expression level * (0.0677) + CENPE mRNA expression level *0.4481 + AKR1B10 mRNA expression level * 0.046+ ABCB6 mRNA expression level * 0.2454+ EIF2B4 mRNA expression level * 0.4178+ COMMD3 mRNA expression level * 0.1888. We used the median risk score value to classify the365 HCC patients into high-risk group or low-risk group in the training set. Analysis of Kaplan-Meier survival curves revealed that the low-risk group had a much longer overall survival time. (Figure 3C). We found that the higher the risk score, the higher the proportion of patients who died. Most gene expression expression levels were positively correlated with the risk score(Figure 3D).The ROC curve was then used to assess the risk sore’s prediction effect in judging the prognosis of HCC patients, with mean AUC values of 0.806, 0.731 and 0.690 at 1, 3 and 5 years, respectively(Figure S3A). To verify this risk signature, we used the same method to classify 240 patients in the validation set, and not surprisingly, the low-risk group survived better than the high-risk group. (Figure 3E). The ROC curves demonstrated that the risk score was a strong predictor of HCC patient survival at 1, 2, and 3 years (1-year AUC:0.735, 2-year AUC:0.735, and 5-year AUC:0.736)(Figure S3B). Similarly, we also found in our validation set that the higher the risk score, the higher the proportion of patients who died, and most gene expression expression levels were positively correlated with the risk score(Figure 3F).

GSEA enrichment analysis results

We performed GSEA enrichment analysis to further explore the differences of molecular processes or pathways enrichment between the high-risk and the low-risk group. GSEA enrichment analysis showed that OOCYTE_MEIOSIS, CELL_CYCLE, and PYRIMIDINE_METABOLISM pathways significantly enriched in the high-risk group, while COMPLEMENT_AND_COAGULATION CASCADES, FATTY_ACID_METABOLISM, and PRIMARY_BILE_ACID_BIOSYNTHESIS pathways extraordinarily enriched in the low-risk group of patients(Figure S4).

Relationship between prognostic signature and clinical characteristics

We compared the link between prognostic signature and age, gender, N stage, T stage, M stage, and Tumor Grade to investigate the underlying mechanisms underpinning the performance of prognostic signature in predicting prognosis in HCC patients. The more advanced the tumor stage, the higher the risk score(Figure 4A-4F). The results of univariate analysis of clinical characteristics and risk scores showed that M stage, N stage, T stage and risk scores had an impact on the prognosis of HCC patients(Figure 4G). The results of multivariate analysis showed that T stage and risk score were independent risk factors affecting the prognosis of HCC patients (Figure 4H). We compared the performance of clinical characteristics and risk scores for predicting the prognosis of HCC patients and found that the area under the ROC curve for risk scores (AUC=0.661) was significantly higher than other clinical characteristics, suggesting that our prognostic signature prognosis has better performance(Figure S5A). To better predict the prognosis of HCC patients in conjunction with other clinical parameters, we developed a nomogram based on clinical characteristics and risk ratings to predict the prognosis of HCC patients(Figure S5B).

Relationship between immune cell infiltration and prognostic signature

The remarkable predictive accuracy of the prognostic signature motivated us to investigate its underlying molecular characteristics further. We studied the connection between the prognostic signature and immune cell infiltration. When compared to low-risk group, the TME of high-risk group showed a remarkable prevalence in T cells CD8, T cells CD4 memory activation, T cells follicular helper, and Macrophages M0, and on the contrary, an obviously shrinkage in T cells CD4 memory resting and Macrophages M2. (Figure 5A). GSVA immune gene set enrichment analysis showed that Activated CD4 T cell, Activated dendritic cell, and Type 2 T helper cell were significantly enriched in the high-risk group of patients, whereas Effector memeory CD8 T cell, Eosinophil, Mast cell, Natural killer cell, and Type 1 T helper cell were significantly enriched in the low-risk group of patients(Figure 5B-5C). Subsequently, we further investigated the effect of infiltrating T cells CD4 memory resting in the tumor microenvironment on the prognosis of HCC patients, and we found that the overall survival time of patients in the highly infiltrated group was significantly better than that in the low infiltrated group(Figure 5D).

We calculated 365 HCC patients' StromalScore, ImmuneScore, ESTIMATEScore, and TumorPurity. High-risk patients had higher StromalScore, ESTIMATEScore, and TumorPurity scores (Figure S6A-S6D).

In summary, high-risk patients exhibited higher immune cell infiltration, and infiltrated immune cells demonstrated a negative link with patient prognosis, motivating us to further research the function of immune cells in the high-risk tumor microenvironment.

Relationship between Mutations, MSI and prognostic signature

Comparing the association between prognostic signature and somatic mutations, we discovered that the frequency of somatic mutations was much greater in the high-risk group (91.38%) than in the low-risk group (83.52%)(Figure 6A-6B). LRP1B and OBSCN were unique to the high-risk group among the top 10 mutant genes, while BAP1 and HERC2 were unique to the low-risk group. Curiously, tumor mutation burden(TMB) was not substantially different across high- and low-risk groups (p>0.05) (Figure 6C), while high-risk patients had higher median TMB values. We found that TMB was not a prognostic factor in HCC(Figure 6D). Previous studies have confirmed that MSI can be used as a biological marker for immunotherapy response assessment in a variety of tumors^[15]. In this study, a strong positive association was established between MSI score and prognostic signature(Figure 6E), and MSI score has the ability to affect the prognosis of HCC patients(Figure 6F). These findings show that high-risk individuals have a higher prevalence of somatic mutations, TMB and MSI, and may respond better to immunotherapy.

Prognostic signature can predict immunotherapy response

Expression of immune checkpoint-related genes plays an important role in HCC immunotherapy. We first compared the relationship between the expression of immune checkpoint-related genes, which has been demonstrated in a variety of cancers, and prognostic signature. We found that the expression levels of immune checkpoint-associated genes were significantly higher in patients of the high-risk group than the low-risk group(Figure 7A). Subsequently, we took a closer look at the six most studied immune checkpoint-related genes, including PD-L1, PD1, TIGIT, TIM-3, CTLA4, and PD-L2, and found that the expression levels of PD1, TIGIT, TIM-3, and CTLA4 were significantly higher in the high-risk group than in the low-risk group(Figure S7A-S7F).

We next analyzed the differences in Tumor Immune Dysfunction and Exclusion (TIDE) scores between the high-risk and low-risk groups and discovered that the high-risk group had considerably lower TIDE scores than the low-risk group(Figure 7B-7D). In order to further validate the accuracy of the prognostic signature for immunotherapy prediction, we introduced IMvigor210 immunotherapy data from a publicly available database containing detailed clinical treatment information and bulk RNA sequencing information for 195 bladder cancer patients receiving PD-L1 inhibitors. We calculated the risk scores of them by using the formula of the pre-constructed prognostic signature, and divided these 195 patients into high-risk or low-risk group based on the median value. Interestingly, we found a significant correlation between risk score and immunotherapy response in the IMvigor210 data(Figure 7E), with significantly higher CR/PR(CR: complete response; PR: partial response) rates after immunotherapy in the high-risk group than in the low-risk group(Figure 7F). Our study validated that the high-risk patients identified by our signature were more sensitive to and suitable for immunotherapy.

In summary, our prognostic signature can accurately predict the prognosis of HCC patients, and the accuracy of the signature was validated in an external dataset. Through a comprehensive analysis of immune cell infiltration, MSI, TMB, somatic mutations and TIDE, we found that patients in the high-risk group were more suitable for immunotherapy. The efficacy of the prognostic signature in predicting response to immunotherapy was confirmed in an external dataset. Therefore, we believe that our prognostic signature not only excels in predicting the prognosis of HCC patients, but also effectively predicts the response to immunotherapy in HCC patients, and has the potential to be an effective tool for clinicians to screen the population suitable for immunotherapy in the future.

HCC is a prevalent cancer and the second leading cause of cancer-related death globally^[16]. HCC accounts for 75%-90% of all diagnosed primary liver cancer cases and 830,000 deaths from this disease worldwide in 2020^[17]. Despite the significant therapeutic breakthroughs, its’ unfriendly clinical features, such as late-stage diagnosis, treatment resistance and frequent recurrence and metastasis contribute to HCC's low5-year survival rates in the United States^{[18, 19]} and Asia^[20]. It is long been confirmed by studies that the TME is highly correlated with the occurrence and development of HCC^{[21, 22]}. In the TME, tumor cells interact with numerous immune cells, including tumor infiltrating lymphocytes (TIL), CD8+ cytotoxic T lymphocytes (CTL), regulatory T lymphocytes (Treg) and tumor-associated macrophages (TMC), to form a complex communication network. In particular, TAMs and myeloid-derived suppressor cells (MDSCs) play an important role in tumor cell genesis, development, and metastasis^[23]. Although many studies have demonstrated the ability of immunotherapy on improving the clinical prognosis of HCC, only a very small percentage of patients can benefit from it. In clinical practice, immune checkpoint inhibitors are successful in 15% to 25% of patients, with most patients not responding to these drugs and even a portion of the population developing hyper-progressive disease state after immunotherapy^[24]. Unlike other types of solid tumors, hepatocellular carcinoma lacks effective markers to predict response to immunotherapy due to its unique immune microenvironment and its status quo of not being widely used in HCC. Hence, it’s urgent to explore effective biological markers to predict the response to immunotherapy in HCC patients, so as to screen the population suitable for immunotherapy, improve the response rate of immunotherapy and reduce drug toxic side effects.

In this study, we constructed prognostic signature using nine genes associated with cell developmental trajectories. This signature had a nice predictive value on the prognosis of HCC patients, and the predictive efficacy was further confirmed by an independent external dataset. To further reveal the molecular mechanisms behind the prognostic signature, we explored the relationships between prognostic signature and signaling pathways, somatic mutations, TMB, immune cell infiltration, and MSI. As we showed above, the prognostic signature significantly and positively correlated with T-cell infiltration, somatic mutation frequency, and MSI. This surprising finding prompted us to further explore the relationship between the prognostic signature and immunotherapy response. Interestingly, we found that the expression levels of immune checkpoint-related genes were significantly upregulated in the high-risk group of patients in the prognostic signature than in the low-risk group. Subsequently, we further confirmed the accuracy of the prognostic signature for predicting immunotherapy response using TIDE scores and external immunotherapy data

Other investigations have corroborated the important genes involved in signature creation in our work. The proliferation rate, transition through the cell cycle, migration, and invasion were all reduced when CCT5 was knocked down in an HCC cell line, but elevated when CCT5 was overexpressed, as reported by Liu^[25]. Shen^[26] et al concluded that FLVCR1 was connected with the metabolism of HCC cells and that its overexpression predicted a bad prognosis for HCC patients. Researchers discovered that pancreatic ductal adenocarcinoma cells' CCDC88A expression was elevated in their protrusions, and that these membrane protrusions contributed to the cells' increased motility and invasiveness^[27]. Alcohol dehydrogenase 4 (ADH4) is a major member of the alcohol dehydrogenase family that metabolizes numerous substrates, including ethanol and retinol. ADH4 expression was linked to pathology grade and serum AFP. The overall survival of HCC patients with low ADH4 level was considerably worse than patients whose ADH4 level was high^[28]. According to Xi^[29], CENPE was connected with the prognosis of HCC and might be employed as a molecular marker to predict the prognosis of HCC. Sonohara^[30] et al found that AKR1B10 mRNA levels in HCC and pericarcinomatous tissues can predict prognosis after curative hepatectomy, with low expression in HCC tissue indicating a poor prognosis. COMMD3, EIF2B4, and ABCB6 are also linked to HCC prognosis^[31-33].

The higher predictive performance of the prediction signature in the test and validation sets of HCC patients motivated us to further study the molecular causes behind it. GSEA enrichment analysis revealed that OOCYTE_MEIOSIS and CELL_CYCLE signaling pathways considerably enriched in the high-risk group. The CELL_CYCLE signaling pathway involved in the proliferation and development of tumor cells and may partially explain the poorer prognosis of individuals in the high-risk group. TME plays as an important role in tumor progression, immunotherapy, and tumor drug resistance. In this study, we found a significant correlation between the prognostic signature and the infiltration of immune cells. The percentage of infiltrated T cells and Treg cells in the tumor microenvironment was significantly higher in the high-risk group than in the low-risk group, but this was contradicted by the poor prognosis of the high-risk group. Therefore, we hypothesized that the immune cells in TME in the high-risk group were immunosuppressed. To test our speculation, we compared the prognostic signature with the expression of immune checkpoint-related genes and found that the expression levels of TIGIT, PD1, TIM-3,and CTLA4 genes were significantly higher in the high-risk group than in the low-risk group, confirming the immunosuppressed state of immune cells in the high-risk patients.

A high mutational burden in malignancies such as melanoma and non-small cell lung cancer (NSCLC) may be a response predictor for PD-1/PD-L1 inhibitors ^[34-36]. Multiple studies have indicated that MSI can be used as a biomarker for immunotherapy prediction^[37-39]. MSI can also serve as a prognostic indicator; for instance, people with MSI in colorectal cancer had a better prognosis than those without MSI^[40]. In our study, we found a positive correlation between prognostic signature and MSI scores, with higher MSI scores in patients in the high-risk group, as well as MSI scores as a factor affecting the prognosis of HCC patients. Although, there was no significant correlation between TMB score and prognostic signature(p>0.05), the median value of TMB was higher in the high-risk group than low-risk group. From the results shown above, we may conclude that the high-risk patients would be more responsive to and suitable for immunotherapy.

We then compared the Tumor Immune Dysfunction and Exclusion scores of the two groups and discovered that the high-risk group had lower scores than the low-risk group, indicating the presence of an immunosuppressive state and the increased likelihood that immunotherapy will be able to reverse this state and improve the response rate to immunotherapy in the high-risk group. Meanwhile, the expression levels of immune checkpoint-related genes were significantly higher in the high-risk groups, implying the presence of immunosuppression in this group. Combining MSI, TMB, TIDE, immune cell infiltration, and immune checkpoint-related gene expression, we infer that high-risk patients may be more responsive to and suitable for immunotherapy. Our speculation was validated in the external immunotherapy treatment data (IMvigor210).

PD-1 inhibitors prevent the binding of PDL1 to PD1, thus activating T cells and promoting their proliferation to eliminate tumor cells^{[41, 42]}. CTLA4 antibody inhibits the co-stimulatory signaling pathway mediated by CTLA-4 binding to the B7 family of antigen-presenting cells (CD86, CD80) on the surface of T-reg cells, hence deregulating T-reg cells' negative regulation on tumor-killing T cells (CD8+)^{[43, 44]}. Meanwhile, several investigations have shown that TIM-3 and TIGIT inhibitors can augment the efficacy of PD1 and CTLA-4 inhibitors^[45-47]. As the high-risk group shows upregulated levels of immune checkpoint-related genes, it is obvious that the high-risk group is resistant to PD-1 inhibitors, CTLA4 inhibitors, TIM-3 inhibitors, and TIGIT inhibitors, such as pembrolizumab, ipilimumab, tremelimumab, and sabatolimab.

Although our prognostic signature has good performance in predicting prognosis and immunotherapy response in HCC patients, limitations still remain as follows: 1. lack of a large number of clinical sequencing samples to validate our findings; 2. lack of basic experiments to validate the expression of the 9 genes that constitute the prognostic signature.

In this study, we combined the analysis of single cell sequencing data and bulk RNA sequencing data to construct a prognostic signature based on genes associated with cell developmental trajectories. The prognostic signature can effectively predict not only the prognosis of HCC patients, but also the immunotherapy response. The predictive efficacy of the prognostic signature was all validated in an external independent dataset. In the future, our prognostic signature has the potential to become a useful tool to assist clinicians in screening populations for immunotherapy and specifying individualized treatment regimens.

Acknowledgements Thanks to all those who helped in data preparation and paper writing.

Data availability Gene expression profiles, clinical information, and mutation data of HCC in this study are available from the public database (TCGA, https://portal.gdc.cancer.gov/). GSE109211, GSE213615, GSE10143, GSE14520 and GSE104580were downloaded from GEO database. The ICGC-LIHC data in the validation set were downloaded from the International Cancer Genome Consortium ( ICGC ,https://dcc.icgc. org/).

Conflict of interest The authors declare that they have no conflict of

interest.

Funding

No Funding.

Author contributions

Biao Gao and Yafei Wang performed the data analysis and wrote the manuscript. These authors contributed equally to this work. Chonghui Li and Shichun Lu reviewed and revised the manuscript. These corresponding authors contributed equally to this work.All authors read and approved the final manuscript.

Acknowledgements

We would like to acknowledge the TCGA, ICGC and GEO for providing relevant data.

Sung H, Ferlay J, Siegel R L, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries[J]. CA Cancer J Clin, 2021,71(3):209-249. DOI: 10.3322/caac.21660.
Sim H W, Knox J. Hepatocellular carcinoma in the era of immunotherapy[J]. Curr Probl Cancer, 2018,42(1):40-48. DOI: 10.1016/j.currproblcancer.2017.10.007.
Amicone L, Marchetti A. Microenvironment and tumor cells: two targets for new molecular therapies of hepatocellular carcinoma[J]. Transl Gastroenterol Hepatol, 2018,3:24. DOI: 10.21037/tgh.2018.04.05.
Llovet J M, Castet F, Heikenwalder M, et al. Immunotherapies for hepatocellular carcinoma[J]. Nat Rev Clin Oncol, 2022,19(3):151-172. DOI: 10.1038/s41571-021-00573-2.
Santhakumar C, Gane E J, Liu K, et al. Current perspectives on the tumor microenvironment in hepatocellular carcinoma[J]. Hepatology International, 2020,14(6):947-957. DOI: 10.1007/s12072-020-10104-3.
Tahmasebi B M, Carloni V. Tumor Microenvironment, a Paradigm in Hepatocellular Carcinoma Progression and Therapy[J]. Int J Mol Sci, 2017,18(2). DOI: 10.3390/ijms18020405.
Amicone L, Marchetti A. Microenvironment and tumor cells: two targets for new molecular therapies of hepatocellular carcinoma[J]. Transl Gastroenterol Hepatol, 2018,3:24. DOI: 10.21037/tgh.2018.04.05.
Noonan A, Pawlik T M. Hepatocellular carcinoma: an update on investigational drugs in phase I and II clinical trials[J]. Expert Opin Investig Drugs, 2019,28(11):941-949. DOI: 10.1080/13543784.2019.1677606.
Cheng H, Sun G, Chen H, et al. Trends in the treatment of advanced hepatocellular carcinoma: immune checkpoint blockade immunotherapy and related combination therapies[J]. Am J Cancer Res, 2019,9(8):1536-1545.
Zhang J, Dang F, Ren J, et al. Biochemical Aspects of PD-L1 Regulation in Cancer Immunotherapy[J]. Trends Biochem Sci, 2018,43(12):1014-1032. DOI: 10.1016/j.tibs.2018.09.004.
Gohil S H, Iorgulescu J B, Braun D A, et al. Applying high-dimensional single-cell technologies to the analysis of cancer immunotherapy[J]. Nature Reviews Clinical Oncology, 2021,18(4):244-256. DOI: 10.1038/s41571-020-00449-x.
Kautto E A, Bonneville R, Miya J, et al. Performance evaluation for rapid detection of pan-cancer microsatellite instability with MANTIS[J]. Oncotarget, 2017,8(5):7452-7463. DOI: 10.18632/oncotarget.13918.
Newman A M, Steen C B, Liu C L, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry[J]. Nat Biotechnol, 2019,37(7):773-782. DOI: 10.1038/s41587-019-0114-2.
Hänzelmann S, Castelo R, Guinney J. GSVA: gene set variation analysis for microarray and RNA-seq data[J]. BMC Bioinformatics, 2013,14:7. DOI: 10.1186/1471-2105-14-7.
Lorenzi M, Amonkar M, Zhang J, et al. Epidemiology of Microsatellite Instability High (MSI-H) and Deficient Mismatch Repair (dMMR) in Solid Tumors: A Structured Literature Review[J]. Journal of Oncology, 2020,2020:1807929. DOI: 10.1155/2020/1807929.
EASL Clinical Practice Guidelines: Management of hepatocellular carcinoma[J]. J Hepatol, 2018,69(1):182-236. DOI: 10.1016/j.jhep.2018.03.019.
Sung H, Ferlay J, Siegel R L, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries[J]. CA Cancer J Clin, 2021,71(3):209-249. DOI: 10.3322/caac.21660.
Altekruse S F, Henley S J, Cucinelli J E, et al. Changing hepatocellular carcinoma incidence and liver cancer mortality rates in the United States[J]. Am J Gastroenterol, 2014,109(4):542-553. DOI: 10.1038/ajg.2014.11.
Xu L, Kim Y, Spolverato G, et al. Racial disparities in treatment and survival of patients with hepatocellular carcinoma in the United States[J]. Hepatobiliary Surg Nutr, 2016,5(1):43-52. DOI: 10.3978/j.issn.2304-3881.2015.08.05.
Zhang G, Li R, Deng Y, et al. Conditional survival of patients with hepatocellular carcinoma: results from the Surveillance, Epidemiology, and End Results registry[J]. Expert Rev Gastroenterol Hepatol, 2018,12(5):515-523. DOI: 10.1080/17474124.2018.1453806.
Ringelhan M, Pfister D, O'Connor T, et al. The immunology of hepatocellular carcinoma[J]. Nat Immunol, 2018,19(3):222-232. DOI: 10.1038/s41590-018-0044-z.
Liu Z, Liu X, Liang J, et al. Immunotherapy for Hepatocellular Carcinoma: Current Status and Future Prospects[J]. Frontiers in immunology, 2021,12:765101. DOI: 10.3389/fimmu.2021.765101.
Chew V, Lai L, Pan L, et al. Delineation of an immunosuppressive gradient in hepatocellular carcinoma using high-dimensional proteomic and transcriptomic analyses[J]. Proc Natl Acad Sci U S A, 2017,114(29):E5900-E5909. DOI: 10.1073/pnas.1706559114.
Zhang J, Dang F, Ren J, et al. Biochemical Aspects of PD-L1 Regulation in Cancer Immunotherapy[J]. Trends Biochem Sci, 2018,43(12):1014-1032. DOI: 10.1016/j.tibs.2018.09.004.
Liu J, Huang L, Zhu Y, et al. Exploring the Expression and Prognostic Value of the TCP1 Ring Complex in Hepatocellular Carcinoma and Overexpressing Its Subunit 5 Promotes HCC Tumorigenesis[J]. Front Oncol, 2021,11:739660. DOI: 10.3389/fonc.2021.739660.
Shen Y, Li X, Zhao B, et al. Iron metabolism gene expression and prognostic features of hepatocellular carcinoma[J]. J Cell Biochem, 2018,119(11):9178-9204. DOI: 10.1002/jcb.27184.
Tanouchi A, Taniuchi K, Furihata M, et al. CCDC88A, a prognostic factor for human pancreatic cancers, promotes the motility and invasiveness of pancreatic cancer cells[J]. J Exp Clin Cancer Res, 2016,35(1):190. DOI: 10.1186/s13046-016-0466-0.
Wei R R, Zhang M Y, Rao H L, et al. Identification of ADH4 as a novel and potential prognostic marker in hepatocellular carcinoma[J]. Med Oncol, 2012,29(4):2737-2743. DOI: 10.1007/s12032-011-0126-3.
Xi Y B, Zhang H M, Yang B, et al. [Bioinformatics analysis of genes related to poor prognosis of human hepatocellular carcinoma and its clinical significance][J]. Zhongguo Ying Yong Sheng Li Xue Za Zhi, 2019,35(1):90-96. DOI: 10.12047/j.cjap.5764.2019.021.
Sonohara F, Inokawa Y, Hishida M, et al. Prognostic significance of AKR1B10 gene expression in hepatocellular carcinoma and surrounding non-tumorous liver tissue[J]. Oncol Lett, 2016,12(6):4821-4828. DOI: 10.3892/ol.2016.5240.
Wang X, He S, Zheng X, et al. Transcriptional analysis of the expression, prognostic value and immune infiltration activities of the COMMD protein family in hepatocellular carcinoma[J]. BMC Cancer, 2021,21(1):1001. DOI: 10.1186/s12885-021-08699-3.
Li M, Liu Z, Wang J, et al. Systematic Analysis Identifies a Specific RNA-Binding Protein-Related Gene Model for Prognostication and Risk-Adjustment in HBV-Related Hepatocellular Carcinoma[J]. Front Genet, 2021,12:707305. DOI: 10.3389/fgene.2021.707305.
Polireddy K, Chavan H, Abdulkarim B A, et al. Functional significance of the ATP-binding cassette transporter B6 in hepatocellular carcinoma[J]. Mol Oncol, 2011,5(5):410-425. DOI: 10.1016/j.molonc.2011.07.005.
Rizvi H, Sanchez-Vega F, La K, et al. Molecular Determinants of Response to Anti-Programmed Cell Death (PD)-1 and Anti-Programmed Death-Ligand 1 (PD-L1) Blockade in Patients With Non-Small-Cell Lung Cancer Profiled With Targeted Next-Generation Sequencing[J]. J Clin Oncol, 2018,36(7):633-641. DOI: 10.1200/JCO.2017.75.3384.
High TMB Predicts Immunotherapy Benefit[J]. Cancer Discov, 2018,8(6):668. DOI: 10.1158/2159-8290.CD-NB2018-048.
Goodman A M, Piccioni D, Kato S, et al. Prevalence of PDL1 Amplification and Preliminary Response to Immune Checkpoint Blockade in Solid Tumors[J]. JAMA Oncol, 2018,4(9):1237-1244. DOI: 10.1001/jamaoncol.2018.1701.
Bairwa N K, Saha A, Gochhait S, et al. Microsatellite instability: an indirect assay to detect defects in the cellular mismatch repair machinery[J]. Methods Mol Biol, 2014,1105:497-509. DOI: 10.1007/978-1-62703-739-6_35.
Dudley J C, Lin M T, Le DT, et al. Microsatellite Instability as a Biomarker for PD-1 Blockade[J]. Clin Cancer Res, 2016,22(4):813-820. DOI: 10.1158/1078-0432.CCR-15-1678.
Le DT, Durham J N, Smith K N, et al. Mismatch repair deficiency predicts response of solid tumors to PD-1 blockade[J]. Science, 2017,357(6349):409-413. DOI: 10.1126/science.aan6733.
Boland C R, Goel A. Microsatellite Instability in Colorectal Cancer[J]. Gastroenterology, 2010,138(6):2073-2087. DOI: https://doi.org/10.1053/j.gastro.2009.12.064.
Zongyi Y, Xiaowu L. Immunotherapy for hepatocellular carcinoma[J]. Cancer Lett, 2020,470:8-17. DOI: 10.1016/j.canlet.2019.12.002.
Cheng A L, Hsu C, Chan S L, et al. Challenges of combination therapy with immune checkpoint inhibitors for hepatocellular carcinoma[J]. J Hepatol, 2020,72(2):307-319. DOI: 10.1016/j.jhep.2019.09.025.
Leone P, Solimando A G, Fasano R, et al. The Evolving Role of Immune Checkpoint Inhibitors in Hepatocellular Carcinoma Treatment[J]. Vaccines (Basel), 2021,9(5). DOI: 10.3390/vaccines9050532.
Leone P, Solimando A G, Fasano R, et al. The Evolving Role of Immune Checkpoint Inhibitors in Hepatocellular Carcinoma Treatment[J]. Vaccines (Basel), 2021,9(5). DOI: 10.3390/vaccines9050532.
Das M, Zhu C, Kuchroo V K. Tim-3 and its role in regulating anti-tumor immunity[J]. Immunol Rev, 2017,276(1):97-111. DOI: 10.1111/imr.12520.
Ramos-Casals M, Brahmer J R, Callahan M K, et al. Immune-related adverse events of checkpoint inhibitors[J]. Nat Rev Dis Primers, 2020,6(1):38. DOI: 10.1038/s41572-020-0160-6.
Chiu D K, Yuen V W, Cheu J W, et al. Hepatocellular Carcinoma Cells Up-regulate PVRL1, Stabilizing PVR and Inhibiting the Cytotoxic T-Cell Response via TIGIT to Mediate Tumor Resistance to PD1 Inhibitors in Mice[J]. Gastroenterology, 2020,159(2):609-623. DOI: 10.1053/j.gastro.2020.03.074.

No competing interests reported.

Supplement.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Single cell RNA sequence combined with bulk RNA sequence to construct and validate a novel prognostic signature based on cell developmental trajectory- related genes to predict prognosis and immunotherapeutic response in Hepatocellular Carcinoma

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Results

Discussion

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1