Identification of cuproptosis-related subtypes, the establishment of a prognostic model, and exploration of drug candidates in urothelial carcinoma

doi:10.21203/rs.3.rs-2213491/v1

Download PDF

Research Article

Identification of cuproptosis-related subtypes, the establishment of a prognostic model, and exploration of drug candidates in urothelial carcinoma

https://doi.org/10.21203/rs.3.rs-2213491/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Urothelial carcinoma (UC) originates from the urinary tract and can arise from the renal pelvis, ureter, urinary bladder, and urethra. This new kind of programmed cell death, called cuproptosis, is linked to tumor advancement and microenvironment homeostasis. Nevertheless, the biological functions of cuproptosis-related genes (CRGs) in UCs are yet unknown.

Methods

The three Gene Expression Omnibus datasets (GEO) datasets and The Cancer Genome Atlas (TGCA) database were utilized to obtain data from 972 UC patients. R software was utilized to identify different cancer clusters and comprehensively depicted their relationships with clinical pathological characteristics and tumor microenvironment.

Results

Firstly, 972 UC patients underwent classification to obtain three CRG clusters based on 17 prognostic CRGs, showing significantly different prognoses, clinical features, and immune infiltration statuses (P < 0.001). Specifically, we found that cluster C displayed more like an immune-inflamed phenotype and cluster B was more consistent with the immune-desert pattern. Also, three cuproptosis gene clusters were identified according to differentially expressed genes (DEGs), displaying significantly different prognoses (P < 0.001). Secondly, we developed and validated a cuproptosis risk model and the formula we utilized was: CRG_score = (-0.0887 * HSD17B2) + (0.2014 * KDELR3) + (0.1125* EFEMP1) + (0.1118 * TMEM45A). All samples were randomly divided into the training and validation sets, and the high-risk score group was strongly correlated with poor prognosis in both sets. Besides, the two risk groups displayed significant differences in the tumor microenvironment, mutation landscape, and anti-tumor agent sensitivity. Finally, a nomogram was constructed to predict individual risk, with AUC values of 0.78, 0.79, and 0.81 for 1-, 3-, and 5-year periods, correspondingly.

Conclusions

This study revealed the underlying biological patterns of cuproptosis-related genes in UC. We developed a CRG risk score that could act as a practical prognostic model to predict the clinical outcome of individual UC patients.

Urothelial carcinoma

Cuproptosis

Prognostic signature

Tumor microenvironment

Drug sensitivity

Urothelial carcinoma is a commonly heterogeneous malignancy that derives from the epithelium in the urinary tract. Included in this group are urothelial carcinomas of the bladder (UBUC) and the upper tract (UTUC)[1, 2]. Radical nephroureterectomy with ipsilateral bladder cuff excision is the standard treatment option for UTUC [1]. And patients with muscle-invasive UBUC are recommended to receive radical cystectomy and systematic chemotherapy [2]. Gemcitabine and cisplatin (GC) have been considered gold-standard first-line systemic chemotherapeutic agents for patients with advanced or metastasis UC, while the long-term survival is poor. Although immune checkpoint inhibitors and targeted therapy agents have emerged as promising treatments for advanced UC patients, the overall response rate remains unsatisfactory and varied among unselected patient populations[3]. Thus, it is vital to identify the underlying biological characteristics of UC and find a reliable model to enhance the prediction of each patient’s prognosis accurately.

The appropriate concentration of metal is essential to maintain cellular protein homeostasis and cell function in the human body. One of them is copper, which acts as a cofactor of metabolic enzymes in a series of biological processes, including oxidation resistance, mitochondrial respiration, and detoxification [4]. Realizations have arisen that copper is essential in tumor growth, angiogenesis, and metastasis, and increased cellular copper concentration is associated with tumor progression [5]. Recent research by Tsvetkov et al. affirmed that cuproptosis is an intracellular copper-dependent type of cell death that is varied from necrosis, apoptosis, autophagy, and ferroptosis [6]. Researchers found that elesclomol, the copper ionophore, couldn’t kill tumor cells without loading copper, indicating that this mode of cell death was induced by copper toxicity. Furthermore, cuproptosis was shown to be a distinct mode of programmed cell death since it was not inhibited by drugs that target the apoptotic or necrotic pathways. More specifically, researchers noted that cells relying greatly on mitochondrial respiration were remarkably vulnerable to cuproptosis attack, suggesting a solid relationship between copper-induced cell death and the TCA cycle. The previous study has demonstrated that the TCA cycle can help cancer cells adapt to the metastatic microenvironment, thus promoting tumor progression[7]. Besides, this study identified seven cuproptosis-promoted genes (LIAS, FDX1, DLD, LIPT1, PDHA1, DLAT, and PDHB) and three cuproptosis-inhibited genes (GLS, MTF1, and CDKN2A). There are minimal therapeutic approaches for patients with progressed or relapsed urothelial carcinoma following first-line platinum-contained chemotherapy, and chemoresistance remains a challenge for clinical oncologists. A previous study found that the replacement of platinum by copper in coordination complexes could resensitize tumor cells to platinum-based drugs [5]. In addition, Copper transporter receptor 1 (CTR1) functions in Platinum uptake, and the expression level can act as a biomarker for platinum sensitivity in UBUC patients [8]. Also, inhibition of cell death is an essential factor for drug resistance, and exploring the mechanism patterns of cuproptosis in UC can help us better understand its role in UC development and the microenvironment.

In our investigation, we identified the biological patterns of 19 cuproptosis-related genes in urothelial carcinoma, successfully constructed a predicting model for individual patients, and elucidated their relationships with the tumor microenvironment. Clinicians can use this scoring model to assess each UC patient better and determine the most effective therapy.

Data collection and preprocession

RNA sequencing (RNA-seq) data and corresponding clinical data from follow-up visits for bladder urothelial carcinoma (UC) were gathered from The TCGA database (https://tcga-data.nci.nih.gov/tcga/) and the GEO website (GSE13507, GSE31684, and GSE32894) (https://www.ncbi.nlm.nih.gov/geo/). To further analyze the expression data from TCGA-bladder cancer (BLCA), we normalized fragments per kilobase million (FPKM) values to transcripts per million (TPM). As for the three GEO datasets, we downloaded the series matrix file datasets from which tumor samples were selected. Then, we converted the gene probes to gene names by referencing respective GEO platforms (GPL). The mean expression value was calculated for probes mapped to the same gene, and empty probes were removed. After eliminating samples with unknown follow-up information and uncertain survival status, we combined the four datasets, and the batch effects were adjusted using R's “SVA” package[9, 10]. At last, our research included data from 972 people with UC for further examination. Based on the previously published study[6], we chose to investigate 19 genes involved in cuproptosis.

Somatic mutations, tumor mutation burden and copy number variation of CRGs

Somatic mutation and copy number variation (CNV) of BLCA data were obtained from The TCGA database. We calibrated the TMB (tumor mutation burden) value of each TCGA-BLCA sample according to the somatic mutation data, namely the number of somatic variants per million base pairs. To examine the distribution of somatic mutations in these 19 CRGs, we utilized the "maftools" R program [11]. Also, we calculated the changes of gene copy number alterations and displayed the locations of CRGs in chromosomes. Through univariate cox analysis, we identified several cuproptosis-related genes highly related to prognosis (p-value < 0.05; HR > 1, risk factor; HR < 1, protective factor). Patients were divided into high- and low-expression groups as per the optimum threshold for dividing expression levels.

Unsupervised clustering for 19 CRGs

We obtained data on the gene expression of 19 CRGs from the merged datasets. Utilizing the expression of prognosis-related CRG genes, we utilized R’s "ConsensusClusterPlus" package to conduct unsupervised clustering analysis, with the maximum evaluated k (max K) being 9[12]. Finally, all UC samples were classified into three groups as per the expression levels of prognostic CRGs. To achieve the optimal clustering effect, we chose the cluster plot with clear distinction by groups and balanced sample size allocation between groups. In addition, the R package “survminer” (www.sthda.com/english/rpkgs/survminer) played a central role in visualizing the Kaplan–Meier survival curves between the three cluster groups.

Exhibition of some heterogenous characteristics between three clusters

The ‘pheatmap’ (https://github.com/raivokolde/pheatmap) package in R facilitated the display of the expression differences of 17 prognosis-related CRGs between three cluster groups, along with some clinical indicators including age, N stage, and T stage. The "GSVA" R package performed the GSVA enrichment analysis to delve into the different biological processes across various clustering categories. The GSVA is a powerful and flexible technique for estimating population-wide route activity differences (variations) in an unsupervised manner. For the purpose of performing the GSVA analysis, the gene sets "c2.cp.kegg.symbols.gmt" were downloaded from the MSigDB database. In addition, we performed a single sample GSEA (ssGSEA) analysis using the immune gene set "immune.gmt" to ascertain the relative abundance of the immune cells of each UC sample and compared three cluster populations. Additionally, we do dimensionality reduction for each piece using principal component analysis (PCA).

DEGs identification and pathways enrichment analysis

Using the "limma" R package and filter criteria of logFC 0.585 and adjusted P-value 0.05, we could identify differentially expressed genes (DEGs) across three groups of CRGs. In addition, we looked at the highly enriched pathways that were linked to DEGs. We utilized the KEGG rest API to obtain the most recent annotation for genes in the KEGG pathway (https://www.kegg.jp/kegg/rest/keggapi.html). GO annotation of genes in the R software package org.hs.eg.db (version 3.1.0) was used as the background to map genes to the background set. The R package clusterProfiler was utilized to do the enrichment analysis of gene sets (version 3.14.3). We defined that a p-value of < 0.05 depicted statistical significance.

Identification of cuproptosis gene clusters and construction of Cuproptosis-related prognostic risk score

Firstly, we applied univariate Cox regression to choose DEGs which have a prognostic association with overall survival. Then, based on the expression of prognosis-related DEGs, we conducted an unsupervised search with the aid of the R package "ConsensusClusterPlus to identify three cuproptosis gene clusters using [12]. The optimal number of gene clusters was calculated by consensus clustering algorithm and considering stability. We plotted Kaplan-Meier survival curves to further examine the variations in survival rates across three cluster samples. To evaluate the effect of the cuproptosis pattern on individual UC samples, we created a prognostic-related cuproptosis gene score system as follows. As per the prognosis-related DEGs, we conducted a Lasso regression with cross-validation and a P value = 0.05 using the R package “glmnet.” To make a final model, lambda. min correlation coefficients are recommended. Then, we computed the multivariate cox regression to classify the candidate cuproptosis prognostic-related genes. Lastly, we confirmed four genes to construct the model, and which formula was: CRG-score = Σ (Exp * coefi). To verify further if the model is accurate, all MIBC patients (n = 866) from TCGA and GEO datasets were randomized in a 1: 1 ratio between training (n = 433) and validation (n = 433) cohorts. After determining the CRG score for each sample, we classified patients into low-risk and high-risk categories in both the training and validation cohorts. A Kaplan-Meier curve was constructed to assess the variations in survival rates across the validation, training, and all cohorts.

Using the "timeROC" R package, we analyzed the predictive ability of the CRG score in the training, validation, and all cohorts. We calculated the area under the receiver operating characteristic (ROC) curve to determine the risk model's predictive power (0.5 = no predictive ability, 1 = perfect prediction). Besides, we summarized and visualized predicted results through risk score plots.

Establishment and evaluation of a predictive nomogram

A nomogram was developed using the predictive factors resulting from the multivariate analysis, including CRG-score and several clinical variables[13]. Moreover, ROC analysis, calibration curve, and C-index were calculated to ascertain the reliability of the nomogram. The predictive performance of the nomogram was improved when the calibration curve approached the 45-degree line and the C-index approached 1.

CRG-score correlations with immune cell infiltrating and tumor microenvironment

CIBERSORT analysis (https://cibersort.stanford.edu/) calculated the linkage between the various kinds of immune cells and the risk score, including macrophages, neutrophils, CD8 T cells, and so on. Also, the R package “estimate” was important in quantifying the infiltrating degree of stromal cells and immune cells of each sample, namely the Immune score and Stromal score [14]. The estimated score is the sum of the two, which is inversely proportional to the tumor purity. Finally, we compared the differences in three scores between high- and low-risk groups using a violin plot.

Mutation landscape and drug sensitivity analysis

The R package “maftools” was employed to display the mutation landscape of low-risk and high-risk populations, respectively, along with each sample’s TMB value [11]. We also analyzed the relationship between stemness-score and CRG risk score. What’s more, to identify potential therapeutic agents, we applied R’s “pRRophetic” package to calculate the IC50 values of chemotherapy or targeted agents commonly used in clinical practice and made a comparison among the three groups.

Statistical analysis

R software (version 4.0.0) and Perl were employed in performing all the statistical analyses in this research. Statistical significance was defined as a p-value > 0.05 using either the Student t-test, the Wilcoxon test, or the Kruskal-Wallis test to examine the differences between the groups.

Mutation landscape and survival-related CRGs in urothelial carcinoma

19 genes that functioned closely with cuproptosis were included in our study (Table S1). Firstly, we explored the expression difference of 19 CRGs between tumor and normal samples from the TCGA cohort, which is illustrated in the boxplot (Fig. 1A). Compared to the normal tissues, we demonstrated significantly higher expression of SLC31A1, LIPT2, CDKN2A, and GCSH in tumors. Adversely, the expression of NFE2L2, NLRP3, ATP7A, DLD, MTF1, and DLT was significantly decreased in tumors. Overall mutation landscape displayed a relatively high mutation frequency of cuproptosis genes in urothelial carcinoma. Among 414 TCGA-BLCA samples, 97 (23.43%) had a cuproptosis-gene mutation (Fig. 1B). We detected 12 mutated genes with different frequencies and mutation types. Specifically, NFE2L2 showed the highest mutation frequency (7%), followed by CDKN2A (6%), NLRP3 (3%), ATP7B, MTF1, DLD (2%), and ATP7A, DLAT, LIAS, PDHA1, DBT, GLS (1%). Afterward, we calculated the CNV (copy number variations) of all cuproptosis genes and found that copy number alterations commonly occurred among them (Fig. 1C). There were significant CNV gain in LIPT2 and loss in CDKN2A, ATP7B, and PDHB. In addition, we displayed the location of 19 CRGs on chromosomes along with their CNV status (Fig. 1D).

Cuproptosis subtype identification in UC

17 CRGs were found to have potential prognostic significance with UC patients after univariate cox regression analysis. To better understand the association mechanism behind these 17 prognostic CRGs, we drown a regulatory network to display their interactive relationship and predicted properties in UC patients (Fig. 2A). Interestingly, NLRP3, as a risk factor, was negatively related to several favorable factors, such as ATP7A, LIAS, LIPT1, and NFE2L2. Next, according to the optimal cutoff values of prognostic cuproptosis genes calculated by Kaplan-Meier (KM) survival analysis, patients from all three datasets were classified into high-risk and low-risk expression level groups, respectively. As a result, for PDHA1, SLC31A1, NLRP3, MTF1, and DLST, high expression was linked to worse overall survival (OS). While for LIPT1, ATP7B, CDKN2A, PDHB, NFE2L2, FDX1, and LIAS, high expression was found to be related to longer OS (Figure S1).

To further elucidate the underlying functional pattern and clinical effect of these prognostic-related cuproptosis genes, we conducted an unsupervised cluster analysis per the expression levels of the abovementioned 17 CRGs. We found that when the k value = 3, the subgroups were well separated during the calculation process, showing distinct disparity and clustering stability (Fig. 2B). As a result, a sum of 972 UC patients from TCGA and three GEO cohorts were categorized into cluster A (n = 418), cluster B (n = 233), and cluster C (n = 321). Besides, PCA analysis demonstrated clear discrimination between three clusters per the expression level of CRGs (Fig. 2F). TME analysis of 23 immune cells also revealed a dramatically different landscape of immune infiltration between three clusters (Fig. 2E). Specifically, cluster C exhibited a remarkably higher degree of immune infiltration, especially effector cells, comprising Activated CD8 cells, Natural killer T cells, Activated CD4 T cells, and Macrophage. Surprisingly, Regulatory T cells also exhibited a high infiltration level in cluster C, which has been shown to play an immune-suppressor role. As shown in the heatmap, we also compared differences in CRGs expression level and clinical features among three clusters, such as the T and N stages (Fig. 2D). GSVA analysis revealed the differences in biological behaviors between distinct clusters. As is illustrated in Fig. 2G-2I, cluster B is subjected to enrichment in drug and lipid metabolism processes, like the arachidonic acid metabolism pathway and the linoleic acid metabolism pathway. While CRG cluster C shows significant enrichment in immune-related KEGG terms, including natural killer cell, mediated cytotoxicity, T cell receptor signaling pathway, chemokine signaling pathway, TOLL-like receptor signaling pathways, NOD-like receptor signaling pathway, and pancreatic cancer pathway, together with cell cycle pathways such as oocyte meiosis, progesterone mediated oocyte maturation, and cell cycle pathway. Further survival analysis showed that cluster B cohorts showed the best overall survival among the three groups while cluster C showed the worst (Fig. 2C, P ༜ 0.001).

Generation of cuproptosis gene clusters in UC

Based on the R package “limma”, we conducted differential expressed gene analysis among these three clusters and identified 185 DEGs (Fig. 3A and Table S2). Next, these DEGs were utilized to perform functional analysis. On the one hand, GO analysis illustrated that DEGs were subjected to remarkable enrichment in biological processes of leukocyte chemotaxis, leukocyte migration, extracellular matrix organization, external encapsulating structure organization, and extracellular structure organization, as well as the molecular function of chemokine receptor binding, chemokine activity, and MHC class II receptor activity, which are related with recruiting immune cells and shaping tumor immune microenvironment (Fig. 3B). On the other hand, KEGG analysis showed that DEGs were subjected to enrichment in pathways of inflammatory responses and autoimmune diseases, such as ECM − receptor interaction, IL − 17 signaling pathway, systemic lupus erythematosus, rheumatoid arthritis, and asthma, indicating a strong relationship with immune microenvironment (Fig. 3C).

Next, we performed univariate cox regression again to ascertain the DEGs that significantly correlated with patients’ survival, and 164 prognosis-related genes were determined for further analysis with a P value < 0.05. Then, to discover the potential regulatory mechanisms of these prognostic DEGs, we clustered all patients from three cohorts into gene cluster A (460), gene cluster B (291), and gene cluster C (221) in an unsupervised manner as in the previous work (k = 3, Fig. 3D). Patients in cluster C cohorts fared the worst outcome, as seen by the Kaplan-Meier survival curve (Fig. 3E, p < 0.001), whereas those in cluster B cohorts fared the best. As per the DEGs’ expression levels, the heatmap displayed a distinct distribution of DEGs between three clusters, together with other variables including CRG clusters, gender, age, N stage, and T stage (Fig. 3G). Also, we found that patients with tumor stage N0 mainly concentrated in gene clusters A and B, indicating better survival. The boxplot illustrated that the expression levels of most cuproptosis genes were substantially different among the three groups, except DLD and DLAT (Fig. 3F).

Construction and validation of a predictive scoring model in UC

Considering the above analysis was based on population and the heterogeneity of the individual sample, we developed a scoring system to anticipate the prognosis of each UC patient using the LASSO method based on prognosis-related DEGs (Fig. 4A-B). To determine the model's accuracy, samples were divided randomly into validation and training sets. Finally, we obtained a scoring model that quantifies each individual’s CRG risk effects, holding all other independent factors constant. The model containing four CRGs and the risk score formula was obtained as follows: CRG risk score = (-0.0887 * expression of HSD17B2) + (0.2014 * expression of KDELR3) + (0.1125* expression of EFEMP1) + (0.1118 * expression of TMEM45A). In conformity with the median value of the CRG risk score, all UC patients were then divided into low-risk and high-risk categories. Additionally, it was revealed that there was a statistically significant difference in the CRG risk score across three CRG-cluster and gene-cluster groups (Fig. 4D-E, p < 2.22 * e¹⁶). Besides, consistent with what we had illustrated in the former survival analysis, CRG-cluster B and gene-cluster B showed the lowest risk scores, and CRG-cluster C and gene-cluster C possessed the highest risk score, respectively. Besides, the expression levels of many cuproptosis genes were significantly different between the aforementioned two risk groups (Fig. 4F). Kaplan-Meier survival analysis demonstrated that patients with higher risk scores displayed poorer prognosis in training and all cohorts, as well as the test cohorts (Fig. 4G-I, p < 0.001). Subsequently, the AUC values for one-year, three-year, and five-year OS were 0.687, 0.709, and 0.722 in the training cohort, respectively (Fig. 4J). Also, the results were similar in all-patient and training sets, indicating the potential predictive value of the score model for both long-term and short-term follow-up (Fig. 4K-L). Overall, the distribution plot suggested that the OS of UC declined with elevated CRG risk score and the high-risk group showed higher expression of KDELR3, EFEMP1, and TMEM45A in training, validation, and whole-patient sets, respectively (Figure S2). Finally, the alluvial diagram depicted the overall correlations of cluster assignments between risk groups and survival status and we found most individuals in the low-risk group were surviving and a large proportion of dead patients came from the high-risk group (Fig. 4C).

Development and assessment of the nomogram based on CRG risk score

An individualized score nomogram was developed to make a prediction of the 1-, 3-, and 5-year OS of UC patients based on five parameters, including CRG risk score, N stage, T stage, age, and gender (Fig. 5A). Obviously, the lower the score is, the better the prognosis. It is evident from the calibration plot that nomogram-predicted OS and ideal outcomes are in good agreement (Fig. 5C). In addition, the model's C-index was 0.723, indicating a considerable degree of predictive significance. Furthermore, the 1-year, 3-year, and 5-year AUC values were 0.78, 0.79, and 0.81, correspondingly (Fig. 5B). The results suggested that the newly developed nomogram-prediction model was a reliable tool for clinical decision-making for UC patients.

Correlations of CRG risk score with tumor microenvironment and mutation landscape of UC

The CIBERSORT algorithm showed that the prognostic CRG risk score was positively correlated with activated NK cells, resting CD4 T memory cells, M0 macrophages, Neutrophils, and M2 macrophages, and negatively correlated with regulatory T cells (Tregs), follicular helper T cells, CD4 naive T cells, Plasma cells, and naive B cells (Fig. 6A and S3).

The ESTIMATE algorithm was employed in calculating the stromal and immune scores. As shown in the violin plot, CRG risk score was significantly positively related to StromalScore, ImmuneScore, and ESTIMATEScore, indicating lower tumor purity (Fig. 6B).

We also visualized the mutation landscapes of low-risk and high-risk groups in the TCGA cohort and made a comparison. As a result, we found that the low-risk group displayed a much higher overall rate of mutation than the high-risk group (94.38% vs 93.49%). Both two groups possessed the same top 5 mutated genes, namely TP53, TTN, KMT2D, MUC16, and ARID1A (Fig. 6D-E). In addition, we discovered that the risk score had a remarkably adverse correlation with tumor stemness, as determined by RNAss (Fig. 6C, p = 2.4 * e¹⁰).

Anti-cancer drug susceptibility analysis between two risk groups

Next, we performed an anti-cancer drug susceptibility analysis to find potential chemotherapy or targeted agents that could be beneficial to patients with different risk groups. As displayed in Fig. 7, patients in the low-risk group had higher IC50 values for bexarotene, bleomycin, bortezomib, cisplatin, cyclopamine, dasatinib, docetaxel, and doxorubicin. Adversely, the IC50 values of bicalutamide, erlotinib, axitinib, bosutinib, and afatinib were significantly elevated in the low-risk group.

In cellular metabolism and bioenergy, copper is a vital transition metal that works as a cofactor for many enzymes. However, copper ions can cause cellular toxicity if consumed in excess concentrations. Copper toxicity and carcinogenicity have been concerns of physicians for a long time. As an important element for proliferative immortality, angiogenesis, and metastasis, the excessive cellular concentration of Cu is associated with tumor progression[4]. Recently, Tsvetkov et al. proposed a new form of programmed cell death, namely cuproptosis. It can be differentiated from other types of programmed cell death (e.g. pyroptosis, ferroptosis, apoptosis, and necroptosis) because researchers found that copper-induced cell death couldn’t be blocked by inhibitors of other known cell death pathways [6, 15]. Besides, the researchers concluded that cuproptosis had a close relationship with the TCA cycle for cells that remarkably relied on mitochondrial respiration were more susceptible to copper-induced cell death. Accumulating evidence has merged indicating that copper plays a vital function in the tumor microenvironment, metabolism, and drug sensitivity, including UC[8]. Therefore, characterizing the functional biological pattern mediated by TME alterations and CRGs can aid in the identification of potential prognostic features and help doctors to determine more effective therapies for individual UC patients.

Our study comprehensively analyzed variations of CRGs at both genomic and transcriptome levels as well as revealed the overall landscape of CRG signature among UC patients which could predict their prognosis. In total, 19 CRGs were included in our research according to previous studies [6, 16–20]. First, we found the variations in CRG expression between normal and tumor tissues and demonstrated most of them had prognostic value in UC patients, including CDKN2A. This made it plausible to speculate that CRGs may play a pivotal function in tumorigenesis and the progression of urothelial carcinoma. First, we discovered three CRG molecular patterns in an unsupervised way based on the expression of 19 CRGs, which were designated as clusters A, B, and C. To our surprise, the three clusters showed distinctly different characteristics in CRG expression, immune infiltration, and signaling pathways. On one hand, cluster C had more characteristics of immunoinflammatory phenotype for its high proportion of immune infiltration in TME, including Activated CD8 T cell, Activated CD4 T cell, Macrophage, Regulatory T cell, and Natural killer T cell. This finding agreed with previous studies that regulatory T cells usually presented in TME accompanying effector T cells to maintain immune homeostasis [21]. What’s more, KEGG analysis revealed that cluster C was tightly correlated with immune-related pathways and cell cycle pathways. On the other hand, cluster B displayed more like an immune-desert phenotype because of its lack of immune cell presence and was enriched in drug and lipid metabolism processes, such as the arachidonic acid metabolism pathway and linoleic acid metabolism pathway. Survival analysis illustrated that cluster B had the best prognosis, whereas cluster C had the worst. Perhaps, what could explain this outcome is that the strong immune infiltration may function as a component of possible immune escape and immuno-editing processes, and not all infiltrating effector cells are tumor-reactive[22, 23]. Next, we identified 185 DEGs whose expression levels differed significantly among the three clusters. Biological function analysis revealed that these DEGs may play a potential role in organizing and shaping extracellular matrix. Besides, 164 DEGs were identified to be related to the OS of UC patients in our cohort. Based on this, we determined three gene clusters with significantly different survival outcomes, DEGs and CRG expression for UC, designated gene clusters A, B, and C. Gene cluster C patients displayed the worst prognosis, on the other hand, gene cluster B had the best. Therefore, an exhaustive comprehension of the mechanism of CRGs and its pattern of clinical manifestations can not only establish an accurate prediction model but also help medical experts to take rational treatment for each patient since it’s feasible to cluster patients according to their genetic testing results.

Further, based on the complexity of UC and individual heterogeneity, we constructed a scoring system to evaluate the characteristics of each patient, and they were all stratified into low-risk and high-risk groups. Interestingly, a comprehensive relationship network illustrated that patients characterized by immunoinflammatory phenotype display higher risk scores, while patients with immune-desert phenotype showed lower risk scores, which are consistent with our survival analysis. Also, we should note significant correlations between the CRG risk score and stromal score, tumor purity, immune score, cancer stem-cell-like features (RNAss), and expression of cuproptosis genes. Patients in the low-risk category, therefore, displayed a significantly better prognosis than those in the high-risk category. In addition, this outcome could be validated in train, test, and whole-patient cohorts with satisfactory 1-, 3- and 5-year AUC values. What’s more, the CRG risk score displayed significance among three CRG clusters and gene clusters. Likewise, CRG cluster C and gene cluster C showed the highest risk score, representing the poorest prognosis and the situation of CRG cluster B and gene cluster B showed the opposite. All in all, this further demonstrated a rather good predictive efficiency of CRG risk score.

The previous study has demonstrated that inhibition of copper trafficking could result in reduced tumor cell proliferation and remodeling of the tumor microenvironment [24]. Over the last decade, immune-checkpoint inhibitors have made great progress in the treatment of advanced urothelial carcinoma, but the overall effective rate is not high. The immune microenvironment is also an important factor affecting the efficacy in addition to tumor mutation burden and PD-L1 expression in tumor cells[25]. The present study has featured CRG cluster C as an immunoinflammatory phenotype with abundant infiltration of activated CD4 T cell and activated CD8 T cell, while these patients displayed the poorest prognosis, suggesting a broader view of the UC tumor microenvironment is required. One possible explanation for this phenomenon is that CD8 + T cells can be classified into different subtypes and some of them play a suppressive role in tumor immune reaction [26]. The primary elements of a tumor microenvironment include stromal cells, immune cells, and tumor cells [27]. There are exist arguments that stromal activation in the tumor microenvironment is thought to be associated with T-cell inhibition[28]. In the present study, we found higher risk score had a positive association with a stromal score, representing poorer overall survival, which was consistent with the theory. Our research further analyzed the connection between tumor immune microenvironment and CRG risk score. It can be observed that risk score had a significantly positive relationship with M0 and M2 macrophages, which had been demonstrated to display an immunosuppressive phenotype in urothelial carcinoma, suggesting that individuals with low CRG risk scores had a better prognosis [29]. Additionally, the expression level of neutrophils was remarkably elevated in the group with a high CRG risk, and an increasing number of studies have shown that neutrophils play a crucial role in the metastasis and prognosis of tumor patients [30, 31]. Overall, the two CRG risk groups show distinct different landscapes of the tumor microenvironment, and detecting cuproptosis-related genes could provide clinicians with tools for the diagnosis and treatment of UC patients.

The majority of urothelial carcinomas are sensitive to cisplatin-chemotherapy while this sensitivity does not last. Besides, it is reported that patients with advanced urothelial carcinoma are heterogeneously responsive to platinum-based chemotherapy and ICI therapy[32, 33]. Thus, it is crucial to determine UC patients who are fit for cisplatin treatment and we observed that individuals with a high-risk score could gain more benefit from cisplatin-based chemotherapy. For UC patients with platinum-refractory, a phase II trial demonstrated that those with HER2 or ERBB3 alterations responded to afatinib actively, which is an irreversible tyrosine kinase inhibitor of the ErbB receptor family[34]. Also, we found individuals with lower risk scores were more sensitive to afatinib. Hence, the CRG risk score system could provide personalized treatment to match the individual needs of UC patients. Finally, a nomogram incorporating the CRG risk score and clinical characteristics was developed to predict the clinical outcome of individual UC patients. What’s more, its predictive value was robustly validated (1-, 3- and 5-year AUC values of 0.78, 0.79, and 0.81). However, there are still some limitations that need to be noted. For example, the prognostic model was developed as per retrospective data from public channels. So, it still needs to be verified in a large-size prospective study.

In general, we successfully developed a CRG risk score model to accurately predict the prognosis of individual UC patients. What’s more, the model underwent internally validated and was found to perform extremely well, indicating it can be served as a useful tool in clinical practice.

Urothelial carcinoma

UTUC

Upper tract urothelial carcinoma

UBUC

Urothelial carcinoma of bladder

CRGs

Cuproptosis-related genes

Gemcitabine and cisplatin

TCA

Tricarboxylic acid

CTR1

Copper transporter receptor 1

TME

Tumor microenvironment

CNV

Copy number variation

TMB

Tumor mutation load

LASSO

Least absolute shrinkage and selection operator

DEG

Differentially expressed gene

Overall survival

TCGA

The cancer genome atlas

GEO

Expression omnibus

PCA

Principal component analysis

ROC

Receiver operating characteristic curve

Gene ontology

KEGG

Kyoto encyclopedia of genes and genomes

SsGSEA

Single sample gene set enrichment analysis

ICI

Immune checkpoint inhibitor

IC50

Half maximum inhibitory concentration

DCA

Decision curve analysis

AUC

Area under curve

PD-L1

Programmed cell death-ligand 1

Ethics approval and consent to participate

This study design was fully in accordance with the Helsinki Declaration. Use of data from The Cancer Genome Atlas (TCGA) and geo cohort in this study fully met the TCGA (http://cancergenome.nih.gov/publications/publicationguidelines) and GEO (https://www.cedd.gov.hk/eng/publications/geo/) publication regulations. Study protocols were approved by Fudan University Shanghai Cancer Center (FUSCC) (Shanghai, China).

Consent for publication

Not applicable.

Availability of data and materials

The datasets used and/or analyzed during the current study were downloaded from TCGA (https://portal.gdc.cancer.gov/repository. DataSet ID: TCGA-BLCA Transcriptome Profiling and TCGA-BLCA Clinical) and GEO database (https://www.ncbi.nlm.nih.gov/geo/, DataSet ID: GSE13507, GSE31684, and GSE32894). These databases are all publicly available.

Competing interests

The authors declare that they have no competing interests

Funding

This work was supported by the Clinical Research Plan of SHDC [grant numbers SHDC2020CR4031].

Authors' contributions

All authors contributed to the study design and manuscript editing. YH and YX developed the study proposal. WT, YX and YH carried out data management, statistical analysis and chart making. YH drafted the manuscript. SM helped with cohort identification and data management. YJ and DW performed project administration. All authors read and approved the final manuscript.

Acknowledgements

We are grateful to Sangerbox (http://vip.sangerbox.com) for providing technical support of this manuscript. We thank Bullet Edits Limited for the linguistic editing and proofreading of the manuscript.

Authors' information

Not applicable

Footnotes

Not applicable

M. Rouprêt, M. Babjuk, M. Burger, O. Capoun, D. Cohen, E.M. Compérat, N.C. Cowan, J.L. Dominguez-Escrig, P. Gontero, A. Hugh Mostafid, J. Palou, B. Peyronnet, T. Seisen, V. Soukup, R.J. Sylvester, B. Rhijn, R. Zigeuner, S.F. Shariat, European Association of Urology Guidelines on Upper Urinary Tract Urothelial Carcinoma: 2020 Update, European urology, 79 (2021) 62–79.
J.A. Witjes, H.M. Bruins, R. Cathomas, E.M. Compérat, N.C. Cowan, G. Gakis, V. Hernández, E. Linares Espinós, A. Lorch, Y. Neuzillet, M. Rouanne, G.N. Thalmann, E. Veskimäe, M.J. Ribal, A.G. van der Heijden, European Association of Urology Guidelines on Muscle-invasive and Metastatic Bladder Cancer: Summary of the 2020 Guidelines, European urology, 79 (2021) 82–104.
M. Koufopoulou, P.A.P. Miranda, P. Kazmierska, S. Deshpande, P. Gaitonde, Clinical evidence for the first-line treatment of advanced urothelial carcinoma: Current paradigms and emerging treatment options, Cancer treatment reviews, 89 (2020) 102072.
S. Blockhuys, E. Celauro, C. Hildesjö, A. Feizi, O. Stål, J.C. Fierro-González, P. Wittung-Stafshede, Defining the human copper proteome and analysis of its expression variation in cancers, Metallomics: integrated biometal science, 9 (2017) 112–123.
D. Denoyer, S. Masaldan, S. La Fontaine, M.A. Cater, Targeting copper in cancer therapy: 'Copper That Cancer', Metallomics: integrated biometal science, 7 (2015) 1459–1476.
P. Tsvetkov, S. Coy, B. Petrova, M. Dreishpoon, A. Verma, M. Abdusamad, J. Rossen, L. Joesch-Cohen, R. Humeidi, R.D. Spangler, J.K. Eaton, E. Frenkel, M. Kocak, S.M. Corsello, S. Lutsenko, N. Kanarek, S. Santagata, T.R. Golub, Copper induces cell death by targeting lipoylated TCA cycle proteins, Science (New York, N.Y.), 375 (2022) 1254–1261.
Z. Cai, C.F. Li, F. Han, C. Liu, A. Zhang, C.C. Hsu, D. Peng, X. Zhang, G. Jin, A.H. Rezaeian, G. Wang, W. Zhang, B.S. Pan, C.Y. Wang, Y.H. Wang, S.Y. Wu, S.C. Yang, F.C. Hsu, R.B. D'Agostino, Jr., C.M. Furdui, G.L. Kucera, J.S. Parks, F.H. Chilton, C.Y. Huang, F.J. Tsai, B. Pasche, K. Watabe, H.K. Lin, Phosphorylation of PDHA by AMPK Drives TCA Cycle to Promote Cancer Metastasis, Molecular cell, 80 (2020) 263–278.e267.
D. Kilari, K.A. Iczkowski, C. Pandya, A.J. Robin, E.M. Messing, E. Guancial, E.S. Kim, Copper Transporter-CTR1 Expression and Pathological Outcomes in Platinum-treated Muscle-invasive Bladder Cancer Patients, Anticancer research, 36 (2016) 495–501.
J.T. Leek, W.E. Johnson, H.S. Parker, A.E. Jaffe, J.D. Storey, The sva package for removing batch effects and other unwanted variation in high-throughput experiments, Bioinformatics (Oxford, England), 28 (2012) 882–883.
W.E. Johnson, C. Li, A. Rabinovic, Adjusting batch effects in microarray expression data using empirical Bayes methods, Biostatistics (Oxford, England), 8 (2007) 118–127.
A. Mayakonda, D.C. Lin, Y. Assenov, C. Plass, H.P. Koeffler, Maftools: efficient and comprehensive analysis of somatic variants in cancer, Genome research, 28 (2018) 1747–1756.
M.D. Wilkerson, D.N. Hayes, ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking, Bioinformatics (Oxford, England), 26 (2010) 1572–1573.
A. Iasonos, D. Schrag, G.V. Raj, K.S. Panageas, How to build and interpret a nomogram for cancer prognosis, Journal of clinical oncology: official journal of the American Society of Clinical Oncology, 26 (2008) 1364–1370.
K. Yoshihara, M. Shahmoradgoli, E. Martínez, R. Vegesna, H. Kim, W. Torres-Garcia, V. Treviño, H. Shen, P.W. Laird, D.A. Levine, S.L. Carter, G. Getz, K. Stemke-Hale, G.B. Mills, R.G. Verhaak, Inferring tumour purity and stromal and immune cell admixture from expression data, Nature communications, 4 (2013) 2612.
F.J. Bock, S.W.G. Tait, Mitochondria as multifaceted regulators of cell death, Nature reviews. Molecular cell biology, 21 (2020) 85–100.
E.V. Polishchuk, A. Merolla, J. Lichtmannegger, A. Romano, A. Indrieri, E.Y. Ilyechova, M. Concilli, R. De Cegli, R. Crispino, M. Mariniello, R. Petruzzelli, G. Ranucci, R. Iorio, F. Pietrocola, C. Einer, S. Borchard, A. Zibert, H.H. Schmidt, E. Di Schiavi, L.V. Puchkova, B. Franco, G. Kroemer, H. Zischka, R.S. Polishchuk, Activation of Autophagy, Observed in Liver Tissues From Patients With Wilson Disease and From ATP7B-Deficient Animals, Protects Hepatocytes From Copper-Induced Apoptosis, Gastroenterology, 156 (2019) 1173–1189.e1175.
L. Aubert, N. Nandagopal, Z. Steinhart, G. Lavoie, S. Nourreddine, J. Berman, M.K. Saba-El-Leil, D. Papadopoli, S. Lin, T. Hart, G. Macleod, I. Topisirovic, L. Gaboury, C.J. Fahrni, D. Schramek, S. Meloche, S. Angers, P.P. Roux, Copper bioavailability is a KRAS-specific vulnerability in colorectal cancer, Nature communications, 11 (2020) 3701.
M.A. Kahlson, S.J. Dixon, Copper-induced cell death, Science (New York, N.Y.), 375 (2022) 1231–1232.
J. Dong, X. Wang, C. Xu, M. Gao, S. Wang, J. Zhang, H. Tong, L. Wang, Y. Han, N. Cheng, Y. Han, Inhibiting NLRP3 inflammasome activation prevents copper-induced neuropathology in a murine model of Wilson's disease, Cell death & disease, 12 (2021) 87.
X. Ren, Y. Li, Y. Zhou, W. Hu, C. Yang, Q. Jing, C. Zhou, X. Wang, J. Hu, L. Wang, J. Yang, H. Wang, H. Xu, H. Li, X. Tong, Y. Wang, J. Du, Overcoming the compensatory elevation of NRF2 renders hepatocellular carcinoma cells more vulnerable to disulfiram/copper-induced ferroptosis, Redox biology, 46 (2021) 102122.
D.S. Chen, I. Mellman, Elements of cancer immunity and the cancer-immune set point, Nature, 541 (2017) 321–330.
G.P. Dunn, A.T. Bruce, H. Ikeda, L.J. Old, R.D. Schreiber, Cancer immunoediting: from immunosurveillance to tumor escape, Nature immunology, 3 (2002) 991–998.
J. Zhang, D. Huang, P.E. Saw, E. Song, Turning cold tumors hot: from molecular mechanisms to clinical applications, Trends in immunology, 43 (2022) 523–545.
J. Wang, C. Luo, C. Shan, Q. You, J. Lu, S. Elf, Y. Zhou, Y. Wen, J.L. Vinkenborg, J. Fan, H. Kang, R. Lin, D. Han, Y. Xie, J. Karpus, S. Chen, S. Ouyang, C. Luan, N. Zhang, H. Ding, M. Merkx, H. Liu, J. Chen, H. Jiang, C. He, Inhibition of human copper trafficking by a small molecule significantly attenuates cancer cell proliferation, Nature chemistry, 7 (2015) 968–979.
J.E. Rosenberg, J. Hoffman-Censits, T. Powles, M.S. van der Heijden, A.V. Balar, A. Necchi, N. Dawson, P.H. O'Donnell, A. Balmanoukian, Y. Loriot, S. Srinivas, M.M. Retz, P. Grivas, R.W. Joseph, M.D. Galsky, M.T. Fleming, D.P. Petrylak, J.L. Perez-Gracia, H.A. Burris, D. Castellano, C. Canil, J. Bellmunt, D. Bajorin, D. Nickles, R. Bourgon, G.M. Frampton, N. Cui, S. Mariathasan, O. Abidoye, G.D. Fine, R. Dreicer, Atezolizumab in patients with locally advanced and metastatic urothelial carcinoma who have progressed following treatment with platinum-based chemotherapy: a single-arm, multicentre, phase 2 trial, Lancet (London, England), 387 (2016) 1909–1920.
L. Flippe, S. Bézie, I. Anegon, C. Guillonneau, Future prospects for CD8(+) regulatory T cells in immune tolerance, Immunological reviews, 292 (2019) 209–224.
F. Spill, D.S. Reynolds, R.D. Kamm, M.H. Zaman, Impact of the physical microenvironment on tumor progression and metastasis, Current opinion in biotechnology, 40 (2016) 41–48.
D. Malhotra, A.L. Fletcher, S.J. Turley, Stromal and hematopoietic cells in secondary lymphoid organs: partners in immunity, Immunological reviews, 251 (2013) 160–176.
Z. Chen, L. Zhou, L. Liu, Y. Hou, M. Xiong, Y. Yang, J. Hu, K. Chen, Single-cell RNA sequencing highlights the role of inflammatory cancer-associated fibroblasts in bladder urothelial carcinoma, Nature communications, 11 (2020) 5077.
L. Wu, S. Saxena, M. Awaji, R.K. Singh, Tumor-Associated Neutrophils in Cancer: Going Pro, Cancers, 11 (2019).
S.B. Coffelt, M.D. Wellenstein, K.E. de Visser, Neutrophils in cancer: neutral no more, Nature reviews. Cancer, 16 (2016) 431–446.
M.D. Galsky, G.J. Chen, W.K. Oh, J. Bellmunt, B.J. Roth, R. Petrioli, L. Dogliotti, R. Dreicer, G. Sonpavde, Comparative effectiveness of cisplatin-based and carboplatin-based chemotherapy for treatment of advanced urothelial carcinoma, Annals of oncology: official journal of the European Society for Medical Oncology, 23 (2012) 406–410.
A. Birtle, M. Johnson, J. Chester, R. Jones, D. Dolling, R.T. Bryan, C. Harris, A. Winterbottom, A. Blacker, J.W.F. Catto, P. Chakraborti, J.L. Donovan, P.A. Elliott, A. French, S. Jagdev, B. Jenkins, F.X. Keeley, Jr., R. Kockelbergh, T. Powles, J. Wagstaff, C. Wilson, R. Todd, R. Lewis, E. Hall, Adjuvant chemotherapy in upper tract urothelial carcinoma (the POUT trial): a phase 3, open-label, randomised controlled trial, Lancet (London, England), 395 (2020) 1268–1277.
N.J. Choudhury, A. Campanile, T. Antic, K.L. Yap, C.A. Fitzpatrick, J.L. Wade, 3rd, T. Karrison, W.M. Stadler, Y. Nakamura, P.H. O'Donnell, Afatinib Activity in Platinum-Refractory Metastatic Urothelial Carcinoma in Patients With ERBB Alterations, Journal of clinical oncology: official journal of the American Society of Clinical Oncology, 34 (2016) 2165–2171.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Identification of cuproptosis-related subtypes, the establishment of a prognostic model, and exploration of drug candidates in urothelial carcinoma

Status:

Version 1

Abstract

Figures

Background

Materials And Methods

Data collection and preprocession

Somatic mutations, tumor mutation burden and copy number variation of CRGs

Unsupervised clustering for 19 CRGs

Exhibition of some heterogenous characteristics between three clusters

DEGs identification and pathways enrichment analysis

Identification of cuproptosis gene clusters and construction of Cuproptosis-related prognostic risk score

Establishment and evaluation of a predictive nomogram

CRG-score correlations with immune cell infiltrating and tumor microenvironment

Mutation landscape and drug sensitivity analysis

Statistical analysis

Results

Mutation landscape and survival-related CRGs in urothelial carcinoma

Cuproptosis subtype identification in UC

Generation of cuproptosis gene clusters in UC

Construction and validation of a predictive scoring model in UC

Development and assessment of the nomogram based on CRG risk score

Correlations of CRG risk score with tumor microenvironment and mutation landscape of UC

Anti-cancer drug susceptibility analysis between two risk groups

Discussion

Conclusion

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1