1. The expression of RRM2 in pan-cancer
In this study, to explore the oncogenic role of human RRM2, we first analyzed the expression pattern of RRM2 in different cancer types using the Oncomine database (Fig. 1A). The results showed that RRM2 was significantly increased in most cancer groups, including bladder, brain, breast, cervical, colorectal, esophageal, gastric, head and neck, kidney, liver, lung, lymphoma, melanoma, other, ovarian, pancreatic, prostate and sarcoma cancers. While the expression of RRM2 was decreased in leukemia. Further, we identified the expression levels of RRM2 in 33 different tumor and nontumor tissues based on the datasets of TCGA. As shown in Fig. 1B, the expression levels of RRM2 were extremely different between cancer and normal groups. The expression levels of RRM2 were significantly higher in 19 tumor tissues than the corresponding normal tissues, including BLCA (Bladder Urothelial Carcinoma), BRCA (Breast invasive carcinoma), CESC (Cervical squamous cell carcinoma and endocervical adenocarcinoma), CHOL (Cholangiocarcinoma), COAD (Colon adenocarcinoma), ESCA (Esophageal carcinoma), GBM (Glioblastoma multiforme), HNSC (Head and Neck squamous cell carcinoma), KIRC (Kidney renal clear cell carcinoma), KIRP (Kidney renal papillary cell carcinoma), LIHC (Liver hepatocellular carcinoma), LUAD (Lung adenocarcinoma), LUSC (Lung squamous cell carcinoma), PCPG (Pheochromocytoma and Paraganglioma), PRAD (Prostate adenocarcinoma), READ (Rectum adenocarcinoma ), STAD (Stomach adenocarcinoma), THCA (Thyroid carcinoma), and UCEC (Uterine Corpus Endometrial Carcinoma). Overall, these results indicated that RRM2 was highly expressed in most cancers and showed a potential oncogenic role.
2. RRM2 expression and clinical correlation in pan-cancer
We assessed the correlation between the expression of RRM2 and tumor patients’ clinical characteristics in pan-cancer, including age, race, tumor stage and status. For age, a significant correlation was observed in BRCA, ESCA KICH (Kidney Chromophobe), KIRC, KIRP, LAML (Acute Myeloid Leukemia), LGG (Brain Lower Grade Glioma), LIHC, LUAD, LUSC, PAAD, PCPG, READ, STAD, TGCT (Testicular Germ Cell Tumors) and THYM (Thymoma) (Fig. 2A-D and Fig. S1A-L). For race, RRM2 expression significantly correlated with BLCA, BRCA, KICH, KIRC, KIRP, LIHC and THYM (Fig. 2E-H and Fig. S1M-O). Regarding the tumor stage, RRM2 expression was significantly associated with ACC (Adrenocortical carcinoma), BRCA, COAD, KICH, KIRC, KIRP, LIHC, LUAD, LUSC, PAAD (Pancreatic adenocarcinoma), SKCM (Skin Cutaneous Melanoma), TGCT and THCA (Fig. 2I-L and Fig. S1P-X). In terms of tumor status, RRM2 expression levels significantly correlated with ACC, BLCA, COAD, KICH, KIRC, KIRP, LGG, LUAD, OV (Ovarian serous cystadenocarcinoma), PAAD, PCPG, PRAD and UVM (Uveal Melanoma) (Fig. 2M-P and Fig. S1Y-AG).
3. Survival analysis
For estimating the correlation between RRM2 expression with the prognosis of pan-cancer, we divided the tumor cases into two groups by the high- and low-expression levels of RRM2.
Kaplan-Meier method was applied to analyze Overall survival (years) (years) (OS), Disease-free interval (years) (DFI), Disease-specific survival (years) (DSS) and Progression-free interval (years) (PFI), respectively. As shown in Fig. 3A-E and Fig. S2A-I, OS analysis data showed high RRM2 expression was associated with poor prognosis for the TCGA cases of ACC(P < 0.001), KICH(P = 0.015), KIRC(P < 0.001), KIRP(P < 0.001), LGG(P < 0.001), LIHC(P = 0.005), LUAD(P < 0.001), MESO (Mesothelioma) (P < 0.001), PAAD(P = 0.006), PRAD(P = 0.004), SARC (Sarcoma)(P = 0.04), UCEC(P = 0.021) and UVM(P < 0.001). However, a low RRM2 expression was related to poor OS for THYM(P = 0.017). DSS analysis data indicated that a correlation between high RRM2 expression and poor prognosis for the TGCA cases of ACC(P < 0.001), KICH(P = 0.001), KIRC(P < 0.001), KIRP(P < 0.001), LGG(P < 0.001), LIHC(P = 0.028), LUAD(P = 0.004), MESO(P < 0.001), PAAD(P = 0.014), PRAD(P = 0.03) and UVM(P < 0.001) (Fig. 3F-J and Fig. S2J-O). For DFI (Fig. 3K-O and Fig. S2P-R), high RRM2 expression was linked to poor prognosis for cancers of KIRP(P = 0.007), LIHC(P = 0.038), LUAD(P = 0.005), PAAD(P = 0.022), SARC(P < 0.001), TGCT(P = 0.043) and THCA(P = 0.001). Additionally, a low RRM2 expression was related with poor DFI for OV(P = 0.022). For PFI analysis data (Fig. 3P-T and Fig. S2SA-B), the high RRM2 expression showed a correlation with poor prognosis for cancers of ACC(P = 0.002), KICH(P = 0.007), KIRC(P = 0.025), KIRP(P < 0.001), LGG(P = 0.001), LIHC(P = 0.003), LUAD(P = 0.003), MESO(P < 0.001), PAAD(P = 0.002), PRAD(P < 0.001), SARC(P = 0.001), THCA(P = 0.011) and UVM(P < 0.001). However, the low RRM2 expression was linked to poor PFI for cancers of COAD(P = 0.049) and STAD(P = 0.05).
Further, we calculated the survival data by the Cox regression models (Fig. 3U-X). The results of OS, DSS and PFI were similar to that we calculated by Kaplan-Meier method. The difference for OS was the high RRM2 expression did not show a significant correlation with poor prognosis for cancers of UCEC. Besides, high RRM2 expression was linked to poor DSS for SARC(P = 0.024). For PFI, the high RRM2 expression showed a significant correlation with poor prognosis for cancer of PCPG(P = 0.006) and did not show a significant correlation with poor prognosis for STAD. For DFI, the data indicated that the high RRM2 expression was associated with the poor prognosis for cancers of BRCA(P = 0.036), KIRP(P < 0.001), LIHC(P = 0.021), LUAD(P < 0.001), PAAD(P = 0.012), PRAD(P = 0.020), SARC(P = 0.002) and THCA(P < 0.001). These data suggested that RRM2 was differentially associated with the prognosis of cases with different cancers.
4. genetic alteration analysis data
Next, we explored the genetic alteration status of RRM2 in 10953 patients / 10967 samples in 32 studies of the TGCA cohorts. The analysis data showed that RRM2 has the highest alteration frequency (> 7%) in patients with uterine carcinosarcoma, among which "amplification" is the main type. Notably, the genomic alteration types of all DLBCL (Diffuse large B cell lymphomas) and KICH cases were deep deletion of RRM2(Fig. 4A). Then, we investigated the types, sites and case numbers of RRM2 mutation (Fig. 4B). We found that the primary genetic mutation types of RRM2 was missense mutation. R298Q/W mutation in the ribonucleotide reductase, which was detected in 3 cases of UCEC and 1 case of GBM, was able to product a missense mutation of the RRM2 gene, translation from R(Arginine) to W(Tryptophan) or Q(Glutamine) at the 298 site of RRM2 protein. We visualized the R298 site in the 3D structure of RRM2 protein (Fig. 4C).
Moreover, we explored the correlation between RRM2 expression and TMB (tumor mutational burden)/ MSI (microsatellite instability) in all cancers of TCGA. As shown in Fig. 4D, RRM2 expression had a positive correlation with TMB for ACC, BLCA, BRCA, CESC, CHOL, COAD, KICH, KIRC, LGG, LIHC, LUAD, LUSC, MESO, OV, PAAD, PRAD, READ, SARC, SKCM and STAD but a negative correlation for THYM. As shown in Fig. 4E, we found a positive correlation between RRM2 expression and MSI for cancers of COAD, LIHC, SARC, STAD, TGCT, UCEC, and UCS (Uterine Carcinosarcoma) but found a negative correlation for LAML and SKCM. The results deserved more in-depth analysis and induction.
5. immune infiltration
Immune cells, as primary components of tumor microenvironment (TME), were related with the progression and metastasis of tumors and therapy resistance(23, 27). Tumor-associated endothelial cells were reported to play a key role in sculpting immune responses, which was necessary for tumor out-growth and metastasis(28). Firstly, we explored the association between RRM2 gene expression and the infiltration level of immune cells in cancers of TCGA by the TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, XCELL, MCPCOUNTER and EPIC algorithms. As shown in Fig. 5A, the RRM2 expression level in BRCA, KIRC, LUAD, LUSC, STAD, THCA and THYM was negatively correlated with the infiltration level of tumor-associated endothelial cells, but noted a positive correlation for KIRP and LGG. Then, we analyzed the correlation between RRM2 expression level and the immune infiltration of different tumor-associated cells (B cells, Plasma cells, T cells, NK cells, Monocytes, Macrophages, Dendritic cells, Mast cells, Eosinophils and Neutrophils) as well as their subtypes in the tumors of TGCA. The results showed that RRM2 expression level significantly correlated with the immune infiltration of various tumor-associated cells in 24 tumor types. For example, RRM2 expression level in ACC was negatively correlated with the infiltration level of Mast cells resting (Fig. 5B). In BLCA, we observed that RRM2 expression significantly correlated with the infiltration of 5 types of immune cells (Fig. 5C-G). In BRCA, RRM2 expression significantly correlated with the infiltration of 10 types of immune cells (Fig. 5H-Q). In other cancers, the correlations between RRM2 expression and immune cells’ infiltration are shown in Fig. S3A-CI.
6. Immune checkpoint analysis
We explored the correlation between RRM2 expression level and immune checkpoint genes in various cancers (Fig. 6). The results showed that the expression levels of more than 40 immune checkpoint genes were significantly associated with RRM2 expression level in TGCT and THCA. Moreover, the expression level of up to 30 immune checkpoint genes in BRCA, HNSC, KIRC, KIRP, LGG, LIHC, PRAD, THYM and UVM were related to RRM2 expression level. Additionally, there were more than 20 different immune checkpoint genes related with RRM2 expression level in 10 types of cancers.
7. Enrichment analysis of SND1-related partners
To deeply understand the molecular mechanism of the RRM2 gene in tumorigenesis, we tried to search the RRM2 expression-correlated targeting genes and the RRM2-binding proteins for functional enrichment analyses. We acquired 50 RRM2-binding proteins which had been authenticated by experiments using STRING. The protein-protein interaction network was shown in Fig. 7A. Then we explored the top 100 RRM2 expression-correlated genes based on datasets of TGCA by the GEPIA2. The results showed that the expression level of RRM2 was positively associated with that of MKI67 (Marker of proliferation Ki-67) (R = 0.79), ORC1 (Origin recognition complex subunit 1) (R = 0.78), CCNA2 (Cyclin A2) (R = 0.77), PLK1 (Polo like kinase 1) (R = 0.77) and KIF11 (Kinesin family member 11) (R = 0.76) (Fig. 7B). The heatmap indicated a positive correlation between RRM2 and the four genes in virtually all the cancer types (Fig. 7C). Intersecting the above two groups demonstrated 4 common genes: PLK1, CDK1 (Cyclin dependent kinase 1), DTL (Denticleless E3 ubiquitin protein ligase homolog) and ASF1B (anti-silencing function 1B histone chaperone) (Fig. 7D). The KEGG enrichment analyses were performed based the genes in the two groups. The result indicated that “cell cycle” might play a crucial role in the effect of RRM2 on tumor pathogenesis (Fig. 7E).
8. Potential drug prediction for pan-cancer
We performed GSEA to explore the pathways that RRM2 regulates in pan-cancer (Fig. S4). The results showed that the pathway of cell cycle was the most significantly affected by RRM2 expression in pan-cancer, which was the same to KEGG enrichment analyses above (Table S1). The drug prediction analysis showed that up to 14 types of approved drugs, such as Phenylbutyrate and Romidepsin, may have therapeutic effect on pan-cancer through targeting the cell cycle-associated pathways (Table S2).