3.1 Immune infiltration analysis related to SKCM patients
We obtained the gene expression data of SKCM patients from TCGA database, and 22 different immune cell infiltrations in each sample were analyzed through the CIBERSORT algorithm (Fig. 1A). The TME cell network depicts the interactions between tumor immune cells (Supplementary Fig. 1). Comprehensive status of cell lineage and its impact on overall survival of patients with SKCM were also analysed (Fig. 1B). The TME cell network demonstrated that Cell cluster − B, C and D were positively correlated with each other, including T cells CD8, B cells naive, Plasma cells, T cells CD4 naive, NK cells activated, Monocytes, Dendritic cells resting, T cells regulatory (Tregs), Monocytes and T cells CD4 memory activated. Moreover, there was also a significant positive correlation among immune cells in Cell cluster-A, such as gamma delta T cells, NK cells resting, Neutrophils, Mast cells activated, Macrophages M0, Eosinophils and Dendritic cells activated. Meanwhile, Macrophages M0, T cells CD4 memory activated, Monocytes were performed with a significant reverse correlation. NK cells resting and NK cells activated showed a same significant negative correlation. We also performed an exploratory analysis to measure the survival benefit and potential risks. The results showed that T cells CD4, T cells CD8, NK cells activated, Tregs, Dendritic cells and Macrophages M1 associated with shortened OS while NK cells resting, Neutrophils, Macrophages M0 and Dendritic cells activated associated with prolonged OS. In order to construct the best number of clusters and classifierion, we used the ConsensusClusterPlus package to assess the stability of the clustering structure and divided SKCM patients into TMECluster-A, B and C. Unsupervised hierarchical clustering was used to analyze the normalized immune cell fractions. Heatmap showed the correlation between the infiltration abundance of 22 type of immune cells and the immune scores in the three groups (Fig. 1C). The results showed that patients with TMECluster-A had higher immune scores, while patients with TMECluster-B and C were mainly related to tumor purity and stromal scores. In addition, PCA analysis results show that based on the expression data of SKCM patients, the three TMECluster groups can be significantly distinguished (Fig. 1D). The survival analysis related to the TME phenotype showed that TMEcluster-C (N = 92) was associated with better prognosis (log-rank test, P < 0.001) (Fig. 1E).
Subsequently, we further selected and analyzed the gene expression profile data and immune cell infiltration abundance of SKCM samples in the GEO database. Also use the ConsensusClusterPlus package to evaluate the stability of clustering (Fig. 2A). Subsequently, the heatmap showed that patients with TMECluster-A showed higher immune scores (Fig. 2B), which is consistent with the results in the TCGA database. This was in line with previous studies reported that melanoma is a highly immune dependent malignant tumor.
3.2 Construction of TME-related signature gene in SKCM patients
TME has a crucial impact on tumor epigenetics, metastasis, and immune escape. We used limma package to analyze the 486 differentially expressed genes among the three groups of TMECluster in the TCGA database to determine the potential biological characteristics of different TME phenotypes. In the same way, 330 DEGs were analyzed in the GEO datasets. Unsupervised clustering using these DEGs provided three distinct clusters: GeneCluster-A,B and C (Fig. 3B). Simultaneously, according to the differential expression of DEGs in different clusters, it was further divided into Signaure gene-A and B. GO enrichment analysis showed that Signature gene-A and B exhibit different unique biological processes. Signature gene-A was associated with the overexpression of immune-activated genes (Fig. 3C), while Signature gene-B showed up-regulation of genes related to stromal and transmembrane receptors (Fig. 3D). There were significant differences in the infiltration of TME cells (Fig. 3E) and the enrichment of related biological pathways (Fig. 3F) between the three GeneClusters, which was consistent with the previous functional enrichment results.
3.3 CeRNA regulatory network construction and signature gene expression in SKAM patients
We further analyzed the genetic variants of Signature gene-A and B, including single nucleotide polymorphisms (SNPs), copy number variations (CNVs), and alternative splicing patterns. The differences in the frequency of mutations in SKCM driver genes were explored between the two clusters. SNP analysis shows that there were obvious mutations in both gene sets. Somatic SPTA1 mutations were more frequent in TMEcluster-A (Fig. 4A). And TNN was the most frequently observed somatic mutations in TMEcluster-B (Fig. 4B). Moreover, the study of CNV change frequency showed that CNV changes are common in two groups, and most of them were concentrated on the amplification of copy numbers. Then, we showed the positions of these CNV changes on the chromosome in Fig. 4C and D. In the alternative splicing analysis, the Signature gene-A was mainly manifested as exon skipping (ES) in SKCM patients (Fig. 4E), while the Signature gene-B had no obvious alternative splicing forms (Fig. 4F). In addition, based on the gene expression of Signature gene-A and B, we screened out LncRNAs that may be related to immunity with correlation coefficients > 0.4 and P < 0.05. Then, the starBase database (http://starbase.sysu.edu.cn/) was used to obtain targeted differentially expressed miRNAs to construct the regulatory ceRNA network of mRNA-miRNA-LncRNA interaction (Fig. 4G).
3.4 Construction of immune-related prognostic gene signature
To better predict the impact of immune characteristics on the prognosis of patients, we have constructed a new prognostic-related risk scoring system. The Signature gene-A and B genes were incorporated into univariate Cox analyses, and 233 genes related to prognosis were obtained (P < 0.05). The LASSO Cox analyses was further used for dimensionality reduction and model construction, and finally a total of 16 hub genes were included in the risk scoring model (Fig. 5A). A risk score (RS) formula was established by including individual normalized gene expression values weighted by their LASSO Cox coefficients. We calculated the risk score for each SKCM patient according to locking of coefficients in each gene signature. The high/low-risk groups were divided according to the median risk score. Kaplan-Meier analysis showed that patients with high-risk scores had a relatively poor prognosis (Fig. 5B). The time-dependent ROC curve analysis also showed that the risk score has a good predictive ability on OS (Fig. 5C). The area under the curve (AUC) of 1-year, 3-year, and 5-year OS wre 0.713, 0.694, and 0.734, respectively. In addition, the distribution of each patient's risk score, survival status, and gene expression map are shown in Fig. 5D.
3.5 Gene Set Enrichment Analysis (GSEA)
Subsequently, we analyzed the impact of high and low risk groups on the biologically relevant functions of SKCM patients. GSEA analysis showed that pathways related to metabolism and oxidative phosphorylation were mainly enriched in the high-risk group (Fig. 6A and C), while pathways related to immune response, including cytokine signaling pathway, JAK-STAT signaling pathway, and natural killer cell-mediated cytotoxicity were significantly enriched in low-risk patients (Fig. 6B and D). At the same time, the expression of TME cells (Fig. 6E), as well as some other pathways, such as angiogenesis, mismatch-related features, and stromal-related features (Fig. 6F), there were significant differences between the high and low-risk groups of SKCM patients (P < 0.05).
3.6 Correlation analysis of the risk score and clinicopathological characteristics
We assessed the correlation between the risk score and the clinicopathological characteristics of SKCM. The analysis results showed that the low-risk score was correlated with TMECluster-A classification (P < 0.001; Fig. 7A) and metastatic SKCM (P < 0.001; Fig. 7B); at the same time, it was correlated with gender (P = 0.031; Fig. 7E), Pathological stage (P = 0.0027; Fig. 7F) and T stage (P < 0.001; Fig. 7G) were significantly correlated. There was no significant correlation between the risk score and TMB, age, M stage and N stage (P > 0.05; Fig. 7C, D, H and I).
3.7 Application value of the risk score in pan-cancer
Correlation analysis showed that the expression of 13 hub genes in different tumors has a significant correlation (Fig. 8A). The heterogeneity between tumors caused the risk score have different prognostic effects on different cancers (Fig. 8B). Univariate and multivariate Cox analysis showed that the risk score is an independent risk factor for predicting the prognosis of SKCM patients (Table 3; Fig. 8C). We included the index of P < 0.05 in the multivariate Cox model to construct a nomogram to predict the OS of 1, 2, 3-year survival probability in SKCM patients.The C-indexes (0.732 (95% CI: 0.697–0.767)) of the nomogram was used to calculate the discriminative ability of the nomogram, showing a high degree of discrimination. Moreover, the calibration showed a great agreement between the 1-year, 2-year, and 3-year OS estimates with the actual observation values of SKCM patients by comparing the nomogram (Fig. 8E).
3.8 The prognostic value of hub genes in SKCM patients
We focused on the potential prognostic value of EDN3, CLEC4E, SRPX2, KIR2DL4, UBE2L6, IFIT2 as an immunoscore for melanoma patients.In the group of novel hub gene, low levels of UBE2L6、KIR2DL4、IFIT2 and CLEC4E expression were observed in tumor cells, while SRPX2 and EDN3 were highly expressed in SKCM. However, only UBE2L6 and IFIT2 have significant expression differences in the TCGA-SKCM cohort. This may require a larger sample size to verify the differential expression of these molecules in SKCM. The open online tool GEPIA was used to analyze the prognostic values of these novel hub gene. We found low expression of UBE2L6, KIR2DL4, IFIT2 and CLEC4E were significantly associated with poor prognosis in SKCM.
2.9 Ube2L6, SRPX2, IFIT2 were higher expression in IHC melanoma specimens
We evaluated these gene expressions in melanoma tissues. The IHC staining results showed that Ube2L6, SRPX2, IFIT2 were higher expression while CLEC4E, END3, KIR2DL4 were lower expression in 25 melanoma specimens.(Fig. 9)