3.1 DE-MRFG in LC was associated with oxidation-reduction and cancer
We screened 14,742 DEGs (Table S2), including 431 up- and 1,065 down-regulated genes, between LC and control samples in the TCGA-HNSC dataset (Fig. 1A). The top 10 DEGs for up- and down-regulated gene expressions were drawn into the heatmap (Fig. 1B). Then, we obtained 5,832 LC-related genes using WGCNA. Combined with 5,832 LC-related genes, 14,742 DEGs, and 567 ferroptosis-related genes, an intersection gene, 97 DE-FG in LC, was obtained (Fig. 1C, Table S3). Among them, 83 genes are related to m6A through correlation analysis between DE-FG and m6A-related genes, named DE-MRFG. GO and KEGG enrichment analyses demonstrated the molecular mechanism of DE-MRFG. Figures 1D–F show that DE-MRFG is enriched in iron ion transport, NADPH oxidase complex, and superoxide-generating NAD(P)H oxidase activity. Combined with the KEGG analysis results (Fig. 1G), the function and signaling pathways of DE-MRFG were enriched in oxidation-reduction and cancer, such as unsaturated fatty acid biosynthesis and thyroid cancer.
3.2 A risk model based on m6A-regulated ferroptosis genes
Univariate Cox regression results indicated that three genes, including TFRC, RGS4, and FTH1, are related to the survival state of patients with LC (Fig. 2A). Based on these three genes, LASSO regression established a risk model using the LASSO regression formula: Risk score = 0.097 × TFRC + 0.269 × RGS4 + 0.303 × FTH1. This formula assigned a risk score to each LC sample in the TCGA-HNSC dataset (108 samples with survival information). We divided all LC samples into two groups with an optimal risk score threshold of 3.44. In GSE65858, the risk score of LC samples was calculated and classified into two groups according to the optimal risk score threshold of 5.88 to verify the risk model. Additionally, KM survival analysis suggested that the low-risk group usually combined with a better survival state in the TCGA and GSE65858 datasets (Figs. 2B-C). ROC analysis demonstrated that the risk model accurately predicted OS (Figs. 2D-E). The survival state distribution maps (Figs. 2F-G) suggest that shorter survival times and more deaths are usually combined with higher risk scores. The three gene expressions are shown in the heatmap. The risk score in this heatmap was sorted ascending from the left to the right (Figs. 2H–I). These three genes showed higher expression in the high-risk groups.
3.3 Risk model was related to skin and protein interaction
To further investigate the potential influence of DEGs in the high- and low-risk groups, 5,754 DEGs, including 2,225 up- and 3,529 down-regulated genes, were found (Fig. 3A, Table S4). GO and KEGG enrichment analysis revealed that the most enriched GO items and pathways were ‘epidermis development,’ ‘collagen-containing extracellular matrix,’ ‘receptor ligand activity,’ and ‘neuroactive ligand-receptor interaction’ (Figs. 3B-C).
3.4 The performance of the risk model in different clinical characteristics
The distribution of LC samples with different risk scores in clinical characteristics indicated that the risk scores of patients with LC have no significant differences in most features, such as gender, tumor stage, pathological N stage, and pathological T stage, except for age (Fig. 4A). Figure 4B displays the TFRC, RGS4, and FTH1 expressions in different clinical characteristics. Furthermore, the KM survival curve of different groups with different clinical characteristics demonstrated that people in the low-risk group had better OS in males, older age, tumor stage III–IV, pathological N2_N3, pathological T2, and pathological T3 (Figs. 4C–H). Figure S1 depicts the results without significant differences.
3.5 Establishment of a nomogram
Univariate Cox regression analysis suggested that risk score, gender, and Pathologic_N were associated with the OS of patients with LC (Fig. 5A). According to multivariate Cox regression analysis (Fig. 5B), risk score, gender, and Pathologic_N were considered independent prognostic factors to build a nomogram (Fig. 5C) that can predict the OS of patients with LC. Calibration curves and ROC analyses proclaimed the good comportment of nomograms for predicting OS (Figs. 5D–G).
3.6 The risk model predicted the effect of immune therapy
SsGSEA was used to determine immune infiltration in the LC. Figure 6A reveals that the ssGSEA scores of six immune cells, including neutrophils, mast cells, and Memory B cells, differed significantly between the high- and low-risk groups. Additionally, Figs. 6B–D present the relationship between the three prognostic genes (TFRC, RGS4, and FTH1) and the six immune cells. Prognostic genes were positively related to the other five immune cells, except for neutrophils. Finally, stromal, immune, and ESTIMATE scores were calculated (Figs. 6E-F). The stromal and ESTIMATE scores were dramatically higher in the high-risk group than in the low-risk group.
To probe the mutation in different groups, the ‘maftools’ package was used. Figures 6G–H depict the top 20 mutant genes. The High-risk group had 3,905 mutated genes, which were present in all samples (Fig. 6G). The low-risk group contained 9,030 genes, with mutations occurring in 98.85% of the samples (Fig. 6H). TP53 had the highest mutation rate in both groups.
Based on the association analyses of immune checkpoints and risk score, only PD-L1 had a significantly positive relationship with risk score (Fig. 6I), with dramatically up-regulated expression in a high-risk group (Fig. 6J). KM survival analysis indicated better OS of patients with high-expressed PD-L1, although the result did not have statistical significance (Fig. 6K). Then, the TIDE, dysfunction, and exclusion scores of all LC samples were calculated. Moreover, the relationship between immunotherapy response and risk score was analyzed. Figures 6L-M indicate a significantly positive relationship between the risk score, TIDE, and exclusion score.
3.7 Chemotherapy drugs were related to the risk model
The IC50 values of 20 chemotherapeutics were calculated for each LC sample to estimate the sensitivity of chemotherapy drugs in different groups. Nine drugs, such as Cisplatin, had a significantly higher IC50 in the high-risk group than in the low-risk group, while 10 drugs, such as Erlotinib, had a lower one (Fig. 7). Moreover, the correlation analysis between risk score and IC50 in 19 drugs indicated a strong relationship (Fig. 8). For example, Bosutinib was positively related to the risk score, while Docetaxel was negatively related to the risk score.
3.8 Expression of TFRC, RGS4 and FTH1 in laryngeal carcinoma
In this study, we analyzed the TFRC, RGS4, and FTH1 expression levels in LC cell lines and NHBEC. Our findings revealed a significant increase in TFRC, RGS4, and FTH1 expressions in these cell lines than in NHBEC (Figs. 9A-B). Additionally, we examined 30 pairs of laryngeal carcinomas and paracancerous tissues and utilized qRT-PCR to assess TFRC, RGS4, and FTH1 expression. The results demonstrated significantly higher TFRC, RGS4, and FTH1 expression in laryngeal carcinoma tissues than in paracancerous tissues (Fig. 9C). Furthermore, KM analysis demonstrated a significant correlation between TFRC and FTH1 expression and patient prognosis, whereas RGS4 expression was not significantly correlated with patient prognosis (Fig. 9D).