2.1 Cuproptosis-related lncRNA identification and establishment of a prognostic model
Sixteen cuproptosis-related genes were determined from TCGA-LUSC samples. Based on the |R|>0.4 and P < 0.001 criteria, the 16,874 lncRNAs were filtered out to obtain 295 Cuproptosis-related lncRNAs. Visualization of the co-expression relationships of the genes and lncRNAs linked with cuproptosis was executed by Sankey diagrams (Fig. 1), where the genes and lncRNAs association was determined through their linkage. The 295 cuproptosis-related lncRNAs obtained above were subjected to one-way Cox regression analysis, yielding ten Cuproptosis-related lncRNAs with prognostic relevance (Fig. 2A). The eight lncRNAs with optimal efficacy for the prognostic model with the minimal error were obtained by Lasso regression ten-fold cross-validation (Fig. 2B, 2C). Afterward, a multivariate Cox analysis was executed to screen five cuproptosis-related lncRNAs that had the potential to predict the prognosis independently. Risk score was calculated as follows: Risk score = (0.50905619755965×AC010328.1 expression) + (0.255153440600426×LINC01740 expression) + (-0.37749067013803×AL358613.2 expression) + (0.538100881107615× MIR3945HG expression) + (-0.372555984713978×AC002467.1 expression).
2.2 Assessment of the prognostic model and value validation
The clinically relevant features did not depict any statistical difference in the training and test groups (p > 0.05), indicating no bias in clinical characteristics in the TCGA-LUSC sample subgroup (Table 3). The correlation between the cuproptosis-related genes and cuproptosis-related lncRNAs involved in the prognostic model in the TCGA dataset was assessed (Fig. 3). Kaplan-Meier survival analysis of the training group, the test group, and the whole dataset depicted that the group with lesser risk had better prognosis regarding the LUSC patients in contrast to the group with greater risk, with the two depicting considerable variations in OS and PFS (Fig. 4A-D). According to the risk curve, the risk score’s median value was 1, which could differentiate between patients of high- and low-risk (Fig. 5A, 5D, 5G). As the risk value rises in the survival status plot, the number of LUSC fatalities increases, while the survival time decreases (Fig. 5B, 5E, 5H). According to the results of the heat map, increased expression of AC010328.1, LINC01740, and MIR3945HG in the high-risk group, and increased AL358613.2 and AC002467.1 in the low-risk group were detected (Fig. 5C, 5F, 5I). The aforementioned data was also documented in the training group, the test group, and the entire dataset. Univariate and multivariate independent prognostic analyses depicted that the prognostic model, staging, and age have the potential to be utilized in prognosis predicting independent factors (Fig. 6A-B). The 1-, 3-, and 5-year survival AUC values for LUSC patients under risk score reached above 0.6, demonstrating the superior sensitivity and specificity of the model in the prediction of LUSC patients’ survival (Fig. 7A). The prognostic model had higher sensitivity and accuracy compared with the traditional clinicopathological characteristics (Fig. 7B). In addition, the C-index value of the risk score was also the largest, indicating the high agreement of the prognostic model-predicted patient survival with the actual clinical observation (Fig. 7C).
Table 3
Clinical statistical analysis of the subgroups
Covariates | Type | Total | Test | Train | P-value |
---|
Age | ≤ 65 | 189 (38.18%) | 58 (39.19%) | 131 (37.75%) | 0.828 |
༞65 | 300 (60.61%) | 88 (59.46%) | 212 (61.1%) | |
Unknown | 6 (1.21%) | 2 (1.35%) | 4 (1.15%) | |
Gender | Female | 129 (26.06%) | 43 (29.05%) | 86 (24.78%) | 0.3794 |
Male | 366 (73.94%) | 105 (70.95%) | 261 (75.22%) | |
Stage | Stage I | 242 (48.89%) | 76 (51.35%) | 166 (47.84%) | 0.2613 |
Stage II | 159 (32.12%) | 38 (25.68%) | 121 (34.87%) | |
Stage III | 83 (16.77%) | 29 (19.59%) | 54 (15.56%) | |
Stage IV | 7 (1.41%) | 2 (1.35%) | 5 (1.44%) | |
Unknown | 4 (0.81%) | 3 (2.03%) | 1 (0.29%) | |
T | T1 | 114 (23.03%) | 38 (25.68%) | 76 (21.9%) | 0.6376 |
T2 | 288 (58.18%) | 86 (58.11%) | 202 (58.21%) | |
T3 | 70 (14.14%) | 17 (11.49%) | 53 (15.27%) | |
T4 | 23 (4.65%) | 7 (4.73%) | 16 (4.61%) | |
M | M0 | 407 (82.22%) | 129 (87.16%) | 278 (80.12%) | 1 |
M1 | 7 (1.41%) | 2 (1.35%) | 5 (1.44%) | |
Unknown | 81 (16.36%) | 17 (11.49%) | 64 (18.44%) | |
N | N0 | 316 (63.84%) | 95 (64.19%) | 221 (63.69%) | 0.2314 |
N1 | 128 (25.86%) | 33 (22.3%) | 95 (27.38%) | |
N2 | 40 (8.08%) | 17 (11.49%) | 23 (6.63%) | |
N3 | 5 (1.01%) | 2 (1.35%) | 3 (0.86%) | |
Unknown | 6 (1.21%) | 1 (0.68%) | 5 (1.44%) | |
2.3 Construction of the prognostic nomogram and PCA
Seven factors including risk class and clinicopathological characteristics were included to construct a nomogram for survival prediction. A LUSC patient (ID: TCGA-98-A53C) was used as an example to plot the nomogram (Fig. 8A), wherein the 1-, 3-, and 5-year predicted survival rates of the patient were similar to the observed overall survival rate (Fig. 8B). The model constructed based on lncRNAs was further validated by PCA to have good grouping ability (Fig. 8F), and the distribution of high- and low-risk groups obtained based upon all genes as well as cuproptosis-related genes and lncRNAs was relatively dispersed (Fig. 8C-E). These data suggest that the model constructed using lncRNAs can effectively differentiate between low- and high-risk groups.
2.4 Enrichment analysis and immune-related functional analysis
GO analysis demonstrated the enrichment of DEGs in the two risk groups at three levels: biological processes, cellular components, and molecular functions. Enrichment of most of the DEGs was detected in eicosanoid metabolic process, phagocytosis, humoral immune response, recognition, and other processes. They are distributed on the plasma membrane’s external side immunoglobulin complex, the anchored component of the membrane and other cellular components, performing oxidoreductase activity, carboxylic acid binding, monocarboxylic acid binding, acting on the CH-OH group of donors, and other functions (Fig. 9A, Table 4). KEGG analysis suggests the association of these DEGs to the effects on xenobiotics metabolism by cytochrome P450, drug metabolism-cytochrome P450, tyrosine metabolism, and other pathways, suggesting the involvement of these DEGs in drug metabolism (Fig. 9B, Table 5). According to immune-related function analysis, such functions were significantly active in the group with greater risk (P < 0.05), except for the inhibition of the tumor suppressor gene (APC-co-inhibition), which was not significantly different between the risk groups (Fig. 9C).
Table 4
Category | GO ID | Term | P-values | Related genes |
---|
GO-BP | GO:0006959 | humoral immune response | 1.23E-07 | 21 |
| GO:0006690 | icosanoid metabolic process | 1.04E-06 | 12 |
| GO:0006910 | phagocytosis, recognition | 1.28E-06 | 11 |
| GO:0001523 | retinoid metabolic process | 1.56E-06 | 10 |
| GO:0008037 | cell recognition | 1.89E-06 | 16 |
| GO:0016101 | diterpenoid metabolic process | 2.17E-06 | 10 |
GO-CC | GO:0019814 | immunoglobulin complex | 8.07E-10 | 18 |
| GO:0009897 | external side of plasma membrane | 1.26E-07 | 26 |
| GO:0031225 | anchored component of membrane | 1.61E-06 | 14 |
| GO:0045177 | apical part of cell | 5.88E-05 | 20 |
| GO:0016324 | apical plasma membrane | 6.12E-05 | 18 |
| GO:0042599 | lamellar body | 0.000179 | 4 |
GO-MF | GO:0033293 | monocarboxylic acid binding | 1.21E-08 | 12 |
| GO:0031406 | carboxylic acid binding | 4.73E-08 | 16 |
| GO:0016614 | oxidoreductase activity, acting on CH-OH group of donors | 8.32E-07 | 13 |
| GO:0016616 | | 1.99E-06 | 12 |
| GO:0038024 | cargo receptor activity | 7.87E-06 | 9 |
| GO:0032052 | bile acid binding | 2.49E-05 | 4 |
Table 5
Database type | ID | Description | P-values | Count |
---|
KEGG | hsa00980 | Metabolism of xenobiotics by cytochrome P450 | 2.64E-08 | 12 |
| hsa00350 | Tyrosine metabolism | 4.66E-06 | 7 |
| hsa00982 | Drug metabolism - cytochrome P450 | 9.15E-06 | 9 |
| hsa04514 | Cell adhesion molecules | 4.95E-05 | 12 |
| hsa00590 | Arachidonic acid metabolism | 0.000162 | 7 |
| hsa00360 | Phenylalanine metabolism | 0.000205 | 4 |
| hsa05204 | Chemical carcinogenesis - DNA adducts | 0.000352 | 7 |
| hsa04640 | Hematopoietic cell lineage | 0.000638 | 8 |
| hsa00480 | Glutathione metabolism | 0.00077 | 6 |
| hsa00830 | Retinol metabolism | 0.00195 | 6 |
2.5 TMB analysis and potential drug screening
In contrast to the group with higher risk values, the less risky group depicted slightly more frequent mutation of most genes (98.41% vs 96.88%) (Fig. 10A). In addition, tumor mutation burden did not depict a considerable difference in the two risk groups (P = 0.83). Though, differences in survival rates were detected between the high- and low-TMB groups, with individuals with high TMB having a remarkably better prognosis than those with low TMB (P < 0.05) (Fig. 10B). The combined TMB and risk score depicted considerably longer median survival time in the high-TMB low-risk group (P < 0.05) (Fig. 10C). Additionally, the TIDE algorithm further revealed greater TIDE scores in the group with higher risk than in the less risky group, depicting a higher risk of immune escape and worse immunotherapeutic outcomes in patients in the high-risk group (P < 0.05) (Fig. 11A). Finally, all antitumor drugs known to be used in clinical settings were screened to analyze the sensitivity of these drugs treating LUSC. The IC50 values of quizartinib (AC220), dasatinib, phenformin, rapamycin, and sunitinib were reduced in the groups with greater risk, implying the high sensitivity of these drugs in high-risk LUSC patients (Fig. 11B-F).