Identification of Cuproptosis-related LncRNA in STAD Patients
Using the GENCODE database, we identified a total of 16,876 LncRNAs in the TCGA-STAD dataset. We referenced a total of 19 cuproptosis-related genes collected in the literature3. A total of 430 cuproptosis-related LncRNAs were screened by Pearson correlation test. In Fig. 1A, Sankey diagrams illustrate the relationship between cuproptosis-related genes and LncRNAs. Next, univariate Cox regression analysis was used to explore cuproptosis-related LncRNAs (p < 0.05), and we screened a total of 18 cuproptosis-related LncRNAs (Fig. 1B). We divided 407 STAD patients into training and validation groups, and the corresponding clinical traits of the two groups is shown in Table 2. There were no differences between the training and validation groups in clinical traits.
Table 2
Clinical characteristics of STAD patients in the TCGA database
Variable | Entire cohort (n = 371) | Training cohort (n = 186) | Validation cohort (n = 185) | p value |
Age | | | | |
≤ 65 | 163(43.94%) | 84(45.16%) | 79(42.7%) | 0.7438 |
༞65 | 205(55.26%) | 101(54.3%) | 104(56.22%) | - |
unknow | 3(0.81%) | 1(0.54%) | 2(1.08%) | - |
Gender | | | | |
FEMALE | 133(35.85%) | 70(37.63%) | 63(34.05%) | 0.5414 |
MALE | 238(64.15%) | 116(62.37%) | 122(65.95%) | - |
Grade | | | | 0.1437 |
G1 | 10(2.7%) | 8(4.3%) | 2(1.08%) | - |
G2 | 134(36.12%) | 66(35.48%) | 68(36.76%) | - |
G3 | 218(58.76%) | 105(56.45%) | 113(61.08%) | - |
unknow | 9(2.43%) | 7(3.76%) | 2(1.08%) | |
Stage | | | | 0.266 |
Stage I | 50(13.48%) | 24(12.9%) | 26(14.05%) | - |
Stage II | 111(29.92%) | 50(26.88%) | 61(32.97%) | - |
Stage III | 149(40.16%) | 77(41.4%) | 72(38.92%) | - |
Stage IV | 38(10.24%) | 24(12.9%) | 14(7.57%) | - |
unknow | 23(6.2%) | 11(5.91%) | 12(6.49%) | - |
T | | | | 0.6726 |
T1 | 18(4.85%) | 8(4.3%) | 10(5.41%) | - |
T2 | 78(21.02%) | 40(21.51%) | 38(20.54%) | - |
T3 | 167(45.01%) | 80(43.01%) | 87(47.03%) | - |
T4 | 100(26.95%) | 55(29.57%) | 45(24.32%) | - |
unknow | 8(2.16%) | 3(1.61%) | 5(2.7%) | - |
M | | | | 0.3983 |
M0 | 328(88.41%) | 161(86.56%) | 167(90.27%) | - |
M1 | 25(6.74%) | 15(8.06%) | 10(5.41%) | - |
unknow | 18(4.85%) | 10(5.38%) | 8(4.32%) | - |
N | | | | 0.6169 |
N0 | 108(29.11%) | 48(25.81%) | 60(32.43%) | - |
N1 | 97(26.15%) | 51(27.42%) | 46(24.86%) | - |
N2 | 74(19.95%) | 39(20.97%) | 35(18.92%) | - |
N3 | 74(19.95%) | 36(19.35%) | 38(20.54%) | - |
unknow | 18(4.85%) | 12(6.45%) | 6(3.24%) | - |
Construction and Validation of a Risk Score Model for STAD Patients Based on the Cuproptosis-related LncRNA Prognostic Signature
The 18 cuproptosis-related lncRNAs from the previous step were further screened by LASSO Cox regression analysis, resulting in 15 lncRNAs screened in total. The trajectory changes in regression coefficients of lncRNAs were identified, as were the cross-validation results of model construction (Fig. 2a, b). Eight lncRNA signatures linked to cuproptosis were screened using multiple stepwise Cox regression analysis, and risk score models were built using these signatures. To assess the correlation between cuproptosis-related genes and prognostic signatures, we plotted a heatmap (Fig. 2c). Risk score = (0.612310337796917* Expression AC005050.3) + (1.66740693091285* Expression C5orf66) + (0.398655774690665* Expression HAGLR) + (-0.485279771224916* Expression TDRKH−AS1) + (0.549647873930273* Expression LINC01094) + (-0.525617348023318* Expression BX890604.1) + (-0.680536241533385* Expression AC016394.2) + (-0.483203374377389* Expression AL606970.1). Based on the median risk score, we divided STAD patients into high-risk and low-risk groups and analyzed the OS of patients in the high- and low-risk groups in the training and validation cohorts using the Kaplan‒Meier method. In the high-risk group, OS was significantly lower than in the low-risk group (P < 0.05; Fig. 2d, e).
Figure 3 shows a positive correlation between the risk score and the number of deaths and a negative correlation with patient survival time. Figure 3a-c shows the relationship between the risk score and patient survival status. We see an increasing number of patient deaths as the risk score increases. The expression levels of eight cuproptosis-related prognostic signatures in the different risk groups were plotted using heatmaps (Fig. 3d-f). PCA also revealed significant differences between the high- and low-risk groups. Based on the risk model constructed with cuproptosis-related lncRNAs, we observed that STAD patients were clearly divided into two groups (Fig. 4a-d).
Cuproptosis-related LncRNA Prognostic Signatures are Independent Prognostic Factors in STAD Patients
We used univariate and multivariate Cox regression analyses to assess the prognostic value of the risk model. In univariate and multivariate Cox regression analyses, age, stage, and risk score were prognostically relevant independent risk factors (P < 0.05; Fig. 5a, b). Next, we further evaluated the accuracy of the risk score, age, sex, tumor grading and staging model in predicting patient prognosis using ROC and C-index curves. As shown in Fig. 5c and 5d, the risk score model was more accurate than the other clinical prediction models, indicating that this risk score model was more predictive of patient prognosis and was more correlated with patient prognosis. Subsequently, we further evaluated the accuracy of the model for predicting 1-, 3-, and 5-year survival in STAD patients using TimeROC curves. As shown in Fig. 5e, the AUC was 0.721 at 1 year, 0.645 at 3 years, and 0.713 at 5 years, which indicated that the model was accurate. In addition, PFS showed a statistically significant difference in progression-free survival between the high-risk and low-risk groups (P < 0.05; Fig. 5f), which further revealed that the model has good diagnostic value for predicting patient prognosis.
Nomograms have become a common predictive tool for analyzing patients' clinical outcomes. Based on the risk score and some clinicopathological features, we produced a nomogram (Fig. 6a). We found that age and risk score were significantly associated with patient prognosis (P < 0.001; Fig. 6a). In the subsequent calibration plots, the calibration plots matched well with the nomogram predictions (Fig. 6b).
Relationship Between Prognostic Signature and Clinicopathological Features in Stomach Adenocarcinoma
Using the above univariate and multivariate ANOVAs, we found that age and tumor stage were associated with patient prognosis. To explore the clinical value of the risk score model, we explored the relationship between the risk score model and age and tumor stage. The results indicated that there were significant differences between the distribution of risk score and age and tumor stage. In terms of age, there was a statistically significant difference between patients aged ≤ 65 years and > 65 years (p < 0.05; Fig. 7a, b); in terms of tumor stage, there was a statistically significant difference between stage I-II and stage III-IV (p < 0.05; Fig. 7c, d).
Functional Enrichment Analysis
We used the "Limma" package in R to screen the DEGs in the high-risk and low-risk groups for GO and KEGG enrichment analysis. There were 374 DEGs identified. Based on GO analysis, DEGs were mainly related to muscle system process, muscle contraction, and regulation of muscle system process in the category of biological process. In the category of cellular components, DEGs were mainly focused on collagen − containing extracellular matrix, immunoglobulin complex and contractile fiber. In the molecular function category, DEGs were mainly involved in glycosaminoglycan binding, heparin binding, and extracellular matrix structural constituent (Fig. 8a, c, e). KEGG enrichment analysis revealed that DEGs were enriched in vascular smooth muscle contraction, dilated cardiomyopathy, focal adhesion, and hypertrophic cardiomyopathy (Fig. 8b, d, f).
Evaluation of Immune Cell Infiltration and Immunotherapy in Stomach Adenocarcinoma
As a first step, we examined the association between cuproptosis-related lncRNA prognostic signatures and immune cell infiltration. We performed ssGESA analysis on TCGA-STAD data and plotted the results as a heatmap. As a result of a correlation analysis between immune cell populations and related functions, Type_II_IFN_Reponse, Parainflammation, Type_I_IFN_Reponse, APC_co_inhibition, T_cell_co-inhibition, Check-point, T_cell_co-stimulation, Cytolytic_activity, Inflammation-promoting, HLA, APC_co_ stimulation, and CCR were abundantly expressed in the high-risk group (Fig. 9a). This suggests that cuproptosis-related prognostic signatures are associated with immune cell infiltration. To assess the effects of immunotherapy in the high- and low-risk groups, we also used the TIDE score. Figure 9b shows that the TIDE scores were significantly higher in the high-risk group than in the low-risk group, indicating that the high-risk group was more susceptible to immune escape and weaker to immunotherapy.
Tumor Mutational Burden of the Cuproptosis-related lncRNA Prognostic Signature in Stomach Adenocarcinoma
We calculated TMB scores in different risk groups using STAD somatic mutation data from TCGA. Compared to the high-risk group, the low-risk group had a higher mutation burden (Fig. 9c). The mutation rate in the low-risk group was 91.28%, whereas in the high-risk group, it was 88.6%, with the top three mutated genes being TTN, TP53, and MUC16 (Fig. 9f, g). Analyzing the patient mutation data, we found that TMB was higher in the low-risk group than in the high-risk group, suggesting that the low-risk group population is more likely to benefit from immunotherapy. Subsequently, we classified STAD patients into "high-TMB" and "low-TMB" groups according to the median cutoff point and performed survival analysis. As shown in Fig. 9d, patients with "high TMB" also had a greater survival rate. Next, we performed a combined survival analysis based on tumor mutation load and risk score. The results showed that low-TMB patients with low risk had the highest survival probability, while high-TMB patients with low risk had the lowest survival probability (Fig. 9e).
Drug Sensitivity
To evaluate the use of the cuproptosis-related prognostic signature in the treatment of STAD, we explored the relationship between the risk score and drug IC50 in the treatment of STAD. We compared the sensitivity of 138 anticancer drugs in the high- and low-risk groups using the "pRRophetic" package in R. A total of 23 drugs were found to have different sensitivities in the high- and low-risk groups. The IC50 of 16 drugs in the low-risk group was low (Fig. 10), and the positive correlation between risk score and IC50 is shown in more detail in Additional file 1. The IC50 of seven drugs in the high-risk group was lower (Fig. 11), and the negative correlation between risk score and IC50 is described in more detail in Additional file 2. As a result, the low-risk group may be more sensitive to drug therapy, and seven drugs represented by dasatinib may play a role in the future treatment of advanced STAD.
Expression of Cuproptosis-related lncRNA Prognostic Signature in GC
To further investigate the expression of eight cuproptosis-related prognostic markers, we observed the differences in the expression of eight lncRNAs in gastric cancer cell lines (MKN-45, SGC-7901 and HCG-27) and human gastric epithelial cells (GES-1). According to quantitative real-time polymerase chain reaction (qRT‒PCR) analysis, eight cuproptosis-related prognostic markers were significantly differentially expressed in gastric cancer cells and GES-1 cells (P < 0.05) (Fig. 12). In gastric cancer cells, all cuproptosis-related prognostic markers except for LINC01094 were significantly increased. The expression of LINC01094 was decreased in MKN-45 cells compared with normal gastric epithelial cells and increased in SGC-7901 and HCG-27 cells compared with normal gastric epithelial cells. Different species of gastric cancer cells may be responsible for this difference. In conclusion, the expression profiles of the eight cuproptosis-related prognostic markers were generally consistent with previous analyses, suggesting a possible important role in stomach adenocarcinoma.