Differential expression analyses of pyroptosis-related genes in the NSCLC patients
We identified 4851 and 6513 differentially expressed genes (DEGs) in the LUAD and LUSC cohorts, respectively, of which the down-regulated genes accounted for 60.17% and 55.86%, respectively (Fig. 2A). Moreover, 40 and 45 pyroptosis-related genes were identified in the LUAD and LUSC cohorts, respectively, and 33 pyroptosis-related genes were found in both cohorts (Figs. 2B-C). Interestingly, these common pyroptosis-related genes exhibited similar differential expression patterns in the LUAD and LUSC cohorts, with 57.58% of the genes being down-regulated in tumor tissues. Previous studies have shown that the expression of GSDMC gene can be used as a prognostic indicator in LUAD patients, which can be utilized in the future treatment of lung cancers23. In addition, downregulation of GSDMD gene was found to inhibit lung cancer tumor proliferation26. Therefore, the expression of this gene is an effective prognostic indicator for NSCLC. Functional enrichment analyses showed that these genes were principally enriched in cellular production of functional proteins and regulation of transcription activity (Fig. 2D), such as positive regulation of cytokine production, regulation of interleukin-1β production, regulation of DNA-binding activities of transcription factors, and regulation of interleukin-1 production, among which interleukin-1β plays an essential role in pyroptosis.
Co-expression network analyses of pyroptosis-related genes and the DEGs identified in the LUAD and LUSC cohorts
Subsequently, we tried to identify correlated modules between the identified DEGs and the N&T tissues of the LUAD and LUSC cohorts and constructed a co-expression network using the pyroptosis-related genes and the identified DEGs with WGCNA27 tools.
Our results indicated that the N&T tissues of the LUAD and LUSC cohorts were closely correlated with the DEGs, with a Pearson correlation coefficient r of > 0.5 (Fig. 3). For the LUAD cohort, we identified three gene modules (turquoise, red, and purple modules) associated with the tumor tissues, and the corresponding Pearson correlation coefficients were r = -0.71 (P-value < 5e-06), r = 0.57 (P-value < 0.002) and r = 0.52 (P-value < 3e-08), respectively. Besides, a module (red) associated with the normal tissues (Fig. 3A) was identified for the LUAD cohort. These four modules contained eleven differentially expressed pyroptosis-related genes. For the LUSC cohort, we identified two modules (turquoise and green modules) related to the tumor tissues, and the corresponding Pearson correlation coefficients were r = -0.73 (P-value < 5e-07) and r = 0.56 (P-value < 1e-02), respectively. A blue module relevant to the normal tissues was identified for the LUSC cohort (Fig. 3B). Ten differentially expressed pyroptosis-related genes were involved in the three modules for the LUSC cohort.
The findings of our co-expression network analysis for the LUAD cohort are summarized in Fig. 3C. Four gene modules were associated with the N&T tissues in the LUAD cohort. Notably, there were three key pyroptosis-related genes (namely GSDMB, GSDME and CARD16) in the turquoise module, four key pyroptosis-related genes (namely GSDMA, GSDMD, NLRP3 and CASP3) in the red module, and four key pyroptosis-related genes in the magenta module (namely GSDMC, IL6, NLRP14 and PJVK). Most DEGs in the co-expression modules were downregulated. The proportions of downregulated genes in the red, magenta and turquoise modules were 66.7%, 58.3% and 55.6%, respectively. Notably, 63.6% of the pyroptosis-related genes were downregulated. Our co-expression network analysis of the LUSC cohort (Fig. 3D) revealed that there were three gene modules associated with the N&T tissues. There were three pyroptosis-related genes (namely TLR4, TXNIP and TLR3) in the blue module, four pyroptosis-related genes in the turquoise module (namely GSDMB, GSDMD, GSDME and PLCG1), and three pyroptosis-related genes in the green module (namely IFI16, CASP5 and GSDMC). In addition, the proportions of downregulated genes in the blue, turquoise and green modules were 60%, 64% and 55.6%, respectively.
These results indicated that downregulation of the pyroptosis-related genes and the DEGs played an essential role in the development of LUAD and LUSC. In each module, one pyroptosis-related gene interacted with almost all the DEGs. This finding indicated that the pyroptosis-related genes were key regulators of the DEGs during the development of LUAD and LUSC. Furthermore, we noted that five gasdermin genes (GSDMA/B/C/D/E) and four gasdermin genes (GSDMB/C/D/E) were expressed in the tumor tissues of LUAD and LUSC patients, respectively. This indicated that these gasdermin genes might play important roles in the development of LUAD and LUSC. These results may provide a new perspective for identification of novel prognostic biomarkers and therapeutic targets for patients with LUAD and LUSC.
Verification of expression levels of the gasdermin proteins
In order to further verify the correlations between the gasdermin family proteins and NSCLC, we analyzed the protein levels of gasdermin family proteins in the N&T tissues of LUAD and LUSC patients based on the HPA database (Fig. 4). The results showed moderate GSDMB/C/E expression, negative GSDMA expression and strong GSDMD expression in normal tissues. The numbers of NSCLC patients with gasdermin proteins being expressed in their tumor tissues were GSDMA: 11 cases (negative:10 cases and weak: 1 case); GSDMB: 8 cases (negative: 6 cases and weak: 2 cases); GSDMC: 11 cases (negative: 7 cases, strong: 2 cases, and moderate: 2 cases); GSDMD: 10 cases (negative: 3 cases, weak:3 cases, strong: 2 cases, and moderate: 2 cases); and GSDME: 12 cases (negative: 8 cases, weak: 3 cases, and negative: 1 case).
The above results showed that GSDMC was significantly upregulated in tumor tissues compared with the corresponding normal tissues, indicating a positive correlation between the expression of GSDMC and lung cancer development. Compared with GSDMC, GSDMA/B/D/E were expressed at much lower levels in tumor tissues, which is consistent with the WGCMA results. This also proved the credibility of our identification of the module genes.
Association between the aberrant expression of gasdermin genes and tumor stages in the NSCLC patients
Next, we used the UALCAN database to compare the expression levels of gasdermin genes between 515 primary tumor and 59 normal tissue samples from the LUAD cohort, and between 503 primary tumor and 52 normal tissue samples from the LUSC cohort. The expression levels of GSDMA/B/C/E genes were significantly higher in primary NSCLC tumor tissues than in the corresponding normal tissues, especially those of GSDMB/C genes (Figs. 5A-E). Furthermore, the expression level of GSDMD was higher in normal tissues than in primary tumor tissues in the LUSC cohort, and there was no significant difference in GSDMD expression between primary tumor tissues and normal tissues in the LUAD cohort. Using the GEPIA database, we also analyzed the correlations between the expression levels of gasdermin genes and the NSCLC patients’ tumor stages (Fig. S1). The expression level of GSDMB was significantly correlated with the NSCLC patients’ tumor stages, and NSCLC patients with earlier tumor stages tended to express a higher level of GSDMB. In order to comprehensively investigate the aberrant expression of the gasdermin family genes in the NSCLC patients, we also compared the relative expression levels of gasdermin family genes in the LUAD and LUSC tissues (Fig. 5F). Interestingly, GSDMD had the highest expression levels among gasdermin family genes in the LUAD and LUSC tissues.
These results indicated that the gasdermin family genes had significantly different expression levels in the LUAD and LUSC tissues. Meanwhile, the high expression of GSDMB may affect the tumor stages of the NSCLC patients.
Survival analyses of the gasdermin genes in the NSCLC patients
By using data mining methods in the Kaplan–Meier Plotter database, we investigated the correlations between the expression of gasdermin family genes and the OS, FP, and PPS of the NSCLC patients. To do this, the NSCLC patients were divided into high-expression and low-expression groups based on the cutoff values in each cohort (Fig. 6 ).
Our results indicated that the expression levels of GSDMB/C/D genes were significantly correlated with the OS, FP and PPS of the NSCLC patients. In contrast, there was no significant correlation between GSDME expression and either the OS or the FP of the NSCLC patients, whereas and increased GSDME expression was associated with the PPS of the NSCLC patients. For GSDMB, the survival rates of patients in the high-expression group were higher than those of patients in the low-expression group, and the OS, FP and PPS of the NSCLC patients were positively correlated with GSDMB expression. These results suggest that GSDMB is a potential tumor suppressor gene. Unlike GSDMB, the expression levels of GSDMC/D showed negative correlations with the OS, FP and PPS of the NSCLC patients. Unfortunately, we were not able to establish a correlation between the expression level of GSDMA and the NSCLC patients’ survival based on the Kaplan–Meier Plotter database.
Analysis of the prognostic significances of gasdermin gene mutations
Heritable mutations are crucial targets of research on clinical medicine and cancer development28,29. In order to investigate the prognostic significances of gasdermin gene mutations, we performed mutation analyses on the five genes of GSDMA/B/C/D/E based on the cBioPortal online tool.
Gasdermin gene mutations were identified in 237 out of 515 patients with LUAD (46%) (Fig. 7A). GSDMD, GSDMC and GSDME had the highest mutation rates of 20%, 18%, and 14%, respectively. Besides, based on the expression levels of gasdermin genes (RNA-Seq V2 RSEM), we also calculated the Pearson’s correlation coefficients between gasdermin family via the cBioPortal online tool for the LUAD cohort (TCGA, Firehose Legacy). The results indicated an evident positive correlation between GSDMB, GSDMC, GSDMD and GSDME genes (Fig. 7B). Finally, we analyzed the relationships between gasdermin gene mutations and the OS of the NSCLC patients. Kaplan Meier plot analysis and log-rank test results revealed that gasdermin gene mutations were associated with a shorter OS (Fig. 7C, p = 5.668E-3) in the NSCLC patients.
By analyzing the prognostic significances of gasdermin gene mutations in the NSCLC patients, we obtained 150 genes whose expression levels were significantly associated with gasdermin gene mutations by using the “co-expression” module of cBioPortal. Subsequently, we used the GO and KEGG modules in DAVID to analyze the potential functions of these 180 genes, and the results are shown in Fig. 7D. We found that gasdermin gene mutations in NSCLC were significantly associated with biological processes (BP) terms, including GO: 0006954 (inflammatory response), GO: 0006955 (immune response), GO: 0045087 (innate immune response), GO: 0002250 (adaptive immune response) and GO: 0050900 (leukocyte migration). Furthermore, we also found that cellular components (CC) terms such as GO: 0016021 (integral component of membrane), GO: 0005886 (plasma membrane), GO: 0070062 (extracellular exosome), GO: 0005887 (integral component of plasma membrane) and GO: 0005615 (extracellular space) were significantly related to gasdermin gene mutations. Among molecular functions (MF) terms, however, only GO: 0044389 (ubiquitin-like protein ligase binding) was associated with gasdermin gene mutations. The KEGG analysis revealed that hsa05150 (Staphylococcus aureus infection) and hsa04610 (Complement and coagulation cascades) were closely related to the gasdermin gene mutations in NSCLC.
The above results indicated that gasdermin gene mutations may play potentially important roles in the clinical treatment of NSCLC patients and in the development of NSCLC models. In particular, GSDMC/D/E, which had the highest mutation rates among the gasdermin family genes and an apparent positive correlation among them, may be important for the prognosis of NSCLC patients. Furthermore, functional enrichment analyses suggest that genes whose expression levels were significantly associated with gasdermin gene mutations may play important roles in inflammatory and immune responses.
Prediction and molecular docking analyses of small molecule drugs that can potentially target gasdermin family proteins
In order to identify novel therapeutic drugs for NSCLC, we used the CMAP database to match gasdermin family proteins with therapeutic small molecules. The top four most significantly enriched small molecule drugs and their corresponding enrichment values and 3D conformations are shown in Table 1 and Fig. 8A respectively. Among these small molecule drugs, cefotiam (enrichment value: -0.826), metanephrine (enrichment value: -0.739), and vorinostat (enrichment value: -0.525) were associated with significant negative fractions. These candidate small molecule drugs have the potential to reverse gene expression profiles in NSCLC, providing a guidance for the development of targeted drugs for NSCLC.
Our molecular docking analysis revealed that the four small molecules mainly interacted with GSDMB via H-bonds and hydrophobic interactions (Fig. 8B). For example, metanephrine can potentially form a H-bonds (2.0A) with the O atom of Asp65 of GSDMB. In addition, metanephrine can potentially form hydrophobic interactions with Trp230, Trp340, Glu153, and Tyr155 of GSDMB.
Table 1: The four most significantly enriched small molecule drugs.
|
Rank
|
CMAP name
|
Mean
|
N
|
Enrichment value
|
P
|
CID
|
1
|
Erastin
|
0.701
|
4
|
0.838
|
0.00107
|
11214940
|
2
|
Vorinostat
|
-0.508
|
12
|
-0.525
|
0.00136
|
5311
|
3
|
Cefotiam
|
-0.701
|
4
|
-0.826
|
0.00177
|
43708
|
4
|
Metanephrine
|
-0.691
|
5
|
-0.739
|
0.00244
|
21100
|