3.1Screening and functional enrichment of DEGs
Differential analysis showed that 4107 genes were upregulated and 2022 genes were downregulated in LUAD tumor tissues compared with normal tissues (Figure 1A) .GO enrichment analysis showed that overlapping DEGs were mainly enriched in the extracellular matrix, extracellular structures, intercellular adhesion, and cAMP-mediated signaling (Figure 1B) .GSEA showed that tumor samples with DEGs were associated with signaling pathways such as Nod-like receptor, Toll-like receptor (Figure 1C).
3.2Construction of co-expression modules
Genes with similar expression patterns in DEGs were clustered and characterized by constructing co-expression modules (Figure 2A), and a total of 26 gene modules were identified (Figure 2B). As seen in Figure 3C, turquoise module members were relatively highly correlated with tumors (coefficient = 0.54, P < 0.001), so the turquoise module was selected as the key module.
3.3Screening and identification of pivotal genes
The scatter plot of MM and GS correlations in the turquoise module (Figure 3A) revealed that DEG, which was significantly associated with Cancer status, was also an important gene member in the turquoise module (cor=0.45, P<0.001). PPI was constructed based on the STRING database, and a total of 187 points and 1287 edges were identified (Figure 3B). The MCC algorithm using CytoHubba plug-in in Cytoscape software was used for pivotal gene screening, in which the top 10 genes with the highest MCC scores were designated as pivotal genes: LDHA, TOP2A, UBE2C, TYMS, TRIP13, EXO1, TTK, TPX2, ZWINT, UHRF1 (Figure 3C). By comparing the gene expression of genes between tumor samples and normal samples, we found that all of these 10 genes were up-regulated in the LUAD tumor samples (Figure 4A). As shown in Figure 4B, the correlation between the expression of LDHA and the other 9 genes was weak.
3.4Prognostic ability and functional analysis of pivotal genes
The best model to predict the prognosis of LUAD patients was determined by univariate COX proportional risk regression analysis, which revealed an association between the expression levels of all 10 pivotal genes and the overall survival of patients (Figure 5A), followed by the identification of three independent prognostic factors (LDHA, TRIP13, TTK) based on multifactorial COX proportional risk regression analysis (Figure 5B).
Meanwhile, we ran cross-validation to construct the LASSO model by analyzing the trajectories of each independent variable (Figure 6A-B). As shown in Figure 6C, patients in the high-risk group had a higher risk of survival, and the ROC confirmed the accuracy of the prognostic risk score prediction (Figure 6D), and the results of the Kaplan-Meier analysis also indicated that low-risk patients had a better prognosis than high-risk patients (Figure 6E).
Finally, we compared the survival of patients with high and low expression of LDHA, TRIP13, and TTK by plotting Kaplan-Meier survival curves, and patients with high expression of these three key genes had a worse prognosis than their respective low expression (Figure 7), indicating that the expression levels of these three key genes were closely related to the prognosis of LUAD patients.GO analysis also showed that these three genes are associated with cancer-related processes such as NAD metabolism, para-cAMP, and are involved in signaling pathways such as pyruvate metabolism and HIF-1 (Figure 8).