Data normalization and FRDEGs analysis
In this study, bioinformatics methods were primarily employed to explore the biological characteristics of sarcopenia, and the flow chart showing the overall analysis process is illustrated in Fig. 1. In the GSE1428 and GSE8479 datasets, the subjects were divided into a sarcopenia group and a normal control group. Two groups of datasets were normalized, annotation probe and other data cleaning operations were performed, and a boxplot was drawn for data distribution before and after standardization (Fig. 2A-B).
Figure 1. Flow chart of overall analysis of sarcopenia biological characteristics explored by bioinformatics methods
Figure 2. Boxplot of GEO dataset before and after correction.
(A). Box diagram of gene expression distribution between samples of GSE1428 dataset before correction. (B). Box diagram of gene expression distribution among samples of GSE1428 dataset after correction. (C). Box diagram of gene expression distribution among samples of GSE8479 dataset before correction. (D). Box diagram of gene expression distribution among samples of GSE8479 dataset after correction.
A total of 619 FRGs were downloaded by searching the keyword "Ferroptosis" in GeneCards database. Moreover, 474 FRGs were downloaded from FerrDb database (http://www.zhounan.org/ferrdb/current/). A total of 883 FRGs were obtained after merging and dereplication.
To analyze the differential gene expression values in the Sarcopenia sample group (Sarcopenia group) in comparison with the Normal sample group (Normal group) in the dataset GSE1428 and GSE8479 respectively, “limma” R package was used to acquire genes that were differentially expressed in the two datasets. The results are expressed as follows: 12548 DEGs were obtained in Dataset GSE1428, 1149 of them meet threshold | logFC | > 0, P-value < 0.05. Under the threshold, there are 603 high expression genes in the disease group (low expression in the normal group, logFC is positive, up-regulated genes), 546 low expression genes in the disease group (high expression and logFC is negative in the normal group). Volcano maps were drawn from the differential analysis results of this dataset (Fig. 3A). Totally, 16742 DEGs were selected in the dataset GSE8479, and 5039 of them meet the threshold | logFC | > 0, P-value < 0.05. Under the threshold, there are 2485 high expression genes in the disease group ((low expression in the normal group, logFC is positive, up-regulated genes), 2554 low expression genes in the disease group (high expression in the normal group, logFC is negative). Volcano map was drawn based on the results of differential analysis of this dataset (Fig. 3B).
As to obtain FRDEGs, all the DEGs selected from the datasets GSE1428 and GSE8479 were intersected (| logFC | > 0, P-value < 0.05). Next, 591 common differentially expressed genes (Co-DEGs) were acquired, and a Venn diagram was generated (Fig. 3C). We also intersected the Co-DEGs and FRGs in the datasets, and obtained a total of 46 FRDEGs, and a Venn diagram (Fig. 3D) was generated. The 46 FRDEGs included DDIT4, FTL, FTH1, HMGB1, HSPB1, CISD1, TP63, CDKN1A, PLIN2, ZFP36, SREBF2, FZD7, BRD3, DECR1, CP, GOT1, EX1, MPC1, ABCC5, USP11, CS, ATG5, MTCH1, SLC38A1, DLD, CIRBP, STOML2, LDHA, MDH2, ANP32B, FKBP3, LPIN1, PGK1, ANXA1, YWHAZ, ACTC1, LRPPRC, MAP4, HNRNPA3, MYL6B, RUFY1, WBP11, FOXO1, PRPF8, RPS27A, MAGED2. In accordance with the results of Venn diagram, the differential expression of FRDEGs among different sample groups in GSE1428 dataset (Fig. 3E) and GSE8479 dataset (Fig. 3F) was analyzed. The heatmaps were generated using “pheatmap” R package to display the analysis results.
Figure 3. Differential expressed gene analysis of GEO datasets.
(A). Volcano map of DEGs for GSE1428 dataset. (B). Volcano map of DEGs for GSE8479 dataset. (C). Venn diagrams of DEGs in GSE1428 and GSE8479 datasets. (D). Venn diagram of co-DEGs and ferroptosis-related genes in the dataset. (E). Complex numerical heat map of differentially expressed genes associated with iron death for the GSE1428 dataset. (F). Complex numerical heat map of differentially expressed genes associated with iron death for the GSE8479 dataset.
Co-DEGs: Common differentially expressed genes.
Functional enrichment analysis underlying FRDEGs
The GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment analyses of FRDEGs were conducted first to investigate the biological processes, molecular functions, cellular components, biological pathways, as well as their relationship with sarcopenia in 46 FRDEGs. The results of GO functional enrichment analysis and KEGG enrichment analysis were presented in bubble chart (Fig. 4A-B) and bar chart (Fig. 4C-D). In addition, the correlation between 46 FRDEGs and GO/KEGG enrichment analysis results was presented as a ring network diagram (Fig. 4E-F).
The results suggest that 46 FRDEGs are significantly enriched in response to steroid hormone, response to glucocorticoid, response to corticosteroid, metabolic process in the BP category, and organelle outer membrane, ubiquitin protein ligase binding in cellular component autophagosome, secondary lysosome in the CC category, and ubiquitin-like protein ligase binding, single-stranded DNA binding, coenzyme binding in the MF category (Table 1). A total of 46 FRDEGs were remarkably enriched in Carbon metabolism, Ferroptosis, Citrate cycle (TCA cycle), Glyoxylate and dicarboxylate metabolism, Pyruvate metabolism, Cysteine and methionine metabolism, Amino Acids, HIF-1 Signaling Pathway, 2-oxocarboxylic Acid metabolism, Glycolysis/Gluconeogenesis, Propanoate metabolism, as well as other pathways (Table 2).
Table 1
GO enrichment analysis results of Ferroptosis-related differentially expressed genes
ONTOLOGY | ID | Description | GeneRatio | BgRatio | pvalue | p.adjust | qvalue | Count |
BP | GO:0048545 | response to steroid hormone | 7/46 | 385/18670 | 4.01818E-05 | 0.006702331 | 0.004754146 | 7 |
BP | GO:0051384 | response to glucocorticoid | 6/46 | 146/18670 | 1.4933E-06 | 0.002280405 | 0.001617553 | 6 |
BP | GO:0031960 | response to corticosteroid | 6/46 | 162/18670 | 2.7343E-06 | 0.002280405 | 0.001617553 | 6 |
BP | GO:0019362 | pyridine nucleotide metabolic process | 6/46 | 189/18670 | 6.64981E-06 | 0.002653115 | 0.001881927 | 6 |
CC | GO:0031968 | organelle outer membrane | 4/46 | 201/19717 | 0.001223747 | 0.061913794 | 0.050806452 | 4 |
CC | GO:0019867 | outer membrane | 4/46 | 203/19717 | 0.001269255 | 0.061913794 | 0.050806452 | 4 |
CC | GO:0005776 | autophagosome | 3/46 | 93/19717 | 0.001331479 | 0.061913794 | 0.050806452 | 3 |
CC | GO:0005767 | secondary lysosome | 2/46 | 14/19717 | 0.000475991 | 0.061913794 | 0.050806452 | 2 |
MF | GO:0031625 | ubiquitin protein ligase binding | 5/45 | 290/17697 | 0.000815692 | 0.022256728 | 0.017295115 | 5 |
MF | GO:0044389 | ubiquitin-like protein ligase binding | 5/45 | 308/17697 | 0.001067763 | 0.025492839 | 0.019809812 | 5 |
MF | GO:0003697 | single-stranded DNA binding | 4/45 | 113/17697 | 0.000191887 | 0.0091626 | 0.007120014 | 4 |
MF | GO:0050662 | coenzyme binding | 4/45 | 291/17697 | 0.006283042 | 0.085718645 | 0.066609694 | 4 |
GO: Gene Ontology;BP༚ biological process, BP༛ CC༚cellular component༛MF༚molecular function. |
Table 2
KEGG enrichment analysis results of Ferroptosis-related differentially expressed genes
ONTOLOGY | ID | Description | GeneRatio | BgRatio | pvalue | p.adjust | qvalue | Count |
KEGG | hsa01200 | Carbon metabolism | 5/30 | 118/8076 | 6.50825E-05 | 0.003514458 | 0.002945842 | 5 |
KEGG | hsa04216 | Ferroptosis | 4/30 | 41/8076 | 1.42431E-05 | 0.001538251 | 0.001289373 | 4 |
KEGG | hsa00020 | Citrate cycle (TCA cycle) | 3/30 | 30/8076 | 0.000175531 | 0.004739324 | 0.003972533 | 3 |
KEGG | hsa00630 | Glyoxylate and dicarboxylate metabolism | 3/30 | 30/8076 | 0.000175531 | 0.004739324 | 0.003972533 | 3 |
KEGG | hsa00620 | Pyruvate metabolism | 3/30 | 39/8076 | 0.000386305 | 0.008344188 | 0.006994153 | 3 |
KEGG | hsa00270 | Cysteine and methionine metabolism | 3/30 | 50/8076 | 0.000805981 | 0.014507664 | 0.01216042 | 3 |
KEGG | hsa00010 | Glycolysis / Gluconeogenesis | 3/30 | 67/8076 | 0.001887912 | 0.029127783 | 0.0244151 | 3 |
KEGG | hsa01230 | Biosynthesis of amino acids | 3/30 | 75/8076 | 0.002608454 | 0.031301446 | 0.026237079 | 3 |
KEGG | hsa04066 | HIF-1 signaling pathway | 3/30 | 109/8076 | 0.007450151 | 0.073146932 | 0.061312244 | 3 |
KEGG | hsa01210 | 2-Oxocarboxylic acid metabolism | 2/30 | 19/8076 | 0.002193384 | 0.029610689 | 0.024819876 | 2 |
KEGG | hsa00640 | Propanoate metabolism | 2/30 | 34/8076 | 0.006951451 | 0.073146932 | 0.061312244 | 2 |
KEGG: Kyoto Encyclopedia of Genes and Genomes. |
Figure 4. Ferroptosis-related DEGs functional enrichment analysis (GO) and pathway enrichment (KEGG) analysis.
(A-B) GO functional enrichment analysis of Ferroptosis-related DEGs (A) and KEGG pathway enrichment analysis (B) results Bubble diagram presentation. (C-D). Results of GO functional enrichment analysis (C) and KEGG pathway enrichment analysis (D) of Ferroptosis-related DEGs are shown in bar charts. (E-F). Annular network diagram of GO functional enrichment analysis (E) and KEGG pathway enrichment analysis (F) results of Ferroptosis-related DEGs.
Ferroptosis-related DEGs: Ferroptosis-related genes differentially expressed genes; GO: Gene Ontology; BP: Biological process; CC: Cellular component; MF: Molecular function; KEGG: Kyoto Encyclopedia of Genes and Genomes
GSEA enrichment analysis of the sarcopenia datasets
To determine differentially enriched pathways between sarcopenia and non-sarcopenia patients, the GSEA (Gene Set Enrichment Analysis) of gene expression profiles was performed. The results showed that in dataset GSE1428 (Table 3), the molecular pathways were significantly associated with oxidative phosphorylation (Fig. 5E) endoplasmic reticulum stress response in coronavirus infection (Fig. 5C), oxidative phosphorylation (Fig. 5B), glycolysis and gluconeogenesis (Fig. 5D) and other pathways (Fig. 5A-E). In dataset GSE8479 (Table 4), they were significantly enriched in reactome chaperone mediated autophagy (Fig. 5H), apoptosis-related network due to altered notch3 in ovarian cancer (Fig. 5I), mechano-regulation and pathology of YAPTAZ on the basis of hippo and non-hippo mechanisms (Fig. 5J), glycolysis gluconeogenesis (Fig. 5G) and other pathways (Fig. 5F-J).
Table 3
ID | setSize | enrichmentScore | NES | pvalue | p.adjust | qvalues |
NABA_CORE_MATRISOME | 192 | 0.359224489 | 1.591219408 | 0.001858736 | 0.168392046 | 0.16549529 |
PID_TAP63_PATHWAY | 50 | 0.559261037 | 2.013902319 | 0.001876173 | 0.168392046 | 0.16549529 |
WP_MRNA_PROCESSING | 112 | 0.409850575 | 1.696861139 | 0.001876173 | 0.168392046 | 0.16549529 |
PID_CMYB_PATHWAY | 80 | 0.425781838 | 1.659216402 | 0.001886792 | 0.168392046 | 0.16549529 |
WP_ADIPOGENESIS | 125 | 0.423679599 | 1.770682554 | 0.001908397 | 0.168392046 | 0.16549529 |
PID_P73PATHWAY | 74 | 0.492236343 | 1.886005776 | 0.001912046 | 0.168392046 | 0.16549529 |
PID_P53_DOWNSTREAM_PATHWAY | 120 | 0.382632438 | 1.593149301 | 0.001919386 | 0.168392046 | 0.16549529 |
PID_MYC_REPRESS_PATHWAY | 58 | 0.497125351 | 1.833777637 | 0.001949318 | 0.168392046 | 0.16549529 |
REACTOME_DISEASES_ASSOCIATED_ WITH_GLYCOSAMINOGLYCAN_METABOLISM | 35 | 0.581885739 | 1.932458778 | 0.001949318 | 0.168392046 | 0.16549529 |
REACTOME_PYRUVATE_METABOLISM | 28 | -0.668264848 | -2.112287759 | 0.001956947 | 0.168392046 | 0.16549529 |
KEGG_CITRATE_CYCLE_TCA_CYCLE | 30 | -0.623445311 | -2.003716563 | 0.001996008 | 0.168392046 | 0.16549529 |
WP_OXIDATIVE_PHOSPHORYLATION | 30 | -0.535266811 | -1.720316051 | 0.001996008 | 0.168392046 | 0.16549529 |
WP_ENDOPLASMIC_RETICULUM_STRESS_ RESPONSE_IN_CORONAVIRUS_INFECTION | 34 | -0.501903728 | -1.664692773 | 0.00204918 | 0.168392046 | 0.16549529 |
KEGG_OXIDATIVE_PHOSPHORYLATION | 82 | -0.548571102 | -2.184397656 | 0.002145923 | 0.168392046 | 0.16549529 |
WP_GLYCOLYSIS_AND_GLUCONEOGENESIS | 45 | -0.534061743 | -1.902402409 | 0.002150538 | 0.168392046 | 0.16549529 |
GSEA: Gene Set Enrichment Analysis. |
Table 4
ID | setSize | enrichmentScore | NES | pvalue | p.adjust | qvalues |
KEGG_SYSTEMIC_LUPUS_ERYTHEMATOSUS | 57 | 0.717866394 | 2.230238825 | 0.001865672 | 0.04807219 | 0.040628643 |
REACTOME_COMPLEMENT_CASCADE | 56 | 0.656205379 | 2.032523115 | 0.001879699 | 0.04807219 | 0.040628643 |
WP_PRIMARY_FOCAL_SEGMENTAL_ GLOMERULOSCLEROSIS_FSGS | 70 | 0.635980629 | 2.024884952 | 0.001879699 | 0.04807219 | 0.040628643 |
REACTOME_INTERLEUKIN_4_AND_ INTERLEUKIN_13_SIGNALING | 108 | 0.594235253 | 2.016724784 | 0.001808318 | 0.04807219 | 0.040628643 |
REACTOME_SMOOTH_MUSCLE_CONTRACTION | 34 | 0.72233364 | 2.006980347 | 0.001949318 | 0.04807219 | 0.040628643 |
WP_COMPLEMENT_ACTIVATION | 22 | 0.772084612 | 1.975263157 | 0.001941748 | 0.04807219 | 0.040628643 |
NABA_ECM_GLYCOPROTEINS | 175 | 0.546853662 | 1.962865961 | 0.00173913 | 0.04807219 | 0.040628643 |
REACTOME_CHAPERONE_MEDIATED_AUTOPHAGY | 22 | 0.764534312 | 1.955946841 | 0.001941748 | 0.04807219 | 0.040628643 |
PID_NOTCH_PATHWAY | 59 | 0.626456076 | 1.954493131 | 0.001872659 | 0.04807219 | 0.040628643 |
REACTOME_RESPONSE_TO_METAL_IONS | 14 | 0.854295521 | 1.943158541 | 0.001992032 | 0.04807219 | 0.040628643 |
BIOCARTA_CARDIACEGF_PATHWAY | 18 | 0.79285177 | 1.941356478 | 0.001934236 | 0.04807219 | 0.040628643 |
BIOCARTA_COMP_PATHWAY | 19 | 0.782625403 | 1.937007068 | 0.001934236 | 0.04807219 | 0.040628643 |
WP_APOPTOSISRELATED_NETWORK_ DUE_TO_ALTERED_NOTCH3_IN_OVARIAN_CANCER | 53 | 0.573101748 | 1.75137624 | 0.001890359 | 0.04807219 | 0.040628643 |
WP_MECHANOREGULATION_AND_PATHOLOGY_OF_ YAPTAZ_VIA_HIPPO_AND_NONHIPPO_MECHANISMS | 46 | 0.539252571 | 1.600002309 | 0.005671078 | 0.078188934 | 0.066082079 |
KEGG_GLYCOLYSIS_GLUCONEOGENESIS | 61 | -0.470240257 | -1.490154504 | 0.027542373 | 0.184983731 | 0.15634066 |
GSEA: Gene Set Enrichment Analysis. |
Figure 5. GSEA enrichment analysis of sarcopenia datasets.
(A). Four main biological characteristics of GSEA enrichment analysis for GSE1428 dataset. (B-E). Genes significantly enriched in WP_OXIDATIVE_PHOSPHORYLATION in GSE1428 dataset (Fig. 5E). WP_ENDOPLASMIC_RETICULUM_STRESS_RESPONSE_IN_CORONAVIRUS_INFECTION (Fig. 5C), KEGG_OXIDATIVE_PHOSPHORYLATION (Fig. 5B), WP_GLYCOLYSIS_AND_GLUCONEOGENESIS (Fig. 5D) pathway. (F). GSEA analysis of GSE8479 dataset showed four main biological characteristics. (G-J). Genes in GSE8479 dataset were significantly enriched in REACTOME_CHAPERONE_MEDIATED_AUTOPHAGY (Fig. 5H). "Wp_sisrelated_network_due_to_altered_notch3_in_ovarian_cancer" (Fig. 5I). WP_MECHANOREGULATION_AND_PATHOLOGY_OF_YAPTAZ_VIA_HIPPO_AND_NONHIPPO_MECHANISMS (Fig. 5J), KEGG_GLYCOLYSIS_GLUCONEOGENESIS (Fig. 5G) and other pathways (Fig. 5F-J).
GSEA: Gene Set Enrichment Analysis.
Protein-protein interaction network
A total of 46 FRDEGs (DDIT4, FTL, FTH1, HMGB1, HSPB1, CISD1, TP63, CDKN1A, PLIN2, ZFP36, SREBF2, FZD7, BRD3, DECR1, CP, GOT1, BEX1, MPC1, ABCC5, USP11, CS, ATG5, MTCH1, SLC38A1, DLD, CIRBP, STOML2, LDHA, MDH2, ANP32B, FKBP3, LPIN1, PGK1, ANXA1, YWHAZ, ACTC1, LRPPRC, MAP4, HNRNPA3, MYL6B, RUFY1, WBP11, FOXO1, PRPF8, RPS27A, MAGED2) were analyzed for protein-protein interaction using STRING database. The minimum required interaction score > 0.400 (medium confidence 0.400) was selected as the standard, constructing 46 FRDEGs of PPI network, using Cytoscape software to visualize the protein interaction relationship. The result indicates that only 31 FRDEGs in the PPI network are associated with other genes. Subsequently, the PPI network diagram (Fig. 6B) was plotted according to the degree scores of the above 31 FRDEGs in the PPI network. Next, cytoHubba plugin was adopted to analyze five algorithms (including MCC, MNC, EPC, Degree and Closeness). As a result, 11 common genes of the top 15 genes were selected as the hub genes (mRNA) (Fig. 6B-C).
Figure 6. Protein-protein Interaction Network (PPI Network).
(A). PPI network of Ferroptosis-related DEGs under Degree algorithm. (B). Common gene of the first 15 Ferroptosis-related DEGs under five algorithms including MCC, MNC, EPC, Degree, and Closeness selected by Wayne diagram. (C). PPI Network of 11 Hub genes in Degree algorithm.
Ferroptosis-related DEGs: Ferroptosis-related genes differentially expressed genes; PPI network: protein-protein interaction; Degree: Degree Correlation; MNC: Maximum Neighborhood Component; MCC: Maximal Clique Centrality; EPC: Edge Percolated Component; And you may say you may not so-called Centrality. The higher the Degree score, the larger the node area, the darker the color, and the more interactions there are.
Hub genes-miRNA interaction network; hub genes-transcription factor interaction network; hub genes-drug interaction network
The mRNA-miRNA data in ENCORI database and miRDB database were used to predict the correlation between 11 hub genes (mRNA) (LDHA, CS, MDH2, PGK1, DLD, YWHAZ, RPS27A, FOXO1, PLIN2, CDKN1A, HSPB1), and then the intersection part of the results from the two databases was selected to draw the mRNA-miRNA interaction network by Cytoscape software for visualization (FIG. 7A). According to the mRNA-miRNA interaction network, our mRNA-miRNA interaction network consists of 11 hub genes (mRNA) (LDHA, CS, MDH2, PGK1, DLD, YWHAZ, RPS27A, FOXO1, PLIN2, CDKN1A, HSPB1), including 189 miRNA molecules, composing 428 mRNA-miRNA interaction pairs.
We searched the CHIPBase database (Version 2.0) and hTFtarget database for transcription factors (TFS) that is binding to the hub genes. After downloading the interaction relationship found in the two databases, we took the intersection with 11 hub genes (mRNA), and finally obtained 11 hub genes (CCNB1, CEP55, KIF20A, MELK, MKI67, NCAPG, NUF2, RRM2, TTK, UBE2C, UHRF1) and 48 TFS were visualized by Cytoscape software (FIG.7B). In the mRNA-TF interaction network, hub gene YWHAZ has the most interaction relationships with TF, and YWHAZ has 40 pairs of mRNA-TF interaction relationships.
The DGIdb database was used to identify potential drugs or molecular compounds for 11 hub genes. As shown in the mRNA-Drugs interaction network (FIG.7C), we identified 56 potential drugs or molecular compounds corresponding to six hub genes (LDHA, MDH2, PGK1, FOXO1, CDKN1A, HSPB1). Among them, the drugs or molecular compounds that interact with hub gene HSPB1 are the most.
Figure 7. Hub Genes-miRNA interaction network; Hub genes-transcription factor interaction network; Hub Genes-Drug interaction network.
(A). Hub Genes-miRNA interaction network. (B). Hub genes-transcription factor interaction network. (C). Hub genes-drug interaction network. The green oval is mRNA; Yellow hexagon is miRNA; The pink quadrilateral represents transcription factors; The blue triangle is Drug.
PPI network: protein-protein interaction network; TF: Transcription factors.
Expression analysis and ROC verification of hub genes in two datasets
To further explore the differential expression of FRGs in the sarcopenia datasets, the correlation between expression levels and different groups of 11 hub genes (LDHA, CS, MDH2, PGK1, DLD, YWHAZ, RPS27A, FOXO1, PLIN2, CDKN1A, HSPB1) screened by PPI network in GSE1428 dataset (Fig. 8A) and GSE8479 dataset (Fig. 9A) was analyzed. Thus, ROC verification was performed. As depicted in Fig. 8, 11 hub genes were statistically significant among different groups in the GSE1428 dataset (Fig. 8A). Thereinto, the expression levels of hub genes FOXO1, PLIN2, RPS27A and YWHAZ among different groups were statistically significant (P < 0.05). The expression levels of hub genes CDKN1A, CS, DLD, HSPB1, LDHA, MDH2 and PGK1 among different groups were highly statistically significant (P < 0.01). Figure 8B-K demonstrated the ROC verification of hub genes in GSE1428 dataset. Hub genes CDKN1A (AUC = 0.892, Fig. 8B), CS (AUC = 0.883, Fig. 8C), DLD (AUC = 0.846, Fig. 8D), FOXO1 (AUC = 0.783, Fig. 8E), HSPB1 (AUC = 0.867, Fig. 8F), LDHA (AUC = 0.842, Fig. 8G), MDH2 (AUC = 0.854, Fig. 8H), PGK1 (AUC = 0.867, Fig. 8I), RPS27A (AUC = 0.812, Fig. 8J), YWHAZ (AUC = 0.783, Fig. 8K). The expression level in the GSE1428 dataset showed a certain correlation with the Normal sample group (Normal) and Sarcopenia sample group (Sarcopenia).
In the GSE8479 dataset (Fig. 9A), the expression levels of 10 hub genes (LDHA, CS, MDH2, PGK1, DLD, YWHAZ, RPS27A, FOXO1, CDKN1A, HSPB1) were statistically significant among different groups. Thereinto, the expression levels of hub genes LDHA, CS, MDH2, PGK1, DLD, YWHAZ, FOXO1 and CDKN1A in different groups were highly statistically significant (P < 0.001). The expression levels of HSPB1 and RPS27A in different groups were statistically significant (P < 0.05). Figure 9B-K shows the ROC verification of hub genes in the GSE8479 dataset. As depicted in the figure, the expression level of hub genes CDKN1A (AUC = 0.954, Fig. 9B), DLD (AUC = 0.915, Fig. 9D) and FOXO1 (AUC = 0.928, Fig. 9E) in the GSE8479 dataset are significantly correlated with the Normal sample group (Normal) and Sarcopenia sample group (Sarcopenia). The expression level of hub genes CS (AUC = 0.875, Fig. 9C), HSPB1 (AUC = 0.704, Fig. 9F), LDHA (AUC = 0.858, Fig. 9G), MDH2 (AUC = 0.859, Fig. 9H), PGK1 (AUC = 0.856, Fig. 9I) and YWHAZ (AUC = 0.852, Fig. 9K) in the GSE8479 dataset showed a certain correlation with Normal sample group (Normal) and Sarcopenia sample group (Sarcopenia). The expression level of hub gene RPS27A (AUC = 0.690, Fig. 9J) in the GSE8479 dataset showed low correlation with both Normal sample group (Normal) and Sarcopenia sample group (Sarcopenia).
Figure 8. Differential analysis of Hub genes expression in dataset GSE1428.
(A). Group comparison of the results of differential analysis of hub genes expression in GSE1428 dataset. (B-K). ROC curves of hub genes CDKN1A (B), CS (C), DLD (D), FOXO1 (E), HSPB1 (F), LDHA (G), MDH2 (H), PGK1 (I), RPS27A (J) and YWHAZ (K) in GSE1428 dataset.
AUC: Area Under Curve. The symbol NS is equal to P ≥ 0.05, which is not statistically significant. The symbol * is equivalent to P < 0.05, which is statistically significant; The symbol ** is equivalent to P < 0.01, which is highly statistically significant; The symbol *** is equivalent to P < 0.001, which is highly statistically significant.
Figure 9. Differential analysis of Hub genes expression in dataset GSE8479.
(A). Group comparison diagram of the results of differential analysis of hub genes expression in GSE8479 data set. (B-K). ROC curves of hub genes CDKN1A (B), CS (C), DLD (D), FOXO1 (E), HSPB1 (F), LDHA (G), MDH2 (H), PGK1 (I), RPS27A (J) and YWHAZ (K) in GSE8479 dataset.
AUC: Area Under Curve. The symbol NS is equal to P ≥ 0.05, which is not statistically significant. The symbol * is equivalent to P < 0.05, which is statistically significant; The symbol ** is equivalent to P < 0.01, which is highly statistically significant; The symbol *** is equivalent to P < 0.001, which is highly statistically significant.
Immune infiltration of hub genes
After disposing the expression profile data of sarcopenia dataset GSE1428 and GSE8479, ssGSEA algorithm was applied to count the infiltration of 29 kinds of immune cells. We plotted the infiltration of immune cells in the two datasets in the form of bar chart (Fig. 10A-B).
Afterwards, we calculated the correlation between the expression level of 11 hub genes (LDHA, CS, MDH2, PGK1, DLD, YWHAZ, RPS27A, FOXO1, PLIN2, CDKN1A) and the abundance of immune cell infiltration in dataset GSE1428 (Fig. 11A) and GSE8479 (Fig. 11I). For the combination of immune cells and hub genes with statistically significant positive and negative correlations, scatter plots of correlation between the expression level of hub genes and the abundance of immune cell infiltration in datasets GSE1482 and GSE8479 were drawn. As depicted in the figure, a moderate negative correlation was identified between the infiltration degree of DCs of immune cells and the expression level of hub gene DLD (Fig. 11B, r= -0.668, P < 0.001) in dataset GSE1482. A weak negative correlation was identified between the degree of DCs infiltration and the expression level of hub gene LDHA (Fig. 11C, r= -0.443, P-value = 0.039). A weak negative correlation was identified between the invasion degree of Para inflammation and the expression level of hub gene LDHA (Fig. 11D, r= -0.485, P-value = 0.022). A weak positive correlation was identified between the degree of Treg infiltration of immune cells and the expression level of hub gene LDHA (Fig. 11E, r = 0.451, P-value = 0.035), and a moderate negative correlation between the degree of DCs infiltration of immune cells and the expression level of hub gene PGK1 (Fig. 11F, r = -0.514, P-value = 0.015), a moderate negative correlation was identified between the infiltration degree of immune cells pDCs and the expression level of hub gene YWHAZ (Fig. 11G, r= -0.660, P-value = 0.001). A weak negative correlation was identified between the infiltration degree of immune cells Th1 cells and the expression level of hub gene YWHAZ (Fig. 11H, r= -0.459, P-value = 0.033).
In GSE8479 dataset, a moderate negative correlation was identified between the degree of DCs infiltration and the expression level of hub gene DLD (Fig. 11J, r= -0.622, P-value < 0.001). A weak negative correlation was identified between the degree of DCs infiltration and the expression level of hub gene LDHA (Fig. 11K, r= -0.469, P-value < 0.001). A weak negative correlation was identified between the invasion degree of Para inflammation and the expression level of hub gene LDHA (Fig. 11L, r= -0.318, P-value = 0.023). A weak positive correlation was identified between the degree of Treg infiltration of immune cells and the expression level of hub gene LDHA (Fig. 11M, r = 0.307, P-value = 0.028), and a weak negative correlation was identified between the degree of DCs infiltration of immune cells and the expression level of hub gene PGK1 (Fig. 11N, r = -0.405, P-value = 0.003). a weak negative correlation was identified between the infiltration degree of immune cells pDCs and the expression level of hub gene YWHAZ (Fig. 11O, r= -0.390, P-value = 0.005). A weak negative correlation was identified between the infiltration degree of immune cells Th1 cells and the expression level of hub gene YWHAZ (Fig. 11P, r= -0.379, P-value = 0.006).
Figure 10. Immune infiltration analysis in sarcopenia dataset GSE1428 and GSE8479.
(A). Histogram of immune infiltration results of immune cells in dataset GSE1428. (B). Histogram of immune infiltration results of immune cells in dataset GSE8479.
Figure 11. Correlation analysis between the expression of hub genes and the degree of immune cell infiltration.
(A). Results of correlation analysis between hub genes and the abundance of 29 types of immune cell infiltration in dataset GSE1428. (B-H). Results of correlation analysis between the infiltration degree of immune cell DCs and the expression level of hub gene DLD in dataset GSE1428 (B), results of correlation analysis between the infiltration degree of immune cell DCs and the expression level of Hub gene LDHA (C), results of correlation analysis on the degree of infiltration of immune cells Para inflammation and the expression level of hub gene LDHA (D), and results of correlation analysis on the degree of infiltration of immune cells Treg and the expression level of hub gene LDHA (E). Results of correlation analysis between the infiltration degree of immune cells DCs and the expression level of hub gene PGK1 (F). Results of correlation analysis between the infiltration degree of immune cells pDCs and the expression water of hub gene YWHAZ (G). Correlation analysis between the infiltration degree of immune cells Th1 cells and the expression level of hub gene YWHAZ (H). I. Results of correlation analysis between hub genes and the abundance of 29 types of immune cell infiltration in dataset GSE8479. (J-P). Results of correlation analysis between the infiltration degree of immune cell DCs and the expression level of hub gene DLD in dataset GSE8479 (J), and results of correlation analysis between the infiltration degree of immune cell DCs and the expression level of Hub gene LDHA (K). Results of correlation analysis on the degree of infiltration of immune cells Para inflammation and the expression level of hub gene LDHA (L), and results of correlation analysis on the degree of infiltration of immune cells Treg and the expression level of hub gene LDHA (M). Results of correlation analysis between the infiltration degree of DCs and the expression level of hub gene PGK1 (N). Results of correlation analysis between the infiltration degree of immune cells pDCs and the expression water of hub gene YWHAZ (O). Correlation analysis between the infiltration degree of immune cells Th1 and the expression level of hub gene YWHAZ (P).
The symbol NS is equal to P ≥ 0.05, which is not statistically significant. The symbol * is equivalent to P < 0.05, which is statistically significant; The symbol ** is equivalent to P < 0.01, which is highly statistically significant; The symbol *** is equivalent to P < 0.001, which is highly statistically significant. The correlation coefficient R is positive, indicating that the two variables may be positively correlated. The correlation coefficient is negative, indicating that there may be a negative correlation between the two variables. The absolute value of correlation coefficient above 0.8 is a strong correlation, and the absolute value between 0.5 and 0.8 is a moderate correlation. The absolute value between 0.3 and 0.5 indicates a general degree of correlation; An absolute value below 0.3 is weak or uncorrelated.