A total of 625 DEGs were identified, of which 314 were up-regulated and 311 were down-regulated (logFC absolute value > 1, adj. P.Val < 0.05). And accordingly, the volcano map of up-and down-regulated genes, as well as the heat map of differential genes by R software, are shown in Figure 2-a and 2-b. PCA showed no overlap between these two groups and indicated a significant difference in gene expression matrix between OS group and the healthy control group (p<0.05), Figure 2-c.
WGCNA weighted gene co-expression network analysis
According to the gene expression matrix, 32 modules were identified in the normal group or tumor group by R software, respectively. The result of the module with the highest absolute value was shown as MEgrey60, and there were 594 genes in MEgrey60 module, as shown in Figure 3-a、3-b、3-c.
Differential gene and WGCNA identification module take the intersection
The intersection of DEG and MEgrey60 was taken by R software to obtain a total of 247 candidate differential genes, which were saved in TXT files and plotted in a Wayne diagram, as in Figure 4.
GO-term and KEGG pathway enrichment analysis of candidate differential genes
GO-term and KEGG pathway enrichment analyses of candidate differential genes with adjusted p-values <0.05 were obtained, respectively. Figure 5-a, Figure 5-b, and Table II show the KEGG enrichment results of the candidate differential genes. Figure 5-c, Figure 5-d, and Table III show the results of GO enrichment analysis of candidate differential genes. the KEGG pathway analysis showed that these candidate differential genes were mainly involved in adhesive spots, Rap1, PI3K-Akt, MAPK, and insulin resistance, while the GO pathway analysis was mainly involved in cellular processes such as osteogenesis, epithelial cell proliferation, osteoblast differentiation, and growth factors, fibronectin-binding and other molecular functions. In addition, adherent spots, insulin resistance, Rap1, PI3K-Akt, and MAPK pathways were relatively significantly enriched on KEGG, so the specific pathways are shown in Figure 6, Figure 7, and Figure 8. these significantly enriched pathways may enlighten our thinking and help us to further investigate the role of DEGs in OS.
Table 2 KEGG pathway of candidate DEGs
ID
|
Description
|
p-value
|
geneID
|
Count
|
hsa04510
|
Focal adhesion
|
4.97E-05
|
rapgef1/thbs1/vegfb/spp1/vegfc/
itga7/parva/egfr/ppp1cb/pxn/rapgef1/vav2
|
12
|
hsa04931
|
Insulin resistance
|
0.000221
|
tbc1d4/il6/rps6ka1/trib3/pygb/ppp1cb/ppp1r3c/pck2
|
8
|
hsa04015
|
Rap1 signaling pathway
|
0.000328
|
RAPGEF4/FLT1/EFNA1/THBS1/VEGFB/FGFR3/
VEGFC/EGFR/FGF7/RAPGEF1/VAV2
|
11
|
hsa04010
|
MAPK signaling pathway
|
0.000489
|
flt1/map3k12/efna1/mef2c/vegfb/fgfr3/vegfc/map4k3
/rps6ka1/egfr/ntrk2/fgf7/ntf3
|
13
|
hsa04151
|
PI3K-Akt signaling pathway
|
0.000897
|
flt1/efna1/thbs1/vegfb/spp1
fgfr3/vegfc/il6/itga7/egfr/ntrk2/fgf7/pck2/ntf3
|
14
|
hsa04014
|
Ras signaling pathway
|
0.008546
|
flt1/efna1/vegfb/fgfr3/vegfc/egfr/ntrk2/fgf7/ntf3
|
9
|
hsa04142
|
Lysosome
|
0.014937
|
ppt1/npc1/ctsl/ctsa/sgsh/ctsb
|
6
|
hsa04810
|
Regulation of actin cytoskeleton
|
0.017551
|
IQGAP2/FGFR3/ITGA7/EGFR/PPP1CB
PXN/FGF7/VAV2
|
8
|
hsa00730
|
Thiamine metabolism
|
0.021043
|
ALPL/AK1
|
2
|
hsa05219
|
Bladder cancer
|
0.023855
|
THBS1/FGFR3/EGFR
|
3
|
hsa05202
|
Transcriptional misregulation in cancer
|
0.026486
|
flt1/igfbp3/mef2c/il6/jmjd1c/bcl11b
MMP3
|
7
|
hsa04020
|
Calcium signaling pathway
|
0.029179
|
flt1/vegfb/drd1/fgfr3/vegfc/egfr
NTRK2/FGF7
|
8
|
hsa05205
|
Proteoglycans in cancer
|
0.035992
|
smo/thbs1/ctsl/egfr/ppp1cb/pxn/vav2
|
7
|
hsa05144
|
Malaria
|
0.039787
|
HBB/THBS1/IL6
|
3
|
hsa04964
|
Proximal tubule bicarbonate reclamation
|
0.046882
|
CA2/PCK2
|
2
|
Table 3 GO analysis of candidate DEGs
ONTOLOGY
|
ID
|
Description
|
p.adjust
|
geneID
|
Count
|
BP
|
GO:0001503
|
ossification
|
0.000108
|
SNX10/SEMA4D/ALPL/PENK/SMO/IGFBP3/IGFBP5/FHL2/MEF2C/PDLIM7/
pth1r/spp1/fgfr3/vegfc/il6/col13a1/egfr/smad3/npr2/suco
|
20
|
BP
|
GO:0001649
|
osteoblast differentiation
|
0.000108
|
SEMA4D/ALPL/PENK/SMO/IGFBP3/IGFBP5/FHL2/MEF2C/PDLIM7
/PTH1R/SPP1/VEGFC/IL6/SMAD3/SUCO
|
15
|
BP
|
GO:0050673
|
epithelial cell proliferation
|
0.000198
|
fst/flt1/odam/smo/igfbp3/igfbp5/mef2c/thbs1/vegfb/vegfc/
IL6/CDH13/RPS6KA1/LOXL2/EGFR/SMAD3/BCL11B/FGF7/CLDN1/MMP12
|
20
|
BP
|
GO:0032963
|
collagen metabolic process
|
0.00416
|
CTSL/ADAMTS2/IL6/COL13A1/TRAM2/MMP3/SUCO/CTSB/MMP12
|
9
|
BP
|
GO:0048771
|
tissue remodeling
|
0.00416
|
SNX10/SEMA3C/IGFBP5/MEF2C/PTH1R/SPP1/CA2/IL6/ADAM8/EGFR/SUCO
|
11
|
BP
|
GO:0045453
|
bone resorption
|
0.00416
|
SNX10/PTH1R/SPP1/CA2/IL6/ADAM8/EGFR
|
7
|
BP
|
GO:0050927
|
positive regulation of positive chemotaxis
|
0.00416
|
VEGFB/VEGFC/CDH13/SMAD3/NTF3
|
5
|
BP
|
GO:0046849
|
bone remodeling
|
0.00416
|
SNX10/PTH1R/SPP1/CA2/IL6/ADAM8/EGFR/SUCO
|
8
|
BP
|
GO:0050926
|
regulation of positive chemotaxis
|
0.00419
|
VEGFB/VEGFC/CDH13/SMAD3/NTF3
|
5
|
BP
|
GO:0050920
|
regulation of chemotaxis
|
0.00419
|
sema4d/sema3e/gpr183/sema3c/thbs1/vegfb/
VEGFC/IL6/DPP4/CDH13/SMAD3/NTF3
|
12
|
MF
|
GO:0001968
|
fibronectin binding
|
0.000409
|
igfbp6/igfbp3/igfbp5/thbs1/ctsl/fbln1
|
6
|
MF
|
GO:0019838
|
growth factor binding
|
0.009811
|
flt1/igfbp6/igfbp3/igfbp5/thbs1/fgfr3/egfr/ntrk2/il11ra
|
9
|
MF
|
GO:0045499
|
chemorepellent activity
|
0.047405
|
SEMA4D/SEMA3E/SEMA3C/DPP4
|
4
|
Analysis of candidate differential genes in OS using PPI networks
The candidate DEG expression products in OS were constructed by STRING database to construct the PPI network. After removing some of the isolated and disconnected nodes, the lowest interaction score of 0.9 was chosen to construct an integrated network as shown in Figure 9-b. The 50 most important genes showing statistically significant interactions were ADAMTS2, THBS1,AOX1,MOCOS,CHI3L1,IL13RA2,CNDP2,GATM,COX7B,NDUFA4,CTSB,CTSL,DDX18,POLR1B,EFNA1,
VAV2,EGFR,PXN,SH3BGRL3,FHL2,IGFBP3,ELL,HMGN1,ENTPD1,NT5E,FGF7,FGFR3,FLT1,VEGFC,VEGFB,HEY1,HEY2,IGFBP5,
SPP1,IL6,MMP3,LOXL1,LOXL2,MEF2C,SMAD3,NRIP1,TBC1D4,NTF3,NTRK2,OPTN,SQSTM1,PARVA,PPP1CB,PPP1R3C,SH3BP2.among these genes,EGFR,FLT1,VAV2,and PXN had the highest nodality. As a result, the core gene network map was obtained by CYTOSCAPE software based on the PPI network interactions and the corresponding node files, as shown in Figure 9-c. Finally, the core ranking histogram of key genes was obtained from the above files, as shown in Figure 9-a for the top 23 genes. hub gene survival curves, except for FGF7, were not statistically different, as shown in Figure 10. hub gene expression, clinical correlation, as shown in Figures 11, 12, and 13. EGFR expression analysis in pan-cancer showed a significant difference in BRCA, CSC, CBM , PCPG differences were more pronounced, but no significant differences were seen in SARC. In the clinical correlation analysis, group comparisons with statistical differences existed only in the gender comparison, 81-100 years old group compared with 41-60 years old group and 61-80 years old group, respectively.FGF7 expression analysis in pan-cancer showed greater differences in BLCA, CESC, UCEC.FGF7 expression and clinical correlation analysis showed that in tumor tissues, TP53 mutated The analysis of FLT1 expression in pan-cancer showed a significant difference in CESC, KIRC, but not in SARC. the analysis of FGF7 expression and clinical correlation showed a significant difference in tumor tissues when comparing gender, and a statistically significant difference when comparing TP53 mutated group with the unmutated group. hub gene promoter methylation and clinical correlation, as shown in Figures 14, 15, and 16. analysis of promoter methylation levels of EGFR in SARC showed a statistically significant difference in the 61- to 80-year-old group compared with the normal group. 21- to 40-year-old group compared with the 61- to 80-year-old group. analysis of promoter methylation levels of FGF7 in SARC showed a statistically significant difference in the 21- to 40-year-old group compared with the 41- to 60-year-old group, respectively. Analysis of promoter methylation levels of FGF7 in SARC showed statistically significant differences in the 21-40 years group compared to the 41-60 years group, 61-80 years group, and 81-100 years group, respectively. analysis of promoter methylation levels of FLT1 in SARC showed significant differences in the normal group compared to the tumor group. Similarly, differences were shown in the age grouping.
Immuno-infiltration analysis
Figure 17-b Results show a positive correlation between regulatory T cells and naive CD4+ cells (value = 0.80). Neutrophils have a positive correlation with naive CD4+ T cells (value = 0.65). Regulatory T cells and γδ T cells had a positive correlation (value=0.74). Neutrophils had a positive correlation with regulatory T cells (value=0.72). Neutrophils had a positive correlation with γδ T cells (value=0.72). And naive B cells had a significant negative correlation with B cell memory cells (value = - 0.64). bar chart of 20 samples summarizes the relative percentages of 22 immune cells as shown in Figure 17-a. and Figure 18-a heatmap demonstrates the visualization of each immune infiltrating cell within each sample. The violin map of each immune infiltrating cell compared to normal tissue (Figure 18-b) shows that in OS tissue, infiltration of M2 macrophages, unactivated dendritic cells, and neutrophils is statistically greater, while infiltration of naive B cells and unactivated CD4+ memory T cells are statistically less. The correlation analysis of core genes with immune infiltrating cells is shown in Figures 19, 20, 21. The correlation analysis of core genes with each immune checkpoint is shown in Figures 22, 23.