1.Screening of autophagy-related differential genes and enrichment analysis
In the GEO cohort, we screened 129 ES and 18 normal skeletal muscle samples for differential genes and obtained a total of 5760 differential genes. 72 autophagy-related differential genes were obtained by taking intersections with 232 autophagy-related genes obtained from the HADb Human Autophagy Database (Fig2), and plotting the volcano map of autophagy-related differential genes (Fig3). We mapped the PPI protein interaction network of 72 autophagy-related genes (Fig4). GO and KEGG pathway enrichment analysis was performed, and the results of GO enrichment analysis showed that autophagy-related differential genes were mostly enriched in macroautophagy, regulation of autophagy, cellular, response to external stimulus, autophagosome assembly, autophagosome organization, and so on(Fig5). KEGG pathway enrichment analysis indicated that the genes correlated with Autophagy − animal, Autophagy − other, Kaposi sarcoma−associated, herpesvirus infection, Apoptosis, Mitophagy − animal, and so on(Fig6).
2. Distinction and Evaluation of 8 autophagy-related genes Prognostic Signature for ES
We selected 129 samples containing clinical prognostic information for the next analysis of 72 autophagy-related differential genes. By univariate cox regression in over sruviva l(OS), we found that a total of 22 genes were strongly associated with prognosis (P<0.05), of which 12 genes were prognostic protective factors and 10 genes were prognostic risk factors(Fig7). Then, 12 genes were screened by lasso regression analysis (Fig8 A, B). Finally, based on multivariate cox regression and optimized the model using the “step” function, 8 genes, including HDAC1, EIF4EBP1, DLC1, BECN1, ATG10, SIRT1, PINK1, CFLAR were included in the final model.The risk scoring formula is Riskscore=PINK1(expression)*(-0.930699593)+HDAC1(expression)*0.462381473+BECN1*(-0.897598075)+CFLAR(expression)*(-0.679324292)+DLC1(expression)*0.4698522+EIF4EBP1*0.50843151+ATG10*(-0.734256293)+SIRT1*(-0.659263165). We risk scored these 8 genes based on their risk coefficients as well as expression and divided the samples into high-risk and low-risk groups according to the median cut-off value. Kaplan-Meier (K-M) curves showed that the high-risk group had a worse prognosis (p<8.185e-07)(Fig9 A). Risk curves and scatter plots show the risk scores and survival status of all patients with ES. The mortality rate and hazard ratio were lower in the low-risk group than in the high-risk group. Mortality and risk factors were lower in the low-risk group than in the high-risk group, and the Heatmap showed that HDAC1, EIF4EBP1, DLC1 were upregulated in the high-risk group, while other genes were significantly downregulated (Fig10 A, B, C). Besides, the time-dependent receiver operating characteristic curve (ROC) curve is used to evaluate the 8 autophagy-related gene signals in predicting the total 3 to 5 years of ES patients . Precision in terms of lifetime. The area under the ROC value (AUC) for 3 years and 5 years were 0.897 and 0.849 respectively (Fig9 B). We validated these 8 prognostic gene signatures in the ICGC cohort using the same formula and plotted risk curves, scatter plots and heatmap.(Fig11 A,B,C) The survival curves also reflected a significantly worse prognosis in the high-risk group (p=0.05), with AUC values of 0.595 and 0.666 at 3 and 5 years, respectively, both reflecting the good predictive power of our model (Fig9 C,D).
3. Evaluation of 8 autophagy-associated prognostic genes as independent prognostic factors in GEO chohort
Combining clinical information age and sex, all eight autophagy-related prognostic genes signature could be used as independent prognostic factors in both univariate and multivariate cox regression (P<0.01) (Fig12 A, B).
4. GSEA enrichment analysis
To elucidate the biological pathways and functions associated with the risk score. We performed enrichment analysis on the high-risk group and the low-risk group. Interestingly, in the results of the enrichment analysis, there were enrichment results in the low-risk group concerning multiple metabolic pathways (Fig13), like heparin metabolic process,Histidine metabolism and Propanoate metabolism. To investigate the differences in metabolic pathways between high and low-risk groups, we used the metabolic pathway-related gene set obtained from the KEGG pathway online website, quantified the enrichment scores of 90 metabolic pathways by the R package "GSVA", and created box plots to visualize the differences in metabolic pathways between high and low-risk groups (Fig14), We found differences in a large number of metabolic pathways in both high and low-risk groups, including Mucin type O-glycan biosynthesis, Other types of O-glycan biosynthesis, Glutathione metabolism, etc. were significantly enriched in the high-risk group, and then Starch and sucrose metabolism,beta-Alanine metabolism ,Tryptophan metabolism, etc. were significantly enriched in the low-risk group. Among them, Histidine metabolism and Propanoate metabolism were also significantly different from the high-risk group (P<0.001), which was consistent with the results of our GSEA enrichment analysis.