LLPS-related genes
The overall flow diagram of this study was presented in Fig. 1. The detailed information of 3585 LLPS-related genes in homo sapiens was obtained from the DrLLPS, of which 85 were scaffolds (2.37%), 355 were regulators (9.9%) and 3145 were clients (87.73%; Additional file 1: Fig. S1A and Additional file 10: Table S4). Then, the transcriptome data of these 3585 LLPS-related genes were obtained from TCGA cohort and the GTEx database. The heatmap showed an obvious distinction in the expression of LLPS-related genes between LGG samples and normal samples (Additional file 1: Fig. S1B). Further differential expression analysis identified 443 LLPS-related DEGs, of which 170 were upregulated and 273 were downregulated in LGG samples compared with normal samples (Fig. 2A and Additional file 11: Table S5). By intersecting these DEGs with the prognostic LLPS-related genes obtained through univariate Cox regression analysis (Additional file 12: Table S6), 225 prognostic LLPS-related DEGs were identified (Fig. 2B), of which 3 were scaffolds, 24 were regulators and 198 were clients (Additional file 1: Fig. S1C). The top 10 significantly enriched GO terms and KEGG pathways for these prognostic LLPS-related DEGs were shown in Fig. S1D and E (see Additional file 1).
Identification of LLPS subtypes in TCGA cohort
Based on the expression profiles of 225 prognostic LLPS-related DEGs, we performed NMF to identify LLPS subtypes in TCGA cohort. We selected 4 as the optimal clustering number, which was decided by the cophenetic, dispersion and silhouette indicators (Additional file 2: Fig. S2). Then, a total of 423 LGG patients were categorized into four subtypes (Fig. 2C), namely LS1 (n=73), LS2 (n=78), LS3 (n=227) and LS4 (n=45). The tSNE showed robust differences in distribution among four LLPS subtypes (Fig. 2D). The prominent differences in the expression of 225 prognostic LLPS-related DEGs were also observed in the heatmap (Fig. 2E). The K-M survival curve revealed that there was distinct survival difference among four LLPS subtypes (Fig. 2F). LS1 had the worst survival outcome, whereas LS3 had the best survival outcome.
Subsequently, we compared the demographics and clinicopathological features of LGG patients in four LLPS subtypes. As illustrated in Fig. 3A, LS1 had more patients with age greater than or equal to 45 years compared with other subtypes. There were no significant differences among subtypes regarding gender distribution. The proportion of patients with Karnofsky Performance Score (KPS) greater than or equal to 80 was higher in LS3 than other subtypes. A higher percentage of deaths was observed in LS1 and LS4. Astrocytoma was more common in LS1, but oligodendroglioma was more common in other three subtypes. LS1 and LS4 had a significantly higher proportion of World Health Organization (WHO) grade III glioma compared with LS2 and LS3. There were also significant differences in molecular pathology among four LLPS subtypes. LGG patients with IDH wild type, or 1p19q non-codeletion, or MGMT promoter (MGMTp) unmethylated, or TERT mutant were more frequent in LS1.
To explore the underlying molecular mechanism related to the LLPS subtypes of LGG, we performed ssGSEA based on the transcriptome data of 50 gene sets retrieved from MSigDB. The ssGSEA Z-score was applied to quantify the levels of 50 hallmarks, and was visually illustrated by the heatmap (Fig. 3B). Compared with LS2 and LS3, LS1 and LS4 was more correlated with the hallmarks related to cell cycle and DNA repair, especially LS4, which may suggest an active proliferation of cancer cells. In addition, LS1 was positively associated with many cancer-related hallmarks, including glycolysis, epithelial mesenchymal transition (EMT), angiogenesis, hypoxia, apoptosis, inflammation and immunity. For further validation, we screened out a total of 507 upregulated genes (log2FC > 1 and P < 0.05) in LS1, and 121 upregulated genes (log2FC>1 and P < 0.05) in LS4 (Fig. 3C, D), which were subsequently submitted to GO enrichment analyses. The enriched biological processes of upregulated genes in LS1 and LS4 were consistent with the results of ssGSEA (Fig. 3E, F). The above results might explain the poor survival of LS1 and LS4 to some extent.
Comprehensive analyses of genomic alterations among LLPS subtypes
To gain a further insight into the disparity in the genomic layer, we compared the somatic mutation profiles and CNA landscapes among LLPS subtypes. First, LS1 and LS4 had significantly higher TMB and mutation counts than LS2 and LS3 (Fig. 4A). The somatic mutation profiles revealed that LS1 possessed specific top mutated genes compared with other three LLPS subtypes (Fig. 4B). EGFR was the most commonly mutated gene in LS1, followed by PTEN, whereas IDH1 and TP3 were the most two frequently mutated genes in LS2, LS3 and LS4. Next, we also noticed that there were clear differences in the degree of CNA burden among LLPS subtypes (Fig. 4C). It showed a trend that compared with LS2 and LS3, LS1 and LS4 had relatively higher burdens of gain and loss at both focal and arm levels. The distribution of Gistic score across all autosomes in LLPS subtypes was visualized in Fig. 4D. The results described above demonstrated an active genomic alteration in LS1 and LS4, which was likely due to their stronger proliferation ability of cancer cells.
TIME and immunotherapeutic responses in different LLPS subtypes
Growing studies have begun to characterize the potential role of LLPS in regulating TIME and sensitivity to immunotherapy [12, 27]. Hence, we tried to compare the TIME patterns and immunotherapeutic responses among different LLPS subtypes. The ESTIMATE algorithm was firstly performed to quantify the constituents of the TIME of LGG. The results revealed that LS1 had highest immune, stromal, and ESTIMATE scores, and lowest tumor purity compared with other three subtypes. An opposite trend was observed in LS2 and LS4 (Fig. 5A). Then, LGG patients were classified into high-/low- immunity subtypes based on the ssGSEA Z-scores of 29 immune signatures. LS1 consisted of more proportions of high-immunity subtype, whereas LS2, LS3 and LS4 contained mainly low-immunity subtype (P < 0.001; Fig. 5B). The distribution of ssGSEA Z-score of 29 immune signatures was presented in Fig. 5C and D. There were significant differences in immune cell infiltrations and immune functions among four LLPS subtypes. Most notably, LS1 showed higher infiltration of most immune cells, and more robust immune functions than other three subtypes. Another algorithm, CIBERSORT, was also utilized to described the proportion of 22 immune cells in different LLPS subtypes, which was displayed in Fig. 5E.
We then compared the expression levels of immune checkpoints among LLPS subtypes. Compared with other subtypes, LS1 had significantly higher expression levels of all ten immune checkpoints, including PD-1 and its ligands (PD-L1 and PD-L2), CTLA-4 and its ligands (CD80 and CD86), LAG-3, TIM-3, IDO1 and B7H3 (Fig. 5F). Currently, ICI therapy has undoubtedly been a very promising strategy of immunotherapy, which caused a major breakthrough in antitumor treatment. Thus, we further used the TIDE algorithm to predict the response to ICI therapy of different LLPS subtypes. Our results revealed that the TIDE score was significantly decreased in LS1 and LS4 (Fig. 5G). The proportions of responders in LS1 and LS4 were nearly twofold that in LS2 and LS3 subtypes (P < 0.001; Fig. 5H). We also observed that compared with LS2 and LS3, LS1 and LS4 had higher MSI scores, which has been considered to be an indicator of effective immunotherapy (Fig. 5I). Moreover, we performed subclass mapping analysis to predict the response to ICI therapy, including PD-1 and CTLA-4 inhibitors, of the four LLPS subtypes. LS1 was found to be more sensitive to PD-1 inhibitor (Bonferroni corrected P = 0.001), while LS2 showed no response to CTLA-4 inhibitor (Bonferroni corrected P = 0.044; Fig. 5J). All these findings suggested that the LLPS patterns of LGG might play a crucial role in regulating the TIME patterns and immunotherapeutic responses.
Construction and validation of a prognostic signature based on LLPS-related hub genes
To identify the LLPS-related hub genes, WGCNA analysis was performed with the transcriptome data of 225 prognostic LLPS-related DEGs. We selected 2 as the optimal soft-thresholding power based on the standard scale-free model fitting index R2 (Additional file 3: Fig. S3A). Then, a total of 10 gene modules were obtained. Based on the previous findings, there were similarities in terms of survival, genomic alteration and immune characteristic between LS1 and LS4, also between LS2 and LS3. Thus, LS1 and LS4, and LS2 and LS3, were merged together, respectively. Among these 10 models, the brown module containing 26 genes exhibited the highest correlation with LS1 and LS4, and the green module containing 17 genes showed the highest correlation with LS2 and LS3 (Fig. 6A, B). Then, the genes in these two models were deemed as LLPS-related hub genes, and were picked for subsequent analyses. Fig. S3B and C (see Additional file 3) showed the top 10 enriched GO terms and KEGG pathways for the genes of the green and brown modules. As shown in the interaction networks, there were 15 genes and 11 edges in the green module, and 22 genes and 92 edges in the brown module with a threshold weight > 0.15 (Additional file 3: Fig. S3D, E).
Next, these forty-three LLPS-related hub genes were incorporated into the LASSO Cox regression in the TCGA cohort, twelve of which stood out for the construction of a LLPS-related prognostic signature (Additional file 3: Fig. S3F and Fig. 6C). Fig. 6D exhibited the LASSO coefficients of each selected gene in this signature. There were nine protective genes and three risky genes for survival outcomes. The K-M survival curves of these 12 selected genes were shown in Fig. S4 (see Additional file 4). Then, the LPRS of each LGG patient was calculated by summing the product of the expression levels of each selected LLPS-related hub gene and corresponding LASSO coefficients. The median LPRS was used to stratify patients into high-LPRS and low-LPRS subgroups. As shown in the K-M survival curves, high-LPRS patients exhibited worse OS in TCGA cohort (Fig. 6E). Consistent results were obtained in four independent validation cohorts (Fig. 6F-I). Additionally, the high accuracy of LPRS in predicting 1-, 3- and 5-year OS was confirmed by the ROC curves (Additional file 5: Fig. S5A). According to the univariate and multivariate Cox regression analyses, LPRS was an independent prognostic indicator for OS in all cohorts (Additional file 5: Fig. S5B). Further, meta-analysis was performed, and revealed that the overall pooled HR of LPRS was 1.91 (95% CI = 1.39-2.63; Fig. 6J).
Correlation of LPRS with clinicopathological features, genomic alterations and TIME patterns
The prognostic value of LPRS has been well elucidated. Then, we sought to explore its clinical relevance in TCGA cohort. As shown in Fig. 7A, LPRS was ranked from low to high to show the correlation between LPRS and clinicopathological features. There were significant differences between high- and low-LPRS subgroups in terms of age, survival status, histology, WHO grade, IDH status, 1p19q status, MGMTp status, TERT status, immunity subtypes and LLPS subtypes. The attribute changes of each patient were visualized by an alluvial diagram (Fig. 7B). We also compared the levels of LPRS between subgroups stratified by different clinicopathological features. Patients with the clinicopathological features of age ≥ 45 years, death, astrocytoma, WHO grade III, IDH wild type, 1p19q non-codeletion and MGMTp unmethylated presented significantly higher levels of LPRS, whereas no LPRS differences were observed between patients stratified by gender, KPS and TERT status (Additional file 6: Fig. S6A). In addition, we found that the high-immunity subtype was associated with a higher LPRS than low-immunity subtype, and LPRS was ranked in increasing order for LS3, LS2, LS4 and LS1 (Additional file 6: Fig. S6A).
To better characterize the LLPS-related prognostic signature, we tested the correlation between the cancer hallmarks and LPRS. A correlation heatmap revealed that LPRS was significantly positively correlated with many well-known hallmarks of cancer, including DNA repair and cell cycle-related hallmarks (Fig. 7C). As expected, further analyses demonstrated that LPRS was markedly positively linked with TMB, mutation counts, burden of copy number gain and loss at focal-level, and burden of copy number gain at arm-level (Fig. 7D). These data indicated to some extent that a higher LPRS represents a higher frequency of genomic alterations.
Given that LPRS was associated with different immunity subtypes, we took further insight into the detailed differences in TIME patterns as the LPRS changes. The correlation between LPRS and 29 immune signatures was illustrated by a correlation heatmap (Fig. 7E). Then, LPRS was significantly positively correlated with the immune, stromal, and ESTIMATE scores, but negatively correlated with tumor purity, which indicating that the infiltration levels of immune and stromal cells increase with the elevation of LPRS (Fig. 7F). Further correlation analyses were carried out between LPRS and the infiltration levels of 22 immune cells quantified by the CIBERSORT algorithm. It turned out that LPRS was positively correlated with memory resting CD4+ T cells, M1 macrophages and Tregs, and was negatively correlated with activated mast cells and monocytes (Additional file 6: Fig. S6B). In addition, LPRS was observed to be positively correlated with the expression levels of ten immune checkpoints (Additional file 6: Fig. S6C).
The role of LPRS in predicting the response to ICI therapy
ICI therapy represented by PD-1, PD-L1 and CTLA-4 inhibitors has undoubtedly achieved encouraging progress in the therapeutic landscape of cancer. However, the considerable heterogeneity in therapeutic response has long been a major challenge to improve survival outcomes for glioma patients. Hence, we focused on the role of LPRS in predicting the response to ICI therapy. In TCGA cohort, patients with higher LPRS showed lower level of TIDE and higher level of MSI score (Fig. 8A, B). Based on TIDE algorithm, the high-LPRS subgroup contained a higher proportion of responders to ICI therapy compared with low-LPRS subgroup (Fig. 8C). Besides, the LPRS of responders was significantly higher than that of non-responders (Fig. 8D). Above all, we speculated that high-LPRS patients could benefit more from ICI therapy than low-LPRS patients. To make our findings more convincing, we next investigated whether the LPRS could predict patients’ response to ICI therapy in two independent ICI therapy cohorts, namely IMvigor210 (anti-PD-L1 cohort) and GSE78220 (anti-PD-1 cohort). In both cohorts, there were significantly higher proportion of complete response (CR) or partial response (PR) in the high-LPRS subgroup (Fig. 8E, F). Patients with the outcome of CR or PR exhibited significantly higher level of LPRS than patients with the outcome of stable disease (SD) or progressive disease (PD) (Fig. 8G, H). An overall satisfactory accuracy of LPRS for predicting the response to ICI therapy was confirmed by the ROC curves (Fig. 8J, K). Altogether, our findings strongly indicated that the LPRS was associated with the response to ICI therapy, and has the potential to serve as a response indicator in clinical practice.
Validation of the Expression Levels of Selected LLPS-related hub genes
We selected four LLPS-related hub genes involved in LPRS for validation, namely FAM204A, SMU1, TNPO1 and TOP2A. We tested their transcript levels in cell lines, LGG tissues and non-tumor brain tissues. The qRT-PCR results showed that the mRNA expression levels of FAM204A and SMU1 in human glioma cell lines overall showed a downward trend compared with the HA1800, while the mRNA expression levels of TNPO1 and TOP2A showed an overall upward trend (Figure 9A). The qRT-PCR results of tissue samples are consistent with that of cell lines. Compared with non-tumor brain tissue, the transcription levels of FAM204A and SMU1 in LGG tissues decreased, while the transcription levels of NPO1 and TOP2A increased (Figure 9B). Further, the protein expression level of these four LLPS-related hub genes was detected by IHC staining and analyzed by calculating the IOD/area. Compared with non-tumor tissues, FAM204A was down-regulated, but TNPO1 and TOP2A were up-regulated in glioma tissues. There was no significant difference in the protein level of SMU1 between LGG and NBT (Figure 9C). Representative IHC staining images were shown in Figure 9D.