3.1 Totally 58 DE-CSRGs were acquired
A total of 1,186 DEGs were acquired between PD and control groups from GSE100054, including 784 up-regulated and 402 down-regulated genes (Fig. 1A-B). Then, the sample clustering analysis showed no outlier samples in GSE100054 dataset (Fig. 1C). Based on the scale-free topology criterion with R2 = 0.85, the soft-threshold was chosen as 12 (Fig. 1D). The adjacency matrix was converted into a TOM matrix, which was used to show the similarity between nodes by considering the weighted correlation. Next, eight modules were obtained through dynamic tree clipping (Fig. 1E), and the heat map found that the MEblue (Cor = 0.58, P = 0.01) and MEyellow (Cor = 0.55, P = 0.01) had significant positive correlation with PD. The two modules were selected as key modules, containing 2,139 key module genes (Fig. 1F). Finally, the 58 DE-CSRGs were obtained by taking the intersection of 1,186 DEGs, 2,139 key module genes and 414 CSRGs (Fig. 1G).
3.2 DE-CSRGs regulated PD through multiple biological pathways
Totally 58 DE-CSRGs were analyzed for functional enrichment. The GO showed that a total of 551 items were enriched, including 441 biological processes (BPs), 55 cellular components (CCs) and 55 molecular functions (MFs). In terms of BP, DE-CSRGs were significantly associated with blood coagulation, coagulation, hemostasis, wound healing and regulation of body fluid levels. In terms of CC, DE-CSRGs were significantly enriched in ficolin-1-rich granule, secretory granules lumen, tertiary granule, cytoplasmic vesicle lumen and vesicle lumen. In terms of MF, DE-CSRGs were significantly related to integrin binding, extracellular matrix binding, RAGE receptor binding, complement receptor activity and Toll-like receptor binding (Fig. 2A). The KEGG showed that 186 related pathways were obtained, of which 5 were most significant (P.adj < 0.05), including complement and coagulation cascades, neutrophil extracellular trap formation, malaria, platelet activation and tuberculosis (Fig. 2B).
3.3 CD93, TLR2, CTSS, PRKCD, and CR1L were causally associated with PD
After screening, 38 DE-CSRGs of the 58 DE-CSRGs had three or more SNPs, thus these 38 DE-CSRGs were incorporated into the subsequent MR analysis. Five algorithms were applied to perform univariate MR analysis of the 38 DE-CSRGs as exposure factors and PD as an outcome, respectively. The results showed that CD93 (P = 0.005, OR = 1.045), TLR2 (P = 0.022, OR = 1.178), CTSS (P = 0.010, OR = 0.970), PRKCD (P < 0.001, OR = 1.333), ITGAM (P < 0.001, OR = 1.479) and CR1L (P < 0.001, OR = 0.85) were significant causal relationship with PD. According to the OR value, CD93, TLR2, PRKCD, and ITGAM were risk factors for PD, whereas CTSS and CR1L were protective factors. Details about the MR analysis on significant causal exposure factors and PD were shown in Table 2. Scatter plots and forest plots further support our conclusions above, and funnel plots showed that MR analysis conform to Mendel's second law of random grouping (Supplementary Figs. 1–3).
Table 2
MR analysis on significant causal exposure factors and PD
exposure | outcome | gene | nSNP | pval | OR |
eqtl-a-ENSG00000125810 | Parkinson's disease|| id:ieu-b-7 | CD93 | 52 | 0.005213979 | 1.044739 |
eqtl-a-ENSG00000137462 | Parkinson's disease || id:ieu-b-7 | TLR2 | 4 | 0.021604242 | 1.178478 |
eqtl-a-ENSG00000163131 | Parkinson's disease || id:ieu-b-7 | CTSS | 49 | 0.010088317 | 0.970339 |
eqtl-a-ENSG00000163932 | Parkinson's disease || id:ieu-b-7 | PRKCD | 16 | 0.000000000531 | 1.332504 |
eqtl-a-ENSG00000169896 | Parkinson's disease || id:ieu-b-7 | ITGAM | 13 | 0.000184669 | 1.47872 |
eqtl-a-ENSG00000197721 | Parkinson's disease || id:ieu-b-7 | CR1L | 18 | 0.00000427 | 0.851668 |
OR value, 1 as the boundary, greater than 1 as risk factor, less than 1 as safety factor; Abbreviations: MR- Mendelian randomization; SNP - single nucleotide polymorphism |
Furthermore, the sensitivity analysis was performed on the MR analysis results. The Cochran's Q-test results showed that no heterogeneity was found in the MR analyses of CD93, TLR2, CTSS, PRKCD, and CR1L (P > 0.05), whereas there was heterogeneity in the MR analysis of ITGAM (P < 0.05) (Table 3). Meanwhile, the above five genes were all passed the pleiotropy test (P > 0.05), means that there was no horizontal pleiotropy (Table 4). In the LOO analysis, the effect of the remaining SNPs on the outcome didn’t change significantly by gradually eliminating each SNP, indicating that the results of MR analysis were reliable and stable (Supplementary Fig. 4). Therefore, CD93, TLR2, CTSS, PRKCD, and CR1L were selected as candidate genes for subsequent analyses.
Table 3
MR heterogeneity test and analysis
exposure | outcome | Symbol | Q | Q_df | Q_pval |
eqtl-a-ENSG00000125810 | Parkinson's disease || id:ieu-b-7 | CD93 | 41.6124 | 50 | 0.79479 |
eqtl-a-ENSG00000137462 | Parkinson's disease || id:ieu-b-7 | TLR2 | 1.358926 | 3 | 0.71519 |
eqtl-a-ENSG00000163131 | Parkinson's disease || id:ieu-b-7 | CTSS | 55.60518 | 45 | 0.133602 |
eqtl-a-ENSG00000163932 | Parkinson's disease || id:ieu-b-7 | PRKCD | 17.67404 | 14 | 0.222032 |
eqtl-a-ENSG00000169896 | Parkinson's disease || id:ieu-b-7 | ITGAM | 85.27118 | 12 | <0.001 |
eqtl-a-ENSG00000197721 | Parkinson's disease || id:ieu-b-7 | CR1L | 6.066947 | 15 | 0.978601 |
Q_pval value greater than 0.05 indicates that there is no heterogeneity among the samples |
Table 4
The horizontal pleiotropy test for five candidate exposure factors
exposure | outcome | Symbol | egger_intercept | se | p |
eqtl-a-ENSG00000125810 | Parkinson's disease || id:ieu-b-7 | CD93 | 0.004353 | 0.007369 | 0.557445 |
eqtl-a-ENSG00000137462 | Parkinson's disease || id:ieu-b-7 | TLR2 | 0.043354 | 0.041348 | 0.40442 |
eqtl-a-ENSG00000163131 | Parkinson's disease || id:ieu-b-7 | CTSS | -0.00091 | 0.0103 | 0.929632 |
eqtl-a-ENSG00000163932 | Parkinson's disease || id:ieu-b-7 | PRKCD | 0.022947 | 0.022314 | 0.322539 |
eqtl-a-ENSG00000197721 | Parkinson's disease || id:ieu-b-7 | CR1L | -0.00894 | 0.027131 | 0.746666 |
p>0.05 indicates that there was no horizontal pleiotropy. |
3.4 Diagnostic value of CD93, CTSS, PRKCD and TLR2
The expression of candidate genes was verified in GSE100054 and GSE165082 datasets. The boxplot showed that the the expression of CD93, CTSS, PRKCD and TLR2 was consistent in the two datasets and significantly different between PD and control groups (Figs. 3A-3B). Meanwhile, in the ROC curves, the AUC values of CD93, CTSS, PRKCD, and TLR2 were all greater than 0.7 with respect to two datasets (Figs. 3C-3D), showing that the four candidate genes had better predictive power in PD. Therefore, CD93, CTSS, PRKCD, and TLR2 were chosen as the hub genes for this study. Furthermore, based on these four hub genes, we constructed an ANN model in GSE100054 dataset to further diagnose PD (Fig. 3E), and the ROC curve results indicated that the AUC value of ANN model was 0.922, implying that the diagnostic effect of the ANN model was excellent (Fig. 3F).
Figure. 3 Evaluation of genes expression and diagnostic performance. A The box line diagram of the expression levels of candidate genes in GSE100054. B The box line diagram of the expression levels of candidate genes in GSE165082. C The ROC curves of the four candidate genes in GSE100054. D The ROC curves of the four candidate genes in GSE165082. E The ANN model of the four hub gens in GSE100054. F The ROC curve of the AUC value of ANN model.
3.5 Prediction of relevant pathways and functions of four hub genes
Chromosome localisation results showed that CTSS, PRKCD, TLR2 and CD93 were located in chromosomes 1, 3, 4 and 20, respectively (Fig. 4A). Simultaneously, the four hub genes were significantly positively correlated with each other (cor > 0.4, P < 0.05), among which there was the strongest positive correlation between PRKCD and CD93 (cor = 0.858) (Fig. 4B).
We explored the biological pathways enriched for four hub genes through GSEA. The results revealed that all four hub genes regulated a variety of immune responses, inflammatory factors, cytokines, and complement system-related pathways, including “signaling by interleukins”, “Toll-like receptor cascades”, “MHC class II antigen presentation”, “complement system”, “TYROBP causal network in microglia”, “IRAK-4 deficiency TLR2/4”, “cytokine-cytokine receptor interaction”, etc (Figs. 5A-5D). These analyses revealed these pathways were closely correlated with the pathogenesis of PD.
Furthermore, GeneMANIA results showed that the co-expression network and potential functions of four hub genes and their 20 reciprocal genes, and they were primarily linked to interleukin-6 production, pattern recognition receptor signaling pathway, regulation of tumor necrosis factor production, regulation of tumor necrosis factor superfamily cytokine production, and toll-like receptor signaling pathways (Fig. 5E)
3.6 CD93, CTSS, PRKCD and TLR2 were significantly correlated with immune cells
The enrichment scores of 28 immune infiltrating cells for all samples in GSE100054 were calculated, as shown in Fig. 6A, in which the enrichment score of myeloid-derived suppressor cell (MDSC) was the highest. Boxplots were drawn between PD and controls samples, which indicated significant differences (P < 0.05) in 17 immune cells, including activated dendritic cell, CD56bright natural killer cell and macrophage, etc (Fig. 6B). In addition, the heat map was drawn to analyze the correlation between differential immune cells, as shown that MDSC and macrophage had the highest correlation with a correlation coefficient of 0.911, and activated CD8 + T cell and activated dendritic cell were most negative correlation with a correlation coefficient of -0.712 (Fig. 6C). Finally, the correlation between four hub genes and 17 differential immune cells was explored, showing that all four hub genes presented significant positive correlations with regulatory T (Treg) cells, macrophage, and activated dendritic cells, as well as significant negative correlations with activated CD8 T cells (Figs. 6D-6G).
3.7 The hub gene was controlled via several ceRNAs and TFs in PD
Totally 5 TFs of regulatory hub genes were predicted based on ENCODE database, of which four TFs (NFKB1, HIF1A, SP1 and RELA) were regulators of TLR2, while only one TF (SPI1) was a regulator of CTSS, and TF-mRNA network was shown in Fig. 7A In the samples of GSE100054 dataset, HIF1A was had higher expression, whereas SPI1 had lower expression (Fig. 7B). Additionally, totally 16 potential miRNAs regulating hub genes were predicted through miRTarBase and TarBase databases, immediately followed by 309 upstream lncRNAs of miRNAs were collected through miRNet. Subsequently, the lncRNA-miRNA-mRNA network was constructed (Fig. 7C), containing 4 mRNAs, 16 miRNAs and 309 lncRNAs. The multiple relationship pairs were found in network, such as lncRNAs (CCDC18-AS1, IL6R-AS1, BLACAT1, PINK1-AS, etc.) could simultaneously regulate two hub genes (CTSS and TLR2) via both hsa-mir-106b-5p, lncRNAs (GAS5, MIR29B2CHG, etc.) could regulate CD93 via hsa-mir-29a-3p, and lncRNAs (LINC01128, TNFRSF14-AS1, etc.) could simultaneously regulate PRKCD via miRNAs (hsa-mir-15a-5p, hsa-mir-16-5p and hsa-mir-195-5p).
In the DrugBank database, totally 6 drugs were eventually searched, including Copper and Hyaluronic acid targeting CD93, Adapalene and Tuberculin purified protein derivative targeting TLR2, Fostamatinib targeting both CTSS and PRKCD, as well as Ingenol mebutate likewise targeting PRKCD (Fig. 7D). Furthermore, based on DGIdb database, it was found 35 drugs that could potentially affect the hub genens. Of these, 1 drug (Petesicatib), 4 drugs (DiaPep277, Tomaralimab, ChEMBL 1836411, and Resveratrol hexanoic acid) and 30 drugs (Enoxolone, Rottlerin, Fasudil, etc.) targeting CTSS, TLR2 and PRKCD were predicted, respectively, unfortunately, our analyses did not yield predictions for drugs targeting CRISPLD2 (Fig. 7E).
3.8 Experimental validation of hub genes expression through qRT-PCR
Ultimately, we validated the four hub genes by qRT-PCR, and the result revealed that the mRNA expression of CTSS, PRKCD and TLR2 was significantly elevated in the PD samples (P < 0.05) (Figs. 8A-8C), which was consistent with the results from public databases, implicating that CTSS, PRKCD and TLR2 had regulatory role in PD. However, CD93 expression was not significantly different between PD and control samples (P > 0.05) (Figs. 8D), probably because of the small sample size, and we will perform further experiments to verify it afterwards.