Identification of DEGs
The raw CEL files were downloaded from the GEO, and then normalized with the “affy” package. After eliminating batch effects in the expression data, we used the limma package to select DEGs. Using P < 0.01 and |logFC| ≥ 3 criteria, 357 DEGs were screened out from two GEO datasets, including 181 up-regulation DEGs and 176 down-regulation DEGs in HGSC compared to LGSC. The top 10 up-regulated and top 10 down-regulated DEGs were listed in Table 1. In addition, the heat map of DEGs was shown as the Fig. 1. The hierarchical clustering analysis revealed a distinct separation in these two different types of ovarian cancer.
Table 1
Top ten up-regulation and top ten down-regulation DEGs (high-grade serous ovarian cancer versus low-grade serous ovarian cancer)
Probe set | Gene symbol | Log fold change | P value |
Up-regulated | | | |
1565483_at | EGFR | 3.29 | 1.78E-12 |
214677_x_at | IGLC1 | 3.19 | 1.91E-07 |
218542_at | CEP55 | 3.10 | 1.61E-17 |
203560_at | GGH | 2.83 | 5.39E-13 |
201292_at | TOP2A | 2.77 | 2.86E-12 |
203764_at | DLGAP5 | 2.77 | 5.43E-17 |
219787_s_at | ECT2 | 2.74 | 9.80E-15 |
204533_at | CXCL10 | 2.71 | 1.48E-09 |
219148_at | PBK | 2.66 | 2.25E-13 |
242546_at | LINC01296 | 2.64 | 4.91E-11 |
Down-regulated | | | |
214218_s_at | XIST | -3.41 | 5.93E-11 |
229782_at | RMST | -3.39 | 3.47E-13 |
233249_at | LOC100507073 | -2.85 | 6.36E-13 |
204014_at | DUSP4 | -2.68 | 3.70E-08 |
229331_at | SPATA18 | -2.61 | 2.69E-07 |
205765_at | CYP3A5 | -2.58 | 3.42E-08 |
229245_at | PLEKHA6 | -2.58 | 4.60E-14 |
240065_at | FAM81B | -2.50 | 7.43E-06 |
225996_at 227188_at | LONRF2 EVA1C | -2.50 -2.40 | 1.14E-08 9.98E-09 |
Functional determination by GO terms and KEGG pathways
To interpret the function of the DEGs, the gene lists were uploaded to the online software DAVID. The GO terms including cell component (CC), biological processes (BP) and molecular function (MF) ontologies were shown in Fig. 2. For the CC ontology, we found that most DEGs significantly enriched in nucleus and cytoplasm items, such as nucleus, nucleoplasm and cytosol. Some genes were associated with organelles in cytoplasm, such as centrosome, spindle and midbody (Fig. 2A). For the BP ontology, the majority GO terms were about cell proliferation items, such as cell division, G1/S transition of mitotic cell cycle. In addition, the other GO terms is significantly related to regulation activities of organism, including positive regulation of cell proliferation, positive regulation of GTPase activity (Fig. 2B). In the MF ontology, the binding function consisted a large proportion of GO categories, which involved protein binding, protein kinase binding, ATP binding and microtubule binding (Fig. 2C). As shown in the Table 2, the KEGG pathway enrichment analysis found five significantly pathways, including the PI3K-Akt signaling pathway, pathways in cancer, p53 signaling pathway, cell cycle, microRNAs in cancer.
Table 2
KEGG pathway enrichment of DEGs
Term | Count | P value |
hsa04110: Cell cycle | 12 | 1.97E-06 |
hsa04115: p53 signaling pathway | 9 | 5.85E-06 |
hsa05200: Pathways in cancer | 14 | 0.0053 |
hsa04151: PI3K-Akt signaling pathway | 12 | 0.0133 |
hsa05206: MicroRNAs in cancer | 10 | 0.0257 |
Count refers to the number of genes significantly enriched in this term |
PPI network and hub genes
The protein-protein interaction network of DEGs was constructed through the STRING database. The 357 DEGs were uploaded to the STRING website to get PPI data. Then, the PPI data were analyzed by the Cytoscape software. The PPI which consisted of 167 nodes and 1794 edges shown in Fig. 3A. After the PPI network construction, the CytoHubba plug-in in Cytoscape was used to indentify hub genes. In PPI networks, 12 node protein, including TOP2A, CDK1, CCNB1, MAD2L1, KIF11, CCNB2, TTK, AURKA, RACGAP1, BUB1, RRM2, PBK had a strong association with other nodes (Degree ≥ 60). From the PPI network, a significant module including 51 nodes and 1206 edges was selected by the MCODE (Fig. 3B). The GO and KEGG enrichment analysis show these genes in the module were enriched in the mitotic cytokinesis, ATP binding and p53 signaling pathway (Table 3).
Table 3
GO and KEGG pathway enrichment analysis of the genes in module
Term | Description | Count | P value |
GO:0000281 | mitotic cytokinesis | 5 | 7.37E-07 |
GO:0032467 | positive regulation of cytokinesis | 4 | 5.82E-05 |
GO:0007018 | microtubule-based movement | 5 | 6.19E-05 |
GO:0035556 | intracellular signal transduction | 4 | 0.044449 |
GO:0030496 | midbody | 6 | 5.04E-06 |
GO:0005654 | nucleoplasm | 14 | 9.49E-05 |
GO:0005737 | cytoplasm | 19 | 4.15E-04 |
GO:0005634 | nucleus | 16 | 0.004541 |
GO:0016020 | membrane | 9 | 0.004243 |
GO:0005524 | ATP binding | 19 | 8.45E-11 |
GO:0016887 | ATPase activity | 5 | 1.27E-04 |
GO:0003682 | chromatin binding | 5 | 0.008074 |
KEGG:hsa04110 | Cell cycle | 10 | 6.71E-12 |
KEGG:hsa04115 | p53 signaling pathway | 5 | 2.13E-05 |
Count refers to the number of genes significantly enriched in this term |
Genetic Alterations of Twelve Hub-genes in Serious Ovarian Cancer Patients
The cBioPortal tool was used for the analysis of genetic alterations of twelve hub-genes from the TCGA PanCancer Atlas dataset. As a result, 5% TOP2A, 9% CDK1, 11% CCNB1, 6% MAD2L1, 5% KIF11, 5% CCNb2, 6% TTK, 14% AURKA, 5%ACGAP1, 4%BUB1, 10%RRM2 and 10% PBK were altered in ten types of genetic alterations, including infame mutation (unknown significance), missense mutation (unknown significance), splice mutation (unknown significance), truncating mutation (unknown significance), amplification, deep deletion, mRNA high, mRNA low, protein high and protein low in the queried TCGA serious ovarian cancer samples (Fig. 4A). The alteration frequency derived from mutations, copy-number alterations, mRNA expression data and protein expression data were shown in serous ovarian cancer (Fig. 4B). To explore whether these hub genes amplification had an influence on its mRNA and protein level, the results indicated that TTK and AURKA were amplified along with the significantly high mRNA and protein level from TCGA-OV cohort (Fig. 4C). The mutation types, number, and sites of TOP2A, CDK1, TTK and BUB1 genetic alterations were displayed in Fig. 4D.
The Expression Level of the Twelve Hub-genes in Ovarian Cancer Tissues with P53 Mutation
GO functional enrichment analysis showed that the DEGs was associated with the p53 signaling pathway. We conducted research and analysis on 199 ovarian cancer samples with P53 mutations and 19 ovarian cancer samples without P53 mutations. The results showed that the mRNA expression levels of AURKA (Fig. 5A), BUB1 (Fig. 5B), CCNB1 (Fig. 5C), CDK1 (Fig. 5D), MAD2L1 (Fig. 5E), PBK (Fig. 5F), TOP2A (Fig. 5G), and TTK (Fig. 5H) in P53 mutated ovarian cancer samples were higher than those in P53 non mutated ovarian cancer samples.
The Expression Level of the Twelve Hub-genes in Tumor Tissues
To better understand the differential expression, the CPTAC dataset was used to assess the twelve hub-genes mRNA expression level in large-scale mRNA data from the National Cancer Institute. As shown in Fig. 6A, the mRNA expression of TOP2A, CDK1, CCNB1, MAD2L1, KIF11, CCNB2, TTK, AURKA, RACGAP1, BUB1, RRM2, and PBK were significantly increased in tumor tissues. The GEPIA2 tool was also used to analyze the relationship between the twelve hub-genes expression and tumor pathological stage. Figure 6B showed stage-specific change of MAD2L1, RACGAP1, RRM2 and TTK in tumor tissues. On the other hand, the remaining hub genes were no clear association between the gene expression and patients’ stage.
The Protein Expression of Twelve Hub-genes in Serous Ovarian Cancer tissues vs. Non-serous Ovarian Cancer tissues
To assess the protein expression of twelve hub-genes in serous ovarian cancer tissues vs. non-serous ovarian cancer tissues, we performed immunohistochemical analysis. The expression levels of ten proteins, including TOP2A, CDK1, CCNB1, MAD2L1, KIF11, CCNB2, TTK, AURKA, RACGAP1 and PBK were distinctly higher in serous ovarian cancer tissues than non-serous ovarian cancer tissues (Fig. 7).
The Prognostic Value of Twelve Hub-genes in Patients with Serous Ovarian Cancer
The prognostic value of twelve hub-genes were assessed using an online tool of KM plotter (http://www.kmplot.com). The survival curves were calculated according to the gene expression levels. Among those hub-genes, our results showed that high expression of TOP2A (HR 1.32 95%CIs [1.13–1.55], P = 0.00063), CCNB1 (HR 1.24 95%CIs [1.05–1.47], P = 0.01), KIF11 (HR 1.23 95%CIs [1.04–1.45], P = 0.017), AURKA (HR 1.47 95%CIs [1.23–1.75], P = 2.2e-05), and BUB1 (HR 1.18 95%CIs [1.02–1.38], P = 0.03) were associated with worse overall survival (Fig. 8A-E). We also found increased expression of MAD2L1 (HR 0.79 95%CIs [0.62–1.01], P = 0.055) was not significantly associated with worse overall survival (Fig. 8F).
Correlation of Twelve Hub-Genes with Tumor Purity and Immune Cell Infiltration in Patients with Serous Ovarian Cancer
Since the functional annotation analysis revealed that the twelve hub-genes participated in the process of the immune response, next, the correlation between the expression of twelve hub-genes and immune cell infiltration in the TIMER database was further analyzed. Interestingly, high expression levels of twelve hub-genes were found to be associated with high immune cell infiltration in serous ovarian cancer. A positive correlation between TOP2A expression and the infiltration of macrophage (Cor = 0.12, p = 8.67e − 03) and purity (Cor = 0.123, p = 6.83e − 03) were observed, while the TOP2A expression was negatively associated with the infiltration of CD8 + T cells (Cor = -0.121, p = 7.90e − 03). There is no significant correlation between the expression level of TOP2A and the infiltration level of B cells, CD4 + T cells, neutrophil and dendritic cells (Fig. 9A). The change of KIF11 expression level is the same as that of TOP2A (Fig. 9E). As shown in Fig. 9B, the expression level of CDK1 is positively correlated with tumor purity (Cor = 0.152, p = 1.63e − 02), but not significantly correlated with the infiltration level of B cells, CD4 + T cells, CD8 + T cells, macrophage, neutrophil and dendritic cells. Figure 9C showed that the expression level of CCNB1 is positively correlated with the infiltration level of macrophage (Cor = 0.119, p = 8.80e − 03) and neutrophil (Cor = 0.125, p = 6.01e − 03), but not significantly correlated with the infiltration level of B cells, CD4 + T cells, CD8 + T cells, dendritic cells and tumor purity. The expression level of MAD2L1 is positively correlated with the infiltration level of macrophage (Cor = 0.18, p = 7.00e − 05), neutrophil (Cor = 0.238, p = 1.38e − 07) and dendritic cells (Cor = 0.17, p = 1.75e − 04), but not significantly correlated with the infiltration level of B cells, CD4 + T cells, CD8 + T cells and tumor purity (Fig. 9D).The change of CCNB2 and TTK expression level is the same as that of MAD2L1 (Fig. 9F, G). Figure 9H showed that the expression level of AURKA is positively correlated with the infiltration level of CD4 + T cells (Cor = 0.118, p = 9.90e − 03), macrophage (Cor = 0.225, p = 6.06e − 07) and neutrophil (Cor = 0.12, p = 8.49e − 03), but not significantly correlated with the infiltration level of B cells, CD8 + T cells, dendritic cells and tumor purity. The expression level of RACGAP1 and RRM2 are positively correlated with the infiltration level of CD4 + T cells, macrophage, neutrophil and dendritic cells, but not significantly correlated with the infiltration level of B cells and CD8 + T cells. In addition, the expression level of RACGAP is also positively correlated with tumor purity, while the expression level of RRM2 is not significantly correlated with tumor purity (Fig. 9I, K). Figure 9J showed that the expression level of BUB1 is positively correlated with the infiltration level of macrophage (Cor = 0.128, p = 4.96e − 03) and tumor purity (Cor = 0.09 p = 4.78e − 02), while the BUB1 expression level was negatively associated with the infiltration of CD8 + T cells (C or = -0.121, p = 7.90e − 03). but not significantly correlated with the infiltration level of B cells, CD4 + T cells, neutrophil and dendritic cells. The expression level of PBK is positively correlated with the infiltration level of macrophage (Cor = 0.131, p = 4.14e − 03), but not significantly correlated with the infiltration level of B cells, CD4 + T cells, CD8 + T cells, macrophage, dendritic cells and tumor purity (Fig. 9L).