Data preprocessing and DEG identification
With the advancement of high-throughput sequencing technologies such as RNA sequencing and microarrays in combination with bioinformatics analysis, many datasets have been generated, including lncRNA, miRNA, and mRNA expression profiles. Millions of genes have been detected and are widely used to predict potential biomarkers for various types of cancer [27]. For the identification of the hub genes involved in gynecological cancer, the GEO datasets GSE54388, GSE69428, and GSE36668 were utilized. This retrieved dataset included different samples, including control samples and ovarian carcinoma samples [28], and was subjected to statistical analysis via GEO2R. The statistical analysis yielded 16344 DEGs from the three designated studies by setting a cut-off p-value <0.05 (Fig. 2). All DEG IDs, gene log fold change (FC) values, and p values were saved in Microsoft Excel files(Table S1). The gene volcano plot shown in Fig. 2 A-C revealed red and green indicating increased and decreased gene expression, respectively, while black indicates genes whose expression did not change.
Robust rank aggregation (RRA)
Through the RRA method, 229 common DEGs were identified in the datasets for GYN compared with those for the tissue samples. As shown in Fig. 3, 135 genes among these DEGs were upregulated, while 94 genes were downregulated. The details of these genes is available in Table S2. The list included not only indispensable genes such as transporters, energy-associated genes, and transcription factors but also several key genes that are known to be associated with various types of cancers, such as decorin, mimecan, and the serine/threonine protein PIM kinase. Decorin is an extracellular matrix small leucine-rich proteoglycan protein. It has an impact on several types of cancer by targeting signalling molecules involved in angiogenesis, metastasis, survival, and cell growth [29]. Similarly, mimecan is a significant precursor lesion of colorectal cancer, which is among the most common cancers worldwide. PIM kinases, which belong to the serine/threonine kinase family, are overexpressed in several cancers, including prostate, breast, colon, endometrial, and gastric cancers [30, 31]. The identification of potential differentially expressed key genes associated with various types of cancer reveals the value of the dataset chosen.
Module analysis and PPI network construction
The PPI network built from the STRING database by utilizing the DEGs to assess interactions and in-depth study and results are shown in Fig. 4 [32]. Using the Cytoscape ClusterViz module and the MCODE algorithm, the robust DEGs were subjected to cluster analysis, and the TSV list was imported to the above software as output. The average local clustering coefficient was 0.563, reflecting a moderate tendency for nodes to cluster together. With the expected number of edges being 511, the observed count of 3014 edges significantly surpasses this, suggesting an exceptionally dense network. Furthermore, a p value of PPI enrichment less than 1.0e-16 indicated highly substantial PPI enrichment within the network. To reliably categorize and diagnose diseases using molecular data and provide insight into the mechanisms of disease pathogenesis, it is imperative to identify network biomarkers. The results of the PPI enrichment analysis revealed a persistent correlation between various genes and gene function.
Prediction and analysis of the hub genes
With a set of topological analysis algorithms that are part of the Cytoscape plug-in, CytoHubba, which is used for the identification of significant nodes in PPI networks, assigns a score to each node belonging to the PPI network and is considered a hub gene. The results revealed 10 hub genes, namely, CDK1, AURKA, BUB1B, CCNB1, TOP2A, KIF11, BUB1, CCNB2, CDCA8, and BIRC5, which are shown in Fig. 5. Cyclin-dependent kinases (CDKs), aurora kinase, BUB1B A, CCNB1, BUB1, CCNB2, CCNB2, and CDCA8 are serine/threonine kinases considered promising targets for cancer therapy. These proteins are essential for cell cycle progression, particularly when they form complexes with cyclins [33, 34]. TOP2A is a factor involved in DNA unlinking and not only plays a pivotal role in cell proliferation, DNA replication, transcription, and recombination but also plays a role in other biological processes, such as chromosome separation and concentration. TOP2A has been reported to play an oncogenic role in several tumor types [35]. Survivin is another name for baculoviral inhibitor of apoptosis repeat containing 5 (BIRC5) and is known to play a role by preventing caspase activation and adversely controlling programmed cell death or apoptosis. Due to these characteristics, BIRC5 overexpression promotes particular division and survival pathways linked to cancerous tumors [36]. The literature shows that these genes may help to overcome drug resistance and improve treatment outcomes, and understanding these genes can lead to the discovery of novel therapeutic targets and strategies.
Gene Drug interaction
Gene-drug interaction networks are generated using a tool such as NetworkAnalyst. It visualizes how specific genes interact with various drugs. The highlighted genes (TOP2A, AURKA, CDK1, and KIF11) are shown in yellow, indicating their central role in the network, while the drugs that interact with these genes are displayed in red. For instance, TOP2A interacts with a variety of chemotherapeutic agents like Etoposide, Doxorubicin, and Levofloxacin, all of which interfere with DNA replication in cancer cells [37]. AURKA is associated with drugs such as MLN8237 and Phosphorothioate, both used in cancer treatment due to their impact on cell division [38]. CDK1, another key player, interacts with inhibitors like Flavopiridol and AT7519, which are studied for their potential in cancer therapies targeting cell cycle regulation [39]. Last, KIF11 interacts with drugs including Monastrol and Dimethylenastron, which inhibit the kinesin family member involved in mitosis. These networks highlight critical drug-gene interactions, emphasizing their significance in developing targeted cancer therapies as shown in Fig. 6, and detailed functionalities are described in table 1.
Table 1. Drug–gene interaction and analysis
Gene
|
Drug
|
FDA
|
Description
|
Interaction Score
|
CDK1
|
CLOTRIMAZOLE
|
Approved
|
It is used to treat oral thrush, diaper rash, tinea versicolor, vaginal yeast infections, and various forms of ringworm, such as jock itch and athlete's foot.
|
0.4
|
AURA
|
PACLITAXEL
|
Approved
|
A chemotherapeutic drug such as Paclitaxel is used to treat esophageal cancer, lung cancer, ovarian cancer, Kaposi's sarcoma, breast cancer, cervical cancer, and pancreatic cancer. It has several marketed names including Taxol.
|
0.11
|
CCNB1
|
SELICICLIB
|
In trail
|
Different ovarian cancer cell lines can undergo cell cycle and apoptosis arrest when employed seliciclib. In vitro experiments has shown that seliciclib efficiently inhibited CDK activity in ovarian cancer cells, resulting in increased apoptotic cell death and decreased viability.
|
2.5
|
TOP2A
|
DOXORUBICIN HYDROCHLORIDE
|
Approved
|
The FDA has approved the prescription medication doxorubicin hydrochloride (liposomal), an antineoplastic, for the treatment of multiple myeloma, ovarian cancer, and Kaposi sarcoma associated with AIDS
|
0.03
|
KIF11
|
LITRONESIB
|
Approved
|
Trials examining the therapy of solid tumors, ovarian cancer, gastric cancer, prostate cancer, and acute leukemia have all involved intrathenesib.
|
6.56
|
BUB1
|
OMEPRAZOLE
|
Approved
|
Medicine used to treat Zollinger-Ellison syndrome, peptic ulcer disease, and gastroesophageal reflux disease.
|
0.17
|
BIRC5
|
ARSENIC TRIOXIDE
|
Approved
|
The IC50 values of arsenic trioxide (As2O3) and tetraarsenic oxide (As4O6) for each cell line showed a dose-dependent substantial inhibition of the cell proliferation of ovarian cancer cell lines.
|
0.06
|
Identification of differentially expressed microRNAs associated with GYN cancer
For the identification of miRNAs involved in gynecological cancer and their association with various genes, particularly with identified hub genes, the same GEO datasets, GSE54388, GSE69428, and GSE36668, were utilized. A total of 30 miRNAs were retrieved to be associated with GSE54388, GSE69428, and GSE36668 from ten hub genes, which were further categorized into downregulated and upregulated miRNAs based on the log Fc and p-value. The expression levels of these 30 upregulated and downregulated miRNAs are shown in Table 2 and in Tables S3 and S4. miRNAs with more than 80% target scores, such as hsa-miR-190a-3p, had a target score of 98, hsa-miR-670-3p had a target score of 97, and the minimum target score was hsa-miR-3922-5p 85. By binding to their corresponding sites, miRNAs regulate target genes and silence the posttranscriptional modification that follows this binding [40].
All the miRNAs were more or less associated with the hub genes CDK1, AURKA, BUB1B, CCNB1, TOP2A, KIF11, BUB1, CCNB2, CDCA8, and BIRC5. The most important element to be noted was serine/threonine protein kinase, which was associated with most hub genes, followed by those hub genes linked with miRNAs. In a previous study, bioinformatics analysis revealed that each miRNA can regulate several hundred gene targets and participate in various gene signalling pathways [41]. Consequently, miRNAs influence numerous biological functions, including apoptosis, differentiation, and cell proliferation [42]. Altered miRNA expression can significantly impact cellular functional activity. Genomic research on human cancer has shown variability in miRNA expression [43]. Extensive diagnostic and prognostic biological information is linked to miRNA expression. Thus, miRNA expression could serve as a predictor of cervical cancer prognosis, as imbalances in miRNA expression are common across all tumor types [44]. As a result, these expression characteristics may be useful for potential value in diagnosis and prognosis.
Interactions between different microRNAs (miRNAs, indicated by purple squares) and genes (indicated by red circles). The lines represent interactions or regulatory relationships. The orange circles indicate specific genes of interest or key nodes in the network. The density and connections suggest a complex regulatory network, possibly involved in a specific biological process or disease context. The figure shows a complex network of miRNA-gene interactions. Subnetwork 1 comprises 379 nodes, 420 edges, and 10 seed nodes. The nodes include miRNAs (purple squares) and genes (red circles), with interactions depicted by connecting lines, as shown in Fig. S1. The dataset features a detailed table of 30 microRNAs (miRNAs) and their corresponding target genes, showcasing critical parameters such as the target rank, target score, miRNA, gene symbol, and gene description. Each entry highlights a specific miRNA and its target, emphasizing their role in gene regulation.
The target rank and score indicate the efficacy of miRNA-gene interactions, while the gene symbol and description provide insights into the gene's function. This information is vital for understanding the regulatory networks involving miRNAs, which play crucial roles in gene expression and cellular processes. Consequently, miRNAs are an invaluable resource for researchers exploring the impacts of miRNAs on biological functions and disease mechanisms, as described in Table 2.
Table 2. The top 30 miRNAs associated with hub genes
Target Rank
|
Target Score
|
miRNA
|
Gene Symbol
|
Gene Description
|
1
|
96
|
hsa-miR-5011-5p
|
CDK1
|
cyclin-dependent kinase 1
|
2
|
95
|
hsa-miR-3913-3p
|
CDK1
|
cyclin-dependent kinase 1
|
3
|
94
|
hsa-miR-4760-3p
|
CDK1
|
cyclin-dependent kinase 1
|
4
|
97
|
hsa-miR-3941
|
AURKA
|
aurora kinase A
|
5
|
95
|
hsa-miR-6871-5p
|
AURKA
|
aurora kinase A
|
6
|
91
|
hsa-miR-1-5p
|
AURKA
|
aurora kinase A
|
7
|
87
|
hsa-miR-524-5p
|
BUB1B
|
BUB1 mitotic checkpoint serine/threonine kinase B
|
8
|
87
|
hsa-miR-520d-5p
|
BUB1B
|
BUB1 mitotic checkpoint serine/threonine kinase B
|
9
|
85
|
hsa-miR-3922-5p
|
BUB1B
|
BUB1 mitotic checkpoint serine/threonine kinase B
|
10
|
98
|
hsa-miR-548n
|
CCNB1
|
cyclin B1
|
11
|
96
|
hsa-miR-559
|
CCNB1
|
cyclin B1
|
12
|
95
|
hsa-miR-548ar-5p
|
CCNB1
|
cyclin B1
|
13
|
92
|
hsa-miR-8077
|
TOP2A
|
DNA topoisomerase II alpha
|
14
|
91
|
hsa-miR-4641
|
TOP2A
|
DNA topoisomerase II alpha
|
15
|
89
|
hsa-miR-4663
|
TOP2A
|
DNA topoisomerase II alpha
|
16
|
98
|
hsa-miR-190a-3p
|
KIF11
|
kinesin family member 11
|
17
|
97
|
hsa-miR-381-3p
|
KIF11
|
kinesin family member 11
|
18
|
97
|
hsa-miR-300
|
KIF11
|
kinesin family member 11
|
19
|
94
|
hsa-miR-5688
|
BUB1
|
BUB1 mitotic checkpoint serine/threonine kinase
|
20
|
93
|
hsa-miR-495-3p
|
BUB1
|
BUB1 mitotic checkpoint serine/threonine kinase
|
21
|
92
|
hsa-miR-653-5p
|
BUB1
|
BUB1 mitotic checkpoint serine/threonine kinase
|
22
|
97
|
hsa-miR-670-3p
|
CCNB2
|
cyclin B2
|
23
|
91
|
hsa-miR-4251
|
CCNB2
|
cyclin B2
|
24
|
72
|
hsa-miR-10525-3p
|
CCNB2
|
cyclin B2
|
25
|
92
|
hsa-miR-4518
|
CDCA8
|
cell division cycle associated 8
|
26
|
92
|
hsa-miR-1266-5p
|
CDCA8
|
cell division cycle associated with 8
|
27
|
91
|
hsa-miR-589-3p
|
CDCA8
|
cell division cycle associated with 8
|
28
|
93
|
hsa-miR-548t-3p
|
BIRC5
|
baculoviral IAP repeat containing 5
|
29
|
93
|
hsa-miR-548ap-3p
|
BIRC5
|
baculoviral IAP repeat containing 5
|
30
|
93
|
hsa-miR-548aa
|
BIRC5
|
baculoviral IAP repeat containing 5
|
miRNA and differential expression analysis
Using the dbDEMC, an integrated database a source of DEM in cancers collected by low-throughput or high-throughput sequencing technologies, identified top five up and top five downregulated genes. Conseqeunctly, using Cytoscape, a regulatory network of miRNAs and their target genes was built for both five up and downregulated miRNAs. The results revealed the top five upregulated miRNAs (hsa-miR-653-5p, hsa-miR-495-3p,hsa-miR-381-3p,hsa-miR-1266-5p,hsa-miR-589-3p) regulate approximately 229 common genes in the network, displaying interconnections between the genes and miRNAs. While the top five (hsa-miR-1-5p, hsa-miR-300, hsa-miR-3913-3p, hsa-miR-6871-5p, hsa-miR-3922-5p) downregulated miRNAs have no proper interconnection. These upregulated miRNAs show potential as biomarkers and therapeutic targets in gynecological cancers as shown in Table 3.
Studies revealed the emerging role of hsa-miR-653-5p, hsa-miR-495-3p, and hsa-miR-381-3p in various cancer diseases. Such as hsa-miR-653-5p has been connected to aggressive tumors and poor prognosis in human cancer. At the same time, it has an inhibitory effect on some other types of cancer, such as lung cancers, renal, liver, breast, and cervical [45] which is contrary to our study where upregulation of miR-653 showed aggressiveness in cervical cancer. The hsa-miR-495-3p is known to have inhibitory effects in the progression of various types of cancer such as colorectal cancer [46] while we noted that it is upregulated. In the case of miR-381-3p it is known to inhibit breast cancer progression [47] while in our study it is upregulated. Overall we noted that these miRNAs which are associated with differentially expressed hub genes seem to be potential diagnostic markers and therapeutic studies.
Table 3. Expression level of ten shortlisted miRNA
miRNA ID
|
Source ID
|
Cancer Type
|
Cancer Subtype
|
Design
|
LogFC
|
Expression Status
|
Expression ID
|
hsa-miR-653-5p
|
GSE106817
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
0.76
|
Up
|
EXP00527
|
hsa-miR-495-3p
|
GSE106817
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
0.6
|
Up
|
EXP00528
|
hsa-miR-381-3p
|
GSE106817
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
1.05
|
Up
|
EXP00528
|
hsa-miR-1266-5p
|
GSE31801
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
0.05
|
Up
|
EXP00327
|
hsa-miR-589-3p
|
GSE113486
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
1.38
|
Up
|
EXP00537
|
hsa-miR-1-5p
|
DRP001085
|
breast cancer
|
bladder cancer
|
cancer vs normal
|
0
|
Down
|
EXP00706
|
hsa-miR-300
|
GSE5244
|
uterus cancer
|
uterus cancer
|
cancer vs normal
|
-0.46
|
Down
|
EXP00030
|
hsa-miR-3913-3p
|
GSE106817
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
-0.91
|
Down
|
EXP00528
|
hsa-miR-6871-5p
|
GSE106817
|
ovarian cancer
|
ovarian cancer
|
Blood
|
-1.85
|
Down
|
EXP00528
|
hsa-miR-3922-5p
|
GSE113486
|
ovarian cancer
|
GYN/ovarian cancer
|
Blood
|
-1.49
|
Down
|
EXP00537
|
Pathway enrichment and gene ontology analysis of miRNAs and hub genes
The miRNAs (upregulated and downregulated) were analysed for pathway analysis as well as their interaction with previously identified hub genes (CDK1, AURKA, BUB1B, CCNB1, TOP2A, KIF11, BUB1, CCNB2, CDCA8, and BIRC5). Pathway enrichment analysis revealed the enrichment analysis results (Fig. S2 A and B). Identifying miRNAs that can alter the expression of hub genes, which are essential for controlling various types of cancer, particularly genealogical cancer, is critical. The DIANA-miRPath program helped us identify miRNAs that affect hub genes. Several of them have been previously mentioned in the literature as controlling the emergence of different forms of cancer.
The functional enrichment of the top upregulated and downregulated DEMs. The heatmap shows the enrichment of biological processes for five upregulated miRNAs, with high enrichment (red) observed in processes such as "Fc-gamma receptor signalling pathway" for hsa-miR-653-5p, "positive regulation of transcription, DNA-templated" for hsa-miR-381-3p, and "Toll-like receptor signalling pathway" for hsa-miR-495-3p. The log (p-value) indicates the significance of enrichment, with more negative values (darker red) representing greater enrichment (Fig. S3 A). The enrichment of the cellular components "organelle" in hsa-miR-653-5p, "protein complex" in hsa-miR-589-3p, and "nucleoplasm" in hsa-miR-381-3p is shown in Fig. S3 (B). High enrichment (red) of molecular functions was observed for "nucleic acid binding transcription factor activity" in hsa-miR-653-5p, "protein binding transcription factor activity" in hsa-miR-381-3p, and "RNA binding" in hsa-miR-495-3p (Fig. S3 C).
The biological processes for downregulated miRNAs with the highest enrichment (most significant) were related to transcription, response to stress, signalling pathways, and catabolic processes, indicating strong associations of these miRNAs with these functions (Fig. S4 A). The cellular components with the highest enrichment (most significant) were organelles, protein complexes, and nucleoplasm, suggesting strong associations of these miRNAs with these cellular structures (Fig. S4 B). The molecular functions with the greatest enrichment (most significant) included nucleic acid binding transcription factor activity, protein binding transcription factor activity, and ion binding, highlighting the strong associations of these miRNAs with these molecular activities (Fig. S4 C).
Network of miRNA and their target genes
Network of microRNAs (miRNAs) and their target genes visualized using Cytoscape. In the network, yellow nodes represent miRNAs, while red nodes represent the target genes regulated by these miRNAs. Edges (lines) connecting the nodes indicate regulatory relationships between miRNAs and their target genes. This visualization helps in understanding the complex regulatory mechanisms of miRNAs and identifying potential key miRNAs that play crucial roles in gene regulation as shown in Fig. 7. The exact relationship between the hub gene and miRNA showed that the four genes CDK1, AURKA, CCNB1, BUB1 and BIRC5 target more miRNAs than the other hub genes, BUB1B, TOP2A, KIF11, CCNB2 and CDCAB. Among these, the highest interaction was related to BIRC5 (Grade=80), and the lowest interaction was related to BUB1B (grade=4). Additionally, hsa-miR-653-5p, hsa-miR-495-3p,hsa-miR-381-3p, hsa-miR-1266-5p, and hsa-miR-589-3p were the top five interactive miRNAs, respectively, that targeted the most hub genes. Recently, various studies specified that miR-653 is intricate in many biological processes such as cancer by differentially expressing in various types of cancer [48, 49]. The miR-495-3p reported to inhibit the colorectal cancer progression by HMGB1 downregulation [50].
A suffiecnt data of literature has demonstrated the correlation between dysregulated miR-589-3p and a variety of diseases inclduing cancer [51]. While the hsa-miR-1266-5p is not well reported in litertaure but on various databses, it is know tomplay role pericardium cancer and pericardial mesothelioma. So overall, the role of these miRNA in antioncogenic and prooncogenic functions, in various cancer types reveal that n combination of hub genes these miRNAs could be a potential targets for the diagnosis and treatment of gynlogcical cancer. While miR-495-3p which is not well studied in cancer and seems to be a novel miRNA identified in our transcriptomics data analysis and may become a potential therapeutic target for gynological cancer.
Survival analysis of upregulated miRNAs
Kaplan‒Meier survival curves for patients with high and low expression levels of hsa-miR-653-5p across different cancer stages (I-IV). The p values indicate significant differences in survival probabilities between high (p = 0.0093) and low (p = 0.0018) expression groups. Kaplan‒Meier survival curves for patients with high and low expression levels of hsa-miR-495-3p across different cancer stages (I-IV). The p values indicate significant differences in survival probabilities between high (p = 0.00016) and low (p = 0.2) expression groups. Kaplan‒Meier survival curves for patients with high and low expression levels of hsa-miR-495-3p across different cancer stages (I-IV). The p values indicate significant differences in survival probabilities between high (p = 0.0011) and low (p =0.007) expression groups. Kaplan‒Meier survival curves for patients with high and low expression levels of hsa-miR-495-3p across different cancer stages (I-IV). The p values indicate significant differences in survival probabilities between high (p = 0.025) and low (p =0.0011) expression groups. Kaplan‒Meier survival curves for patients with high and low expression levels of hsa-miR-495-3p across different cancer stages (I-IV). The p values indicate significant differences in survival probabilities between high (p = 0.015) and low (p < 0.0001) expression groups as shown in Fig 8.
To confirm that indeed all four miRNAs as well as hub genes are essential for classification into good or poor prognoses, we performed the survival analysis and results showed that statistically overall survival analysis were associated with good survival rate, confirming that all four miRNAs could be potential biomarkers.
TF-miRNA Coregulatory Network
The transcription factor (TF)-miRNA coregulatory network highlights the complex interactions between transcription factors (shown in red circles) and microRNAs (miRNAs, shown in blue squares and green diamonds). Key nodes such as TP53, CDK1, and AURKA are central regulatory hubs. The network illustrates how TFs and miRNAs coregulate target genes, emphasizing their coordinated role in gene expression regulation. This kind of network is crucial for understanding the regulatory mechanisms involved in various biological processes and diseases, as shown in Fig. 9.