Screening of Differentially Expressed mRNAs and miRNAs
We extracted a total of 7892 mRNAs and 856 miRNAs from the expression profile of urinary bladder urothelial carcinoma of TCGA. Through R language software, we used the edge R package to set the P value to less than 0.05 and log2FC > 1 as differentially expressed mRNA and miRNA. A total of 1111 differentially expressed genes (including 545 up-regulated genes and 566 down-regulated genes) and 473 differentially-expressed miRNAs were screened. (including 380 up-regulated genes and 93 down-regulated genes)). Visualized heatmap and volcano map were drawn by R software R package (Fig. 1)
We randomly divided the expression profile of the entire group with miRNA mature bodies (N = 408) into two groups, the train group (N = 204) and the test group (N = 204). The results showed that in the train group, we found 37 miRNAs related to the overall survival rate of bladder urothelial carcinoma patients based on the screening conditions (p < 0.05). For the reliability of the model, we improved the screening conditions (P < 0.01, log2FC > 1.5) and further screened 7 miRNAs by univariate and multivariate Cox regression analysis. We selected these seven miRNAs for further analysis (Table 2). The Kaplan-Meier method indicated that seven miRNAs include hsa-miR-33b-5p, hsa-miR-33a-5p, hsa-miR-590-3p, hsa-miR-576-5p, hsa-miR-200b-3p, hsa-miR-186-3p and hsa-miR-3934-5p, which are related to the overall survival of patients (p value < 0.05; Fig. 2).
Table 2
Univariate and multivariate Cox regression of differentially expressed miRNAs
miRNA | Univariate Cox regression | Multivariate Cox regression |
HR(95%CI) | P | HR(95%CI) | P |
hsa-miR-33b-5p` | 1.246(1.045ཞ1.486) | 0.014 | 1.279(0.997ཞ1.641) | 0.053 |
hsa-miR-33a-5p | 1.216(1.005ཞ1.473) | 0.045 | 1.196(0.934ཞ1.531) | 0.156 |
hsa-miR-590-3p` | 0.699(0.526ཞ0.928) | 0.013 | 0.614(0.442ཞ0.853) | 0.004 |
hsa-miR-576-5p` | 0.556(0.413ཞ0.749) | 0.000 | 0.644(0.477ཞ0.869) | 0.004 |
hsa-miR-200b-3p` | 0.876(0.785ཞ0.976) | 0.017 | 0.775(0.691ཞ0.870) | 0.000 |
hsa-miR-186-3p` | 1.440(1.088ཞ1.906) | 0.011 | 1.384(1.036ཞ1.850) | 0.028 |
hsa-miR-3934-5p` | 0.729(0.587ཞ0.905) | 0.004 | 0.812(0.643ཞ1.025) | 0.080 |
These 7 miRNAs make up the prognostic model, including hsa-miR-33b-5p, hsa-miR-33a-5p, hsa-miR-590-3p, hsa-miR-576-5p, hsa-miR-200b-3p, hsa-miR-186-3p and hsa -miR-3934-5p. Riskscore = (0.246 * hsa-miR-33b-5p` expression level) + (0.179 * hsa-miR-33a-5p` expression level) + (0.440 * hsa-miR-590-3p` expression level) + (0.641 * `hsa-miR-576-5p` expression level) + (0.254 * hsa-miR-200b-3p` expression level) + ( 0.325 * `hsa-miR-186-3p` expression level) + (0.208 * hsa-miR-3934-5p expression level). By calculating the risk score of each patient, the patients were divided into high-risk group and low-risk group based on the median value.
Roc Curve And Km Survival Analysis
Based on the model, we used the R language programming survival package and survivalROC package to draw relevant graphs. From the figure, we can observe that the area under curve(AUC) of the train group, test group, and all sample groups were 0.830, 0.722, and 0.770 respectively, all of which were greater than 0.7, indicating that this prognostic model has sensitivity and specificity in predicting the survival of patients with bladder cancer. (Fig. 3D-F). At the same time, (Fig. 3A-C) it can be seen that there was a significant difference between the high and low risk groups in the train group, the test group, and all sample groups (P < 0.05), and the survival rate decreased as the risk value increased. The train group showed that the 5-year overall survival rates for the low- and high-risk groups were 32.0% and 70.8%, respectively. The test team demonstrated that the 5-year survival rates for the overall high-risk and low-risk groups were 34.3% and 62.7%, respectively. The entire group showed that the 5-year overall survival rate of the high-risk and low-risk groups were 36% and 56.9%. We could observe that (Fig. 3H-I) it was the visualization results of the sample survival status of the train group, test group and all sample groups. From the figure, we could find that as the risk value increases, the number of death also increases. This evidence suggests that the model was sensitive and specific in predicting the survival of bladder cancer.
Independence Of Clinicopathological Factors
Univariate Cox regression analysis showed that a model constructed from 7 miRNAs was significantly related to overall patient survival (hazard ratio HR = 1.592, confidence interval 95% CI = 1.365–1.856, p = 3.19E-09; Table 3). Multivariate Cox regression analysis indicated that the comprehensive consideration of other commonly-seen clinical factors (HR = 1.605, 95% CI = 1.357–1.898, p = 3.28E-08), and other clinical factors including clinical stage, T stage, lymph node status, and distant metastasis, which still has nothing to do with overall survival rate. In addition, the patient's gender and age make it possible to become a prognostic indicator of bladder cancer in the future. At the same time, the patient's age also serves as an independent prognostic factor (HR = 1.030, 95% CI = 1.002–1.059, p = 0.037). The risk score of the model and other clinical characteristics ROC curves indicate that the risk score (0.698), clinical stage (0.637), T stage (0.623), lymph node status (0.631), and distant metastasis (0.523). These results indicate that this model still has good predictive ability (Fig. 4). In this regard, we can observe that risk score can be adopted as an independent prognostic factor for predicting the survival of bladder cancer patients.
Table 3
Univariate and multivariate Cox regression of clinical features
Clinical features | Univariate Cox regression | Multivariate Cox regression |
HR(95%CI) | P | HR(95%CI) | P |
Age (continuous variable) | 1.030(1.002ཞ1.059) | 0.037 | 1.035(1.005ཞ1.066) | 0.024 |
Gender (male vs female) | 0.618(0.353ཞ1.081) | 0.091 | 0.551(0.311ཞ0.975) | 0.041 |
Clinical stage (I + II vs III + IV)) | 1.742(1.216ཞ2.496) | 0.002 | 1.166(0.573ཞ2.371) | 0.672 |
T (T1 + 2 vs T3 + 4) N (N0 vs N1 + 2) | 1.697(1.146ཞ2.513) 2.162(0.778ཞ6.011) | 0.008 0.139 | 1.260(0.715ཞ2.219) 1.687(0.534ཞ5.328) | 0.424 0.373 |
M (M0 vs M1) | 1.541(1.169ཞ2.031) | 0.002 | 1.195(0.732ཞ1.952) | 0.467 |
Seven-miRNA signature | 1.592(1.365ཞ1.856) | < 0.001 | 1.605(1.357ཞ1.898) | < 0.001 |
Prediction And Visualization Of 7 Mirna Target Genes
The seven miRNAs we selected were used to predict target genes from three databases (miRDB, miRTarBase, and Targetscan). We used the R software VennDiagram package to draw a visualized Wayne diagram to find reliable targets. (Fig. 5) In order to improve the reliability of bioinformatics, we used targets that were supported by all 3 databases, which could be used as targets for miRNAs. From: 4 we can observe that we have screened out 7 miRNA including hsa-miR-33b-5p, hsa-miR-33a-5p, hsa-miR-590-3p, hsa-miR-576-5p, hsa-miR-200b-3p, hsa-miR-186-3p, and hsa-miR-3934-5p,together with 17, 33, 169, 31, 74, 49 of hsa, and 22 overlapping bases. Seven miRNAs were predicted to have a total of 395 target genes.
Construction Of Mirna Regulatory Network
We can see from the regulatory network diagram that in order to explore whether these miRNA target genes may be involved in the progression of bladder urothelial carcinoma, 1111 differentially expressed genes obtained above (among which 545 genes were up-regulated and 566 genes were down-regulated) and 473 differentially expressed miRNAs (380 differentially expressed miRNAs up-regulated and 93 down-regulated) were used for analysis.7 up-regulated miRNAs (hsa-miR-33b-5p, hsa-miR-33a-5p, hsa-miR- 590-3p, hsa-miR-576-5p, hsa-miR-200b-3p, hsa-miR-186-3p and hsa-miR-3934-5p) and 247 down-regulated mRNAs. (The regulatory relationship between MiRNA and its target genes was shown in Fig. 6.)
Functional Enrichment Analysis of Related Target Genes in Bladder Urothelial Carcinoma
The top 15 terms in GO results are: Biological Processes (BP), Cellular Components (CC), and Molecular Function (MF) were demonstrated in dot plots (Fig. 7A-C). Among these three categories, BP analysis mainly includes cell junction, adhesion, migration, protease phosphorylation, regulation of enzyme activity and smooth muscle cell proliferation and differentiation. CC analysis mainly includes adhesion plaques, actin, extracellular matrix, fibrotic proteins, etc. MF analysis mainly includes metal ion transmembrane transporter activity, transcription activator activity, DNA binding, and ion channel binding. The result of the KEGG pathway of target genes associated with bladder urothelial carcinoma was 18 (Table 4), of which counts > 10 were mainly enriched in the MAPK signaling pathway, cGMP-PKG signaling pathway, the pi3k signaling pathway, the TGF-β signaling pathway, the interaction of miRNAs in tumors, and the regulation of inflammatory cell interaction. In addition, in order to provide a complex relationship between the target gene and the relative KEGG pathway, we adopted the R language software package to make a visual network diagram of the pathways (Fig. 7).
Table 4
KEGG pathways of target genes associated Bca
ID | Description | p.adjust | qvalue | | Count | geneID |
hsa04510 | Focal adhesion | 1.02E-05 | 8.02E-06 | | 14 | RAP1A/TLN1/LAMC1/VCL/ACTN1/ROCK2/PARVA/FLNC/ROCK1/MYLK/CAV1/CCND2/THBS1/PPP1CB |
hsa05205 | Proteoglycans in cancer | 0.000337935 | 0.000264643 | | 12 | MYC/ROCK2/PRKACB/DCN/FLNC/SDC2/MMP2/ROCK1/CAV1/THBS1/ PPP1CB/CDKN1A |
hsa04611 | Platelet activation | 0.00077537 | 0.000607205 | | 9 | RAP1A/PTGS1/TLN1/ROCK2/GNAQ/PRKACB/ROCK1/MYLK/PPP1CB |
hsa04022 | cGMP-PKG signaling pathway | 0.006120072 | 0.004792734 | | 9 | PLN/SRF/ROCK2/MEF2A/GNAQ/MEF2D/ROCK1/MYLK/ PPP1CB |
hsa05206 | MicroRNAs in cancer | 0.077452232 | 0.060654176 | | 9 | MYC/CYP1B1/SERPINB5/MCL1/ROCK1/BCL2L2/CCND2/ THBS1/ CDKN1A |
hsa04921 | Oxytocin signaling pathway | 0.014623413 | 0.011451847 | | 8 | ROCK2/GNAQ/PRKACB/ROCK1/MYLK/PPP1CB/RCAN1/ CDKN1A |
hsa04218 | Cellular senescence | 0.016462393 | 0.012891984 | | 8 | HIPK3/FOXO1/MYC/GADD45B/ZFP36L1/CCND2/PPP1CB/ CDKN1A |
hsa04810 | Regulation of actin cytoskeleton | 0.047814395 | 0.037444275 | | 8 | GSN/VCL/ACTN1/ROCK2/ENAH/ROCK1/MYLK/PPP1CB |
hsa05163 | Human cytomegalovirus infection | 0.06200175 | 0.048554637 | | 8 | MYC/CX3CL1/ROCK2/PTGER4/GNAQ/PRKACB/ROCK1/ CDKN1A |
hsa04010 | MAPK signaling pathway | 0.13149261 | 0.102974126 | | 8 | RAP1A/MYC/DUSP3/SRF/PRKACB/GADD45B/FLNC/DUSP1 |
hsa05165 | Human papillomavirus infection | 0.176682825 | 0.138363361 | | 8 | FOXO1/LAMC1/PTGER4/PRKACB/PPP2R5A/CCND2/THBS1/CDKN1A |
hsa04151 | PI3K-Akt signaling pathway | 0.195202228 | 0.152866225 | | 8 | LAMC1/MYC/PPP2R5A/SGK1/MCL1/CCND2/THBS1/CDKN1A |
hsa04068 | FoxO signaling pathway | 0.017708049 | 0.013867478 | | 7 | FOXO1/FBXO32/GADD45B/SGK1/SOD2/CCND2/CDKN1A |
hsa04270 | Vascular smooth muscle contraction | 0.017708049 | 0.013867478 | | 7 | CALD1/ROCK2/GNAQ/PRKACB/ROCK1/MYLK/PPP1CB |
hsa05203 | Viral carcinogenesis | 0.077452232 | 0.060654176 | | 7 | GSN/SRF/ACTN1/PRKACB/IL6ST/CCND2/CDKN1A |
hsa05131 | Shigellosis | 0.13149261 | 0.102974126 | | 7 | FOXO1/TLN1/VCL/ACTN1/ROCK2/BCL10/ROCK1 |
hsa04350 | TGF-beta signaling pathway | 0.017708049 | 0.013867478 | | 6 | SMAD7/MYC/ID4/DCN/ROCK1/THBS1 |
hsa04670 | Leukocyte transendothelial migration | 0.030279325 | 0.023712261 | | 6 | RAP1A/VCL/ACTN1/ROCK2/MMP2/ROCK1 |
hsa04261 | Adrenergic signaling in cardiomyocytes | 0.077452232 | 0.060654176 | | 6 | PLN/ACTC1/GNAQ/PRKACB/PPP2R5A/PPP1CB |
hsa04390 | Hippo signaling pathway | 0.077452232 | 0.060654176 | | 6 | SMAD7/MYC/TEAD1/WWTR1/CCND2/PPP1CB |
hsa04934 | Cushing syndrome | 0.077452232 | 0.060654176 | | 6 | RAP1A/GNAQ/PRKACB/PBX1/RASD1/CDKN1A |
hsa04530 | Tight junction | 0.102825347 | 0.080524299 | | 6 | RAP1A/ACTN1/AMOTL2/ROCK2/PRKACB/ROCK1 |
hsa05202 | Transcriptional misregulation in cancer | 0.13149261 | 0.102974126 | | 6 | FOXO1/MYC/GADD45B/PBX1/CCND2/CDKN1A |
hsa04062 | Chemokine signaling pathway | 0.13149261 | 0.102974126 | | 6 | RAP1A/CX3CL1/ROCK2/CXCL13/PRKACB/ROCK1 |
hsa05132 | Salmonella infection | 0.176682825 | 0.138363361 | | 6 | SNX9/MYC/RHOB/ROCK2/FYCO1/FLNC |
hsa04024 | cAMP signaling pathway | 0.176682825 | 0.138363361 | | 6 | RAP1A/PLN/ROCK2/PRKACB/ROCK1/PPP1CB |
hsa05166 | Human T-cell leukemia virus 1 infection | 0.178922574 | 0.140117348 | | 6 | TLN1/MYC/SRF/PRKACB/CCND2/CDKN1A |
hsa04115 | p53 signaling pathway | 0.025719742 | 0.020141573 | | 5 | GADD45B/SERPINB5/CCND2/THBS1/CDKN1A |
Construction Of Protein Interaction Network
We used the plug-in cytoHubba calculation in the cytoscape software to sort the network nodes in descending order based on the degree value, select the top ten genes as the central genes, and construct the central network map. The results indicate that 136 of the 247 target genes were filtered to the target genes for the PPI network complex, the top ten genes based on the degree value were (MYC, VCL, MMP2, ITPKB, FOXO1, SMAD7, CDKN1A, SRF, CAV1, KLF4) .(The specific genetic information were shown in Fig. 8 and Table 5)
Table 5
Identification of hub genes by cytoHubba
Node- name | MCC | DMNC | MNC | Degree | EPC | Bottle Neck | Ec Centricity | Closeness | Radiality | Betweenness | Stress | Clustering Coefficient |
MYC | 471 | 0.26055 | 25 | 30 | 58.693 | 22 | 0.18963 | 73.2 | 6.67437 | 4910.49147 | 15504 | 0.14253 |
VCL | 223 | 0.21623 | 26 | 27 | 58.181 | 28 | 0.15802 | 68.45 | 6.48026 | 3446.04512 | 12240 | 0.1567 |
MMP2 | 151 | 0.32307 | 16 | 19 | 57.795 | 11 | 0.18963 | 64.2 | 6.41307 | 1972.27219 | 7728 | 0.21053 |
ITPKB | 224 | 0.3491 | 14 | 16 | 56.642 | 29 | 0.18963 | 63.2 | 6.42053 | 1311.8855 | 5756 | 0.25833 |
FOXO1 | 296 | 0.40723 | 11 | 15 | 55.274 | 9 | 0.23704 | 59.16667 | 6.24882 | 1676.85358 | 6838 | 0.22857 |
SMAD7 | 84 | 0.3791 | 10 | 14 | 56.425 | 12 | 0.18963 | 60.28333 | 6.30108 | 1504.76416 | 6392 | 0.21978 |
CDKN1A | 348 | 0.42441 | 12 | 14 | 55.379 | 6 | 0.23704 | 59.91667 | 6.29361 | 1257.4955 | 4962 | 0.31868 |
SRF | 80 | 0.35915 | 10 | 12 | 55.853 | 5 | 0.18963 | 60.03333 | 6.33094 | 1118.09772 | 4770 | 0.27273 |
CAV1 | 53 | 0.358 | 9 | 12 | 54.393 | 7 | 0.15802 | 59.28333 | 6.27122 | 1227.38966 | 4312 | 0.22727 |
KLF4 | 180 | 0.39905 | 10 | 12 | 53.532 | 4 | 0.18963 | 56.28333 | 6.15176 | 648.29204 | 2142 | 0.30303 |
Survival Analysis
We mapped the K-M survival curve based on 247 target genes by using the R language package, and we obtained 40 genes related to survival prognosis (ACCTC1, AMOTL2, CALD1, CAV1, CLIC4, CPE, CSRP1, CXCL13, DCN, DKK3, DPYSL2, DUSP3, FAM168B, FILIP1L, FLNC, FSTL1, GSN, IL33, KLHL21, LAMC1, LIX1L, LRP1, MEF2A, MFAP4, MSRB3, MYC, MYLK, NFIC, PTGER4, RASD1, SBDS, SGCB, SLC39A14, SYNM, SYNP02, TIMP2, TMPRSS11E, VCL, WWTR1, TRIB1). (Fig. 9) From the graph of these genes, we can find that obvious differences occur in the high and low expression groups (p < 0.05) were of statistical significance.