Differential expression analysis
According to the RefSeq ID in the downloaded data, a total of 15,885 mRNAs, 1,241 lncRNAs and 1,046 miRNAs were annotated. Under the threshold of FDR < 0.05 and |log2FC| > 1, 29 downregulated and 590 upregulated mRNAs; 14 downregulated and 199 upregulated lncRNAs; 54 downregulated and 29 upregulated miRNAs were identified to be differentially expressed in the HCV-HCC tissues compared with those of the normal controls. Volcano plots were used to display the distribution of the DELs-DEGs and DEMs (Fig. 1A). The heat map showed the expression of these DELs, DEGs and DEMs can clearly distinguish HCV-HCC from normal tissues (Fig. 1B).
Identification Of Lncrna Signature With Prognostic Value
Univariate Cox regression analysis identified 214 differentially expressed RNAs that were significantly related with OS in the training set of HCV-HCC patients, including 154 DEGs, 20 DELs and 40 DEMs. These 20 candidate DELs were then entered into multivariate Cox regression, 6 of which (SLC16A1-AS1, ZFPM2-AS1, JARID2-AS1, LINC01426, USP3-AS1 and LYPLAL1-AS1) were shown to be independent predictive biomarkers. Thus, they may be potential signature lncRNAs that can optimally predict the OS of patients with HCV-HCC (Table 2). Subsequently, a risk score system of this 6-lncRNA signature was established for each patient according to the following formula: risk score = (2.06022) × ExpSLC16A1−AS1 + (-4.60301) × ExpZFPM2−AS1 + (-4.24416) × ExpJARID2−AS1 + (0.73659) × ExpLINC01426 + (1.93958) × ExpUSP3−AS1 + (-3.73102) × ExpLYPLAL1−AS1.
Table 2
Prognostic RNAs significantly associated with OS
| ID | logFC | FDR | β | HR | 95%CI | P-value |
lncRNA | SLC16A1-AS1 | 1.65 | 2.23E-10 | 2.06 | 7.85 | 1.65–10.39 | 4.85E-03 |
| ZFPM2-AS1 | 3.74 | 1.03E-04 | -4.60 | 0.10 | 0.02–0.62 | 1.42E-02 |
| JARID2-AS1 | 1.09 | 2.44E-02 | -4.24 | 0.14 | 0.03–0.65 | 1.44E-02 |
| LINC01426 | 3.07 | 4.50E-07 | 0.74 | 2.09 | 1.39–4.65 | 3.55E-02 |
| USP3-AS1 | 1.19 | 1.74E-12 | 1.94 | 6.96 | 1.77–12.88 | 4.21E-02 |
| LYPLAL1-AS1 | 1.56 | 1.67E-02 | -3.73 | 0.24 | 0.03–0.90 | 4.73E-02 |
mRNA | AURKA | 2.01 | 4.21E-18 | | 1.56 | | 2.70E-02 |
| CCNB1 | 2.24 | 5.11E-15 | | 1.51 | | 7.00E-03 |
| MSH2 | 1.25 | 5.11E-15 | | 1.76 | | 4.80E-02 |
| SFN | 2.14 | 4.47E-09 | | 1.39 | | 1.05 E-03 |
| TSSK6 | 1.66 | 9.39E-11 | | 0.37 | | 2.50E-02 |
| TMPRSS9 | 1.03 | 1.02E-08 | | 0.53 | | 1.80E-02 |
miRNA | hsa-miR-383 | -2.18 | 2.48E-12 | | 0.84 | | 4.10E-02 |
Multivariate analysis results for lncRNAs and univariate analysis results for miRNA and mRNAs. FDR, False discovery rate; HR, hazard ratio; CI, confidence interval. |
Using the median risk score as the cut-off value, the patients were assigned into a high-risk group (n = 26) and a low-risk group (n = 25). The Kaplan-Meier survival curve and log-rank test showed that OS was significantly worse in HCV-HCC patients of the high risk group than that in patients of the low risk group in the training (hazard ratios [HR] = 3.798, 95% confidence intervals [CI] = 1.545–9.332) (Fig. 2A), testing (HR = 2.688, 95% CI = 1.134–6.373) (Fig. 2B) and entire set (HR = 3.206, 95% CI = 1.723–5.966) (Fig. 2C). The AUC of the ROC curve was respectively 0.95, 0.885, and 0.907 for the training, testing and entire set (Fig. 2D). These findings suggested a stronger prediction potential of this 6-lncRNA prognostic model.
Independence Of 6-lncrna Signature For Survival Prediction
Univariate analysis results indicated that neoplasm histologic grade (G1/G2/G3/-), pathologic T (T1/T2/T3/T4/-), pathologic stage (stage I/II/III/IV/-) and risk score status were significant factors associated with prognosis. However, only pathologic stage (stage I/II/III/IV/-) and risk score status were still significant in the multivariate Cox regression analysis (Table 3). These findings further validated this 6-miRNA signature may be an independent prognostic factor. Also, the stratification analysis showed this 6-miRNA signature can further divide the patients in each stage into high and low risk groups which possessed significantly different OS ratio (Fig. 3), indicating our molecular risk score had higher predictive power for OS than the clinically used stage classification system.
Table 3
Univariate and multivariate analysis of lncRNA signature and survival in different patient sets
Variables | Univariate analysis | Multivariate analysis |
HR | 95%CI | P-value | HR | 95%CI | P-value |
Training set (N = 51) | | | | | | |
Age (years, mean ± SD) | 0.975 | 0.947–1.005 | 1.03E-01 | - | - | - |
Gender (male/female) | 1.086 | 0.425–2.772 | 8.64E-01 | - | - | - |
Neoplasm histologic grade (G1/G2/G3/-) | 0.965 | 0.481–1.940 | 9.22E-01 | - | - | - |
Pathologic M (M0/M1/-) | - | - | - | - | - | - |
Pathologic N (N0/N1/-) | 2.867 | 0.366–22.43 | 2.94E-01 | - | - | - |
Pathologic T (T1/T2/T3/T4/-) | 2.232 | 1.214–4.105 | 6.76E-03 | 0.516 | 0.175–1.523 | 2.31E-01 |
Pathologic stage (stage I/II/III/IV/-) | 2.542 | 1.333–4.846 | 4.61E-03 | 4.269 | 1.274–14.308 | 1.87E-02 |
Tumor recurrence (Yes/No/-) | 0.749 | 0.289–1.948 | 5.53E-01 | - | - | - |
Risk score status (high/low) | 3.798 | 1.545–9.332 | 1.86E-03 | 3.583 | 1.349–9.509 | 1.04E-02 |
Testing set (N = 51) | | | | | | |
Age(years, mean ± SD) | 1.005 | 0.976–1.036 | 7.34E-01 | - | - | - |
Gender (male/female) | 1.686 | 0.624–4.557 | 2.98E-01 | - | - | - |
Neoplasm histologic grade (G1/G2/G3/-) | 1.801 | 1.014–3.195 | 4.04E-02 | 1.21 | 0.659–2.221 | 5.38E-01 |
Pathologic M (M0/M1/-) | - | - | - | - | - | - |
Pathologic N (N0/N1/-) | - | - | - | - | - | - |
Pathologic T (T1/T2/T3/T4/-) | 2.065 | 1.308–3.258 | 1.12E-03 | 2.361 | 0.509–10.938 | 2.72E-01 |
Pathologic stage (stage I/II/III/IV/-) | 2.994 | 1.511–5.932 | 5.79E-04 | 2.579 | 1.279–5.196 | 8.05E-03 |
Tumor recurrence (Yes/No/-) | 1.784 | 0.736–4.324 | 1.94E-01 | - | - | - |
Risk score status (high/low) | 2.688 | 1.134–6.373 | 1.96E-02 | 2.939 | 1.119–7.721 | 2.87E-02 |
Entire set (N = 102) | | | | | | |
Age (years, mean ± SD) | 0.991 | 0.971–1.012 | 4.12E-01 | - | - | - |
Gender (male/female) | 1.278 | 0.715–2.283 | 4.07E-01 | - | - | - |
Neoplasm histologic grade G1/G2/G3/-) | 1.416 | 0.915–2.193 | 1.18E-01 | - | - | - |
Pathologic M (M0/M1/-) | - | - | - | - | - | - |
Pathologic N (N0/N1/-) | 2.548 | 0.342–19.01 | 3.44E-01 | - | - | - |
Pathologic T (T1/T2/T3/T4/-) | 2.124 | 1.486–3.035 | 1.85E-05 | 1.115 | 0.319–3.897 | 8.64E-01 |
Pathologic stage (stage I/II/III/IV/-) | 2.471 | 1.720–4.365 | 4.66E-06 | 2.507 | 1.600–3.930 | 6.11E-05 |
Tumor recurrence (Yes/No/-) | 1.175 | 0.615–2.247 | 6.25E-01 | - | - | - |
Risk score status (high/low) | 3.206 | 1.723–5.966 | 1.06E-04 | 3.326 | 1.683–6.574 | 5.47E-04 |
CI, confidence interval; HR, hazard ratio; TNM, tumor, node, metastasis. Bold indicates significance at p < 0.05. |
Functional Characteristics Of Prognostic Signature
In total, 10 downregulated prognosis-related DEMs were predicted to interact with 4 upregulated signature DELs (SLC16A1-AS1, USP3-AS1, LINC01426 and ZFPM2-AS1) in the DIANA-LncBase database; while 37 upregulated prognostic DEGs were predicted to interact with 6 downregulated DEMs (hsa-miR-133b, hsa-miR-33b, hsa-miR-383, hsa-miR-424, hsa-miR-153 and hsa-miR-511) by starBase database. These interaction pairs were integrated to construct an lncRNA-related ceRNA network (Fig. 4).
Under the threshold of PCC > 0.6, 5 upregulated signature DELs (SLC16A1-AS1, USP3-AS1, LINC01426, ZFPM2-AS1 and LYPLAL1-AS1) were discovered to be co-expressed with 133 prognostic DEGs, which were used to construct the co-expression network (Fig. 5).
The DEGs in these two networks were uploaded into the Enrichr database to predict their underlying functions. The results showed 18 GO biological process terms and 10 KEGG pathways were enriched (Table 4; Fig. 6). Four (MSH2, AURKA, SFN, CCNB1) of 14 DEGs (MAST2, SLC2A1, UBE2C, MSH2, ROBO1, DBF4, AURKA, EFNA3, CDCA8, SFN, TOP2A, ZWINT, RACGAP1 and CCNB1) that can be commonly regulated by lncRNAs in the co-expression and ceRNA networks were enriched into both GO terms and KEGG pathways, such as mitotic sister chromatid segregation (GO:0000070) (CCNB1), anaphase-promoting complex-dependent catabolic process (GO:0031145) (CCNB1, AURKA), mitotic spindle organization (GO:0007052) (CCNB1, AURKA), regulation of mitotic cell cycle phase transition (GO:1901990) (CCNB1, AURKA), DNA metabolic process (GO:0006259) (MSH2), cellular response to DNA damage stimulus (GO:0006974) (SFN), Cell cycle (CCNB1, SFN), Oocyte meiosis (CCNB1, AURKA), p53 signaling pathway (CCNB1, SFN) and Mismatch repair (MSH2). Accordingly, these four DEGs related ceRNA (USP3-AS1-hsa-miR-383-SFN) and co-expression (LINC01426/ SLC16A1-AS1-AURKA/SFN/CCNB1) relationship pairs may be especially important. In addition, ZFPM2-AS1, LYPLAL1-AS1 and JARID2-AS1 may be involved in HCC by co-expression with TSSK6, although the PCC of ZFPM2-AS1 and LYPLAL1-AS1 was < 0.6 and was not included in the co-expression network. TSSK6 was enriched in phosphorylation pathway. More important, the above crucial relationship pairs were selected because, as expected, the prognosis and expression trend between all these lncRNAs and mRNAs were consistent, but contrast between lncRNAs and miRNAs (Table 2).
Table 4
Function enrichment for genes regulated by lncRNAs
Categories | Term | FDR | Genes |
GO BP | sister chromatid segregation (GO:0000819) | 1.47E-05 | TOP2A; SPAG5; KIFC1; CDCA8; KNSTRN; ZWINT |
| mitotic sister chromatid segregation (GO:0000070) | 1.47E-05 | CCNB1; PSRC1; SPAG5; KIFC1; CDCA8; NCAPD2; KNSTRN; ZWINT |
| anaphase-promoting complex-dependent catabolic process (GO:0031145) | 1.30E-04 | CDC20; CCNB1; PTTG1; UBE2C; UBE2S; CDK1; AURKA |
| regulation of ubiquitin protein ligase activity (GO:1904666) | 4.94E-04 | CDC20; CCNB1; UBE2C; UBE2S; CDK1 |
| mitotic spindle organization (GO:0007052) | 1.02E-03 | TPX2; CCNB1; KIFC1; STMN1; BIRC5; AURKA |
| DNA replication (GO:0006260) | 1.12E-02 | RECQL4; DBF4; POLD1; CDK1; MCM6; RHNO1 |
| positive regulation of ubiquitin protein ligase activity (GO:1904668) | 1.12E-02 | CDC20; CCNB1; UBE2C; UBE2S; CDK1 |
| regulation of mitotic cell cycle phase transition (GO:1901990) | 1.12E-02 | CDC20; TPX2; CCNB1; UBE2C; CEP131; CDK1; AURKA |
| DNA metabolic process (GO:0006259) | 1.12E-02 | TOP2A; RECQL4; DBF4; MSH2; POLD1; CDK1; MCM6; TK1; RHNO1 |
| cellular response to DNA damage stimulus (GO:0006974) | 1.29E-02 | TOP2A; RECQL4; MSH2; POLD1; CDK1; SFN; IKBKE; ZBTB40; RHNO1 |
| G1/S transition of mitotic cell cycle (GO:0000082) | 2.35E-02 | CDKN2C; DBF4; RRM2; MCM6; CDKN3 |
| positive regulation of cell cycle process (GO:0090068) | 2.35E-02 | CCNB1; SPAG5; RACGAP1; AURKA; RHNO1 |
| regulation of mitotic cell cycle (GO:0007346) | 2.38E-02 | TPX2; CCNB1; PSRC1; TACC3; CDK1; BIRC5 |
| peptidyl-lysine modification (GO:0018205) | 2.66E-02 | TOP2A; CDCA8; BIRC5; P3H4; LOXL2 |
| phosphorylation (GO:0016310) | 2.66E-02 | PKN3; MAST2; ADM2; LIMK1; CDK1; TSSK6; BIRC5; IKBKE; AURKA |
| G2/M transition of mitotic cell cycle (GO:0000086) | 3.42E-02 | CCNB2; CCNB1; CEP131; CDK1; AURKA |
| cell cycle G2/M phase transition (GO:0044839) | 3.45E-02 | CCNB2;CCNB1;CEP131;CDK1;AURKA |
| regulation of G2/M transition of mitotic cell cycle (GO:0010389) | 4.95E-02 | TPX2; CCNB1; CEP131; CDK1; AURKA |
KEGG pathway | Cell cycle | 1.18E-07 | CDC20; CCNB2; CCNB1; CDKN2C; DBF4; PTTG1; CDK1; SFN; MCM6 |
| Oocyte meiosis | 1.76E-05 | CDC20; CCNB2; CCNB1; PTTG1; CDK1; CPEB3; AURKA |
| p53 signaling pathway | 1.06E-04 | CCNB2; CCNB1; RRM2; CDK1; SFN |
| Drug metabolism | 5.43E-03 | RRM2; UGT2B11; NAT2; TK1 |
| Central carbon metabolism in cancer | 8.74E-03 | G6PD; SLC2A1; SLC16A3 |
| Mismatch repair | 9.70E-03 | MSH2; POLD1 |
| Human T-cell leukemia virus 1 infection | 1.43E-02 | CDC20; CCNB2; CDKN2C; PTTG1; SLC2A1 |
| Pentose and glucuronate interconversions | 2.05E-02 | AKR1B10; UGT2B11 |
| DNA replication | 2.29E-02 | POLD1; MCM6 |
| Axon guidance | 3.06E-02 | EFNA3; LIMK1; PLXNA1; ROBO1 |
FDR, false discovery rate; GO, Gene ontology; BP, biological process; KEGG, Kyoto Encyclopedia of Genes and Genomes. |