3.1 Clinical characteristics
A set of data from 418 patients were downloaded from the TCGA database with corresponding patient demographic and clinical characteristics data including age, gender, histological grade, stage, T/N/M classification and survival status of liver cancer (Table 1). Paired tumor, adjacent non-tumor liver tissues from a cohort of 316 HBV-related HCC patients were downloaded from the current Clinical Proteomic Tumor Analysis Consortium (CPTAC) project (28).
Table 1. Clinical characteristics of the liver cancer patients.
Characteristics
|
|
Number of patients (%)
|
Age
|
|
|
|
<55
|
120(28.71)
|
|
≥55
|
256(61.24)
|
|
Not available
|
42(10.05)
|
Gender
|
|
|
|
Female
|
146(34.93)
|
|
Male
|
254(60.77)
|
|
Not available
|
18(4.31)
|
Histological grade
|
|
|
|
G1
|
55(13.16)
|
|
G2
|
180(43.06)
|
|
G3
|
124(29.67)
|
|
G4
|
13(3.11)
|
|
Not available
|
46(11)
|
Stage
|
|
|
|
I
|
194(46.41)
|
|
II
|
98(23.44)
|
|
III
|
90(21.53)
|
|
IV
|
12(2.87)
|
|
Not available
|
24(5.74)
|
T Classification
|
|
|
|
T1
|
204(48.8)
|
|
T2
|
107(25.6)
|
|
T3
|
90(21.53)
|
|
T4
|
14(3.35)
|
|
TX
|
1(0.24)
|
|
Not available
|
2(0.48)
|
M Classification
|
|
|
|
M0
|
303(72.49)
|
|
M1
|
8(1.91)
|
|
MX
|
107(25.6)
|
N Classification
|
|
|
|
N0
|
290(69.38)
|
|
N1
|
8(1.91)
|
|
NX
|
119(28.47)
|
|
Not available
|
1(0.24)
|
Survival status
|
|
|
|
Death
|
147(35.17)
|
|
Survival
|
271(64.83)
|
Not available data and TX data were not used for subsequent analysis
3.2 SNRPA1 is highly expressed in liver cancer
SNRPA1 mRNA expression level was significantly up − regulated in tumor tissues compared with normal tissues (p = 1.411e − 27, Figure. 1A) in liver cancer. A paired comparison between normal and liver cancer tissue from the same patients also showed a significant up − regulation (p = 4.08e − 16, Figure. 1B). SNRPA1 expression level showed a positive correlation with survival status (p = 0.035, Figure. 1C). Furthermore, significant differences were observed in SNRPA1 expression based on histological grade and T classification (Fig. 1D − 1F).
3.3 SNRPA1 is an independent risk factor for evaluation of the survival rate in liver cancer patients
Univariate and multivariate Cox analyses indicated that the SNRPA1 mRNA expression (hazard ratio [HR] = 1.08, 95% confidence interval [CI]: 1.02–1.14, p = 0.005, Table 2) could be a useful biomarker for liver cancer prognosis.
Table 2. Univariate analysis and multivariate analyses of the correlation between SNRPA1 expression and clinical parameters.
|
Univariate analysis
|
Multivariate analysis
|
Parameters
|
HR
|
95%CI
|
P-value
|
HR
|
95%CI
|
P-value
|
age
|
1.01
|
1–1.03
|
0.177
|
|
|
|
gender
|
0.82
|
0.56–1.21
|
0.317
|
|
|
|
grade
|
1.12
|
0.87–1.45
|
0.382
|
|
|
|
stage
|
1.67
|
1.36–2.06
|
0.000
|
1.19
|
0.51–2.8
|
0.680
|
T classification
|
1.65
|
1.36–2.01
|
0.000
|
1.41
|
0.63–3.18
|
0.402
|
SNRPA1
|
1.08
|
1.03–1.14
|
0.002
|
1.08
|
1.02–1.14
|
0.005
|
P-values in Bold indicate p < 0.05. HR: hazard ratio; CI: confidence interval.
3.4 SNRPA1 gene set enrichment analysis in liver cancer
To identify the potential mechanisms of SNRPA1 expression on liver cancer prognosis, we conducted the GSEA (GO and KEGG pathway enrichment analysis) between low and high SNRPA1 expression groups (Table 3). The GO and KEGG analyses results showed processes and pathways associated with AS. “RNA splicing”, “small nuclear ribonucleoprotein complex”, “RNA splicing via transesterification reactions”, “spliceosomal complex” and “mRNA processing” were enriched in GO analysis. “Spliceosome”, as well as some carcinogenesis and development associated pathways, like “DNA replication”, “base excision repair”, “RNA degradation” and “cell cycle” were enriched in KEGG analysis. These related results have been shown in Fig. 2. Our results suggested that SNRPA1 could be a useful biomarker and is related to other gene functions through alternative splicing.
Table 3
The top 10 enriched GO and KEGG pathways of high SNRPA1 expression groups
Name
|
NES
|
NOM p − val
|
FDR q − val
|
GO_POSITIVE_REGULATION_OF_DNA_BIOSYNTHETIC_PROCESS
|
2.151
|
0.000
|
0.124
|
GO_RNA_SPLICING
|
2.115
|
0.000
|
0.115
|
GO_SMALL_NUCLEAR_RIBONUCLEOPROTEIN_COMPLEX
|
2.106
|
0.000
|
0.090
|
GO_U1_SNRNP
|
2.105
|
0.000
|
0.069
|
GO_NEGATIVE_REGULATION_OF_NUCLEAR_DIVISION
|
2.088
|
0.000
|
0.075
|
GO_RNA_SPLICING_VIA_TRANSESTERIFICATION_REACTIONS
|
2.088
|
0.000
|
0.063
|
GO_SM_LIKE_PROTEIN_FAMILY_COMPLEX
|
2.081
|
0.000
|
0.065
|
GO_TELOMERASE_HOLOENZYME_COMPLEX
|
2.076
|
0.000
|
0.062
|
GO_SPLICEOSOMAL_COMPLEX
|
2.063
|
0.000
|
0.068
|
GO_MRNA_PROCESSING
|
2.053
|
0.000
|
0.072
|
KEGG_SPLICEOSOME
|
2.062
|
0.000
|
0.031
|
KEGG_DNA_REPLICATION
|
1.861
|
0.006
|
0.223
|
KEGG_HOMOLOGOUS_RECOMBINATION
|
1.847
|
0.004
|
0.170
|
KEGG_BASE_EXCISION_REPAIR
|
1.800
|
0.008
|
0.205
|
KEGG_RNA_DEGRADATION
|
1.790
|
0.000
|
0.179
|
KEGG_CELL_CYCLE
|
1.788
|
0.010
|
0.154
|
KEGG_PYRIMIDINE_METABOLISM
|
1.782
|
0.004
|
0.141
|
KEGG_OOCYTE_MEIOSIS
|
1.777
|
0.000
|
0.130
|
KEGG_PURINE_METABOLISM
|
1.706
|
0.000
|
0.214
|
KEGG_NUCLEOTIDE_EXCISION_REPAIR
|
1.700
|
0.018
|
0.201
|
NES: normalized enrichment score; NOM: nominal; FDR: false discovery rate; p-val: p value |
3.5 Alternative splicing profiles of liver cancer in TCGA
By analyzing AS events of 418 cases of liver cancer patients from TCGA, we found 2666 AAs in 1937 genes, 2331 ADs in 1663 genes, 6325 APs in 2566 genes, 8087 ATs in 3532 genes, 12327 ESs in 5331 genes, 137 MEs in 135 genes, and 2263 RIs in 1561 genes. Detailed information about the specific AS types of genes was visualized in the Upset plot (Fig. 3A).
3.6 Analysis of OS − related AS events with univariate Cox
AS events data was used to perform univariate analyses for OS. The results of the univariate Cox proportional hazards regression are shown in Supplementary Table S1. Inclusively, 222 AAs, 226 ADs, 618 APs, 891 ATs, 1272 ESs, 16 MEs, and 221 RIs were significantly altered (p < 0.05). The Upset plot of significant OS − related AS types was shown in Fig. 3B. Interestingly, SNRPA1 (SNRPA1–32758–ES), belonging to ES events, was significantly (p = 0.003) associated with OS in the univariate Cox model (hazard ratio [HR] = 155.76, 95% confidence interval [CI]: 5.26–4612.99). The PSI value of every patient has been shown in Supplementary Table S2.
3.7 ES events-related genes interaction networks construction
With access to RNA − seq data and corresponding clinical information of liver cancer patients, we identified 404 candidate SFs whose expression levels were significantly associated with OS related ASs events. Among them, we found that SNRPA1 (COR = 0.606, p = 7.18E − 34), as well as LSM8, PCBP4, LSM2, SNRPB2, NOSIP, SNRPG, SNRPE, SNRPF, SF3A2, PRPF31, EIF2S2, SNRPA, DDX39A, SNRPD1, THOC5, LSM7, ISY1, SRRT, SNU13, RALY, CCDC12, and SNRPB were related to SCP2 (SCP2 − 3045 − ES) (Table 4). We constructed networks between the prognosis associated ES events and survival associated SFs to identify the underlying interactions (Fig. 3C). A total of 48 SFs and 21 AS events were constructed. Ultimately, 69 nodes and 72 edges were established in the PPI networks, which included 14 down − regulated AS events and 7 up − regulated AS events. The SFs and AS events that correlated positively (COR > 0.6) and negatively (COR < − 0.6) were shown by red and blue edges, respectively. From the results, SCP2 could be regulated by 23 SFs.
Table 4
The correlation analysis of AS events and the expression of SFs for SCP2
SF
|
AS
|
COR
|
p − value
|
LSM8
|
SCP2 − 3045 − ES
|
0.64622
|
1.09E − 39
|
PCBP4
|
SCP2 − 3045 − ES
|
0.606735
|
5.80E − 34
|
LSM2
|
SCP2 − 3045 − ES
|
0.671344
|
8.47E − 44
|
SNRPB2
|
SCP2 − 3045 − ES
|
0.621021
|
6.10E − 36
|
NOSIP
|
SCP2 − 3045 − ES
|
0.62959
|
3.54E − 37
|
SNRPG
|
SCP2 − 3045 − ES
|
0.633561
|
9.18E − 38
|
SNRPE
|
SCP2 − 3045 − ES
|
0.600225
|
4.29E − 33
|
SNRPF
|
SCP2 − 3045 − ES
|
0.702603
|
1.64E − 49
|
SF3A2
|
SCP2 − 3045 − ES
|
0.626232
|
1.09E − 36
|
PRPF31
|
SCP2 − 3045 − ES
|
0.630417
|
2.68E − 37
|
SNRPA1
|
SCP2 − 3045 − ES
|
0.606052
|
7.18E − 34
|
EIF2S2
|
SCP2 − 3045 − ES
|
0.655976
|
3.07E − 41
|
SNRPA
|
SCP2 − 3045 − ES
|
0.692363
|
1.47E − 47
|
DDX39A
|
SCP2 − 3045 − ES
|
0.641328
|
6.19E − 39
|
SNRPD1
|
SCP2 − 3045 − ES
|
0.716852
|
2.28E − 52
|
THOC5
|
SCP2 − 3045 − ES
|
0.604718
|
1.08E − 33
|
LSM7
|
SCP2 − 3045 − ES
|
0.711111
|
3.39E − 51
|
ISY1
|
SCP2 − 3045 − ES
|
0.748097
|
2.66E − 59
|
SRRT
|
SCP2 − 3045 − ES
|
0.633006
|
1.11E − 37
|
SNU13
|
SCP2 − 3045 − ES
|
0.724853
|
4.71E − 54
|
RALY
|
SCP2 − 3045 − ES
|
0.666096
|
6.60E − 43
|
CCDC12
|
SCP2 − 3045 − ES
|
0.619867
|
8.89E − 36
|
SNRPB
|
SCP2 − 3045 − ES
|
0.734261
|
4.13E − 56
|
COR: correlation coefficient |
3.8 Relationships between different splices and cancer types
The different splices have been shown in Fig. 4A. When the normal and tumor tissues PSI values of liver cancer were compared, it was found SNRPA1 exon 6 skip (SNRPA1_exon_6) and SCP2 exon 12 skip (SCP2_exon_12) correlated with LICH (Fig. 4B). The tumor tissues were significantly different from normal tissues with Wilcoxon rank-sum tests (p < 0.001). Further analyses in BRCA, COAD, LUAD and STAD showed the PSI values of SNRPA1_exon_6 and SCP2_exon_12 were significantly increasing in tumor tissues (p < 0.001 for each cancer type, Fig. 4C − 4D).
3.9 PSI value of SNRPA1, SCP2 and clinical analysis
SNRPA1 and SCP2 PSI values showed the positive correlations with survival status (p = 3.022e − 04 and p = 2.932e − 03, respectively, Fig. 5A − B). Furthermore, the PSI values of SNRPA1 and SCP2 were significant different among subgroups in histological grade (p = 3.87e − 07 and p = 1.598e − 08), stage (p > 0.005 and p = 0.007), and T classification (p = 0.003 and p = 0.002) (Fig. 5C − H).
3.10 SCP2 is low expressed in liver cancer
On investigating the relationship between SNRPA1 and SCP2 at the gene expression level, we calculated the expression levels of SNRPA1 and SCP2 in tumor tissues and found that they were negatively correlated in clinical tumor samples (COR = − 0.417, p = 5.999e − 21; Fig. 6A). SCP2 mRNA expression level was down − regulated (p = 1.343e − 15; Fig. 6B) in liver cancer, which corresponded to the histological grade and stage of liver cancer (Fig. 6D − F). However, the SCP2 expression level had no significant correlation with survival status (p = 0.063; Fig. 6C) using the median level as a cut − off point.
3.11 Validation the SNRPA1 and SCP2 expression at protein level
For the CPTAC analyzed results, SNRPA1 protein expression level was significantly up − regulated in tumor tissues compared with normal tissues (p = 3.197e − 47, Figure. 7A) in liver cancer. SCP2 protein expression level was significantly down − regulated in tumor tissues compared with normal tissues (p = 1.083e − 23, Figure. 7B). These two protein expression levels had no significant correlation with survival status (p = 0.622, p = 0.352; Fig. 7C-D) using the median level as a cut − off point. Our clinical samples results were consistent with the CPTAC database (Fig. 7E-F).