3.1 Gene Expression analysis between healthy and primary breast cancer samples
Upon differential expression analysis between healthy and primary breast cancer samples, we found that genes like mTOR, SDHB, MYH, BARD1, APC, SPEN, ARID5B, RAD50, and PMS2, IKBKGP had significant differences in the expression between PBC tumour and healthy samples. The findings were widely anticipated, given deregulation of these genes is found not just in PBC, but also in many other malignancies such as colorectal, endometrial, and gastric cancers. Figure 4A shows a heatmap of genes differentially expressed between healthy and breast cancer samples across all 23 samples. Figure 4B shows the average differences in the expression patterns of these genes between tumour and healthy samples.
3.2 Differential expression analysis across immunohistology subtype
The subsequent analysis followed differential expression within distinct classes of immunohistological subtypes. First, we observed the expression levels of Estrogen receptor-α (ESR1), progesterone receptor (PGR) and HER2 in immunohistological classes (Figure 5). We found that Luminal B-type cancers exhibit higher levels of ESR1 expression than luminal A-or HER2 enriched tumours, whereas both luminal A- and B-type tumours exhibit higher levels of PGR than HER2 enriched tumours. Not surprisingly, HER2-enriched tumours showed significantly elevated ERBB2 (a receptor tyrosine kinase protein) transcript levels, compared to luminal A-type and luminal B-type tumours. In contrast to other classes within the immunohistological subtype, triple-negative breast cancer (TNBC) tumours showed decreased expression levels of the three proteins, ESR1, PGR, and HER2. In fact, a significant association of ESR1, and progesterone receptor (PGR) mutations and their respective expression have been reported in hormonotherapy-resistant breast cancers [25, 26].In addition, all the primary tumour samples regardless of immunohistological subtypes exhibited significant upregulation in the gonadotrophin-releasing hormone (GnRH) signaling pathways.
3.2 Differential expression analysis across wild p53-negative (p53-)and p53-positive (p53+) phenotypes
Since p53 is a known tumour suppressor implicated in a variety of cancers, differential gene expression analysis was performed between p53+ and p53- tumours samples (Figure 6a). The p53 interactome comprises the nearest neighbouring interacting partners from the literature, several of which are involved in cancer tumours metastasis and progression (Figure 6b). A heatmap representation of the FPKM values of certain candidate genes in all the samples (Figure 6c).
Differential expression analysis revealed significantly higher BReastCAncer 2 (BRCA2) expression in the p53+ than in p53- samples. Indeed, mutations in BRCA2 and its ortholog BRCA1 have been linked to the over expression of mutated p53 (p53+), correlating with a poor prognosis in breast cancers [27-29]. Further, we observed that TP53 inducible nuclear protein 2 (TP53INP2) which regulates EMT in cancer metastases [28], surprisingly had markedly lower expression in p53+ samples than in p53- samples. Due to the sampling method, the majority of p53+ samples were found to be CTC. Consequently, this would likely lead to a decrease in the expression of TP53INP2 in the p53+ samples. Further, we observed that the YWHAG gene (that stabilizes Snail thereby promoting EMT in breast cancer) which is a direct interacting partner of p53, had an increased YWHAG expression in p53+ cancers compared to p53- tumours [29-34]. The UBC genes, critical for preserving ubiquitin balance and homeostasis, have been implicated in various cancers, however, within our dataset, we did not detect a significant difference in UBC gene expression between p53+ and p53- samples. Moreover, the expression levels of TNF-receptor Associated Factor 1 (TRAF1), a common interacting partner of p53, showed no significant difference between the p53- and p53+ samples [35, 36]. Further, HSPA1A (aka. Hsp70) which encourages proper folding of the p53-DNA-binding domain[37], had lower expression in p53- samples than in their p53+ counterparts. Table 1 lists 20 transcripts that were significantly differentially expressed between the p53- and p53+ tumour samples.
Table 1. List of the top 20 transcripts that were significantly differentially expressed between the p53- and p53+ tumour samples.
GeneNames
|
geneIDs
|
fc
|
pval
|
qval
|
BTN2A1
|
MSTRG.139719
|
0.307241
|
5.50E-06
|
0.117846
|
ANKHD1
|
MSTRG.135585
|
0.334479
|
1.39E-05
|
0.149321
|
GRINA
|
MSTRG.166165
|
0.245895
|
0.000117
|
0.834394
|
SEPTIN6
|
MSTRG.180041
|
0.440208
|
0.000209
|
0.999784
|
<none>
|
MSTRG.60458
|
3.591075
|
0.001085
|
0.999784
|
GPANK1
|
MSTRG.140188
|
0.483044
|
0.002043
|
0.999784
|
RABGAP1
|
MSTRG.172656
|
0.828765
|
0.002138
|
0.999784
|
MTREX
|
MSTRG.129326
|
0.710346
|
0.002143
|
0.999784
|
<none>
|
MSTRG.110887
|
0.472226
|
0.002245
|
0.999784
|
<none>
|
MSTRG.62309
|
1.644774
|
0.002454
|
0.999784
|
<none>
|
MSTRG.12459
|
0.584573
|
0.0029
|
0.999784
|
ELOA
|
MSTRG.2120
|
0.484198
|
0.002984
|
0.999784
|
UFSP2
|
MSTRG.126214
|
0.723889
|
0.003036
|
0.999784
|
TRANK1
|
MSTRG.107059
|
2.022271
|
0.003318
|
0.999784
|
ITGA6
|
MSTRG.90910
|
0.308721
|
0.003367
|
0.999784
|
NR3C1
|
MSTRG.135883
|
0.6114
|
0.003547
|
0.999784
|
<none>
|
MSTRG.110878
|
0.53199
|
0.00375
|
0.999784
|
TM9SF4
|
MSTRG.97196
|
1.831184
|
0.003906
|
0.999784
|
NUCB1
|
MSTRG.79998
|
0.471756
|
0.004307
|
0.999784
|
RNF34
|
MSTRG.41634
|
0.5993
|
0.004368
|
0.999784
|
3.3 Differential expression analysis across EMT phenotypes by comparing localized tumours (CTC+) and metastatic tumours (CTC-)
Upondifferential expression across localized tumours (CTC+) and metastatic tumours (CTC), we found significant differential expression of certain genes, such as Cytokeratin 19 (KRT19), CDH1, EpCAM etc. In previous studies, KRT19 was reported to regulate epithelial-mesenchymal transition in breast cancer metastases however, its reduced expression of KRT19 was associated with poor prognosis [38]. In the present study, KRT19 expression was reduced in CTC+ tumours than in CTC- tumours. CDH1, which encodes E-cadherin, (a protein responsible for cell-to-cell attachment, previously implicated in hereditary lobular breast cancer (LBC)), and diffuse gastric carcinoma (DGC)) demonstrated significantly high expression in CTC+ tumours than in non-CTC+ tumours (Figure 6c) [39]. Likewise, the epithelial cell adhesion molecule (EpCAM), was also found to have elevated expression in CTC+ tumours, in fact, previous studies showed that increased EpCAM expression is associated with poor prognosis in breast cancer patients [40]. Further, conserved serine protease HTRA1 which is involved in the suppression of EMT in tumour metastases [41] showed lower expression of HTRA1 in CTC+ tumours than in CTC- tumours in the present study especially in the “BC12” and “BC20” samples(Figure 6c). In previous studies, Fibrillin-1 (FBN1) was shown to be implicated in TGF-β-induced breast cancer metastasis [42], however, for the present dataset, we did not observe a significant difference in the expression of FBN1 between CTC+ and CTC- tumours. Several breast cancer examples revealed increased expression of interleukin 2 receptor-γ (IL2RG)[39], in the present dataset however, only a single CTC+ sample (“BC14”) exhibited the highest expression of this receptor (Figure 6c). Likewise, c-Myc which is often overexpressed in breast cancer patients, sometimes in conjunction with BRCA1 suppression [40], in the samples from the present dataset, only “BC19”¾a CTC+ sample showed elevated levels of c-Myc. In further analysis, increased expression of Mitogen-activated protein kinases 3 (MAPK3) was observed in 2 CTC+ samples (“BC17” and “BC17”). However, the average expression of MAPK1 was significantly lower in CTC+ samples than in CTC- samples. In assertion to our findings, previous studies have also reported MAPK1&3 to have reduced expression in metastatic (CTC+) tumours[43].
3.4 Functional enrichment analysis and pathway analysis
Considering a q value <0.05, we identified 138 genes differentially expressed between PBC patients and healthy individuals. The transcript levels of these genes were cross-referenced with those in the Human Protein Atlas database to explore their associations with various cancers. Apart from breast cancer, most of the genes on the list exhibited implications in multiple cancer types, such as hepatocarcinoma, renal carcinoma, glioma, colorectal cancer, and lung cancer. From this pool, only 8 genes exclusively linked to breast cancer were selected, and their corresponding expression values (FPKM values) were extracted from the transcript list. The compiled list is provided in Table 2 and Figure 7. The list of significant genes thus obtained was used for functional enrichment studies utilizing a hypergeometric distribution. Functional enrichment analysis revealed that several pathways, including signalling pathways (such as RAS, RAP1, and MAPK) and cellular metabolic pathways (including glycolysis/gluconeogenesis, pyruvate metabolism, and the pentose phosphate pathway), exhibited significant prominence in tumour samples compared to healthy samples. (Figure 8).
Table 2. List of breast cancer-specific genes obtained upon differential expression comparing PBC samples to healthy controls.
Gene Names
|
geneIDS
|
fc
|
pval
|
qval
|
FGF10
|
MSTRG.128492
|
5.635777
|
1.47E-07
|
0.00077
|
ADIPOQ
|
MSTRG.115804
|
21.45618
|
2.40E-05
|
0.021864
|
LIPE
|
MSTRG.79410
|
7.523439
|
3.36E-05
|
0.024938
|
EDNRB
|
MSTRG.45346
|
3.549042
|
4.01E-05
|
0.024938
|
TRARG1
|
MSTRG.65217
|
11.74944
|
4.89E-05
|
0.026304
|
LEP
|
MSTRG.156541
|
9.036603
|
7.99E-05
|
0.031626
|
SLC9A3R1
|
MSTRG.70633
|
0.23404
|
0.00014
|
0.037534
|
CIDEA
|
MSTRG.72178
|
4.153306
|
0.000224
|
0.040077
|
We further conducted functional enrichment analysis across the EMT subtype, comparing CTC+ and CTC- samples. Analysis of differentially expressed genes (DEGs) revealed that SMAD1, TGFBR2, TWIST1, ZEB1, and SNAIL2/SLUG were significantly upregulated in CTC+ samples relative to CTC- samples. In fact, several of these genes have been shown to pathologically elevate epithelial-mesenchymal transition (EMT) for wide scale metastases (Figure 9a).In addition to the aforementioned upregulated genes, our study observed an increase in metastatic traits within tumours induced by viral oncogenes. CTC+ tumours displayed upregulation of various oncoviral induced pathways, including those associated with the Epstein-Barr Virus (EBV), hepatitis B, hepatitis C, and Kaposi-Sarcoma Associated Herpes Virus. Table 3 shows the list of differentially expressed genes between the CTC+ and CTC- groups. A bar plot of the functional enrichment and circular interactome analysis is provided in Figure 9b.
Table 3. Differentially expressed genes identified between CTC+ and CTC- samples.
Gene Names
|
Gene IDs
|
fc
|
pval
|
qval
|
JUN
|
MSTRG.4863
|
2.206242
|
0.000757
|
0.962418
|
SFT2D1
|
MSTRG.147328
|
0.42453
|
0.000787
|
0.962418
|
KRT5
|
MSTRG.36783
|
5.604195
|
0.001414
|
0.962418
|
ERH
|
MSTRG.51032
|
0.644329
|
0.001514
|
0.962418
|
ICAM3
|
MSTRG.76604
|
0.553084
|
0.002609
|
0.962418
|
KIT
|
MSTRG.119767
|
2.796855
|
0.002647
|
0.962418
|
DTX3L
|
MSTRG.111625
|
0.609323
|
0.003237
|
0.962418
|
CCDC8
|
MSTRG.79741
|
1.994054
|
0.003368
|
0.962418
|
IRX1
|
MSTRG.126571
|
2.359517
|
0.003429
|
0.962418
|
CD109
|
MSTRG.142410
|
1.73033
|
0.003874
|
0.962418
|
DSC3
|
MSTRG.72875
|
2.414973
|
0.004219
|
0.962418
|
ITIH5
|
MSTRG.16602
|
1.616171
|
0.004565
|
0.962418
|
SCRN2
|
MSTRG.68752
|
1.360452
|
0.00488
|
0.962418
|
RBM15
|
MSTRG.8354
|
0.555303
|
0.006089
|
0.962418
|
ALDH1A3
|
MSTRG.60095
|
1.670188
|
0.006102
|
0.962418
|
KRT17
|
MSTRG.68202
|
5.030419
|
0.006735
|
0.962418
|
FAM53B
|
MSTRG.24085
|
0.634872
|
0.00699
|
0.962418
|
PRNP
|
MSTRG.95627
|
1.601053
|
0.007342
|
0.962418
|
BECN1
|
MSTRG.68319
|
1.326989
|
0.007732
|
0.962418
|
TDP1
|
MSTRG.52593
|
0.531036
|
0.007836
|
0.962418
|
AASDH
|
MSTRG.119954
|
1.894606
|
0.008083
|
0.962418
|
TRIM29
|
MSTRG.31870
|
2.191798
|
0.008108
|
0.962418
|
TSPYL2
|
MSTRG.176994
|
1.393778
|
0.008511
|
0.962418
|
PITX1
|
MSTRG.135257
|
0.411268
|
0.009034
|
0.962418
|
MTCL1
|
MSTRG.72021
|
1.59995
|
0.009062
|
0.962418
|
MPHOSPH9
|
MSTRG.41732
|
0.688438
|
0.009918
|
0.962418
|