Identification of intestinal flora-related DEGs in CRC
According to the inclusion criteria, 518 genes were identified within 1110 published papers. To explore whether intestinal flora influences the pathogenesis of CRC by regulating the gene expression, we then identified the DEGs between CRC and normal samples from the TCGA database. Compared with the normal samples, there were 1665 and 2235 DEGs in colon adenocarcinoma (COAD) and rectal adenocarcinoma (READ), respectively. Moreover, there were 25 up-regulated genes (Fig. 1a) and 23 down-regulated genes (Fig. 1b) in common between the DEGs and intestinal flora-related genes. The 48 common genes listed in Table 1 were selected for further analysis.
Table 1 Genes in common between the DEGs and intestinal flora-related genes
Category
|
Gene symbol
|
Up-regulated
|
CDKN2A, TNFSF9, CXCL1, CSF2, SPP1, MMP3, MUC5AC, FUT1, NPC1L1, CLDN2, NOX4, GAST, GATA4, IGF2, IL23A, ALB, CXCL8, CXCL10, OSM, REG1A, HP, FGF19, IL22, DIO2, AGT
|
Down-regulated
|
DAO, PYY, ALPI, NPY, SST, ABCG2, FABP2, IL6R, CASR, CNR1, GPR15, CHGA, SLCO4C1, DNASE1L3, AQP8, MAOB, GPT, INSL5, MAPT, PRKCB, GCG, CD36, CFD
|
DEGs, differentially expressed genes.
Functional enrichment analyses
Using the DAVID tool, we identified 55 GO terms and 5 KEGG terms in which the 48 intestinal flora-related DEGs enriched significantly (P<0.05). The results showed the top 5 significant enrichment terms for biological processes, cellular component and molecular function, and 5 KEGG pathway terms (Fig. 2). As listed in Table 2, in the biological processes annotation, selected DEGs are mainly involved in cell-cell signaling, immune response, inflammatory response, digestion, and G-protein coupled receptor signaling pathway. In the cellular component annotation, DEGs are mainly involved in extracellular space, extracellular region, plasma membrane, extracellular exosome, and apical plasma membrane. As for molecular function, DEGs are mainly involved in hormone activity, growth factor activity, cytokine activity, receptor binding, and chemokine activity.
Table 2 The top five GO enrichment terms of the intestinal flora-related DEGs
Category
|
Term
|
Description
|
Count
|
P value
|
BP
|
GO:0007267
|
cell-cell signaling
|
7
|
7.21E-05
|
BP
|
GO:0006955
|
immune response
|
8
|
1.57E-04
|
BP
|
GO:0006954
|
inflammatory response
|
7
|
6.24E-04
|
BP
|
GO:0007586
|
digestion
|
4
|
7.25E-04
|
BP
|
GO:0007186
|
G-protein coupled receptor signaling pathway
|
10
|
7.52E-04
|
CC
|
GO:0005615
|
extracellular space
|
24
|
2.29E-14
|
CC
|
GO:0005576
|
extracellular region
|
24
|
9.97E-13
|
CC
|
GO:0005886
|
plasma membrane
|
17
|
4.90E-02
|
CC
|
GO:0070062
|
extracellular exosome
|
13
|
4.93E-02
|
CC
|
GO:0016324
|
apical plasma membrane
|
4
|
3.90E-02
|
MF
|
GO:0005179
|
hormone activity
|
7
|
1.99E-07
|
MF
|
GO:0008083
|
growth factor activity
|
7
|
4.85E-06
|
MF
|
GO:0005125
|
cytokine activity
|
6
|
1.13E-04
|
MF
|
GO:0005102
|
receptor binding
|
6
|
2.64E-03
|
MF
|
GO:0008009
|
chemokine activity
|
3
|
7.88E-03
|
GO, gene ontology; DEGs, differentially expressed genes; BP, biological process; CC, cellular components; MF, molecular function.
Of the 5 enriched pathways, the two most significant pathways were cytokine-cytokine receptor interaction (P=2.22E-05) and JAK-STAT signaling pathway (P=0.006). Additional highly enriched relevant pathways were rheumatoid arthritis, fat digestion and absorption, and TNF signaling pathway (Table 3).
Table 3 The KEGG pathway enrichment terms of the intestinal flora-related DEGs
Term
|
Description
|
Count
|
P value
|
hsa04060
|
Cytokine-cytokine receptor interaction
|
9
|
2.22E-05
|
hsa04630
|
JAK-STAT signaling pathway
|
5
|
5.97E-03
|
hsa05323
|
Rheumatoid arthritis
|
4
|
9.85E-03
|
hsa04975
|
Fat digestion and absorption
|
3
|
1.66E-02
|
hsa04668
|
TNF signaling pathway
|
4
|
1.67E-02
|
KEGG, Kyoto Encyclopedia of Genes and Genomes; DEGs, differentially expressed genes; JAK, Janus kinase; STAT, signal transducer and activator of transcription; TNF, tumor necrosis factor.
PPI network construction and survival analysis
From the KEGG pathway analysis, 13 genes were selected. With these 13 genes, the PPI network analysis was performed using STRING and then was visualized using Cytoscape v3.7.2 (Fig. 3a). There were 13 nodes and 35 edges in the PPI network (PPI enrichment P<1.0E-16). Moreover, the most significant cluster network (score = 7.143) was created using MCODE plug-in, consisting of 25 edges and 8 nodes/genes (CSF2, CXCL1, MMP3, CXCL10, IL23A, CXCL8, IL22, and IL6R) (Fig. 3b).
Further, we used the GEPIA to analyze the correlation of the 8 key genes with overall survival of CRC patients. The result showed that only chemokine (C-X-C motif) ligand 8 (CXCL8) was closely related to the overall survival of patients with CRC (Fig. 3c).
Drug‑gene interaction
The 8 potential key genes in the PPI network were selected for drug-gene interaction analysis. A list of 24 drugs meeting the requirements for CRC treatment was compiled, comprising only the antineoplastic drugs that had been approved by the FDA (Table 4). As listed in Table 4, the potential targets of these drugs include CSF2, CXCL8, IL6R, and CXCL10, and 79.2% (19/24) of the drugs target CSF2 and CXCL8. Several drugs have been applied either alone or in combination in clinical trials or treatments for CRC, such as cetuximab, oxaliplatin, bevacizumab, temozolomide, and interferon alfa-2b.
Table 4 Candidate drugs targeting key genes
Gene
|
Drug
|
Score*
|
Refs. (PMID)
|
CSF2
|
Interferon alfa-2b
|
2
|
10522033
|
CSF2
|
Mechlorethamine
|
2
|
10640980
|
CSF2
|
Streptozotocin
|
2
|
16342200
|
CSF2
|
Procarbazine
|
2
|
10640980
|
CSF2
|
Temozolomide
|
3
|
12610499, 16100942
|
CSF2
|
Mycophenolic acid
|
2
|
9822358
|
CSF2
|
Nordihydroguaiaretic acid
|
2
|
2453577
|
CSF2
|
Idarubicin
|
2
|
8915668
|
CSF2
|
Cytarabine
|
3
|
8819077, 8450676
|
CSF2
|
Vinblastine
|
2
|
10640980
|
CXCL8
|
Paclitaxel
|
2
|
9271387
|
CXCL8
|
Bevacizumab
|
1
|
-
|
CXCL8
|
Aspirin
|
2
|
12576442
|
CXCL8
|
Leflunomide
|
2
|
10902750
|
CXCL8
|
Tretinoin
|
2
|
8900181
|
CXCL8
|
Cetuximab
|
4
|
10614716, 15908664, 10037173
|
CXCL8
|
Medroxyprogesterone acetate
|
2
|
15914533
|
CXCL8
|
Cyclophosphamide
|
1
|
-
|
CXCL8
|
Verapamil
|
2
|
2686646
|
IL6R
|
Oprelvekin
|
1
|
-
|
IL6R
|
Tocilizumab
|
8
|
16899109
|
IL6R
|
Fluorouracil
|
2
|
8888499
|
IL6R
|
Thalidomide
|
2
|
12515619
|
CXCL10
|
Oxaliplatin
|
2
|
16101140
|
*The score is the combined number of database sources and PubMed references supporting a given interaction.