Correlation between heterogeneity of the intratumoral microbiome and clinical characteristics in colorectal cancer
We first evaluated the relationship between the microbiota composition in 92 tumor tissues based on 16S rRNA sequencing data and clinical characteristics (such as pathological stage, location, age, sex, etc.) of patients with CRC (Table 1). We noticed no statistically significant differences in α-diversity amongst individuals with all these clinical characteristics (Fig. S1). We then analyzed the β-diversity of the intratumoral microbiome in patients belonging to each clinical feature group and discovered statistically significant variations in the microbiota's β-diversity based on tumor stage (early vs. advanced stage, P = 0.049, Fig. 1A). When we considered tumor location (left colon, right colon, or rectum), the microbiota of rectal cancer and left colon cancer were similar (Fig. S2), whereas the β-diversity of left colon cancer was considerably different from that of right colon cancer (P = 0.04, Fig. 1A). There were no statistically significant differences in β-diversity among other clinical characteristics (Fig. S2), suggesting that the intratumoral microbiome differed between either tumor stage and between locations. We listed the top 10 most prevalent genera according to tumor stage and location, with Escherichia-Shigella and Fusobacterium being the most prevalent bacteria in tumors (Fig. 1B).
We then compared the microbiota of the paracancerous tissue to that found within the tumor itself to see whether there were any significant differences between the two. We discovered a significant difference in α- and β-diversity (P < 0.05, Fig. S3) between tumor and paracancerous tissues based on 16S rRNA sequence. Likewise, the microbiota in paracancerous tissues showed strikingly different levels of variety depending on the stage and location of the tumor (Fig. 1C). Intriguingly, Bacteroides and Escherichia-Shigella were the most common in paracancerous tissue (Fig. 1D). These findings suggest that tumor microbiome varies significantly across tumor stage (early vs. advanced) and location (left or right colon).
Table 1: Patient characteristics
Overall survival of colorectal cancer patients was associated with tumor stage and location-related microbiota
To further investigate how intratumoral microflora related to staging in CRC, we first identified the top 50 microflora that contributed to tumor staging in our sequencing data. We used two different test algorithms of random forest approach, Escherichia-Shigella, Parvimonas, and Dialister were identified as the three microorganisms that contributed the most to staging (measured by mean decrease accuracy), by using other test algorithm (measured by mean decrease Gini), we found that Escherichia-Shigella, Dialister, and Peptostreptococcus were the top three (Fig. 2A). Meanwhile, we used LEfSe to identify the specific bacteria linked to CRC stage (early or advanced). Several opportunistic pathogens, including f_Enterobacteriaceae, o_enterobacterale, g_Escherichia-Shigella, and s_Escherichia_coli_ATCC-25922, were all significantly overrepresented (all LDA scores (log10) > 4) in early staging cancer, whereas f_Veillonellaceae, g_Dialister, and s_Dialister were the most abundant microbiota in the advanced group (LDA scores (log10) > 4.0 (Fig. 2B). The Wilcoxon test indicated that g_Escherichia-Shigella was highly expressed in early CRC (Fig. 2C).
To determine if stage-related microflora were associated with the overall survival of CRC patients, we used the TCGA microflora database (TCMA) and assessed overall survival. The results indicate that, among all differentially expressed microflora, the intratumoral quantity of Porphyromonas, Lachnoclostridium, Aggregatibacter, and Hungatella was associated with poor prognosis, while high levels of Bacteroides were associated with good prognosis (Fig. 2D). These five bacteria were subsequently studied as a cluster, one whose expression, we discovered, was negatively correlated with overall survival time in CRC patients (Fig. 2E).
Next, we conducted an identical investigation of intratumoral microbiota at the left and right colon locations in our data. The random forest analysis found that Peptostreptococcus, Aeromonas, and Selenomonas were the three most important bacteria associated with tumor location (left or right colon, measured by mean decrease accuracy). Meanwhile, by using another algorithm of random forest (measured by mean decrease Gini), we found Peptostreptococcus, Selenomonas, and Bacteroides were three genera that most contributed to tumor location (Fig. 3A). LEfSe analysis revealed that s_Fusobacterium, g_Selenomonas, s_Selenomonas, and o_Peptostreptococcales-Tissierellales were the four most highly expressed bacterial species (LDA scores (log10) > 4) in the right colon cancer. In left colon cancer, g_Megamonas, s_Megamonas, o_Desulfovibrionales, c_Desulfovibrionia, and p_Desulfobacteriota were the most prevalent genera (LDA scores (log10) > 3.0) (Fig. 3B). Significantly higher amounts of Peptostreptococcus and Selenomonas were found in right colon cancer compared to left colon cancer (Fig. 3C). By comparing the expression data of the microflora aforementioned with patient survival information from TCMA, we determined that Blautia and, once again, Bacteroides were closely associated with good prognosis (Fig. 3D). Furthermore, this intratumoral bacterial profile (Bacteroides and Blautia) was likewise associated with better outcome in individuals with colon cancer (Fig. 3E). These findings suggest that there are distinct microflora linked with the stage and location of CRC, and that these microflora are strongly connected to the overall survival of patients.
Alterations in colorectal cancer pathways are associated with tumor stage and local microflora
To further evaluate the influence of stage- and location-associated intratumoral microflora on CRC-related pathways, we first performed a correlation study on the abundance of microflora related to the characteristic bacteria mentioned (Fig. 4A). We found that stage-related microflora clusters (Porphyromonas, Lachnoclostridium, Bacteroides, Aggregatibacter, and Hungatella) were associated with sulfur metabolism, styrene degradation, apoptosis, thiamine metabolism and cellular antigens, whereas location-related clusters (Bacteroides and Blautia) were associated with glycerolipid metabolism, nitrotoluene degradation and oxidative phosphorylation (Fig. 4B).
We further used TCMA database to analyze the relationship between intratumoral flora clusters and tumor genes. The tumor samples were divided into two groups according to the relative abundance of characteristically intratumoral flora clusters, then we looked for differentially expressed genes, which processed by GO analysis. GO analysis indicated that stage- and site-specific clusters were associated with the humoral immune response, immunoglobulin complexes, and mucosal immune response pathways (Fig. 4C), suggesting a connection between the microbiota and the immune response. GSVA analysis revealed that the stage-dependent cluster was positively correlated with the bile acid metabolism pathway, whereas the location-dependent cluster was negatively correlated there with, indicating that these two flora clusters may have opposite role on bile acid metabolism (Fig. 4D).
Specific microbiota are associated with the infiltration of immune cells in colorectal cancer
The intratumoral microflora may be related to the immune response of CRC patients(Gao et al. 2022). To further confirm the association between the infiltrated intratumoral microflora and the infiltrated tumor immune cells, IHC staining was performed on the tumor tissues of 92 patients, and the expressions of CD8, FOXP3, CD163, PD-1, and PD-L1 were measured in the tumor. Figure 5A depicts representative images of the IHC staining. Among the microflora contained in the stage- and location-related clusters, we discovered a negative correlation between Bacteroides and PD-1 expression, Blautia and tumor-associated macrophage infiltration, and Hungatella and CD8 + T cell infiltration (Fig. 5B). Figures 5C and D depict the relationship between distinctive microorganisms and their connection with immunohistochemical indices.
Patient samples were subsequently separated into two groups based on the level of stage- and site-specific microorganisms. Expression of PD-1 and FOXP3 were significantly reduced (P < 0.05) at low abundance of stage-related bacterial clusters, whereas expression of CD163, a cancer-related macrophage markers, was decreased at high expression of location-related bacterial clusters (Fig. 5E). These findings suggest a connection between intratumoral immune infiltration and the stage- and site-specific microflora that we identified.
Downstream pathway analysis indicates potential signal pathways in the particular microbiota contained in stage- and location-related clusters
Thus far, we had discovered that the levels of Bacteroides, Blautia, and Hungatella were not only directly connected to patient prognosis, but also independently related to the infiltrating immune cells in the tumor. Therefore, we employed the TCGA database to study potential signal pathways dependent on the expression of these bacteria. Volcanic map, GO analysis, and KEGG pathway analysis were respectively conducted by Bacteroides, Blautia, and Hungatella (Fig. 6A-C). Patient samples were divided into two groups based on the abundance of these three bacteria in TCMA. Within these two groups, we looked for differentially expressed genes, which we then processed by GO (Fig. 6A-C). We discovered that Bacteroides-associated, differentially expressed genes were primarily involved with humoral immunity. KEGG analysis revealed that high abundance of Bacteroides in tumor was positively correlated with glycosaminoglycan degradation, galactose metabolism, amino sugar and nucleotide sugar metabolism and TGF-β signal pathway, and negatively correlated with non-homologous end joining, maturity onset diabetes of the young and thyroid cancer. (Fig. 6A). The majority of the differentially expressed genes in tumor with high Blautia expression were involved in the humoral immune response and immunoglobulin. KEGG analysis revealed a positive correlation between the relative abundance of Blautia and RNA polymerase pathways, and a negative correlation with glycan biosynthesis and sphingolipid metabolism pathway (Fig. 6B). Genes characteristic of tumor with high Hungatella expression were functionally enriched in sphingolipid metabolism, glycan biosynthesis, and p53 signal pathway (Fig. 6C). GSEA analysis revealed that Bacteroides genes were correlated with negative regulation of macromolecule metabolic process, Blautia genes were correlated with cellular aromatic compound metabolic process, and Hungatella genes were correlated with negative regulation of myeloid cell differentiation (Fig. 6D). These findings indicate that Bacteroides, Blautia, and Hungatella were associated with specific signal pathways in CRC.