ScRNA-seq landscape along the consecutive gastric carcinogenesis associated with H. pylori
To explore the cellular heterogeneity and microenvironment in gastric tumorigenesis associated with H. pylori infection, we collected 18 gastric specimens representing the progressive histologic stages, i.e., 6 cases of GS, 6 cases of IM, and 6 cases of GC specimens (Figure 1A). Each stage of the lesion was divided into two groups based on H. pylori infection status. Detailed patient information, including age, sex, pathological information, H. pylori infection status, was provided in Table S1. Tissue samples were dissociated into single-cell suspensions and processed using the 10× Genomic Chromium workflow (Figure 1B).
After data quality control and filtering, a total of 91394 cells were used for further analysis (2563-7677 cells; median 5077 cells/patient) (Table S1). The resulting 26 cell clusters were annotated as 9 cell types by established marker genes(10, 15) (Figure 1C and Figure S1A). Non-immune cells included epithelial cells (PECAM, KRT19), ECs (PECAM1, CDH5), fibroblasts (COL1A1, PDGFRA), and smooth muscle cells (TAGLN, ACTA2); immune cells included T cells (CD3D, CD3G), B cells (CD79A), myeloid cells (CD68, CSF1R), plasma cells (MZB1, IGKC), and mast cells (TPSAB1, TPSB2) (Figure 1C; Figure S1B and Table S2). Epithelial cells expressed high levels of gastric mucosa-specific genes MUC5AC and TFF1 (16)(Figure S1C), as expected, indicating the accuracy of our data.
Gradually decreased proportion of epithelial cells, accompanied by increased percentages of myeloid cells during gastric carcinogenic cascade.
All of these cell types were present among patients, albeit in varying proportions (Figure S1D). When comparing the cellular compositions across the three progressively pathological stages (GS, IM and GC), we found that the GS samples were dominated by epithelial cells and T cells. Notably, the epithelial cells gradually decreased in proportion, accompanied by gradually increased myeloid cells as the disease progressed to cancer. Additionally, the ECs were greatly expanded in cellular proportion and significantly increased in GC tissues, compared to GS and IM tissues (Figure 1D). Furthermore, H. pylori-infected samples contained relatively lower proportion of epithelial cells compared to uninfected tissues, while the trend was opposite for ECs and myeloid cells (Figure 1E), in line with the RNA expression levels of the marker genes of these three cell types in bulk RNA-seq data in gastric tissues (Figure S1E). The top 10 candidate marker genes for each cell types were shown in Figure 1F. We further performed the IHC staining to validate the changes of endothelial, myeloid and epithelial cells during the cascade of gastric carcinogenesis. Consistently, upregulation of marker genes CD31 and CD68 for ECs and myeloid cells, respectively (Figure 1G), and downregulation of gastric epithelial cell-specific marker TFF1 were observed in GC tissues, compared to GS and IM specimens (Figure 1H). In addition, the increased expression levels of CD68, as well as the decreased levels of TFF1, were also supported by TCGA gastric carcinoma cohort (Figure S1F). The analysis of several public datasets from GEO database (GSE122401, GSE65801 and GSE13195) by comparing cancer tissues and paired normal adjacent tissues also showed consistent data (Figure S1G and H). These differences highlight the dynamic changes of cellular compositions in gastric tumorigenesis.
Distinct cell lineage and transcriptional patterns of epithelial cells during multistep gastric carcinogenesis
To explore lineage states and transcriptional changes in gastric tumorigenesis, we next performed unsupervised dimensionality reduction of epithelial cells. A total of 19 clusters emerged, including pit mucous cells, neck mucous cells, chief cells, parietal cells, endocrine cells, enteroendocrine cells, enterocytes, and tumor-like epithelial cells (tumor-like EPC) (Figure 2A and Figure S2A). Several common epithelial clusters were identified using known marker genes, such as MUC5AC and TFF1 (cluster 1,3 and 4) representing pit mucous cells, MUC6 representing neck mucous cells expressing (cluster 5 and 13), PGA3 and PGA4 representing chief cells (cluster 6), and ATP4A and ATP4B representing parietal cells (cluster 8 and 18)(17, 18). Endocrine cells (cluster 2, 9, 16, and 17) and enteroendocrine cells (cluster 11, 12, 14 and 19) were identified based on high expression of GAST and SST, CHGA and TPH1, respectively(19) (Figure 2B; Figure S2B). They exhibited clearly different states from other cell lineages. As expected, the classical epithelial cell populations, such as pit mucous cell, neck mucous cells, parietal cells, showed significantly decreased cell numbers across gastric carcinogenesis cascade (Figure S2C). Interestingly, epithelial cluster 10 expressing FABP1 and PHGR1, defined as enterocytes, were almost derived from the precancerous tissue of gastric intestinal metaplasia. Additionally, cluster 7 named tumor-like PEC, expressing cancer embryonic antigen CEACAM5, were enriched in GC tissues ((Figure 2C and Figure S2D).
Given that cancer cells often occur multiple mutations and chromosomal alterations, we further utilized copy number variants (CNV) analysis to classify epithelial cells as either non-malignant or malignant cells. As expected, tumor-like EPC displayed higher CNV levels compared to other cell types (Figure S2E). We also compared the CNV scores of epithelial cells across all samples, and found it to be significantly elevated in GC tissues, particularly in H. pylori -positive subgroups, compared with GS or IM tissues (Figure 2D). Due to the high proportion of tumor-like EPC cluster in GC samples, we performed multicolor IF staining and verified its significant abundance (EPCAM+/CEACAM5+) in gastric carcinoma tissues (Figure 2E). We noticed that KRT17 was extremely enriched in tumor-like EPC and in GC tissues (Figure S2B and F), suggesting its potential cancer-promoting role. IF and IHC staining consistently confirmed the epithelial cell-specific expression of KRT17 and its significant increase in GC tissues (Figure 2F), which were further confirmed by GC-related GEO datasets and TCGA cohort with primary tumor specimens and matched normal specimens (Figure S2G). We next aimed to explore the cellular origin and transitions of tumor-like EPC population. We surprisingly found that this cell subtype simultaneously expressed MUC5AC and TFF1, the classical pit mucous cell markers. We validated the presentence of MUC5AC in tumor-like EPC (MUC5AC+CEACAM5+) from GC samples by multicolor IF staining (Figure 2G). In line with these data, trajectory plot analysis showed that pit mucous cells were the origin of tumor-like EPC (Figure 2H). To characterize the functions and activation pathways of tumor-like EPC, we performed a gene set enrichment analysis (GSEA). Intriguingly, the gene set of tumor-like EPC was enriched in well-known pro-cancer signaling pathways, such as Notch, PI3K/AKT, and Wnt signaling pathways, as well as cell adhesion molecules, NOD-like receptor signaling pathway. Notably, we noticed that tumor-like EPC were highly expressed genes related to H. pylori infection (Figure 2I). Taken together, these data revealed the diversity and distinct cell states of epithelial cells in the cascade of gastric carcinogenesis. Remarkably, these findings further revealed the oncogenic transcriptomic features and biomarkers (such as KRT17), functional characteristics, and cell origin of tumor-like EPC subpopulation that were abundant in GC tissues, and finally identified gene sets related with H. pylori infection.
A distinctive epithelial cell population enterocyte from precancerous lesion gastrointestinal metaplasia
Given that the enterocyte cell populations were overwhelmingly enriched in gastrointestinal tissues, as determined by sample-to-sample breakdown of proportional cell cluster (Figure 2C), we aimed to characterize the transcriptional patterns and enriched pathways of this distinct cell cluster. The CDX family members CDX1 and CDX2, which have been previously reported to confer gastrointestinal metaplasia phenotypes(20), were mainly expressed in enterocyte cells (Figure S3A). This further confirmed the specificity of enterocyte subclusters in gastrointestinal metaplasia, the important precursor lesion of GC. Spasmolytic polypeptide-expressing metaplasia (SPEM) is a regenerative pathological state of gastric corpus mucosa in response to injury, associated with loss of chief and parietal cells, and hyperplasia of neck mucous cells(21). We observed that SPEM cells, defined by the co-expression of MUC6 and TFF2, seemed to be gradually decreased from GS to IM, and GC (Figure S3B). This is consistent with the notion that SPEM represents as a precursor lesion to metaplasia(17, 18). To assess the transcriptional features of enterocyte cells, we compared the DEGs of this subcluster with other epithelial cell subsets. The UMAP plot showed that these marker genes that regulate lipid metabolism (FABP1, APOC3 and APOC4), and specific genes for intestine (KRT20, ANPEP) were specifically overexpressed in enterocytes (Figure 3A). Based on the pseudotime results using Monocle 3 algorithm, these candidate marker genes were significantly upregulated in metaplasia lesion (Figure S3C). We further performed IF staining to delineate the existence of enterocytes expressing marker genes FABP1 and KRT20 (Figure 3B). As expected, FABP1 and KRT20 were remarkedly overexpressed in IM tissues, compared with GS or GC tissues by IHC staining (Figure 3C).
Gene Ontology (GO) enrichment analysis indicated that enterocytes expressed genes were involved in chylomicron formation and cholesterol homeostasis, suggesting the activity of energy metabolism in IM phase. Enriched Signaling pathways were involved in fat digestion and absorption, cholesterol metabolism and PPAR signaling pathways (Figure S3D). Afterwards, we applied SCENIC to infer TF-target regulatory network (regulons) of enterocytes. HNF4G, exhibited specific and high transcriptional prosperities in enterocytes (Figure 3D), which appeared to be consistent with the previous studies of the important role of HNF4G in regulation of fatty acid oxidation and intestinal stem cells renewal(22). Of note, HNF4G, together with its downstream 83 target genes, such as FABP1, FABP2, APOB, and VIL, were specifically expressed in enterocytes (Figure 3E and Table S3). IHC staining verified the increased levels of HNF4G in IM samples when compared with GS or GC samples (Figure 3F and G). In summary, these findings uncovered enterocytes as the distinct epithelial sublineages in premalignant intestinal metaplastic lesion and HNF4G as its specific transcriptional regulator.
Differentially expressed genes associated with H. pylori infection in epithelial cells
Given that chronic infection with H. pylori causes gastric mucosal injury and initiates neoplastic transformation, we aimed to identify the DEGs associated with H. pylori in epithelial cells. H. pylori preferred to colonize the pit cells of gastric epithelial mucous layer, as indicated by IF staining of MUC5AC and H. pylori (Figure 4A), consistent with previous studies(23). By comparing the cell number of epithelial subsets, endocrine cells expressing GAST and SST were reduced in H. pylori -positive samples. Conversely, as previously observed enrichment in GC, enterocyte and tumor-like EPC subclusters were increased in H. pylori-positive tissues. These data also implied the close association significant differences in the regulation of different cell types by H. pylori in gastric carcinogenesis (Figure 4B). Differential gene expression analysis showed H. pylori -positive epithelial cells had relatively lower expression levels of genes involved in gastric acid and digestive enzymes secretion (e.g., PGC, LIPF, and GAST), but higher levels of genes that specifically overexpressed in metaplasia lesions (PHGR1, FABP1, ANPEP, KRT20, PRAP1 and DMBT1) and extracellular matrix OLFM4 oncogene (Figure 4C; Figure S3E and Table S4). In particular, OLFM4, a stem cell marker that could regulate cell adhesion(24), was further validated to be overexpressed in GC tissues compared to GS and IM tissues by RT-PCR and IHC staining (Figure 4D and Figure S3F), as consistent with previous reports(25). Notably, H. pylori -positive GC specimens exhibited higher levels of OLFM4 than H. pylori-negative subjects (Figure 4E). Public datasets analysis also confirmed the upregulation of OLFM4 in human stomach carcinoma (Figure S3G). In accordance with scRNA-seq, ANPEP, PRAP1, FABP1 and KRT20 were significantly augmented in IM tissues. In particular, they were expressed at higher levels in H. pylori-positive IM individuals, although without statistically significant differences (Figure 4F-I), indicating their potential as diagnostic markers for H. pylori -associated IM lesion.
H. pylori infection induced intestinal metaplasia in INS-GAS mice and initiated enterocyte features.
To determine the effect of H. pylori infection on the characteristics of enterocytes, transgenic INS-GAS mice which overexpress human gastrin and can develop gastric metaplasia and dysplasia with H. pylori infection were infected with H. pylori PMSS1 strain for 4 months. H. pylori colonization was confirmed by Warthin-Starry silver staining (Figure S3H). As shown in Figure 5A, PAS and H&E staining showed that the gastric mucosa of infected INS-GAS mice developed intestinal metaplasia lesion. Also, H. pylori infection resulted in more severe gastric epithelial defects and gland atrophy(26) (Figure 5B). IHC staining was further performed to detect the expression of FABP1 and KRT20, which are the marker genes for enterocytes subtype. As expected, there were significant increases in the levels of FABP1 and KRT20 in gastric mucosa with H. pylori infection, compared to uninfected mice (Figure 5C and D). In addition, multicolor immunostaining confirmed that FABP1 and KRT20 expression were significantly upregulated in the gastric mucosa of H. pylori infected mice (Figure 5E). Since we showed that HNF4G as a specific transcription factor for IM precancerous lesion, we further examined the expression in the gastric mucosa of infected and uninfected mice. As expected, H. pylori infection was observed to remarkably induce HNF4G expression (Figure 5F). Taken together, these in vivo data indicated that H. pylori infection could induce a gastric enterocyte cell feature, such as the upregulation of FABP1 and KRT20, and enhancement of the transcription factor HNF4G, which may be involved in the development of precancerous lesion of IM.
The heterogeneity of ECs and ECs-tumor-like EPC subtypes interactome predictions
ECs play a crucial role in tumorigenesis by governing angiogenesis to provide nutrients and oxygen for cell proliferation (27). Given the gradual increase in ECs proportion in gastric carcinogenesis cascade, we next aimed to detailly investigate the transcriptomic heterogeneity of ECs and to identify tissue-specific subpopulations. We obtained 6266 high-quality ECs in which unsupervised clustering revealed 7 distinct subclusters (Figure S4A). We identified 4 cell subtypes belonging to traditional vascular beds, i.e., venous ECs (VCAM1, SELP, ACKR1), lymphatic ECs (LYVE1, PROX1), capillary ECs (PLPP3, RGCC, CA4) and arteries ECs (SEMA3G, GJA4, GJA5) (Figure 6A and Figure S4B). A cluster of subpopulations was designated as tumor-like ECs due to remarkably increase in GC tissues, which highly expressed marker genes known to be enriched in angiogenesis and extracellular matrix (ECM) remodeling (LGALS1, COL4A1, COL4A2 and ESM1), as well as canonical smooth muscle cells marker (TAGLN) (Figure 6B-D). Another population Endo_3 (cluster 3) exhibited significantly lower UMI counts compared to others, with highly expressed lncRNAs (e.g., HES1, MALAT1, AC020916.1), was not be further analyzed in depth (Figure S4C). Overall, we observed that all EC subsets were largely enriched in GC tissues, compared with those in GS and IM tissues. Both venous and capillary populations accounted for higher percentage of all ECs, whereas lymphatic population were rare (Figure 6B). However, these populations manifested no significance between H. pylori -positive and -negative patients, except for tumor-like ECs. Tumor-like ECs were more abundant in GC samples, especially in H. pylori-positive samples (Figure S4D and E). Additionally, enriched pathways of the highly expressed genes in tumor-like ECs were involved in cell adhesion and PI3K/Akt signaling pathway (Figure 6E and F). Of note, genes enriched in these two pathways were greatly upregulated in GC tissues, especially in H. pylori -positive subgroup, suggesting the role of H. pylori infection in promoting angiogenesis (Figure 6G). Since GC originated from the abnormal changes in epithelial cells(28), and tumor-like EPC was the specific epithelial-derived subset in GC, we speculated that ECs and tumor-like EPC cluster communicate closely. Using CellChat(29), we found that endothelial subtypes dominantly drive outgoing signaling, while tumor-like EPC was the target for incoming signaling (Figure 6H). In detail, the COLLAGEN and VEGF signaling pathways were highly activated between their communications (Figure S4F and G). COL4A1 and COL4A2 were the most highly expressed ligand by endothelial subclusters, especially by tumor-like ECs, and CD44 and SDC44 as their receptors on the tumor-like EPC. In turn, the pro-angiogenic VEGF signaling pathway was mainly driven by VEGFA ligand from tumor-like EPC to VEGFR receptors (FLT1, KDR) from ECs (Figure 6I and J). We also independently revealed that COL4A1 and COL4A2, the secreted factors of COLLAGEN signaling pathways, were expressed at high levels in gastric malignancies compared with paired normal tissues by reanalyzing bulk RNA-seq gastric carcinoma GEO datasets (GSE 122401, GSE65801, GSE13195) and TCGA database (Figure S5A and B). However, only some RNA-seq data from public databases showed that the receptors SDC4 and CD44 were upregulated in gastric carcinomas (Figure S5C and D). Altogether, these findings highlighted the molecular signatures of GC tissue-specific EC subsets, and further predicted ECs and tumor-like EPC interactions that may contribute to gastric carcinogenesis microenvironment.
Distinct fibroblast cell subtypes in cancerous and non-cancerous samples
Fibroblasts are highly organized stromal cells embedded in the ECM. Accumulating evidence indicated high diversity of fibroblasts, which exhibited phenotypes and functions associated with specific-cell lineage(30, 31). To explore fibroblasts cell lineage states, we defined 11 subsets using unsupervised clustering method. Cluster 11 were mixed cells and was disregarded of further analysis (Figure S6A). Surprisingly, compared to non-cancerous tissues, we observed that fibroblasts in gastric carcinoma tissues exhibited completely different lineage states, hence named cancer-associated fibroblasts (CAFs) (Figure 7A). In detail, ECM fibroblasts (ECM_Fib; expressing COL5A1 and POSTN) and myofibroblasts (ACTG2) were characterized, mainly derived from GS and IM tissues. Swann cells was named due to specifically overexpression of S100B and PLP1(32). There were three CAFs subpopulations, consisting of antigen-presenting CAFs (apCAFs; expressing HLA-DRA and HLA-DRB1), epithelial-mesenchymal transition CAFs (EMT_CAFs; expressing KRT8 and KRT19), and inflammatory CAFs (iCAFs; expressing FBLN1, IGFBP6 and CXCL1) (Figure 7B; Figure S6B and C)(33). To further explore the gene set signature of CAFs, we next performed a gene set enrichment analysis (GSEA) and differential gene expression analysis by comparing gene functions of all CAFs (iCAFs, apCAFs and EMT_CAFs) versus other fibroblast subsets (ECM_Fib and Myofibroblasts) (Figure 7C and D). Of note, CAFs were enriched for gene sets involved in immune and inflammation-related pathways (IGFBP4, CXCL8, CXCL2), complement cascades (C3, C7), and Wnt signaling pathway (SFRP2, SFRP4), but exhibited lower expression of gene sets involved in metabolism (Figure 7C and D; Table S5). These results suggested the crucial role of CAFs in shaping tumor immune microenvironment.
Given the higher fraction of iCAFs in GC tissues (Figure 7E), we explored the characteristics of this subpopulation We observed that iCAFs displayed high expression levels of complement components (C3 and C7), extracellular regulators of Wnt signaling (SFRP2 and SFRP4)(34). Also, iCAFs showed largely increased expression of PLA2G2A, like the previously reported immune-infiltrating PLA2G2A+CAFs in breast cancer(35) (Figure S6C). Dual-color IF staining validated the abundance of iCAFs (COL1A1+FBLN1+ or COL1A1+C3+) in GC tissues (Figure 7F). We next investigated the dynamic transition of fibroblast cells. In pseudotime analysis, we removed schwann cells, due to their completely different characteristics. As expected, this analysis indicated that ECM_Fib and myofibroblasts were at the beginning of the trajectory path, whereas all kinds of CAFs were at terminally differentiated states (Figure S6D). Of note, iCAFs appeared to arise from ECM_Fib cell clusters (Figure 7G). Moreover, according to the regulatory network using SCENIC analysis, NFATC1-driven regulons and transcriptional factor expression were inferred to be specific and enriched in iCAF subsets (Figure S6E and F).
Fibroblast-tumor-like EPC populations interactome prediction and identification of DEGs in fibroblast cells associated with H. pylori
Given the increased attention of fibroblasts in multicellular interaction in tumor microenvironment(36), we next predicted the interaction between fibroblasts and tumor-like EPC, to explore the mechanism through which microenvironment favors gastric tumorigenesis. Interestingly, CellChat analysis revealed that fibroblast subsets, especially iCAFs, dominantly drive outgoing signaling, which means secreting more ligands. The corresponding incoming receptors were expressed in tumor-like EPC (Figure 7H). In agreement with the analysis of ECs-tumor-like EPC interaction, the COLLAGEN signaling members, such as COL1A1 and COL1A2, as well as Fibronectin 1(FN1) signaling, were highly ranked ligands expressed by fibroblasts. Their corresponding receptors in tumor-like EPC were similar, such as CD44 and SDC4. In addition, the APP/CD74 and FGF7/FGFR2 also contributed to fibroblasts-tumor-like EPC interactions (Figure 7I).
Existing evidence indicates the critical role of H. pylori on gastric epithelial cells or macrophages, stromal cells response to H. pylori remains unclear. Given that the high heterogeneity of fibroblast cells in gastric malignant transformation, we analyzed the effect of H. pylori on fibroblast cell composition. Of note, H. pylori -positive gastric specimens showed increased percentage of CAFs, as indicated by apCAFs, EMT_CAFs and iCAFs, whereas relatively reduced ECM_Fibs (Figure 7J). Identification of DEGs between H. pylori-positive and -negative groups showed that several ligands and receptors for cellular signaling pathways were the top DEGs, such as FN1, C3, IL1B, CXCL8 and CD74 (Figure 7K and Table S6). They were also significantly overexpressed at bulk RNA levels in GC tissues, particularly in H. pylori -positive GC tissues (Figure S7A). We further queried bulk RNA-seq datasets from the TCGA and GEO databases, which also showed higher expression of FN1, CXCL8, C3, but not IL1B, in gastric cancerous tissues than that in paired adjacent normal tissues (Figure S7B-E). Consistently, we performed RT-PCR analysis in an expanded sample size to confirm the elevated mRNA levels of FN1 and CXCL8 in GC tissues, particularly those associated H. pylori infection (Figure 7L and M). Altogether, these findings suggest that H. pylori infection may drive the interaction between fibroblasts and epithelial cells in GC microenvironment via activation of FN1 signaling network.
Tissue-specific patterns of myeloid cells in H. pylori -related gastric carcinogenesis
To characterize the subsets of myeloid cells that showed markedly increase in GC, we identified five common cell subsets based on canonical cell markers, consisting of macrophage (CD68, CD163, MRC1), Monocyte (VCAN, LYZ), type 1 conventional dendritic cells (cDC1; expressing XCR1 and CLEC9A), type 2 cDC (cDC2; expressing CD1C and CLEC9A), and mature dendritic cells (mDC, expressing LAMP3) (Figure 8A and Figure S8A). A specific cluster of TAM was designated, based on highly expressed marker genes of macrophages and enriched in tumor tissues (Figure 8A and B). Cluster 4 were low quality cells with low UMI contents, and cluster 8 were mixed cells, both of which were disregarded of further analysis (Figure S8B). Of note, total myeloid cells number were much higher in GC tissues, especially H. pylori -infected GC tissues, than that in GS and IM tissues (Figure 8B and Figure S8C). Except for TAM, monocytes also showed a relatively high proportion in GC, whereas the cDC2 subtype tended to gradually decrease in gastric cancerous cascade (Figure S8D). The complement components (C1QA, C1QB and C1QC), and cholesterol metabolism genes (APOE and APOC1) were enriched in macrophages, as described previously(37). Monocytes highly expressed a set of neutrophil-associated genes (S100A8, S100A9), pro-inflammatory cytokine IL1B and inflammasome NLRP3 (Figure 8C). Dual-color IF staining further validated the presence of TAM subsets, as indicated by co-expression of CD68 and CXCL3. Likewise, monocytes were verified by combined immunostaining of CD68 and S100A9 (Figure 8D). GSVA analysis revealed enrichment of monocyte gene sets involved in metabolism pathways, such as fatty acid biosynthesis, bile secretion. The gene sets of TAMs were involved in IL-17, TNF pathway and cytokines interactions (Figure S8E). To clarify M1 and M2 polarization of TAMs, we calculated pro-and anti-inflammatory scores using related gene sets (Table S7). In agreement with previous studies(15), TAMs exhibited dominant M2-like phenotype, suggesting its immunosuppressive role in tumor microenvironment (Figure S8F). Pseudotime analysis using monocle 2 suggested that both TAM and monocyte subsets in neoplastic tissues were likely derived from macrophages in GS and IM tissues (Figure 8E).
The interactome network between myeloid and epithelial cells was further studied. Interestingly, the outgoing and incoming signaling patterns were obviously observed between myeloid cell subpopulations in tumor microenvironment, compared to myeloid-epithelial cells. Macrophages, TAMs, monocytes and cDC2 cells not only secreted cytokines, but also expressing corresponding receptors (Figure S8G). To identify myeloid cell-derived genes regulated by H. pylori, we compared the DEGs from myeloid cells between H. pylori -positive and H. pylori -negative tissues. Notably, myeloid cells from H. pylori -positive tissues displayed high expression of CXCL3, TIMP1, CD14 and CTSL genes (Figure 8F and Table S8). Further analysis of gastric carcinoma bulk RNA-seq datasets from GEO and TCGA database showed that CD14, CTSL and TIMP1 were significantly upregulated in GC tissues compared to the paired adjacent non-cancerous tissues (Figure 8G). Of note, H. pylori -positive patients showed even higher expression (Figure 8H and Figure S9A-C). Likewise, these results were further validated by qRT-PCR analysis with additional samples (Figure 8I).
T, B and plasma cell clustering and subtype analysis
We next applied unsupervised clustering of immune T and B cells to reveal their intrinsic substructures. A total of 14 subclusters of T cells were classified based on well-known markers CD4 and CD8 (Figure 9A). The first CD4 cluster CD4_IL7R highly expressed Naïve marker IL17R that is critical for T cells activation and development(38). The second cluster CD4_PDCD1, showing gradually reduced proportion in gastric carcinogenesis cascade, specifically expressed T follicular helper markers, such as TOX2 and PDCD1 (39, 40). The final cluster CD4_FOXP3 was well-known Treg cells with immunosuppressive effects based on the dominant CTLA4, FOXP3 expression(41) (Figure 9A-C). According to signature genes expression and top-ranked genes, CD8+T cells comprised 7 subtypes: CD8_GZMK and CD8_GNLY cells highly expressed cytotoxicity-associated genes (GZMK, GZMB). CD8_IFNG showed activated resident memory-related markers (CCL4, IFNG, CCL3)(42, 43); CD8_CAPG exhibited elevated expression of CAPG and novel immune checkpoint KLRC1(44). CD8_IL17A (IL17A, KLRB1) cells showed similar activation mode and cytokines secretion with TH17 cells(45). CD8_XCL1 cells showed enriched expression of TRDC, XCL1, XCL2, and CD160 genes. The CD8_MKI67 cluster was designated based on uniquely expression of MKI67(40, 46) (Figure 9A, B and D). In addition, three clusters were characterized as DNT (HSPA1A, HSPA1B), C14_KRT81 (KRT81) (47)and NK cells (NKG7).
We found that Treg cells represent higher proportion in GC samples than that in GS or IM sample (Figure 9E-F). When assessing the role of H. pylori on T cell heterogeneity, we observed a trend of increased CD8_IL17A subset in H. pylori -positive (versus H. pylori -negative) GS tissues, consistent with previous reported viewpoints regarding the role of IL17 cytokine in driving protective immunity against pathogen and promoting inflammatory response(45) (Figure 9F). Combined IF staining labeled CD4 and FOXP3 also revealed Treg cells to be enriched for GC, as previously reports where Treg cells are increased in several solid malignancies and associated with poor prognosis(48, 49) (Figure 9G). The trajectory analysis using monocle 3 indicated that CD4+T cells originated from CD4_IL7R cells and then differentiated into CD4_PDCD1 and CD4_Treg cells (Figure 9H), as previously described that the sustained expression of IL7R in the naïve states is critical for the maintenance and development of all T subpopulations(38). As for CD8+T cells, we observed a notable enrichment of the CD8_MKI67 subpopulation in GC tissues (Figure 9F). Multi-color IHC staining for CD3, CD8 and Ki67 further confirmed the presence of CD8_MKI67 cells in gastric carcinoma tissues (Figure 9I), supporting the previous findings indicating the proliferative function of T cells in tumors(42). Taken together, these findings suggested the potential functional role of Treg cells, and IL17A-expressing and cytotoxic characteristics CD8+T cells in H. pylori -associated gastric neoplasm.
The re-clustering of B and plasma cells revealed 6 populations, including active B (CD69), Naïve B (TCL1A, IL4R), Pre-pro B (CD7, IL7R), IgA plasma (IGHA1, IGHA2), IgG plasma (IGHG1, IGHG2), and IgM plasma cells (IGHM) (Figure S10A and B). Overall, B and plasma cells defined clearly separate clusters, with the higher abundance in GC samples (Figure S10C). The relative cellular proportion of plasma cells various across tissues. IgA+ plasma cells were highly prevalent in GS/IM, while IgG+ plasma subsets were enriched in GC samples (Figure S10D), in agreement with previous work(50). Only slight difference was existed between H. pylori -positive and negative gastric tissues (Figure S10E). Nevertheless, IgG+ plasma population appears to be more abundant in H. pylori -infected individuals (Figure S10F).
Multicellular communication network shapes H. pylori -driven carcinogenesis microenvironment
To understand the cellular signaling pathways driving gastric carcinogenesis cascade, we used CellChat to infer autocrine and paracrine signaling network among all cell types, including epithelial cells, stromal cells and immune cells. The analysis suggested that stromal cells were the dominant senders of signaling, while immune T and myeloid cells were the receivers (Figure 10A). ECs, as well as fibroblasts, also showed relatively strong interaction with epithelial cell subpopulations, pit mucous cells and tumor-like EPC (Figure 10B and Figure S11A). The COLLAGEN signaling component, which included the classic marker COL1A1, was mainly driven by fibroblasts and almost implicated in communication with all other cell types (Figure S11B). Another fibroblast-derived C3 signaling pathway uniquely recruited to myeloid cells through C3-C3AR1 ligand-receptor pair (Figure S11C and D). In line with the previous results, the VEGF signaling pathway was dominant in ECs and tumor-like EPC interaction, as well as ECs-myeloid cells interaction (Figure S11E). Additionally, we found that epithelial-derived pit mucous cells secreted IL1B and that recruited ILR2-expressing myeloid cells (Figure S11D and F). Importantly, the communication network of all cell subtypes inferred that iCAFs subsets with high expression in GC dominantly drive outgoing signaling (Figure 10C). Due to the significant increase of myeloid cells in GC and the immunomodulatory role of iCAFs, we further focused on exploring the interaction between iCAFs and myeloid cell subsets. The analysis suggested that iCAFs primarily secreted cytokines and were recruited to myeloid cells, particularly the TAM subpopulations (Figure 10D). The COL1A1, MIF, C3, and APP signaling regulatory networks were mainly responsible for the communication between iCAFs and myeloid cells (Figure 10E). Based on the exclusive enrichment of iCAFs and myeloid cells, these findings suggest their interaction as an important decisive factor for poor prognosis in GC.
We then compared the overall communication probability in H. pylori positive and negative samples. Notably, H. pylori -infected gastric tissues exhibited a significant increase in the number and strength of inferred intercellular communications (Figure 10F). Moreover, our ligand-receptor pairs analysis predicted several enriched pathways in H. pylori -positive specimens, such as SPP1, TNF, THY1 and complement pathways, while GAS and IL10 signaling networks were more abundant in H. pylori -negative specimens (Figure 10G). We subsequently focused on TNF pathways that has been implicated in inflammatory diseases(51). The analysis predicted TNF-TNFRSF1A (TNFR1) was found to be highly active in H. pylori -positive samples, suggesting the important role of TNF-TNFR1 pathway in the pathogenesis of H. pylori infection (Figure 10H and I). These results highlighted the putative importance of intercellular crosstalk in H. pylori -induced gastric neoplastic transformation.