Gastric cancer ranks as a major cancer worldwide, with the fifth highest incidence and the fourth highest mortality rate[16]. The classical theory proposed by Correa suggests that the formation of gastric cancer is a multi-stage and multi-step process[6]. Early gastric cancer, being confined to the mucosal layer or submucosal superficial layer, has a lower risk of lymph node metastasis and, thus, a higher cure rate[17]. However, early gastric cancer usually does not cause symptoms[18] and lacks specific diagnostic biomarkers[19], leading to most patients being diagnosed at an advanced stage of gastric cancer[20]. Currently, screening for early gastric cancer relies primarily on detailed gastroscopic examination under high-definition endoscopy and targeted biopsy[21]. The reduction in the incidence and mortality rates of gastric cancer in developed countries such as Japan and South Korea has proven the effectiveness of this screening method[22, 23]. However, for populous developing countries like China, the strategy of using high-definition endoscopy and biopsy for early gastric cancer screening faces challenges due to high medical costs and low population coverage[24]. Therefore, screening for biomarkers with specific diagnostic value has significant clinical implications.
In this study, we downloaded and analyzed GSE55696 mRNA-seq data, identifying 18,352 differentially expressed genes between the chronic gastritis group and the early gastric cancer (high-grade gastric mucosal intraepithelial neoplasia) group. By filtering these genes with a Log Fold Change of 1 and an Adj P value < 0.05, we ultimately selected 1,777 genes with significant differential expression, including 776 upregulated and 1,001 downregulated genes. Weighted correlation network analysis (WGCNA) is a systems biology method used to describe gene association patterns between different samples, enabling the identification of highly co-variated gene sets[25]. Through WGCNA, we identified 2,520 core genes related to early gastric cancer. We then intersected these 1,777 significantly differentially expressed genes with the 2,520 core genes identified by WGCNA, obtaining 755 genes strongly related to early gastric cancer. GO and KEGG analyses were employed to further perform functional clustering analysis and pathway analysis on the intersected genes. GO analysis indicated that the intersected genes are mainly involved in biological processes such as response to xenobiotic stimulus, hormone transport, hormone secretion, and peptide transport; primarily targeting the apical part of the cell and apical plasma membrane; and participating in molecular functions including oxidoreductase activity acting on the CH-OH group of donors, with NAD or NADP as acceptor, and monooxygenase activity. KEGG results showed that the intersected genes are mainly related to signaling pathways regulating the pluripotency of stem cells, hepatocellular carcinoma, and retinol metabolism.
Mendelian randomization analysis is an important method for verifying causal relationships. By conducting Mendelian randomization analysis on the ten most relevant core genes selected through the CytoHubba plugin, we discovered a genetic causal relationship between the core gene CLU and early gastric cancer. Clusterin protein, first identified and isolated in rat testes in 1979[26], has been found to be ubiquitously present in almost all bodily fluids and the intracellular matrix, performing a variety of biological functions[27]. Named CLU for its cell aggregation function in vitro[28], it has been found that humans have at least three isoforms of CLU[29]. Under certain stress conditions, immature Clusterin can be converted into a mature or nuclear form of about 55kDa and relocate to the nucleus, exerting pro-apoptotic effects through a caspase 3-dependent pathway[30]. The function of cytoplasmic Clusterin is more complex; under certain stress conditions, pre-secreted clusterin can bind with GRP87 (Bip) in the endoplasmic reticulum to increase its stability, then enter mitochondria to inhibit the formation of the Bax-Bak complex, thereby exerting anti-apoptotic effects[31]. Secreted clusterin (sCLU) has been proven to be an 80kDa glycoprotein[32], an important extracellular chaperone molecule. It plays protective roles in cells and tissues by clearing cell debris and misfolded proteins, inducing cell survival and proliferation pathways[33], and is implicated in the development of diseases such as Alzheimer's[34], atherosclerosis, and malignant tumors[35].
Previous studies have shown that Clusterin acts as either an oncogene or a tumor suppressor gene in different cancers, indicating that the CLU gene plays a dual role in tumorigenesis[36]. Reports suggest that overexpression of CLU in gastric cancer is associated with lymph node metastasis, tumor invasion, and high tumor stage[37]. However, literature on the value of blood-secreted Clusterin as a biomarker for gastric cancer has yielded contradictory results[38]. In our study, we found that CLU is underexpressed in early gastric cancer. We hypothesize that this underexpression indicates a reduction in its function to clear cancer-related proteins, leading to tumor development. However, Mendelian randomization analysis revealed a genetic causal relationship between CLU and gastric cancer, identifying it as a risk factor for gastric cancer. This finding, which seems contradictory to the underexpression of CLU in early gastric cancer, further highlights the multifunctionality of CLU. It prompts further investigation into whether CLU exhibits differential expression and functionality between early and advanced stages of gastric cancer, or whether different isoforms of CLU perform different functions at these stages. This will likely be a focus of our research team's future studies.
Regardless, this is the first study to explore the causal relationship between CLU levels and gastric cancer using GWAS data for CLU (exposure) and early gastric cancer (outcome) through bidirectional Mendelian randomization analysis. This analysis indicates a genetic causal relationship between CLU and gastric cancer, identifying CLU as a risk factor for the development of gastric cancer. Despite the significance of this finding for the diagnosis of early gastric cancer, the study has its limitations. Firstly, although we found CLU to be underexpressed in early gastric cancer with significant weight, the Mendelian randomization analysis was ultimately conducted using gastric cancer GWAS data due to the lack of early gastric cancer GWAS data. Secondly, this study only utilized bioinformatics analysis to examine hub genes and potential functions related to the onset of early gastric cancer. Further research will require validation of CLU's specific mechanisms through cellular and animal model functional experiments, as well as exploration of whether differences exist in the expression and function of CLU at different stages of gastric cancer.