Prediction and pathogenesis of gallstone disease based on clinical metabolomics

doi:10.21203/rs.3.rs-3965901/v1

Download PDF

Research Article

Prediction and pathogenesis of gallstone disease based on clinical metabolomics

https://doi.org/10.21203/rs.3.rs-3965901/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Gallstone is a common disease of biliary system at present. At present, our research on its pathogenesis is still at a single analysis stage. In this study, we collected peripheral serum samples from patients with gallstones and non-biliary diseases, obtained the difference of metabolites in the peripheral blood of both sides through omics technology, and established a clinical risk prediction model for gallstones based on the clinical information of patients. The weighted gene co-expression network analysis was applied to find the metabolite set with high correlation with the pathogenesis of gallstone, and the KEGG enrichment analysis was used to find the relevant enrichment pathway, so as to obtain the metabolic pathway related to the pathogenesis of gallstone. Among them, Pantothenate and CoA biosynthesis, Linoleic acid metabolism path, Citrate cycle (TCA cycle), Glyoxylate and dicarboxylate metabolism are screened that they set with high correlation with the pathogenesis of gallstone. We found in combination with other studies that these highly correlated pathways increase the incidence of gallstones by up-regulating cholesterol synthesis raw materials, reducing cholesterol breakdown, and affecting glucose and lipid metabolism. Therefore, blocking or inhibiting the related pathways or metabolites of GSD formation has guiding significance for the clinical prevention and treatment of this disease.

Gallstone disease(GSD) is a common biliary system disease in the world. They are mainly cholesterol stones or cholesterol-based mixed stones and pigmented stones. GSD tends to occur in adults, with an overall incidence rate of 10%~15%, which increases with age after 40 years old, and the incidence rate of women is higher than men. With the development and application of imaging and endoscopic cholecystectomy, the diagnosis and treatment of GSD have achieved relatively good results. However, our research on the pathogenesis and prevention of gallstones is not very clear. At present, the widely accepted biochemical research on bile suggests that bile is a three-variable solution: the concentration ratio of bile acid, lecithin and cholesterol in bile affects the solubility of cholesterol, Under normal circumstances, bile salts and lecithin in bile can disperse cholesterol to form soluble microcapsules, which are less likely to precipitate and form stones. When the secretion of cholesterol in bile increases or the concentration of bile acids and phospholipids decreases, bile cholesterol is more likely to become supersaturated[1]. Cholesterol in the human body comes from food and its own biosynthesis, among which the liver is the most important organ for cholesterol synthesis, and also the only organ for cholesterol metabolism[2].

The raw material for cholesterol biosynthesis is acetyl-Coenzyme A, which adopts the synergistic action of more than 20 enzymes, such as HMG-CoA reductase, to synthesize cholesterol in cells and thus play its biological function. The liver is the major site of cholesterol biosynthesis, delivering both endogenously synthesized and exogenously obtained cholesterol to the blood stream as very low-density lipoprotein (VLDL). Excess cholesterol combines with apolipoprotein A-I (apoA-I) and switches to high-density lipoprotein (HDL)[3].In addition, cholesterol can produce various oxysterols (Oxysterols: old tale, new twists) through enzymatic and non-enzymatic pathways, some of which are further metabolized to bile acids [4] in the liver and discharged into the gallbladder as a component of bile.

Furthermore, gallbladder secreted mucin into the bile increases the bile viscosity, and mucin-glycoprotein gel is one of the most important known nucleation factors[5]. Therefore, when nucleation factors such as mucin-glycoprotein gel in bile under the supersaturated state accumulate, cholesterol in bile will continue to precipitate over time, thus forming cholesterol gallstones.

This is our current research on the pathogenesis of gallstones which is more focused on the single analysis stage, studying the effects of single gene[6] metabolite, lifestyle etc. such as, enterohepatic circulation, metabolic syndrome[7], obesity, diet[8], hormones[9, 10] etc. While the overall research on the pathogenesis of gallstones is lacking.

Currently, there have been many omics studies on gallbladder diseases. Some studies have found that gallstone related genes are gradually transformed into gallbladder cancer-related genes through methylation and gradual changes in copy number through epigenome analysis of gallbladder stones and gallbladder cancer [11]; A research team used qPCR, immunohistochemistry and immunofluorescence techniques to analyze the Transcriptome data of duodenal biopsy samples from GSD patients and healthy people. Transcriptome technology was used to create a proximal small intestine model that plays a key role in the pathogenesis of GSD, and zinc and intestinal microbiota were found to play a significant role in the treatment of GSD[12].

Therefore, we collected clinical information and peripheral blood samples from GSD patients and NGSD patients, conducted non-targeted metabolomics testing on their peripheral blood samples. The clinical risk prediction model of GSD was constructed by combining its relevant clinical information and various indicators. Metabolic pathway closely related to the pathogenesis of GSD was investigated through the Weighted Correlation Network Analysis (WGCNA) algorithm. We can use these models to identify metabolites and metabolic pathways that are highly correlated with the onset of GSD. By intervening in these metabolites and metabolic pathways, we can better prevent and treat gallstone disease.

2.1 Study design

In this study, clinical information such as age, sex, BMI, history of alcohol and tobacco etc. and peripheral blood routine and biochemical indicators were collected based on medical records for GSD and non-GSD patients. The collected serum samples were tested to obtain the metabolites and its expression in the peripheral blood of the two groups. Based on the differences in clinical information and metabolite data between the two groups, we established a risk prediction model of GSD. Aiming at the differences in the composition of peripheral blood metabolites between the two groups of patients, we applied the WGCNA algorithm combined with clinical information, obtained the association map between the metabolite aggregation modules and clinical indicators, extracted the metabolites in the modules with high correlations with GSD onset for metabolic pathway KEGG enrichment analysis, so as to obtain the main metabolic pathway enriched in the modules with high incidence. Finally we analyzed the possible mechanisms of these metabolic pathways affecting the formation of GSD (Fig. 1).

2.2 Gallstone cases and clinical data

We collected basic information such as gender, age, height, weight, tobacco and alcohol history of patients admitted between August 1,2021 and December 31,2021. Clinical information such as blood routine, blood biochemistry and imaging indicators was also included. And patient preoperative or intraoperative peripheral serum samples were collected for subsequent non-targeted metabolomics testing.

2.3 Non-targeted metabolomics detection

Liquid chromatography-mass spectrometry (LC-MS) is an analytical technique involving the physical separation of target compounds (or analytes), followed by quality-based testing. It is physically separated by liquid chromatography according to the chemical composition or physical properties of the analyte in the sample solution, and then captured by the physical and chemical properties of the analyte (refractive index or absorbance, etc.). The magnetic field in the final region is separated according to its mass-charge ratio. After separation, the amount and type of ions can be collected and detected by various mass detectors[13].

We use the high resolution mass spectrometer (Q Exactive (Thermo Fisher Scientific, USA)) and the liquid chromatography-mass spectrometry (LC-MS) technology to collect the data of positive and negative ions from the peripheral blood serum samples of GSD patients and NGSD patients for non-targeted metabonomic detection[14], to explore the metabonomic composition and biological function of the samples. The expression amount of various metabolites in the peripheral serum of different patients was obtained, collected and used for subsequent data analysis and model making.

2.4 Data storage

We uploaded and stored the collected non targeted metabolomics data in the China National Center for Bioinformatics(CNCB)[15]. The link for storing and sharing the data is ”https://ngdc.cncb.ac.cn/omix/preview/PHnWmMyY”.

2.5 Construction of risk prediction model

The nomogram can simplify statistical prediction models to a single numerical estimate of event probabilities and tailor them to individual patient situations[16]. Currently, it is widely used in oncology and other medical aspects by integrating different information to predict the disease risk and prognosis[17]. We used nomogram to predict the impact of various clinical indicators and metabolites on the incidence of GSD in a single individual.

2.6 Pathway enrichment for the overall metabolomics data

Enrichment analysis typically involves analyzing a set of genes or metabolites at a functional node[18]. The principle is to test the distribution of the data or data set through the hypergeometric distribution (a discrete probability distribution). When the data set is distributed in a pathway, the data are considered to be enriched in the pathway[19].

We organized the non-targeted metabolomic detection data collected from the peripheral blood serum of the GSD and NGSD groups, annotated the metabolites extracted from the highly correlated modules using the human metabolome database (HMDB) and Kyoto Encyclopedia of Genes and Genomes (KEGG). Then we used the online tool "MetaboAnalyst" (https://www.metaboanalyst.ca/) to conduct metabolic pathway enrichment analysis and select pathways with high enrichment and significance for visualization processing. We aimed to observe the enrichment differences in metabolic pathways between the two groups of peripheral blood serum, in preparation for further detailed differential pathway analysis.

2.7 Principal component analysis

Principal component analysis (PCA) and latent structure orthogonal projection discriminant analysis (OPLS-DA) are statistical modeling tools. They are applied to distinguish the results of high-dimensional spectral measurements based on various instruments[20].

The principle of principal component analysis (PCA) is to try to recombine the original variables into a new set of unrelated comprehensive variables, and to extract the few sum variables to as much as possible reflect the information of the original variables according to the actual needs. It is widely used in demography, quantitative geography, molecular dynamics simulation, mathematical modeling, mathematical analysis and other disciplines. Currently PCA has been shown to exploit and demonstrate variability in omics data in omics studies[21].

We used SIMCA-P software for principal component analysis (PCA) and orthogonal partial least squares discriminant analysis (OPLS-DA) to perform metabolite difference analysis on metabolomic data obtained from non-targeted metabolomic tests. The differential metabolic data between the GSD group and the NGSD group were visualized using methods such as heatmaps and volcanic maps.

2.8 Biological network model

Weighted gene co-expression network analysis (WGCNA) is a systems biology method used to describe gene association patterns between different samples. It can be used to identify highly synergistic gene sets and identify alternate biomarker genes or therapeutic targets based on the cohesion of gene sets and the association between gene sets and phenotypes. Currently, WGCNA has been applied to many important studies, including genomics, metabolomics, proteomics and so on[22].

We plotted the WGCNA graph between metabolomics data and clinical information. The highly correlated co-expression metabolic data were divided into the same module. The different modules were distinguished by different color. We associated each module with the clinical phenotype to calculate the correlation coefficient between each module and the phenotype. We extracted metabolites from modules which is highly relevant to clinical phenotype for subsequent pathway enrichment analysis.

2.9 Pathway enrichment and pathway analysis

We annotated the metabolites extracted from the high correlation module based on human metabolome database (HMDB) and Kyoto Encyclopedia of Genes and Genomes (KEGG), conducted the metabolic pathway enrichment analysis through the online tool "MetaboAnalyst" (https://www.metaboanalyst.ca/)[23, 24]. We screened pathways with high enrichment significance and high correlation, and analyzed their physiological functions and mechanisms of action in GSD production through combining previous studies. In addition, we combined the pathway enrichment results of overall metabolomics data to screen and study the consistent enrichment pathways, in order to verify the impact of these pathways on the formation process of gallbladder stones.

3.1 Clinical information of patients

We retrospectively collected and summarized the information (Table 1) about the relevant metabolic information, tests and examination results during their hospitalization. Then the clinical data with significant differences were selected, and a summary table of relevant difference information (Table 2) was established. We constructed the subsequent GSD risk prediction model based on their metabolism-related tests and examination results.

Table 1

Clinical features of GSD cases.
Covariates	Characteristics
Covariates	GSD case	NGSD case
Sex(M/F)	23/41	42/34
Age(years)	50.9 ± 14.1	54.9 ± 15.2
BMI(kg/m2)	23.8 ± 2.8	22.8 ± 3.4
Smoke(Y/N)	9/55	14/62
Drink(Y/N)	3/61	10/66
HBsAg(pos/neg)	3/54	1/75
HCV-RNA(pos/neg)	0/57	0/76
TG	1.31(0.52–17.12)	1.19(0.46–6.5)
HDL-C	1.23(0.54–2.25)	1.21(0.55–2.21)
LDL-C	2.39(1.36–4.85)	2.31(1.00-5.46)
VLDL	0.70(0.21–1.32)	0.60(0.09–2.31)
FBG	5.36(3.38–13.57)	5.28(3.78–10.87)
SBP	123(90–168)	129.5(99–182)
DBP	76(56–105)	80.5(56–100)
ALT	21(6-454)	15(4-109)
AST	20(7-268)	18.5(9-108)
ALP	81(15–711)	77.5(39–257)
GGT	26(8-782)	19(6-231)
WBC	5.71(1.5-12.42)	6.15(3.67–18.2)
N%	59(6.78–84.2)	63.7(35–92)
PLT	219(67.3–365)	212(82–385)
NAFLD(No/light/severe)	33/23/8	61/15/0
liver·cirrhosis(Y/N)	1/63	1/75

Data in normal distribution was presented by mean ± SD, and data in nonnormal distribution was presented by median (IQR (interquartile range)). Abbreviations: BMI: body mass index; TG: triglyceride; HDL-C: High density liptein cholesterol; LDL-C: Low-Density Lipoprotein Cholesterol; VLDL: very low density lipoprotein; FBG: Fasting Blood Glucose; SBP: Systolic Blood Pressure; DBP: diastolic blood pressure; ALT: alanine aminotransferase; AST: aspartate aminotransferase; ALP: alkaline phosphatase; GGT: γ-glutamyl transpeptidase; WBC: white blood cell; N%: neutrophil; PLT: platelet; FLD: fatty liver disease.

We classified the collected clinical information, analyzed the difference of counting data (age, BMI, TG, etc.) through independent sample T-test. We used ratio comparison to statistically describe the two categorical variable (gender, smoking and alcohol history, fatty liver, etc.). Then We selected relevant indicators with a P value (OR value) less than 0.05 to establish a summary table of relevant difference information (Table 2).

Table 2

Summary of Differential Clinical Information
Covariates	Pr(> F)	P value
Age(years)	0.6865	0.0546
BMI(kg/m2)	0.4639	0.0361
SBP	0.698	0.0461
ALP	0.03515	0.0390
GGT	0.005381	0.0030
WBC	0.02455	0.0142
N%	0.02455	0.0004
NAFLD	**	0.0149
Sex(M/F)	**	0.0244

Compile a table for statistical analysis of clinical information with significant differences and their P-values. Pr (> F): The test P value in the analysis of homogeneity of variance. When it is greater than 0.05, the degree of dispersion of the data variance represented by it does not differ significantly, indicating homogeneity of variance; When it is less than 0.05, the degree of dispersion of the data variance represented by it differs greatly, indicating uneven variance.The mark “**” indicates that this type of data is a second categorical variable.The P-value is a parameter used to determine the difference in clinical information between the GSD and NGSD groups. When it is less than 0.05, there is a significant difference in this clinical indicator between the two groups.

3.2 The clinical risk prediction model

We applied the nomogram to predict the risk of gallstone disease in individuals, screened the clinical data of the patients through lasso regression analysis[25]. Furthermore, logistic regression was used for further screening. The clinical variables significantly related to the incidence rate of GSD in the lasso regression and logistic regression models were selected to be included in the risk prediction nomogram of GSD disease by using the R language pack “rms”. We reflected the risk degree of each clinical index in the process of GSD formation by assigning the score of each variable on the nomogram model (Fig. 2A,B,C). The AUC values of clinical nomogram were calculated respectively, and the ROC curve and calibration curve were made by using the R language pack "rms" and “ROCR” to verify the accuracy of the nomogram prediction model.

The AUC value of the prediction model is 0.812 (Fig. 2D,E), indicating that the model has strong predictive value. In this prediction model, the total risk points for most patients in this study ranged from 125 to 175. The indicators such as NAFLD severity, BMI size, and FBG have a higher risk points on the incidence of GSD.

3.3 Analysis of serum metabolomics data

We screened significantly different metabolites by using differential analysis of serum non-targeted metabolomics data results from GSD and NGSD patients. 245 differential metabolites were obtained by HMDB annotation. We conducted PCA analysis on these 245 metabolites (Fig. 3) and visualized them using volcanic maps.

Through principal component analysis, we found significant differences in the expression of these 245 metabolites in peripheral blood samples of GSD and NGSD patients. The PCA model (R2X1 = 0.243,R2X2 = 0.117) and OPLS-DA model (R2 = 0.12,Q2=-0.39) both showed significant separation between the two groups of samples(Fig. 3A,B,C).

3.4 The clinical risk prediction model for metabolomics data

Using the R language package "rms", we screened 245 annotated metabolites through lasso regression and Logistic regression models, and obtained 23 significantly related metabolite data. We visualized these metabolites by using heatmaps and pods (Fig. 4).Through the pod plot, we can visually see the difference in expression levels of various metabolites between the GSD group and the NGSD group(Fig. 4A). In the heatmap, 8 metabolites were highly enriched in NGSD group data and 15 metabolites were highly enriched in GSD group data (Fig. 4B).

We included these 23 metabolites in the risk prediction column chart for GSD diseases (Fig. 5). Generated a metabolite related GSD risk column chart(Fig. 5A,B,C).The AUC values of clinical nomogram were calculated. The ROC curve and calibration curve were made by using the R language pack "rms" and “ROCR” to verify the accuracy of the nomogram prediction model.

The AUROC value of the prediction model is 1 (Fig. 5D,E), indicating the model has strong predictive value. In this prediction model, the total risk points for most patients in this study ranged from 206 to 228. The indicators such as eplerenone (HMDB0014838), Citrusinine I (HMDB0030374) and oleamine (HMDB0002117) have a higher risk points on the incidence of GSD.

3.5 Pathway enrichment for the overall metabolomics data

We have compiled overall non-targeted metabolomics data for both groups of patients. We annotated the omics data of peripheral blood serum after non-targeted metabolomics testing using HMDB and KEGG, conducted the metabolic pathway enrichment analysis through the online tool "MetaboAnalyst" (https://www.metaboanalyst.ca/) to identify the pathway with significant differences and correlations (Fig. 6A,B).

In the pathway map, we found that the increased expression of Citrate cycle (TCA cycle)(P < 0.01) and Pantothenate and CoA biosynthesis pathways(P < 0.01) are significantly correlated with the incidence of GSD. The decreased expression of Glycerophospholipid metabolism pathway(P < 0.05) is significantly correlated with the incidence of GSD.

3.6 Weighted Metabolome Coexpression Network Analysis of GSD risk

We used R language software to draw WGCNA related module diagrams between 245 selected annotated metabolites with significant statistical differences and clinical data through the “WGCNA” language package. The co-expression status of the genes was analyzed by clustering, and 10 co-expression modules (Fig. 7A,B) were detected based on a predefined cut-off value of 0.75. We analyzed the topological overlap heatmap between each module, with light colored areas representing lower overlap. This analysis result indicates that the metabolites between each module are relatively independent(Fig. 7C). Modular trait plot was obtained through the correlation analysis of co-expression module and clinical phenotype. And we found the grey module (0.74 2e-24) and turquoise module (0.48 3e-09) with high correlation with GSD onset from the figure. In addition, the grey module also showed a significant correlation with the NAFLD (0.34 6e-05) (Fig. 7).

3.7 The module pathway enrichment analysis with high correlation

We selected gray modules (0.74 2e-24) and turquoise modules (0.48 3e-09) with high correlation with GSD pathogenesis, extracted the metabolites for KEGG enrichment analysis, and obtained the enrichment pathway map (Fig. 8, Fig. 9) in the high correlation modules.

In the metabolic pathway enrichment results of the gray module in the WGCNA model (Fig. 7) of GSD, the Pantothenate and CoA biosynthesis(P < 0.05) and Linoleic acid metabolism pathways showed a high degree of enrichment association with the incidence of GSD (Fig. 8A,B); In the Pantothenate and CoA biosynthesis pathway, Pantothenoylcysteine (C04079) in the peripheral blood of GSD patients shows an elevated state(Fig. 8C,E). In the Linoleic acid metabolism pathway, the levels of Linoleic acid (C01595) in the peripheral blood of GSD patients showed an upward trend(Fig. 8D,F).

The enrichment of turquoise module reflects the importance of Citrate cycle (TCA cycle)(P < 0.01) and Glyoxylate and dicarboxylate metabolism pathway(P < 0.01) (Fig. 9A,B). In the Citate cycle (TCA cycle), the levels of Pyruvic acid (C00022), cis Aconitiate (C00417), and 2-Oxoglutarate (C00026) in the peripheral blood of GSD patients are increasing(Fig. 9C,E). In the Glyoxylate and dicarboxylate metabololism pathway, the levels of Pyruvic acid (C00022), Acetate (C00417), and Tartrate semialdehyde (C01146) in the peripheral blood of GSD patients showed an upward trend(Fig. 9D,F).

4.1 The risk prediction model for clinical data

In our nomogram model of this clinical data, the risk of GSD was positively correlated with the degree of NAFLD and obesity, and it had a high risk score. In addition, FBG and TG also have certain risk scores in the onset of GSD. The result coincides with the related studies of Dietmar M Klass and Helen V Worthington etc.

In the study of Dietmar M Klass et al., the expression of transmembrane receptor beta3-adrenergic receptor (ADRB3) was increased in obese patients, and the high expression of ADRB 3 in the gallbladder can affect the contractile function of the gallbladder[26], thus affecting the excretion of cholesterol and other substances. At the same time, the elevated HMG-CoA reductase activity in obese patients leads to increased hepatic cholesterol production and impaired bile excretion in the gallbladder, which leads to cholelithiasis[5].

The inflammatory state caused by the elevation of FBG and TG phenotypes[2, 27] can affect the smoothness of the gallbladder endothelial cells, make the nucleation effect of the bile even more pronounced. The inflammatory reaction also can cause the contraction and relaxation of the gallbladder, leading to the secular stagnation of bile and the rapid formation of cholesterol crystals.

4.2 The clinical risk prediction model of metabolomics

Based on the nomogram model constructed by metabolomics data, we found that the low level of eplerenone (HMDB0014838) had a high risk score in the risk of GSD, while the high level of Citrusinine I (HMDB0030374) and oleamine (HMDB0002117) had a certain correlation with the incidence of gallstone (Fig. 5).

Eplerenone is a selective aldosterone receptor antagonist. It only acts on the glucocorticoid receptor, but not on the androgen and progesterone receptors. Through the antagonistic effect on the aldosterone receptor, it reduces the effect of aldosterone in the body, and achieves the effect of lowering blood pressure. At the same time, it has a significant effect on lowering blood pressure in overweight or obese hypertensive patients[28]. And multiple prospective studies have proved that the increase of aldosterone level is positively correlated with the development of insulin resistance[29] and metabolic syndrome[30]. Therefore, the aldosterone receptor antagonism of eplerenone can effectively limit the effects of aldosterone, weaken the development of insulin resistance and metabolic syndrome, and reduce the generation of pro-inflammatory state, thus attenuated the risk of GSD.

Citrusinine I is a new acridone alkaloid isolated from the root skin of citrus plants, which is mainly used in the antagonism of HSV-1, HSV-2 and herpesviruses[31]. Oleamide (cis-9-octadecylamide) is the prototype of long-chain primary fatty acid amide lipid messenger. Oleamide has certain effects on GABA receptor[32] and 5-HT receptor[33], and is related to its potential role in regulating sleep drive, depression, anxiety etc.

However, at present, there are few studies on Citrusinine I and Oleamide metabolism in GSD, so the mechanism of Citrusinine I and Oleamide in the serum of GSD patients is not clear. It will become the direction of our subsequent research and experiments.

4.3 Analysis of enrichment pathway of WGCNA model high correlation module.

Based on the metabolic pathway enrichment results (Fig. 8, Fig. 9) of WGCNA model of GSD, we collected previous studies of pathways with high correlation and enrichment and GSD risk.

4.3.1 Correlation study of gery module enrichment pathways

Among the gery modules enriched by the above metabolite pathway, the Pantothenate and CoA biosynthesis pathway has a high correlation with the occurrence of GSD (P < 0.01) (Fig. 8A). We found that the level of the intermediate molecule N-Pantothenoylcysteine (C04079) of this pathway has a significant increase in the peripheral blood of GSD patients (Fig. 8C). The enhanced metabolism of this pathway increases the level of CoA[34]. The acetyl coenzyme A (HMG-CoA) generated after acetylation is the raw material for cholesterol synthesis[3].

In addition, more metabolites in the gery module were enriched in the Linoleic acid metabolism pathway (Fig. 8A), and the Linoleic acid (C01595), the key metabolites in this pathway, showed a significant decrease in the peripheral blood levels of GSD patients (Fig. 8D). Linoleic acid is the human diet consumes the most unsaturated fatty acids. It is absorbed by the intestinal epithelial cells, into a member of the phospholipid (Linoleic Acid). The phospholipid has the effect of reducing blood cholesterol. Under normal circumstances of bile bile and lecithin, it can make cholesterol dispersed form soluble micromass and not easy to precipitate form stones. When the cholesterol secretion in bile increases or the concentration of bile acid and lecithin decreases, bile cholesterol becomes more susceptible to a supersaturated state to form stones[1].

4.3.2 Correlation study of turquoise module enrichment pathways

Among the turquoise modules enriched by the above metabolite pathway, the correlation between Glyoxylate and dicarboxylate metabolism pathway is high (P < 0.001)(Fig. 9A). The intermediate metabolites such as Pyruvate (C00022), cis-Asonate (C00417) and Tartronate semialdehyde (CO1146) are significantly increased in the peripheral blood serum of GSD patients. The enhanced metabolism of the Glyoxylate and dicarboxylate metabolism pathway will affect the metabolism of glucose and lipid, thus causing metabolic disorder and inflammatory reaction[35], aggravating the inflammatory changes of gallbladder endothelial cells, and finally increasing the risk of GSD.

What’s more, the Citrate cycle (TCA cycle) had a very significant correlation (P < 0.0001) and a certain enrichment (Fig. 9A) in our pathway enrichment of WGCNA model. We also found that Pyruvate (C00022), cis-Aconitate (C00417) and Oxoglutaric acid (C00026) were significantly increased in the peripheral serum of GSD patients. Based on its significant correlation and enrichment, we believed that it was significant to explore the mechanisms associated between TCA cycle and GSD pathogenesis in detail.

4.3.3 Study on the mechanism of TCA cycle and GSD pathogenesis

TCA cycle is the final common oxidation pathway of nutrients in the body, and it is the most important metabolic pathway for the energy supply of the body[36].TCA cycle derived from carbohydrates, fatty acids, amino acids and ketone bodies and produces NADH and FADH2 of the electron transport chain[37]. In this cycle, acetyl-CoA switches to CoA-SH, participating in the subsequent synthesis of acetyl-CoA. Therefore, the high expression of TCA cycle will promote the conversion between acetyl-CoA and CoA-SH, making acetyl-CoA and CoA-SH at a higher level. High levels of acetyl CoA can lead to an increase in histone acetylation levels, causing cells to enter a anabolic state. It can increase cholesterol synthesis and metabolism. The high expression of TCA cycle will also promote the conversion of sugars to lipids in the body, increase the lipid burden of the body, and affect the lipid structure in the body[38], finally aggravating the precipitation of cholesterol in bile (Fig. 10). Therefore, TCA cycle may affect the production of gallstones by accelerating cholesterol production and the conversion of sugars to lipids.

4.3.4 Correlations between gray module pathways and NAFLD

In the above WGCNA module trait diagram (Fig. 7D), there is some correlation between grey module and NAFLD (0.34 6e-05), so we discussed the correlation mechanism between each pathway and NAFLD based on the above pathway results.

According to the mechanistic function of the enriched pathways in the gery module, we can find that the high expression of Pantothenate and CoA biosynthesis pathway and the inhibition of Linoleic acid metabolism pathway will greatly increase the generation of lipids in patients, reduce the metabolism of lipids, and increase the serum lipids and visceral fat in patients, thus increasing the risk of NAFLD.

4.4 Conclusion

In conclusion, this metabolomics-based data analysis study shows that the risk of GSD has a high correlation with the disorder of lipid and glucose metabolism in vivo, among which the Citrate cycle (TCA cycle) pathway plays an extremely important role in this aspect. In addition, Pantothenate and CoA biosynthesis, Linoleic acid metabolism, Glyoxylate and dicarboxylate metabolism pathway and Eplerenone levels also have a great influence on the human lipid structure; Obesity, FBG and TG increases promote the formation of gallstones by affecting the human lipid structure level and inflammatory status.

4.5 Clinical significance and application of this study

In this study, by collecting clinical information of GSD and NGSD patients and conducting metabolomics analysis of peripheral serum, we made an ideal clinical risk prediction map (Fig. 2) and a metabolomics risk prediction map (Fig. 5), besides a biological network of GSD (Fig. 7) was established by WGCNA. It revealed the possible mechanism of elevated Eplerenone levels in the formation of gallstones, and analyzed the metabolomics data to find the metabolic pathways associated with the pathogenesis of GSD and their possible related metabolic mechanisms. The disclosure of these risk factors is instructive for the subsequent clinical prevention and treatment of GSD by inhibiting related pathways or metabolites.

4.6 Future trends and directions of related to GSD

This study mainly focused on the analysis of clinical indicators and serum metabolomics in GSD, without cell and animal experiments to verify whether the selected metabolites and pathways have a significant impact on the stone rate of gallstones. At present, we have few conclusions about the possible mechanism of Citrusinine I and Oleamide in the process of gallstone formation. Therefore, we should conduct experimental demonstration on various metabolic pathways and metabolites.

Our follow-up research should focus on applied omics technology to explore the pathogenesis of gallstones, combined with the previous research progress, gradually improve gallstone genomics, transcriptomic, metabolomics research, realize gallstones from single analysis to gene, transcription, metabolism aspects of integrity research.

The experimental methods and samples used in this article have been ethically approved and agreed upon.

All authors agree to publish this article.

The data used in the article has been publicly published in the China National Center for Bioinformatics（ https://ngdc.cncb.ac.cn/omix/preview/PHnWmMyY ）.

I declare that the authors have no competing interests as defined by BMC, or other interests that might be perceived to influence the results and/or discussion reported in this paper.

Funding details: This study is supported by Innovative Research Groups of National Natural Science Foundation of China (81721091), Major program of National Natural Science Foundation of China (91542205), National S&T Major Project (2017ZX10203205), Zhejiang International Science and Technology Cooperation Project (2016C04003), Zhejiang Provincial Natural Science Foundation of China (LY22H030008), Zhejiang medical S&T program (2021KY145, 2022KY752), and Startup fund for Advanced Talents from Zhejiang Shuren University.

Xiang Li and Zhengtao Liu Developed the main manuscript text, Xiang Li prepared figures . Shusen Zheng and Lei Geng provided funding and clinical samples, Xiaodan Yin collected data, and all authors reviewed the manuscript.

Author Contribution

Fremont-Rahl JJ, Ge Z, Umana C, Whary MT, Taylor NS, Muthupalani S, Carey MC, Fox JG, Maurer KJ: An analysis of the role of the indigenous microbiota in cholesterol gallstone pathogenesis. PloS one 2013, 8(7):e70657.
Itani M, Dubinsky TJ: Physical Chemistry of Bile: Detailed Pathogenesis of Cholelithiasis. Ultrasound quarterly 2017, 33(3):229-236.
Phillips MC: Molecular mechanisms of cellular cholesterol efflux. The Journal of biological chemistry 2014, 289(35):24020-24029.
Luo J, Yang H, Song BL: Mechanisms and regulation of cholesterol homeostasis. Nature reviews Molecular cell biology 2020, 21(4):225-245.
Reshetnyak VI: Concept of the pathogenesis and treatment of cholelithiasis. World journal of hepatology 2012, 4(2):18-34.
Wang HH, Afdhal NH, Gendler SJ, Wang DQ: Targeted disruption of the murine mucin gene 1 decreases susceptibility to cholesterol gallstone formation. Journal of lipid research 2004, 45(3):438-447.
Santana-Gálvez J, Cisneros-Zevallos L, Jacobo-Velázquez DA: Chlorogenic Acid: Recent Advances on Its Dual Role as a Food Additive and a Nutraceutical against Metabolic Syndrome. Molecules (Basel, Switzerland) 2017, 22(3).
Di Ciaula A, Garruti G, Frühbeck G, De Angelis M, de Bari O, Wang DQ, Lammert F, Portincasa P: The Role of Diet in the Pathogenesis of Cholesterol Gallstones. Current medicinal chemistry 2019, 26(19):3620-3638.
Brgelmann J, Ponce CB, Marcelain K, Roessler S, Goeppert B, Gallegos I, Colombo A, Sanhueza V, Morales E, Rivera MTJH: Epigenome‐wide analysis of methylation changes in the sequence of gallstone disease, dysplasia, and gallbladder cancer. 2020.
Cirillo DJ, Wallace RB, Rodabough RJ, Greenland P, LaCroix AZ, Limacher MC, Larson JC: Effect of estrogen therapy on gallbladder disease. Jama 2005, 293(3):330-339.
Brägelmann J, Barahona Ponce C, Marcelain K, Roessler S, Goeppert B, Gallegos I, Colombo A, Sanhueza V, Morales E, Rivera MT et al: Epigenome-Wide Analysis of Methylation Changes in the Sequence of Gallstone Disease, Dysplasia, and Gallbladder Cancer. Hepatology 2021, 73(6):2293-2310.
Riveras E, Azocar L, Moyano TC, Ocares M, Molina H, Romero D, Roa JC, Valbuena JR, Gutiérrez RA, Miquel JF: Transcriptomic profiles reveal differences in zinc metabolism, inflammation, and tight junction proteins in duodenum from cholesterol gallstone subjects. Scientific reports 2020, 10(1):7448.
Keifer DZ, Jarrold MF: Single-molecule mass spectrometry. Mass spectrometry reviews 2017, 36(6):715-733.
Seger C, Salzmann L: After another decade: LC-MS/MS became routine in clinical diagnostics. Clinical biochemistry 2020, 82:2-11.
Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023. Nucleic acids research 2023, 51(D1):D18-d28.
Iasonos A, Schrag D, Raj GV, Panageas KS: How to build and interpret a nomogram for cancer prognosis. Journal of clinical oncology : official journal of the American Society of Clinical Oncology 2008, 26(8):1364-1370.
Balachandran VP, Gonen M, Smith JJ, DeMatteo RP: Nomograms in oncology: more than meets the eye. The Lancet Oncology 2015, 16(4):e173-180.
Chicco D, Jurman G: A brief survey of tools for genomic regions enrichment analysis. Frontiers in bioinformatics 2022, 2:968327.
Mubeen S, Tom Kodamullil A, Hofmann-Apitius M, Domingo-Fernández D: On the influence of several factors on pathway enrichment analysis. Briefings in bioinformatics 2022, 23(3).
Worley B, Powers R: PCA as a practical indicator of OPLS-DA model reliability. Current Metabolomics 2016, 4(2):97-103.
Jolliffe IT, Cadima J: Principal component analysis: a review and recent developments. Philosophical transactions Series A, Mathematical, physical, and engineering sciences 2016, 374(2065):20150202.
Zhao W, Langfelder P, Fuller T, Dong J, Li A, Hovarth S: Weighted gene coexpression network analysis: state of the art. Journal of biopharmaceutical statistics 2010, 20(2):281-300.
Pang Z, Chong J, Zhou G, de Lima Morais DA, Chang L, Barrette M, Gauthier C, Jacques P, Li S, Xia J: MetaboAnalyst 5.0: narrowing the gap between raw spectra and functional insights. Nucleic acids research 2021, 49(W1):W388-w396.
Pang Z, Zhou G, Ewald J, Chang L, Hacariz O, Basu N, Xia J: Using MetaboAnalyst 5.0 for LC-HRMS spectra processing, multi-omics integration and covariate adjustment of global metabolomics data. Nature protocols 2022, 17(8):1735-1761.
Li Z, Sillanpää MJ: Overview of LASSO-related penalized regression methods for quantitative trait mapping and genomic selection. TAG Theoretical and applied genetics Theoretische und angewandte Genetik 2012, 125(3):419-435.
Klass DM, Lauer N, Hay B, Kratzer W, Fuchs M: Arg64 variant of the beta3-adrenergic receptor is associated with gallstone formation. The American journal of gastroenterology 2007, 102(11):2482-2487.
Worthington HV, Hunt LP, McCloy RF, Ubbink JB, Braganza JM: Dietary antioxidant lack, impaired hepatic glutathione reserve, and cholesterol gallstones. Clinica chimica acta; international journal of clinical chemistry 2004, 349(1-2):157-165.
Adachi H, Kakuma T, Kawaguchi M, Kumagai E, Fukumoto Y: Effects of eplerenone on blood pressure and glucose metabolism in Japanese hypertensives with overweight or obesity. Medicine 2019, 98(15):e14994.
Kumagai E, Adachi H, Jacobs DR, Jr., Hirai Y, Enomoto M, Fukami A, Otsuka M, Kumagae S, Nanjo Y, Yoshikawa K et al: Plasma aldosterone levels and development of insulin resistance: prospective study in a general population. Hypertension (Dallas, Tex : 1979) 2011, 58(6):1043-1048.
Ingelsson E, Pencina MJ, Tofler GH, Benjamin EJ, Lanier KJ, Jacques PF, Fox CS, Meigs JB, Levy D, Larson MG et al: Multimarker approach to evaluate the incidence of the metabolic syndrome and longitudinal changes in metabolic risk factors: the Framingham Offspring Study. Circulation 2007, 116(9):984-992.
Yamamoto N, Furukawa H, Ito Y, Yoshida S, Maeno K, Nishiyama Y: Anti-herpesvirus activity of citrusinine-I, a new acridone alkaloid, and related compounds. Antiviral research 1989, 12(1):21-36.
Laposky AD, Homanics GE, Basile A, Mendelson WB: Deletion of the GABA(A) receptor beta 3 subunit eliminates the hypnotic actions of oleamide in mice. Neuroreport 2001, 12(18):4143-4147.
Boger DL, Patterson JE, Jin Q: Structural requirements for 5-HT2A and 5-HT1A serotonin receptor potentiation by the biologically active lipid oleamide. Proceedings of the National Academy of Sciences of the United States of America 1998, 95(8):4102-4107.
Slyshenkov VS, Rakowska M, Wojtczak L: Protective effect of pantothenic acid and related compounds against permeabilization of Ehrlich ascites tumour cells by digitonin. Acta biochimica Polonica 1996, 43(2):407-410.
Song S: Can the glyoxylate pathway contribute to fat-induced hepatic insulin resistance? Medical hypotheses 2000, 54(5):739-747.
Akram M: Citric acid cycle and role of its intermediates in metabolism. Cell biochemistry and biophysics 2014, 68(3):475-478.
Martínez-Reyes I, Chandel NS: Mitochondrial TCA cycle metabolites control physiology and disease. Nature communications 2020, 11(1):102.
Lee JV, Carrer A, Shah S, Snyder NW, Wei S, Venneti S, Worth AJ, Yuan ZF, Lim HW, Liu S et al: Akt-dependent metabolic reprogramming regulates tumor cell histone acetylation. Cell metabolism 2014, 20(2):306-319.

No competing interests reported.

Download PDF

Editor assigned by journal
29 May, 2024
Editor invited by journal
03 Mar, 2024
Submission checks completed at journal
03 Mar, 2024
First submitted to journal
17 Feb, 2024

You are reading this latest preprint version

Prediction and pathogenesis of gallstone disease based on clinical metabolomics

Status:

Version 1

Abstract

Figures

1. Introduction

2. Methods

2.1 Study design

2.2 Gallstone cases and clinical data

2.3 Non-targeted metabolomics detection

2.4 Data storage

2.5 Construction of risk prediction model

2.6 Pathway enrichment for the overall metabolomics data

2.7 Principal component analysis

2.8 Biological network model

2.9 Pathway enrichment and pathway analysis

3. Results

3.1 Clinical information of patients

3.2 The clinical risk prediction model

3.3 Analysis of serum metabolomics data

3.4 The clinical risk prediction model for metabolomics data

3.5 Pathway enrichment for the overall metabolomics data

3.6 Weighted Metabolome Coexpression Network Analysis of GSD risk

3.7 The module pathway enrichment analysis with high correlation

4.Discussion

4.1 The risk prediction model for clinical data

4.2 The clinical risk prediction model of metabolomics

4.3 Analysis of enrichment pathway of WGCNA model high correlation module.

4.3.1 Correlation study of gery module enrichment pathways

4.3.2 Correlation study of turquoise module enrichment pathways

4.3.3 Study on the mechanism of TCA cycle and GSD pathogenesis

4.3.4 Correlations between gray module pathways and NAFLD

4.4 Conclusion

4.5 Clinical significance and application of this study

4.6 Future trends and directions of related to GSD

Declarations

Author Contribution

References

Additional Declarations

Status:

Version 1