Obesity is a significant cause of many diseases and is an important risk factor for human health. Reducing the incidence of obesity can help improve the incidence of metabolic diseases and other related diseases, such as tumors and cardiovascular diseases. At present, the molecular mechanisms underlying the pathophysiology of obesity are still unclear, and there are no complete treatment guidelines for clinical treatment. Therefore, it is crucial to explore the genes associated with obesity. The main genes known to be associated with obesity pathogenesis are C/EBPα, PPAR, SREBP, etc., regulating a wide range of downstream genes such as fatty acid synthesis/metabolism[12–14]. In addition, studies have shown that mitochondrial autophagy, inflammation, apoptosis and other related pathways also regulate obesity. Therefore, the biomarkers of obesity are complex and diverse, and need to be fully explored and explored. In this study, the preliminary analysis showed that there were several differential genes (DIFF) between healthy and obese population samples, and they were enriched to several signaling pathways, such as "Aminoacyl-tRNA biosynthesis", "Butanoate metabolism" were significantly activated; "p53 signaling pathway", "Renin-angiotensin system", "Staphylococcus spp. "Staphylococcus aureus infection" signaling pathways were significantly inhibited. The existing findings show that the p53 signaling pathway regulates lipid homeostasis, including lipid transport and storage, fatty acid and cholesterol biosynthesis, and fatty acid desaturation, etc[15]. The Renin-angiotensin system plays an important role in the development of obesity, and the local Renin-angiotensin system is activated in the obese state[16]. The Renin-angiotensin system is activated in the obese state, thereby regulating obesity and inflammation.
To further identify key genes and pathways in the pathogenesis of obesity, we performed WGCNA, identified gene modules that differed significantly between the two groups, and identified 45 major genes that play an important role in the development of obesity by screening the intersection of WGCNA and DIFF, which were analyzed by GO and KEGG. the results of GO analysis showed that the major genes were associated with KEGG was enriched for Signaling pathways regulating pluripotency of stem cells, Ovarian steroidogenesis, Insulin secretion and other pathways. Insulin resistance is closely related to obesity[17, 18]. Insulin sensitivity decreases with increasing body fat content, while insulin resistance increases, further leading to the development of obesity. It has also been shown that altered levels of adipokines (e.g. leptin, lipocalin, etc.) in the obese state affect steroidogenesis and synergize with insulin resistance and elevated inflammatory status to affect oocytes and ovaries[19]. In addition, stem cells are closely associated with adipogenesis[20]. There are multiple mechanisms of excessive fat accumulation in obese patients, of which overproduction of adipocytes is an important mechanism of influence. Adipose tissue contains a large number of pluripotent stem cells, called adipose-derived stem cells (ASCs), capable of developing into mature adipocytes. The enhanced adipogenesis in obese people may be due to the enhanced ability of ASCs in obese people to undergo adipogenesis and differentiation. In summary, this study found that intersecting genes may contribute to the development of obesity through various pathways such as cell differentiation and regulation of multiple endocrine factors and hormone levels.
Second, we analyzed 45 intersecting genes of WGCNA and DIFF by LASSO, RandomForest, and SVM-REF, and the results obtained from the three analyses were taken to show that XLOC_004699, RIMBP2, COX6B2, OR5T1, RXFP2, XLOC_003676, XLOC_ 013038, VAX1, Q07610, XLOC_011515, PTPN3 and 11 other genes were the major genes for obesity. Subsequently, the ROC curves of the 11 major genes were analyzed and their expression differences in healthy population samples and obese population samples were compared, and finally the enrichment analysis of the major genes was performed. The results showed that eight genes, including XLOC_003676, XLOC_011515, VAX1, Q07610, RIMBP2, OR5T1, XLOC_013038 and PTPN3, had decreased expression in the obese population; three genes, including XLOC_004699, COX6B2 and RXFP2, had increased expression in the obese population. The results of existing studies showed that COX6B2 was closely associated with pancreatic cancer development and metastasis, and was confirmed to be differentially expressed between groups in a study of colonic gene expression in obese rats in which flaxseed polysaccharide altered lipid metabolism and energy metabolism, indicating that it may have a role in affecting lipid metabolism[21]. The main functional enrichments involved in differentially expressed genes include PEROXISOME, BILE_ACID_METABOLISM, OXIDATIVE_PHOSPHORYLATION, and FATTY_ACID_METABOLISM. Peroxisomes are closely associated with obesity and play an important role in lipid metabolism, including fatty acid oxidation and plasma protein synthesis, and can interact with mitochondria, thereby regulating mitochondrial dynamics and lipid thermogenesis[22]. Its associated peroxisome proliferator-activated receptor is a key regulator of adipocyte differentiation and regulates insulin and adipokine production and secretion. Bile acids are important physiological agents for nutrient absorption and biliary secretion of lipids[23]. Bile acids regulate hepatic lipid metabolism through a variety of molecular signals to maintain metabolic homeostasis. A variety of factors, such as a high-fat diet, can alter bile acid metabolism, leading to metabolic disorders and obesity. Fatty acids play an important role in obesity and metabolic diseases[24], mainly through the regulation of malonyl coenzyme A synthesis by acetyl coenzyme A carboxylase 1 and 2 (ACC1 and ACC2) for fatty acid synthesis. The above findings show that obesity is regulated by multiple pathways and targets, and the main targets obtained by machine learning and their enriched molecular functions help us to conduct subsequent experimental studies.
In summary, the analysis of key genes and their enriched signaling pathways and molecular functions that lead to obesity by using WGCNA and various machine learning methods in this study is of guiding significance in exploring the pathogenesis of obesity and seeking new obesity treatment methods and drugs. However, this study still needs to be improved: firstly, a more rigorous analysis by expanding the sample size is needed to obtain more generalized results. Secondly, this study was not validated in animal experiments, and the results obtained from bioinformatics should be validated in experiments in the follow-up study. Finally, we still need to validate the bioinformatics results in the clinical setting.
This is the first study to use the WGCNA approach to construct co-expression networks combined with machine learning to explore obesity-related susceptibility modules and important genes. This study reveals that several modules play an important role in the etiology of obesity, and several genes such as RIMBP2, COX6B2, and RXFP2 are important influences in the development of obesity. This study helps to improve the genetic study of the etiology of obesity.
Data Availability statement
All data generated or analysed during this study are included in this published article [and its supplementary information files].