Discovery and Identification of Candidate Genes, Diagnostic Model and Drug Predictions for Schizophrenia and Crohn's Disease Through Integrated Bioinformatics Analysis and Machine Learning

doi:10.21203/rs.3.rs-2333064/v1

Download PDF

Research Article

Discovery and Identification of Candidate Genes, Diagnostic Model and Drug Predictions for Schizophrenia and Crohn's Disease Through Integrated Bioinformatics Analysis and Machine Learning

https://doi.org/10.21203/rs.3.rs-2333064/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Both schizophrenia and Crohn's disease are linked to the expression of immunological and metabolic abnormalities. The objective of this research is to find appropriate diagnostic candidate genes for patients with schizophrenia and Crohn's disease as well as the appropriate medications.

Methodology:

The datasets were retrieved from the Gene Expression Omnibus (GEO) database for schizophrenia and Crohn's disease. Differentially expressed genes (DEGs) were identified using the Limma package and weighted gene co-expression network analysis (WGCNA). The function enrichment analysis was conducted, followed by machine learning-based identification of candidate immune-related central genes for least absolute shrinkage and selection operators (LASSO) regression and construction of protein-protein interaction (PPI) network. The random forest method was used to identify candidate genes and establish artificial neural networks for the verification of these genes. And for the diagnosis of schizophrenia, the receiver operating characteristic (ROC) curve was plotted. The Enrichr database was then used to collect pertinent drugs that are related to candidate genes.

Results

A total of 2681 DEGs and 210 Crohn's disease-related genes were screened out in schizophrenia. Among the schizophrenia and Crohn's disease-related genes, about 35 genes depicted differential expression. Finally, seven potential genes were screened out using the PPI network and machine learning. The diagnostic value was evaluated using the ROC curve data. These findings suggest that the diagnostic value of these candidate genes is high. Valproic acid and other related drugs were collected from the enrichr database.

Conclusion

The identification of seven candidate gene models (CAP1, INSIG1, MSMO1, PHLDA2, PSMB6, TBC1D2, UBA5) has high diagnostic value, and valproic acid and other drugs may become candidate drugs for patients with schizophrenia, providing effective evidence for the pathogenesis and treatment.

Schizophrenia

Crohn's Disease

drug prediction

diagnostic model

machine learning

Schizophrenia is a disease with variable phenotypic expression and poorly understood complex etiologies, involving major genetic as well as environmental factors that interact with genetic susceptibility [1]. The core features are positive symptoms (hallucinations and delusions), negative symptoms (desire withdrawal and impaired motivation) and cognitive impairment [2]. Every year, one in 10000 adults suffers from schizophrenia [3], and the suicide rate is also higher than that of other diseases making it the 13th most common cause of suicide worldwide [4]. The life expectancy of patients with schizophrenia reduces by 10–20 years [5].

One of the most significant areas of scientific research in recent years is the connection between the dysfunction and inflammation of the intestine and brain growth and psychiatric disorders [6] Crohn's disease is a chronic inflammatory condition of the gastrointestinal tract that can produce lesions from the mouth to the anus and has a genetic predisposition [7]. Although immunity is now recognized to be the main contributor to Crohn's disease, the primary contributor to the immune response pathway of Crohn’s disease is still unrecognized [8]. Studies had shown an increase in the incidence of psychiatric disorders in genetically related and inflammatory bowel diseases, and the associated genes had been identified [9]. Whether these correlations had diagnostic significance and might have any impact on drug research had not been fully elucidated. The development of biomedicine has been tremendously aided by notable breakthroughs in microarray technology and bioinformatics. Public databases have a lot of high-throughput data, which greatly helps in revealing the possible disease pathogenesis and identifying potential targets for drug design [10]. This study aims to use effective methods to fill the gaps and provides a more effective choice for drug selection and clinical diagnosis.

2.1 Materials

From the GEO database (https://www.ncbi.nlm.nih.gov/geo/), the schizophrenia data set GSE92538 [11] was selected as the training group and GSE21935 as the test group; the data set GSE36807 was selected for Crohn's disease. Figure 1 shows the process flow, and the complete information dataset is given in Table 1.

2.2 Screening of differentially expressed genes

A generalized linear model-based differential expression screening method of the R software called Limma (linear models for microarray data) (version 3.40.6) [12] was used for differential analysis and screening of the DEGs between different control and comparison groups. In this study, |log2 fold change (FC)|༞1 and P value༜0.05 were considered as conditions for the Limma package to identify DEGs, and the sangerbox was used to visualize the volcano plot and heat map of differentially expressed genes (DEGs) of schizophrenia and Crohn's disease [13].

2.3 Module gene selection and weighted gene co-expression network analysis

Initially, the MAD (Medium Absolute Deviation) of each gene in the GSE36807 dataset of Crohn’s disease was calculated using gene expression profiling [14]. The top 25% of the genes with the least MAD were eliminated, and outlier genes and samples were removed using the GoodSampleGenes technique of the WGCNA ‘R’ package, and a scale-free co-expression network was created using WGCNA. The sensitivity was set to 3, and 30 co-expression modules were created by merging the modules with a distance smaller than 0.25. It was important to note that the grey module was regarded as a gene set that was not compatible with any module.

2.4 Gene function enrichment analysis

Gene function enrichment analysis of the above-processed module genes was performed to understand the main expression forms of the genes involved in Crohn's disease. For gene set functional enrichment analysis, KEGG rest API (https://www.kegg.jp/kegg/rest/keggapi.html) was used to acquire the latest gene annotation of the KEGG pathway. Under this background, we mapped the genes to the background set, and used the R software package clusterProfiler (version 3.14.3) [16] for enrichment analysis to obtain the results of gene set enrichment. The GSEA software (version 3.0) was downloaded from the GSEA website (http://software.broadinstitute.org/gsea/index.jsp) [17]. To assess the associated pathways and molecular mechanisms, the samples were separated into two groups, and the subset (c2.cp.kegg.v7.4.symbols.gmt) was downloaded from the Molecular Signatures Database [18]. Gene expression profiles and phenotypic grouping were used to determine a minimum gene set of 5 and a maximum gene set of 5000. P value < 0.05 and FDR of < 0.1 were considered statistically significant.

2.5 Construction of protein-protein interaction (PPI) network

The PPI network was constructed using the string database [19], and the minimum required interaction score was set to medium confidence (0.400). The Cytoscape was used for visualization [20].

2.6 Screening of schizophrenia and Crohn's disease-related characteristic genes by machine learning

Initially, the module genes of Crohn's disease were cross-screened with DEGs to obtain the primary genes related to Crohn's disease and further cross-screened with the DEGs of schizophrenia to obtain the characteristic genes related to schizophrenia and Crohn's disease. Additionally, the gene expression data, its survival time, and survival status were integrated using glmnet [21] and RandomForest [22] of the R software, and regression analysis was carried out using the RandomForest and lasso-cox methods. Furthermore, to optimize the model, 10-fold cross-validation was executed. The neuralnet [23] of the software “R” was used to build an artificial neural network for the feature genes obtained by the above methods in order to build a high-precision diagnostic model. The pROC [24] of the software “R” was used to conduct ROC analysis to acquire AUC, while its function, the CI, was utilized to evaluate these AUC and confidence interval (CI) to get the final AUC results. The sangerBox software was used for visualization, and the expressed characteristic genes in the training set (GSE92538) and test group (GSE21935) were observed.

2.7 Drug prediction

The drugs targeting central genes were retrieved from the Enrichr database [25], and the characteristic genes-related drugs were recorded and visualized for further correlation analysis.

3.1 Screening of DEGs between schizophrenia and Crohn's disease

Using the Limma package, 2681 DEGs in total were identified in the schizophrenia data set (GSE92538); 1299 of these were up-regulated, and 1382 were down-regulated (Fig. 2A, 2B). While 3235 DEGs were identified in the Crohn's disease data set (GSE36807), of which 1464 were up-regulated, and 1771 were down-regulated (Fig. 2C, 2D).

3.2 Modular genes selection and weighted gene co-expression network analysis

A cluster tree diagram of Crohn's disease and the control group was created using the soft threshold of the study (β = 8) (Fig. 3A). Based on this, 30 gene co-expression modules (GCM) were constructed (Fig. 3B, 3C), and the association between Crohn's disease and GCM was demonstrated (Fig. 3D). It was observed that the sienna3 module (359 genes) had the maximum correlation with Crohn's disease (correlation coefficient = 0.65, p = 1.8e-3). The correlation between module members and gene significance in the sienna 3 module of schizophrenia was also calculated, and a considerable positive correlation was observed (r = 0.61) (Fig. 3E).

3.3 Functional enrichment analysis of Crohn's disease

The function enrichment of the sienna 3 gene in Crohn's disease was analyzed. KEGG analysis revealed enrichment of CGs mostly in the "meta pathways", "Proteasome," and other pathways (Fig. 4A). The GO analysis revealed that CGs were mainly found in the "vascular", "endomembrane system", "extracellular space", and "extracellular region " of the cell components (CC) (Fig. 4B). The key biological processes (BP) of CGs included the small molecule metabolic process and the small molecule biological process (Fig. 4C). Based on molecular function (MF), it was observed that the key components in CGs were "catalytic activity" and "identity protein binding" (Fig. 4D). The results of these analyses revealed that Crohn's disease was mainly related to metabolism and the immune system, which was similar to schizophrenia.

3.4 Construction and function enrichment analysis of protein-protein interaction network of intersection genes of schizophrenia and Crohn's disease

Initially, 210 major genes related to Crohn's disease were obtained by cross-screening Crohn's disease module genes with DEGs, followed by the cross-screening of schizophrenia DEGs and Crohn's disease-related genes through the Venn diagram, and 35 related genes were obtained (Fig. 5A). The function enrichment analysis of these candidate genes revealed that CGs were mainly enriched in "meta pathways" and "Rap1 signaling pathways" (Fig. 5B). GO analysis revealed that in terms of CC, CGs were mainly found in the "endomembrane system" and "organelle subset" (Fig. 5C). The key BPs of CGs included "vascular mediated transport" and "small molecule metabolic process" (Fig. 5D). Based on the molecular function (MF), "cell adhesion molecule binding" and "cadherin binding" were the most important items in CGs (Fig. 5E). By comparing the above results, it was revealed that there was a major correlation between schizophrenia and Crohn's disease, and both of them were related to metabolism and the immune system. The 35 candidate genes were analyzed through the String database, and it was found that 20 of these genes were related (Fig. 5F).

3.5 Screening candidate genes through machine learning and construction of an artificial neural network

LASSO regression was used to screen candidate genes, and 22 potential biomarkers were identified from these results (Fig. 6A, 6B). The RF regression analysis was also done to screen candidate genes, and 17 potential candidate biomarkers were displayed ultimately (Fig. 6C). The screening results from the two previously mentioned machine learning techniques were cross-analyzed with 20 candidate genes in the PPI network, and finally, 7 candidate genes (CAP1, INSIG1, MSMO1, PHLDA2, PSMB6, TBC1D2, UBA5) were obtained (Fig. 6D). These seven genes were used to construct neural networks, and the results revealed that these seven candidate genes could well differentiate schizophrenia samples and control samples (Fig. 6E). The expression profile analysis of seven candidate genes was evaluated, and the results indicated a considerable variation of the candidate genes between the schizophrenia and the control groups (Fig. 6F).

3.6 Construction and verification of diagnostic model

A forest map of the seven potential candidate genes was established (Fig. 7A), and the AUC and 95% CI (AUC 0.84, CI 0.90 − 0.78) of the forest map were also calculated. The ROC curves were plotted to assess their specificity and sensitivity (Fig. 7B). In order to further verify the model, a forest map of candidate genes in the test group (GSE21935) was established, and their ROC curves (AUC 0.78, CI 0.93 − 0.64) were plotted (Fig. 7C, 7D). The results of the test group showed that the model had certain significance for the diagnosis of schizophrenia. The GSEA analysis of the seven candidate genes was carried out, and the results showed its correlation with metabolism and immunity (Fig. 8A-F).

3.7 Drug prediction

Nine drugs with the highest combined score related to candidate genes (Valproic acid, minocycline, tetrandrine, norcyclobenzaprine, loperamide, protriptyline, rescinnamine, maprotiline, and trifluoperazine) were selected through Enrichr database (Table 2), and the intersection of the candidate genes and drugs was visualized for subsequent analysis (Fig. 9).

Schwarz et al. reported that the number of lactic acid bacteria and bifidobacteria in the fecal microbiota of patients with first-episode psychosis increased, which is related to the severity of psychiatric symptoms [26]. These phenomena have also been confirmed in autistic and depressive patients [27, 28]. Inflammatory bowel disease (IBD) refers to both Crohn's disease and chronic nonspecific ulcerative colitis. There are numerous studies on the relationship between chronic nonspecific ulcerative colitis (UC) and schizophrenia [29], but few studies have been done on the correlation between schizophrenia and Crohn's disease. This study mainly determines the correlation between the two diseases and provides other options for the treatment of schizophrenia.

CAP1 (Cyclase Associated Actin Cytoskeleton Regulatory Protein 1) is mainly a protein-coding gene. Diseases that are mainly associated with CAP1 include Anteroseptal Myocardial Infarction and Luminal Breast Carcinoma A. Response to increased platelet cytosolic Ca2 + and Innate Immune System are two of the pathways linked with it. Gene Ontology (GO) annotations related to this gene include actin binding [30]. Although CAP1 has not been widely studied in schizophrenic patients, rat experiments have confirmed that there is a significant correlation between this gene and schizophrenia [31].

One important TBC/RabGAPs family member, TBC1D2, controls the interaction between Rac1 activation and Rab7 cycling [32]. Additionally, TBC1D2 controls how these two small GTPases function during particular cellular activities like scattering, transport, and autophagy [33]. The findings of this study have also been verified by MS Breen et al., who used methamphetamine-associated psychiatric leave as the pharmacological and environmental model of schizophrenia and found eight high-score linked genes, including TBC1D2 [34].

Valproic acid is a drug for bipolar disorder [35], a fatty acid with anticonvulsant and anti-manic properties. The mechanism of its therapeutic effect is still unclear. It may play a role by increasing the γ- aminobutyric acid level or changing the characteristics of the voltage-gated sodium channel [36]. In the experimental study of mice, it was discovered that Valproic acid prevented hyperactivity and deficits in prepulse inhibition and latent inhibition in Disc1-L100P mice [37], demonstrating the impact of preventing symptoms associated with schizophrenia.

Minocycline belongs to a class of medications called tetracycline antibiotics. It helps to treat bacterial infections by limiting their growth and spread. It helps to treat acne by killing the bacteria that cause pores infection and reducing a certain natural oily substance that causes acne [38]. Hovens discovered through seven case reports on the use of minocycline as an additive in the treatment of persistent schizophrenia that it may improve the negative symptoms and cognition of schizophrenia patients [39].

The shortcomings of this study are that no further confirmation with wet lab tests was carried out. The analysis could not be conducted in combination with clinical information due to the lack of corresponding clinical research tools.

Seven candidate gene models (CAP1, INSIG1, MSMO1, PHLDA2, PSMB6, TBC1D2, and UBA5) were identified with high diagnostic value. Valproic acid and other drugs may be candidate drug choices for schizophrenia patients, providing effective selection evidence for the pathogenesis and drug treatment of schizophrenia.

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and materials

The datasets generated and analysed during the current study are available in the [GEO] repository, [https://www.ncbi.nlm.nih.gov/geo/]

Competing interests

The authors declare that they have no competing interests.

Funding

Not applicable

Authors' contributions

Yu Feng and Jing Shen wrote the main manuscript text, and all authors reviewed the manuscript.

Acknowledgements

In the vastness of space and immensity of time, it is my joy to spend a planet and an epoch with Maggie.

Jablensky A: The diagnostic concept of schizophrenia: its history, evolution, and future prospects. Dialogues Clin Neurosci 2010, 12(3):271-287.
Owen MJ, Sawa A, Mortensen PB: Schizophrenia. Lancet 2016, 388(10039):86-97.
Häfner H, an der Heiden W: Epidemiology of schizophrenia. Can J Psychiatry 1997, 42(2):139-151.
Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, Abraham J, Adair T, Aggarwal R, Ahn SY et al: Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 2012, 380(9859):2095-2128.
Chesney E, Goodwin GM, Fazel S: Risks of all-cause and suicide mortality in mental disorders: a meta-review. World Psychiatry 2014, 13(2):153-160.
Heiss CN, Olofsson LE: The role of the gut microbiota in development, function and disorders of the central nervous system and the enteric nervous system. J Neuroendocrinol 2019, 31(5):e12684.
Veauthier B, Hornecker JR: Crohn's Disease: Diagnosis and Management. Am Fam Physician 2018, 98(11):661-669.
Li N, Shi RH: Updated review on immune factors in pathogenesis of Crohn's disease. World J Gastroenterol 2018, 24(1):15-22.
Uellendahl-Werth F, Maj C, Borisov O, Juzenas S, Wacker EM, Jørgensen IF, Steiert TA, Bej S, Krawitz P, Hoffmann P et al: Cross-tissue transcriptome-wide association studies identify susceptibility genes shared between schizophrenia and inflammatory bowel disease. Commun Biol 2022, 5(1):80.
Zhao X, Zhao Y, Jiang Y, Zhang Q: Deciphering the endometrial immune landscape of RIF during the window of implantation from cellular senescence by integrated bioinformatics analysis and machine learning. Front Immunol 2022, 13:952708.
Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M et al: NCBI GEO: archive for functional genomics data sets—update. Nucleic Acids Research 2012, 41(D1):D991-D995.
Sokhansanj BA, Fitch JP, Quong JN, Quong AA: Linear fuzzy gene network models obtained from microarray data by exhaustive search. BMC Bioinformatics 2004, 5:108.
Shen W, Song Z, Zhong X, Huang M, Shen D, Gao P, Qian X, Wang M, He X, Wang T et al: Sangerbox: A comprehensive, interaction-friendly clinical bioinformatics analysis platform. iMeta 2022, 1(3):e36.
Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008, 9:559.
Carlson M (2022). _org.Hs.eg.db: Genome wide annotation for Human_. R package version 3.15.0.
Yu G, Wang LG, Han Y, He QY: clusterProfiler: an R package for comparing biological themes among gene clusters. Omics 2012, 16(5):284-287.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES et al: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102(43):15545-15550.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP: Molecular signatures database (MSigDB) 3.0. Bioinformatics 2011, 27(12):1739-1740.
Szklarczyk D, Gable AL, Nastou KC, Lyon D, Kirsch R, Pyysalo S, Doncheva NT, Legeay M, Fang T, Bork P et al: The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res 2021, 49(D1):D605-d612.
Doncheva NT, Morris JH, Gorodkin J, Jensen LJ: Cytoscape StringApp: Network Analysis and Visualization of Proteomics Data. J Proteome Res 2019, 18(2):623-632.
Zhang M, Zhu K, Pu H, Wang Z, Zhao H, Zhang J, Wang Y: An Immune-Related Signature Predicts Survival in Patients With Lung Adenocarcinoma. Front Oncol 2019, 9:1314.
Yasir M, Karim AM, Malik SK, Bajaffer AA, Azhar EI: Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models. Saudi J Biol Sci 2022, 29(5):3687-3693.
Beck MW: NeuralNetTools: Visualization and Analysis Tools for Neural Networks. J Stat Softw 2018, 85(11):1-20.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JC, Müller M: pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 2011, 12:77.
Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, Clark NR, Ma'ayan A: Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics 2013, 14:128.
Schwarz E, Maukonen J, Hyytiäinen T, Kieseppä T, Orešič M, Sabunciyan S, Mantere O, Saarela M, Yolken R, Suvisaari J: Analysis of microbiota in first episode psychosis identifies preliminary associations with symptom severity and treatment response. Schizophr Res 2018, 192:398-403.
Adams JB, Johansen LJ, Powell LD, Quig D, Rubin RA: Gastrointestinal flora and gastrointestinal status in children with autism--comparisons to typical children and correlation with autism severity. BMC Gastroenterol 2011, 11:22.
Jiang H, Ling Z, Zhang Y, Mao H, Ma Z, Yin Y, Wang W, Tang W, Tan Z, Shi J et al: Altered fecal microbiota composition in patients with major depressive disorder. Brain Behav Immun 2015, 48:186-194.
Ennaifer R, Rafrafi R, Mouelhi L, Houissa F, Bouzaidi S, Trabelsi S, El Hechmi Z, Najjar T: [Ulcerative colitis and schizophrenia: fortuitous association or etiopathogenic link?]. Tunis Med 2009, 87(8):531-533.
Safran M, Rosen N, Twik M, BarShir R, Iny Stein T, Dahary D, Fishilevich S, and Lancet D. The GeneCards Suite Chapter, Practical Guide to Life Science Databases (2022) pp 27-56
Wong AH, Lipska BK, Likhodi O, Boffa E, Weinberger DR, Kennedy JL, Van Tol HH: Cortical gene expression in the neonatal ventral-hippocampal lesion rat model. Schizophr Res 2005, 77(2-3):261-270.
Tian J, Liang X, Wang D, Tian J, Liang H, Lei T, Yan Z, Wu D, Liu X, Liu S et al: TBC1D2 Promotes Ovarian Cancer Metastasis via Inducing E-Cadherin Degradation. Front Oncol 2022, 12:766077.
Carroll B, Mohd-Naim N, Maximiano F, Frasa MA, McCormack J, Finelli M, Thoresen SB, Perdios L, Daigaku R, Francis RE et al: The TBC/RabGAP Armus coordinates Rac1 and Rab7 functions during autophagy. Dev Cell 2013, 25(1):15-28.
Breen MS, Uhlmann A, Nday CM, Glatt SJ, Mitt M, Metsalpu A, Stein DJ, Illing N: Candidate gene networks and blood biomarkers of methamphetamine-associated psychosis: an integrative RNA-sequencing report. Transl Psychiatry 2016, 6(5):e802.
McIntyre RS, Berk M, Brietzke E, Goldstein BI, López-Jaramillo C, Kessing LV, Malhi GS, Nierenberg AA, Rosenblat JD, Majeed A et al: Bipolar disorders. Lancet 2020, 396(10265):1841-1856.
Davis AP, Wiegers TC, Johnson RJ, Sciaky D, Wiegers J, Mattingly CJ Comparative Toxicogenomics Database (CTD): update 2023. Nucleic Acids Res. 2022 Sep 28.
Lipina TV, Haque FN, McGirr A, Boutros PC, Berger T, Mak TW, Roder JC, Wong AH: Prophylactic valproic acid treatment prevents schizophrenia-related behaviour in Disc1-L100P mutant mice. PLoS One 2012, 7(12):e51562.
AHFS Patient Medication Information [Internet]. Bethesda (MD): American Society of Health-System Pharmacists, Inc.; c2019. Protriptyline; [updated 2020 Jun 24; reviewed 2018 Jul 5; cited 2020 Jul 1]; [about 5 p.]. Available from: https://medlineplus.gov/druginfo/meds/a604025.html
Hovens JE, Onderwater TA: [Minocycline for schizophrenia: a brief overview]. Tijdschr Psychiatr 2014, 56(6):402-406.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Discovery and Identification of Candidate Genes, Diagnostic Model and Drug Predictions for Schizophrenia and Crohn's Disease Through Integrated Bioinformatics Analysis and Machine Learning

Status:

Version 1

Abstract

Background

Methodology:

Results

Conclusion

Figures

1. Brief Introduction

2 Materials And Methods

2.1 Materials

2.2 Screening of differentially expressed genes

2.3 Module gene selection and weighted gene co-expression network analysis

2.4 Gene function enrichment analysis

2.5 Construction of protein-protein interaction (PPI) network

2.6 Screening of schizophrenia and Crohn's disease-related characteristic genes by machine learning

2.7 Drug prediction

3 Results

3.1 Screening of DEGs between schizophrenia and Crohn's disease

3.2 Modular genes selection and weighted gene co-expression network analysis

3.3 Functional enrichment analysis of Crohn's disease

3.4 Construction and function enrichment analysis of protein-protein interaction network of intersection genes of schizophrenia and Crohn's disease

3.5 Screening candidate genes through machine learning and construction of an artificial neural network

3.6 Construction and verification of diagnostic model

3.7 Drug prediction

4 Discussion

5 Conclusion

Declarations

References

Additional Declarations

Status:

Version 1