AD was discovered in 1906, and for a hundred years, the question of how to treat the disease has plagued the scientific community. Although decades of research have identified features of the disease, such as the presence of amyloid plaques between neurons and the buildup of another toxic protein, tau protein, inside neurons, there are still a great deal of questions about what causes the onset of the disease and how best to treat it. The study of AD genes is important for understanding their pathogenesis and developing treatments. For example, by analyzing the epigenomic and gene expression changes that occur in AD, one research team identified cellular pathways that could be targets for new drugs [15–18]. Another study revealed new genetic loci for AD [19]. The discovery of these key genes and loci offers the possibility of a deeper understanding of AD pathogenesis and the development of new therapeutic strategies.
In this study, bioinformatics and machine learning methods were combined to jointly analyze three gene microarrays from the GEO database, from which differences in gene expression between AD patients and the normal control group were demonstrated, and 254 DEGs were selected. To understand the functions and pathways of these DEGs, GO analysis and KEGG enrichment analysis were performed, and the results of GO analysis showed that DEGs were highly enriched in functions and structures such as vesicular transport, vesicular membranes, and proton pumps/ATPases, which implied that these genes play important roles in membrane transport and energy metabolism processes in neuronal cells. This may be closely related to the pathophysiological process of AD, which is usually accompanied by abnormal metabolism of proteins and disturbances in vesicular transport within neurons. This finding is consistent with previous findings, such as those discussing the relationship between amyloid and defects in synaptic vesicle dynamics and neurotransmitter release, and proposing the treatment of Alzheimer's disease through interventions targeting synaptic vesicle dynamics [20]. KEGG enrichment analysis, on the other hand, showed the most significant enrichment of pathways related to neurodegenerative diseases, such as Alzheimer's disease, synaptic transmission, Parkinson's disease, and Huntington's disease, suggesting that AD may share certain pathological mechanisms with other neurological disorders and that these DEGs may play a key role in these shared mechanisms. GO analysis of intersecting genes (mitophagy-related DEGs) showed significant enrichment of intersecting genes in mitochondrial respiratory chain complexes and endosomes, suggesting that these genes may be involved in the regulation of mitochondrial function. Mitochondria are important organelles for cellular energy production, and their dysfunction leads to neuronal damage [21]. It has been reported in the relevant literature that mitochondrial dysfunction may be an important link in the early stages of Alzheimer's disease and may be involved in the development of AD through the generation of oxidative stress [22]. Another study indicated that mitochondrial autophagy maintains cellular homeostasis by degrading damaged mitochondria, which plays a key role in maintaining normal mitochondrial structure and function, cell survival, and neurological development and is closely related to the development of AD [23, 24]. These results indicate the important role of mitochondrial functional impairment and autophagy in the pathogenesis of AD, and the present study searched for DEGs associated with mitophagy in AD, which is of great significance for the development of diagnostic markers and drugs.
In addition, this study revealed the two main subtypes of AD samples in this study by consistency cluster analysis and analyzed the variability of the different sets of pathway genes among the subtypes. Among them, the IL-6 pathway gene expression levels were overall higher in Cluster 1 than in Cluster 2. This result may reflect a more severe inflammatory pathologic process in patients with Cluster 1, or it may be associated with a specific subtype or clinical presentation. It has been suggested that IL-6 controls the development of neurological disorders by binding to receptors that activate transcriptional activators in signaling, which can be used as a basis for the diagnosis of AD, as well as for the screening of patients at high risk for AD and the early detection of patients with mild and severe AD [25]. Similarly, the immune cell infiltration, immune activation, and mitochondrial oxidative phosphorylation of the two subtypes were significantly different, and these differential analyses provide directions for further research on the functional status of the different subtypes of cells and their regulatory mechanisms, which are conducive to the research and implementation of personalized therapy.
Machine learning was then used to screen seven key genes from 38 DEGs associated with mitophagy: FXR1, TUBB4B, TUBB, MAP4K4, IDH3G, RTN3, and ATP5F1B. Among them, FXR1 is a member of the Fragile X gene family, which, together with FXR2 and FMRP, forms the Fragile X gene family that is highly expressed in neurons and associated with schizophrenia, bipolar disorder, and emotion regulation. It has been shown that members of the fragile X gene family are able to regulate synaptic structure and function [26]. Another study indicated that adhesion molecules of synaptic cells can regulate Aβ production through interactions with key enzymes that promote Aβ formation, and Aβ-dependent synaptic adhesion affects synaptic function and integrity, suggesting that altered adhesion molecules at synapses play a key role in the disruption of the AD neuronal network [27]. Therefore, there may also be an association between FXR1 and AD, especially in terms of synaptic function and integrity, and FXR1 may be involved in regulating synaptic function in neuronal networks, in which synaptic adhesion molecules may play a key role. Both TUBB4B and TUBB are microtubule-encoding genes. Microtubules are cytoskeletal polymers of ⍺/β-tubulin heterodimers that are essential for a wide range of cellular processes. Pathogenic variants in microtubule-encoding genes (e.g., TUBB4B, which encodes a β-4B microtubule protein isoform) are responsible for multiple brain malformations [28]. A study [29] showed that microtubule proteins isolated from the brains of AD patients could polymerize in vitro to form structurally abnormal microtubules, suggesting that disturbances in the self-regulation of the microtubule system in human brain cells may lead to structural defects in microtubules and may result in the destabilization of the functioning of the preferential metabolic system within the cell, which contributes to the onset of AD. Tau is also a microtubule protein, and aberrant aggregation of tau proteins into neuroprogenitor fiber tangles is thought to be an important pathogenetic mechanism in AD [30]. Moreover, a study [31] reported direct evidence that overexpression of tau proteins leads to axonal transport disorders in neuronal cells and suggested that this is associated with disorders of microtubule proteins. TUBB4B and TUBB, as genes coding for microtubule proteins, are likely to be involved in the development of AD as well. The MAP4K4 gene encodes mitogen-activated protein kinase kinase kinase kinase kinase 4 protein (MAP4K4 protein). Endothelial MAP4K4 enhances endothelial permeability, thereby promoting inflammation and atherosclerosis progression [32]. In addition, recombinant MAP4K4 protein aggravates neurological damage, brain edema, and blood‒brain barrier damage [33]. Although there is no direct evidence for a clear relationship between MAP4K4 and AD, inflammation plays an important role in the pathogenesis and progression of AD, and MAP4K4, a gene encoding an inflammation-associated protein kinase, may be related to the inflammatory process in AD to some extent.
The IDH3G protein, encoded by the IDH3G gene and working in conjunction with IDH3A and IDH3B, forms the IDH3 complex (isocitrate dehydrogenase). Located within the mitochondria, this complex catalyzes the conversion of isocitrate to α-ketoglutarate, playing a vital role in intracellular energy metabolism and the regulation of oxidative stress. It serves as a core component of the tricarboxylic acid cycle and is closely associated with cellular metabolism and energy homeostasis [34, 35]. Meanwhile, disruptions in neuronal energy metabolism play a pivotal role in the onset and progression of AD [36]. Mitochondrial energy metabolism dysfunction is one of the main pathological features of AD, and mitochondrial dysfunction and energy metabolism abnormalities in the brains of AD patients precede Aβ deposition and Tau protein phosphorylation [37]. Abnormal mitochondria can be found in the brain tissue of several AD model animals and AD patients [38], and early on, they can exhibit dysfunction, such as ultrastructural changes, abnormal mitochondrial enzyme activity and oxidative stress [39], which ultimately leads to reduced ATP synthesis, free radical production and oxidative damage, leading to neuronal dysfunction. Due to the high energy demands of the nervous system and the important role of mitochondria in neuronal homeostasis, ATP synthase deficiency may lead to neurological impairment. Although ATP5F1B is known to be a core gene encoding the v β subunit of the ATP synthase complex, the exact mechanism by which these variants are pathogenic remains incompletely understood [40]. RTN3 belongs to the RTN family, is localized in the endoplasmic reticulum and is widely expressed in various tissues of the body, with the highest expression in brain tissue [41]. The functions of most of the family members are still unknown. Some scholars speculate that they are involved in some important functions related to the endoplasmic reticulum, such as the formation of vesicles, the packaging of secretory products, and the regulation of intracellular calcium levels. They also found that this gene is widely expressed in the nonglial cells of the central nervous system of the mouse, especially in the hippocampus, cerebral cortex, hypothalamus, and some neuronal nuclei [42]. Another study showed that RTN3 was sufficient to negatively regulate BACE1 activity by blocking the binding of BACE1 to amyloid precursor proteins [43], whereas BACE1-deficient mice were able to survive healthily and completely abrogate Aβ production and that increasing the expression of RNT3 markedly reduced amyloidogenic peptides, whereas inhibition of RTN3 increased amyloidogenic peptide production [44]. Although the mechanism of action of RTN3 in AD is still not well understood, these results suggest that RTN3 plays an important role in neurodegenerative diseases.
Finally, based on these 7 key genes, a nomogram model for AD risk prediction was developed. In the dataset of this study, the AUC of the seven pivotal genes was greater than 0.75, which has good diagnostic value. The AUC of the nomogram model even reached 0.877, which has good disease prediction accuracy. In summary, FXR1, TUBB4B, MAP4K4, IDH3G, TUBB, RTN3, ATP5F1B and the nomogram model of these 7 key genes have great potential in the diagnosis and treatment of AD.
In this study, a combination of bioinformatics and machine learning was used to screen key genes, and some preliminary valuable results were obtained. However, the study also has some shortcomings. First, although the three gene chips were combined in this study, the sample size was still small, with 143 AD samples and 82 control samples, and enlarging the sample size might enhance the reliability of the results. Second, due to the limitation of experimental conditions, the results obtained in this paper could not be further verified, and we expect other researchers to further analyze and verify these key genes and the transcription factors associated with them.