SNeurodCNN: Structure-focused Neurodegeneration Convolutional Neural Network for Modeling and Classification of Alzheimer’s Disease

doi:10.21203/rs.3.rs-3951099/v1

Download PDF

Article

SNeurodCNN: Structure-focused Neurodegeneration Convolutional Neural Network for Modeling and Classification of Alzheimer’s Disease

https://doi.org/10.21203/rs.3.rs-3951099/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 02 Jul, 2024

Read the published version in Scientific Reports →

You are reading this latest preprint version

Alzheimer’s disease (AD), the predominant form of dementia, is a growing global challenge, emphasizing the urgent need for accurate and early diagnosis. Current clinical diagnoses rely on radiologist expert interpretation, which is prone to human error. Machine learning has thus far shown promise for early AD diagnosis. However, existing methods often overlook focal structural features critical for understanding cerebral cortex neurodegeneration. This paper proposes a machine learning framework that includes a novel structure-focused neurodegeneration CNN architecture named SNeurodCNN and an image brightness enhancement preprocessor using gamma correction. Leveraging mid-sagittal and para-sagittal brain image viewpoints from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, our framework demonstrated exceptional performance. The para-sagittal viewpoint achieved 97.8% accuracy, 97.0% specificity, and 98.5% sensitivity, while the mid-sagittal viewpoint offered deeper insights with 98.1% accuracy, 97.2% specificity, and 99.0% sensitivity. Model analysis revealed the ability of SNeurodCNN to capture the structural dynamics of mild cognitive impairment (MCI) and AD in the frontal lobe, occipital lobe, cerebellum, temporal, and parietal lobe, suggesting its potential as a brain structural change digi-biomarker for early AD diagnosis.

Health sciences/Diseases

Health sciences/Diseases/Neurological disorders

Health sciences/Diseases/Neurological disorders/Neurodegeneration

Biological sciences/Computational biology and bioinformatics

Biological sciences/Computational biology and bioinformatics/Computational models

Alzheimer’s disease

mild cognitive impairment

classification

deep learning

convolutional neural network

Alzheimer’s disease (AD) is a specific type of dementia associated with severe neurological deficits that affect cognitive, visual, sensory, and motor functions in people living with the disease¹. With AD, neurodegeneration, a progressive loss of structure or function of neurons, is inevitable, and there is currently no cure for reversing this process. However, clinical studies have shown that neurodegeneration progresses. With early diagnosis, treatment, and therapeutic interventions, the process can be slowed. At present, a definitive diagnosis of AD remains a complex task because tests for the presence of amyloid plaques and phosphorylated tau are the true determinants of AD and can mainly be performed posthumously². Other clinical practices depend on a multitude of evaluations, including clinical assessments, medical history reviews, cognitive assessments, and neuroimaging, and many years of study are needed to reach a diagnostic decision³. Neuroimaging methods, such as positron emission tomography (PET) and MRI, provide information on the extent of structural changes in the brain relevant for pathological alterations characteristic of the brain during degeneration⁴, particularly MRI. MRIs reveal several broad viewpoints, such as axial, coronal, and sagittal, with different levels of information for analysing brain neurodegeneration. Notably, the axial view revealed substantial atrophy of the cerebral cortex, leading to shrinkage of the outer layer of the cerebrum. This atrophy is accompanied by ventricle enlargement, reduced brain volume, and diminished gray matter⁵. In contrast, the coronal view highlights ventricle enlargement and emphasizes significant temporal lobe and cortical atrophy. This is a window into the widespread loss of neurons throughout the brain, accompanied by sulcus widening and gyrus thinning⁵. The sagittal plane provides the most visible information for AD diagnosis⁷. Brain neurodegeneration is evident in the sagittal plane, particularly in the frontal lobe, cerebellum, occipital lobe, thalamus, and corpus callosum, where learning and memory, mental function, motor function, and sensory function^{7 can} be significantly impacted.

Despite the diagnostic potential of MRI, its sole reliance on early AD diagnosis faces numerous limitations⁹. For example, AD may elude visual detection, especially when numerous samples of MCI and AD patients are analysed, thus necessitating the need for a comprehensive clinical evaluation methodology that is reliable. Additionally, the interpretability of MRI scans varies among radiologists and clinicians, which can introduce inconsistency in diagnosis. However, the emerging techniques in machine learning offer promise in diagnosing AD, particularly in timeliness, thereby paving the way for more effective AD management and intervention.

Over the past decade, deep learning algorithms, including both pretrained networks and tailored architectures, have been successfully adopted for AD modelling. Pretrained networks have long-standing relevance in AD diagnostic research. Bae tailored a residual network-50 (ResNet50)¹⁰ for discriminating between MCI and AD patients and achieved an accuracy of 82.4%. The GoogLeNet, AlexNet, and ResNet-18 pretrained networks were exploited for classifying patients into cognitively normal, early mild cognitive impairment, mild cognitive impairment, and late mild cognitive impairment categories¹¹. With accuracies of 96.39%, 94.08%, and 97.51% for GoogLeNet, AlexNet, and ResNet-18, respectively, ResNet-18 outperforms the other models in terms of performance. By integrating a 3D mobile inverted bottleneck convolution (MBConv) block in a 3D EfficientNet architecture¹², accuracy, sensitivity, specificity, and AUC values of 86.67%, 75.00%, 90.91%, 97.16%, and 83.33%, respectively, were achieved for the sMCI and pMCI sets. In another work, the DenseNet-169 and ResNet-50 CNN architectures were exploited for early AD diagnosis ¹³. DenseNet-169 exhibited superior accuracy, surpassing ResNet-50, with scores ranging between 97.7% and 88.7%. The ResNet-18 pretrained network was useful for AD classification¹⁴. With the use of the Mish activation function (MAF) for enhancing the model's learning adaptability and a weighted cross-entropy loss function to ensure equitable consideration of the AD, MCI, and CN classes, the network achieved 88.3% accuracy on the preprocessed ADNI dataset.

Tailored deep learning algorithms are now paving the way for AD diagnosis. Basaiaa et al. ¹⁵ proposed a 3D CNN consisting of 2 convolutional blocks of 5 × 5 × 5 filter sizes and 10 blocks of 3 × 3 × 3 filter sizes. They utilized strides in place of max-pooling for downsampling. Their work achieved 74.8%, 75.1%, and 75.3% accuracy, sensitivity, and specificity, respectively, on the ADNI stable (s-MCI) and MCI conversion (c-MCI) sets. The model achieved 85.9%, 83.6%, and 88.3% accuracy, sensitivity, and specificity, respectively, on the AD and s-MCI sets. Another study proposed a CNN for AD diagnosis and stratification¹⁶. This research not only facilitated fast and accurate AD diagnosis but also offered classification for normal, MCI, and AD patients. Additionally, we addressed the challenging task of stratifying MCI into very mild dementia (VMD), mild dementia (MD), and moderate dementia (MoD) stages, akin to prodromal AD. The shallow network¹⁶ achieved an overall testing accuracy of 99.68%, which was greater than that of pretrained networks such as DenseNet121, ResNet50, VGG 16, EfficientNetB7, and InceptionV3. Although the dataset used was the Open Access Series of Imaging Studies (OASIS) dataset, it is important to note that their work shows the importance of custom-trained networks in AD diagnosis. A fine-tuned CNN classifier ¹⁷ called AlzheimerNet was shown to be capable of classifying Alzheimer's disease into five stages. With data preprocessing and augmentation, their method achieved 98.67% accuracy using the RMSProp optimizer. Considering the different patient groups used for diagnosing AD, 18 independent binary classifications were proposed: healthy control (HC) vs. AD, HC vs. pMCI, HC vs. sMCI, pMCI vs. AD, sMCI vs. AD, and sMCI vs. pMCI according to the deep belief network (DBN). The modifications to the DBN¹⁸ include dropout and zero-masking for overcoming overfitting, a preprocessing algorithm, a principal component analysis for dimensionality reduction, and a multitask feature selection approach. Using the ADNI dataset, accuracies ranging from 87.78–99.62% were observed. Hazarikar et al.¹⁹ replaced the downsampling layer in the traditional LeNet architecture with a fusion of min-pooling and max-pooling layers to retain both minimum-value and maximum-valued signals. Their model achieved an accuracy, precision, recall, and F1-score of 98%, 96%, 97%, and 98%, respectively, on the ADNI dataset. In another work, a VGG-TSwinformer architecture²⁰ that combines a VGG-16 convolutional neural network and a transformer network was proposed and validated on the ADNI sMCI and pMCI cohorts. The accuracy, sensitivity, specificity, and AUC were 77.2%, 79.97%, 71.59%, and 0.8153, respectively. Similarly, another work²¹ also found architecture useful for their methodology. They created a hybrid architecture by combining AlexNet with LeNet and varying the filter sizes from 1 × 1, 3 × 3, and 5 × 5. Scores as high as 96%, 93%, 93%, and 96% for accuracy, precision, recall, and F1 score, respectively, were reported. Another notable architecture is the multiplane convolutional neural network (Mp-CNN) architecture, which simultaneously processes three planes, axial, coronal, and sagittal, of 3D MRI ²². The architecture of the Mp-CNN comprises 14 layers with rectified linear unit (ReLU) activation and softmax for multiclass classification, and it outperforms traditional 2D CNNs in multiclass classification associated with AD, MCI, and NC. The Swinformer has also been explored²³ as a transformer-based CNN architecture for AD classification. The Swinformer combines a CNN module for planar feature extraction and a transformer encoder module for 3D semantic connections. They argued that Swinformer can capture local features more accurately. The pipeline included data preprocessing and augmentation strategies such as random rotation and mirror reflection and recorded an accuracy of 88.3%.

While it is obvious that transfer learning with the use of state-of-the-art pretrained models is a promising technique for diagnosing AD, the tailored deep learning algorithm outperforms traditional methods in terms of performance and shows that it is better at preserving the underlying structure of the data for diagnosing AD. However, of these works, none have captured the structural dynamics of neurodegeneration in the brain in individuals with MCI or AD, which leaves room for additional work to be done. Therefore, this paper seeks to bridge this research gap and makes the following contributions to AD research.

We propose a machine learning framework that integrates the novel SNeurodCNN architecture for modelling the structural neurodegeneration of the brain's cerebral cortex and for the task of discriminating between MCI and AD.
Investigate whether the varying viewpoints of the two planes of the sagittal axis, midsagittal and parasagittal, provide differing insights into structural neurodegeneration.
Investigating SNeurodCNN sensitives patients to brain neurodegeneration is relevant for identifying digital biomarkers (digi-biomarkers) for the regions of the brain where structural neurodegeneration is prevalent.

This section presents the results and findings of this paper. We begin by outlining the findings of SNeurodCNN and then progress to its analysis using GradCAM to identify features strongly indicative of the model’s sensitivities to brain neurodegeneration.

Evaluation of the Structure-focused ADNI Dataset

Experiments on the structure-focused ADNI data version of the brain are relevant for understanding the structural changes that are contributors to brain neurodegeneration. The performance of the SNeurodCNN was evaluated on the midsagittal and parasagittal planes of the sagittal axis of the brain. The parasagittal view of the sagittal plane is the version taken from the off-centre plane, while the midsagittal view is the version taken at exactly the centre of the plane. The use of both planes is to investigate whether the model is impacted differently by neurodegeneration, as it pertains to the parasagittal region, which exposes regions of motor control, sensation, and perception, and the midsagittal region, which exposes regions that are responsible for visuospatial integration, memory, and self-awareness. The results of this experiment are presented in Table 1 and Fig. 1. Meanwhile, we demonstrate in Fig. 2 which displays the training and validation loss and accuracy performance of SNeurodCNN with training sets from the midsagittal and parasagittal viewpoints. The loss curves generated in Fig. 3 (a) and (b) demonstrate an exceptional trend in the training progress for both the parasagittal and midsagittal regions of the brain. As the number of epochs increases, there is a smooth and consistent descent in the loss curves, indicating effective model training. The convergence of training and validation losses to stable values, with a plateau at higher loss levels, signifies successful learning without succumbing to overfitting. Additionally, the parallel curves of both validation and training further attests that SNeurodCNN is learning from the training data but also demonstrates a robust ability to generalize to previously unseen data and simultaneously identify intricate pattens in the data while avoiding the pitfalls of overfitting.

Furthermore, akin to the loss curves generated by the model, the accuracy curves depicted in Fig. 2 not only demonstrate the model's adeptness in making predictions for both datasets but also showcase its adaptability to the training data, indicating a continuous enhancement in predictive accuracy. As the epochs increase, the upward trajectory of the axis curve for both experiments corresponds to the decrease in loss. Although peak accuracy is reached relatively early on for both the midsagittal and parasagittal datasets, this does not compromise the SNeurodCNN performance as the close alignment between training and validation indicates good generalization without overfitting.

Table 1: Performance evaluation of SNeurodCNN classification model in the midsagittal and parasagittal planes

Metric	Midsagittal Plane	Parasagittal Plane
Accuracy	98.1%	97.8%
Precision	97.2%	97.2%
Sensitivity	99.0%	98.5%
Specificity	97.2%	97.0%
F1-score	98.1%	97.8%
AUC	98.1%	97.8%

As shown in Table 1 and Fig. 1, the accuracy, precision, recall, specificity, F1 score, and AUC of the SNeurodCNN are outstanding. This is an indication that the model is capable of modelling the structural neurogenerative impacts of MCI and AD occurring at both parts of the sagittal plane—the midsagittal and parasagittal planes. While the accuracies recorded are high, which shows the model’s ability to distinguish between instances of AD and instances of MCI, the midsagittal accuracy was increased by 0.3%. Given that the structure-focused ADNI is used, the performance conforms with the medical statement “midsagittal cerebral morphology provides a homologous geometrical reference for brain shape and cortical vs. subcortical spatial relationships” ²⁴. In medical diagnosis, a high recall is preferred—the higher the value is, the greater the confidence we have in the model’s ability to minimize false positives, that is, diagnosing AD when in fact it is MCI and vice versa. Furthermore, the high F1-scores for both models reflect well-balanced performance in terms of precision and recall, ensuring a lower rate of both false-negative and false-positive predictions.

As an additional measure of our model’s performance, we utilized the AUC metric, which is a metric that provides an unbiased view of a model's performance amidst class imbalances aside from the F1 score since the data presented are 180:105 in proportion for AD patients and MCI patients. Scores of 98.1% and 97.8% were observed for the midsagittal and parasagittal sides, respectively, which underscores the models' discriminatory capability in diagnosing AD.

SNeurodCNN Sensitivity to Brain Neurodegeneration

To understand the sensitivity of SNeurodCNN to brain neurodegeneration, the same slices from the MCI and AD categories for the midsagittal and parasagittal planes were sampled. Then, with Grad-CAM, the activation maps of the last convolutional layer of SNeurodCNN are visualized to better understand the network’s sensitivity to brain degeneration. The Grad-CAM output was analysed using a heatmap; the brighter the yellow, the more significant the region is, and the more prominent the region is where brain neurodegeneration is. The closer to purple it is, the less significant the difference is. These regions can be visualised in Fig. 4, with more examples for the midsagittal in Fig. 5 and parasagittal in Fig. 6.

As can be observed from the MCI (sMCI and pMCI) and AD slices in Fig. 5, the frontal lobe, occipital lobe, and cerebellum are the regions highlighted as being highly significant, with the frontal lobe being the most prominent while the cerebellum the least prominent. Therefore, these regions reveal SNeurodCNN sensitivity to brain neurodegeneration in relation to AD. The findings made with the midsagittal plane are consistent with those made with the parasagittal plane and can be observed in Fig. 6, although the parasagittal plane shows additional prominent regions about the locations of the temporal and parietal lobe. Since we used the structure-focused ADNI, it is only expected that the highlights relate to changes in structure caused by shrinkage of the cerebral cortex, known as cerebral atrophy. An obvious interpretation of these observations is that the SNeurodCNN can sense a significant difference in the structural characteristics of the frontal lobe, occipital lobe, and cerebellum and possibly the parietal regions in individuals with MCI (sMCI and pMCI) and AD.

Insights into the functions of the frontal lobe, occipital lobe, cerebellum, temporal, and parietal lobe are important for understanding the roles these regions play in brain neurodegeneration in relation to AD. The frontal lobe is mainly responsible for motor action and the temporal integration of behavior²⁵. In a recent clinical study²⁶, the frontal lobes of patients with MCI were shown to experience cortical atrophy, but severe cortical volume deficit was evident in patients with AD. The region associated with visual processing for depth perception, colour determination, object, and face recognition and responsible for memory formation is the occipital lobe²⁷. According to clinical research, structural changes are evident in the occipital lobe and are an indication of early progressive MCI ²⁸. A clinical study²⁹ revealed that there was a degree of functional connectivity disruption in the cerebellum of MCI and AD patients. In this paper, our proposed SNeurodCNN showed that its impact is subtle compared to that of the others highlighted. The cerebellum is responsible for regulating motor movement and controlling balance³⁰. The additional highlight from the parasagittal viewpoint, the temporal and parietal lobe, might significantly indicate the impact that neurodegeneration can have on the sensory functions of the brain. The parietal region has not been computationally confirmed in DL-based studies, but a clinical study³¹ has shown that the parietal gray matter volumes distinguish between MCI patients and AD patients. Memory loss, to a considerable extent, affects MCI and AD patients, with the former being less pronounced than the latter ^32,33. The temporal lobe that is responsible for memory loss is visible in the parasagittal axis of the brain. As has been presented, there is a clinical corroboration to the brain regions our model showed sensitivity to through the midsagittal and parasagittal axis of the brain.

Comparison with the state-of-the-art methods

We believe that these studies aimed at discriminating between AD and MCI patients contrast them with the SNeurodCNN model. The comparisons are summarized in Table 2. Basaiaa et al.¹⁵ and the DBN model¹⁸ offer general approaches to diagnosing AD and MCI but with less emphasis on neurodegeneration structures than does SNeurodCNN. DenseNet-169 and ResNet-50 CNN models¹³ focus on early AD diagnosis with deep architectures, which are potentially less specialized in structural neurodegeneration. The Mp-CNN model²² offers a unique approach through advanced multiplanar image processing but differs from the focused technique of SNeurodCNN. Finally, the Hybrid LeNet-AlexNet Model²¹ combines various network elements but diverges from SNeurodCNN, which focuses on neurodegeneration. While each model has distinctive strengths for diagnosing AD and MCI, SNeurodCNN stands out for its specific focus on the regions where structural neurodegeneration occurs, making it a significant tool in the field.

Table 2: Comparison of the SNeurodCNN model with other state-of-the-art models

S/N	Model	Year	Performance Metric	Architecture	Diagnostic Approach	Digi-Biomarker
1	Basaia et al.'s 3D CNN Model¹⁵	2019	74.8% - 88.3% accuracy	2 blocks of 5x5x5 and 10 blocks of 3x3x3 filters, use strides for downsampling	AD and MCI diagnosis	-
2	Deep Belief Network (DBN) Model¹⁸	2023	87.78% - 99.62% accuracy	Dropout and zero-masking, multitask feature selection	Classification tasks for AD and MCI	-
3	DenseNet-169 and ResNet-50 CNN Models¹⁶	2023	DenseNet-169: 97.7% (training), 83.82% (testing); ResNet-50: Lower performance	Deep architectures with advanced feature learning	Early AD diagnosis	-
4	Multiplane Convolutional Neural Network (Mp-CNN)²²	2022	93% accuracy	14-layer architecture processing axial, coronal, sagittal planes of 3D MRI	Advanced multiplane image processing	-
5	The proposed	2024	SNeurodCNN 98.1% accuracy 97.8% accuracy	Sagittal planes; CNN architecture: 2D Conv, 3x3 filter of varying depth, Maxpool, Dense layers, Dropout	Classification tasks for AD and MCI	Captures structural brain neurodegeneration about the frontal lobe, occipital lobe, cerebellum, and parietal lobe.

We delve into the discussion to elaborate on the implications of the results of this study. Neurologists traditionally use neuroimaging methods, such as MRI, to assess MCI and AD neurodegeneration, but this paper presents a dimension that automates diagnosis toward classifying the disease stages but can help in identifying the regions susceptible to neurodegeneration. This finding substantiates the relevance of the proposed SNeurodCNN model in the early diagnosis of AD. The evaluation of neurodegeneration using the midsagittal and parasagittal planes of the brain results in remarkable performance. These findings surpass those of previous studies^7,10. SNeurodCNN has shown promise in capturing neurodegenerative features from MR images. As observed with Grad-CAM, structural changes were prominent in the frontal lobe, occipital lobe, and cerebellum on the midsagittal MR images. Frontal lobe dysfunction has been implicated in various neurodegenerative conditions, including AD and MCI³⁴. The parasagittal view showed that an additional structural change was possible in the parietal lobe, which was not obvious in the midsagittal slices. The SNeurodCNN model region is most sensitive to the frontal lobe, occipital lobe, cerebellum, and parietal lobe, which is consistent with the findings of clinical studies ^25–31 and, in some cases, consistent with regions identified in one of the DL-based studies⁷; however, our study further confirms the significance of cerebral atrophy on brain structure. Therefore, these findings show that our proposed SNeurodCNN can significantly contribute to AD diagnosis and prognosis. This necessitates the integration of deep learning into healthcare practices to provide early cues about AD that might be present in MCI patients, minimizing the occurrence of false positives and false negatives that can mitigate misdiagnosis.

Furthermore, the structural changes that our proposed SNeurodCNN highlights set a record as a potential brain structural change digi-biomarker for the early diagnosis of AD. The benefits of digi-biomarkers are numerous. 1) These findings can help clinicians develop targeted and personalized treatment plans and interventions through personalized analysis of disease states. These findings could lead to valuable targets for therapeutic interventions and monitoring and the assessment of novel treatments accordingly.

This section elucidates the dataset employed in this study and outlines the methodologies encompassing the data preprocessing pipeline and the design of the deep convolutional neural network architecture as illustrated in Fig. 7.

Data

The ADNI AD benchmark dataset commonly used for AD analysis is relevant to this study. This dataset was designed from a study in which individuals aged 55 to 90 years were enrolled from 57 sites across the U.S. and Canada after providing informed consent. In all, the ADNI comprises 368 participants, categorized into 180 AD patients (82 females, 98 males; age ± SD = 75.28 ± 7.57 years) and 105 stable sMCI patients (41 females, 64 males; age ± SD = 74.69 ± 7.41 years). The average mini-mental state examination (MMSE) scores were 23 for AD patients, 28 for sMCI patients, and 27 for pMCI patients. All subjects had a T1-weighted baseline from the ADNI1/Go/2 cohort. The sMCI group consisted of individuals who were diagnosed with MCI at baseline and remained so for at least two years. A summary of the demographics of the ADNI participants is provided in Table 2 and the mental examination score of the participants in Fig. 8.

We particularly explored skull-free patients version. This version utilized the ADNI dataset preprocessed to correct imaging distortion through gradwarping via gradient inhomogeneity correction, intensity correction, and scaling to address gradient drift. The results are further processed using multiatlas label propagation with expectation maximization (MALPEM) to segment, on local scales, the cross-sectional structural volume changes in the brain, which constitute the structure-focused ADNI dataset. It should be noted that each slice of the structure-focused ADNI dataset is characterized by the region of the gray matter that forms the anatomical structures of the MRI brain images and not the gray matter itself.

Table 2

Demographics and clinical characteristics of the study population in the ADNI dataset
Group	Subject	Age	Gender
			M	F
AD	180	75.28 ± 7.57	98	82
sMCI	105	74.69 ± 7.41	64	41
pMCI	83	73.82 ± 6.65	49	34

Preprocessing

Considering that structural changes in the brain and AD pathology are more prominent in the sagittal plane, their efficacy in AD classification has long been established^7,12. This study explored two sagittal plane views, the midsagittal and parasagittal viewpoints, from the ADNI volumetric data in the sagittal plane. From the 3D structured focused ADNI dataset comprising 368 subjects with 155 slices, only slices showing full brain parts were retained; as such, the number of slices varied from sample to sample. In this study, we considered a different approach, which is an unusual practice in the literature where the disease case of a sample is described by all/fraction of their slices. The slices from the sMCI and pMCI patients were combined and referred to as the MCI set, while the slices from the AD samples were likewise combined into an AD set. We consider AD diagnosis to be a sample-independent task; therefore, it is only logical to teach the deep learning algorithm to capture the factors influencing the two categories, MCI and AD, to better understand the disparity that exists between them. In all, a total of 4228 midsagittal AD and MCI sets were retained, each consisting of 2160 and 2068 participants, respectively. The parasagittal AD and MCI sets included 2700 and 2820 participants, respectively, for a total of 5520 data points.

The nonlinearity property of the Gamma function is found to be ^useful35 for expanding the brightness of the intensity of a poorly illuminated image and contributes greatly to the technique proposed³⁵. Despite the intensity correction on the original ADNI dataset, the structure-focused ADNI dataset shows that images are captured under poor illumination conditions. Therefore, the gamma correction, an inverse of the gamma function, is adopted for expanding the brightness of pixel intensities. AD and MCI samples were subsequently split into training and testing sets at a ratio of 80:20. The testing set is further split into halves to accommodate the set for validation.

The SNeurodCNN Architecture

As previously stated, each structure-focused ADNI MRI slice is characterized by the formation of gray matter, which forms the anatomical structures of the MR brain images. This structural formation, as opposed to gray matter features, is used to capture focal structural changes associated with neurodegeneration in the cerebral cortex as opposed to focal fine-grain changes within anatomical structures. For this reason, the SNeurodCNN architecture is designed to be a deep convolutional neural network for modelling the structural neurodegeneration of the brain and discriminating between MCI and AD.

The SNeurodCNN architecture is presented in Fig. 3. It comprises two downsampling convolution blocks growing in a number of convolutional layers. The first block consists of a convolutional layer with a 3\(\times 3\) filter size and 32 filter depths, varying image dimensions, a nonlinear activation function, a rectified linear unit (ReLU), and a 2\(\times 2\) maxpooling layer. The second convolution block differs from the first by only the two convolutional layers, which are made up of 32 and 64 filter depths. This process feeds into the fully connected layer with 500 hidden neurons and a dropout layer, which is a regularizer that optimizes learning by randomly dropping nodes in the hidden neuron by a designated probability, which in our case is 50%. This is followed by the output layer, which is set to two and represents the two classes that the softmax activation function uses to classify an image as belonging to MCI or AD.

Experimental settings

The network's hyperparameters are chosen in ways that best optimize the network’s learning capability. For the hyperparameters, the Adam optimizer with a learning rate fixed at 0.0001 was used. The other variables are epochs and batch sizes, which are 100 and 32, respectively. We adopted early stopping, a regularizer that prevents overfitting by immediately halting training when the performance of the network does not improve after several epochs. The parameters of the early stopping agent used were patience set to 5 and restore_best_weights set to the true Boolean value. During training, the model learns through a method called backpropagation^36, which updates the weight of the network toward minimizing the gradient error. Considering the impact of hardware resources on model performance, the hardware configuration of the system used for training the network is reported as follows: A Keras library running on a Python-TensorFlow environment with a Google Collab GPU, Tesla K80 (T4).

To evaluate the performance of the SNeurodCNN model, the following metrics were identified and found to be consistent with the metrics used in the literature for diagnosing AD. The variables used were the accuracy, precision, recall, specificity, F1-score, and area under the receiver operating characteristic (ROC) curve (AUC), which provide a comprehensive assessment of the model's performance. Their brief descriptions are as follows.

1. Accuracy: This metric measures the proportion of correct predictions over the total number of predictions, as expressed in (1).

\(Accuracy =\frac{TP+TN}{TP+TN+FP+FN}\)

(1)

where TP = true positive, TN = true negative, FP = false positive and FN = false negative.

2. Precision: the proportion of true positives out of all the instances predicted as positive. In this case, precision measures the number of correctly classified AD cases out of all instances classified as ADs and is mathematically expressed as follows

\(Precision= \frac{TP}{TP+FP}\)

(2)

3. Recall: also known as sensitivity, this measures the proportion of correctly identified positive instances out of all the actual positive instances. In this case, the recall measures the proportion of correctly classified AD cases out of all actual AD cases. It is given as:

\(Recall= \frac{TP}{TP+FN}\)

(3)

4. Specificity: This metric quantifies the model’s ability to make true negative predictions out of all the correctly identified negative instances. With regard to the classification model, specificity will measure the proportion of correctly classified MCI cases out of all actual MCI cases. It is mathematically given as follows:

\(Specificity= \frac{TN}{TN+FP}\)

(4)

5. F1-score: The F1-score combines precision and recall into a single metric, providing a balanced measure of the model's performance. It considers both false positives and false negatives, making it valuable for overall performance assessment.

\(F1=2 \times \frac{Precision\times Recall}{Precision+Recall}\)

(5)

6. AUROC: The AUROC evaluates the model's performance across different classification thresholds, considering the trade-off between sensitivity and specificity. This approach provides insights into classifier discrimination ability and can be particularly useful for imbalanced datasets.

Limitations

The growing increase in AD cases underscores the pressing necessity of advancing research in deep learning. Our proposed SNeurodCNN achieves remarkable classification accuracies of 98.1% and 97.8% in both the midline and posterior sagittal planes, respectively. This level of performance surpasses that of many existing studies, highlighting the potential of deep learning, particularly our proposed model, in transforming the landscape of neurodegenerative disease diagnosis. The model’s sensitivities to neurodegeneration in the brain structure not only substantiated our model's efficacy but also deepened our comprehension of the intricate nature of neurodegenerative diseases in relation to MCI and AD, thereby opening doors for a better understanding of the disease.

While our proposed SNeurodCNN model significantly contributes to AD diagnostic research, its findings are constrained by several limitations. The trainable parameters are large and computationally expensive, especially with respect to reliability and usefulness in real-time clinical diagnosis. Therefore, efforts to achieve high performance while minimizing the computational cost will be explored. Additionally, this paper is directed toward the structural function of the brain analysis, which led to the use of structure-focused ADNIs. However, it will be interesting to understand how the SNeurodCNN model performs on focal fine-grain features available with skull-free ADNIs. This approach enables us to better analyse which of the features, focal fine-grain or focal structure, are representative of the characteristics of neurodegeneration in the brain in a way that makes early diagnosis of AD feasible and reliable in clinical analysis. We expect that the outcome of the aforementioned study will lead to an understanding of the underlying neurobiological mechanisms involved in MCI and AD, which will help in studying AD progression.

In this paper, we propose a structure-focused neurodegeneration convolutional neural network (CNN) architecture called the SNeurodCNN, which was integrated into a machine learning framework along with preprocessing techniques for image enhancement and data preparation. The proposed framework leveraged the midsagittal and parasagittal brain image viewpoints of the structure-focused ADNI dataset. Through experiments, the proposed framework achieved 97.8% accuracy, with 97.0% specificity and 98.5% sensitivity on the parasagittal planes. On the midsagittal plane, an accuracy, specificity, and sensitivity of 98.1%, 97.2%, and 99.0%, respectively, were achieved. We further showed that the midsagittal lobe highlights the frontal lobe, occipital lobe, and cerebellum, while the parasagittal lobe extends to the parietal lobe as a region of the brain where structural dynamics are prominent due to MCI and AD. We believe this discovery useful for identifying digi-biomarkers for the early diagnosis of AD. In future work, efforts will be made to minimize the computational cost of the proposed model while achieving the same level of neurodegeneration modelling. Additionally, it will be interesting to apply the proposed model to focal fine-grained feature learning. This approach enables us to better analyse which of the features, focal fine-grain or focal structure, are representative of the characteristics of neurodegeneration in the brain in a way that makes early diagnosis of AD feasible and reliable in clinical analysis.

Conflict of interest

The authors declare that there are no conflicts of interest.

Author Contribution

Data collection, implementation, coding, validation, and testing, S.O.; Conceptualization, methodology, model design, and project supervision, C.C.O.; literature review, project scope, S.O., K.M, C.C.O.; result analysis and interpretation S.O, K.M, C.C.O; manuscript – original draft, C.C.O.,K.M, S.O; All authors reviewed, edited, and approved the final manuscript.

Availability of Data

The datasets analysed during the current study are available at the following link: https://doi.gin.g-node.org/10.12751/g-node.aa605a/.

Alzheimer's Association, 2018. 2018 Alzheimer's disease facts and figures. Alzheimer's & Dementia, 14(3), pp.367–429.
Gonzalez-Ortiz, F. et al. Plasma phospho-tau in Alzheimer’s disease: towards diagnostic and therapeutic trial applications. Mol Neurodegener 18, 18 (2023).
Galvin, J. E. & Sadowsky, C. H. Practical Guidelines for the Recognition and Diagnosis of Dementia. The Journal of the American Board of Family Medicine 25, 367–382 (2012).
Zeng, H.-M., Han, H.-B., Zhang, Q.-F. & Bai, H. Application of modern neuroimaging technology in the diagnosis and study of Alzheimer’s disease. Neural Regen Res 16, 73 (2021).
Van de Mortel, L. A., Thomas, R. M. & van Wingen, G. A. Grey Matter Loss at Different Stages of Cognitive Decline: A Role for the Thalamus in Developing Alzheimer’s Disease. Journal of Alzheimer’s Disease 83, 705–720 (2021).
Lin, H.-Y. et al. Differential Patterns of Gyral and Sulcal Morphological Changes During Normal Aging Process. Front Aging Neurosci 13, (2021).
Hoang, G. M., Kim, U.-H. & Kim, J. G. Vision transformers for the prediction of mild cognitive impairment to Alzheimer’s disease progression using mid-sagittal sMRI. Front Aging Neurosci 15, (2023).
Apostolova, L. G. et al. Hippocampal Atrophy and Ventricular Enlargement in Normal Aging, Mild Cognitive Impairment (MCI), and Alzheimer Disease. Alzheimer Dis Assoc Disord 26, 17–27 (2012).
McEvoy, L. K. & Brewer, J. B. Quantitative structural MRI for early detection of Alzheimer’s disease. Expert Rev Neurother 10, 1675–1688 (2010).
Bae, J. et al. Transfer learning for predicting conversion from mild cognitive impairment to dementia of Alzheimer’s type based on a three-dimensional convolutional neural network. Neurobiol Aging 99, 53–64 (2021).
Shanmugam, J. V., Duraisamy, B., Simon, B. C. & Bhaskaran, P. Alzheimer’s disease classification using pretrained deep networks. Biomed Signal Process Control 71, 103217 (2022).
Zheng, B. et al. A modified 3D EfficientNet for the classification of Alzheimer’s disease using structural magnetic resonance images. IET Image Process 17, 77–87 (2023).
Al Shehri, W. Alzheimer’s disease diagnosis and classification using deep learning techniques. PeerJ Comput Sci 8, e1177 (2022).
Oktavian, M.W., Yudistira, N. and Ridok, A., 2022. Classification of Alzheimer's Disease Using the Convolutional Neural Network (CNN) with Transfer Learning and Weighted Loss. arXiv preprint arXiv:2207.01584.
Basaia, S., et al. Alzheimer's Disease Neuroimaging Initiative, 2019. Automated classification of Alzheimer's disease and mild cognitive impairment using a single MRI and deep neural networks. NeuroImage: Clinical, 21, p.101645.
Marwa, E.G., Moustafa, H.E.D., Khalifa, F., Khater, H. & AbdElhalim, E., 2023. An MRI-based deep learning approach for accurate detection of Alzheimer’s disease. Alexandria Engineering Journal, 63, pp.211–221.
Shamrat, F. M. J. M. et al. AlzheimerNet: An Effective Deep Learning Based Proposition for Alzheimer’s Disease Stages Classification From Functional Brain Changes in Magnetic Resonance Images. IEEE Access 11, 16376–16395 (2023).
Zeng, N., Li, H. & Peng, Y. A new deep belief network-based multitask learning for diagnosis of Alzheimer’s disease. Neural Comput Appl 35, 11599–11610 (2023).
Hazarika, R.A., Abraham, A., Kandar, D. & Maji, A.K. An improved LeNet-deep neural network model for Alzheimer’s disease classification using brain magnetic resonance images. IEEE Access, 9, pp.161194–161207, (2021).
Hu, Z., Wang, Z., Jin, Y. and Hou, W., VGG-TSwinformer: Transformer-based deep learning model for early Alzheimer’s disease prediction. Computer Methods and Programs in Biomedicine, 229, p.107291, (2023).
Hazarika, R.A., Maji, A.K., Kandar, D., Jasinska, E., Krejci, P., Leonowicz, Z. and Jasinski, M., 2023. An Approach for Classification of Alzheimer’s Disease Using Deep Neural Network and Brain Magnetic Resonance Imaging (MRI). Electronics, 12(3), p.676.
Angkoso, C. V., Agustin Tjahyaningtijas, H. P., Purnama, I., & Purnomo, M. H. Multiplane Convolutional Neural Network (Mp-CNN) for Alzheimer’s Disease Classification. International Journal of Intelligent Engineering and Systems 15, (2022).
Hu, Z., Li, Y., Wang, Z., Zhang, S. & Hou, W. Conv-Swinformer: Integration of CNN and shift window attention for Alzheimer’s disease classification. Comput Biol Med 164, 107304 (2023).
Bruner, E., Martin-Loeches, M. and Colom, R. Human midsagittal brain shape variation: patterns, allometry and integration. Journal of Anatomy, 216(5), pp.589–599, (2010).
Hoffmann, M., 2013. The human frontal lobes and frontal network systems: an evolutionary, clinical, and treatment perspective. International Scholarly Research Notices, 2013.
Shi, C., Deng, H., Deng, X., Rao, D. and Yue, W., 2023. The Structural Changes of Frontal Subregions and Their Correlations with Cognitive Impairment in Patients with Alzheimer's Disease. Journal of Integrative Neuroscience, 22(4), p.99.
Rehman A, Al Khalili Y. Neuroanatomy, Occipital Lobe. In: StatPearls. StatPearls Publishing,PMID: 31335040, (2023).
Zhao, L., et al. & Alzheimer Disease Neuroimaging Initiative. Risk estimation before progression to mild cognitive impairment and Alzheimer’s disease: an AD resemblance atrophy index. Aging (Albany NY), 11(16), p.6217, (2019).
Tang, F., Zhu, D., Ma, W., Yao, Q., Li, Q. and Shi, J., 2021. Differences changes in cerebellar functional connectivity between mild cognitive impairment and Alzheimer's disease: a seed-based approach. Frontiers in Neurology, 12, p.645171.
D'Angelo, E., Physiology of the cerebellum. Handbook of clinical neurology, 154, pp.85–108, (2018).
Hänggi, J., Streffer, J., Jäncke, L. and Hock, C. Volumes of lateral temporal and parietal structures distinguish between healthy aging, mild cognitive impairment, and Alzheimer's disease. Journal of Alzheimer's Disease, 26(4), pp.719–734, (2011).
Chetelat, G., Desgranges, B., De La Sayette, V., Viader, F., Eustache, F. and Baron, J.C. Mapping gray matter loss with voxel-based morphometry in mild cognitive impairment. Neuroreport, 13(15), pp.1939–1943, 2002.
Saykin, A.J., Wishart, H.A., Rabin, L.A., Santulli, R.B., Flashman, L.A., West, J.D., McHugh, T.L. and Mamourian, A. Older adults with cognitive complaints show brain atrophy similar to that of amnestic MCI. Neurology, 67(5), pp.834–842, 2006.
Dubois, B., Picard, G. & Sarazin, M. Early detection of Alzheimer’s disease: new diagnostic criteria. Dialogues Clin Neurosci 11, 135–139 (2009).
Chude-Olisah, C.C., Sulong, G., Chude-Okonkwo, U.A. and Hashim, S.Z., Illumination normalization for edge-based face recognition using the fusion of RGB normalization and gamma correction. In 2013 IEEE International Conference on Signal and Image Processing Applications, pp. 412–416, (2013).
Miller, A.S., Blott, B.H. and Hames, T.K. Review of neural network applications in medical imaging and signal processing. Medical and Biological Engineering and Computing, 30, pp.449–464, 1992.

No competing interests reported.

Download PDF

Journal Publication

published 02 Jul, 2024

Read the published version in Scientific Reports →

Editorial decision: Revision requested
24 Mar, 2024
Reviews received at journal
21 Mar, 2024
Reviewers agreed at journal
12 Mar, 2024
Reviewers invited by journal
12 Mar, 2024
Editor assigned by journal
12 Mar, 2024
Editor invited by journal
01 Mar, 2024
Submission checks completed at journal
01 Mar, 2024
First submitted to journal
12 Feb, 2024

You are reading this latest preprint version

SNeurodCNN: Structure-focused Neurodegeneration Convolutional Neural Network for Modeling and Classification of Alzheimer’s Disease

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Results

Discussion

Materials and Method

Conclusion

Declarations

Conflict of interest

Author Contribution

Availability of Data

References

Additional Declarations

Status:

Journal Publication

Version 1