2.1 Data
This study uses secondary data taken from one of Google's open-source sites, Kaggle. This data is published by the Open Access Series of Imaging Studies (OASIS). OASIS is a freely distributed collection of neuroimaging datasets for scientific development that aims to make brain MRI data sets freely available to the scientific community with the hope of facilitating and driving future discoveries in basic and clinical neuroscience. OASIS is provided by the Washington University Alzheimer’s Disease Research Center, Dr. Randy Buckner at the Howard Hughes Medical Institute (HMMI) at Harvard University, the Neuroinformatics Research Group (NRG) at Washington University School of Medicine, and the Biomedical Informatics Research Network (BIRN).
There are two OASIS datasets, namely OASIS Cross-sectional and OASIS Longitudinal. This data has been used for data analysis, making anatomical references, and developing algorithms for segmenting brain regions (10). A cross-sectional dataset consisting of a cross-sectional collection of 416 subjects aged 18 to 96. All subjects were right-handed and included both men and women. 100 subjects over 60 years of age had been clinically diagnosed with very mild to moderate Alzheimer's disease. In addition, a reliability data set was included containing 20 independent subjects who were imaged on subsequent visits within 90 days of their initial session.
The longitudinal dataset consisted of a longitudinal collection of 150 subjects aged 60 to 96. Each subject was scanned at two or more visits, separated by at least one year for a total of 373 imaging sessions. All subjects were right-handed and included both men and women. 72 subjects were categorized as non-demented during the study. 64 of the included subjects were marked as dementia at their baseline visit and remained so for subsequent scans, including 51 with mild to moderate Alzheimer's disease. Another 14 subjects were categorized as having no dementia at their first visit and then characterized as dementia at the next visit. The dataset also contains additional important information including age, gender, chief complaint, systolic blood pressure, diastolic blood pressure, pulse, and axial head length and head width.
In this study, only the OASIS Longitudinal dataset is used for the training model process using the neural network. Table 1 describes the parameters to be used in the system diagnosis training process.
Table 1. Parameters
No
|
Parameters
|
Example
|
1
|
Name
|
John
|
2
|
Gender
|
Male/M
|
3
|
Age
|
73
|
4
|
Years of education
|
15
|
5
|
Socioeconomic status
|
3
|
6
|
MMSE
|
18
|
7
|
Category
|
Moderate
|
2.2 Preprocesing
The OASIS Longitudinal dataset consists of 373 rows of data of people with dementia and non-dementia which include data such as Subject ID, MRI ID, Group, Visit, MR Delay, M / F, Hand, Age, EDUC, SES. MMSE, CDR, eTIV, nWBV, and ASF. In this study, to create an early detection system for dementia, only a few data were used, such as age, M/F (Gender), EDUC (Years of education), SES (Socioeconomic Status), MMSE (Mini-Mental State Examination), and Group (Converted / Demented / Nondemented) because the data can be obtained easily without the need to go to the hospital so this can make it easier for users to test for early detection of dementia.
In the dataset, 19 rows are identified with missing values in the SES column and 2 rows with missing values in the MMSE column. To solve this problem we have 2 approaches. One is dropping rows with missing values and the other way is replacing missing values with the corresponding values, or known as 'Imputation'. Since the dataset contains only 373 data sets, we assume that imputation will help our model perform better. For the imputation we use mean values because SES and MMSE are discrete variables, so to fill in the null columns we calculate the mean value in that column and fill in the mean value in the null columns.
There are 3 classes in the dataset, namely Nondemented, Demented, and Converted. Converted is a nondemented class which after testing again the result is demented. To make it easier, for converted classes to be replaced with demented so that the dataset only has 2 classes. For the results of the MMSE test, points 24-30 are categorized as Normal, points 19-23 are categorized as Mild, points 10-18 are categorized as Moderate, and points 0-9 are categorized as Severe. The dataset is normalized using Standard Scaler as a feature scaling.
2.3 Artificial Neural Network (ANN)
An artificial neural network consists of processing units called neurons. Artificial neurons try to mimic the structure and behavior of natural neurons. A neuron consists of input or dendrite, and one output or synapses via axons. Neurons have a function that determines neuron activation. x1 ... xn is the input to the neuron. Bias is also added to neurons along with input. Usually, the bias value is initialized to 1. W0 ... Wn is the weight. Weight is the connection to the signal. Product weight and input provide signal strength. A neuron receives many inputs from different sources and has one output. There are various functions used for activation. One of the most commonly used activation functions is the sigmoid function, which is given by

The other functions used are Step Function, Linear Function, Ramp Function, Hyperbolic Tangent Function. The hyperbolic tangent (tanh) function is similar in shape to the sigmoid, but the limit is from -1 to +1, unlike the sigmoid which is from 0 to 1. The sum is the sum of the input weights multiplied by the weight between one layer and the next. The activation function used is the sigmoid function, which is a continuous approximation and can be distinguished from the step function (11). Such interconnections of individual neurons form a neural network.
ANN architecture consists of:
- input layer: Receives the input value
- hidden layer (s): A set of neurons between the input and output layers. There can be one or more layers
- output layer: Usually it has one neuron, and the output is between 0 and 1, that is, greater than 0 and less than 1. But multiple outputs can also be present (12).
Processing capability is stored in the connection strength between units, which is called weights (13). The input power depends on the weight value. Weight values can be positive, negative, or zero. A negative weight means the signal is reduced or inhibited. Zero weight means there is no relationship between the two neurons. Weights are adjusted to get the desired results. There is an algorithm to adjust the weight of the ANN to get the required output. This weight adjustment process is called learning or training (11).