ASD is an illness that normally occurs in early childhood, making it difficult for children to communicate and socialize [1, 2]. ASD, one of our most prevalent childhood disorders, cannot be treated today, and the definitive diagnosis process can take an extended duration. In addition, the prevalence of ASD is increasing each year [3, 4].
Despite an extensive range of signs of ASD [5], a complication that prolongs the diagnosis process is the high rate of comorbidity. The comorbidity problem in children with ASD means an extra disability like a vision problem or another health problem [6]. The lack of a known cure for autism, the long diagnosis and treatment process [7], and the high degree of comorbidity all indicate that more scientific work is needed on autism [8. There is an important need to study the influence of age and gender factors on ASD diagnosis and to evaluate the possibility that multiple classifications, including age and gender, may contribute to the rapid early diagnosis of ASD. Recent genetic works show that ASD occurs differently between males and females and between youths and adults [9]. Artificial intelligence and machine learning (ML) techniques [10, 11] such as DL provide fresh opportunities to discover biomarkers for diagnosis of ASD taking into account factors like age and gender that affect ASD, to shorten the diagnostic process of ASD, to avoid subjective opinions of different doctors and possibly reach a definitive diagnosis [12–14].
DL techniques have found extensive application in medical and neurological fields such as seizure detection [15], seizure prediction [16–18], epilepsy diagnosis and classification [19, 20], autism [21–23], optimization of neuroprosthetic vision [24], post-stroke rehabilitation with motor imagery [25], sentiment analysis [26], emotion recognition [27, 28], patient-specific quality assurance [29], classification of the intracranial electrocorticogram [30], brain-computer interface (BCI) for discriminating hand motion planning [31], and many other fields such as mobile robots [32], drone-based water rescue and surveillance [33], and structural health monitoring in recent years [34–36].
The design and effectiveness of a DL method for diagnosing ASD varies according to the data set. The data set can be numeric or two-dimensional graphical, or visual data. Numerical data can be behavioral [37, 38], eye-following [39], or fingerprint data [40–42], converted into numerical data by pre-processing. Optical data are brain structural magnetic resonance scanning images (sMRI) or brain functional magnetic resonance scanning images (fMRI). Using numerical or visual data to train an ML algorithm for ASD diagnosis is ordinarily possible by determining the distinguishing features or using an automated feature extraction technique [43–45]. These features may be structural gray matter (GM) values acquired from cortical thickness (CT) [46–48], GM density (GMd) values from voxel-based morphometry (VBM) [49], diffusion-weighted imaging (DWI) [fractional anisotropy (FA)] in white matter (WM)) microorganism changes [50], connectivity matrices [51], parameters from network analysis [52–54], and resting/duty state fMRI information [55, 56]. However, if a type of DL known as convolutional neural network (CNN) is utilized, direct classification is performed because feature extraction is done automatically. This is known as end-to-end deep learning [57]. For this reason, the CNN method is employed in this research as the most suitable method for rapid diagnosis of ASD.
In the study, the influence of a certain age range and gender on the diagnosis of ASD is examined by performing multiple classifications of ASD based on age and gender. A DL system has been introduced that can diagnose ASD for certain age ranges and gender. The advantages and differences of the current research compared to previously-reported research on ASD diagnosis, binary classification, and/or multiple classification works can be listed as follows. First, multiple classifications, including age and gender, were performed in this study, and to the best of the authors’ knowledge, this has never been done before. Second, compared to other works that employ a DA method, the number of image data in this study is huge and acquired from different brain regions. This is advantageous in terms of the generalizability of the models. Third, CNN was designed from scratch and utilized as a system element in this study. Thus, feature extraction is done automatically. Fourth, using a transfer learning (TL) method, today's popular pre-trained models were trained and tested with the same data set.
The following sections are organized as follows. In the next section, works on ASD classification using brain MRI images, which also considered other factors like age and gender, are discussed. The third section explains the techniques and materials utilized in the study. In the fourth section, metrics used to evaluate the performance of the study are presented. The fifth section reports the numerical experimental results acquired from the study. The paper ends with discussions and a conclusion.