2.1.1 Advanced Machine Learning Techniques to Assist Dyslexic Children for Easy readability: [1]
This research focuses on a review of software and hardware choices that can assist dyslexic children. Machine learning techniques used include K Means, K Nearest Neighbor, Adaptive clustering, LS Algorithm, Support Vector Machine, and Human Markov Model. To extract speech features and measure accuracy and performance improvement in children, machine learning methods are often used. As a result, the research will concentrate on children aged five to seven who have trouble reading Hindi words. Machine learning techniques will be employed in the design process to assist dyslexic children. The letters two and three are utilized. With Hindi words as input, the system is trained with a dynamic Time Wrapping method. Once you’ve programmed the words into the system, you’re ready to go. Words will be taught to a dyslexic child. A child will be given another chance to read if he or she pronounces a word wrong the first time. If the same event occurs three times, the system will repeat the word out, along with an image, to ensure that the child learns the term. The child’s word for another 20 minutes, the same session with different words will be repeated for the child. To recognize speech in this scenario, machine learning techniques will be used. If the system is functioning properly, the child said the word. For another 20 minutes, the same program will be performed with different words for the child. In this situation, machine learning techniques will be used to recognize speech. The Dynamic Time Wrapping technique yields 90 percent to 100 percent accuracy if the system is evaluated with the same user, and 30 percent accuracy if the system is tested with a new user.
2.1.2 Machine Learning based Learning Disability Detection using LMS: [2]
This research focuses on an E-learning system built with Moodle, an open-source Learning Management System (LMS) that enables tutors and students to work together more efficiently. This method recognizes couple of student profiles: students with and without Learning Disability (LD) by using keen courses constructed base on the topic of numerous features of an LD student (non-LD). Several elements of our informal testing approach for collecting learning aspects for Dyslexic students are also included in this paper. The first stage, data collection, is divided into two parts: the first group is the group having age between 8 to 10 years with a smaller number of constraints, and the other one is with age group of 11 to 13 years (Class of 6 to 8) with a greater number of parameters. On the computer, a speech-to-text (STT) conversion was done. Natural Language Processing was used to record the consumers’ audio responses (NLP). The Python programming language was used to analyse the responses. To detect whether or not a user has LD, machine learning (ML) is used (Dyslexia in this case). The dataset’s two classifications are LD and non-LD, and the binary classification is done using machine learning techniques like SVM (Support Vector Machine) and LR (Logistic Regression). The results are shown for both approaches, and a comparison of the datasets generated in another method for collecting parameters using NLP it shows that the dataset generated in the final technique is far better and accurate in case of accuracy level of comparisons. The LR method for ML outperforms SVM when it comes to detection based on the created dataset.
2.1.3 A survey paper on learning disability prediction using machine learning: [3]
This is a survey essay designed to learn about prior work on the subject and to find gaps that operational for various machine learning systems. A variety of machine learning algorithms are used to predict whether or not a child has a learning disability. It is easier to deal with such problems if a prognosis is given sooner rather than later. A comparison of existing machine learning algorithms for a particular data set can be used to see which one offers the best accurate prediction results. It is impossible to overestimate the importance of the pre-processing stage in preparing data for prediction. Several new strategies can be formalized to help forecast better outcomes.
2.1.4 Machine Learning and Dyslexia: Diagnostic and Classification System (DCS) for Kids with Learning Disabilities:[4]
In this study, they proposed an automated diagnosis and categorization method. The system was trained using pre- classified data from 857 school children’s spelling and reading scores. The twenty-fifth percentile was applied to the scores in order to categorize them. Dyslexia was diagnosed in children who scored below the twenty-fifth percentile, while non- dyslexic children scored above the twenty-fifth percentile. The diagnostic module is a prescreening tool that experts, trained users, and parents can use to detect dyslexia symptoms. The students are divided into two groups in the second module, classification: non-dyslexics and dyslexics with suspicion of dyslexia in spelling and reading. A research study implement is the third unit. In the final results it implies that 23 percent of children were at peril for dyslexia the LD in the training data and 20.7 percent in the testing data, with a 98 percent accuracy.
2.1.5 Diagnosis of Dyslexia using computation analysis: [5]
This study examines the use of a computer system to diagnose dyslexia, taking into account people’s difficulties in either writing, reading or in the speaking. As a result, a computer-based classifier can be built using dyslexia system of measurement approaches. As a result, the Gibson brain skills test will be used, which will consider the effects of working memory, auditory and filmic memory and thought, filmic and hearing sensitivities, inscription and motorized aids, math and time organization, performance, well-being, growth, and personality, and mental skill in people with LDs, particularly reading problems. Computational study with classifiers will be used to analyse the recommended dataset, which contains around eighty archives of children. This computing model was created by and used to aid in the uncovering of underlying issues that may interfere with learning to read and then write, as well as concerns that may cause difficulty with memorized understanding. This approach is utilized to help therapists and paternities know the problem and guide children down the right road to academical achievement.
2.2 Limitations of existing systems
The papers referred above have the following limitations: Behavioral elements of members during homogenous tests, such as writing, reading or working memory, are examined by psychologists in traditional dyslexia diagnosis procedures. Where as in ADHD suffering peoples have problems on visual attentions they are identified by their low scores on these exams. However, because symptoms differ between people, these procedures are generally time-consuming and useless for a wide group of people.
2.3 Problem statement
Identify and evaluate the most commonly practiced techniques used to classify the EEG data associated with learning disabilities, and to evaluate the accuracy for various machine learning algorithms and identify which model is well suited for the specified dataset.
2.3.1 Dyslexia: The primary step is to identify the procedure is to demeanor a user survey and get information from them. In traditional dyslexia diagnosis techniques, psychologists assess participants’ behavior throughout uniform exams, such as writing or writing skills along with, phonological awareness, and working memory. [
1,
5] Low results on these tests are used to identify dyslexics. However, because people’s symptoms vary, these methods are frequently time-intense and unsuccessful for a large group of people. As a result, academics are increasingly turning to machine learning techniques, which are less time intense and low-cost.
2.3.2 ADHD: ADHD patients may also have trouble focused on a solo task or sitting ideal for long time period. Anyone of any age can be affected by ADHD
EEG signal features can detect dyslexia, ADHD, dementia, sleeping problems, depression, and other brain illnesses. EEG headsets monitor brain activity by placing electrodes in an array along the user’s or research subject’s scalp.
2.4 PROPOSED SYSTEM
2.4.1 Data collection
Due to the pandemic situation, it was not anticipated for us to build our dataset using sampling of EEG signals of people woe for LDs. Instead, we search for an online benchmark data resource. The first dataset carries the information of dyslexic and non-dyslexic students which is available on Skit Learn. For dyslexia identification the dataset we have selected is unbiased dataset of 28 dyslexic and 21non dyslexic (25 male and 24 female). For the test purpose memory, vocabulary, speed, visual discrimination, audio discrimination feature is selected to discriminate between dyslexic and non-dyslectic persons. And the second dataset for ADHD and non-ADHD students was submitted by Ali Motie Nasrabadi, [14] it consists of unbiased data of 61 children with ADHD and 60 healthy controls (boys and girls, ages 7–12) on IEEE data port. For ADHD identification, on the basis of EEG recordings, children with ADHD and children without ADHD are classified. The data is broken down into four sections: ADHD part 1, ADHD part 2, Control part 1 and Control part 2. Each portion is made up of a number of mat files, each of which corresponds to a person’s EEG data. In the task, a set of pictures of cartoon characters was shown to the children and they were asked to count the characters. The number of characters in each copy was arbitrarily selected from 5 to 16 in numbers, and the magnitude of the copy/pictures was big sufficient to be simply observable and countable by children Thus, the duration of EEG recording throughout this cognitive visual task was dependent on the child’s performance (i.e., visual attention and response speed).
2.4.2 Pre-processing, feature extraction and feature selection
The dataset must be preprocessed and screened previously applying machine learning algorithms. This entails converting the data into a number or qualitative/textual format. To locate useful attributes and eliminate nulls, pre-processing is utilized. Following the initial processing step, the feature elimination/extraction technique, in which relevant features are recognized and a variety of standards is assigned. Here each frequency channel band is a feature and the threshold value of each wave is their respective frequency as stated.
We accomplish three types of component analysis on this dataset and the outcomes are specified later in this paper.
- Principal component analysis
- Independent component analysis
- Linear discriminant analysis
2.4.3 System training and classification
ML Algorithms: Random Forest Classifier, Decision Tree Classifier, Linear Regression, XGB Classifier, support vector machine (SVM), K-Nearest Neighbor (K-NN), and are among the algorithms employed after the component analysis.
2.4.4 Performance evaluation
In the performance evaluation, Python-based tools are employed. In this scenario, accuracy score is utilized to assess the performance of machine learning-based on detection of LDs systems.