This study proposes a method that can predict the cognitive level of subjects based on resting-state EEG and validates the feasibility of the method on a large sample data set, obtaining relatively good predictions. The evaluation model has easy input data collection advantages, low algorithm complexity, and high model interpretability. The method may contribute to the early screening of cognitive decline, the assessment of the effectiveness of therapeutic interventions for neurodegenerative diseases associated with cognitive decline, mainly Alzheimer's disease, the development of drugs for related diseases, and the evaluation of drug efficacy.
One question worth discussing is why many single types of features do not distinguish the three groups well simultaneously, while once they are combined in a specific form, satisfactory classification accuracy can be obtained. Schapire proves that if the target is weakly learnable, it must be possible to make it strongly learnable in some form based on the significant number theorem (Schapire, 1990). In practice, to achieve this goal, the errors of each weak classifier need to be independent of each other. That is, different weak classifiers can provide different information to achieve the error reduction of strong classifiers. The core purpose of this study's sample weight adjustment strategy using the Adaboost method is to ensure an even distribution of correctly classified samples, ultimately making the number of correctly classified samples more than the number of incorrectly classified samples. The results (Fig. 8) of the weak classifier for the predicted samples point out that the starting classification distribution of the samples in this problem is relatively uniform, making it easier to improve the accuracy during further iterations of the weights. In addition, the study also correlated the classification result labels of each weak classifier (Fig. 9). The results showed that each weak classifier did not perform the same, and there was no weak class of classifiers with similar information contributions that could be replaced or removed.
Specifically, in this study, the reason for the success of integrated learning is that the different features pinpoint different levels of variation and increase the amount of valid information at each iteration. The results show that the microstate sequence-related features perform relatively well in the differentiation of the three populations, which may be due to the high temporal resolution of the EEG that allows for the mining of fine-grained systemic transformations that are highly correlated with disease and abnormal activity (Khanna et al., 2015). Spatial network features performed best in distinguishing MCI from dementia but were average for distinguishing HC from MCI. This may be because the structural changes accompanying early neurological decline are not very significant, while the structural differences become increasingly significant as the system collapses (Elmore, 2007).
Another advantage of this work is that the data used contains three different acquisition devices (NeuroScan, Nuracle, BP). The combination and processing based on the feature level are not device dependent and do not require device consistency (Table 2 and Fig. 11), so a multi-device acquisition dataset has no impact on the model building of this work. This study also researches the distribution of the true and predicted values of the prediction sample by an amplifier to ensure the approach is pervasive and cross-device capable. The results showed that the distribution tests between the true and predicted values of the test samples from the three amplifiers did not show any difference (p > 0.1). Furthermore, the absolute prediction error for all three amplifiers is close to the absolute prediction error for the full sample (Total error: 4.63; NeuroScan error: 4.13; Neuracle error: 4.33; BP error: 5.23). The advantage of multiple devices, however, is the increased diversity of data, which also contributes to the generalizability and credibility of the model and establishes a sound basis for future algorithm applications (Cheng et al., 2012).
Table 2
Predictive distribution test for different amplifiers
Amplifiers | Mean absolute error | True Value Distribution Inspection | Predicted Value Distribution Inspection |
NeuroScan | 4.13 | NeuroScan & Nuracle p = 0.32 | NeuroScan & Nuracle p = 0.37 |
Nuracle | 4.33 | NeuroScan & Nuracle p = 0.40 | NeuroScan & Nuracle p = 0.31 |
BP | 5.23 | NeuroScan & Nuracle p = 0.21 | NeuroScan & Nuracle p = 0.20 |
Considering the strong correlation between cognitive decline and aging, the contribution of age in the model obtained by fitting based on cognitive decline is also worth discussing; after all, the study does not expect to obtain only one age predictor. For this reason, this study used the same Adaboost method to fit the age and predict the test sample's age. After obtaining the predicted age corresponding to the actual clinical labels of the subjects, it was found that the results of the model fitting seemed to have some age correlation. However, it can be seen from the figure that the results of the model fit do not all depend on age, as there are normal subjects with a high predicted age and demented subjects with low predicted age (Fig. 12).
The method has strong interpretability, which is crucial for clinical purposes. The results of the weak classifier show that the time domain features are relatively well classified, followed by the space domain features and again by the frequency domain features. This fully reflects the high temporal resolution of the EEG signal, which also indicates that the three categories of HC, MCI, and dementia, that is, the process of cognitive decline, differ more significantly in the transformation of neural activity patterns in the temporal dimension. This part of the results is also reflected in the paper. In particular, early cognitive decline, which is poorly identified subjectively, can be better detected by this model.
This study also describes the pattern of cognitive decline in three dimensions, spatiotemporal and frequency, to enhance the interpretability of the evaluation system. Combining the results of this study with previous work, the study concluded that cognitive decline follows four rules: strong and constant strong non-monotonic changes, decreased efficiency, and reduced stability.
The strong and constant strong means the entire neural system does not change uniformly, with strong connections and strong active areas surviving the degradation process better. These strong active areas or strong connections may characterize the person's essential activity capacity. For example, preserving parietal connections (Sun et al., 2021) may be related to central control and basic perception.
Another critical point is that cognitive decline is not a monotonic change, and there could be repair and compensation of the neural system, especially in the early stage(Cabeza et al., 2002). As can be seen from the results, cognitive decline occurs at an accelerated rate, with MCI to dementia changes far exceeding HC to MCI. There is even a tiny amount of additional connectivity and enhanced information from HC to MCI. This may also account for the mediocre performance of linear models for prediction in some studies (Wolz et al., 2011).
Decreased efficiency is also a primary phenomenon. Cognitive decline characterizes the degradation of the nervous system, with reduced activity and information implying reduced processing of external input (Petersen & Posner, 2012). This correlates with the slowness of response, attention deficit, and repetition of questions and answers exhibited by patients with cognitive decline.
Finally, cognitive decline will reduce stability. The normal brain is a stable dynamic equilibrium system, and cognitive decline disrupts this equilibrium (Buckner et al., 2009). Combined with the findings, the jumps in state and the diminished persistence indicate a system disruption. Some patients with cognitive decline exhibit emotional abnormalities, and communication difficulties may be associated with this.
In summary, the study in this paper also provides an idea of constructing a generic EEG activity space with large samples and labels, such that the mapping can characterize various neural activity changes. With the help of some techniques to increase the signal-to-noise ratio of EEG (Alyasseri et al., 2022; Sun et al., 2022), such a framework could extract information from resting-state scalp EEG more effectively. The cognitive decline mapping proposed in this study is only one part of a larger neural activity space. The construction of such a mapping system will be helpful for neurological function assessment, disease development evaluation, clinical target research, and drug effect testing.