The error rate, reaction time and spatial error in the three tasks correlate with MMSE and MoCA scores, respectively:
An intuitive way to examine the reliability of a novel technique is to make comparisons with well-established systems. Thus, we first seek to determine whether there is a correlation between the performance of saccades in the three tasks and MMSE and MoCA scores. Here, we focus our analysis on three saccadic parameters, i.e., the error rate, reaction time and spatial error, because they reflect the capability, efficiency and accuracy of a subject to perform the tasks. In MGS, there are two different delay intervals (1000 and 600 ms). The 600 ms delay is only used in 41 patients at the beginning of this experiment. Since there was no significant difference in error rate, reaction time or spatial error between the two groups of subjects who performed MGS with 1000 ms or 600 ms delays, their data are combined for further analysis.
The results of the correlation analysis between error rates and MMSE and MoCA scores in the three tasks are shown in Fig. 2. Notably, the error rates in all three tasks are negatively correlated with MMSE and MoCA scores, which indicates that the majority of subjects with higher MMSE and MoCA scores have a higher capability to perform the saccadic tasks, and vice versa. However, the correlation is much weaker in PS (MMSE: p < 0.01, r = -0.15, Fig. 2A; MoCA: p < 0.01, r = -0.11, Fig. 2B) than in AS (MMSE: p < 0.01, r = -0.44, Fig. 2C; MoCA, p < 0.01, r = -0.50, Fig. 2D) and MGS (MMSE, p < 0.01, r = -0.44, Fig. 2E; MoCA, p < 0.01, r = -0.49, Fig. 2F). Moreover, there are subjects with low MMSE and MoCA scores who have a low error rate in PS but not in AS or MGS. Such results are understandable because correctly performing PS requires the involvement of a very limited cognitive function, if at all. In contrast, correctly performing AS and MGS requires the involvement of more cognitive functions, e.g., inhibition, spatial calculation and working memory. Thus, the error rates in AS and MGS reflect the state of cognitive functions more veritably than in PS.
Moreover, the reaction times and spatial errors are also negatively correlated with both MMSE and MoCA scores. The results of the correlation analysis are presented in Figs. 3 and 4, respectively. The figure formats are the same as in Fig. 2. While reaction times weakly but significantly correlated with MMSE and MoCA scores in PS (MMSE: p < 0.01, r = -0.18, Fig. 3A; MoCA: p < 0.01, r = -0.26, Fig. 3B) and AS (MMSE: p < 0.01, r = -0.21, Fig. 3C; MoCA, p < 0.01, r = -0.24, Fig. 3D), the correlation did not reach a significant level in MGS (MMSE, p = 0.06, r = -0.01, Fig. 3E; MoCA, p = 0.15, r = 0.06, Fig. 3F). Considering the fact that the process of visuomotor transformation may have been completed during the delay interval in MGS, the reaction time in MGS is not proper to examine the efficiency of signal transformation. In addition, the standard deviation of reaction times in the three tasks are all negatively correlated with the MMSE and MoCA scores (Fig. 3G-L). Such results indicate that subjects with higher MMSE and MoCA scores generally have higher efficiency and less variation to transform the signal from sensory input to motor output.
The spatial errors in the three tasks show a weak but significant correlation with both the MMSE and MoCA scores. The results of the correlation analysis are shown in Fig. 4A (MMSE: p < 0.01, r = -0.10) and 4B (MoCA: p < 0.01, r = -0.09) for PS; in Fig. 4C (MMSE: p < 0.01, r = -0.14) and 4D (MoCA: p < 0.01, r = -0.16) for AS; and in Fig. 4E (MMSE: p < 0.01, r = -0.22) and 4F (MoCA: p < 0.01, r = -0.26) for MGS. In addition, the standard deviation of spatial errors in the three tasks all negatively correlated with the MMSE and MoCA scores (Fig. 4G-L). Such results indicate that subjects with higher MMSE and MoCA scores generally have higher spatial accuracy (i.e., lower spatial error) and less variation to perform saccadic tasks.
A group of subjects have high MMSE and MoCA scores but also a high error rate in three saccadic tasks:
Interestingly, the results of correlation analysis show that a group of subjects with high MMSE and MoCA scores (MMSE score ≥ 27, MoCA score ≥ 24) have very high error rates in the three saccadic tasks. To understand the reasons behind what may have caused such inconsistency, we separated the subjects with high MMSE and MoCA scores into two groups based on their error rates in each saccadic task. The classifying criterion is set as the mean error rate of these subjects minus one standard deviation. The red and blue rectangles in Fig. 5A, 5F and 5J show the separated two groups of subjects, respectively, i.e., group one with an error rate lower than the criterion and group two with an error rate higher than the criterion. Next, we investigate the reasons for the high error rate in subjects of group two by analyzing the error types of saccades in the three tasks. We found that the primary error in PS was the inaccurate saccadic endpoint related to the saccadic goal location, i.e., the spatial error (Fig. 5B). Two exemplified sessions of eye traces from a normal subject (upper panel in Fig. 5C) and a subject with a high error rate show that the spatial error in the later subject (lower panel in Fig. 5C, magenta traces) is larger than that of the former subject. This phenomenon is also true for the averaged spatial error analysis between the two groups of subjects (p < 0.01, rank sum test, Fig. 5D).
The primary error type of group two subjects in AS is the unsuppressed reflexive saccade toward the location of the visual cue (Fig. 5F). Two exemplified sessions of eye traces from a normal subject (upper panel in Fig. 5G) and a subject with a high error rate show that the unsuppressed reflexive saccade occurs more frequently in the later subject (lower panel in Fig. 5G, brown traces) than in the former subject. This phenomenon is also true for the averaged rate analysis of unsuppressed reflexive saccades between the two groups of subjects (p < 0.01, rank sum test, Fig. 5H).
Unlike PS and AS having a single primary error type, there are three major error types for group two subjects in MGS, including spatial error, missing error and error of unsuppressed reflexive saccade (Fig. 5J). Four examples of eye traces are shown in Fig. 5K. From top to bottom, each panel represents eye traces from a normal subject, a subject making a large proportion of spatial errors (magenta traces), a subject making the greatest number of missing errors (green traces) and a subject making a large proportion of unsuppressed reflexive saccades (brown traces). The average rates of the three error types are significantly higher in group two subjects than in group one subjects (p < 0.01, rank sum test, Fig. 5L).
The results of saccadic error type analysis from the data of three tasks indicate that (1) compared with MMSE and MoCA tests, the remarkable advantage of measuring saccades is the ability to test movement-related functions; (2) each of these three tasks preferentially tests different cognitive and saccade related functions. Thus, we assume that the accuracy of evaluating brain function by analyzing combined saccadic parameters in multiple saccadic tasks is higher than that of a single saccadic parameter in a single task.
The discrimination of different groups of subjects is better using the combination of saccadic parameters from multiple tasks than a single saccadic parameter in a single task:
To examine the assumption that the evaluation of brain function with the combination of saccadic parameters from multiple tasks is more accurate than a single saccadic parameter in a single task, we build a weighted multiple saccadic parameter model (see methods for details). Then, we compare the efficiency of the model with the two most sensitive parameters (error rates in AS and MGS) in discriminating different groups of subjects. While both error rates in AS and MGS can discriminate different groups of subjects other than ESHC and SMCI (Fig. 6A-B), the output of the model can discriminate all five groups of subjects (Fig. 6C). Moreover, an ROC curve analysis shows that the discrimination of patients with cognitive impairment (MCI and dementia) from EHC is better by using the output of the model (AUC = 0.961, 95% CI 0.920–0.985) than the error rate in AS (AUC = 0.918 95% CI 0.865–0.954) and MGS (AUC = 0.909 95% CI 0.855–0.948; pairwise comparison of ROC curves by z test: AS error rate vs. model score, p = 0.0061, MGS error rate vs. model score, p = 0.0058) (Fig. 6D). In addition, the discrimination of patients with cognitive impairment (MCI and dementia) from the remaining three groups of subjects (EHC, ESHC and SMCI) is better using the output of the model (AUC = 0.892, 95% CI 0.853–0.925) than the error rate in AS (AUC = 0.836 95% CI 0.790–0.875) and MGS (AUC = 0.839 95% CI 0.793–0.878; pairwise comparison of ROC curves by z test: AS error rate vs. model score, p = 0.0003, MGS error rate vs. model score, p = 0.0077) (Fig. 6E). These results support our previous assumption.
The diagnostic performance of saccade measurements is more accurate than the MMSE test:
We directly compared the diagnostic performance between the output of our model and the MMSE test by employing ROC curve analysis, to assess the likelihood of discriminating patients with cognitive impairment from control subjects. When discriminating patients with cognitive impairment (MCI and dementia) from EHC, our model showed better accuracy (model: AUC = 0.961, 95% CI 0.920–0.985, Youden index at the optimal cutoff point = 0.802, sensitivity = 81.63%, specificity = 98.59%; MMSE: AUC = 0.919, 95% CI 0.867–0.956, Youden index at the optimal cutoff point = 0.745, sensitivity = 74.49%, specificity = 100%). Pairwise comparison of ROC curves by z test between the model and MMSE resulted with p = 0.0680. (Fig. 7A). When discriminating patients with cognitive impairment (MCI and dementia group) with the remaining three groups (EHC, ESHC and SMCI), our model showed better accuracy (model: AUC = 0.892, 95% CI 0.853–0.925, Youden index at the optimal cutoff point = 0.664, sensitivity = 86.79%, specificity = 79.59%; MMSE: AUC = 0.865, 95% CI 0.853–0.925, Youden index at the optimal cutoff point = 0.603, sensitivity = 85.85%, specificity = 74.49%). However, pairwise comparison of ROC curves by z test between the model and MMSE resulted with p = 0.3017. (Fig. 7B).
Finally, there were 27 cognitive impairment patients with positive positron emission tomography (PET) beta-amyloid accumulation imaging test results. These 27 patients met the research diagnosis criteria of MCI due to AD according to the National Institute on Aging Alzheimer’s Association workgroups in 2011 (30). Among these 27 patients, our model resulted in a true positive rate of 92.6% (25 out of 27), while the MMSE resulted in a true positive rate of 85.2% (23 out of 27).
Taken together, the results mentioned above, to some extent, indicate that the diagnostic performance of measuring saccades is better than that of the MMSE test, although some of results do not reach statistical significance.