Participants
Patients with aMCI were recruited via a registry at the Affiliated ZhongDa Hospital of Southeast University [5, 17]. From this registry, 87 aMCI patients were included; all of them were Chinese Han and right-handed. All patients were recruited through a normal community health screening and newspaper advertisements, and they underwent a standardized clinical interview, a neuropsychological battery assessment, and multimodal brain MRI examinations. The detailed inclusion and exclusion criteria are described in our previous publications [4, 5]. The Research Ethics Committee of the Affiliated ZhongDa Hospital of Southeast University approved this study, and written informed consent was obtained from all participants.
In the present study, we selected the data of 50 aMCI patients who completed visits at two time points (T1 and T2) with 3 years between them. Notably, at follow-up (i.e., the T2 time point, 3 years later), both the clinical/behaviour assessment and parameters of MRI scanning were identical to those conducted at baseline (i.e., the T1 time point). All aMCI patients included in this study had no excessive motion artifacts (i.e., exceeding 3 mm in translational movement or 3° in rotational movement) during MRI scanning or incomplete image coverage. Table 1 presents the aMCI patients’ demographic information and neuropsychological performances covering the episodic memory domain.
Table 1
Demographic and clinical information.
| aMCI (N = 50) |
Baseline | 3-year Follow-up |
Age (years) | 68.0 ± 7.3 | 70.1 ± 7.3 |
Education (years) | 11.9 ± 3.4 | - |
Gender (male/female) | 30/20 | - |
MMSE | 27.08 ± 2.03 | 26.30 ± 3.22 |
MDRS-2 | 133.74 ± 5.76 | 125.40 ± 20.65 |
AVLT-immediate recall | 15.7 ± 3.5 | 14.4 ± 4.4 |
AVLT-recognition | 19.6 ± 2.4 | 18.8 ± 4.2 |
Episodic Memory | 6.53 ± 2.75 | 6.40 ± 3.35 |
AVLT-delayed recall | 2.78 ± 1.54 | 2.46 ± 2.38 |
LMT-delayed recall | 3.34 ± 2.26 | 2.60 ± 2.02 |
CFT-delayed recall | 13.47 ± 6.03 | 14.14 ± 6.64 |
Data are presented as the mean ± standard deviation. |
Abbreviations: aMCI, amnestic mild cognitive impairment; MMSE, mini-mental state examination; MDRS-2, mattis dementia rating scale-2; AVLT, auditory verbal learning test. LMT, logical memory test; CFT, Rey-Osterrieth complex figure test. |
Neuropsychological Examination
For all aMCI patients, we assessed their general cognitive function using the MMSE and Mattis Dementia Rating Scale-2 and administered a neuropsychological battery to evaluate their episodic memory performance. This battery consisted of the AVLT-DR, the logical memory test with a 20 min delayed recall (LMT-DR), and the Rey-Osterrieth complex figure test with a 20 min delayed recall (CFT-DR). As in previous studies [18], a composite episodic memory score was also calculated by averaging the AVLT-DR, LMT-DR and CFT-DR scores for all patients.
Data Acquisition
MRI images were acquired in a 3.0 T Siemens Verio scanner with a 12-channel head coil at the Affiliated ZhongDa Hospital of Southeast University. All patients underwent both the high-resolution T1-weighted and resting-state functional MRI scanning. Participants lay supine with the head snugly fixed by a belt and foam pads to minimize head movement. High-resolution T1-weighted axial images covering the whole brain were acquired using a 3D magnetization prepared rapid gradient echo (MPRAGE) sequence as follows: repetition time (TR) = 1900 ms; echo time (TE) = 2.48 ms; flip angle (FA) = 9°; acquired matrix = 256 × 256; field of view (FOV) = 250 mm × 250 mm; thickness = 1.0 mm; gap = 0 mm; and number of slices = 176. Resting-state functional images were obtained for eight minutes with gradient-recalled echo-planar imaging (GRE-EPI) sequence: TR = 2000 ms; TE = 25 ms; FA = 90°; acquisition matrix = 64 × 64; FOV = 240 mm × 240 mm; thickness = 4.0 mm; gap = 0 mm; and number of slices = 36. Prior to the scan, all patients were instructed to keep their eyes closed, stay awake, relax their minds, and move as little as possible during data acquisition.
Data Processing
Structural MRI Processing
The VBM8 (http://dbm.neuro.uni-jena.de/vbm/) toolbox was used to perform the analysis of brain structural imaging. In this process, all images were spatially normalized using combinations of affine linear transform and nonlinear registration to the standard Montreal Neurological Institute (MNI) template and segmented into GM, WM, and cerebrospinal fluid (CSF). Segmented GM images were modulated to compensate for the volumetric effects of expansion or shrinking employed in spatial normalization by multiplying the voxel intensity with the Jacobian determinants reflecting the parameters for fitting a voxel in native space to the corresponding voxel in template space. The modulated images were then smoothed with a 10-mm full width half maximum (FWHM) isotropic Gaussian kernel and resampled to 10 mm isotropic voxels. These procedures created a whole-brain voxel-based GM volume (GMV) map for each individual. Then, we calculated the average GMV of each region in the Automated Anatomical Labelling (AAL) atlas, which includes 90 prior cortical and subcortical regions in total. These 90 regional average GMV values were used as a feature vector to perform the following prediction analyses.
Functional MRI Processing
Data pre-processing. Resting-state functional data were pre-processed using SPM8 (http://www.fil.ion.ucl.ac.uk/spm/) and the DPARSF (http://www.restfmri.net/ forum/dparsf) toolboxes. The first ten functional volumes were discarded to minimize the effects of scanner stabilization and participant adaption. The remaining images were corrected for timing differences and motion effects. The individual structural images were segmented into GM, WM and CSF using a unified segmentation algorithm. Using the transformation parameter estimate during unified segmentation, the motion-corrected functional volumes were spatially normalized to MNI space and resampled to 3 mm isotropic voxels. Further pre-processing included linear detrending and temporal bandpass filtering (0.01–0.1 Hz), which were applied to reduce the effects of low-frequency drift and high-frequency physiological noise. We regressed out several spurious effects of nuisance covariates, including six head motion parameters, mean global signal, WM signal and CSF signal.
Whole-brain resting-state functional connectivity (FC) analyses. To compute resting-state FC, the AAL atlas was applied to parcellate the entire GM into 90 cortical and subcortical regions. For each subject, a regional mean time series was calculated by averaging the time series over all voxels within this region, and a total of 90 regional mean time series were therefore yielded. The resting-state FC between each pair of regions was computed by using the Pearson correlation coefficient. We obtained one symmetric correlation matrix (i.e., 90 × 90) for each aMCI patient. Then, Fisher’s z-transform was applied to improve the normality of the correlation coefficients. Finally, for each patient, we converted the FC matrix into a feature vector with 4,005 values.
Individualized Prediction of Current and Future Episodic Memory Performance by MRI-derived Features
Based on the MRI data at T1 time point, we applied multivariate RVR to predict both the current (i.e., T1 time point) and future (i.e., T2 time point, 3 years later) episodic memory performances (i.e., AVLT-DR and composite episodic memory scores) of unseen aMCI individuals. Particularly, both whole-brain GMV acquired from structural MRI and whole-brain FC acquired from functional MRI were used separately. We were interested in whether the structure and function performed equally well in the prediction of both current and future episodic memory performance.
RVR is performed in a probabilistic Bayesian learning framework and obtains sparse solutions of a multivariate regression model [15]. Under this framework, an explicit zero-mean Gaussian prior was applied to the model weights, and therefore, most weights were set to zero, resulting in only some samples, termed the ‘relevance vector’, being used to train the model. The maximum likelihood estimation was used to find the weights of these samples. The regression coefficients of all features were determined as the weighted sum of the feature vector of all “relevance vector” samples. This algorithm does not have an algorithm-specific free parameter and is computationally more efficient than other algorithms [19]. RVR has been widely used to predict age [20] and behaviours [21]. We used the codes from the PRoNTo toolbox (http://www.mlnl.cs.ucl.ac.uk/pronto/) to implement RVR. All codes about prediction are publicly available on GitHub (https://github.com/ZaixuCui/Pattern_ Regression_Clean).
Prediction Framework
To quantify prediction accuracy, we applied leave-one-out cross-validation (LOOCV) to estimate the out-of-sample generalizability of the models. Specifically, N-1 subjects (where N is the number of aMCI patients) were used as the training set, with the remaining individual used as the testing sample. During the training procedure, each feature was linearly scaled to a range of zero to one across the training set [19, 22], and then an RVR prediction model was constructed using this training set. During the testing procedure, each testing subject feature vector was scaled using the scaling parameter acquired during the training procedure. The training and testing procedures were repeated N times so that each patient was used once as the testing sample. The correlation coefficient r and mean absolute error (MAE) between the predicted and actual scores were used to quantify the prediction accuracy [19]. In the present study, we controlled for age, sex and years of education when calculating the correlation between predicted and actual scores.
Significance of Prediction Performance
Permutation tests were used to determine whether the coefficient r and MAE were significantly better than the results expected by chance. Specifically, the above prediction procedure was re-applied 1,000 times. For each time, we permuted the behavioural scores across the training samples without replacement. The P value of the mean correlation r was calculated by dividing the number of permutations that showed a higher value than the actual value for the real sample by the total number of permutations (i.e., 1,000). Similarly, the P value of the mean MAE was the portion of permutations that showed a lower value than the actual value for the real sample. To assess the specificity of the predictive models for current and future episodic memory performances in aMCI patients, we examined the correlation between the predicted AVLT-DR scores and their actual scores after adjusting for the effects of age, sex, and years of education.
Contributing Features and Corresponding Weights
To quantify the contribution of each feature to prediction, we constructed a new RVR model using all subjects. The absolute value of the RVR weight of each feature quantified its contribution to the model. A larger absolute value of weight indicated a greater contribution of the corresponding feature to the prediction in the context of all other features. For GMV predictions that included 90 features in total, the features were selected for visualization if the absolute value of their weight was in the top 10%. Given the large number of FC features (i.e., 4,005 features), we displayed the features with the highest absolute contribution weight in the top 1%. These thresholds, although arbitrary, eliminated noise components and enabled a better visualization of the most predictive features.
To better interpret our results, we used the standard 7-system template image provided by Yeo et al. [23] that was originally derived from a whole-brain clustering analysis, which yielded 7 large-scale functional networks. To define the priori network modules, network nodes obtained from the AAL atlas were each assigned to one of the 7 large-scale functional modules; subcortical nodes were assigned to an eighth subcortical module. Therefore, the primary modular partition defined by 90-node networks consisted of 8 brain networks: default mode, frontoparietal, ventral attention, dorsal attention, visual, sensorimotor, limbic, and subcortical systems.
Validation
We conducted two additional analyses to validate our results. First, a 10-fold cross-validation was applied to validate our prediction results. Specifically, all patients were divided into 10 subsets, of which nine were used as the training set, and the remaining one was used as the testing set. The training set was scaled and used to train an RVR prediction model, which was then used to predict the scores for the scaled testing data. The scaling of testing data used parameters acquired from training data. This procedure was repeated 10 times so that each subset was used as the testing set once. Finally, the correlation r and MAE between the actual and predicted scores were calculated across all patients. Since the full dataset was randomly divided into 10 subsets, performance might have depended on data division. Therefore, the 10-fold cross-validation was repeated 100 times, and the results were averaged to produce the final prediction performance. A permutation test was applied 1,000 times to test the significance of the prediction performance. Second, we further put T1 GMV and FC features into the RVR separately to predict the T1 and T2 composite episodic memory scores. Putatively, the predictive model would capture specific GMV and FC features for individualized predictions of current and future episodic performances in aMCI patients if the resultant correlation remained significant.