An ensemble learning framework based on comprehensive gray matter features for identification of mild cognitive impairment in leukoaraiosis

doi:10.21203/rs.3.rs-2234761/v1

Download PDF

Research Article

An ensemble learning framework based on comprehensive gray matter features for identification of mild cognitive impairment in leukoaraiosis

https://doi.org/10.21203/rs.3.rs-2234761/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

White matter hyperintensities (WMH), also known as leukoaraiosis (LA), is strongly associated with cognitive impairment and lead to an increased risk of dementia. The purpose of this study is to develop a model to effectively and objectively identify WMH patients with cognitive impairment (WMH-MCI). Firstly, the comprehensive multiple cortical morphological measurements were extracted from magnetic resonance imaging (MRI) to enrich the disease characterization information. Then, based on the general eXtreme Gradient Boosting classifier (XGBoost), we designed a data-level fusion resampling method (Fusion + XGBoost) and an algorithm-level focal loss improved XGBoost model (FL-XGBoost), respectively, to solve the imbalance learning problem of classifying WMH-MCI (minority class of 27 samples) and the WMH population without cognitive impairment (WMH-nCI, majority class of 70 samples). Moreover, an ensemble framework based on weighted soft-voting was developed to combine the two models to further improve the overall classification performance and stability of the model. Compared with the baseline XGBoost model trained on the original imbalance dataset (Bacc: 78.20%), both the Fusion + XGBoost model (Bacc: 80.53%) and the FL-XGBoost model (Bacc: 81.25%) could improve the identification accuracy of WMH-MCI, the improvements were 2.33% and 3.05%, respectively. The overall model accuracy with weighted ensemble learning achieved 84.80%, with high sensitivity (85.50%) and specificity (84.14%) at the same time, which was better than that of the single model and significantly improved than the baseline model. The developed model could accurately identify the cognitive impairment in the WMH population, which could assist early clinical diagnosis and timely decision-making.

White matter hyperintensities

Surface-based morphometry

XGBoost algorithm

Imbalance Learning

Ensemble learning

The White matter hyperintensities (WMH), also known as leukoaraiosis (LA), refer to changes in magnetic resonance imaging (MRI) abnormalities caused by degenerative changes in the white matter, which usually manifest as highlights on the T2 sequence or FLAIR sequence. WMH is the most common and earliest brain tissue change in chronic small vessel ischaemic disease (CSVD), and it increases with age[1]. The prevalence of WMH is up to 50–90% in the community elderly population of older adults [2]. WMH is associated with the decline of cognitive function and executive function. The increased WMH load confers a higher risk of vascular cognitive impairment [3–6]. However, at present, the early diagnosis of WMH cognitive impairment mainly relies on the evaluation of neuropsychological scales, which has some problems, such as strong subjectivity, inconsistent diagnostic criteria, and a complicated diagnostic process. And the lack of specific diagnostic basis for vascular related cognitive dysfunction, which would lead to clinical missed diagnosis and misdiagnosis, thereby increasing the risk of CSVD and even Alzheimer's disease (AD) [7, 8]. Therefore, an objective and reliable detection method is important for early identification of WMH-related cognitive impairment, thereby assisting early clinical intervention and treatment.

Traditional shallow statistical analyses frequently failed to capture the heterogeneity behind psychiatric phenotypes, particularly in experiments with small sample sizes. To overcome this limitation, machine learning (ML) algorithms have been widely applied to MRI image analysis of various neurological diseases [9, 10], which provided a promised tool for both analyzing these variables and observing inherent disease-related patterns. For example, using the ML algorithm called support vector machine (SVM) to extract abnormal brain structure information from whole brain MRI data can achieve excellent classification accuracy of AD, which can aid in early AD diagnosis [11, 12]. One study performed machine learning analyses based on altered diffusion tensor imaging (DTI) metrics between groups [13]. They applied random forest to generate models and achieved an 80.5% accuracy in diagnosing WMH-MCI from WMH populations. However, the diagnostic ability of gray matter atrophy for cognitive impairment in WMH remains unknown.

Improving classification accuracy is the goal of machine learning algorithms in the training process. Therefore, in imbalanced-learning, the classifier would easily tend to the majority class, which leads to misclassification and model untrustworthiness. To overcome the problem of unbalanced classification, the existing methods mainly include the sample resampling method based on data-level and a cost-sensitive learning method based on algorithm-level. The data-level method tries to reduce the level of imbalance by under-sampling majority samples or over-sampling augmentation minority samples[14, 15]. However, over-sampling may increase the probability of over-fitting, while under-sampling may cause poor fitting effects, especially for the small sample data. At the algorithm level, applying the idea of cost-sensitive learning, the misclassification cost loss of different categories could be integrated into the objective function of algorithm training by a weighting strategy, so that the classification algorithm itself would have a certain data tendency[16, 17]. Prior studies that have implemented weighted cross-entropy and focal loss functions on XGBoost and demonstrated the competitive performance of this method based on five imbalanced datasets[17]. For the clinical research, the patient data usually tend to be smaller than healthy control data. Therefore, how to effectively solve the small sample imbalance is the key technology in the process of model construction.

This study aimed to develop an objective and effective classification framework to automatically distinguish patients with cognitive impairment in WMH populations. To provide rich disease representation information for the model, we extracted various scale features that could characterize structural information of gray matter and fused them, which includes macroscopic gray matter volume and those fine-grained morphological measurements based on the cortical surface. Then, to overcome the problem of unbalanced classification of small samples in this study, we respectively discussed the sample resampling method based on data-level and the cost-sensitive learning method based on algorithm-level. Moreover, we implemented an ensemble learning strategy based on a performance weighted voting mechanism to combine the above two methods. In this way, the advantages of data-level resampling method and algorithm-level cost-sensitive learning method can be well complementary, so that to obtain a WMH-MCI diagnostic model with higher classification performance and better stability, which could provide a basis for clinical diagnosis.

2.1 Participants

122 patients with WMH were recruited from the collaborating hospital. The study was approved by the Ethics Committee and each participant gave written informed consent. The clinical diagnosis of WMH was assessed by two experienced radiologists who visually examined T2-weighted FLAIR images and scored WMH according to the Fazekas scale (range 0–6). The intraclass correlation coefficient of observer assessment was greater than 0.9. A standard clinical baseline assessment included demographic data, vascular risk factors (VRF), assessment of the mini mental state examination (MMSE) scales, and multi-modal MRI scans. The image acquisition parameters were given in the Supplementary S1. The inclusion criteria were as follows: patients with WMH imaging manifestations on MRI scan, aged 55–80 years. Exclusion criteria included lacunar infarction, hydrocephalus, brain tumor or space occupation, leukodystrophy caused by other reasons such as multiple sclerosis and so on, and no neurological or psychiatric disorders. Finally, a total of 97 subjects with WMH met the criteria and were enrolled in this study. According to the clinical diagnosis and cognitive ability assessment (MMSE༞26 or ≤ 26), we divided the WMH patients into two groups: (1) WMH-MCI group; (2) age and gender matched WMH patients without MCI (WMH-nCI) control group. The demographics and neuropsychological characteristics were showed in the Supplementary Table S1.

2.2 Comprehensive Gray matter features

The Computational Anatomy Toolbox (CAT 12.7-r1727, http://dbm.neuro.uni-jena.de/cat/ ) based on the statistical parametric mapping software (SPM 12, https://www.fil.ion.ucl.ac.uk/spm/) was used to preprocess MRI data, and obtained gray matter volumetric assessments and cortical surface reconstruction. The standardized MRI pre-processing pipeline was given in the Supplementary Table S2. Then, based on the fine-grained functional brain subregions defined by the Brainnetome atlas [18], the mean GM volume (GMV) values of 246 fine brain subregion regions were extracted from modulated GM maps. The reconstructed cortical surface was matched and aligned with the highly accurate, high-resolution Human Connectome Project Multimodal Parcellation (HCP-MMP) atlas [19], which defined 180 vertices for each hemisphere. Six vertex-based morphometric measurements that reflected the associated gray matter microstructural alterations of the cortical surface, including cortical thickness (CT), surface area (SA), cortical volume (CV), sulcal depth (SD), fractal dimension (FD) and gyrification index (GI), were extracted for each vertex. Finally, the above gray matter structural features at different scales were fused in series to form a feature matrix. And the ANOVA F-test and widely used L1 (Lasso- Least absolute shrinkage and selection operator) regularization [20] were conducted for feature selection to exclude redundant variables.

2.3 Imbalance learning

In this study, the ratio of WMH-MCI patients to WMH-nCI patients was close to 1:2.6, showing a sample imbalance. Therefore, we addressed this problem in two ways: 1) at the data-level, data resampling methods were used, and 2) at the algorithm-level, cost-sensitive learning based on focal loss function improved XGBoost (FL-XGBoost) was applied.

2.3.1 Data-level: Data Resampling

Three data resampling methods were investigated to resolve data imbalance: (1) the Borderline Smote algorithm, (2) the Cluster Centroids algorithm, and (3) the SMOTEENN algorithm. On this basis, the balanced data obtained by the three mentioned approaches were subsequently fused (referred to as the Fusion resampling method), so that the effective data augmentation for the small sample dataset was also achieved simultaneously. The implementation details were as follows.

The minority sample group (WMH-MCI) was denoted as${T_{\hbox{min} }}$, the majority sample group (WMH-nCI) was${T_{maj}}$. The original training sample size of two groups was$M,N$ respectively.

1. Borderline Smote over-sampling. The Borderline Smote algorithm focuses on those “Danger” minority samples (${x_i}$), which more than half of the K-nearest neighbor samples were majority samples, and then generates new samples with reasonable distribution by the nearest neighbor linear interpolation method (${x_{new}}={x_i}+\delta \bullet ({\tilde {x}_i} - {x_i}),\delta \in (0,1)$) [21]. It considered the distribution characteristics of original samples, and thus outperformed the traditional SMOTE algorithm. The balanced dataset was denoted as${T_1}=\{ {T^{\prime}_{\hbox{min} }}(N),{T_{maj}}(N)\}$.

2. Cluster Centroids down-sampling. The Cluster Centroids algorithm obtains M (the number of minority class samples) cluster centers of majority class samples by the k-means algorithm, and then the nearest neighbors of cluster centers are as new majority class samples to replace the original majority class samples [22]. The balanced dataset was denoted as${T_2}=\{ {T_{\hbox{min} }}(M),{T^{\prime}_{maj}}(M)\}$.

3. SMOTEENN hybrid-sampling. The SMOTEENN algorithm is a combination of the SMOTE and Edited Nearest Neighbor (ENN) algorithms [23]. The ENN algorithm searches for K nearest neighbor samples (K is generally 3) of majority samples, and eliminates samples where there were two or more samples inconsistent with its class. The ENN algorithm aims to clean the synthetic data samples that might be mislabeled by SMOTE, and thus decrease the false prediction rate. The balanced dataset was denoted as${T_3}=\{ {T^{\prime}_{\hbox{min} }}(P),{T^{\prime}_{maj}}(Q)\}$.

4. Fusion. The Borderline Smote and SMOTEENN algorithms could synthesize new minority class samples, while the Cluster Centroids algorithm could produce new majority class samples. This study fused the above balanced data obtained by three different data sampling methods, which not only solved the problem of sample imbalance but also achieved data augmentation to the small sample size. The fused balanced dataset was represented as the follow:

$\begin{gathered} {T_{new}}=\{ {T_1},{T_2},{T_3}\} \hfill \\ {\text{ }}=\{ {{T^{\prime}}_{\hbox{min} }}(N),{T_{maj}}(N),{T_{\hbox{min} }}(M),{{T^{\prime}}_{maj}}(M),{{T^{\prime}}_{\hbox{min} }}(P),{{T^{\prime}}_{maj}}(Q)\} \hfill \\ {\text{ }}=\{ {T_{{{\hbox{min} }_{new}}}}(N+M+P),{T_{ma{j_{new}}}}(N+M+Q)\} \hfill \\ \end{gathered}$

2.3.2 Algorithm-Level: FL-XGBoost

The XGBoost classifier has the advantage of a custom loss function. Generally, this model employs the standard binary cross-entropy as the loss function, which is a probability-based measure. However, this loss function usually assigns the same weight to all samples, and does not consider the cost of data imbalance and misclassification. The focal loss introduces adjustment parameters $\gamma$ on the basis of a bivariate cross-entropy loss function, which makes the model pay more attention to the samples that are difficult to classify and appropriately reduce the weight of the samples that are easy to classify in the iterative process [16]. The definition of focal loss is as follows:

$${L_f}= - \frac{1}{n}\sum\limits_{{i=1}}^{n} {(\alpha {y_i}{{(1 - {p_i})}^\gamma }\log ({p_i})+(1 - \alpha )(1 - {y_i}){p_i}^{\gamma }\log (1 - {p_i}))}$$

The $\alpha$ is the balanced coefficient to slightly balance the weight of negative and positive samples. Furthermore, as ${p_i}$is close to 1, indicating that the sample is easy to classify correctly. When the factor $\gamma$ ($\gamma$> 0) is applied, the loss would decrease. Thus, the main role of focal loss is equivalent to increase the weight of hard-separated samples in the loss function, making the loss function tend to hard-separated samples. We embedded the focal loss into XGBoost to get an improved XGBoost (FL-XGBoost) to improve the accuracy of imbalanced small sample learning.

2.4 Model development strategy

The model development flowchart was shown in Fig. 1. We first trained the general XGBoost classifier with the original imbalanced dataset to obtain a baseline result. Then, the data-level fusion resampling method and the algorithm-level FL-XGBoost were applied respectively to conduct the data balancing and classification model building. To further improve the classification accuracy and develop WMH-MCI diagnostic models, we adopted weighted soft-voting techniques to ensemble the above base classifier models. This ensemble strategy could adaptively assign weights to each model according to their prediction performance, which was a more reasonable method to set the ensemble output voting power value and thus could obtain better results than each single model.

For each model, hyper-parameter tuning was performed using grid search with internal 10-fold cross-validation of the training dataset. The F1 score, a comprehensive classification index of the model, was used as the selection basis for hyper-parameters to avoid model bias caused by unbalanced data. To be fair, the 10-fold cross validation repeated 10 times with stratified sampling was performed for each model. We considered WMH-MCI samples to be true positives (TP), and WMH-nCI samples to be true negatives (TN). The false positives (FP) was that WMH-nCI incorrectly predicting as WMH-MCI, while false negatives (FN) was that WMH-MCI incorrectly predicting as WMH-nCI. The mean balance accuracy (Bacc), sensitivity (Sens), specificity (Spec), G-mean index and F1 score of all folds were calculated to quantitatively evaluate the classification performance of the model.

$$Bacc=\frac{1}{2}(\frac{{TP}}{{TP+FP}}+\frac{{TN}}{{TN+FN}})$$

$$Sens=\frac{{TP}}{{TP+FN}}$$

$$Spec=\frac{{TN}}{{TN+FP}}$$

$$G - mean=\sqrt {Sens*Spec}$$

$${F_1}{\text{ }}score=\frac{{2TP}}{{2TP+FP+FN}}$$

Table 1 and Fig. 2 showed the classification results of three data resampling methods (the Borderline SMOTE, Cluster Centroids, and SMOTEENN algorithms) and the proposed Fusion method. To verify the universality of the proposed method, the above procedures were also performed by applying other four common classifiers, including logistic regression (LR), support vector classifier (SVC), multilayer perceptron (MLP) and AdaBoost. By the Fusion data resampling method, the classification accuracy of the LR, SVC, MLP, AdaBoost, and general XGBoost classifier was 80.57%, 80.06%, 80.31%, 79.58%, and 80.53%, respectively, which improved by 4.36%, 8.06%, 2.33%, 8.52%, and 2.33% than classification without data imbalance.

Table 1

The classification results based on data-level resampling methods
Classifier	Resampling Method	Bacc (%)	Sens (%)	Spec (%)	G-mean (%)	F1 (%)
LR	Original	76.21	60.00	92.43	70.01	63.43
	Borderline SMOTE	78.69	68.67	88.71	74.89	67.55
	Cluster Centroids	78.14	73.00	83.29	75.23	66.56
	SMOTEENN	74.00	85.00	63.00	70.86	61.46
	Fusion	80.27	72.00	88.14	78.00	70.89
	p value·	0.03^*	0.003^*	0.044^*	0.008^*	0.02^*
SVC	Original	72.00	51.00	93.00	61.68	55.34
	Borderline SMOTE	77.53	66.50	88.57	73.80	65.92
	Cluster Centroids	78.73	74.17	83.28	76.60	67.85
	SMOTEENN	74.87	83.17	66.57	71.85	62.54
	Fusion	80.06	71.83	88.29	77.21	69.64
	p value	< 0.001^**	< 0.001^**	0.003^*	< 0.001^**	< 0.001^**
MLP	Original	77.98	68.00	89.57	74.81	66.38
	Borderline SMOTE	78.78	76.33	83.43	76.15	67.71
	Cluster Centroids	79.88	81.83	74.14	76.78	68.66
	SMOTEENN	75.30	85.17	65.43	72.55	62.64
	Fusion	80.31	72.33	88.28	77.03	70.08
	p value	0.241	0.269	0.454	0.445	0.189
AdaBoost	Original	71.06	45.83	96.28	58.41	45.83
	Borderline SMOTE	77.50	69.00	86.00	73.93	65.64
	Cluster Centroids	76.68	71.50	81.86	73.94	64.58
	SMOTEENN	56.30	94.17	18.43	46.45	31.59
	Fusion	79.58	76.17	83.00	77.51	68.84
	p value	< 0.001^**	< 0.001^**	< 0.001^**	< 0.001^**	< 0.001^**
XGBoost	Original	78.20	66.83	89.57	73.23	66.40
	Borderline SMOTE	79.55	70.67	88.43	75.97	68.65
	Cluster Centroids	80.29	78.17	82.43	78.47	69.96
	SMOTEENN	74.93	84.00	65.86	72.27	62.48
	Fusion	80.53	73.17	88.00	77.82	70.33
	p value	0.263	0.118	0.374	0.137	0.238
Note: The results were the average of 10 repeated 10-fold cross-validation experiments. The p value was calculated by the t-test to statistically compare the performances of the model based on the fusion resampling method and the model based on the original dataset, and it was considered significant when p < 0.05.

The FL-XGBoost model was conducted and compared with the weight cross-entropy loss (WL-XGBoost), as shown in Table 2. It could be observed that the results of the two improved XGBoost algorithms based on embedded cost-sensitive functions were better than those of the general XGBoost algorithm. The Bacc value was improved by 1.82% (WL-XGBoost) and 3.05% (FL-XGBoost). The result of the FL-XGBoost was slightly higher than that of WL-XGBoost.

Table 2

The classification results based on algorithm -level cost-sensitive learning
Classifier	Bacc (%)	Sens (%)	Spec (%)	G-mean (%)	F1 (%)
XGBoost	78.20	66.83	89.57	73.23	66.40
WL-XGBoost	80.02	81.33	78.71	78.86	68.86
p value	0.368	< 0.001	< 0.001	0.043	0.418
FL-XGBoost	81.25	84.50	78.00	80.37	70.73
p value	0.135	< 0.001	< 0.001	0.01	0.156

By further implementing the ensemble learning strategy based on weighted soft-voting, the XGBoost model trained by fusion resampling data and the FL-XGBoost model were combined. As shown in Table 3, the model constructed by the ensemble algorithm achieved higher classification performance (Bacc: 84.80%; G-mean: 84.00%; F1 score: 76.85%) than the single imbalanced data resampling method and the method based on cost-sensitive learning. And compared with baseline results based on raw imbalanced data, the classification accuracy, G-mean index, and F₁ score of the Voting ensemble model showed statistically significant improvement (p ≤ 0.001), as shown in Fig. 3.

Table 3

The classification results based on the ensemble learning
Model	Bacc (%)	Sens (%)	Spec (%)	G-mean (%)	F1 (%)
XGBoost	78.20	66.83	89.57	73.23	66.40
Fusion + XGBoost	80.53	73.17	88.00	77.82	70.33
FL-XGBoost	81.25	84.50	78.00	80.37	70.73
Voting Ensemble	84.82	85.50	84.14	84.00	76.85

The class imbalance of the dataset may lead to model bias, which would not be ignored in the classification task. As shown in Table 1, when the model was trained with the original imbalanced data, the positive samples (WMH-MCI) were smaller than the negative samples (WMH-nCI), which made each model almost with high specificity but poor sensitivity. The prediction of the model was biased towards numerous WMH-nCI classes. For data-level resampling methods, the Borderline SMOTE over-sampling method could increase the minor positive sample data, thus improving the recognition ability of the model for WMH-MCI (Sens: 70.67%), but not significantly. The Cluster Centroids, an under-sampling method, could achieve a relative balance of the sensitivity (78.17%) and specificity (82.43%) of the model. But, this method took the cost of removing majority class samples, which might lead to the loss of sample spatial information. The model based on the SMOTEENN mixed sampling method showed a higher sensitivity (84.00%), but with a severe decrease in specificity (65.86%). This might be related to that SMOTEEN mixed sampling method first oversamples minority class and then removed majority categories, which would lead to the reduction of majority class. Therefore, to better solve the problem of small sample category imbalance, we fused the balanced data obtained by these three resampling methods. By this way, the final fused balance dataset would combine the advantages of three resampling approaches, as well as the sample space could be further augmented effectively. The experimental results showed that by the fusion method, the overall classification accuracy of the model could have a further improvement (Bacc: 80.53%; Sens: 73.17%; Spec: 88.00%). Although there was no statistically significant improvement in classification performance compared with imbalanced data or single data resampling methods, the fusion based method performed the best among each classifier model, which indicated that the fusion method was effective and feasible, as well as good universality.

At the algorithm level, cost-sensitive learning is introduced to modify the sample weights in the model to improve the accuracy of minority classes. This study proved that the embedded cost-sensitive XGBoost model could well solve the imbalance learning. Both the WL-XGBoost and FL-XGBoost generated a higher Bacc rate, G-mean and F1 score than those of the general XGBoost algorithm (as shown in the Table 3). The improved detection sensitivity of WMH-MCI indicated that the improved model paid more attention to the difficult few WMH-MCI samples. Moreover, the FL-XGBoost with the modulation coefficient could not only balanced the training weights of positive and negative samples, but also increased the misclassification cost of some hard-separated samples. Thus, the overall prediction ability enhancement of FL-XGBoost was superior to simple sample weighted WL-XGBoost.

Overall, the comparison of the experimental results showed that the classification performance of the FL-XGBoost model was better than that of the data-level Fusion + XGBoost. The former had the advantage of making the model pay more attention to a few positive and difficult samples without changing the original data distribution, and therefore, the sensitivity of the detection of WMH-MCI had been significantly improved (Sens: 84.50%). However, this method also reduced the recognition ability for the majority class (Spec: 78.00%). The model based on data fusion resampling (Fusion + XGBoost) could slightly improve the recognition accuracy of minority samples (Sens: 73.17%), while still retaining a high specificity for majority samples (Spec: 88.00%). It could be argued that these two models have different properties for unbalancing learning. Therefore, the ensemble learning strategy based on the performance-weighted soft-voting mechanism was further executed to fuse the two weak classifiers. The ensemble method effectively integrated the advantages of data-level model and algorithm-level model, and further eliminated the probability bias generated by single model prediction. By this method, the prediction ability of the overall model for positive and negative samples could get relatively balance (Sens: 85.50%; Spec: 84.14%), and the performance improvement in classification of both WMH-MCI and WMH-nCI was more significant (Bacc: 84.80%; G-mean: 84.00%; F1 score: 76.85%). Therefore, we believed that the strategy proposed in this study had potential application to solve the imbalanced classification problem of small samples.

Taken together, an ensemble learning framework based on comprehensive gray matter features was developed to identify MCI patients in WMH populations. The comprehensive investigation of macroscopic GMV and fine-grained morphological measurements for the cortical surface provided enrich disease characterization information for the model. In addition, the proposed ensemble framework to combine the data-level fusion resampling method and algorithm-level FL-XGBoost for the first time could well solve the sample imbalance problem in the classification and improved the sensitivity of the model for MCI detection. Our approach would provide a useful reference for solving similar imbalanced small-sample learning problems. However, it would be more efficient and feasible to design an integrated framework that could adaptively select imbalance learning methods according to the characteristics of the sample dataset.

Ethical Approval

This study has been reviewed approval and by the Ethics Committee of Shanghai Fifth People’s Hospital, Fudan University, and informed consent was obtained from all patients.

Competing interests

No conflict of interest exits in the submission of this manuscript, and manuscript is approved by all authors for publication. I would like to declare on behalf of my co-authors that the work described was original research that has not been published previously, and not under consideration for publication elsewhere, in whole or in part. All the authors listed have approved the manuscript that is enclosed.

Authors' contributions

Yifeng Yang: Conceptualization, Methodology, Software, Writing-Original draft preparation. Ying Hu: Data curation, Validation, Software. Yang Chen: Formal analysis, Investigation. Weidong Gu: Supervision, Writing-checking. Shengdong Nie: Supervision, Writing- Reviewing and Editing. All authors reviewed the manuscript.

Funding

This study has received funding by grant from the National Natural Science Foundation of China (Grant No. 82271286), the Key Program of National Natural Science Foundation of China (Grant No.81830052); the Science and Technology Innovation Action Plan of Shanghai (Grant No.18441900500) and the Natural Science Foundation of Shanghai (Grant 20ZR1438300).

Wardlaw JM, Smith C, Dichgans M. Mechanisms of sporadic cerebral small vessel disease: insights from neuroimaging (vol 12, pg 483, 2013). Lancet Neurol 2013;12(6):532–532.
Moran C, Phan TG, Srikanth VK. Cerebral small vessel disease: a review of clinical, radiological, and histopathological phenotypes. Int J Stroke 2012;7(1):36–46.
Debette S, Markus HS. The clinical importance of white matter hyperintensities on brain magnetic resonance imaging: systematic review and meta-analysis. Brit Med J 2010;341.
Mortamais M, Artero S, Ritchie K. Cerebral white matter hyperintensities in the prediction of cognitive decline and incident dementia. Int Rev Psychiatr 2013;25(6):686–698.
Lee HK, Lee YM, Park JM, Lee BD, Moon ES, Chung YI. Amnestic multiple cognitive domains impairment and periventricular white matter hyperintensities are independently predictive factors progression to dementia in mild cognitive impairment. Int J Geriatr Psych 2014;29(5):526–532.
Kynast J, Lampe L, Luck T, et al. White matter hyperintensities associated with small vessel disease impair social cognition beside attention and memory. J Cerebr Blood F Met 2018;38(6):996–1009.
Bos D, Wolters FJ, Darweesh SKL, et al. Cerebral small vessel disease and the risk of dementia: A systematic review and meta-analysis of population-based evidence. Alzheimers Dement 2018;14(11):1482–1492.
Lee S, Viqar F, Zimmerman ME, et al. White Matter Hyperintensities Are a Core Feature of Alzheimer's Disease: Evidence from the Dominantly Inherited Alzheimer Network. Ann Neurol 2016;79(6):929–939.
Iniesta R, Stahl D, McGuffin P. Machine learning, statistical learning and the future of biological research in psychiatry. Psychol Med 2016;46(12):2455–2465.
Lemm S, Blankertz B, Dickhaus T, Muller KR. Introduction to machine learning for brain imaging. Neuroimage 2011;56(2):387–399.
Magnin B, Mesrob L, Kinkingnehun S, et al. Support vector machine-based classification of Alzheimer's disease from whole-brain anatomical MRI. Neuroradiology 2009;51(2):73–83.
Morra JH, Tu ZW, Apostolova LG, Green AE, Toga AW, Thompson PM. Comparison of AdaBoost and Support Vector Machines for Detecting Alzheimer's Disease Through Automated Hippocampal Segmentation. Ieee T Med Imaging 2010;29(1):30–43.
Chen HF, Huang LL, Li HY, et al. Microstructural disruption of the right inferior fronto-occipital and inferior longitudinal fasciculus contributes to WMH-related cognitive impairment. Cns Neurosci Ther 2020;26(5):576–588.
Buda M, Maki A, Mazurowski MA. A systematic study of the class imbalance problem in convolutional neural networks. Neural Networks 2018;106:249–259.
Lam LHT, Do DT, Diep DTN, et al. Molecular subtype classification of low-grade gliomas using magnetic resonance imaging-based radiomics and machine learning. Nmr Biomed 2022;35(11).
Lin TY, Goyal P, Girshick R, He KM, Dollar P. Focal Loss for Dense Object Detection. Ieee T Pattern Anal 2020;42(2):318–327.
Wang C, Deng CY, Wang SZ. Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost. Pattern Recogn Lett 2020;136:190–197.
Fan LZ, Li H, Zhuo JJ, et al. The Human Brainnetome Atlas: A New Brain Atlas Based on Connectional Architecture. Cereb Cortex 2016;26(8):3508–3526.
Glasser MF, Coalson TS, Robinson EC, et al. A multi-modal parcellation of human cerebral cortex. Nature 2016;536(7615):171.
Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 2008;9(3):432–441.
Han H, Wang WY, Mao BH. Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. Lect Notes Comput Sc 2005;3644:878–887.
Lin WC, Tsai CF, Hu YH, Jhang JS. Clustering-based undersampling in class-imbalanced data. Inform Sciences 2017;409:17–26.
Batista GE, Prati RC, Monard MCJASen. A study of the behavior of several methods for balancing machine learning training data. 2004;6(1):20–29.

No competing interests reported.

SupplementaryMaterial.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

An ensemble learning framework based on comprehensive gray matter features for identification of mild cognitive impairment in leukoaraiosis

Status:

Version 1

Abstract

Figures

1. Introduction

2. Materials And Methods

2.1 Participants

2.2 Comprehensive Gray matter features

2.3 Imbalance learning

2.3.1 Data-level: Data Resampling

2.3.2 Algorithm-Level: FL-XGBoost

2.4 Model development strategy

3. Results

4. Discussion

5. Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1