Brain volume has clinical implications beyond the individual characteristics of ethnicity or sex. Brain volume is closely related to aging and can reflect cognitive function or brain activity(Kiraly et al. 2016; Qing and Gong 2016). In dementia risk analysis, brain volume has been used as an indicator of cognitive reserve (van Loenhoud et al. 2018). Due to the correlation of lower brain volume with cognitive decline (Vibha et al. 2018), brain atrophy enables suspicion of dementia. Furthermore, it is hypothesized that a decrease in brain volume indicates neuropathological progression in AD (Reiter et al. 2017). Accurate longitudinal measurements of brain volume can reveal the status of a degenerating brain. In this study, we measured regional brain volume using an automated program that we developed for quick and accurate measurements and to effectively discriminate ADD and MCI from NC using medial temporal volume normalization with cerebellar volume.
It has been demonstrated that AD can be classified into four atrophy subtypes of medial temporal dominant, parieto-occipital dominant, diffuse cortical, and mild atrophy (Ten Kate et al. 2018). The hippocampal sparing variant in AD, which displays parieto-occipital dominant or mild atrophy, has been suspected to have limitation in assessment of hippocampal atrophy (Murray et al. 2011). However, hippocampal sparing AD also demonstrates medial temporal atrophy including that of the hippocampus, though its volume loss can be less than those of other subtypes (Ten Kate et al. 2018). In the present study, medial temporal volume and SAVR showed high accuracy to discriminate ADD from NC. However ,they exhibited lower performance to distinguish between MCI and NC because hippocampal sparing variants are more frequent in MCI than in AD (Ten Kate et al. 2018). However, measurement of medial temporal volume decline increased the accuracy of diagnosis of MCI patients in our study.
Medial temporal lobe atrophy and hippocampal atrophy have been reported as diagnostic markers specific to ADD (Hodson 2018). The results of this study using QBraVo correspond well with those observations. The medial temporal V:TIV had the highest accuracy for diagnosing ADD, with a sensitivity of 64.4% and specificity of 83.9%. It can be hypothesized that QBraVo can not only measure accurate brain volume, but also precisely conduct regional discrimination. The previous automated programs had low accuracy of measurement in the hippocampus (Guenette et al. 2018). Measurement of medial temporal volume was effectively substituted for hippocampal volume in the current study.
In this study, medial temporal lobe was the most degenerative region relative to total brain atrophy in ADD and MCI. On the contrary, the least degenerative region in ADD and MCI was the cerebellum, which showed AD pathology in the final stages in an earlier autopsy study (Braak and Braak 1991). Progression of deposition of β-amyloid occurs from the neocortex and hippocampus in the early phase to the cerebellum in the late phase. Despite this, the cerebellum might be involved in cognition and presented reduced activation on functional imaging (Jacobs et al. 2018). A previous study has reported cerebellar GM volume to be lower in MCI than in NC (Möller et al. 2013). Further evaluations of cerebellar volume change are needed to prove the role of the cerebellum in AD.
In the validation of QBraVo, it took about 5 to 6 minutes to obtain results after input of preprocessed MR images. The runtimes of QBraVo were considerably shorter than those of the other methods. Manual volumetry commonly takes several days with a skilled analyst. FreeSurfer requires approximately 20 to 40 hours to process both hemispheres. About 8 hours is required for volume measurements using Inbrain (MIDAS Information Technology Corporation, Seongnam, Republic of Korea), which has been recently commercialized. We demonstrated that QBraVo is faster than previously used methods for volume measurements. Furthermore, QBraVo showed excellent reproducibility and relatively high accuracy that were confirmed by a significant correlation with the results of manual volumetry in the present study. This suggests QBraVo as an easy and rapid tool to measure brain volume compared to the traditional volumetric methods.
Compared with manual volumetry, the ICC was higher for TBV than for TCV or TIV. This is because the analysis of QBraVo is based on T1W images, which are limited in differentiating surface CSF from skull bone. This is similar to the results of an SPM validation in an earlier study (Heinen et al. 2016). Overestimation by SPM is caused by probabilistic segmentation that can include tissues outside the subarachnoid space in CSF (Nordenskjöld et al. 2013). Because the regional mask was created with strict limitations on the sub-arachnoid space, QBraVo showed improved reliability not only in TBV, but also in TCV compared to SPM and FreeSurfer. Although volume measurements in regions including surface CSF have relatively lower accuracy, QBraVo showed excellent ICCs in TCV and TIV.
In the present study, although subjects with ADD or MCI were older than the NCs, other clinical characteristics did not differ among them. Though old age can affect brain volume decrements in ADD and MCI, the effects of aging were not statistically significant by multiple regression analysis.
In the analysis of regional brain volume, discrimination performance by V:TIV was generally better than that by raw regional volume, which implies that brain volume normalized by TIV could adjust for head size variation among subjects. However, raw regional volume showed better performance than V:TIV in some regions for distinguishing between ADD and MCI, possibly because brain volume in MCI was larger than in ADD.
The V:TBV or V:TCV did not show higher performance than V:TIV for discriminating among the groups. However, performances of V:TBV/V:TCV in more or less degenerated regions were similar to that of V:TIV. In MCI and ADD, the more degenerated regions were medial temporal, anterior temporal lobe, and ventricle, and the less degenerated regions were posterior medial frontal lobe and cerebellum.
Ventricular volume and ratios also showed good performance to distinguish ADD and MCI. It has been documented that WM changes are associated with ventricular enlargement in AD (Coutu et al. 2016). Researchers have theorized that subcortical WM changes occur relatively quickly in AD. However, further investigations are needed to verify this hypothesis.
The SAVR of the most degenerated regions showed larger AUC and higher diagnostic accuracy than did V:TIV for differentiating among the groups. In comparison of ROC curves, medial temporal SAVR was inferior to MMSE for ADD vs. NC but was not inferior to MMSE for MCI vs. NC. The difference of performance between orbital frontal SAVR and MMSE was not significant for ADD vs. MCI. Accordingly, it was posited that SAVR can clarify the difference of regional atrophy between groups. Especially, SAVR was effective for discrimination of ADD and MCI from NC, which suggests that normalization of regional brain volume by cerebellar volume is more sensitive than normalization by TIV in AD spectrum neurodegenerative cognitive disorders. In an earlier study, hippocampal volume to neocortical volume ratio was a predictor of cognition and subtype in AD (Risacher et al. 2017). Volume comparisons with the cerebellum have not been used previously. The combined model using MMSE and medial temporal SAVR had higher performance for discriminating MCI from NC than did MMSE alone, which suggests that SAVR is an advantageous method to assess AD in the early phase.
Quantitative information about brain volume can improve diagnostic capabilities in a clinic. However, AD diagnosis using volumetric data alone shows decreased accuracy. Volumetric data should be added to visual assessment by radiologists, clinical information, and a neurocognitive evaluation for better results.
Although the brain volume measurements using the automated analytic program were precise in previous validations, those results could have been affected by the imaging analytic program and the MR sequence parameters (Haller et al. 2016). To overcome these biases, strict standardization should be applied to all analysis processes. Analysis in specific regions, which can be conducted with insufficient precision, are necessary to be corrected in a regular range.
The limitations of our study are as follows. First, the number of enrolled subjects was relatively small to obtain high statistical power; therefore, the results of this study should be carefully interpreted. Second, the enrolled patients with ADD could be heterogeneous because their diagnoses were clinical without pathologic confirmation. though some subjects with ADD or MCI were confirmed with F18-florbetaben PET. Third, while subjects with a brain lesion were excluded, vascular damage was not evaluated in this study. Vascular lesions can affect not only brain volume, but also signal intensity on MR. Finally, we did not measure volume of GM, WM, or hippocampus, which can provide more specific information for AD. Further studies on these topics are needed.