Harmonization of dopamine transporter SPECT imaging improves segregation between patients with parkinson’s disease and healthy elderlies in multicentre cohort studies

doi:10.21203/rs.3.rs-2237619/v1

Download PDF

Research Article

Harmonization of dopamine transporter SPECT imaging improves segregation between patients with parkinson’s disease and healthy elderlies in multicentre cohort studies

https://doi.org/10.21203/rs.3.rs-2237619/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Purpose

Dopamine transporter single-photon emission computed tomography (DAT-SPECT) is an indispensable method for investigating Parkinson’s disease (PD). However, it comprises several confounding factors for consideration in a multicentre study. We aimed to assess the impact of the harmonization of multisite data on the differentiation between patients with PD and healthy elderlies in this multicentre cohort study.

Methods

We acquired a specific binding ratio (SBR)s of DAT-SPECT in 72 healthy elderlies (HCs) and 81 patients with PD (PDs). We assessed the effects of the following correction method for SBR: age and sex correction, correction for scanner differences by phantom scanning (phantom correction), a standardized operation for SBR computation (operation standardization), and a data-driven statistical method. We investigated the changes in the SBR and area under the receiver operating characteristic curve (ROC-AUC) for PD diagnostic accuracy.

Results

Without correction, the SBR yielded fair discrimination of PDs and HCs (Hedge’s g = 2.82 and ROC-AUC = 0.926). Age-sex correction exerted a moderate effect (g = 2.76 and ROC-AUC = 0.936). Of the multisite harmonization methods, the combination of phantom and operation correction displayed the largest changes (g = 4.32, ROC-AUC = 0.992), followed by data-driven correction (g = 3.99, ROC-AUC = 0.987).

Conclusions

Our findings demonstrated the usefulness of the multisite harmonization of DAT-SPECT in a multicentre cohort. Prospective correction with phantom scanning and operation standardization was ideal for the robustness and interpretability of the corrected values. The data-driven correction was another powerful method; however, the corrected value requires cautious interpretation.

Dopamine transporter

SPECT

[123I]FP-CIT

Parkinson’s disease

multisite harmonization

Dopamine transporter (DAT) single-photon emission computed tomography (SPECT) imaging with ¹²³I-labeled N-(3-fluoropropyl)-2beta-carbomethoxy-3beta-(4-iodophenyl) nortropane ([¹²³I]FP-CIT) is a clinically indispensable method to confirm the loss of dopaminergic neuron terminals. DAT-SPECT is widely used for a supportive diagnosis of parkinsonian syndromes, including Parkinson’s disease (PD) and dementia with Lewy bodies, in both clinical and research fields [1–3].

Using DAT-SPECT, a specific-to-nondisplaceable binding ratio (SBR) is computed as a semiquantitative value representing the radionuclide accumulation of [¹²³I]FP-CIT. Factors such as age [1, 4–7], sex [1, 4–6, 8], and differences in the study sites and SPECT scanners [1] confound the SBR measurements, thereby suggesting that their correction may improve the diagnostic accuracy. Nevertheless, the loss of dopaminergic neurons in PD exerts a tremendous effect on DAT-SPECT results [9–11], thus advocating no need for a specific correction. Thus, the need for correcting those confounding factors in the consistency between clinical diagnosis and SBR in multicentre SPECT studies remains unclear.

Few possible methods may contribute to the correction of DAT-SPECT. A model-based approach corrects for the factors that affect DAT-SPECT results, such as age, sex, and site/scanners [1, 4]. It requires phantom scanning and is laborious. Another data-driven method is termed the “Combatting batch effects when combining batches of gene expression microarray data (ComBat)” [12]. ComBat corrects for systematic differences in the mean and variance of data across different facilities as previously applied to other multisite data, including genomes and magnetic resonance imaging (MRI) [12, 13]. The ComBat method is retrospectively applicable to old datasets; however, its effectiveness has not been assessed in DAT multicentre studies. An effective data-driven correction method will improve the compatibility between clinical diagnosis and DAT-SPECT in multicentre studies.

To evaluate the effectiveness of the two correction methods, we examined patients with DAT-SPECT acquired at multiple clinical centres versus healthy elderly control participants (HCs) and patients with PD (PDs). Subsequently, we intended to assess the mechanisms through which the confounding factor correction influenced the agreement of SBR findings with the clinical diagnosis of PD. First, we aimed to determine the influence of each confounding factor, including the age, sex, and difference in sites (scanners and operations), on diagnostic consistency. Then, we aimed to compare the SBR between the prospective correction and the ComBat correction. We hypothesized that the prospectively corrected SBR for all confounders would display the highest consistency with clinical diagnosis, followed by other correction methods.

Participants

Eighty-five patients with PD (PDs) and 84 healthy controls (HCs) were recruited from the Parkinson’s and Alzheimer’s disease Dimensional Neuroimaging Initiative (PADNI) study (https://padni.org/), a neuroimaging cohort study comprising patients with dementia or PD and healthy elderlies. We defined the number of HCs to exceed the minimum sample size required for the semiquantitative analysis of the striatum in a previous report [14]. Written informed consent was obtained from the following four participating facilities: National Center of Neurology and Psychiatry (NCNP), Kyoto University (KU), Kyoto Prefectural University of Medicine (KPUM), and Fukushima Medical University (FMU) (Table 1).

Table 1

The participants’ demographic information in each centre
Center	HCs (n)	PDs(n)	Age (years)			Sex (Male/Female)
Center	HCs (n)	PDs(n)	HCs	PDs	Total	HCs	PDs	Total
NCNP	46	19	68.5 (8.67)	70.4 (7.96)	69.0 (8.45)	26/20	15/4	41/24
KPUM	6	8	68.3 (8.38)	76.5 (5.31)	73.0 (7.73)	3/3	3/5	6/8
KU	20	42	67.0 (8.72)	68.3 (9.52)	67.9 (9.27)	13/7	18/24	31/31
FMU	0	12	n/a	66.5 (5.43)	66.5 (5.43)	n/a	7/5	7/5

Data shown are the mean (standard deviation: SD).

NCNP, National Center of Neurology and Psychiatry; KPUM, Kyoto Prefectural University of Medicine; KU, Kyoto University; FMU, Fukushima Medical University; HCs, healthy subjects; PDs, patients with Parkinson’s disease

The Fukushima Medical University did not provide DAT data from healthy elderlies. The cognitive functions of all participants were sufficiently preserved to provide informed consent. The inclusion criteria were as follows: ≥50 years, having a study partner informing of the participant’s activities of daily living (ADL), and no use of medications affecting dopamine uptake. The exclusion criteria were as follows: neurological and psychiatric disorders other than PD (e.g., cerebral infarction, major depressive disorder), allergy to alcohol or iodine, and concurrent plans to participate in clinical trials.

PDs were consistent with clinically established Parkinson’s disease or clinically probable Parkinson’s disease in the International Parkinson and Movement Disorder Society (MDS) PD criteria [15]. HCs were free of any psychiatric or neurological disorders, maintaining completely independent ADL, and without significant abnormalities in the neurological and psychological tests required for PADNI (refer to the “Clinical and Neuropsychological Assessments” section). The participants had no apparent structural abnormalities, which may have affected their cognitive or motor function, on T1-weighted and T2-weighted brain structural MRIs [16].

Sixteen participants, including 12 HCs and four PDs, were excluded at the screening visit. Six, one, and five HCs were excluded for latent brain lesions on MRI (four had cerebral infarction and two had enlarged ventricles), difficulty in cooperating with their study partners, and abnormalities in neuropsychological tests, respectively. For PDs, one patient each had severe white matter hyperintensities and difficulty in cooperation with the study partners, and two patients had withdrawn their consent for personal reasons. After applying the exclusion criteria, 72 HCs and 81 PDs were Included in this study.

Imaging data acquisition and processing

We acquired SPECT-computed-tomography (SPECT-CT) scans using five imaging devices in the four institutes (Table 2).

Table 2

The single photon emission computed tomography (SPECT) scanner information in each centre
Center	Interval between Infusion and scan (hours)	Dose of intravenous [¹²³I]FP-CIT (MBq)	SPECT scanner
NCNP	3.0	158 to 186 MBq	Siemens Symbia T6 + LMEGP, GE Discovery NM/CT 670 pro + ELEGP
KPUM	3.0	158 MBq	GE Discovery NM/CT670QSP + ELEGP
KU	3.0	167 MBq	GE NMCT 870DR + ELEGP
FMU	3.0	167 MBq	Toshiba GCA-9300R + FANHR

X-ray-CT attenuation correction without scatter correction was selected as the reconstruction method. Following reconstruction, the data were quantified by the physician of each facility using the Southampton method [17]. Subsequently, a technician/physician at each institute calculated the uncorrected SBR, using their routine to produce the striatal count of the specific binding divided by the non-specific binding count in the reference region [1].

DAT-SPECT data processing to compute the SBR was performed using DaTView (Nihon Medi-physics). Before the operation standardization, the setting of the striatal volume of interest (VOI) and background differed across the institutes (Iso-Contour Threshold at ~ 30% at NCNP, Iso-Contour Threshold at ~ 40% at KPUM, Iso-Contour Threshold at ~ 50% at KU, and 16-mm Contour Extraction at FMU).

We calculated the age-sex-corrected data using the uncorrected data and an existing DAT-SPECT database based on healthy people [1]. According to the DaTView software based on an existing report [1], we used 57 years, which was the mean age of the DAT-SPECT database, as a reference for the age correction and the annual change in -0.063 SBR. For sex correction, 0.461 was added to the SBR of the women. This procedure converted the SBR considering all the participants as 57 years old.

The data for phantom correction was acquired by SPECT imaging of an anthropomorphic striatal phantom (NMP Business Support Co., Ltd., Hyogo, Japan), filled with ¹²³I solution with the identical SPECT scanner in each facility [1, 18]. The original SBRs in each participant was calibrated using the linear equation, converting the measured SBRs of the phantom in each facility to the standardized SBRs [1], thus yielding phantom-corrected SBRs. The phantom-corrected SBRs were calculated in the four facilities. The reconstructed DAT images were sent from each facility to the central facility (NCNP). The central facility assessed if the phantom correction was appropriately performed and re-applied the procedure to the reconstructed data if necessary. Two researchers from the principal organization computed the SBRs after standardizing the operations related to the setting of the striatal VOI and background with a standardized method (The Iso-Contour Threshold was adjusted between 30% and 60% to accurately capture the subject-to-subject size variation of the striatal VOI). Eventually, they generated three types of corrected SBR (phantom-corrected SBR, operation-standardized SBR, and phantom-operation-corrected SBR). The operation-standardized SBR and phantom-operation-corrected SBR referred to the mean SBR calculated by these researchers.

Moreover, we examined whether a modelling method of post hoc correction for the mean and variance between the facilities could achieve data harmonization. Using the ComBat harmonization method for inter-facility correction enabled this attempt [12]. We added the age, sex, and clinical diagnosis to the covariates for ComBat correction as non-site-specific variabilities.

Clinical and neuropsychological assessments

All participants consistently underwent standardized clinical and neuropsychological assessments across the groups. The cut-off scores for defining HCs were as follows: (i) the Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) [15] part III score ≤ 7, excluding tremor, (ii) the mini-mental state examination [19] score ≥ 28, (iii) clinical dementia rating (CDR) [20] score of 0, and (iv) the Japanese edition of rapid eye movement sleep behaviour disorder screening questionnaire [21] score ≤ 5. All participants underwent the Japanese version of the Montreal Cognitive Assessment [22], frontal assessment battery [23], trail making test (TMT) [24], and the Japanese version of the odor stick identification test (OSIT-J) [25] to comprehensively investigate their cognitive and physical conditions.

Statistical analyses

The present study assessed the significance of two major corrections, namely age-sex correction and site-effect harmonization. According to the MDS PD criteria, each participant was labelled either HC or PD. For the two major corrections, we examined the changes in SBR between HCs and PDs using Hedge’s g. In addition, the accuracy of classifying HCs and PDs with a corrected and uncorrected SBR was compared using the Receiver Operating Characteristic (ROC) analysis. The area under the curve of ROC (ROC-AUC) was used as an index of diagnostic accuracy.

Subsequently, we assessed the effects of two factors related to site-effect harmonization. In the site-effect analysis, we used the age-sex-corrected data, which displayed higher accuracy than the uncorrected data in the ROC analysis (see Results). The two factors of the harmonization comprised normalization with phantom scans and operation standardization across different scanners/institutes. In the ROC analysis, we compared the accuracy with SBR across the following four conditions: no harmonization, operation standardization only, phantom correction only, and both phantom correction and operation standardization (phantom and operation correction). Eventually, we performed the two-way analysis of variance (ANOVA), including the two factors (phantom correction and operation standardization) as within-subject variables.

These analyses were performed with Python scikit-learn 0.24.2 (https://scikit-learn.org/) and statsmodels 0.12.1 (https://www.statsmodels.org). The results were visualized using matplotlib 3.1.1 (https://matplotlib.org/) and seaborn 0.9.0 (https://seaborn.pydata.org/). The behavioural data and SBR for each correction were plotted on “Raincloud plots” [26] using ptitprince 0.2.5 (https://github.com/pog87/PtitPrince).

The detailed clinical and neuropsychological assessments supported the diagnostic label of HCs and PDs. The MDS-UPDRS part III score was ≥ 8 (excluding tremors) in all PDs, whereas it was < 8 in all HCs. None of the HCs had an apparent cognitive decline or movement disorder (Table 3).

Table 3

The results of clinical evaluations
		Healthy subjects	Parkinson’s disease
MDS-UPDRS	Part Ⅰ	2.17 (2.61)	10.72 (4.98)
	Part Ⅱ	0.40 (1.21)	12.53 (8.57)
	Part Ⅲ	1.29 (2.26)	29.77 (18.40)
	Part Ⅳ	n/a^*1	3.13 (5.59)
MMSE		29.16 (1.69)	27.43 (3.075)
CDR Sum of boxes		0.08 (0.20)	0.98 (1.89)
Global CDR		0 (0)	0.22 (0.35)
MOCA-J*²		26.2 (2.45)	24.9 (3.81)
RBDSQ		2.03 (1.97)	4.77 (2.91)
FAB		16.7 (1.17)	14.9 (3.15)
TMT-A		40.5 (14.1)	70.0 (61.7)
TMT-B		73.0 (42.1)	109.1 (69.2)
OSIT-J		9.23 (2.67)	4.75 (2.86)

Data shown are the mean (standard deviation: SD).

^*1 Healthy subjects did not undergo MDS-UPDRS part Ⅳ because they are free of parkinsonism and anti-parkinsonian drugs.

^*2 MoCA-J scores are adjusted for years of education.

MDS-UPDRS, Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale; MMSE, Mini Mental State Examination; CDR, Clinical Dementia Rating; MoCA-J, Japanese version of Montreal Cognitive Assessment; RBDSQ-J, Japanese edition of RBD screening questionnaire; FAB, Frontal Assessment Battery; TMT-A/B, Trail Making Test A/B; OSIT-J, Odor Stick Identification Test for Japanese

Significant differences were noted in the MDS-UPDRS scores across the facilities if a modelling method of post hoc correction was used (Supplementary Table 1). Following the correction for the age and sex, the SBR changed from 5.32 ± 1.57 (mean ± SD) to 6.13 ± 1.54 and from 1.04 ± 1.45 to 2.03 ± 1.41 in HCs and PDs, respectively (Fig. 1A). This procedure did not substantially impact the effect size of SBR discriminating between the PDs and HCs (Hedge’s g = 2.82 before and 2.76 after the age and sex correction).

In the ROC analysis, the diagnostic accuracy of DAT-SPECT at the multicentre was marginally better upon using SBR with correction for the age and sex (ROC-AUC = 0.936) than that without correction (ROC-AUC = 0.926) (Fig. 1B). Because of the slightly better discrimination accuracy, we used the age- and sex-corrected SBR in the following analyses.

Consequently, we assessed the effects of multisite harmonization comprising phantom correction and standardized operations as the factors. The mean SBR was 5.78 ± 1.33 (SD) and 1.63 ± 1.23 following the standardized operations only, 6.32 ± 1.12 and 2.29 ± 1.00 following the phantom correction only, and 6.52 ± 1.06 and 2.40 ± 0.99 following both phantom and operation corrections in HCs and PDs, respectively (Fig. 2). A two-way ANOVA revealed significant effects of both phantom correction (F [1.00, 140.76] = 215.60, p < 0.001, Greenhouse-Geisser corrected) and standardized operation (F [1.00, 138.56] = 7.98, p = 0.005, Greenhouse-Geisser corrected); moreover, it revealed an interaction between the phantom correction and standardized operation (F [1.00, 150.65] = 19.20, p < 0.001, Greenhouse-Geisser corrected).

The ROC-AUC score was 0.964, 0.971, and 0.992 following the operation standardization, phantom correction, and both phantom and operation correction, respectively (Fig. 3).

Following the multisite harmonization with ComBat, the mean SBR was 5.24 ± 0.89 (SD) and 2.01 ± 0.73 for HCs and PDs, respectively. The ROC-AUC score following the ComBat correction was 0.987, almost equivalent to the phantom- and operation-corrected data (Fig. 4).

Despite reasonably segregated PDs and HCs in each facility before harmonization, a substantial difference was noted in the mean SBR across the facilities in both HC (F[2, 13.65] = 47.14, p < 0.001 by Brown-Forsythe corrected one-way ANOVA) and PD (F[3, 35.69] = 52.19, p < 0.001) (Fig. 5, Supplementary Table 2). On combining data from all facilities, the data were not normally distributed, and the segregation between PDs and HCs was modest. The phantom-operation harmonization procedure removed the site difference in HCs (F [2, 32.83] = 0.55, p = 0.58) but not fully in PD (F [3, 18.86] = 7.11, p = 0.005 in PDs). The site difference was not evident in either HCs or PDs after ComBat correction (Supplementary Table 2). The segregation between PDs and HCs was more apparent, with improved distribution in each group. The effect size of the mean SBR difference between HCs and PDs changed from small (Hedge’s g = 2.76 following age and sex correction) to medium (Hedge’s g = 4.32 following phantom and operation correction, Hedge’s g = 3.99 following ComBat correction).

The present study demonstrated that the harmonization procedure to remove site-specific effects improved the accuracy of detecting the loss of dopaminergic terminals in the striatum in multicentre DAT-SPECT data.

DAT-SPECT is an established method for the differential diagnosis of PD and related disorders [27]. Moreover, the uncorrected DAT data displayed an acceptable level of accuracy for differentiating PDs and HCs, in reference to the MDS PD criteria as the gold standard. Without harmonization, 92.6% of the diagnostic accuracy in a multicentre study appeared considerably high to support the clinical diagnosis of PD. Together with other clinical assessments, including the MDS-UPDRS part III score olfactometry, uncorrected DAT-SPECT in clinical practice appeared sufficiently accurate even for uncorrected data.

Several studies [1, 4–8, 28, 29] have demonstrated the effects of age, sex and scanner differences on SBR using linear regression models; thus, these effects are conceivable. However, a previous study using the Parkinson Progression Marker Initiative dataset reported that SBR data without correcting for sex or age might be acceptable for differentiating between PDs and HCs in a multicentre DAT study [9]. This claim reflected that the effect size of the dopaminergic terminal loss in PD was substantially larger than that of age and sex. Therefore, the confounding effects, other than the disease, were negligible in differentiating PD from HCs. This interpretation appears reasonable because the loss of dopaminergic terminals progress before the clinical onset of PD [30, 31], and a substantial reduction of SBR is already present at the onset, even in mild cases [32]. Indeed, the present study confirmed that the correction for age and sex did not substantially alter the ability of SBR to differentiate between PDs and age- and sex-matched HCs [9]. The correction for the age and sex is not necessary upon the availability of appropriate control data.

By contrast, corrections for age and sex may be required to compare individual data with those in the database [1]. This may also be necessary for the marginal mean difference in the SBR across the groups, for example, when SBR is applied to differentiating prodromal PD with healthy elderlies or with PD. Future studies should address this possibility. Owing to the existing effects of age, sex, and inter-site differences on SBRs, the consideration of correcting these effects depends on the purpose of the study.

The present study demonstrated that correction for differences in the facilities improved the diagnostic accuracy of dopaminergic denervation. The correction for facility differences comprised two factors as follows: (i) the correction across scanners by applying a linear transformation equation based on data from a phantom filled with ¹²³I solution for each SPECT scanner (phantom correction) and (ii) the standardization of human operation to set up VOI in software computing SBR (operation standardization). Both factors exerted significant effects on the SBR. Moreover, we identified substantial interactions between the phantom correction and operation standardization, thus suggesting the effects differed across the facilities (i.e., some sites employed a similar procedure to the standard procedure, whereas some did not). The application of phantom correction and operation standardization achieved a high agreement with the clinical diagnosis of PD. Both corrections improved the effect size of the SBR differentiating PDs and HCs from mild to medium, with an improvement rate of approximately 6% (from ROC-AUC).

The ROC-AUC improvement may not appear monumental. However, this level of difference should exert tremendous effects in large-scale studies, such as a randomized control trial for disease-modifying therapy. In clinical trials involving thousands of participants, a 6% difference in diagnostic accuracy will result in over a hundred misdiagnoses [33–36]. A cohort based on an accurate diagnostic test should yield a more specific outcome of the intervention and save enormous time and financial costs in these large-scale studies. Therefore, we strongly recommend phantom correction and operation standardization to reduce false findings from DAT-SPECT for managing a large-scale multicentre SPECT study. The correction will be considerably greater in clinical trials in prodromal PD, which comprise marginal differences in SBRs from HCs.

The phantom-operation correction removed the site-effects in HCs; however, the correction only reduced the site-effects in PD (Supplementary Table 2). As DAT-SPECT reflects the severity of PD, this finding can be attributed to the difference in the severity of PD across the sites (Supplementary Table 1) [37]. Hence, the phantom-operation correction likely removed the technical differences across sites only, leaving the difference in the participants’ factors unaffected. This is favourable when we consider analyses using inter-individual differences after the harmonization.

Two HCs comprised SBRs categorized as PDs, and three PDs were categorized as HCs from the SBR cut-off, even with extensive corrections. We assessed the detailed clinical background of these participants. None of the two HCs (false positives) displayed increased MDS-UPDRS III scores or general cognitive decline (CDR = 0). However, one of the HCs with reduced SBR finished TMT-B at 82 s, approaching the cut-off value, and suspected latent cognitive decline. The remaining HC had an OSIT-J score of 5 points, thus indicating mild olfactory impairment. These participants were followed up in the PADNI cohort to monitor the development of parkinsonism or cognitive decline. The three PDs (false negatives) displayed MDS-UPDRS part III scores (excluding tremors) of 8, 15, and 14, respectively, thereby indicating relatively mild motor symptoms. Moreover, the PADNI will follow up with these participants to observe possible progress in parkinsonism and a decrease in SBR.

The correction with ComBat improved the diagnostic accuracy, comparable to the full model-based correction. The ComBat harmonization is principally used in genomic and MRI studies as a simple and robust method for correcting measurement bias across facilities. ComBat correction is a powerful method that easily replaces the laborious method, such as phantom scanning, at each facility. In addition, it appears useful during the inability to perform phantom scans, for example, for already completed research projects. Moreover, it should be effective while analysing a public neuroimaging dataset [38]. However, whereas ComBat correction is promising, it appears to have a limitation. The ComBat-corrected SBR revealed a low mean SBR compared with the remaining correction methods. The result was probably attributed to the low mean SBR of an uncorrected SBR in a facility with numerous participants. Specifically, the mean and variance of a single facility can significantly affect the corrected data, thus compromising its generalizability to third parties or in meta-analyses. Therefore, we recommend model-based corrections for the age, sex, and site-effect whenever possible.

A limitation of the present study was that PD diagnosis depended on the clinical symptoms, levodopa responsibility, and olfactory tests only, without intense tests to exclude atypical parkinsonism, for example, with ¹²³I-metaiodobenzylguanidine SPECT.

In conclusion, we compared the correction methods for evaluating dopaminergic terminal loss in multisite DAT-SPECT data. The prospective phantom and operation correction improved the diagnostic accuracy and effect size, despite an institute lacking data from HC. A multisite database with a completely standardized SBR will enable reliable large-scale multisite research, thus overcoming the study-wise limitation at each facility. Furthermore, the ComBat correction reasonably improved the diagnostic accuracy of PD. The ComBat correction is applicable during unavailable phantom scanning, for example, to compare data with publicly available data sets.

Funding: This study was partially supported by the Japan Agency for Medical Research and Development (AMED, www.amed.go.jp) (18dm0207070s0001, 18dm0307003h0001) and the Japan Society for the Promotion of Science (JSPS, www.jsps.go.jp) KAKENHI (19H05726).

Competing interests: The authors have no relevant financial or non-financial interests to disclose.

Authors’ contributions: NW: study concept and design, data acquisition, data analysis, interpretation of results, and writing and revising the draft. HT: data acquisition. MA: data acquisition. NS: data acquisition. T Murai: data acquisition. T Mizuno: data acquisition. T Matsuoka: data acquisition. RY: data acquisition. HY: data acquisition. HM: data acquisition, the interpretation of the results. TH: study concept and design, data acquisition, the interpretation of the results, revising the draft. All authors have read and approved the final manuscript.

Data availability: The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request. The data are not publicly available due to ethical restrictions.

Code availability: Not applicable.

Ethical approval: The present study is registered in the University Hospital Medical Information Network Clinical Trials Registry (UMIN000036297), a public clinical trial registry that meets the International Committee of Medical Journal Editors criteria. Furthermore, the research protocols were approved by the institutional review board of the National Center of Neurology and Psychiatry, Japan, with the identification number A2018-086. This study was conducted according to the tenets of the Declaration of Helsinki.

Consent to participate: All participants provided written informed consent to participate.

Consent to publish: The patients provided written informed consent for the publication of the images in Figures 1 and 2 showing anonymized SPECT imaging results for individual subjects.

Acknowledgments: This study was partially supported by the Japan Agency for Medical Research and Development (AMED, www.amed.go.jp) (18dm0207070s0001, 18dm0307003h0001) and the Japan Society for the Promotion of Science (JSPS, www.jsps.go.jp) KAKENHI (19H05726). The members of the Parkinson’s and Alzheimer’s disease Dimensional Neuroimaging Initiative are listed in the Supplementary Information. We thank the Editage group (https://www.editage.jp/) for editing a draft of this manuscript.

Matsuda H, Murata M, Mukai Y, et al. Japanese multicenter database of healthy controls for [ 123 I ] FP-CIT SPECT. Eur J Nucl Med Mol Imaging. 2018;45:1405–16. https://doi.org/10.1007/s00259-018-3976-5.
Iwabuchi Y, Kameyama M, Matsusaka Y, et al. A diagnostic strategy for Parkinsonian syndromes using quantitative indices of DAT-SPECT and MIBG scintigraphy: an investigation using the classification and regression tree analysis. Eur J Nucl Med Mol Imaging. 2021;48:1833–41. https://doi.org/10.1007/s00259-020-05168-0.
Shimizu S, Hirao K, Kanetaka H, et al. Utility of the combination of DAT SPECT and MIBG myocardial scintigraphy in differentiating dementia with Lewy bodies from Alzheimer’s disease. Eur J Nucl Med Mol Imaging. 2016;43:184–92. https://doi.org/10.1007/s00259-015-3146-y.
Honkanen EA, Noponen T, Hirvilammi R, et al. Sex correction improves the accuracy of clinical dopamine transporter imaging. EJNMMI Res. 2021;11:1–9. https://doi.org/10.1186/s13550-021-00825-3.
Eusebio A, Azulay JP, Ceccaldi M, et al. Voxel-based analysis of whole-brain effects of age and gender on dopamine transporter SPECT imaging in healthy subjects. Eur J Nucl Med Mol Imaging. 2012;39:1778–83. https://doi.org/10.1007/s00259-012-2207-8.
Varrone A, Dickson JC, Tossici-Bolt L, et al. European multicentre database of healthy controls for [123I]FP- CIT SPECT (ENC-DAT): Age-related effects, gender differences and evaluation of different methods of analysis. Eur J Nucl Med Mol Imaging. 2013;40:213–27. https://doi.org/10.1007/s00259-012-2276-8.
Werner RA, Lapa C, Sheikhbahaei S, et al. Impact of aging on semiquantitative uptake parameters in normal rated clinical baseline [123I]Ioflupane single photon emission computed tomography/computed tomography. Nucl Med Commun. 2019;40:1001–4. https://doi.org/10.1097/MNM.0000000000001061.
Nobili F, Naseri M, De Carli F, et al. Automatic semi-quantification of [123I]FP-CIT SPECT scans in healthy volunteers using BasGan version 2: Results from the ENC-DAT database. Eur J Nucl Med Mol Imaging. 2013;40:565–73. https://doi.org/10.1007/s00259-012-2304-8.
Schmitz-Steinkrüger H, Lange C, Apostolova I, et al. Impact of age and sex correction on the diagnostic performance of dopamine transporter SPECT. Eur J Nucl Med Mol Imaging. 2021;48:1445–59. https://doi.org/10.1007/s00259-020-05085-2.
Nam SB, Kim K, Kim BS, et al. The Effect of Obesity on the Availabilities of Dopamine and Serotonin Transporters. Sci Rep. 2018;8:1–6. https://doi.org/10.1038/s41598-018-22814-8.
Pak K, Kim K, Lee MJ, et al. Correlation between the availability of dopamine transporter and olfactory function in healthy subjects. Eur Radiol. 2018;28:1756–60. https://doi.org/10.1007/s00330-017-5147-7.
Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27. https://doi.org/10.1093/biostatistics/kxj037.
Maikusa N, Zhu Y, Uematsu A, et al. Comparison of traveling-subject and ComBat harmonization methods for assessing structural brain characteristics. Hum Brain Mapp. 2021;42:5278–87. https://doi.org/10.1002/hbm.25615.
Schmitz-Steinkrüger H, Lange C, Apostolova I, et al. Impact of the size of the normal database on the performance of the specific binding ratio in dopamine transporter SPECT. EJNMMI Phys. 2020;7:1–6. https://doi.org/10.1186/s40658-020-00304-z.
Postuma RB, Berg D, Stern M, et al. MDS clinical diagnostic criteria for Parkinson’s disease. Mov Disord. 2015;30:1591–601. https://doi.org/10.1002/mds.26424.
Koike S, Tanaka SC, Okada T, et al. Brain/MINDS beyond human brain MRI project: A protocol for multi-level harmonization across brain disorders throughout the lifespan. NeuroImage Clin. 2021;30:102600. https://doi.org/10.1016/j.nicl.2021.102600.
Tossici-Bolt L, Hoffmann SMA, Kemp PM, et al. Quantification of [123I]FP-CIT SPECT brain images: an accurate technique for measurement of the specific binding ratio. Eur J Nucl Med Mol Imaging. 2006;33:1491–9. https://doi.org/10.1007/s00259-006-0155-x.
Tossici-Bolt L, Dickson JC, Sera T, et al. Calibration of gamma camera systems for a multicentre European ¹²³I-FP-CIT SPECT normal database. Eur J Nucl Med Mol Imaging. 2011;38:1529–40. https://doi.org/10.1007/s00259-011-1801-5.
Folstein MF, Folstein SE, McHugh PR. ‘Mini-mental state’. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–98. https://doi.org/10.1016/0022-3956(75)90026-6.
Morris JC. The Clinical Dementia Rating (CDR): current version and scoring rules. Neurology. 1993;43:2412–4. https://doi.org/10.1212/wnl.43.11.2412-a.
Miyamoto T, Miyamoto M, Iwanami M, et al. The REM sleep behavior disorder screening questionnaire: validation study of a Japanese version. Sleep Med. 2009;10:1151–4. https://doi.org/10.1016/j.sleep.2009.05.007.
Fujiwara Y, Suzuki H, Yasunaga M, et al. Brief screening tool for mild cognitive impairment in older Japanese: validation of the Japanese version of the Montreal Cognitive Assessment. Geriatr Gerontol Int. 2010;10:225–32. https://doi.org/10.1111/j.1447-0594.2010.00585.x.
Dubois B, Slachevsky A, Litvan I, et al. The FAB: a frontal assessment battery at bedside. Neurology. 2000;55:1621–6. https://doi.org/10.1212/WNL.55.11.1621.
Tombaugh TN. Trail Making Test A. and B: normative data stratified by age and education. Arch Clin Neuropsychol. 2004;19:203–14. https://doi.org/10.1016/S0887-6177(03)00039-8.
Kobayashi M, Nishida K, Nakamura S, et al. Suitability of the odor stick identification test for the Japanese in patients suffering from olfactory disturbance. Acta Otolaryngol Suppl. 2004;:74–9. https://doi.org/10.1080/03655230410017715.
Allen M, Poggiali D, Whitaker K, et al. Raincloud plots: A multi-platform tool for robust data visualization [version 1; peer review: 2 approved]. Wellcome Open Res. 2019;4:1–46. https://doi.org/10.12688/wellcomeopenres.15191.1.
Suwijn SR, van Boheemen CJM, de Haan RJ, et al. The diagnostic accuracy of dopamine transporter SPECT imaging to detect nigrostriatal cell loss in patients with Parkinson’s disease or clinically uncertain parkinsonism: A systematic review. EJNMMI Res. 2015;5:1–8. https://doi.org/10.1186/s13550-015-0087-1.
Tossici-Bolt L, Dickson JC, Sera T, et al. [123I]FP-CIT ENC-DAT normal database: the impact of the reconstruction and quantification methods. EJNMMI Phys. 2017;4:1–6. https://doi.org/10.1186/s40658-017-0175-6.
Buchert R, Kluge A, Tossici-Bolt L, et al. Reduction in camera-specific variability in [(123)I]FP-CIT SPECT outcome measures by image reconstruction optimized for multisite settings: impact on age-dependence of the specific binding ratio in the ENC-DAT database of healthy controls. Eur J Nucl Med Mol Imaging. 2016;43:1323–36. https://doi.org/10.1007/s00259-016-3309-5.
Iranzo A, Santamaría J, Valldeoriola F, et al. Dopamine transporter imaging deficit predicts early transition to synucleinopathy in idiopathic rapid eye movement sleep behavior disorder. Ann Neurol. 2017;82:419–28. https://doi.org/10.1002/ana.25026.
Hustad E, Aasly JO. Clinical and Imaging Markers of Prodromal Parkinson’s Disease. Front Neurol. 2020;11:1–11. https://doi.org/10.3389/fneur.2020.00395.
Fearnley JM, Lees AJ. Ageing and Parkinson’s disease: substantia nigra regional selectivity. Brain. 1991;114:2283–301. https://doi.org/10.1093/brain/114.5.2283.
Stocchi F, Rascol O, Hauser RA, et al. Randomized trial of preladenant, given as monotherapy, in patients with early Parkinson disease. Neurology. 2017;88:2198–206. https://doi.org/10.1212/WNL.0000000000004003.
Lang AE, Espay AJ. Disease Modification in Parkinson’s Disease: Current Approaches, Challenges, and Future Considerations. Mov Disord. 2018;33:660–77. https://doi.org/10.1002/mds.27360.
LeWitt PA, Aradi SD, Hauser RA, et al. The challenge of developing adenosine A2A antagonists for Parkinson disease: Istradefylline, preladenant, and tozadenant. Park Relat Disord. 2020;80:54–63. https://doi.org/10.1016/j.parkreldis.2020.10.027.
Vijiaratnam N, Simuni T, Bandmann O, et al. Progress towards therapies for disease modification in Parkinson’s disease. Lancet Neurol. 2021;20:559–72. https://doi.org/10.1016/S1474-4422(21)00061-2.
Ceravolo R, Frosini D, Poletti M, et al. Mild affective symptoms in de novo Parkinson’s disease patients: Relationship with dopaminergic dysfunction. Eur J Neurol. 2013;20:480–5. https://doi.org/10.1111/j.1468-1331.2012.03878.x.
Marek K, Jennings D, Lasch S, et al. The Parkinson Progression Marker Initiative (PPMI). Prog Neurobiol. 2011;95:629–35. https://doi.org/10.1016/j.pneurobio.2011.09.005.

SupplementaryInformation.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Harmonization of dopamine transporter SPECT imaging improves segregation between patients with parkinson’s disease and healthy elderlies in multicentre cohort studies

Status:

Version 1

Abstract

Purpose

Methods

Results

Conclusions

Figures

Introduction

Materials And Methods

Participants

Imaging data acquisition and processing

Clinical and neuropsychological assessments

Statistical analyses

Results

Discussion

Declarations

References

Supplementary Files

Status:

Version 1