Of the total 498 NPC patients, 298 were excluded due to less than a 2-year follow-up (148), no pretreatment (99) or an abnormal pretreatment thyroid function test (37), a preexisting thyroid disease (13), or a previous neck radiation (1). The remaining 200 patients (144 males and 56 females) with a mean age of 48.7 years (with a range 18 to 83 years) were analyzed. The RHT group tended to have younger ages, earlier T stages (T1-T2), more advance N stages (N2-3), and higher pretreatment TSH. The patient and treatment characteristics are shown in Table 1.
Table 1
|
|
All
(N = 200)
|
No RHT
(N = 74)
|
RHT
(N = 126)
|
Age (year)
|
Mean (SD)
|
48.7 (11.5)
|
51 (11.7)
|
47.4 (11.2)
|
Gender
|
|
|
|
|
- Male
|
N (%)
|
144 (72)
|
52 (70.3)
|
92 (73)
|
- Female
|
N (%)
|
56 (28)
|
22 (29.7)
|
34 (27)
|
T stage
- T1-2
- T3-4
|
N (%)
N (%)
|
130 (65)
70 (35)
|
41 (55.4)
33 (44.6)
|
89 (70.6)
37 (29.4)
|
N stage
- N0-1
- N2-3
|
N (%)
N (%)
|
64 (32)
136 (68)
|
30 (40.5)
44 (59.5)
|
34 (27)
92 (73)
|
Group Stage
- Stage I
- Stage II
- Stage III
- Stage IV
|
N (%)
N (%)
N (%)
N (%)
|
3 (1.5)
33 (16.5)
107 (53.5)
57 (28.5)
|
0 (0)
12 (16.2)
37 (50)
25 (33.8)
|
3 (2.4)
21 (16.6)
70 (55.6)
32 (25.4)
|
WHO type
|
|
|
|
|
- 1 : Keratinizing
- 2 : Non-keratinizing
|
N (%)
|
2 (1)
|
1 (1.4)
|
1 (0.8)
|
2A: Differentiated
|
N (%)
|
32 (16)
|
12 (16.2)
|
20 (15.9)
|
2B: Undifferentiated
|
N (%)
|
165 (82.5)
|
60 (81.0)
|
105 (83.3)
|
- Other
|
N (%)
|
1 (0.5)
|
1 (1.4)
|
0 (0)
|
RT Technique
- IMRT
- VMAT
- IMRT + VMAT
|
N (%)
N (%)
N (%)
|
36 (18)
159 (79.5)
5 (2.5)
|
12 (16.2)
59 (79.7)
3 (4.1)
|
24 (19)
100 (79.4)
2 (1.6)
|
Pre-treatment TFT
|
Mean (SD)
Mean (SD)
|
1.3 (0.2)
1.7 (0.9)
|
1.3 (0.2)
1.4 (0.9)
|
1.3 (0.2)
1.9 (0.8)
|
- FT4 (ng/dl)
- TSH (µU/ml)
|
Post-treatment TFT (worst during 2-year follow-up)
|
- FT4 (ng/dl)
|
Mean (SD)
|
1 (0.2)
|
1 (0.2)
|
1 (0.2)
|
- TSH (µU/ml)
|
Mean (SD)
|
9.5 (19.4)
|
2.2 (1)
|
13.9 (23.4)
|
Thyroid gland
- Volume (cm3)
|
Mean (SD)
|
14.8 (7.3)
|
15.1 (6.1)
|
14.6 (7.9)
|
- gEUD (Gy)
|
Mean (SD)
|
47.2 (6.9)
|
45.5 (6.5)
|
49.2 (6.7)
|
- Dmean (Gy)
|
Mean (SD)
|
52.9 (5.3)
|
51 (5.3)
|
54 (5.1)
|
- Dmin (Gy)
|
Mean (SD)
|
34.4 (5.9)
|
32.6 (6.1)
|
35.4 (5.5)
|
- Dmax (Gy)
|
Mean (SD)
|
66.3 (6.8)
|
64.1 (6.1)
|
67.6 (6.8)
|
- V35 (%)
|
Mean (SD)
|
96.8 (7.2)
|
94.6 (10)
|
98.2 (4.6)
|
- V40 (%)
|
Mean (SD)
|
91.5 (12)
|
87.8 (15)
|
93.7 (9.2)
|
- V45 (%)
|
Mean (SD)
|
82.3 (16.7)
|
77.2 (19.2)
|
85.3 (14.4)
|
- V50 (%)
|
Mean (SD)
|
67.9 (22)
|
61.9 (23.6)
|
71.4 (20.3)
|
- V55 (%)
|
Mean (SD)
|
44.2 (27)
|
37.1 (25.9)
|
48.3 (26.9)
|
- V60 (%)
|
Mean (SD)
|
16.3 (20.2)
|
10.3 (15.3)
|
19.8 (21.9)
|
- VS45 (cm3)
|
Mean (SD)
|
2.7 (2.8)
|
3.4 (2.9)
|
2.3 (2.6)
|
- VS50 (cm3)
|
Mean (SD)
|
4.8 (3.9)
|
5.7 (3.8)
|
4.3 (3.7)
|
- VS55 (cm3)
|
Mean (SD)
|
8.3 (5.4)
|
9.4 (4.8)
|
7.7 (5.6)
|
- VS60 (cm3)
|
Mean (SD)
|
12.4 (6.5)
|
13.5 (5.5)
|
11.8 (6.9)
|
Pituitary gland
|
|
|
|
|
- Pmean (Gy)
- Pmin (Gy)
- Pmax (Gy)
|
Mean (SD)
Mean (SD)
Mean (SD)
|
53.6 (16.7)
45.3 (19.5)
61.3 (14.4)
|
55.5 (17.8)
46.7 (20.8)
62.8 (14.9)
|
52.4 (15.9)
44.5 (18.7)
60.5 (14.1)
|
Chemotherapy
|
N (%)
|
194 (97)
|
72 (97.3)
|
122 (96.8)
|
Abbreviations : RHT = Radiation-induced hypothyroidism; WHO = World Health Organization; RT = Radiotherapy; IMRT = Intensity-modulated radiotherapy; VMAT = Volumetric Modulated Arc Therapy; TFT = Thyroid function test; TSH = Thyroid-stimulating hormone; FT4 = Free thyroxine; Dmin = Minimal dose of the thyroid (Gy); Dmean = Mean dose of the thyroid (Gy); Dmax = Maximum dose of the thyroid (Gy); Pmin = Minimal dose of the pituitary (Gy); Pmean = Mean dose of the pituitary (Gy); Pmax = Maximum dose of the pituitary (Gy); gEUD = generalized equivalent uniform dose (Gy); Vx = Volume of thyroid gland receiving x Gy or more (%); VSx = absolute thyroid volume spared from x Gy or more (cm3)
With a median follow-up time of 29 months (with a range 5-120 months), RHT occurred in 126/200 patients (63%) at a median latency period of 18 months (5–97 months). The 1, 2 and 3-year cumulative incidences of RHT were 17.5%, 38.5% and 50.5%, respectively. The most common type of RHT was subclinical hypothyroidism (108/126, 85.7%), followed by primary hypothyroidism (11/126, 8.7%), and central hypothyroidism (7/126, 5.6%). There were patients with subclinical hypothyroidism who progressed to primary hypothyroidism (15/108, 13.9%) and central hypothyroidism (1/108, 0.9%) after a longer follow-up period. Thyroid hormone supplementation was prescribed to 77/126 patients (61.1%). Generally, thyroid hormone replacement therapy was prescribed when TSH level ≥ 10 µU/mL or FT4 ≤ 0.8 ng/dL. The median time from complete radiation to thyroid hormone supplementation was 26 months (with a range 6-108 months).
The mean dose, Dmin, Dmax, and Vx of the thyroid gland in RHT group were significantly higher than non-RHT group, while the VS45, VS50, and VS55 were significantly lower. The maximum correctly classified cutoff for the mean thyroid dose to differentiate between patients with no RHT and RHT group was 45Gy. The incidence of RHT was 2% and 61% in the thyroid Dmean of ≤ 45Gy and > 45Gy subgroups, respectively. The gEUD was also a significant predictor for RHT. Compared to the mean thyroid dose, all patients had a lower gEUD with a mean absolute difference of 5Gy (with a range range 0–8Gy).
Other dosimetric parameters, such as thyroid volume and pituitary exposure dose, were not found to be significant in this study. The mean thyroid volume was 14.8 cm3 (15.1 vs 14.6 cm3, with a non-significant P-value = 0.64) and the mean pituitary dose was 53.6 Gy (55.5 vs 52.4 Gy, with a non-significant P-value = 0.22). In subgroup analysis, patients with central RHT received a significantly higher pituitary dose that those without central RHT (a median of Pmean 72.3 Gy vs 57.4Gy with a significant P-value = 0.009, and a median of Pmax 74.7Gy vs 63.5Gy with a significant P-value = 0.019) as shown in Supplement 3.
NTCP model development
In the univariate logistic regression analyses, age, T and N stage, pre-treatment TSH, Dmean, Dmin, Dmax, V35, V40, V45, V50, V55, V60, VS45, VS50, and VS55 of the thyroid gland were significantly associated with RHT at any time point. In the multivariate logistic regression analysis, age, pre-treatment TSH ,and Dmean were the only significant parameters, as shown in Table 2.
Table 2
Univariate and multivariate analyses for radiation-induced hypothyroidism.
Variables
|
Reference
|
Univariate
|
Multivariate*
|
Odds Ratio
|
95%CI
|
P-value
|
Odds
Ratio
|
Regression coefficient
|
95%CI
|
P-value
|
Age (year)
|
|
0.97
|
0.95-1.00
|
0.035
|
0.97
|
-0.03
|
-0.06 to -0.002
|
0.034
|
Gender
|
Male
|
0.87
|
0.46–1.65
|
0.676
|
|
|
|
|
T stage
|
T1-2
|
0.52
|
0.28–0.94
|
0.030
|
|
|
|
|
N stage
|
N0-1
|
1.84
|
1.0-3.39
|
0.048
|
|
|
|
|
Pre-treatment FT4
|
|
1.61
|
0.37–7.06
|
0.525
|
|
|
|
|
Pre-treatment TSH
|
|
2.44
|
1.61–3.71
|
< 0.001
|
2.49
|
0.91
|
0.48–1.33
|
< 0.001
|
Thyroid volume (cm3)
|
0.99
|
0.95–1.03
|
0.643
|
|
|
|
|
gEUD (Gy)
|
|
1.09
|
1.04–1.14
|
< 0.001
|
|
|
|
|
Dmean (Gy)
|
|
1.12
|
1.05–1.18
|
< 0.001
|
1.12
|
0.12
|
0.05–0.19
|
0.001
|
Dmin (Gy)
|
|
1.09
|
1.03–1.14
|
0.001
|
|
|
|
|
Dmax (Gy)
|
|
1.09
|
1.04–1.14
|
0.001
|
|
|
|
|
V35 (%)
|
|
1.07
|
1.03–1.12
|
0.002
|
|
|
|
|
V40 (%)
|
|
1.04
|
1.02–1.07
|
0.002
|
|
|
|
|
V45 (%)
|
|
1.03
|
1.01–1.05
|
0.001
|
|
|
|
|
V50 (%)
|
|
1.02
|
1.01–1.03
|
0.004
|
|
|
|
|
V55 (%)
|
|
1.02
|
1.01–1.03
|
0.005
|
|
|
|
|
V60 (%)
|
|
1.03
|
1.01–1.05
|
0.002
|
|
|
|
|
VS45 (cm3)
|
|
0.87
|
0.78–0.96
|
0.009
|
|
|
|
|
VS50 (cm3)
|
|
0.92
|
0.85–0.98
|
0.022
|
|
|
|
|
VS55 (cm3)
|
|
0.94
|
0.89–0.99
|
0.032
|
|
|
|
|
VS60 (cm3)
|
|
0.96
|
0.92-1.00
|
0.078
|
|
|
|
|
Pmean (Gy)
|
|
0.99
|
0.97–1.01
|
0.219
|
|
|
|
|
Pmin (Gy)
|
|
0.99
|
0.98–1.01
|
0.456
|
|
|
|
|
Pmax (Gy)
|
|
0.99
|
0.97–1.01
|
0.282
|
|
|
|
|
Chemotherapy
|
Yes
|
0.85
|
0.15–4.74
|
0.850
|
|
|
|
|
Constant
|
|
|
|
|
|
-5.62
|
-9.33 to -1.901
|
0.003
|
Abbreviations: CI = confidence interval; TSH = Thyroid-stimulating hormone; Dmean = Mean dose of the thyroid (Gy); Pmean = Mean dose of the pituitary (Gy); Vx = Volume of thyroid gland receiving x Gy or more (%); VSx = absolute thyroid volume spared from x Gy or more (cm3).
* The multivariate results were obtained using a requirement of a P-value < 0.10 from the univariate and backward stepwise logistic regression analyses to select the final model.
The multivariable NTCP model for RHT is given by the following equation:
with S(x) = -5.62 + (-0.03×age(yrs)) + (0.91×pre-treatment TSH (µU/ml)) + (0.12×Dmean thyroid (Gy)).
It was noted that the negative coefficient for age (-0.03) indicated that patients with an older age had a reduced risk of RHT. However, the positive coefficients for pre-treatment TSH (0.91) and Dmean (0.12) indicated that these two variables were positively associated with a risk of RHT. Pearson correlation tests showed no significant correlations between age and Dmean, but there were high correlations between the dose-volume variables of thyroid gland as shown in Supplement 4.
T and N stage were significant predictors of RHT in univariate analysis, but not in multivariate analysis. The supraclavicular lymph node involvement was not significantly different between the non-RHT and RHT groups (6.8% vs 14.3% with a non-significant P-value = 0.107). Additionally, there was no difference of the presence of PTV 70Gy in the supraclavicular region between the non-RHT and RHT groups (21.6% vs 32.5% with a non-significant P-value = 0.099).
Model performance and validation
Our NTCP model for predicting RHT had a good performance with an AUROC of 0.749 (95% CI 0.677–0.821) (Fig. 1). The Hosmer-Lemeshow goodness-of-fit test was used to assess the performance of our NTCP model. There was a good degree of fit with a non-significant P-value = 0.837. Supplement 5 shows the calibration plot of expected and observed event rates. Internal validation was performed with a 10-fold cross-validation and resulted in a mean AUROC of 0.764 (95% confidence interval (CI) 0.634–0.795). The model performance on external validation dataset (N = 25) was excellent with an AUROC of 0.812. We compared our model to two previously reported NTCP models for predicting RHT in NPC patients, Luo[12] and Shen[16], and revealed the low AUROC in both training and external validation dataset (Table 3 and Fig. 2).
Table 3
Model performance and validation.
|
N
|
Model
|
AUROC from original dataset
|
AUROC from our training dataset
|
AUROC from external validation dataset
|
Our NTCP model
|
200
|
βthyroid mean dose = 0.12
βAge = -0.03
βTSH pre RT = 0.91
Constant = -5.62
|
0.749
|
0.749
|
0.812
|
Luo et al.[12]
|
174
|
βPmax= -0.026
βthyroid V50 = 0.05
βfemale = 1.28
βCMT = 2.902
Constant = -2.695
|
0.793
|
0.606
|
0.571
|
Shen et al.[16]
|
69
|
βthyroid mean dose = 0.093
βthyroid Volume = -0.188
Constant = -1.385
|
NR
|
0.619
|
0.487
|
Abbreviations: AUROC = area under the curve of receiver operating characteristic (ROC) curves; NR=not reported; RT=radiotherapy; CMT=chemotherapy