Characteristics of Participants
The characteristics of participants in this study were shown in Table 1. Of the 4978 participants, there were 905 patients with CKD. Participants with CKD were significantly older (61.07 vs 45.34, p < 0.01). Men were more likely to developing CKD. Participants with higher education levels, high family income reduced the risk of having CKD. Participants with history of smoking status had a higher proportion of having CKD compared to no-smoker. CKD participants have higher BMI being 29.87(95% CI, 29.30-30.43). On the other hand, participants with impaired glucose tolerance and diabetes had significantly higher risk of CKD. Participants with CKD had lower eGFR being 72.48(95% CI, 70.03-74.93) compared to the normal. Serum zinc were lower in CKD participants compared to others (12.29 vs 12.57, P = 0.028), on the contrary, serum copper was higher in participants with CKD (19.41 vs 18.49, P <0.001). There were no significant differences in race/ethnicity and serum selenium.
Logistic Regression Models
The levels of serum zinc, copper and copper/zinc were categorized Q1-4 by quartiles, in which Q 1 set as referent. Weighted logistic regression model was constructed to explore the associations between serum zinc, copper levels and copper/zinc ratios and the risk of CKD (table 2). In model 1 with adjusted by age and gender, serum copper and zinc level of Q4 was significantly associated with CKD (Table 3). The risk of developing CKD was decreased gradually, with increasing serum zinc levels or decreasing serum copper levels, and this trend was statistically significant (p for trend equal to 0.033 and less than 0.001, respectively). The model 2 was adjusted by age, gender, ethnicity, education status, average household income, BMI, diabetic status and smoke status. In the model 2, serum zinc level of Q2-4 was significantly associated with CKD and the odds ratio (OR) equal to 0.75(95%CI, 0.58-0.96), 0.74(95%CI, 0.56-0.99), 0.63(95%CI, 0.43-0.91) respectively. On the other hand, Q4 level of serum copper was significantly associated with CKD with OR being 1.62(95% CI, 1.02-2.59). Consistent with model 1, the trend of risk in model 2 was statistically significant (p for trend equal to 0.023 and less than 0.001).
For copper/zinc ratio, participants with lower ratio level had lower risk of having CKD in those models. When the copper/zinc level decreased to 1.20-1.46 level in Model 2, the risk of developing CKD decreased to 0.59 (95% CI, 0.43-0.80, p=0.002), and when the copper/zinc level decreased to less than 1.2, the risk of participants had CKD was reduced to 0.57 (95%CI, 0.39-0.84, p=0.006). The forest plot showed the weighted logistic regression (shown in figure 1), in which age, ethnicity, diabetes and smoking status were the risk factors and the higher education level and lower copper/zinc ratio were protective risks.
Dose-Response Relationship between serum copper/zinc ratio and risk of CKD
The restricted cubic spline RCS were used to analyze the relationship between copper/zinc ratio and the risk of CKD (shown in figure 2). Adjusted by age, gender, ethnicity, education status, average household income, BMI, diabetic status and smoke status, the RCS model showed a negative liner correlation between serum copper/zinc ratio and CKD risk (p <0.001 and the value of nonlinear equal to 0.06). In addition, according to the prediction of this model, when the ratio was lower than 1.49, the odds ratio of CKD begins to be lower than 1 and shows a rapidly increasing trend. On the contrary, when the ratio was higher than 1.49 and lower than 4.14, OR was gradually decreasing trend.
Machine Learning Using the XGBoost Algorithm Model
In the phase of model-development and validation, the XGBoost model was applied to determine the relative importance of variables associated with the CKD. Variables included age, education status, average household income, BMI, diabetic status and smoke status. Categorical variables were converted into ordinal factors firstly, and then converted into ordinal numeric variables into the model. According to the results of each variables’ contribution by the XGBoost model, (shown in Figure 3), besides the glomerular filtration rate, the Copper/Zinc ratio, was the most relative variable in our study.