Aim, design and setting of the study
To investigate the association between PA trajectories and longitudinal changes in renal function, we performed the present study with a prospective cohort. Data used in this study were obtained from the CHARLS, a nationally representative longitudinal survey conducted in Chinese population aged 45 years and above[8]. The baseline survey was launched in 2011 and then the participants were followed up biennially. Since serum creatinine and cystatin C levels were measured in 2011 and 2015, the present study used data from 2011 to 2015.
Study population
Participants were excluded if they were lack of information on age or gender in 2011, missing information on PA in 2011 or 2015, or missing information on renal function including serum creatinine and cystatin C in 2011 or 2015, or under 60 years old. A total number of 407 participants were selected. The flow chart showing the selection of study population were presented in Fig. 1.
Physical activity assessment
The weekly volume of PA was reflected by the physical activity score (PAS). The variable “PAS” was constructed according to the International Physical Activity Questionnaire[9]. Specifically, variables used to calculate PAS in the “health status and functioning” section of CHARLS codebooks in 2011 and 2015 included da052_1_(days do vigorous activities at least 10 minutes continuously), da052_2_(days do moderate physical effort at least 10 minutes continuously), da052_3_(days walking at least 10 minutes continuously), da053_1_(2 hours everyday do vigorous activities at least 10 minutes continuously), da053_2_(2 hours everyday do moderate physical effort at least 10 minutes continuously), da053_3_(2 hours everyday walking at least 10 minutes continuously), da054_1_(minutes everyday do vigorous activities at least 10 minutes continuously), da054_2_(minutes everyday do moderate physical effort at least 10 minutes continuously), da054_3_(minutes everyday walking at least 10 minutes continuously), da055_1_(4 hours everyday do vigorous activities at least 10 minutes continuously), da055_2_(4 hours everyday do moderate physical effort at least 10 minutes continuously) and da055_3_(4 hours everyday walking at least 10 minutes continuously). Based on the responses of participants, the daily PA duration index was defined as follows: the duration of PA less than 30 minutes in one day was indexed as 1; the duration ranging from 30 minutes to 2 hours in one day was indexed as 2; the duration ranging from 2 hours to 4 hours in one day was indexed as 3; the duration more than 4 hours in one day was indexed as 4[10]. Then, the weekly PA duration index can be figured out by multiplying the daily PA duration index for each kind of activity and the number of days. Finally, the variable “PAS” was generated using metabolic equivalent multipliers as follows: PAS = 8.0 × weekly vigorous PA duration index + 4.0 × weekly moderate PA duration index + 3.3 × weekly walking duration index[10].
Evaluation of changes in renal function
In 2011 and 2015, the CHARLS collected venous blood samples to measure renal function including serum creatinine and cystatin C levels. We used the new equation without race developed in 2021 by the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) to calculate estimated glomerular filtration rate based on serum creatinine and cystatin C (eGFRcr-cys)[11]. The follow-up years were calculated based on the time span between interview dates in 2015 and 2011. The annualised rate of changes in renal function over the four years from 2011 to 2015 (slope of eGFRcr-cys) equal to the quotient of difference between eGFRcr-cys in 2015 (eGFRcr-cys2015) and eGFRcr-cys in 2011 (eGFRcr-cys2011, also called baseline eGFRcr-cys) divided by follow-up years. The formula of changes in renal function is as follows: slope of eGFRcr-cys = (eGFRcr-cys2015 - eGFRcr-cys2011)/follow-up years.
Covariates
In this study, the following covariates were included: age, gender (male, female), area (urban, rural), education level (less than lower secondary, upper secondary & vocational training, tertiary), marital status (married, unmarried/divorced), current smoking (no, yes), drinking in last year (no, yes), body mass index (BMI) categorization (underweight: less than 18.5, normal weight: from 18.5 to 23.9, overweight: from 23 to 24.9, obesity: greater than 25), hypertension (no, yes), diabetes mellitus (no, yes), heart disease (no, yes), dyslipidemia (no, yes), liver disease (no, yes), kidney disease (no, yes), baseline eGFRcr-cys, baseline uric acid, baseline hypersensitive C-reactive protein (hsCRP), and Baseline triglyceride-glucose (TyG) index. The TyG index was calculated as ln[fasting triglyceride (mg/dL) × fasting plasma glucose (mg/dL)/2][12]. For missing of each categorical covariate, we set those missing values as another level of the categorical covariate. All continuous covariates were complete.
Statistical analysis
First, trajectories of PAS were identified by latent class trajectory modeling (LCTM), a method to identify heterogeneous subgroups of population in terms of longitudinal exposures or outcomes[13]. By this method, we can determine the optimal number of trajectory classes with the lowest Bayesian Information Criterion (BIC)[14]. The mean of posterior probabilities in each class less than 70% indicated a poor fit[14].
Second, to describe characteristics of study population classified by trajectories of PAS, normally distributed continuous variables were presented as mean ± standard deviation; non-normally distributed continuous variables were presented as median (1st quartile, 3rd quartile); categorical variables were presented as n (%). Student’s t-tests were used to estimate differences in normally distributed continuous variables among trajectories of PAS; Kruskal-Wallis tests were used to estimate differences in non-normally distributed continuous variables among trajectories of PAS; Chi-square tests were used to estimate differences in categorical variables among trajectories of PAS.
Third, to investigate the association between trajectories of PAS and the slope of eGFRcr-cys, univariate and multivariate logistic regression models were used. Three models were constructed: crude model with no adjusted covariates; model 1 with adjusted age and gender; model 2 with adjusted age, drinking in last year, diabetes mellitus, heart disease, kidney disease, baseline eGFRcr-cys and baseline uric acid. In model 2, variates selected for adjustment were determined by backward stepwise selection[15].
Fourth, subgroup analysis was performed as a sensitivity test to evaluate the robustness of our results. We tested potential interactions between trajectories of PAS and the slope of eGFRcr-cys in different subgroups of participants. P for interaction was estimated by the likelihood ratio test. The variate “trajectories of PAS” was treated as a continuous variable. Subgroups were stratified by drinking in last year (no, yes), diabetes mellitus (no, yes), heart disease (no, yes), kidney disease (no, yes), baseline eGFRcr-cys (< 60 mL/min/1.73m2, ≥ 60 mL/min/1.73m2), and baseline uric acid (< 6 mg/dL, ≥ 6 mg/dL).
Fifth, serial mediation analysis was conducted to reveal significant mediating pathways from trajectories of PAS to the slope of eGFRcr-cys. A direct effect is the association between predictor variable (X) and outcome variable (Y) after controlling for all mediators, whereas an indirect effect is the association between X and Y via particular mediators[16]. Trajectories of PAS was entered as X. The slope of eGFRcr-cys was entered as Y. Baseline TyG index was entered as mediator 1(M1). Baseline eGFRcr-cys was entered as mediator 2(M2). A 1000 sample bootstrapping technique was used to estimate the indirect effects.
All analyses and graphics programs were performed with R-4.2.2 (https://www.R-project.org, The R Foundation). A p value less than 0.05 (two-sided) was considered statistically significant.