Data Resource
The sample used in this study was drawn from ten waves (1989–2015) of the China Health and Nutrition Survey (CHNS). The CHNS is an international collaborative project between the National Institute for Nutrition and Health (NINH) at the Chinese Center for Disease Control and Prevention (CCDC) and the Carolina Population Center (CPC) at the University of North Carolina Chapel Hill. CHNS was designed to focus on investigating Chinese residents’ health and nutrition status, and how the status was affected by the social and economic transformation of Chinese society. Hence it contains abundant information on individuals’ demographic background, health status, as well as some other household-level information, etc.
Based on a multi-stage stratified random cluster process, the survey draws 7,200 households with over 30,000 individuals (both citizens and villagers) coming from 15 provinces and municipal cities, covered a large part of China’s territory, and thus eligible for representing Chinese residents. We restricted our sample to individuals who are at least 16 years old and with a full-scale of information on health status as well as some other covariates. Also, we excluded families with zero income or less as these observations could barely contribute to our study. Finally, we obtained 78,235 observations in total and a ten-wave balanced panel with 121 observations from each period.
Dependent variables
Based on the CHNS survey, the key dependent variables are mainly indexes measuring individuals’ health conditions, including body mass index (BMI), waist-hip rate (WHR), self-reported health, ADL, as well as indicators on diabetes and high blood pressure. According to Li H and Mellor [30, 31], all the dependent variables were transformed into binary variables which equal to 1 indicate a good health condition and 0 otherwise.
BMI is an international measurement for obesity and health, the standard value ranges between 18.5 and 23.9, and a higher value indicates more serious obesity and unhealthy status. In this study, we use a dummy variable that equals 1 for the standard values (18.5–23.9) and 0 otherwise, to reflect information on BMI. Compared with BMI, WHR is available for expressing the distribution of body fat, which can also be used as a prediction for elderly’s mortality [32] as well as females’ fertility and pressure [33]. WHR is also represented by a binary variable that takes 1 if WHR less than or equal to 0.8 for women and 0.9 for men and takes 0 otherwise. Self-reported health status (SRHS) includes five different categories (“very good”, “good”, “fair”, “bad”, “very bad”) in total, here we also use a dummy variable that equals 1 if the interviewer reports a good or very good health condition and equals to 0 otherwise. High blood pressure (HBP) and diabetes are indicators reflecting if the person has been diagnosed with related diseases. Activities of daily living (ADL) are a series of basic activities necessary for independent living at home or in the community [34, 35], which are mainly consisted of personal hygiene, dressing, eating, maintaining continence, and mobility. Hence, we applied a binary variable (ADL) which indicates if the respondents can finish all of these basic activities.
Independent variables
The key independent variables in this study are county-level Gini-index and Theil-index which account for income inequality. In China, a county is a basic unit for the fiscal system as well as responsible for public health and healthcare utilization [29]. Thus, these two variables should be suitable for measuring districts’ income inequality.
Individual-level control variables include income (proxied by family aggregate income divided on family size), length of education, age, gender, ethnicity (ethnic minority or not), marital status (has a spouse or not), occupation, affiliation, Hukou (China’s household registration system indicates the residents registered in rural or urban areas), and health insurances (have insurance or not) [36]. Moreover, we also controlled for some household-level covariates such as family size, access to tap water, and indoor toilet.
Table 1
Descriptive statistics of health, income, inequality and other variables
Variables
|
Obs
|
Mean
|
Std.dev
|
Min
|
Max
|
Health Variables
|
|
|
|
|
|
WHR(normal = 1)
|
78235
|
.458
|
.499
|
0
|
1
|
BMI(normal = 1)
|
78235
|
.598
|
.49
|
0
|
1
|
SRHS(very good/good = 1)
|
46785
|
.704
|
.457
|
0
|
1
|
ADL(No difficulty = 1)
|
7950
|
.775
|
.417
|
0
|
1
|
Blood(normal = 1)
|
60692
|
.917
|
.276
|
0
|
1
|
Diabetes(No = 1)
|
46503
|
.978
|
.145
|
0
|
1
|
Health Behaviors
|
|
|
|
|
|
Smoking rate
|
65635
|
.933
|
.060
|
.825
|
.998
|
Drinking rate
|
65635
|
.373
|
.025
|
.311
|
.424
|
Income and Income inequality indicators
|
|
|
|
|
|
Country income per capita(log)
|
78235
|
8.88
|
.996
|
1.193
|
13.94
|
Family income per capita(log)
|
78235
|
8.825
|
1.298
|
.361
|
15.326
|
Gini
|
78235
|
.463
|
.035
|
.323
|
.544
|
Theil
|
78235
|
.427
|
.091
|
.208
|
.691
|
P10/P50
|
78235
|
.212
|
.073
|
.087
|
.545
|
P90/P10
|
78235
|
13.096
|
4.248
|
3.699
|
26.168
|
P90/P50
|
78235
|
2.524
|
.265
|
1.937
|
3.051
|
Other Variables
|
|
|
|
|
|
Age
|
78235
|
45.103
|
15.384
|
16
|
100
|
Marriage(Yes = 1)
|
78235
|
.831
|
.375
|
0
|
1
|
Gender(Male = 1)
|
78235
|
.512
|
.5
|
0
|
1
|
Education
|
78235
|
7.578
|
4.244
|
0
|
18
|
Insurance(Yes = 1)
|
78235
|
.889
|
.314
|
0
|
1
|
Occupation
|
78235
|
3.822
|
1.509
|
1
|
6
|
Register(Rural = 1)
|
78235
|
.654
|
.476
|
0
|
1
|
Tap water(Yes = 1)
|
78235
|
.74
|
.439
|
0
|
1
|
Indoor toilet(Yes = 1)
|
78235
|
.437
|
.496
|
0
|
1
|
Family size
|
78235
|
3.924
|
1.585
|
1
|
15
|
Number of Medical Institutions
|
78235
|
1.277
|
.548
|
1
|
8
|
Distance(Minutes)
|
78235
|
14.537
|
23.246
|
0
|
3600
|
Empirical Model
As our outcome variables are all binary, this study applied a probit model. Except for the key independent variables, we also added the interaction term between income and Gini-index (Theil-index), and the squared income and Gini-index (Theil-index) are also included as covariates in the model, as we presented in Eq. (1) below:
Y ict = β1Ginict(Theilct)+β2Ginict*Incomeict + γXict + µict
Where subscript i, c, and t denote that the individual i from county c was interviewed in period t. The dependent variable consisted of several dummies indicate the interviewers were in a good health condition or not. The first independent variable Gini-index (Theil-index) reflects the degree of income inequality within a county, and the correlated estimates β1 is the key of our interest, which depicts the marginal effect of income inequality on individuals’ health conditions. Through interacting between Gini-index (Theil-index) and income, the second independent variable could help us figure out the heterogeneous effect of income inequality on the health of different income levels. X represents a series of covariates, including income, length of education, age, gender, ethnicity, marital status, occupation, affiliation, Hukou, health insurances, family size, access to tap water, and indoor toilet, as well as squared income and Gini-index (Theil-index). The error term was represented by µ.