Study design
Data collection was performed in three stages. In March 2020 we collected data from 5 686 adults in China who completed an online survey. We used a two-by-two factorial design contrasting between (1) medical and non-medical workers, and (2) Wuhan and non-Wuhan residents. Our sample includes 1 136 medical workers many of whom participated in the front-line anti-pandemic activities.iii As we discuss below, the survey elicited the respondents’ willingness for a loved one to choose a medical profession. We use this as a measure for medical occupation preferences.
Our approach utilizes the fact that the COVID-19 pandemic and lockdown policy made Wuhan distinct from other areas of China. By March 12, 2020,iv the total number of confirmed cases in Wuhan was 49 986, accounting for 61.9% of the total number of confirmed cases in China.v The total number of deaths reached 3 869, accounting for 83.5% of 4 632 reported deaths. Of the 3 000 medical workers in China who had been infected with COVID-19, more than 85% were medical workers in Wuhan. Wuhan was the only city in China whose medical system was being overwhelmed by confirmed cases. Among our medical worker sample from Wuhan, 85.3% reported that their direct family members or friends had been infected by the virus, which is much higher than the 52.5% of the non-medical sample from Wuhan and the 27.3% of the medical sample from other regions of China.
In May 2020 we collected data from 1 198 adults in the UK who completed an online survey. The survey contained questions mirroring those in the Chinese survey, thus, allowing a robustness check of our findings. By the 20th May the UK had experienced around 35 000 deaths due to Covid.vi This is orders of magnitude more than in Wuhan or China. And extreme stresses on workers in the National Health Service (NHS) were apparent during the pandemic [30]. Morale in the NHS was, however, already low after a decade of real budget cuts [31]. Moreover, the lockdown was associated with strong public support for medical professionals with, for instance, a weekly “clap for carers” [32]. We expected, therefore, that the direct impact of the pandemic on medical occupation preference would be less pronounced in the UK compared to Wuhan. This allows a robustness test for the underlying role of social and risk preferences in medical occupation choice.
In December 2020 we conducted a field experiment with 428 first and second-year students in Wuhan. Second-year students enrolled before the COVID-19 pandemic began while first-year students enrolled after the pandemic. We collect data from both medical and non-medical students. This allows us to conduct a difference-in-difference analysis examining the causal effect from the exogenous shock of the pandemic on the social-preferences and risk-preferences of medical students. Again, this allows a robustness test for the underlying role of social and risk preferences in medical occupation choice.
Ethical approval for the data collected was provided by Wuhan University (China Survey and field experiment) and De Montfort university (UK Study).
Participants
To recruit respondents for our main survey we followed a “social network snowball” (SNS) sampling method. SNS sampling is an appropriate tool for researching "hard to reach" populations that are difficult to study through conventional survey methods [33-36], and it has been found to have no detectable impact on associations among variables [37]. We, therefore, used this approach to collect data from individuals with COVID-19 experiences, especially medical workers.
We shared the survey in multiple private WeChat groups and asked existing survey participants to share the survey among their own colleagues and acquaintances. To ensure that no individual could respond to the survey more than once, we screened responses based on their IP address and WeChat account number. To collect medical worker samples, we submitted the questionnaire to target hospital WeChat groups in Wuhan and other cities. The medical workers' samples included at least six out of the top ten hospitals with the highest number of infected medical workers in China.vii Survey respondents were based in 32 of China's provincial regions (as shown in Figure A1 in the Supplementary Material). More than one in ten percent of respondents were from Wuhan, which is the provincial capital of Hubei Province. A descriptive summary of the survey sample is shown in Table 1.
Participants for the UK survey were recruited using Prolific. The sample was chosen to be representative of the UK population in terms of sex, age and ethnicity. Of the 1 198 participants, 75 (7.5%) were working in the healthcare sector, 28 of them involved in the front line treatment and care of people during the pandemic, and 184 (18.4%) had immediate family working in the health sector.
A total of 428 students from one of the top universities in Wuhan participated in the field experiment. The experiment is composed of four sub-samples varied by the year of study (Freshman n = 210 vs. Sophomore n = 218) and school of admission (non-medico n = 196 vs. medico n = 232).
Table 1 Descriptive Summary
|
Wuhan
|
Non-Wuhan
|
|
|
Med
|
Nonmed
|
Med
|
Nonmed
|
Total
|
Preference for medical occupation (pooled across married and unmarried)
|
|
|
|
|
|
WTM
|
2.5(1.5)
|
3.6(1.3)
|
3.9(1.5)
|
4.1(1.2)
|
4.0 (1.3)
|
Preference
|
0.7(0.7)
|
1.0(0.7)
|
1.2(0.8)
|
1.2(0.7)
|
1.2(0.7)
|
Panel a. Selected demographic variables
|
|
|
|
|
|
Age (proxy)
|
39.5(10.3)
|
35.4(10.8)
|
33.6(10.3)
|
33.4(9.8)
|
33.8(10.0)
|
Male (%)
|
32.2(46.8)
|
39.0(48.8)
|
40.0(49.0)
|
40.4(49.1)
|
39.9(49.0)
|
High education, college or above (%)
|
97.2(16.7)
|
93.9(24.1)
|
84.4(36.3)
|
78.0(41.4)
|
80.8(39.4)
|
High income, 10w-30w or above (%)
|
56.4(49.7)
|
53.2(50.0)
|
34.7(47.6)
|
30.5(46.1)
|
33.7(47.3)
|
Married (%)
|
84.3(36.4)
|
61.5(48.7)
|
71.5(45.2)
|
64.1(48.0)
|
65.9(47.4)
|
No. of children (married)
|
1.1(0.5)
|
1.0(0.6)
|
1.3(0.6)
|
1.2(0.6)
|
1.2(0.6)
|
Panel b. Risk/social preference
|
|
|
|
|
|
Risk taking
|
2.8(0.6)
|
3.0(0.6)
|
3.6(0.9)
|
3.3(0.8)
|
3.3(0.8)
|
Prosocial
|
2.9(0.6)
|
3.0(0.6)
|
3.6(0.9)
|
3.3(0.8)
|
3.3(0.8)
|
Prefer son (%)
|
33.2(47.2)
|
41.2(49.3)
|
37.7(48.5)
|
38.4(48.6)
|
38.3(48.6)
|
Panel c. Influences from COVID-19, 5-extremely positive, 1-extremely negative
|
|
|
|
|
|
Family heath
|
3.1(1.3)
|
2.9(1.1)
|
3.7(1.2)
|
3.5(1.2)
|
3.5(1.2)
|
Occupational development
|
3.3(1.3)
|
2.7(1.2)
|
3.7(1.3)
|
3.4(1.2)
|
3.4(1.3)
|
Financial status
|
2.8(1.1)
|
2.6(1.1)
|
3.5(1.3)
|
3.3(1.2)
|
3.2(1.2)
|
Mental health
|
3.1(1.1)
|
2.9(1.1)
|
3.8(1.2)
|
3.5(1.1)
|
3.5(1.2)
|
Panel d. COVID-19 experiences
|
|
|
|
|
|
Participated (frontline anti-pandemic activities, %)
|
73.9(44.0)
|
1.6(12.6)
|
44.8(49.8)
|
5.1(21.9)
|
13.8(34.5)
|
Participated (volunteer work, %)
|
56.4(49.7)
|
31.0(46.3)
|
73.9(43.9)
|
37.7(48.5)
|
43.8(49.6)
|
Family member who is a medical worker (%)
|
55.0(49.9)
|
27.5(44.7)
|
68.9(46.3)
|
29.6(45.7)
|
36.8(48.2)
|
Direct family member was infected (%)
|
10.4(30.6)
|
4.5(20.9)
|
10.5(30.7)
|
1.0(9.9)
|
3.1(17.4)
|
Friend was infected (%)
|
74.9(43.5)
|
48.1(50.0)
|
16.8(37.4)
|
4.5(20.8)
|
12.0(32.5)
|
Sample Size
|
211
|
374
|
925
|
4 176
|
5 686
|
Notes. Expected value or proportion (in terms of %) for key variables. Standard deviations were reported in the last column of the table in the parenthesis. 1 For post-pandemic expectations, responses of “no preference” were removed from the analysis. The proportion is a comparison between “more” and “less”. For a full analysis of the responses, see Table A3.
Procedures
We administered the main survey online between March 12 and March 20, 2020. Each survey respondent received a payment of 2 yuan for their completion of the survey. In addition, respondents were informed that by answering the survey they could help the research about the pandemic and medical industry. Respondents typically took approximately 5-15 minutes to complete the survey, and we excluded from the sample any respondents who took less than four minutes to complete the survey. This left a total of 5 686 complete survey responses. The survey included 66 questions in total (all survey questions are available in the Supplementary Material). Certain sections of the survey contained jump logic, so respondents only needed to answer a subset of the survey questions.
We constructed two measures of medical occupation preference. First is “individual’s willingness to let their children (or partner if the respondent was unmarried) choose a medical occupation”. This was measured on a 5-point Likert scale from 1 “Not willing at all” to 5 “Extremely willing” and will be denoted WTM. (See Q21 for married and Q31 for unmarried respondents, Supplementary Material). Second, we asked “preference for children's (or partner's if the respondent was unmarried) occupation after the COVID-19 outbreak”. This will be denoted Preference and is a direct measure of change in medical occupation intention under the influence of the COVID-19 pandemic. This was measured on a scale of 2 being “More willing to let my children/partner choose a medical job,” 1 being “No change” and 0 being "Less willing to let my children/partner choose a medical job". (See Q25 for married and Q35 for unmarried in the Supplementary Material.)
Both WTM and Preference are a direct measure of the respondents medical occupational preferences for a loved one. We view this as a proxy for the participants preferences for own occupation. We also note that family influences are crucial for the development of young people's occupational intentions [38-41]. Similar to those in other countries, parents in China play a crucial role in all aspects of their children's lives, including critical decision-making points such as choosing a career [42, 43]. Moreover, there is evidence of family ties in the medical profession. For instance, Dal Bo et al. (2009) reports that almost 14% of doctors in the US have fathers who were doctors [44], while George and Ponattu (2018) shows that 10.5% of doctors in India have fathers who were doctors [45]. We believe our two measures of occupation preference are, thus, a robust way of modelling medical occupation preferences.
In the survey we also asked participants their self-evaluated influence from the pandemic along 4 aspects: health, occupation/work, financial status, relationships and mental health (see Q56-Q60 in the Supplementary Material).viii The answers ranged from extremely negatively (1) to neutral (3) and extremely positively (5). These were aggregated to provide a self-reported measure of the impact of the COVID-19 pandemic. Individual risk preference is measured by 13 hypothetical questions adopted from the domain-specific risk-taking scale (DoSpeRT) [46], which is widely used in psychology and economic studies.ix Social preference is measured by 4 additional questions about willingness to engage in other-regrading behavior, such as donate blood, donate money or give up a seat on the bus/metro (see Q41, Q42, Q53, Q55 in the Supplementary Material).
The UK survey was conducted on the 19-20th May with participants recruited using Prolific. Participants were paid £2.50 for completing the survey which took around 5-15 minutes to complete. The survey had a total of 61 questions (all survey questions are available in the Supplementary Material). The UK survey explored a range of issues (other than medical occupation preference) but contained questions that mirrored those in the main China survey about willingness to let children (or partner) choose a medical profession. It also contained similar measures of self-reported impact, risk-taking and social preference.
The field experiment in Wuhan was conducted in December 2020. To elicit individual pro-sociality participants were given an initial endowment of 50 Chinese Yuan and were asked if they wanted to donate part of the endowment to a charity program that supports Amyotrophic Lateral Sclerosis (ALS) patients. This was a real incentivized choice with money going either to the respondent or charity. Risk-preference was elicited using the same 13-hypothetical questions from the DoSpeRT task as with the medical occupational survey described above. Experimental instructions can be seen in the Supplementary Material.
Statistical analysis
We analyze our measures of occupation preference (WTM and Preference) against individual's experience during the pandemic. Our main approach is to assume that there were no systematic differences in medical occupation preference between Wuhan and non-Wuhan residents before the lockdown. If, therefore, we observe differences in medical occupation preference in our sample this is likely due to the COVID-19 pandemic. Given the greater negative impact of COVID-19 in Wuhan we expect Wuhan residents, particularly health care workers, to be more negatively impacted. It can be seen from Table 1 that residents in Wuhan exhibit significantly lower medical occupation preference, especially among medical workers. This is consistent with the negative impact of the pandemic lowering preference for medical occupation. We will refer to as the “Wuhan medical worker effect”. We tested the robustness of this effect by running ordered probit regressions with WTM (ranging from 1-5) and Preference (ranging from 0-2) as the dependent variable, controlling for demographic variables (panel a.) and risk and social attitudes (panel b.).
We perform two pieces of analysis to check the robustness of our findings against the assumption that there were no systematic differences in medical occupation preferences before the lockdown. First, we added additional controls for provincial medical conditions, e.g. medical system capacity and medical worker compensation, in the ordered probit regression analysis. Second, we analyzed the relationship between WTM and the self-reported measure of the impact of the COVID-19 pandemic.x In analysing this relationship there is a potential endogeneity problem caused by self-selection, because individuals who have lower medical occupation preference may be more likely to be negatively influenced by the pandemic. We, therefore, used awareness of medical-related news as an instrumental variable for influence from COVID-19.xi In justifying this approach we found awareness of medical news (either positive or negative) is not a weak instrument for self-evaluated influence from COVID-19 (F-statistic equals 25.39). Second, awareness of medical-related news during COVID-19, such as “More than 3 000 medical workers were infected” may have temporary effects on perceived influence from the pandemic, but could not directly affect medical occupation preference, which shall be formed during long-term evaluation and perception of the attractiveness of medical career. In other words, the instrument affects medical occupation preference only through perceived influences from COVID-19.
A key objective of our project is to explore the channels through which the experience of the pandemic and lockdown could affect medical occupation preference. One hypothesis is that a medical occupation is seen as a risky choice, and the negative experience of the lockdown in Wuhan, and possible burnout among medical workers, made Wuhan residents less risk tolerant. A second hypothesis is that the negative experience of the lockdown could have made Wuhan residents less pro-social which, in turn, would make them less likely to have a preference for a medical occupation. We performed the Sobel-Goodman mediation test to explore the mediation pathway from the pandemic to WTM [29]. In the regression analysis we use robust standard errors and control for demographic variables.
The UK survey data is analyzed using ordered probit regressions with WTM and Preference as the dependent variable. The key independent variables are risk-taking and pro-sociality. We also control for a range of variables including self-reported influence of COVID-19 and whether the respondent is a medical worker or has direct family members who are medical workers.
The field experiment data is analyzed using the non-parametric Mann-Whitney-Wilcoxon test. We compare the amount donated to charity across the four subsamples, freshman or sophomore and medico or non-medico student. We perform a similar analysis for risk-preferences. This allows us to test whether freshman students, i.e. those who enrolled after the pandemic, differ from sophomore students, who enrolled before the pandemic.
iii Front-line anti-pandemic activities are defined as activities directly related to pandemic prevention and the treatment of COVID-19 pneumonia and activities in which a person had direct contact with confirmed or suspected cases.
iv If not separately stated, all case numbers and death numbers are as of this day, which was the time to start collecting the survey.
v The city with the second-highest number of confirmed cases had only 3 518 patients.
vii Wuhan medical samples were from but not limited to the Zhongnan Hospital of Wuhan University, the Renmin Hospital of Wuhan University, Tongji Hospital, the Hubei Provincial Hospital of TCM, the Maternal and Child Hospital of Hubei Province, Wuhan Central Hospital, Wuhan Third Hospital, Wuhan Fourth Hospital, and the clinics at Wuhan University and the China University of Geoscience.
viii We also planned to collect data on relationships with their family, but due to technical issues the data were invalid and were thus removed from the analysis.
ix We selected 13 out of 30 original questions that were most applicable to and suitable for respondents in China.
x We focus on WTM but the results also apply to Preference.
xi Specifically, the instrumental variable was derived from the Q15 and Q16 in the survey: “Looking back at the development of the pandemic, which of the following events have had the most psychological impact (motivation or discouragement) on you?”. Among the 15 motivating options, two positive events had the most influence on medical workers, “Medical teams volunteered to assist in Hubei province” and “Medical workers received national cognition award”; on the other hand, among the 15 discouraging options, two negative events/news were mostly correlated with the medical industry, “The death of Dr. Li Wenliang” and “More than 3 000 medical and nursing staff members were infected”.