Social contact patterns relevant for infectious disease transmission in Cambodia

DOI: https://doi.org/10.21203/rs.3.rs-1601240/v1

Abstract

Social mixing patterns are key determinants of infectious disease transmission. Mathematical models parameterised with empirical data from contact pattern surveys have played an important role in understanding epidemic dynamics and informing control strategies, including for SARS-CoV-2. However, there is a paucity of data on social mixing patterns in many settings. We conducted a community-based survey in Cambodia to characterise mixing patterns and generate setting-specific contact matrices according to age and urban/rural populations. Data were collected using a diary-based approach from 2,004 participants, selected by stratified random sampling. Contact patterns were highly age-assortative, with clear intergenerational mixing between household members. Both home and school were high-intensity contact settings, with 27.7% of contacts occurring at home with non-household members. Social mixing patterns differed between rural and urban residents; rural participants tended to have more intergenerational mixing, and a higher number of contacts outside of home, work or school. Participants had low spatial mobility, with 88% of contacts occurring within 1km of the participants’ homes. These data broaden the evidence-base on social mixing patterns in low and middle-income countries and Southeast Asia, and highlight within-country heterogeneities which may be important to consider when modelling the dynamics of pathogens transmitted via close contact.

Introduction

Infectious diseases spread via close contact exert a huge burden on health systems, local and global economies and societies. Mathematical models of infectious diseases have been used extensively to elucidate transmission dynamics, evaluate control strategies, and inform policy 1. Central to these models are transmission parameters, including the effective contact rate between individuals or groups describing population mixing. Models of directly-transmitted pathogens commonly assume that transmission rates are proportional to the rate of contact between individuals, and often explicitly incorporate age-specific data on social mixing patterns 2,3. Models allowing for heterogeneous contact rates between age groups have described age-specific infection rates for pathogens including influenza, pertussis and mumps 35. In addition to the number of contacts, the intensity, duration, and location of contacts can also influence person-to-person transmission dynamics 6.

Consideration of social mixing patterns is crucial for models aiming to simulate the impact of non-pharmaceutical interventions (NPIs) such as school and workplace closures, elderly shielding, and other social distancing measures 7. This has been brought into sharp focus recently, as governments have implemented unprecedented measures as part of the SARS-CoV-2 pandemic response, often with guidance from models incorporating empirical data on social mixing patterns 812.

Although patterns of mixing are critical determinants of infectious disease transmission, there is a lack of data on social mixing patterns in many countries 2,7. The landmark POLYMOD project by Mossong et al., (2008) surveyed, for the first time, contemporary mixing patterns in eight European countries. Using a contact diary approach, the study collected information on approximately 100,000 contacts of 7,290 participants, including their age, duration, nature (physical/non-physical), and setting 13. Data from this study continues to be used in many transmission modelling studies. Since then, empirical contact data has been collected in a number of other geographical regions 1418.

However, a 2019 systematic review of social contact surveys found that 80% of studies were conducted in High Income Countries and, despite being the most populous continent, covered only five Asian countries 2. Furthermore, only three studies included in the review examined differences between (peri)-urban and rural populations 16,19,20. Demographic and social structures, socio-cultural norms, population mobility and occupations vary substantially between (and even within) countries, and there is a need for surveys describing heterogenous contact patterns in diverse regions 2,21. To address geographical gaps in empirical data, ‘synthetic’ contact matrices have been projected for other countries using available demographic data until such data are available 7.

South-east Asia is recognised as hotspot for emergence and spread of infectious diseases, including those with pandemic potential. This is driven by factors such as increasing population sizes and mobility, coupled with environmental and land-use change including urbanisation, deforestation and intensification of livestock systems 22. To contribute to the evidence-base on social mixing patterns in different contexts, here we present an in-depth analysis of data from a community-based contact survey in Cambodia. The study aimed to characterise mixing patterns and spatial mobility, generate setting-specific contact matrices between age-groups, and assess how these patterns may vary between urban and rural populations.

Results

Description of the sampled participants

Contact diary data were obtained from 2,004 study participants in May 2012, with equal target ratios achieved for each gender, age-group, and urban/rural locations (Table S1). The median age of surveyed participants was 24 years (range: 4 days to 93 years), and 85.2% of diaries were completed on a weekday (Mon-Fri). Among participants aged between 15 and 64 years, employment rates were lower than those reported in the Cambodian Socioeconomic Survey of the same year for urban areas (64.0% vs. 86.1%) but comparable for rural areas (77.8% vs. 79.6%) (National Institute of Statistics, 2013).

Number and duration of contacts

After combining contacts reported in the contact diary with ‘supplementary’ contacts (i.e. additional contacts which were not individually reported the diary – see Methods), a total of 59,597 contacts were reported across all study participants. After adjusting for employment rates and age distribution of the Cambodian population and oversampling of week days vs weekends (Table S2) this gave an estimated mean of 31.2 contacts per person day (se = 0.73). In comparison, a mean of 16.3 (se = 0.34) contacts per day were estimated when using diary contacts alone. Participant age, occupation, urban/rural residence, and day of the week the diary was completed, were all significantly associated with the number of contacts made (ANOVA p < 0.001, Table 1). Meanwhile, age, occupation, urban/rural designation, household size, and the number of rooms were significantly associated with total contact hours. The number and duration of contacts did not differ significantly between male and female participants.

The relative number of contacts (i.e. “relative degree”, adopting social network analysis terminology) was highest between the ages of 15 and 34 years and lowest among the youngest (under 5s) and oldest (≥ 65s) age groups. Total contact duration peaked in ages 5–14, and then decreased with advancing age. Rural participants reported, on average, 1.23 times more contacts (95% CI 1.17–1.30), and 1.12 times (95% CI 1.07–1.17) more contact hours, compared with urban participants. Fewer contacts were reported by people completing their contact diaries on weekends; however, there was no significant difference in cumulative contact duration suggesting that on these days, people have fewer, but longer duration contacts.

There was no difference in the number of contacts made by housewives or retirees compared to the reference group of children at home. All other occupational groups however, had significantly greater relative degree, with shop workers/traders, and “other” (unspecified) occupational groups having approximately twice the number of contacts than children at home. Despite this wide variability, there was little difference in the total contact hours made by different occupational groups. Total contact hours were highest in pre-school children.

Table 1

Relative difference in degree (number of contacts), and total contact hours according to multivariable negative binomial regression; Significance: ***P < 0.001; **P < 0.01; *P < 0.05.

   

N (%)

Relative degree

(95% CI)

ANOVA

Relative hours contact

(95% CI)

ANOVA

 

(Intercept)

 

18.77 (15.96–22.11) ***

 

19.01 (16.54–21.88) ***

 

Urban/rural

Urban

1008 (50)

REF

***

REF

***

Rural

1008 (50)

1.23 (1.17–1.3) ***

1.12 (1.07–1.17) ***

Day of the week

Weekday

1717 (85.2)

REF

***

REF

0.059

Weekend

299 (14.8)

0.81 (0.74–0.89) ***

1 (0.92–1.08)

Sex

Male

1006 (49.9)

REF

0.91

REF

0.48

Female

1007 (50)

1 (0.93–1.06)

0.98 (0.92–1.03)

Household size

Per unit increase

-

1 (0.98–1.02)

0.27

1.1 (1.08–1.12) ***

***

Number of rooms

1

1150 (57)

REF

0.29

REF

***

2

472 (23.4)

0.98 (0.9–1.06)

0.91 (0.85–0.98) **

3

212 (10.5)

1.02 (0.91–1.14)

0.85 (0.77–0.94) ***

4

111 (5.5)

0.93 (0.8–1.08)

0.83 (0.73–0.95) **

≥ 5

69 (3.4)

1.05 (0.88–1.27)

0.81 (0.7–0.96) *

Age group

0–4

231 (11.5)

REF

***

REF

***

5–14

440 (21.9)

1.04 (0.87–1.25)

1.21 (1.04–1.42) *

15–24

355 (17.6)

1.35 (1.1–1.66) **

1.14 (0.95–1.37)

25–34

278 (13.8)

1.35 (1.07–1.71) *

1.02 (0.83–1.26)

35–44

224 (11.1)

1.32 (1.04–1.68) *

0.86 (0.7–1.07)

45–54

137 (6.8)

1.17 (0.91–1.5)

0.88 (0.71–1.11)

55–64

174 (8.6)

1.11 (0.87–1.42)

0.75 (0.61–0.94) **

≥ 65

177 (8.8)

0.77 (0.6–0.99) *

0.69 (0.55–0.86) ***

Occupation

Child at home

286 (14.2)

REF

***

REF

***

Pre-school child

75 (3.7)

1.51 (1.22–1.87) ***

1.27 (1.05–1.53) *

School/college student

466 (23.1)

1.47 (1.24–1.75) ***

1.06 (0.91–1.23)

Professional/office worker

84 (4.2)

1.64 (1.27–2.13) ***

1.07 (0.85–1.34)

Shop worker/trader

165 (8.2)

1.98 (1.57–2.49) ***

0.87 (0.7–1.06)

Manual labourer, non-agri

131 (6.5)

1.29 (1.02–1.64) *

0.84 (0.68–1.04)

Agriculture and fishing

459 (22.8)

1.29 (1.04–1.59) *

1.05 (0.87–1.27)

Housewife

134 (6.6)

1.15 (0.9–1.46)

0.97 (0.78–1.2)

Retired

54 (2.7)

1.34 (1-1.8)

0.95 (0.73–1.24)

Unemployed

95 (4.7)

1.4 (1.09–1.8) **

0.94 (0.75–1.17)

Others

64 (3.2)

2 (1.54–2.61) ***

1.08 (0.85–1.37)

 

Age-specific mixing

Heat maps of age-specific contact matrices, based on diary-reported contacts, are shown in Fig. 1 (see Supplementary Information for numeric matrices). Age-assortative mixing (i.e., mixing between people of similar ages) was apparent in both urban and rural populations, as shown by the central diagonal lines in the matrices which include contacts across all settings (Fig. 1A). This was most pronounced in school-aged children (5–14 years). Most intergenerational mixing occurred at home between household members (Fig. 1B), with the second set of parallel lines representing contacts between generations (e.g. parent-child mixing). Contrastingly, contacts made at home with non-household members did not exhibit a clear intergenerational mixing structure (Fig. 1C). Consistent with the (relatively young) demographic structure of the Cambodian population, older participants (aged between around 35 and 64 years) reported more contacts on average with younger people (aged 5–34), than was reported in the other direction.

Relatively few contacts were reported at work, particularly in rural areas (Fig. 1D). The expected strong age-assortative mixing in school was clear from the age of 5 to 19 years of age in rural locations, and from 5 to 24 years in urban locations. Few contacts were reported on transport or during leisure time, so these data were combined with the contacts reported in ‘other’ (unspecified) settings. In this combined matrix (representing all contacts outside of home, school or work), age-assortative mixing was observed, although there was also substantial mixing across age groups, particularly in rural areas (Fig. 1F).

Contact matrices which included only contacts that were physical or over 15 minutes in duration (which accounted for 81.2% of all diary-reported contacts) exhibited the same patterns of age-specific mixing (Figures S1 A-F).

Setting of contacts

The majority of contacts occurred at home (56.7% of contacts) or in ‘other’ settings (that is, other than home, work, or school) (28.5%) (Fig. 2A-D). Of note, over a quarter of all contacts (27.7%) occurred at home with non-household members. Relatively few contacts were reported to have occurred at work (2.5%), on transport (0.3%), and during leisure time (0.2%). Children and young adults (aged 5–19 years) made approximately 20% of their contacts at school/college, while those aged 20–24 made around 10% of their contacts in this setting. The proportion of contacts made at home was highest for pre-school aged-children (0–4 years), reached its lowest point at school age (5–19), and had a general trend to increase with age thereafter (Fig. 2A).

The number of contacts made in each setting was significantly associated with participant age (\({\chi }^{2}\) = 4318.9, df = 65, p < 0.001), sex (\({\chi }^{2}\)= 433, df = 5, p < 0.001), urban/rural (\({\chi }^{2}\)squared = 2628.4, df = 5, p < 0.001), and day of the week (\({\chi }^{2}\)= 70.8, df = 5, p < 0.001). Rural participants recorded a greater proportion of their contacts in ‘other’ settings compared with urban participants (37% vs 16%), and a greater proportion of their home contacts with non-household members (53% vs 39% of home contacts; Fig. 2B). Though males and females reported no difference in quantity or cumulative duration of contacts (Table 1), the setting where these contacts took place differed, with males reporting more contacts at work and in ‘other’ settings compared to females who made more contacts at home with non-household members (Fig. 2C).

Intensity of contacts

Measures of contact intensity were in general agreement: as contact duration increased, so too did the likelihood that a contact was physical in nature (Figure S2A; χ2 test for trend: χ2 = 5036.9, df = 1, p < 0.001). More regular contacts were also more likely to be physical; among contacts with whom participants reported meeting for the first time or with frequencies of up to ‘once or twice per week’, approximately one quarter were physical, compared with 60% of contacts that were made daily or almost daily (Figure S2B; χ2 = 1529.3, df = 1, p < 0.001).

Due to the close relationship between duration and physical nature of contacts, similar patterns were observed for both of these intensity measures (Fig. 3). There was a general trend for decreasing contact intensity with increasing age, both with regards to the proportion of contacts that were physical (Fig. 3A), and the duration of those contacts (Fig. 3B), consistent with the regression analysis (Table 1).

The intensity of contacts also varied by setting, with contacts made at home with household members or at school/college the most intense. Contacts recorded in multiple settings were also high intensity owing to the fact that the majority (91%) of these contacts occurred at home (in addition to other settings). While rural participants reported more contacts (Table 1), the contacts of urban participants tended to be of longer duration and were more likely to be physical (Fig. 3C and 3F).

Comparison of contact reporting methods

Over half (59%) of the participants reported having supplementary contacts (i.e. additional contacts that were not individually reported in their contact diary). Degree distribution was highly right-skewed for both reporting types, but particularly so for supplementary contacts (Figure S3).

The age distributions of diary-reported contacts vs supplementary contacts differed (\({\chi }^{2}\) = 1563.1, df = 3, p < 0.001) with supplementary contacts more likely to be children (0–4 years) and seniors (65 + years) (Figure S4). It was uncertain if this represented a true reporting bias, or was an artefact arising from the question format: namely, the use of fewer (and unequal) age-groupings to summarise the ages of supplementary contacts. Nonetheless, when comparing age-contact matrices based on diary contacts and supplementary contacts, using the age bands of the latter, similar patterns were observed (Figure S4B).

The proportion of physical vs non-physical contacts also differed between reporting methods (\({\chi }^{2}\) = 2962, df = 1, p < 0.001), with diary-reported contacts twice as likely to be physical compared with supplementary contacts (Figure S4C).

Distance of contacts and participant mobility

The majority of diary contacts (88%) occurred within 1km from participants’ homes and only 2% occurred more than 9km away from their homes. Based on background questions regarding usual travel frequencies, participants tended to remain fairly localised: 68% and 45% of rural and urban participants, respectively, reported having never left their province, while 98% of rural and 89% of urban residents reported never leaving the country. The oldest and youngest individuals tended to be the least mobile, and rural participants were less mobile than urban participants (Fig. 4).

Discussion

Here we present a large-scale, prospective social contact patterns study, the first of its kind to be conducted in Cambodia. We estimated that (at least prior to the Covid-19 pandemic) the average Cambodian person had a mean of 31.2 social contacts per day, although this figure falls to 16.3 when counting only the contacts that were individually reported in the contact diary. The average number of self-reported diary contacts are similar to the Mossong et al., (2008) European study (13.4 contacts per person per day) and a study in a rural coastal population of Kenya (17.7 contacts per person per day 16, but higher than other studies in Hong Kong 18, UK 17 and Zimbabwe 23 which reported an average of 7 to 8.1 contacts per person per day. Similar to our study, the study in Zimbabwe also asked participants to list ‘casual’ additional contacts with 64% of people reporting fewer than additional 10 contacts per person per day 23. Such heterogeneities highlight the need for the collection of context-specific mixing patterns data, although the definition of contact used in the various studies and method of recording contacts will also play a role 18.

Consistent with studies in other settings, contacts in Cambodia were highly age-assortative; this was particularly evident in school-aged children but was also observed in other age-groups, such as young adults 13,16,23. Tertiary diagonals, representing inter-generational mixing, were also evident in our contact matrices, and indeed appeared more prominent compared with data from European countries (Mossong et al., 2008). This likely reflects the higher prevalence of multi-generational households in Cambodia, further supported by our data showing that these contacts tended to be reported at home with household members, particularly in rural areas. This has important implications for respiratory infections such as influenza and SARS-CoV-2, where younger individuals may have higher exposure rates due to their relatively high contact rates, and elderly individuals may be more susceptible severe disease 7.

Home and school are well documented as high-intensity contact settings important for infectious disease transmission 24,25. Interventions aiming to reduce contact rates of vulnerable individuals (such as elderly shielding) may be more challenging to implement in LMICs given the household structures. Model simulations indicate that reducing contact rates among younger individuals in LMICs through school closures can have secondary protective effects on older household members 26. Conversely, concerns have also been raised surrounding potential adverse consequences of such measures, for example if children continue to mix with peers outside of school whilst increasing contact rates with their grandparents, or if there is health care worker absenteeism due to childcare responsibilities 27,28.

Few, if any, previously published contact surveys have differentiated between household and non-household contacts when describing contacts taking place at home (at least in their reported results). Notably, over a quarter of all diary contacts (27.7%) reported in our study occurred at home with non-household members. Given that reduction in rates of inter-household mixing has been a key component in the SARS-CoV-2 pandemic response, this may highlight a need for future contact surveys and modelling studies to consider heterogeneities in contact rates between household vs. non-household members at home. Contacts made in settings outside of home, work or school (defined as “other” settings), also accounted for a substantial proportion of contacts, particularly in rural areas, lending support for the role of NPIs such as public space closures during periods of elevated transmission risk.

A previous study by Prem et al., (2017) combined contact data from the European POLYMOD survey with socio-demographic data with to project social contact matrices for 152 countries, including Cambodia. The overall patterns in our empirical contact matrices were broadly comparable to those in the synthetic, country-level matrices generated by Prem et al., (2017), although we observed a lower relative contribution of contacts in work settings than predicted. Moreover, we observed marked differences between rural and urban areas in terms of the number, intensity and settings of contacts. The higher number of contacts observed in rural areas, particularly in “other” settings, may initially seem counter-intuitive, given higher population densities in urban areas. However, this might be explained by socio-cultural factors such as close-knit rural communities, particularly when considering our survey’s definition of a contact as a two-way conversation. Such factors may also be reflected in higher observed rates of contact made across age groups with visitors at home. Meanwhile, the contacts of urban participants tended to be of higher intensity in terms of both duration and the proportion of contacts which were physical.

Relatively few previous studies have investigated differences in contact patterns between urban and rural populations. However, our findings are consistent with a study in Kenya, which reported higher contact rates in rural vs peri-urban participants 16, while studies in Zimbabwe 29 and China 20 did not observe any significant differences. Recognizing that early outbreaks of SARS-CoV-2 were mostly observed in cities with subsequent spread to rural areas, Prem et al., (2021) recently updated their synthetic contact matrices to include custom contact matrices for rural and urban settings.

The data analysed in our study were collected over 12 days in May 2012, therefore temporal changes are not captured and changes in social mixing patterns may have occurred since that time, particularly following the emergence of the SARS-CoV-2 pandemic. However, given the lack of data from this region, and LMICs in general 2, such data are still highly valuable. The POLYMOD data collected in 2005/2006, and “synthetic” contact matrices based on these data, continue to be widely used in infectious disease transmission models, including those developed for SARS-CoV-2 6,9,30,31. Statistical approaches can be used to adjust for changing population demographics, and our results can also be used to inform and compare with future surveys to identify temporal trends.

Selection bias may have occurred as we observed lower employment rates in our survey participants than has been reported in country census data as a whole. This is likely due to the recruitment via household visits, and future studies recruiting in a similar way may seek to avoid this during collection by having sampling quotas for this variable. We sought to correct for this in our analyses by using sampling weights for employment status (along with other factors such as weekday vs weekend). Such weighting is useful, but care should be taken to avoid raking with too many variables with many levels, and ensuring sufficient data points with each cross-tabulation of these selected variables. This should help to avoid extreme sampling weights and minimise design effects 32.

Our study used paper-based contact diaries for self-reporting of contacts; other studies have used online surveys, retrospective interviews, or made inferences regarding contacts through wearable devices 33. Leung et al., (2017) found that those with more years of education and higher income levels were more likely to choose online questionnaires, as opposed to paper based, when both were offered in a study in Hong Kong. They also found that participants using paper questionnaires reported a significantly greater number and duration of contact than those using online questionnaires 18. Cambodia is classified as an LMIC and, according to the 2010 Demographic Health Survey (DHS), only 19% of rural residents had electricity (compared to 90% of urban residents), necessitating the use of household visits and paper-based questionnaires to reduce selection bias.

Prospective surveys are preferred as they allow participants to remember more details of the contacts 2,34. The use of an interviewer to retrospectively review and/or complete the contact diaries with participants through prompts enhanced completeness, and also inclusion, for example of illiterate participants (estimated at 30% of females in rural areas in 2012) 35. However, while participants were asked to report all contacts within the diary, it is worth noting that less than half of our study participants believed their diaries to be complete when asked about this at the end of the interview. (Participants were not given prior notice that this question would be asked.) On the other hand, it seems likely that many of the ‘supplementary’ (non-diary) contacts reported would have been more casual in nature, and thus of lower epidemiological relevance. Indeed, a larger proportion of these supplementary contacts were non-physical contacts compared to contact-diary reported contacts. Furthermore, the age distributions of supplementary contacts were reasonably similar to those of the diary contacts (to the extent that these could be compared), suggesting that truncation of the latter would not have biased the age-contact matrices to any substantial degree.

The contact matrices (Table S2) and other data presented here can be used to inform mathematical models of disease transmission in Cambodia. Indeed, the results from this survey were recently used to parameterise an age-structured Susceptible-Exposed-Infectious-Removed (SEIR) model of SARS-CoV-2, to conduct a scenario analysis of NPI strategies on transmission and healthcare resource capacities at country and province-level 36, under work conducted for the WHO Western-Pacific Regional Office Covid-19 modelling consortium. More broadly, the study also contributes towards addressing knowledge gaps on social-mixing patterns and population mobility in LMIC and Southeast Asian contexts, and provides empirical evidence on within-country variation between urban and rural areas.

Methods

Study design and Population

Data were collected through a cross-sectional survey in May 2012, and employed a multi-stage, quota sampling strategy. In the first stage, three main study areas were purposefully selected to enhance geographic representativeness, whilst keeping the geographic scope logistically feasible. These study areas were: i) Phnom Penh municipality (the capital) and surrounding Kandal province, located in south-central Cambodia, which informally comprise the Phnom Penh greater metropolitan area; ii) Kampot province (south-west Cambodia); and iii) Kratie province (east Cambodia) (Fig. 5). Within each province, one rural and one urban district (as designated by national census data) were randomly selected. Three communes were then randomly selected from each study district.

The total target sample size was 2,016 people, aiming for equal numbers of rural and urban residents. The sampling strategy was further stratified to target equal ratios of study participants by study district, gender (male and female) and age-group (using the age bands: 0–5, 6–12, 13–19, 20–29, 30–39, 40–59, 60+), giving a sampling quota of 72 participants within each demographic (age-gender-urban/rural) stratum. This sampling strategy, along with the data collection tools described below, were designed based on those developed by the consortium of another project (Social Mixing for Influenza-Like Illness in Asia; PI: John Edmunds) involving contact surveys in other Asian countries.

Study participants were recruited through door-to-door household visits. Households were selected by picking a random direction (determined by spinning a pen) from the approximate centre of each commune and visiting each successive house in that direction. When someone was at home and consented to participate, a single household member was recruited, providing the consenting person was within a demographic stratum for the which the quota had not been reached.

Data collection

Upon recruitment, a background questionnaire was administered to collect data on demographic and occupational variables, household structure and usual travel frequencies outside of their administrative divisions. Social contact data were collected following similar methods used by Mossong et al. (2008). Participants were prospectively allocated a 24-hour period in which they were instructed to remember all contacts they made beginning at 3 a.m. on the morning of the assigned day, where a ‘contact’ was defined as either “a two-way conversation in the physical presence of another person” or “physical skin-to-skin contact”. Participants were provided with a contact diary and encouraged to record every contact they made throughout their allocated day. An interviewer also led them through its completion the following day using a semi-structured series of prompts to aid recall.

At both visits (i.e. at recruitment and during the follow-up visit after the diary day), it was emphasised to study participants that they should try to capture all contacts within the contact diary. However, it was also acknowledged that some participants may have a large number of contacts, and/or “casual” contacts, which might result in underreporting of contacts in the contact diary. Therefore, after reviewing the diary with the participant, the interviewer also asked the participant whether they thought any contacts had been omitted in the diary. If so, participants were then asked to estimate the number of additional physical or non-physical contacts (using the same definition described above) in each of four age bands representing children (0–4 and 5–15 years), adults (16–64 years), and seniors (aged 65+). These additional contacts are henceforth referred to as ‘supplementary contacts’, as opposed to ‘diary-reported contacts’.

Ethical Approval

 This study was approved by the London School of Hygiene and Tropical Medicine (LSHTM) Institutional Review Board (ref: 6129), and by the Cambodia Ministry of Health National Ethics Committee for Health Research (ref: 188). All methods were performed in accordance with the relevant guidelines and regulations. Written informed consent was obtained from all study participants, or from a parent/guardian if the participant was under 18 years of age.

Data analysis

Number and duration of contacts

All analyses were conducted using R. Sampling weights generated by raking were applied to adjust the distribution of selected variables towards population levels (Table S1; supplementary) 37. The following variables were used for weighting: urban/rural, age group, weekday/weekend, and employment status based on three levels: non-working age, employed, and unemployed, as defined by the Cambodian Socioeconomic Survey 2013 (CSES; National Institute of Statistics, 2013).

We estimated the effects of covariates of interest on the total number of contacts (a participant’s “degree”) and the total contact hours using multivariable negative binomial regression. Total contact hours were estimated by summing the contact durations across all contacts reported in the diary. Since duration of contact was reported in one of five ranges (< 5 minutes, 5-14 minutes, 15-59 minutes, 1-4 hours, or >4 hours), the mid-point of each range was used, right truncated at 4 hours 18,38. Analysis of total contact hours was restricted to diary-reported contacts, as contact duration was not collected for supplementary contacts.

Age-specific mixing

Age-structured mixing matrices were generated using five-year age bands from 0 to 64 years, and a single age band for participants aged 65 and over. Different matrices (showing the mean number of contacts an individual in age group i made with an individual in age group j) were generated for urban and rural participants in each of five settings: 1) at home with household members, 2) at home with non-household members, 3) at work, 4) at school or college, 5) on transport, during leisure time, or in other undefined settings. To observe the age-specific mixing patterns of high-intensity contacts, another set of matrices were generated for contacts that were either physical in nature, or ≥15 mins in duration. Supplementary contacts were not included in the mixing matrices because the settings, and precise ages, were not collected for these contacts.

Comparison of reporting methods

To assess for potential bias in the reporting of contacts, comparisons were made between the age distributions, and intensity (physical vs non-physical) of contacts reported in the contact diary vs those reported as supplementary contacts. Comparisons were made using Pearson’s chi squared tests with Rao-Scott corrections to account for sampling weights 40.

Declarations

Acknowledgements

We thank enumerators from the Prek Leap National Institute of Agriculture and the Royal University of Agriculture, Cambodia, and the provincial and district veterinary officers, commune and village leaders in our study areas of Cambodia, for their invaluable support with data collection. We also acknowledge contributions of the “Social Mixing Patterns for Influenza-Like Illness in Asia” (SMILI) Project Consortium (PI John Edmunds), from which the sampling scheme and survey tool were adapted for this study. We would also like to thank Prof. Richard Coker and Dr. Arata Hidano for useful advice and discussions during data collection and data analysis, respectively. Finally, we would like to thank all study participants in Cambodia.

Funding

Data collection for this study was funded by Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) under the project “Developing evidence-based investment options in support of pandemic preparedness policy in Cambodia”. Data analyses and write-up was funded in part by the U.S. Department of Defense, Defense Threat Reduction Agency (DTRA), as part of the project “Understanding zoonotic and pandemic risk from swine production and exchange systems in Cambodia” (HDTRA11810051). The content of the information does not necessarily reflect the position or the policy of the federal government, and no official endorsement should be inferred. W.T.M.L is funded by a Bloomsbury PhD studentship.

Author Contributions

J.W.R., A.M. and K.B. and T.D. contributed to design of the study. K.B., C.T., P.S., C.P., S.S., T.D., J.W.R supervised and conducted the data collection. W.T.M.L. and A.M. analyzed the data. H.R.H., W.T.M.L., A.M., J.W.R interpreted the results and drafted the manuscript. All authors read, commented on, and approved the final manuscript.

Data availability statement

Social Contact Matrices are included in the Supplementary Information. Contact data used to generate these matrices are available from the corresponding author on reasonable request.

References

  1. Grassly, N. C. & Fraser, C. Mathematical models of infectious disease transmission. Nat. Rev. Microbiol. 6, 477–487 (2008).
  2. Hoang, T. et al. A Systematic Review of Social Contact Surveys to Inform Transmission Models of Close-contact Infections. Epidemiol. Camb. Mass 30, 723–736 (2019).
  3. Wallinga, J., Teunis, P. & Kretzschmar, M. Using Data on Social Contacts to Estimate Age-specific Transmission Parameters for Respiratory-spread Infectious Agents. Am. J. Epidemiol. 164, 936–944 (2006).
  4. Eames, K. T. D., Tilston, N. L., Brooks-Pollock, E. & Edmunds, W. J. Measured Dynamic Social Contact Patterns Explain the Spread of H1N1v Influenza. PLOS Comput. Biol. 8, e1002425 (2012).
  5. Kretzschmar, M., Teunis, P. F. M. & Pebody, R. G. Incidence and Reproduction Numbers of Pertussis: Estimates from Serological and Social Contact Data in Five European Countries. PLOS Med. 7, e1000291 (2010).
  6. Prem, K. et al. Projecting contact matrices in 177 geographical regions: An update and comparison with empirical data for the COVID-19 era. PLOS Comput. Biol. 17, e1009098 (2021).
  7. Prem, K., Cook, A. R. & Jit, M. Projecting social contact matrices in 152 countries using contact surveys and demographic data. PLOS Comput. Biol. 13, e1005697 (2017).
  8. Brooks-Pollock, E. et al. High COVID-19 transmission potential associated with re-opening universities can be mitigated with layered interventions. Nat. Commun. 12, 5017 (2021).
  9. Davies, N. G. et al. Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study. Lancet Public Health (2020) doi:10.1016/S2468-2667(20)30133-X.
  10. Head, J. R. et al. School closures reduced social mixing of children during COVID-19 with implications for transmission risk and school reopening policies. J. R. Soc. Interface 18, 20200970 (2021).
  11. Prem, K. et al. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. Lancet Public Health (2020).
  12. Zhang, J. et al. Changes in contact patterns shape the dynamics of the COVID-19 outbreak in China. Science 368, 1481 (2020).
  13. Mossong, J. et al. Social Contacts and Mixing Patterns Relevant to the Spread of Infectious Diseases. PLOS Med. 5, e74 (2008).
  14. Fu, Y., Wang, D.-W. & Chuang, J.-H. Representative Contact Diaries for Modeling the Spread of Infectious Diseases in Taiwan. PLOS ONE 7, e45113 (2012).
  15. Horby, P. et al. Social Contact Patterns in Vietnam and Implications for the Control of Infectious Diseases. PLoS ONE 6, (2011).
  16. Kiti, M. C. et al. Quantifying Age-Related Rates of Social Contact Using Diaries in a Rural Coastal Population of Kenya. PLoS ONE 9, (2014).
  17. Klepac, P. et al. Contacts in context: large-scale setting-specific social mixing matrices from the BBC Pandemic project. http://medrxiv.org/lookup/doi/10.1101/2020.02.16.20023754 (2020) doi:10.1101/2020.02.16.20023754.
  18. Leung, K., Jit, M., Lau, E. H. Y. & Wu, J. T. Social contact patterns relevant to the spread of respiratory infectious diseases in Hong Kong. Sci. Rep. 7, 7974 (2017).
  19. Melegaro, A. et al. Social contact structures and time use patterns in the manicaland province of Zimbabwe. PLoS ONE 12, (2017).
  20. Read, J. M. et al. Social mixing patterns in rural and urban areas of southern China. Proc. R. Soc. B Biol. Sci. 281, 20140268 (2014).
  21. Horby, P. et al. The Epidemiology of Interpandemic and Pandemic Influenza in Vietnam, 2007–2010: The Ha Nam Household Cohort Study I. Am. J. Epidemiol. 175, 1062–1074 (2012).
  22. Coker, R. J., Hunter, B. M., Rudge, J. W., Liverani, M. & Hanvoravongchai, P. Emerging infectious diseases in southeast Asia: regional challenges to control. The Lancet 377, 599–609 (2011).
  23. le Polain de Waroux, O. et al. Characteristics of human encounters and social mixing patterns relevant to infectious diseases spread by close contact: a survey in Southwest Uganda. BMC Infect. Dis. 18, 172 (2018).
  24. Chao, D. L., Halloran, M. E. & Longini, I. M. School opening dates predict pandemic influenza A (H1N1) outbreaks in the United States. J. Infect. Dis. 202, 877–880 (2010).
  25. Riley, S. Large-Scale Spatial-Transmission Models of Infectious Disease. Science 316, 1298–1301 (2007).
  26. Alon, T., Kim, M., Lagakos, D. & VanVuren, M. How should policy responses to the COVID-19 pandemic differ in the developing world? (2020).
  27. Bayham, J. & Fenichel, E. P. Impact of school closures for COVID-19 on the US health-care workforce and net mortality: a modelling study. Lancet Public Health 5, e271–e278 (2020).
  28. Brooks, S. K. et al. The impact of unplanned school closure on children’s social contact: rapid evidence review. Euro Surveill. Bull. Eur. Sur Mal. Transm. Eur. Commun. Dis. Bull. 25, 2000188 (2020).
  29. Melegaro, A., Jit, M., Gay, N., Zagheni, E. & Edmunds, W. J. What types of contacts are important for the spread of infections? Using contact survey data to explore European mixing patterns. Epidemics 3, 143–151 (2011).
  30. Davies, N. G. et al. Age-dependent effects in the transmission and control of COVID-19 epidemics. Nat. Med. 26, 1205–1211 (2020).
  31. van Zandvoort, K. et al. Response strategies for COVID-19 epidemics in African settings: a mathematical modelling study. BMC Med. 18, 324 (2020).
  32. Battaglia, M. P., Hoaglin, D. C. & Frankel, M. R. Practical Considerations in Raking Survey Data. Surv. Pract. 2, 2953 (2009).
  33. Read, J. M., Edmunds, W. J., Riley, S., Lessler, J. & Cummings, D. A. T. Close encounters of the infectious kind: methods to measure social mixing behaviour. Epidemiol. Infect. 140, 2117–2130 (2012).
  34. Mikolajczyk, R., Akmatov, M., Rastin, S. & Kretzschmar, M. Social contacts of school children and the transmission of respiratory-spread pathogens. Epidemiol. Infect. 136, 813–822 (2008).
  35. CSES. Cambodian Socioeconomic Health Survey. Cambodia Socio Economic Survey Tables (nis.gov.kh) (2012).
  36. Hidano, A. et al. Modelling the impact of non-pharmaceutical interventions on COVID-19 transmission and healthcare demands in Cambodia: a scenario analysis. Technical Report for WHO-WPRO. (2020).
  37. Pasek, J. & Pasek, M. J. Package ‘anesrake’. Compr. R Arch. Netw. (2018).
  38. Fu, Y., Wang, D.-W. & Chuang, J.-H. Representative Contact Diaries for Modeling the Spread of Infectious Diseases in Taiwan. PLOS ONE 7, e45113 (2012).
  39. Funk, S. Socialmixr: social mixing matrices for infectious disease modelling. R package version 0.1.6., (2020).
  40. Rao, J. N. K. & Scott, A. J. On Chi-Squared Tests for Multiway Contingency Tables with Cell Proportions Estimated from Survey Data. Ann. Stat. 12, 46–60 (1984).