Investigating sources of non-response bias in a population-based seroprevalence study of vaccine- preventable diseases in The Netherlands

doi:10.21203/rs.3.rs-2093388/v1

Download PDF

Research Article

Investigating sources of non-response bias in a population-based seroprevalence study of vaccine- preventable diseases in The Netherlands

https://doi.org/10.21203/rs.3.rs-2093388/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

PIENTER 3 (P3) is the third nationwide serological survey in the Netherlands, conducted in 2016/17. The overall response rate was 13.9%, following a decreasing trend in response compared to the previous two PIENTER studies (1995/1996 and 2006/2007). During P3 a non-response survey was used to investigate non-response bias. We assess P3 representativeness and potential sources of non-response bias, and trends in decreasing participation rates across all PIENTER studies.

Methods

P3 invitees were classified into survey response types (RTs): Full Participants (FP), Questionnaire Only (QO), Non-Response Questionnaire (NRQ) and Absolute Non-Responders (ANR). FP demographic and health indicator data were compared with Dutch national statistics, and then the RTs were compared to each other. Random Forest algorithms were used to predict an invitee’s RT. Finally, FPs from all three PIENTERs were compared to investigate the profile of national survey participants through time.

Results

P3 FPs were in general healthier, younger and higher educated than the Dutch population. Random forest was not able to differentiate between FPs and ANRs, but when predicting FPs from NRQs we found evidence of healthy-responder bias. Participants of the three PIENTERs were found to be similar, but we found that, in line with national trends, P3 participants were less inclined to vaccinate than previous cohorts.

Discussion

As vaccination coverage is high in the Netherlands, P3 remains a powerful tool to monitor population-level protection against vaccine preventable diseases (VPDs). Participants of all three PIENTERS do not differ and there can be compared through time. However, future PIENTER studies should continue to focus on improving recruitment from under-represented groups but consider alternative survey modes to improve overall response.

Sero-epidemiological surveys are powerful tools for infectious disease surveillance. They allow the direct measurement of exposure to an infectious agent, or to a vaccination, across a population (1, 2). In the Netherlands, national serosurveys have been conducted every 10 years since 1996 to monitor the success of the National Immunisation Programme (NIP). These are known as the PIENTER studies.

The most recent, PIENTER 3 (P3), was conducted in 2016/17 and had a response rate of 15.7% for the main sample. PIENTER 2 (P2) and PIENTER 1 (P1) were conducted 10 and 20 years prior to it, with main sample response rates of 33% and 55%, respectively (3–5). These successive studies offer a unique insight into the serostatus of the Dutch population and have provided evidence to support recommendations on vaccination policies in the Netherlands (6–8). It is therefore important that estimates generated from these surveys be reliable and generalizable to the Dutch population.

As with many European health surveys, the PIENTER studies have experienced decreasing response rates at each iteration (9). Low response rates can severely impact not only the representativeness of a sample, but also the reliability of any estimates made, due to the introduction of non-response biases (10). However, a survey response rate alone is not a robust indicator of the presence of non-response bias (11–13). Rather, the impact of non-response bias on survey-derived estimates depends upon how participation behaviours are associated with the nature of the survey questions (14, 15).

This paper aims to describe the representativeness of the P3 sample and investigate potential sources of non-response bias. First, we describe the participants of P3, and compare them to the Dutch general population. Secondly, we investigate potential determinants of non-response bias by comparing demographic- and survey-derived characteristics of responders and non-responders. We then aim to identify factors, and combinations of them, which explain participation in the survey. Lastly, we compare the participants from all three PIENTERs to determine whether the characteristics of PIENTER participants have changed through time.

The P3 data and samples will be used for a large variety of health studies. Subsequently, the influence of potential non-response bias must be considered on a case-by-case basis. We hope that the findings of this paper will support future researchers and policy makers when interpreting and applying P3 findings to the broader Dutch population. Further, we hope that our findings may inform strategies to improve response to future health surveys, PIENTER or otherwise.

2.1. Study Population and Sample Design

2.1.1. Description of PIENTER3 (P3) Sampling

As in P1 and P2, a two-stage sample was drawn from the Dutch national population (3, 4). Eight municipalities were sampled, with probabilities proportional to their population sizes, from five geographic regions of a similar population size. An age-stratified sample was then drawn from the register of each municipality, henceforth referred to as the National Sample (NS). Sampling and recruitment strategies for P3 have been described in further detail elsewhere (5).

2.1.2. Oversampling of Non-Western Migrants (NWMs)

An additional oversample of NWMs was drawn due to the low response rates seen in P1 and P2 for this group (3, 4). A stratified sample of NWMs residing in 9 of the 40 municipalities and not included in the NS were invited to participate. Further subgroup oversamples were collected in P3 and are detailed elsewhere (5). This manuscript considers only the NS and NWM samples.

2.1.3. Data collection and recruitment methods

Invitees were contacted by post and asked to complete an online or paper questionnaire. The invitation contained an appointment at the study clinic to have samples taken but indicated that “walk-in” appointments were possible. Before the scheduled appointment invitees received a reminder letter and telephone call. At the appointment informed consent was obtained and biological samples were taken. Each participant received a €25 incentivisation payment and was offered a further €25 for the donation of additional biological materials.

Invitees were contacted again if they did not attend their appointment. Invitees that did not want to attend clinic but wished to participate were sent a self-collection kit, to donate a finger-prick dried blood spot (DBS) sample. Invitees that indicated that they did not want to participate were asked to complete a telephone Non-Response Questionnaire (NRQ), conducted by an external study call-centre.

2.2. PIENTER 3 Survey Participation Behaviour

Based on participation behaviour all invitees were assigned a Response Type (RT):

Full participants (FPs) submitted a questionnaire and donated at least one biological sample at a study clinic. Biological samples included blood, saliva, oro/nasopharyngeal swabs and faeces.

Questionnaire-only participants (QOs) submitted a questionnaire but did not attend a study clinic to submit any biological samples. Participants with a DBS were classed as QOs as they did not physically attend a clinic.

Non-responders who submitted a Non Response Questionnaire (NRQs) did not submit a questionnaire or attend clinic but completed a telephone NRQ.

Absolute non-responders (ANRs) did not attend a study clinic and did not submit a non-response questionnaire.

Using the above RTs, the survey response rate is calculated as Response Rate 1 of the American Association for Public Opinion Research (AAPOR) 2016 standard definitions (16):

$$AAPOR Response Rate 1 = \frac{FP}{\left(\text{F}\text{P}+\text{Q}\text{O}\right)+ \left(\text{N}\text{R}\text{Q}+\text{A}\text{N}\text{R}+\text{O}\right)}$$

Some participants were excluded due to missing questionnaire data, despite having provided a biological sample, and were classed as ‘Other’ (O).

2.3. Representativeness of PIENTER 3 Sample

To investigate representativeness, we compared basic demographic and questionnaire-derived characteristics of all P3s FPs to those of the Dutch population. Population data were correct as of the 1st January 2016, (CBS Nederland).

2.4. Comparing Response Type (RT) Characteristics

To investigate sources of non-response bias, demographic characteristics were compared between the four RTs, presenting counts and proportions for categorical variables.

Chi-squared tests were used to test for at least one difference and for pairwise differences between RTs with respect to each variable. Variance across the four RTs was assessed using chi-square tests adjusted using the Benjamini-Hochberg multiple testing procedure, at a false discovery rate of 0.05 (17).

2.5. Predicting Response Types (RTs) using Random Forests (RFs)

Random forest (RF) is a non-parametric machine learning algorithm. It can predict the value of an outcome variable for a study participant based on the individual’s observed or reported values on a number of predictor variables (18). RF also assesses the accuracy of those predictions, in terms of sensitivity, specificity and the probability of misclassification (pmc) (18). RF also provides measures of variable importance, computed as the percent increase in pmc that results from a random permutation of the values of that variable in the data set (18, 19). If the permuted value of a variable tends to worsen the prediction, that variable is regarded as important in determining the outcome. If, overall, the prediction is about as accurate with the permuted and correct inputs, the variable is regarded as unimportant (18, 19). Confusion matrices were constructed to indicate the sensitivity and specificity of the prediction, and variable importance plots were created to visualise which variables best predicted RT separation.

In our analyses, we used RF to predict RT as a nominal outcome using demographic and questionnaire-derived variables as predictors. To account for potential variations in outcome across geographical regions, we also used participant coordinates based on their residential postcode (PC4).

We conducted three analyses: comparing FPs to ANRs, FPs to NRQs, and FPs to QOs. In each comparison, FP was the “positive” outcome, and its complement RT was the “negative” outcome. Variable definitions are provided in Additional File 1.

2.5. Changing Participation Behaviours

Using a combination of demographic and questionnaire derived variables, RF predicted which PIENTER year each participant originated from. Participants aged over 79 were excluded from P3, as P1 and P2 did not sample this age group. The NWM oversamples of P3 and P2 were also excluded, as oversampling was not conducted during P1.

In order to obtain estimates of sensitivity and specificity from the RF analyses, the NS samples were analysed in pairs; P1 with P2, P2 with P3 and P1 with P3. Data regarding the degree of urbanisation were excluded when comparing P1 to P2 and P2 to P3, as P2 data on urbanisation were limited. Variable definitions can be found in Additional File 1.

2.6. Analytical Considerations

As is common in survey datasets, we were faced with the challenge of missing data. Based on a priori knowledge of survey response behaviours and on differences seen in key sociodemographic measures across the P3 RTs, we expect that the missing data of the variables included in our analyses could not be considered Missing Completely at Random (MCAR) or Missing at Random (MAR). The degree of missing data present in variables common to different RTs varied, with missingness increasing for participants with less survey engagement. Consequently, missingness in itself must be somewhat informative, indicating potential underlying differences between the RT groups, or at least a gradient in the willingness of RT groups to divulge certain information.

Where missing data occurred in a categorical variable, they were assigned a category level of its own and included in all analyses, unless otherwise stated. There were no missing data in the continuous variables.

All analyses were conducted in R v 4.0.3 (RStudio Server 1.4.1103), using the “stats” and “randomForest” packages (19–21).

3.1. Study Population

In total 40,065 individuals were invited from 40 municipalities. 167 individuals were excluded due to non-delivery or inability to participate for medical reasons. From a net sample of 39,898, a total of 31,714 individuals were invited in the NS and 8,184 were invited in the NWM oversample.

3.2. Response Type (RT)

Considering both samples together (NS + NWM), 5,553 were classified as FPs, 647 as QOs, 14,043 as NRQs and 19,639 as ANRs (Fig. 1). A further 16 participants were excluded due to missing questionnaire data, despite having a biological sample, and are labelled as ‘Other’ (O).

3.3. Response Rate

The overall response rate was 13.9% (NS + NWM; 5,553 / 39,898). For the NS sample alone, the response rate was 15.7% (4,983 / 31,714), and for the NWM oversample the response rate was 7.0% (570 / 8,184). Response rates varied by age, gender and urbanisation, as well as migration background. A description of response rates by these subgroups is provided in Additional File 2.

3.4. Generalizability of the P3 Sample (FPs only)

Compared to national figures reported by CBS, FPs had a higher proportion of females and under 20 year olds, and a lower proportion of older adults aged 40–69 years. FPs from areas of very high or very low degrees of urbanisation were underrepresented. FPs contained a lower proportion of participants with either Moroccan, Turkish or Western migration backgrounds, but had a larger proportion of participants with Surinamese, Antillean or other non-western migration backgrounds. FPs were more highly educated, had a higher household income and reported themselves to be healthier than the general Dutch population. A detailed breakdown is presented in Additional File 3.

3.5. Non-Response Bias

We found that the variance in distributions across the four RTs was significant regarding all tested characteristics (Table 1).

3.6. Predicting Response Type

When predicting RT among ANRs and FPs, the most important predictors were the degree of urbanisation and geographical location (X and Y co-ordinates of districts within municipalities) of the participant, closely followed by migration background (Fig. 2.) The majority of ANRs originated from very highly urbanised areas, with FPs more likely to originate from highly or moderately urbanised areas (Table 1). Less than 25% of FPs were of a non-Dutch migration background, compared to nearly 50% of ANRs (Table 1).

Table 1

Demographic characteristics of P3 participants in each response category
Variable	FPs N = 5,553 (13.9%)	QOs N = 647 (1.6%)	NRQs N = 14,043 (35.2%)	ANRs N = 19,639 (49.2%)	p-value (Chi² with BH adjustment)
Age-group (in years)
< 10	1051 (18.9)	131 (20.3)	3263 (23.2)	5535 (28.2)
10–19	608 (11.0)	70 (10.8)	1201 (8.6)	1363 (6.9)
20–29	766 (13.8)	129 (20.0)	2335 (16.6)	3777 (19.2)
30–39	697 (12.6)	102 (15.8)	1755 (12.5)	2974 (15.1)
40–49	634 (11.4)	77 (11.9)	1771 (12.6)	1858 (9.5)
50–64	938 (16.9)	78 (12.1)	1945 (13.9)	2076 (10.6)
65–79	785 (14.1)	43 (6.7)	1452 (10.3)	1618 (8.2)
80+	74 (1.3)	17 (2.6)	321 (2.3)	437 (2.2)	< 0.001
Sex
Male	2538 (45.7)	314 (48.5)	8052 (57.3)	10704 (54.5)
Female	3015 (54.3)	333 (51.5)	5991 (42.7)	8935 (45.5)	< 0.001
Degree of urbanisation
Very high	1167 (21.0)	200 (30.9)	3966 (28.2)	7509 (38.2)
High	1816 (32.7)	207 (32.0)	4122 (29.4)	6020 (30.7)
Middle	1064 (19.2)	110 (17.0)	2362 (16.8)	3161 (16.1)
Low	1018 (18.3)	94 (14.5)	2164 (15.4)	1992 (10.1)
Very low	488 (8.8)	36 (5.6)	1429 (10.2)	956 (4.9)	< 0.001
Migration Background
Dutch	4352 (78.4)	464 (71.7)	9150 (65.2)	10436 (53.1)
Other Western	366 (6.6)	49 (7.6)	1100 (7.8)	1784 (9.1)
Moroccan or Turkish	134 (2.4)	27 (4.2)	1223 (8.7)	2678 (13.6)
Antillean, Surinamese or Aruban	269 (4.8)	53 (8.2)	1247 (8.9)	2087 (10.6)
Other Non-Western	431 (7.8)	54 (8.4)	1320 (9.4)	2634 (13.4)	< 0.001
Missing	1 (0.02)	0 (0.0)	3 (0.02)	19 (0.1)
SES
Lowest	1625 (29.3)	220 (34.0)	5034 (35.8)	7780 (39.6)
Low	885 (15.9)	97 (15.0)	2336 (16.6)	3242 (16.5)
Middle	983 (17.7)	102 (15.8)	2267 (16.1)	2518 (12.8)
High	1182 (21.3)	132 (20.4)	2452 (17.5)	3201 (16.3)
Highest	873 (15.7)	95 (14.7)	1934 (13.8)	2886 (14.7)	< 0.001
Missing	5 (0.09)	1 (0.2)	20 (0.1)	11 (0.06)

When predicting RT among NRQ and FP the most important predictors were religion and self-reported health condition, followed by NIP participation (Fig. 3 Panel A). However, there was a considerable amount of missing data for NRQs in these three variables, with 32.7% of all NRQs missing data across all three additional predictor variables. Naturally, the refusal to provide data on such topics as religion is itself informative. However, in order to check whether missingness on a variable was informative in a very different sense from the variable proper we re-ran the RF excluding any participants who had missing data across all three NRQ derived variables. Whilst the PMC increased in the second analysis, the variable importance order remained relatively similar (Fig. 3). The most important variables in this analysis were self-reported health condition, religion and geographical location (X and Y co-ordinates) of the participant (Fig. 3 Panel B). While proportions of those reporting poor or very poor health satisfaction were similar for NRQs and FPs, 25.9% of FPs reported to have very good health, compared to only 7.5% of NRQs. 76.9% of NRQs reported most frequently to have good health satisfaction, compared to 58.6% of FPs. The proportion of those reporting to hold any religious belief was higher in FPs than NRQs, with more NRQs reporting to have no religious belief compared to FPs. Following exclusion of NRQs with entirely missing data, the proportion of missing values for health satisfaction and religion was 8.1% and 11.0% for NRQ,s respectively, and 1.3% and 6.9% for FPs, respectively. Based on this, we conclude that the categorical level for missingness increased the accuracy of the prediction but did not necessary influence the order of the variables of importance. The distribution of NRQ and FP questionnaire characteristics are provided in Additional File 4.

When predicting RT among QO and FP the most important predictors were geographical location (X and Y co-ordinates) and degree of urbanisation in which the participant lived, closely followed by age (Fig. 4). QOs were more likely to reside in very highly urbanised areas and were on average younger than FPs (FP median age 34 (IQR 41), QO median age 29 (IQR 32)) (Table 1). However, due to the distribution of outcome frequency in this comparison, estimates of sensitivity and specificity were highly unbalanced (Fig. 4) The distribution of QO and FP questionnaire characteristics are provided in Additional File 4.

3.7. Comparison of Participants from P1, P2 and P3

3.7.1. Demographics of P1, P2 and P3 Full Participants

The overall response rates have decreased considerably between P1 and P3. Non-responders across P1, P2 and P3 mainly alleged lack of time or fear of blood sampling as their reason for non-response (data not shown).

P3 had a lower proportion of children under 9 years than P1 and P2, and there was an increase in the proportion of participants aged 20–29 for each subsequent PIENTER (Table 2). Unsurprisingly, there was a considerable shift from P1 to P3 in the proportion of participants living in high and very highly urbanised areas. Further, there was a large increase in the proportion of participants reporting to have no religious beliefs (Table 2).

NIP participation increased with each study, with a concurrent reduction in the proportion of those not eligible for the NIP. Across all three studies, there was an increase in those reporting not to have participated in the NIP despite being eligible. This was accompanied by an increase in those reporting to be less inclined to be vaccinated. Concurrently, there was a reduction in those reporting to have no change in their opinion on vaccination, and a reduction in those reporting to be more inclined to be vaccinated (Table 2).

Table 2

Characteristics of P1, P2 and P3 Full Participants (FPs) from the National Sample (NS) only.
	PIENTER (Years) Sample Number (Response Rate %)
	P1 (1995/1996) N = 8356 (55.0%)	P2 (2006/2007) N = 5834 (33.9%)	P3 (2016/2017) N = 4897 (15.8%)
Age group (years)
0–9	2055 (24.6)	1200 (20.6)	916 (18.7)
10–19	1037 (12.4)	706 (12.1)	561 (11.5)
20–29	719 (8.6)	699 (12.0)	732 (15.0)
30–39	936 (11.2)	699 (12.0)	645 (13.2)
40–49	949 (11.4)	631 (10.8)	573 (11.7)
50–59	996 (11.9)	631 (10.8)	547 (11.2)
60–69	922 (11.0)	721 (12.4)	517 (10.6)
70–79	742 (8.9)	547 (9.4)	406 (8.3)
Sex
Male	3946 (47.2)	2647 (45.4)	2237 (45.7)
Female	4410 (52.8)	3187 (54.6)	2660 (54.3)
Migration background
Dutch	7415 (89.0)	4954 (84.9)	4257 (87.0)
Other Western	512 (6.2)	448 (7.7)	342 (7.0)
Moroccan or Turkish	200 (2.4)	153 (2.6)	55 (1.1)
Antillean, Surinamese or Aruban	95 (1.1)	98 (1.7)	56 (1.1)
Other Non-Western	108 (1.3)	181 (3.1)	186 (3.8)
Degree of urbanisation
Very high	938 (11.2)	834 (14.3)	821 (16.8)
High	1029 (12.3)	1923 (33.0)	1726 (35.3)
Moderate	2166 (25.9)	1281 (22.0)	901 (18.4)
Low	1776 (21.3)	1796 (30.8)	986 (20.1)
Very low	2447 (29.3)	N/A	463 (9.5)
Educational level
High	1801 (21.6)	1677 (28.8)	1837 (37.5)
Moderate	2437 (29.2)	1922 (32.9)	1632 (33.3)
Low	4008 (48.0)	2181 (37.4)	1138 (23.2)
Missing	110 (1.3)	54 (0.9)	290 (5.9)
Religion
Protestant	2091 (25.0)	1420 (24.3)	733 (15.0)
Catholic	2801 (33.5)	1692 (29.0)	1086 (22.2)
Other	726 (8.7)	465 (8.0)	314 (6.4)
None	2688 (32.2)	2221 (38.1)	2404 (49.1)
Missing	50 (0.6)	36 (0.6)	360 (7.4)
NIP participation
Yes/Yes, partly	4988 (59.7)	4037 (69.2)	3687 (75.3)
No	129 (1.5)	111 (1.9)	160 (3.3)
Don’t know	120 (1.4)	153 (2.6)	363 (7.4)
Not eligible	3038 (36.4)	1511 (25.9)	619 (12.6)
Missing	81 (1.0)	22 (0.4)	68 (1.4)
Opinion on vaccination changed
No	7256 (86.8)	4724 (81.0)	3711 (75.8)
More inclined	760 (9.1)	466 (8.0)	224 (4.6)
Less inclined	154 (1.8)	158 (2.7)	366 (7.5)
Don’t know/ missing	186 (2.2)	486 (8.3)	596 (12.2)
Self-reported Health Condition
Very good/good	6868 (82.2)	5351 (91.7)	4177 (85.3)
Not good/not bad	929 (11.1)	424 (7.3)	591 (12.1)
Bad/very bad	483 (5.8)	27 (0.5)	59 (1.2)
Don’t know/missing	76 (0.9)	32 (0.6)	70 (1.4)

3.7.2. Predicting the PIENTER Study to which a FP belongs

Random forests were used to predict from which PIENTER study a participant originated, from P1 or P2, P2 or P3 and finally from P1 or P3. In all three models the most important variables were participant age and participation in the NIP. Details of the model outputs and variable importance plots are presented in Additional File 5.

As with many large health surveys across Europe, the PIENTER studies face decreasing survey response rates through time(3–5, 22). With an all-time low response to PIENTER3, concerns surrounding the influence of non-response biases on future estimates are at the forefront. However, response rates do not always indicate high levels of non-response bias, and the overall influence of non-response bias on survey derived estimates varies per research questions (13, 23). Therefore, the documentation of differences between participants and non-participants is crucial.

Generalisability

We found that the age and gender structure of the P3 sample did not closely mimic that of the Dutch population, but this was to be expected due to the study design. Whilst post-hoc weighting for variables such as age and gender can easily be applied to estimates, adjusting for other factors such as geographical location, urbanisation level, educational level, and health status could prove more difficult. There may well be non-response biases within the weighting classes that are under-represented to begin with, due to the influence of topic saliency, among other factors (11, 14, 23). Subsequently even those in our sample classified as having lower education and poorer health, for example, may not represent this subgroup well at a population level.

Survey Response

The response rates seen in P3 are in line with age and gender stratified response behaviours seen in previous PIENTER studies, and in other large national health surveys (3–5, 22, 24). Gender imbalances in health survey response are common, and are posited to be mediated by gender-related values interacting with decision making (25, 26). Despite some research indicating that men may be more likely to respond to a survey when offered higher incentivisation, the larger renumeration offered in P3 has not obviously influenced the gender distribution of the sample (5, 26, 27). Overall, efforts to increase the numbers of men in the working age ranges in P3 seem to have been largely unsuccessful (5).

Large response differences between the genders in the working age range are commonly seen in other health surveys. In P3 this could largely reflect a perceived burden on time, as this was the most cited reason for non-participation. Further, these differences could be amplified in the Netherlands. In 2017 75% of women aged 20–64 were reported to be working part-time (< 28 hours a week), compared to 22% of men (28). This is more than double the EU28 average for women (31.4%) in this age category (28).

In the non-western migrant (NWM) oversample, the overall response was much lower but varied similarly by age and gender. However, the comparatively high response of Dutch speaking migrants, from Suriname, the Antilles and Aruba (SAN), could indicate a language penetration issue for the survey. The initial invitation and information leaflet were sent in Dutch. There was a single sentence on the second page of the invitation in English, Turkish and Arabic to indicate the letter and information was available in other languages online or at request. These additional steps to access the survey may well reduce individual likelihood of engagement (29). However, as response was similar between SAN and Other NWMs (non-Dutch speaking), the lower response in those with Turkish or Moroccan backgrounds may reflect additional barriers beyond language. This could relate to variable cultural values surrounding health, research and community engagement, or awareness of and/or trust in the RIVM specifically (30, 31).

Comparing Response Types

Although random forests were unable to distinguish ANRs from FPs accurately that does not necessarily indicate a lack of non-response bias. In a large meta-analysis of 539 studies, it was demonstrated that prevalence estimates from participants and non-participants often had large differences that were not strikingly evident when comparing the group demographic characteristics (11).

When predicting NRQs from FPs, self-reported health was the strongest predictor, even when coded missingness was excluded in a form of complete-case analysis. FPs reported most frequently to be of very good health, whilst the majority of NRQs, after excluding missing values, report to be of good health, but not to the higher level seen in FPs. Combined with the large difference in distribution between these health categories in the available data, the high proportion of missing values seen in the NRQs could indicate an unwillingness to divulge poor health status, and thus the presence of healthy responder bias, a well-documented phenomena in voluntary participation health studies (32).

Using the “Continuum of Resistance” theory, that stipulates non-responders are furthest away from full responders on a continuum that ranges from “will never respond” to “will always respond”, we may take the assumption that NRQs act as a reasonable proxy for ANRs. Extending this assumption, we may expect that ANRs on average may report poorer health status. Considerable differences in health status between responders and non-responders to health surveys have been documented previously (24, 33, 34). As non-response adjustments for demographic factors alone may not reduce estimate biases sufficiently, this could have considerable impacts on health-related and prevalence estimates from the P3 sample, depending on the topic (33).

Looking at differentiating FPs from QOs, the most important predictors related to geographical location, urbanisation and age. This is probably reflective of perceived available time and survey mode preference, as QOs were younger, and had both a larger proportion of men and those living in areas of very high urbanisation, established predictors of non-response (35). A study of survey mode preference in the Netherlands found that those in younger age classes preferred app-based approaches, and that men were more responsive to face-to-face and registration linkage survey methods (36) .

Overall Impact

The primary aim of all three PIENTER studies is to assess the populations seroprevalence of infectious diseases, and levels of protection against vaccine preventable diseases (VPDs). For NIP vaccines in the Netherlands uptake is almost universally high (37). It is therefore unlikely that any differences between participants and non-participants would have a large impact on estimates of seroprevalence of VPDs. However, this may not be the case when vaccine uptake or disease exposure is less universal, as estimates may become biased where coverage or exposure varies by under-represented subgroups. The uptake of the HPV vaccine, for example, varies largely by migration background and socioeconomic status (38). The vaccine was rolled out in 2009 and subsequently included in the NIP during 2010, with uptake reaching a maximum of 63% in 2021 (37, 38)

Participation Trends in PIENTER Through the Decades

PIENTER participant characteristics were not highlighted by random forests as important in differentiating the studies from each other. In fact, the strongest predictors of study origin were age and NIP participation; a simple proxy for the cohort effect as more of the population becomes eligible for NIP participation, at a younger age, through the years. Based on this we could posit that the PIENTER participants are largely similar “types” of people, and thus estimates from the three studies could be compared across time. This, combined with the decreasing response rates seen in the PIENTER studies, may indicate that it is the interactions between participant characteristics, social-/physical- environment and the survey that produces a survey participant that have reduced through time (35).

Although RF was not able to distinguish participants by PIENTER year, we did capture evidence of falling confidence in the NIP among full participants. However, we saw in the variable importance plots that “opinions on vaccination have changed” was of lower but similar importance to “educational level”. It is possible that this apparent falling confidence in the NIP may be a product of the over-representation of the highly educated in the P3 sample. High educational levels have been previously and recently correlated with vaccine hesitancy in Dutch populations (39, 40).

Limitations

As for all survey research, we faced limitations regarding missing data and data quality. The non-response survey data, conducted as a telephone follow-up, contained a large proportion of item-missing data. This should be considered alongside our interpretations.

Secondly, our dataset was unbalanced with regard to the response type outcome, and very much so in the QO class, as indicated by our skewed confusion matrices with low pmcs. To check that our conclusions were not distorted by this, we ran analyses on random subsets of data containing more balanced proportions of the two possible outcomes. We found that the rankings of variable importance remained stable.

Future considerations

Adjustments for non-response can only go so far, and it has been shown that a balanced survey response is less biased in its estimates than when using post-hoc adjustments alone (23). After all, post-hoc weights are frequently based on limited available data, can’t improve overall precision, and do not deal with non-response biases within weighting classes.

As survey response is likely to continue to decline, future PIENTER studies may consider alternative methods, such as targeted mixed-method survey designs, to improve overall response (41–43). Further, questionnaires available in different formats and in multiple languages reduce barriers to participation and address survey mode preferences across different subgroups (9, 30).

The P3 sample is a powerful and unique tool, adding further biological and epidemiological data to the existing PIENTER biobank. As study designs and sample characteristics are broadly the same between the three PIENTER studies, the biobank allows the study of trends across 30 years in The Netherlands. Although we found evidence that non-response biases may be present, particularly related to migration background and health, P3 remains a key resource for monitoring population-level protection against VPDs. As vaccination coverage in the Netherlands is generally high, non-response bias may not significantly influence the accuracy of estimates of population seroprevalence of VPDs. However, the power to detect associations between serostatus and behaviours/exposures may be limited, and we urge future researchers using the PIENTER biobank to carefully consider sources of bias on a case-by-case basis.

Unfortunately, as was experienced in PIENTER2, the oversampling of NWMs was not successful across all migration background subgroups. Our findings echo the need for improved coverage of these groups, as previously stated in the non-response analysis of PIENTER2 (44). Considering this in combination with the continuously decreasing response rates to all surveys, future PIENTER studies may consider alternative sampling and survey methods.

NIP – the National Immunisation Programme

PIENTER - Seroepidemiological study in the Netherlands to monitor the effect of the National Immunization Programme (in Dutch: ‘Peiling Immunisatie Effect Nederland Ter Evaluatie van het Rijksvaccinatieprogramma’)

NS – National Sample

NWM – Non-Western Migration Background Oversample

DBS – Dried Blood Spot sample

RT – Response Type

FP – Full Participant

QO – Questionnaire Only participant

NRQ – Non Response Questionnaire participant

ANR – Absolute Non-Respondent

O – Other response type (excluded)

AAPOR – American Association of Public Health Research

CBS – Central Bureau of Statistics Netherlands

RF – Random Forest

Pmc – probability of misclassification

PC4 – Postcode at neighbourhood level (first 4 numbers only)

MCAR – missing completely at random

MAR – missing at random

SAN -Suriname, Aruba and the former Dutch Antilles

RIVM - National Institute for Health and the Environment (in Dutch: ‘Rijksinstituut voor Volksgezondheid en Milieu’)

Ethics approval and consent to participate

All procedures performed were in accordance with the ethical standards of the institutional and/or national research committee and with 1964 Declaration of Helsinki and its later amendments. The study proposal was approved by the Medical Ethics Committee Noord-Holland (METC number: M015–022) and written informed consent was obtained from all adult participants, and parents or legal guardians of minors included in the study.

Consent for publication

Not applicable

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Dataset Reference Links

CBS Population totals 2016 and 2017

https://opendata.cbs.nl/statline/#/CBS/en/dataset/83474ENG/table?dl=6245D

CBS Age, sex, migration background in NL 1^st January 2016

https://opendata.cbs.nl/statline/#/CBS/en/dataset/37325eng/table?dl=6245A

CBS Perceived Health Status in 2013

https://opendata.cbs.nl/statline/#/CBS/en/dataset/81174ENG/table?dl=183FE

Competing interests

The authors declare that they have no competing interests.

Funding

This study was commissioned and funded by the Ministry of Health, Welfare and Sport (VWS) of the Netherlands. The funding body had no role in design of the study and collection, analysis and interpretation of data, and in writing the manuscript

Authors' contributions

AP wrote this paper and was responsible for the methodological design, data processing and analyses. LM advised on methodology, contributed to data processing and editing the written content of the paper and co-supervised the epidemiological aspects of the analysis. JF advised on the methodology and contributed to the analytical coding in R relating to random forest algorithm implementation. HdM advised on methodology and co-supervised the epidemiological aspects of the analysis. FvdK was principal investigator and supervised the process of writing this article. All authors critically revised and approved the final manuscript.

Acknowledgements

The serosurveys in the Netherlands (PIENTER) are conducted by the National Institute for Public Health and the Environment (RIVM), in close collaboration with the local Public Health Services (GGD) and Statistics Netherlands (CBS). We gratefully acknowledge the participants of these studies, the research team, and the collaboration with UPPER, SFK, ZVK, and pharmacies concerning provision of data.

Giesecke J. Seroepidemiology. Modern infectious disease epidemiology: Hodder Arnold; 2002. p. 188 - 98.
K. Osborne JW, E. Miller. The European Sero-Epidemiology Network. Eurosurveillance. 1997;2(4).
De Melker HE, Conyn-van Spaendonck MA. Immunosurveillance and the evaluation of national immunization programmes: a population-based approach. Epidemiol Infect. 1998;121(3):637-43.
F.R.M. van der Klis LM, G.A.M. Berbers, H.E de Melker, R.A. Coutinho. Second national serum bank for population-based seroprevalence studies in the Netherlands. The Netherlands Journal of Medicine. 2009;67(7).
Verberk JDM, Vos RA, Mollema L, van Vliet J, van Weert JWM, de Melker HE, et al. Third national biobank for population-based seroprevalence studies in the Netherlands, including the Caribbean Netherlands. BMC Infect Dis. 2019;19(1):470.
Waaijenborg S, Hahne SJ, Mollema L, Smits GP, Berbers GA, van der Klis FR, et al. Waning of maternal antibodies against measles, mumps, rubella, and varicella in communities with contrasting vaccination coverage. J Infect Dis. 2013;208(1):10-6.
Steens A, Mollema L, Berbers GA, van Gageldonk PG, van der Klis FR, de Melker HE. High tetanus antitoxin antibody concentrations in the Netherlands: a seroepidemiological study. Vaccine. 2010;28(49):7803-9.
de Voer RM, Mollema L, Schepp RM, de Greeff SC, van Gageldonk PG, de Melker HE, et al. Immunity against Neisseria meningitidis serogroup C in the Dutch population before and after introduction of the meningococcal c conjugate vaccine. PLoS One. 2010;5(8):e12144.
de Leeuw dH. Trends in Household Survey Nonresponse: A Longitudinal and International Comparison. Survey Nonresponse: New York: Wiley; 2002. p. 41 - 54.
Maitland A, Lin A, Cantor D, Jones M, Moser RP, Hesse BW, et al. A Nonresponse Bias Analysis of the Health Information National Trends Survey (HINTS). J Health Commun. 2017;22(7):545-53.
Groves RM, Peytcheva E. The Impact of Nonresponse Rates on Nonresponse Bias: A Meta-Analysis. Public Opinion Quarterly. 2008;72(2):167-89.
Ronald C. Kessler RJAL, and Robert M. Groves. Advances in Strategies for Minimizing and Adjusting for Survey Nonresponse. Epidemiologic Reviews. 1995;17(1):192 - 203.
Phillips AW, Reddy S, Durning SJ. Improving response rates and evaluating nonresponse bias in surveys: AMEE Guide No. 102. Med Teach. 2016;38(3):217-28.
R. M. Groves SP, S. Dipko. The role of topic interest in survey participations. Public Opinion Quarterly. 2004;68(1):2 - 31
Keyes KM, Rutherford C, Popham F, Martins SS, Gray L. How Healthy Are Survey Respondents Compared with the General Population?: Using Survey-linked Death Records to Compare Mortality Outcomes. Epidemiology. 2018;29(2):299-307.
AAPOR. Standard Definitions: Final Dispositions of Case Codes and Outcome Rates for Surveys. The American Association of Public Health Research; 2016.
Benjamini, Y, Hochberg, Y. . Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological). 1995;57(1):289 - 300.
Biau G, Scornet E. A random forest guided tour. Test. 2016;25(2):197-227.
A. Liaw MW. Classification and Regression by randomForest. R News - The Newsletter of the R Project. 2002;2(3).
RStudio Team Boston M. RStudio: Integrated Development Environment for R. 2016.
Team RC. R: A language and environment for statistical computing. In: Computing RFfS, editor. Vienna, Austria2019. p. https://www.R-project.org/.
Stedman RC, Connelly NA, Heberlein TA, Decker DJ, Allred SB. The End of the (Research) World As We Know It? Understanding and Coping With Declining Response Rates to Mail Surveys. Society & Natural Resources. 2019;32(10):1139-54.
Schouten B, Cobben, F, Lundquist, P., Wagner, J. Does more balanced survey response imply lessnon-response bias? Journal of the Royal Statistical Society Series A. 2016;179:727-48.
Tolonen H, Helakorpi S, Talala K, Helasoja V, Martelin T, Prattala R. 25-year trends and socio-demographic differences in response rates: Finnish adult health behaviour survey. Eur J Epidemiol. 2006;21(6):409-15.
Smith WG. Does gender influence online survey participation?: A recordlinkage analysis of university faculty online survey response behavior. ERIC Document Reproduction Service. 2008.
Boulianne S. Examining the Gender Effects of Different Incentive Amounts in a Web Survey. Field Methods. 2012;25(1):91-104.
Rao N. Cost effectiveness of pre- and post-paid incentives for mail survey response. Survey Practice. 2020;13(1):1-7.
Eurostat. Part-time employment as percentage of the total employment, by sex and age (%) 2016 [Available from: http://appsso.eurostat.ec.europa.eu/nui/show.do?dataset=lfsa_eppga&lang=en.
Harmsen IA, Bos H, Ruiter RA, Paulussen TG, Kok G, de Melker HE, et al. Vaccination decision-making of immigrant parents in the Netherlands; a focus group study. BMC Public Health. 2015;15:1229.
Ahlmark N, Algren MH, Holmberg T, Norredam ML, Nielsen SS, Blom AB, et al. Survey nonresponse among ethnic minorities in a national health survey--a mixed-method study of participation, barriers, and potentials. Ethn Health. 2015;20(6):611-32.
Slootman M. The Dutch Integration Landscape. Ethnic Identity, Social Mobility and the Role of Soulmates. IMISCOE Research Series2018. p. 59-83.
Manolio TA, Weis BK, Cowie CC, Hoover RN, Hudson K, Kramer BS, et al. New models for large prospective studies: is there a better way? Am J Epidemiol. 2012;175(9):859-66.
Tolonen H, Laatikainen T, Helakorpi S, Talala K, Martelin T, Prattala R. Marital status, educational level and household income explain part of the excess mortality of survey non-respondents. Eur J Epidemiol. 2010;25(2):69-76.
Torvik FA, Rognmo K, Tambs K. Alcohol use and mental distress as predictors of non-response in a general population health survey: the HUNT study. Soc Psychiatry Psychiatr Epidemiol. 2012;47(5):805-16.
Lynn P. The Problem of Non-Response. International Handbook of Survey Methodology. USA: Taylor & Francis Group; 2008. p. 35-55.
Mulder J, de Bruijne M. Willingness of Online Respondents to Participate in Alternative Modes of Data Collection. Survey Practice. 2019;12(1):1-11.
RIVM. The National Immunisation Programme in the Netherlands: Surveillance and Developments 2020 - 2021. 2021. p. 362.
de Munter AC, Klooster T, van Lier A, Akkermans R, de Melker HE, Ruijs WLM. Determinants of HPV-vaccination uptake and subgroups with a lower uptake in the Netherlands. BMC Public Health. 2021;21(1):1848.
Hak E, Schonbeck Y, De Melker H, Van Essen GA, Sanders EA. Negative attitude of highly educated parents and health care workers towards future vaccinations in the Dutch childhood vaccination program. Vaccine. 2005;23(24):3103-7.
Veldwijk J, van der Heide I, Rademakers J, Schuit AJ, de Wit GA, Uiters E, et al. Preferences for Vaccination: Does Health Literacy Make a Difference? Med Decis Making. 2015;35(8):948-58.
Christensen AI, Lynn P, Tolstrup JS. Can targeted cover letters improve participation in health surveys? Results from a randomized controlled trial. BMC Med Res Methodol. 2019;19(1):151.
Beebe TJ, Jacobson RM, Jenkins SM, Lackore KA, Rutten LJF. Testing the Impact of Mixed-Mode Designs (Mail and Web) and Multiple Contact Attempts within Mode (Mail or Web) on Clinician Survey Response. Health Serv Res. 2018;53 Suppl 1:3070-83.
Lynn P. From standardised to targeted survey procedures for tackling non-response and attrition. Survey Research Methods. 2017;11:93-103.
H. E. de Melker NJD, Nagelkerde, Spaendonck MAEC-v. Non-participation in a population-based seroprevalence study of vaccine-preventable diseases. Epidemiology & Infection. 2000;124(2):255–62.

No competing interests reported.

AdditionalFile1VariableDefinitions.docx
Variable Definitions (.docx) – detailed descriptions of variables used in all analyses.
AdditionalFile2SurveyResponseGraphs.docx
Survey Response Graphs (.docx) a. Barplots depicting the survey response by age category and Gender for the National sample and Non-Western Migrant oversample. b. Barplots depicting the response rates for each of the 5 regions in the Netherlands by urbanisation degree.
AdditionalFile3GeneralizabilityoftheP3sample.docx
Generalizability of the P3 Sample (.docx) – Table of characteristics of the National Sample Full Participants compared to the Dutch national population.
AdditionalFile4Distributionofquestionnairederivedcharacteristicsbyresponsetype.docx
Distributions of Questionnaire derived characteristics by Response Type (.docx) a. Table of the distribution of response types by questionnaire derived variables included in the RF analyses for FPs QOs and NRQs. b. Table of the distribution of NRQs and FPs after excluding participants missing data for all three NRQ variables; health satisfaction, religion, NIP participation
AdditionalFile5RandomForestoutputsforcomparisonof3pienterstudies.docx
Random Forest Model Outputs for Comparison of three PIENTER studies (.docx) a. Table of model outputs, sensitivity, specificity, pmc and important variables for the analyses comparing the three PIENTERs. b. Figure showing the variable importance plots when predicting the PIENTER study to which a Full Participant belongs.

Download PDF

Version 1

posted

You are reading this latest preprint version

Investigating sources of non-response bias in a population-based seroprevalence study of vaccine- preventable diseases in The Netherlands

Status:

Version 1

Abstract

Background

Methods

Results

Discussion

Figures

1. Introduction

2. Methods

2.1. Study Population and Sample Design

2.1.1. Description of PIENTER3 (P3) Sampling

2.1.2. Oversampling of Non-Western Migrants (NWMs)

2.1.3. Data collection and recruitment methods

2.2. PIENTER 3 Survey Participation Behaviour

2.3. Representativeness of PIENTER 3 Sample

2.4. Comparing Response Type (RT) Characteristics

2.5. Predicting Response Types (RTs) using Random Forests (RFs)

2.5. Changing Participation Behaviours

2.6. Analytical Considerations

3. Results

3.1. Study Population

3.2. Response Type (RT)

3.3. Response Rate

3.4. Generalizability of the P3 Sample (FPs only)

3.5. Non-Response Bias

3.6. Predicting Response Type

3.7. Comparison of Participants from P1, P2 and P3

3.7.1. Demographics of P1, P2 and P3 Full Participants

3.7.2. Predicting the PIENTER Study to which a FP belongs

4. Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1