Study design and participants
In Bangladesh, there are more than 13,000 community clinics that deliver primary health care services across the country including in remote and hard-to-reach areas. We conducted a two-arm, single-blind, cluster randomized trial with parallel assignment in forty community clinics in the rural Kishorganj district of Bangladesh, located approximately 100km from Dhaka city. Each community clinic has three health staff: a Community Health Care Provider (CHCP) who works full time in the clinic and a Health Assistant (HA) and a Family Welfare Assistant (FWA) who work half-time in the clinic and half-time in the community. CHCPs and HAs have Masters’ degrees and most FWAs have completed high school. Community clinic was the unit of randomization to reduce contamination between the groups as the intervention was integrated into clinic services and was implemented by existing clinic staff. We selected two rural subdistricts in Kishorganj with a total of seventy-four community clinics. An independent statistician randomly selected twenty clinics from each subdistrict (n=40 clinics) to participate in this study. No clinics refused to participate.
Inclusion criteria for children were: weight for age (WAZ) < -1.5 SD, singleton birth, no obvious disability, no known chronic disease (e.g. epilepsy), not hospitalised or requiring ongoing monitoring for acute malnutrition and parental consent. We conducted a house-to-house survey around each clinic and all children aged 5-23 months, living within a thirty-minute walking distance from the clinic, were screened for inclusion. We limited the sample to mothers and children living within a 30-mintue walk from the clinic based on prior piloting that demonstrated poor attendance among mothers living farther away (14). Based on the pilot study, we also found that a maximum of twenty-four children could be reached through each clinic. Children were weighed using standard methods and those with weights for age < -1.5 SD of WHO standards (15) and meeting all other inclusion criteria were invited to participate in the study. We recruited up to twenty-four children in each clinic. In clinics with more than twenty-four eligible children, a simple random sample of twenty-four children was selected. Written informed consent of mothers was collected at enrollment. Ethical approval was given by the institutional review board of the International Centre for Diarrhoeal Diseases Research, Bangladesh (icddr,b).
Randomisation
The forty community clinics were stratified by subdistrict and then randomly assigned 1:1 to intervention or control by an independent statistician, using a computer-generated randomisation sequence. All clinics and mother/child dyads were recruited prior to randomisation. Baseline measurements were conducted after randomisation. Data collectors were masked to group allocation at baseline and endline.
Intervention
Mothers and children attending clinics allocated to the intervention group were invited to fortnightly parenting sessions for one year. The parenting sessions were facilitated by the health workers in the clinic: CHCPs conducted 1-2 sessions per week, HAs and FWAs conducted one session per week. There was an average of four mother/child dyads in each group, with the group size constrained by the available space within the clinic. The Group Reach-Up and Learn curriculum was used in the parenting sessions. This curriculum was adapted from the Jamaican Reach-Up home visiting programme (14). The health workers were trained and supervised by the research team. See Box 1 for further details of the intervention. Mothers and children in control clinics were not invited to parenting sessions, but they used the clinic as usual for health care. We recorded attendance at parenting sessions.
Measurements
Outcome measurements included child development, behaviour, and nutritional status and mothers’ parenting knowledge and depressive symptoms, and stimulation in the home. All outcomes were measured at baseline (from Sept-Dec 2015) and after one year of intervention (from Oct-Dec 2016) and have been used previously in Bangladesh (9,11,16,17). Children were tested in the presence of the mother either in a private room at the community clinic or an alternative location in the community.
Primary Outcomes
The primary outcomes were child development, behaviour, and nutritional status. Children’s development was measured using the Bayley Scales of Infant and Toddler Development (18). We used three composite scores: 1) cognition, 2) language (combined score of the expressive and receptive language scales), and 3) motor (combined score of the fine and gross motor scales). Child behaviour was rated during the test using four Wolke’s behaviour rating scales: approach to examiner, emotional tone, cooperativeness, and vocalisations (19). Approach was rated during the first 10 minutes of the test; the remaining three scales were based on the child’s behaviour throughout the test. Behaviours were rated on a 8-point scale with higher scores representing more of the characteristic. Child weight and length/height were measured by the testers after the Bayley test using WHO standard methods (20). The z scores of weight-for-age, weight-for-height, and height-for-age were calculated using WHO anthroplus (15). Children were tested at baseline and endline by one of eight testers. All testers had a Masters’ degree in Psychology or related field. Testers received one month training and they were masked to group allocation.
Secondary Outcomes
The secondary outcomes were mothers’ parenting knowledge, stimulation provided in the home and mothers’ depressive symptoms. Parenting knowledge was measured using a specially designed instrument consisting of 20 questions. Stimulation in the home was measured using an extended version of the Family Care Indicators (FCI) (21). The FCI has been previously validated in Bangladesh (17). The FCI consisted of 24 questions including questions on the availability of play materials and the extent to which the mother and other adults in the home engaged the child in play activities. Maternal depressive symptoms were measured using six questions that are included in the FCI, taken from the Center for Epidemiological Studies Depression Scale (22). All interviews with mothers were interviewer-administered and conducted after child measurements were completed.
Quality Control of Measurements
Before the study assessments began, interobserver reliabilities were measured between each tester and the trainer on 8-16 tests per tester. Inter-observer reliabilities were acceptable for all measures: intraclass correlation coefficients (ICC) >0.98 on Bayley composite scores, range of ICC=0.62-1.00 on behaviour ratings, and ICC>0.95 on anthropometric measures. Interobserver reliabilities were conducted on approximately 10% of all Bayley tests during the study and reliabilities were ICC>0.95 for all Bayley composite scores and ICC=0.67-0.99 for behaviour ratings.
All maternal questionnaires had good internal consistency at baseline (Cronbach’s a mean 0.82, range: 0.68-0.89) and endline (Cronbach’s a mean 0.84, range: 0.79-0.88) (webtable 1). The Bayley Scales scores at baseline and endline were significantly correlated with height-for-age (r=0.18-0.30), weight-for-age (r=0.21-0.28) and with maternal education (r=0.10-0.19) and paternal education (r=0.12-0.24), indicating good discriminant validity (webtable 2).
Statistical analysis
The primary outcomes of the study were child development (3 scores: cognitive, language and motor development), child behaviour (4 scores: approach, emotional tone, cooperativeness, vocalisations) and child nutritional status (3 scores: weight-for-age, weight-for-height, height-for-age). To calculate the sample size, we used a significance level of 0.005 (instead of 0.05) to account for ten primary outcomes and we assumed an intracluster correlation coefficient of 0.05 (5). With an average of 21 mother/child dyads per clinic (378 mother/child dyads), and allowing for a loss of two clinics per group, (giving 18 clinics in each group), we had 80% power to detect an effect of 0.38 SD on the primary outcomes.
All analyses were prespecified. For each outcome, we fitted a multi-level random effects model that accounted for clustering at the clinic level. We adjusted for child age and sex, the relevant baseline score and tester/interviewer. Study group was entered as a binary variable. For child development and behaviour outcomes, as children were tested either in the community clinic or in an alternative location in the community, we also entered place of test and an interaction term of place of test x group as fixed effects. Data completeness was excellent (>98%) for child outcomes. At endline, we had incomplete data for maternal outcomes (91% for parenting knowledge and home stimulation, 90% for maternal depression). We used multiple imputation, assuming data was missing at random, to account for missing data. Baseline sociodemographic variables and baseline scores of all child and maternal outcomes were included in the imputation model. We generated 20 datasets and ran a full multi-level random effects model using the whole dataset and to correct for overfitting, we implemented a bootstrap (200 samples) for each imputed dataset. The final models were obtained by fitting a multi-level model with all the above factors, and estimates were combined using Rubin’s rules (23). To allow for comparability across outcomes, effect sizes were calculated by using an internal standardization of the whole sample at baseline and endline separately. We used intention-to-treat analyses for all outcomes and we controlled for multiple primary outcomes using Holm step-down procedure. All analyses were carried out using Stata version 15. The trial registration number is NCT02208531.