Study sample
The study sample was drawn from the “Epidémiologie et Conditions de vie sous le COVID-19 » (EpiCoV) cohort, a national random population-based survey which followed 134,391 participants selected from the national administrative and tax register through four rounds of data collection from Spring 2020 to Autumn 2022. Details of the sampling methods are available in [26]. Data collection was achieved through computer-assisted web interviews (CAWI) or computer-assisted telephone interviews (CATI) and covered a wide range of health characteristics, including mental health, healthcare and socio-demographic characteristics.
For this study, the sample was a subgroup of the original cohort composed of participants with at least one child aged 3 to 17 years at the time of the third data-collection round (N = 21,406 in Spring 2021), and who had filled in the SDQ questionnaire for one randomly selected child (along with the questionnaires on their own mental health). Since it was a complete data study (Supplementary Fig. 1), our final sample consisted of 20,765 participants. A subset of this sample (N = 14,339) participated in the fourth round (Autumn 2022), which made it possible to investigate the test-retest reliability of the questionnaire.
Measures
Strengths and Difficulties Questionnaire: The parental version was used to evaluate the mental health of the children and adolescents. This questionnaire consists of 25 items (Supplementary Table 1), to which the respondent answers on a three-point scale: "not true" (0), "somewhat true" (1) or "very true" (2). The factor structure of the SDQ has been the subject of several studies with contrasted results. Three dominant models have been identified [5, 27](Fig. 2): the original 5-factor model; the 3-factor model, which combines items from the ES and PRP dimensions to form an "Internalizing Problems" factor and items from the H/I and CD dimensions to form an "Externalizing Problems" factor; and the 2nd order model, which places the internalizing and externalizing factors on a second level. Factor scores can be calculated by summing the scores of the associated items. Confirmatory Factor Analysis (CFA) studies have shown contrasting factorial validity for the 3-factor models [18, 19, 22, 27–31] and the 2nd order model [18, 19, 22, 27–29]. The 5-factor structure generally presents the best fit [19, 20, 22, 27, 29, 31, 32], but there are also several large studies that have failed to obtain fit indices above acceptable thresholds [28, 30, 33]. Regarding the French version more specifically, it should be noted that the parent version has never undergone a factorial evaluation in France.
Parental Mental Health: The PHQ-9 scale [34] was used to assess various symptoms associated with a characterized depressive episode (loss of pleasure, sadness, sleep or appetite disturbances, low self-esteem, etc.). The total score (items sum) is used to measure the intensity of depressive symptoms: no depressive symptoms (0–4), mild depressive symptoms (5–9) and moderate/severe depressive symptoms (10–27). Similarly, the GAD-7 scale [35] was used to screen for anxiety symptoms, as it assesses various aspects of anxiety (nervousness, worry, irritability, difficulty relaxing, etc.). Participants are categorized as having no anxiety symptoms (0–4), mild anxiety symptoms (5–9) or moderate/severe anxiety symptoms (≥ 10) on the basis of the total score (items sum). Both questionnaires use a Likert frequency scale ranging from 0 ("not at all”) to 3 ("nearly every day”) and assess symptoms over the past two weeks. The factorial structure and internal consistency of the PHQ-9 and GAD-7 were checked in our sample (Supplementary Table 2). Psychiatric history was assessed using the following question: "During your life, has a doctor ever told you that you have a psychiatric, psychological or addiction disorder? Yes or No”.
Some of the socio-demographic characteristics (child's age, child's gender, parental gender, parental educational level) collected in the cohort were used to describe the survey sample and to conduct the invariance analysis. In this analysis, the characteristics were integrated in the form of categorical variables (with the categories presented in Table 1)
Analytical approach
The factor structure of the French version of the parental SDQ has classically been studied using CFA, and later using a more recent latent variable approach, Exploratory Structural Equation Modelling (ESEM), as several authors have argued that this new approach could be better suited to the assessment of psychological constructs [36, 37]. Indeed, ESEM [37–39] integrates the possibility of cross-loading into a CFA framework, in the same way as in exploratory factor analysis (EFA). Similar to EFA, ESEM offers several rotation options. The target rotation was used, as it enables an a-priori specification matrix to be constructed, which maximizes the loading of the targeted items for each factor, and minimizes all others. Marsh et al. [37] state that with this rotation, ESEM can provide a robust and flexible confirmatory approach to investigate the structural validity of a questionnaire. Some studies [21, 24] have already applied ESEM to the SDQ.
Because the SDQ items are ordinal variables, the models were estimated with the Weighted Least Squares with adjusted Mean and Variance estimator (WLSMV), using a polychoric correlation matrix with probit regression. The three different SDQ factor structures described above (Fig. 2) were fitted using CFA and ESEM. To note, for ESEM, the 2nd order model required an alternative approach, namely ESEM-within-CFA (EwC), which involves reintegrating the estimated coefficients from an ESEM as starting values within a CFA. The following fit thresholds were used: fit was considered acceptable if the Comparative Fit Index (CFI) and the Tucker-Lewis Index (TLI) were ≥ 0.90, the Root Mean Square Error of Approximation (RMSEA) and the Standardized Root Mean square Residual (SRMR) were ≤ 0.080, and good if CFI and TLI were ≥ 0.95, RMSEA and SRMR were ≤ 0.060 [40]. Factor loadings were also examined to ensure that each manifest variable was adequately explained by its expected latent factor. Target factor loadings were expected to have values greater than 0.50 with CFA [41] or 0.35 with ESEM [39].
Once the best-performing model was selected, invariance measurement was studied across parental mental health (depressive and anxiety symptoms, psychiatric history) and socio-demographic characteristics (child's age, child's and parental gender, parental educational level). In line with the literature [16], three levels of invariance were tested sequentially: configural invariance (a multi-group factor model is examined without constraints), metric invariance (factor loadings are constrained to be equal across groups), and scalar invariance (additional equality constraints on the item response category thresholds across groups). The invariance is established when no meaningful variation of the fit is observed when constraints are imposed at each level. Lack of invariance was thus indicated when CFI or TLI decreased by more than 0.010 or RMSEA increased by more than 0.015 or SRMR increased by 0.030 (for metric invariance) or 0.010 (for scalar invariance) [42]. A stratified invariance analyses across the three mental health characteristics was also performed separately according to the gender of the parent who filled in the SDQ.
Finally, internal consistency reliability was evaluated using McDonald's ω, computed using the factor loadings and item residuals of the models. To note, there is no specific method to incorporate cross-loadings in the calculation of ω for ESEM models [39]; the current recommendation is to calculate ω by considering only the parameters of the "target" items. For test-retest reliability, longitudinal ESEM and CFA models were run to investigate the correlation of a latent dimension between measurements in round 3 and round 4. The two types of reliability are both considered acceptable if they are greater than 0.70 [43, 44].
Data cleaning and descriptive analyses were performed using R Software (version 4.2.3), while all psychometric analyses were conducted using Mplus (version 8.8).