Part 1: Estimating the probability of Covid-19 associated hospitalisation in the first wave from national data
Within a South-London cohort admitted to hospital during the first wave of Covid-19 (Fig. 2), examination of the relationship between ethnicity and Covid-19 associated mortality suggests a 42% (13%, 61%) reduction in risk of death in those of Black ethnicity (p = 0.008). This finding greatly differs from other studies examining a similar timeframe using less restricted cohorts suggesting likely bias within this analysis17,20,26. Accounting for covariates including medical history, age, sex and IMD does not account for the unexpected finding (hazard ratio (HR) = 0.63 (0.39, 1.00), p = 0.049).
Instead, use of IPW derived from external sources allowed the comparison of Black vs. White ethnicity to be revaluated resulting in a non-significant increase in risk of death in those who were Black (HR = 1.06 (0.56, 2.00), p = 0.851). This hazard ratio now follows a similar trend to that seen in other studies17,20,26, albeit without statistical significance suggesting the use of IPW has corrected for potential sources of selection bias in this cohort.
Like the relationship between Black ethnicity and Covid-19 associated death, relationships of other ethnicities and survival were modified by the use of IPW to adjust for collider bias.
The effect of Asian ethnicity compared to White ethnicity on survival demonstrated some effect of confounding when comparing the unadjusted and adjusted models (green circle and red square, Fig. 2) with the estimated effects moving towards a more extreme hazard ratio after accounting for covariates (adjusted model: HR = 1.73 (0.99, 3.01), p = 0.052 vs. unadjusted model: HR = 1.36 (0.85, 2.15), p = 0.197). The use of IPW to account for selection biases, however, strengthens the estimated effect of Asian ethnicity (vs. White) on survival even further (HR = 2.43 (1.05–5.62), p = 0.038) allowing the findings to match the existing literature more closely13,17,20,26,27.
The effect of Mixed/Other and Unknown ethnicities demonstrated less extreme changes in hazard ratios after adjusting for covariates followed by IPW to correct for collider bias (Fig. 2). However, this also matches existing studies which focus on an effect of Black and Asian ethnicity on Covid-19 outcomes in the first UK wave of the pandemic12,17,20,26.
Part 2: Sensitivity analysis examining the effect of changes in estimated probabilities to match the local population
Applying these adjusted IPWs as part of the weighted survival analysis to examine the effect of misspecification in the relationship between Covid-19 associated hospitalisation and ethnicity/Covid-19 associated mortality found that adjustments to \({\widehat{\gamma }}_{0}\) (Fig. 3A), \({\widehat{\gamma }}_{1}\) (Fig. 3B), and \({\widehat{{\lambda }}}_{{a}}\) (Fig. 3C) had minimal effect on the estimates obtained from the analysis using IPW demonstrated in Fig. 2.
Reducing the probability of hospitalisation in those who died specifically in Black, Asian and Mixed/Other ethnicities (\({\widehat{{\lambda }}}_{{b}}\)) had a more dramatic effect on the estimates for the effect of each ethnic group on mortality (Fig. 3D). The estimated hazard ratios for all three ethnicities increased and there was more uncertainty about these estimates. The extent of the heightened hazard and the increased uncertainty was systematically enhanced as the degree of misspecification increased from 50–200%. Adjustments made to \({\widehat{{\lambda }}}_{{b}}\) had little effect on the estimated effect of Unknown ethnicity on death (Fig. 3D) – as expected because the probability of hospitalisation in those who died in this group was not adjusted.
Interestingly, the adjustment for misspecification in \({\widehat{{\lambda }}}_{{b}}\) allowed the estimated effect of Black ethnicity on mortality to reflect the situation reported in other studies17,20,27 and the media during wave 1 of Covid-19 –an increased risk of death compared to those who are White. This differs from the results for the cohort from before applying IPWs to account for selection bias (Fig. 2) and highlights that in hospitalised cohorts, such as this one, there is a clear risk of inducing bias in the form of collider bias and other selection biases.
Part 3: Addition of covariates into the model equation
The addition of data from the second wave of Covid-19 altered the results obtained from the initial analysis (Fig. 2) demonstrating how IPW in addition to adjusting for covariates improves estimation. All estimates now have more precision reflecting the larger cohort analysed (Fig. 4). Most point estimates have shifted to indicate a smaller/neutral effect of ethnicity on mortality with some exceptions. All point estimates fell within the 95% confidence intervals estimated from the wave 1 only cohort (Fig. 2).
This analysis continued to suggest a reduced risk of death in those of Black ethnicity (compared to White) in the unadjusted analysis (HR = 0.70 (0.53, 0.92), p = 0.012). This was corrected by adjustment for covariates (HR = 0.91 (0.65, 1.27), p = 0.575) and tended towards a non-significant increase in mortality (HR = 1.29 (0.87, 1.93), p = 0.211) after the use of IPW to account for selection bias.
Asian ethnicity had a clear increased risk of mortality after adjustment for covariates (HR = 1.94 (1.28, 2.93), p = 0.002). Use of IPW did little to alter this finding demonstrating continued increased risk of mortality in those of Asian ethnicity (HR = 2.06 (1.15, 3.67), p = 0.014) as shown by other studies13,20,27.
Inclusion of wave 2 in this analysis suggested a reduced risk of mortality in those with Mixed/Other ethnicity that was less evident in the initial analysis, although this is not significant after adjusting for covariates and using IPW to correct for collider bias (HR = 0.79 (0.39, 1.64), p = 0.533). Meanwhile inclusion of wave 2 in this analysis suggested a lack of difference in mortality between those with Unknown ethnicity and those with known White ethnicity (unadjusted: HR = 0.82 (0.61, 1.11), p = 0.198; adjusted for covariates and IPW: HR = 0.96 (0.64, 1.44), p = 0.838) matching results seen in other studies20.
Sensitivity analysis of the extended analysis
The sensitivity analysis examining misspecification in the model parameters for the logistic model describing probability of hospitalisation while including wave as a covariate (Eq. 2) showed a similar pattern as the sensitivity analysis of the initial model.
Adjustments to \({\widehat{\gamma }}_{0}\) (Fig. 5A), \({\widehat{\gamma }}_{1}\) (Fig. 5B), \({\widehat{\gamma }}_{2}\) (Fig. 5E), and \({\widehat{\gamma }}_{3}\) (Fig. 5F) had minimal effect on the obtained estimates for the effect of ethnicity on mortality. Adjustments to \({\widehat{{\lambda }}}_{{a}}\) (Fig. 5C), \({\widehat{{\lambda }}}_{{c}}\) (Fig. 5G), and \({\widehat{{\lambda }}}_{{d}}\) (Fig. 5H) had small but notable effects on these estimates. Accounting for a large (200%) misspecification of \({\widehat{{\lambda }}}_{{a}}\) (related to the probability of hospitalisation in surviving individuals of minority ethnicities) was sufficient to indicate a significant increase in the risk of mortality in Black individuals compared to White, while adjustments to \({\widehat{{\lambda }}}_{{c}}\) and \({\widehat{{\lambda }}}_{{d}}\) (which related to ethnicity-specific effects on the risk of hospitalisation in wave 2) cause a reduction in the estimated hazard ratio comparing Asian and White ethnicity.
As before the only model parameter which had a substantial effect on the obtained results once misspecification was applied is \({\widehat{{\lambda }}}_{{b}}\) (Fig. 5D) which matches the pattern described above (Fig. 3D). Namely reducing the probability of hospitalisation in those who died specifically in Black, Asian and Mixed/Other ethnicities resulted in increases in the estimated hazard ratios for the relationship between mortality and these ethnic groups (compared to White ethnicity).