Data sources
We pooled individual-level data from two double-blind, placebo-controlled trials of mepolizumab for severe asthma: the Dose Ranging Efficacy And safety with Mepolizumab (DREAM, 2009/11–2011/12), and the Mepolizumab as Adjunctive Therapy in Patients with Severe Asthma (MENSA, 2012/10–2014/01) studies(25, 26).
During the 52-week DREAM study, 621 patients were randomly assigned (in a 1:1:1:1 ratio) to receive one of the three doses of intravenous mepolizumab (75mg, 250mg, 750mg) or matched placebo every 4 weeks. During the 38-week MENSA study, 576 patients were randomized (in a 1:1:1 ratio) to receive placebo, a 75mg intravenous dose, or a 100mg subcutaneous dose of mepolizumab, every 4 weeks. Both studies had similar inclusion criteria related to clinical diagnosis, eosinophilic asthma, and lung function, with details provided in Appendix S1. In addition, both studies required patients to have a confirmed history of 2 or more asthma exacerbations requiring treatment with oral corticosteroids (OCS) in the preceding 12 months.
Study design and sample
For consistency, this study only included patients who received placebo or intravenous 75mg doses of mepolizumab (the dose shared between the two trials). Patients were followed from the date of study enrolment until the end of study period or loss to follow-up. There were no missing data for the current analysis.
Endpoint
The primary endpoint was the absolute reduction in the rate of severe exacerbations with mepolizumab 75mg compared to placebo in the first 365 days of follow-up. In line with the American Thoracic Society (ATS) / European Respiratory Society (ERS) Task Force definition(27), a severe exacerbation was defined as a worsening of asthma which required OCS for at least 3 days, hospitalization, or emergency department (ED) visit requiring systemic corticosteroids. The secondary endpoint was reduction in the 5-item Asthma Control Questionnaire (ACQ5) score with mepolizumab 75mg compared to placebo over the first 365 days of follow-up(28). The ACQ5 score integrated five questions on asthma symptom control into a 0 (excellent control) to 6 (extremely poor control) scale.
Covariates of interest
Consistent with a previous analysis on the heterogeneity in the burden of exacerbations in eosinophilic asthma(29), this analysis focused on 15 commonly recorded patient characteristics, including age, sex (male or female), ethnicity (Hispanic or other), body mass index (BMI), history of smoking (yes or no), duration of asthma, pre-bronchodilator value of Forced Expiratory Volume in 1 second (FEV1), FEV1 bronchodilator responsiveness (in %), the ratio of post-bronchodilator FEV1/Forced Vital Capacity (FVC), a history of nasal polyps, BEC, serum IgE level, long-term OCS use, ACQ5 score, and number of previous severe exacerbations, all measured at baseline or the 12 months preceding study enrolment. Of note, fractional exhaled nitric oxide (FeNO) was not included because it was measured only in DREAM but not in MENSA.
Statistical analysis
All statistical analyses were performed in R (version 3.6.0, 2020). P-values were considered significant at two-tailed 0.05 level. Descriptive statistics were compared using Pearson Chi-square test for categorical variables and Kruskal–Wallis test for continuous variables.
To construct the treatment benefit prediction model for the primary endpoint (exacerbation rate reduction), we fitted a generalized linear model (GLM) with Poisson distribution and logarithmic link function, with the number of severe exacerbations during follow-up as the dependent variable and the logarithm of total follow-up days as the offset. The covariates and their first-order interaction with treatment were the independent variables. To prevent overfitting, we applied Lasso regression to estimate model parameters, which shrank the coefficients of non-essential covariates to zero(30). We used 10-fold cross validation to identify the optimal shrinkage factor that minimized the mean squared error of off-sample predictions. For the secondary endpoint (reduction in ACQ5 score), we fitted a Lasso GLM model with normal distribution and logarithmic link function, with the last follow-up ACQ5 score as the dependent variable. Variable specification, selection, and model fitting steps were the same as for the primary endpoint, except the total days of follow-up was included as an independent variable (assuming a log-linear effect for time since randomization on the ACQ5 score).
We predicted treatment response for each individual, based on the fitted GLM models, as the difference in the predicted rate of exacerbation or the ACQ5 score between treatment and no treatment(31).
To evaluate predictive performance for both endpoints, we assessed model calibration based on calibration plots and the root mean squared error, with details provided in Appendix S2. Because this study aimed to quantify the predictive capacity of patient characteristics, rather than deriving a final prediction tool, external validation was not assessed.
The Lasso-selected predictors were ranked based on their importance in predicting treatment response. Importance was quantified as the absolute value of magnitudes of coefficients in a linear regression model with predicted treatment response as the dependent variable and standardized predictors as independent variables.
Assessment of the heterogeneity of treatment benefit
A successful individualized prediction algorithm should be able to identify the subgroup of individuals who benefit the most from treatment, or those who benefit the least(24). As such, measures of concentration of a variable within a population, in particular the Lorenz curve and the closely related Gini index, can be relevant to this context(32). The more concave the Lorenz curve, the higher the capacity of covariates in concentrating treatment benefit. The Gini index is a summary statistic of inequality, which is 0 with perfect equality and with higher values indicating higher levels of inequality(32).
If salient patient characteristics are capable of distinguishing individuals who will benefit the most from treatment, then prioritizing treatment for subgroups of patients with higher predicted benefit should be significantly more efficient than treating patients indifferently (thus informing treatment rule-in decisions). Similarly, a powerful treatment benefit prediction algorithm should be able to identify patients who will not benefit from therapy (thus informing treatment rule-out decisions). To explore the capacity of covariates to inform treatment rule-in or rule-out, we assessed how the average treatment benefit would change if mepolizumab were given to quintiles of patients with higher (or lower) predicted benefits. The observed treatment benefit for the primary endpoint was defined as the average difference in the annualized rates of severe exacerbation between the placebo and treatment arms; the observed treatment benefit for the secondary endpoint was defined as the average reduction in ACQ5 in the last follow-up visit between the treatment and placebo arms.