Study population
NHANES is conducted biennially by the Centers for Disease Control and Prevention’s National Center for Health Statistics (NCHS) since 1999, which is a cross-sectional,
complex multistage, nationally representative survey that assesses population health and nutritional status that is representative of the non-institutionalized U.S. civilian population. The questionnaire data, physical examination data, and biospecimens from participants were collected. The NCHS Research Ethics Review Board reviewed and
approved the study, and informed written consent was obtained from all participants before they took part in the study. This study included children aged 3–12 years who had completed all urinary mVOCs measurements. Then we excluded participants with missing covariate data. Finally, a total of 1559 children were included in the analysis (Fig. 1).
Figure 1. Flowchart for participants selection.
NHANES, the National Health and Examination Survey; mVOCs, metabolite of volatile organic compounds.
Laboratory methods
In this study, five urinary mVOCs (2MHA [2-Methylhippuric acid], SBMA [N-Acetyl-S-(benzyl)-L-cysteine], CEMA [N-Acetyl-S-(2-carboxyethyl)-L-cysteine], 3PHMA [N-Acetyl-S-(3-hydroxypropyl)-L-cysteine], HPMMA [N-Acetyl-S-(3-hydroxypropyl-1-methyl)-L-cysteine]) were quantified in children aged 3–12 years participating in the NHANES 2011–2018 survey cycle (n = 1559). We list the relative parent compounds of the VOCs detected in this study and their official abbreviations (Table S1). Five mVOCs concentrations were quantified in urine samples using ultra-performance liquid chromatography coupled with electrospray tandem mass spectrometry (UPLC-ESI/MSMS)[12]. For mVOCs with analytical results below the lower limit of detection (LLOD), an imputed fill value (LLOD/√2) was replaced. Then, we excluded urinary mVOCs that had a low detection rate (≤ 50%) considering the representativeness of the data and the robustness of the results. Details of the analytical method and quality assurance/quality control procedures used are available on the NHANES website[15].
Outcome and covariables
Asthma diagnosis was determined by the medical conditions data file (MCQ) (question: “has a doctor or other health professional ever told you that study participant has asthma?”). We considered adjusting for variables associated with childhood urinary mVOCs concentrations and asthma diagnosis or believed to confound this relationship. Age (in years), gender (male or female), race/ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic (includes Mexican American and Other Hispanic), and Other (non-Hispanic Asian, and other race including multiracial), and income (ratio of family income to poverty (0–5)) were extracted from the demographic variables and sample weights data file (DEMO). Cutoffs for poverty were defined by the NHANES survey variable “INDFMPIR,” calculated as a ratio of individual/family income to poverty guidelines determined by the Department of Health and Human Services with a range from 0–5. Values from < 1.36 were considered “poor”, 1.36–1.86 were considered “nearly poor”, and values of ≥ 1.86 were considered “not poor” for classification.
Other variables considered included glomerular filtration rate, diet (fish consumption), history of wheezing, history of eczema, history of respiratory infections, history of breastfeeding, parental marital status, and parental medical history, which were not available in all study participants in this age group.
Statistical analyses
All data analyses were performed using Stata 17.0 and R version 4.2.1. We used published subsample weights (WTSA2YR) designed for a one-third subset of the entire survey for all effects and variance estimates to produce estimates representative of the U.S. population, as recommended by NHANES. Logarithmic transformation was performed for the 5 urine mVOCs to ensure a normal distribution, and then they were standardized for follow-up analyses. Then, we calculated geometric means (GM) for all five mVOCs and by age group (3–6 and 7–12 years), gender, race/ethnicity (Hispanic, non-Hispanic 100, non-Hispanic black, and other), and income (poor, nearly poor, not poor). We also calculated weighted Pearson correlation coefficients for all urinary mVOCs log10 concentrations.
In the unadjusted model, we estimated the odds ratio of ln-transformed urinary mVOCs concentrations per standard deviation (SD) increase for asthma by logistic regression. In the adjusted model, taking into account urinary dilution, we adjusted for child age (continuous), gender (binary), race/ethnicity (categorical), poverty to income ratio (continuous), and ln-transformed serum cotinine concentration (continuous) and estimated the odds ratio of ln-transformed creatinine-corrected mVOCs concentrations in current asthma per standard deviation (SD) increase in urine. We used Wald tests to detect differences in model coefficients for the mVOCs variable between different levels of modifiers. We also performed stratified regression analyses by age group, gender, and race/ethnicity. Given that mVOCs were measured in children with asthma after asthma diagnosis, we performed additional sensitivity analyses. We tested whether the difference between current age and age at diagnosis of asthma predicts urinary mVOCs concentrations in patients with asthma by Pearson correlation. Restricted cubic spline (RCS) was further used to identify potential nonlinearities. Four knots were set at the 5th, 35th, 65th, and 95th percentiles in the models, and the reference value was set to the median of the ln-transformed mVOCs concentration.