Study Population
Data were from Waves 1-4 (2013-2018) of the Population Assessment of Tobacco and Health (PATH) Study for young adults aged 18-24 years old and followed up annually. The PATH Study is a nationally representative and longitudinal cohort of US youth and adults to understand factors associated with changes in tobacco use and related health outcomes [23]. Self-administered audio computer-assisted self-interview questionnaires were used to collect tobacco-use information and associated health behaviors among adults. The retention rates for Wave 2, Wave 3 and Wave 4 among Wave 1 adult respondents were 82.8%, 78.0% and 72.9%, respectively. The detailed procedures of the PATH Study can be found elsewhere [21,23].
Measures
Outcome
Relative harm perception was assessed prospectively across Waves 1-4. Participants were first asked, "Have you ever seen or heard of an e-cigarette before this study?" at Wave 1. Only those who responded "Yes" were followed up with the question, "Is using e-cigarettes less harmful, about the same, or more harmful than smoking cigarettes?" Literature shows that perceiving lower levels of e-cigarette harm was one of the main drivers for e-cigarette initiation [24], so we dichotomized the responses as less harmful vs. same or more harmful (reference group). This method was also guided by the previous research [21, 25, 26], which provides a simplified model for the analysis and interpretation of results.
Time-invariant and time-variant predictors
The selection of variables was guided by the Host, Agent, Vector, Environment (HAVE) conceptual model of the PATH study [23] as well as a review of previous literature [16, 17-21, 26, 27]. According to the HAVE model, the interactions among these four domains impact behavioral and health outcomes. In our study, host factors refer to individuals who are tobacco users or at risk of being tobacco users (demographics, tobacco-related attitudes, current cigarette/e-cigarette use, ever other tobacco/drug/alcohol use, and mental/physical health). Agent factors are the tobacco products' design, formulation, packaging, and promotions (media use and health warning exposure). Environmental factors include current policies, social, cultural, and geographic influences (smoke-free home rules and environmental/peer influence). The vector facilitates interaction between host, agent, and environments. The variables were further categorized into time-invariant and time-variant predictors.
Time-invariant variables included sex and race/ethnicity. Race/ethnicity was coded as non-Hispanic White, non-Hispanic Black, Hispanic, and non-Hispanic Other.
The rest of the predictors were measured as time-variant variables. Specifically, education was categorized into four levels: less than high school, high school, some college or college, and advanced degree. The assessment of environmental/peer influence used the question, "Does anyone who lives with you use any form of tobacco product?" Smoke-free home rules were measured by asking if participants were allowed to smoke a combustible/non-combustible tobacco product inside their homes. To assess tobacco-related attitudes, young adult participants were asked, "Thinking about the people who are important to you, how would you describe their opinion on using tobacco?". The answers were recoded as positive/neutral or negative. Current cigarette/e-cigarette use was measured by asking if participants had used them at least once during the past month. Ever other tobacco use was evaluated by asking, "Have you ever used cigars, smokeless tobacco, traditional cigars, filtered cigars, pipe tobacco, snus pouches, dissolvable tobacco, bids, kreteks, hookah or cigarillos?" Ever drug use was assessed by asking if they had ever used marijuana, prescription drugs, cocaine or crack, stimulants, heroin, inhalants, solvents, and hallucinogens. Ever alcohol use was assessed by asking if they had ever used alcohol. Mental/physical health was evaluated by asking, "In general, would you say your mental/physical health is excellent, very good, good, fair, and poor?" Health warning exposure was assessed by asking if they had ever noticed the health warning messages on packages of e-cigarettes and cigarettes.
For media use, participants were asked whether they had ever signed up for email alerts about tobacco products, read articles online about tobacco products, or watched a video online about tobacco products in the past 6 months. The poverty level was guided by the Department of Health and Human Services poverty guidelines and coded as below/at or above poverty levels. Since the questions of these two variables were only asked in Wave 1, for the analysis, we treated media use and poverty level as time-invariant predictors.
Data analysis
The analysis plan was conducted in four steps, similar to a previous study [21]. First, descriptive statistics for sample characteristics were reported as numbers and percentages for Waves 1-4. Second, four waves were combined by the PERSONID, and the missing outcome at Wave 1 was excluded. Third, except for race, sex, poverty level, and media use, we coded the other variables as time-variant predictors, meaning that these variables could vary across the time (wave) metric and were not constrained to be equal across all four waves. Fourth, the time-variant and time-invariant predictors were examined using multilevel modeling as it is the most widely used analytic approach for intensive longitudinal data with time-variant and time-invariant covariates [28]. We applied a time-varying effect model (TVEM) to our study by using the macro %TVEM (available from http://methodology.psu.edu) [28]. The macro %TVEM used all data for each individual across four waves and automatically excluded the missing time-specific observations in the analysis [29]. We reported adjusted odds ratios (ORs) (95% confidence intervals; CIs), and the statistical significance was set at α=0.05 for 2-sided tests for all analyses. Data analyses were conducted using SAS version 9.4 statistical software.