2.1 The Stochastic Model for Infectious Process of the Imported COVID-19 Case
Let X(t) denote a random variable for the state at time t realized by four states defined within state space, Ω = {uninfected (state 1), pre-symptomatic phase (PSP) (state 2), symptomatic phase (SP) (state 3), and asymptomatic phase (ASP) (state 4)}. Given time t, PSP and ASP belong to symptom-free state but the distinction between PSP and ASP is that the former would progress to SP (non-persistent asymptomatic) whereas the latter would remain free of symptom after infection (persistent asymptomatic). Figure 1shows the entire infectious process from exposure to infectives, through ASP or PSP until SP for the imported COVID-19 cases accompanied with time to exposure, the arrival time, and the duration of quarantine and isolation of those suspected infectives.
There are several assumptions made for the proposed four-state stochastic process that are supposed to be biologically plausible.
-
The proposed model is progressive from uninfected to SP.
-
ASP would not surface to SP and can only be detected through RT-PCR test.
Note that there is a fraction of patients infected with SARS-CoV-2 will not present any clinical symptoms. We therefore applied the idea of competing risk model to PSP and ASP for those infected. Infected subjects, when going through the pathway of PSP, would have a finite infectiousness time from PSP to SP.
Following the random process of X(t), Fig. 1 also sketches an illustration of natural course of infection for n passengers from time staying abroad until the end of quarantine and isolation. They consisted of r imported COVID-19 cases and (n-r) susceptible individuals. As shown in Fig. 1, suppose the ith enrolled individual had been in close contact with three types of COVID-19 infectives (including symptomatic, pre-symptomatic, and possible asymptomatic cases) (Ferretti et al. 2020) while staying abroad since calendar time t1 (departure time). Upon the arrival time (t2), this individual may have four possible outcomes as defined within Ω from the date of exposure to the date of arrival governed by the infectious course of SARS-CoV-2 infection.
We are very interested in estimating three parameters deriving from the above-mentioned four-state stochastic process. These include the incidence rate of pre-symptomatic and asymptomatic CPOVID-19 and the hazard from pre-symptomatic to symptomatic phase. The parameter of the latter will be converted to the median pre-symptomatic transmission time (MPTT). The proportion of asymptomatic cases will be also computed by using these parameters.
2.2 Empirical Data
To model the infectious process in connection with the four-state disease model of COVID-19, we targeted the imported cases of COVID-19 among inbound passengers around the world flown into Taiwan between March 2020 and Jan 2022. We divided the study period into seven epochs to cover different periods of emerging SARS-CoV-2 variants including the D614G, Variant of Concern (VOC) Alpha, and Delta, and the recent VOC Omicron. The seven epochs were named according to the corresponding dominant SARS-CoV-2 variants as follows.
-
D614G-1: March-June 2020
-
D614G-2: July-September 2020
-
Alpha-1: October-December 2020
-
Alpha-2: January-May 2021
-
Delta-1: June-August 2021
-
Delta-2: September-November 2021
-
Omicron-1: December 2021-January 2022
It should be noted that we excluded domestic cases because they may contain unknown origin of contact history that may preclude us from estimating the relevant parameters governing the natural history of COVID-19. For each confirmed case, we retrieved data from a repository summarizing the information on imported cases reported by the Central Epidemic Command Center (CECC) in Taiwan (TCDC, 2020). In addition to personal attributes including age and sex, the time stamped on the date of arrival and departure from foreign countries, date of arrival at Taiwan, date of the occurrence of clinical symptoms, and date of RT-PCR test performed were abstracted from the CECC press. It should be noted that as the majority of inbound passengers are Taiwan residents who had been abroad and flown back after business the period between the date of departure and the date of arrival can be exploited for estimating three parameters of interest as below. We can also assume the date of departure would stay in the uninfected state because they would be requested for negative RT-PCR test negative before departure. Therefore, although the exactly date of exposure is unknown it can be assumed the date of exposure must lie between the date of departure and the date of arrival. This forms the basis for building up the likelihood functions as below for estimating pre-symptomatic incidence and the median time of pre-symptomatic transmission. Following the guideline of Taiwan CECC, for subject with suspected symptoms of COVID-19 including fever, cough, short of breath, fatigue, myalgia, diarrhea, and anomaly in smell and test, they would be tested for SARS-CoV-2 infection with RT-PCR twice within 24 hours. For inbound passengers without symptoms, it is mandatory for these subjects to be isolated for quarantine for 14 days. The quarantined subjects will be tested for SARS-CoV-2 upon the occurrence of suspected symptoms during their 14-day quarantine (Taiwan Centers for Disease Control 2021). Information on the origin of country/region for the inbound cases were also collected. Supported by the empirical data of timeline through COVID-19 symptom development, we estimated the rates of progression from pre-symptomatic phase to symptomatic phase. Data on the number of passengers arrived at Taiwan with the information on the departure countries were retrieved from the open data source of Ministry of Transportation and Communications, Taiwan and Ministry of the Interior National Immigration Agency, Taiwan.
2.3 Transition probabilities of state transition
Let I(t) and i(t) represent cumulative distribution function (CDF) and probability density function (pdf) of being infected and staying in the PSP, namely the transition from state 1 to state 2 at time t. The two corresponding functions for the progression from PSP to SP (transition from state 2 to state 3) are denoted by F(t) and f(t). Let A(t) and a(t) denote both CDF and pdf for the occurrence of asymptomatic COVID-19.
The transition probabilities from uninfected to pre-symptomatic, symptomatic, and asymptomatic, and of staying in uninfected in the time interval of \(\left({t}_{1},{t}_{2}\right)\) are derived with the following stochastic integration formula.
$${P}_{12}\left({t}_{1},{t}_{2}\right)={\int }_{{t}_{1}}^{{t}_{2}}i\left(s-{t}_{1}\right)\times \left[1-F\left({t}_{2}-s\right)\right]\times \left[1-A\left(s-{t}_{1}\right)\right]ds$$
1
Equation (1) represents the transition probability from state 1 (uninfected) to state 2 (PSP) between t1 and t2. The first and second element inside the integral of Eq. (1) indicate the process for an infected individual who entered into the PSP at calendar time s since time t1 (the date of departure) as shown in Fig. 1 but has not developed into SP yet until time t2. The third element is not allowed to enter into the ASP once become pre-symptomatic cases according to our model specification.
$${P}_{13}\left({t}_{1},{t}_{2}\right)={\int }_{{t}_{1}}^{{t}_{2}}i\left(s-{t}_{1}\right)\times \left[1-A\left(s-{t}_{1}\right)\right]\times F\left({t}_{2}-s\right)ds$$
2
Equation (2) is the transition probability for an individual who had been exposed at time t1 and entered into PSP at time s (first element) and turn into symptomatic case at time t2 (second element). The third element asserts that it would not be possible to go the pathway of being ASP from t1 to t2
$${P}_{14}\left({t}_{1},{t}_{2}\right)={\int }_{{t}_{1}}^{{t}_{2}}a\left(s-{t}_{1}\right)\times \left[1-I\left(s-{t}_{1}\right)\right]ds$$
3
Equation (3) is the transition probability for an individual who become asymptomatic at time s (first element) and would not follow the pathway of progression from PSP to SP (second element).
The complementary probability for an individual staying uninfected state would be derived as follows.
$${P}_{11}\left({t}_{1},{t}_{2}\right)=1-{P}_{12}\left({t}_{1},{t}_{2}\right)-{P}_{13}\left({t}_{1},{t}_{2}\right)-{P}_{14}\left({t}_{1},{t}_{2}\right)$$
4
The most important information here is related to the probability of progression from PSP to SP in the time interval between arrival (t2) and the end of quarantine (t3),
$${P}_{23}\left({t}_{2},{t}_{3}\right)={\int }_{t2}^{{t}_{3}}f\left(r\right)dr$$
5
, which is equivalent to \(F\left({t}_{3}-{t}_{2}\right)\).
Equation (5) is to provide the transition probability for a pre-symptomatic individual who develops symptoms at instantaneous time r during the time interval between t1 and t2.
2.4 Statistical Distribution of i(t) and f(t)
There are various statistical distributions that can be used for depicting each pdf of i(t), f(t), and a(t) as indicated above. The simplest form follows the exponential distribution for each one of them parameterized by λ1-λ3, which represent pre-symptomatic incidence rate, the progression rate from PSP to SP, and asymptomatic incidence rate in the language of epidemiology. For pre-symptomatic and asymptomatic rate, it would be very reasonable to use two Poisson distributions, an alternative expression of exponential distribution in terms of counts during time interval, for capturing the two parameters of each exponential distribution, λ1 and λ3, given the assumption of rare disease among a large susceptible population.
For the progression rate from PSP to SP in relation to f(t), various forms can be adapted. According to the distribution of viral load from PSP to SP from literature (Sethuraman et al. 2020), we reckon that the log-logistic form seems more appropriate than others like the Weibull distribution form because it is more likely to have a non-monotonic hazard function to describe the progression from pre-symptomatic to symptomatic phase with time. With log-logistic form, it is possible to have a hazard function which increases with time when disease progresses and turns to decrease beyond a time point when a patient starts to recover (Collett 2003). The hazard function of log-logistic form is expressed as
\(h\left(t\right)=\frac{{e}^{\theta }\kappa {t}^{\kappa -1}}{1+{e}^{\theta }{t}^{\kappa }}\) \(0\le \text{t}, {\kappa }>0\) (6)
The cumulative risk of developing symptomatic phase from t1 to \({t}_{2}\) can be written as follows,
$$F\left({t}_{1},{t}_{2}\right)=1-{\left(1+{e}^{\theta }\bullet {\left({t}_{2}-{t}_{1}\right)}^{\kappa }\right)}^{-1}$$
7
Despite this postulate, we attempt a serious of survival functions, including exponential and Weibull distribution, to compare the resulting distribution of emerging symptomatic cases by time with that of log-logistic form. Recall that the hazard from pre-symptomatic to symptomatic phase will be converted to the median pre-symptomatic transmission time (MPTT).
2.5 Parameter estimation with Bayesian Markov Chain Monte Carlo method underpinning
We used the Bayesian Markov Chain Monte Carlo (MCMC) method to estimate the incidence of pre-symptomatic and asymptomatic COVID-19 and the transition from PSP to SP with time following various distributions, including exponential, Weibull, and log-logistic distribution. For each model, the initial 5000 burn-in samples were discarded. Every 20th sample of the following 100,000 iterations was retained and comprised the posterior distribution of parameters of interests. The posterior mean and 95% credible interval of equal tails were reported for all estimates.