A Big Picture
Throughout the year, there were 1,337,774 ITGs and 1,542,050 distinct ITG riders identified in the network. Daily ITG riders account for 10–16% of all daily riders. About half of all ITGs consist of two adult riders, and the proportion climbs to over 60% when we particularly examine ITGs with different home stations. We observe 23.5% of all ITGs include one adult and one child (Fig. 1a). In particular, we define riders’ “home” stations as those riders start their first metro trip of a day most frequently within a year. It could be found that 74% of all ITGs include two riders with the same “home” stations. These ITGs have an average of 10.96 ITG trips throughout the year, which is more than that of other ITGs with 8.24 trips on average. Considering “family” members are more likely to live together and have higher contact frequency, the ITGs including riders with the same home stations could at least partially reflect the share of “family” or “neighbors” in the dataset.
Fig 1b presents monthly ITG riders. Hong Kong suffered from four surges of locally confirmed COVID-19 caseloads during 2020 since the first such case emerged on Jan 23 (HK Gov., 2022). As expected, the numbers of both ITGs and ITG riders were significantly suppressed after COVID-19 hit the city. Specifically, the quantity of ITGs decreased from 313,441 in January to 129,858 in February. The size of ITG riders in February dropped to only 44% of that in January. In terms of persistent group riders (PGRs), who remain travelling as ITGs, 85,809 PGRs, 45% of all ITG riders in January, retained their intentional group travel in February after the local pandemic outbreak. Despite the presence of COVID-19, 9,461 PGRs remained in ITG for all the twelve months in 2020.
Destination Distribution Of Itg Riders
To explore how ITG riders’ destinations (metro stations) were impacted by the pandemic and how related potential transmission risks by destination changed across different surges, we compared the spatial distribution of those destinations before and after the outbreak of the pandemic based on a percolation analysis.
Fig 2 presents the spatial patterns of the top metro stations in terms of the most incoming ITG trips across different periods. Not surprisingly, before the local pandemic outbroke, the ITG riders centered around Kowloon South and Hong Kong Island North, where there is the highest concentration of local leisure, commercial, catering, and business facilities (Fig 2b). Specifically, stations serving Causeway Bay (Station 28), Tsim Sha Tsui (Station 3), and Mong Kok (Station 6) stood out among all the local metro stations.
After the local pandemic outbroke, only PGRs continued travelling as a group. The top metro stations that these riders visited varied notably from those for ITG riders pre the pandemic. First, Tesung Kwan O (Station 50) and its neighboring stations became one of the top 3 destinations. Second, Mong Kok was no longer within the top list. Third, Causeway Bay was still one of the most popular destinations whereas other stations such as Tai Wai emerged in the list of these destinations across the four surges of locally confirmed cases. Throughout the four surges, Tesung Kwan O was a destination that PGRs continued visiting. Given that on top of and around the station and its neighboring stations, it is one of the latest “Rail + Property” new towns of Hong Kong, one can say that the corresponding new town was quite self-contained for PGRs before and amid COVID-19, i.e., local facilities could satisfy most need of ITG riders (Cervero and Murakami, 2009) and residents have more opportunities for joint household activities when working at home and saving commuting time (Tao and He, 2021). Most if not all riders/residents in this new town only need a short metro ride to have their needs met, which exemplifies both the well-discussed neighborhood planning (Silver, 1985) and the emerging 15-minute city concept (Kissfazekas, 2022; Leung, 2021).
A Social Interactive Network Among Metro Riders During The Pandemic
Regarding ITG as a proxy for social interactions among metro riders, a metro “social network” (ITG network) is constructed. This network is facilitated by the presence and usage of the local metro system. The network’s nodes are individual ITG riders, and the links exist when any two ITG riders intentionally travel as a group.
Against the backdrop of the COVID-19 pandemic, we examine whether and how the network properties changed significantly amid the pandemic surges. Overall, the daily quantity of all riders, ITG riders, and ITGs decreased after the local pandemic outbroke, hit rock bottom in the first and second surge, and then rebounded gradually in the third and fourth surge (Fig. 3a). Particularly, Fig. 3b shows the average quantity of ITG riders by hour of a weekend day pre-pandemic and in each surge. In addition to an obvious decrease, the hourly ITG riders were fewer in the first and second surge than those in later two surges. During the surges, the pre-pandemic peak hour (5 pm) was flattened, which indicates some trips might shift from peak to off-peak hours. Despite the variations in quantity, we found that the average lag between any two ITG trips of all the ITG riders fluctuated amid the pandemic, with 82 hours during the pre-pandemic time and then 92,138,237 and 167 hours in the following four surges, respectively. It seems that the lag increased right after the local pandemic outbroke while decreased when people became aware that there could be multiple surges of locally confirmed cases.
To explore the variation among different periods of each surge, Fig. 3c depicts the COVID-19 case surges and their respective quiet (the time pre-COVID-19 or when case increase remained low), climbing (when daily confirmed cases were increasing), and declining periods (when daily confirmed cases were decreasing) in Hong Kong in 2020. The time windows of the periods were defined by referring to both the dates when local anti-pandemic measures eased or tightened (Supplementary Table S1), and the algorithm-based detection of optimal changepoints in numbers of daily locally confirmed cases (Killick et al., 2012) (Supplementary Fig. S3). All ITGs were allocated to the three different periods of each surge. Concerning ITGs across the three periods, we proposed and tested the first hypothesis:
Hypothesis 1
The size and interconnections in the ITG network varied significantly across the quiet, climbing, and declining periods of each pandemic surge.
Figure 3d shows the variation in the ratio of ITG riders to all the riders as the time approaches a surge’s zenith, when the number of daily locally confirmed cases reached the highest of the given surge. The ratio of ITG riders to all the riders in each surge fluctuated. Except for the fourth surge, a “smiling” curve in variation could be found for the first, second, and third surges. In the fourth surge, one half of the “smiling” curve could also be seen despite incomplete data. This means that there was a negative correlation between the share of ITG riders and the daily confirmed cases. Quantity of ITG riders decreased disproportionately strongly. Interestingly, even though there were a much larger number of locally confirmed cases in the third and fourth surges, the share of the daily ITG riders in these two surges were no smaller and the slope of the corresponding “smiling” curves were even flatter than their counterparts in the first and second surges. This implies that ITG riders became less sensitive to the locally confirmed cases as the fight against COVID-19 had become a long-lasting task (Petherick et al., 2021).
Student’s t-tests were performed to examine the variation in the ITG network’s interconnections, measured by the degree (k) of ITG riders, and size, measured by the average number (n) of ITG trips of each Metro rider per weekend day among the different periods of each surge. The degree (k) equals the number of other riders that a given ITG rider has ITG trips with. One hypothesis for the analysis is that k and n during the quiet period of a given surge would be lower than that within the surge.
Table 1
t-test Results for the ITG network
|
Average Degree for the ITG Riders (k)
|
Average ITG trips per rider (n)
|
Surge
|
\({k}_{s}\) < \({k}_{q}\)
|
\({k}_{d}\) > \({k}_{c}\)
|
\({k}_{d}\) < \({k}_{c}\)
|
\({n}_{s}\) < \({n}_{q}\)
|
\({n}_{d}\) > \({n}_{c}\)
|
\({n}_{d}\) < \({n}_{c}\)
|
1st Surge
|
44.54
|
-
|
2.46***
|
10.81
|
-
|
37.17***
|
2nd Surge
|
14.17***
|
16.87***
|
-
|
11.83***
|
10.25***
|
-
|
3rd Surge
|
13.69***
|
9.68***
|
-
|
24.93***
|
-
|
6.89***
|
4th Surge
|
3.72
|
11.03***
|
-
|
6.71***
|
-
|
9.06***
|
t-statistic is reported. ***p < 0.01, **0.01 < p < 0.05, *0.05 < p < 0.1. s denotes data during the given surge which consists of both the climbing and declining periods; q denotes data in the quiet period before the given surge; c denotes data during the climbing period and d denotes data during the declining period. Data is normalized into daily average.
|
Table 1 presents the t-test results, which were statistically significant for k or n models for the last three surges. This indicates that the pandemic situations reduced the ITG networks’ interconnectedness and size. Another hypothesis is that the two parameters would differ between two periods within a surge. Results of most n models show statistically significant lower values in declining periods compared with the climbing periods. This complies with the previous study that responses from Metro travel behaviors would lag behind the variation in COVID-19 transmission (Zhang et al., 2021). However, k of the last three surges is statistically significantly lower during the climbing periods than the declining periods, which implies lower interconnectedness of the network while newly confirmed cases are growing. Some analysis results incompatible with the above explanations might be impacted by implementation of local anti-pandemic measures during certain periods. For example, closure of cross-border check points occurred within the climbing period of the first surge, and bans on gatherings and international travel were implemented during the climbing periods of the second surge.
A Scale-free Itg Network
Anti-pandemic measures and people’s fear of infection could reduce physical contacts in the city. Subsequently, the ITG network could collapse. However, most social interactive networks are found to be scale-free in distribution (Barabâsi et al., 2002) and scale-free networks are usually robust when faced with malfunctional nodes and exogenous attacks (Albert et al., 2000; Dou et al., 2010). That is, a scale-free network hierarchically includes dominant nodes with a large degree followed by less-dominant nodes. In the network, nodes with a small degree account for vast majority of the network. Therefore, the overall structure of the network usually would not be affected if we randomly remove a small fraction of nodes. We consider that this would also be true for the ITG network amid the pandemic. Such properties would increase metro travel anxiety of many people and notably reduce outgoings of many metro riders. Hence, the following two hypotheses are proposed:
Hypothesis 2
The structure of the ITG network remains robust amid the pandemic.
The topology of the network remains relatively robust as a whole despite the pandemic. Figure 4a compares the complementary cumulative distribution functions (CCDF) of ITGs network before the pandemic outbroke with those in different COVID-19 surges. The ITG networks perform statistically significant in power-law distribution spanning all periods with p-value of KS-tests smaller than 0.01. The α decreased after the pandemic outbroke and then fluctuated across surges around 3 to 5. This indicates that riders with a higher degree account for a larger proportion of ITG riders of a local surge compared to the pre-pandemic time.
Hypothesis 3
The ITG network increases in time and follows a scale-free distribution.
Shown in Fig. 4b, interconnections of the network can be measured by two indicators: average degree (k) and the relative size of the largest cluster (S). Despite the k mentioned above, the S is the ratio between the total number of riders in the largest cluster and that in the whole network. Both k and S increased in an approximately linear manner. The growth of k has three contributors. One, new ITG riders made ITG trips together and joined the existing ITG network. Two, these new riders travelled with existing ITG riders so new ITGs trips emerged. Three, new ITG trips emerged among existing ITG riders. A cluster of the ITG network represents a subset of ITG riders who became connected because of the presence of ITG trips. The ITG network of metro riders is embedded in the large social interactive network in the city. A larger S indicates that an increasingly dominant cluster in the ITG network and possibly in the large social interactive network had emerged.
As for the network structure, we used the whole year’s ITG dataset to examine distribution of frequency and degree of the ITG riders. As shown in Fig. 4c, the trip frequencies of most riders in the network typically increases as the degree increases. For all ITG riders, their trip frequencies’ mean and maximum are 28 and 1,132, respectively. Figure 4d presents the degree distribution in the logarithmic binning format. The distribution of all the ITG riders and that of non-commuting ITG riders remain similar. The average degree of all the ITG riders is 1.73 and the max value is 52, lower than 150, the suggested cognitive limit to the number of people with whom one could maintain stable social relationships (Dunbar, 1993). In particular, we found that the probability P(x) that a rider is connected with at least x other riders in the ITG network decays following a power law, which can be expressed as P(x) ~ \({x}^{-\alpha }\), where α = 5.67. The Kolmogorov-Smirnov test (KS-test) was performed to examine whether the network’s degree distribution statistically follows the power-law distribution. The test produces a p-value smaller than 0.01. This result demonstrates that the ITG network indeed resembles other social networks with a scale-free distribution in degree.
Who Maintains The Itg Network?
According to what we know about the structural robustness of scale-free networks (Albert et al., 2000), a probable explanation for the ITG network’s relative stability amid the pandemic is the continued presence of PGRs, who remained active in ITG travel despite of the surges in locally confirmed cases.
Table 2
t-Test Analysis for Persistent Group Riders (PGRs) in the ITG network
|
Average Degree for PGRs (k)
|
Average ITG trips for PGRs (n)
|
Surge
|
\({k}_{s}\) < \({k}_{q}\)
|
\({k}_{np}\) < \({k}_{p}\)
|
\({n}_{s}\) < \({n}_{q}\)
|
\({n}_{np}\) < \({n}_{p}\)
|
1st Surge
|
19.82***
|
130.54***
|
7.90
|
52.18***
|
2nd Surge
|
67.70***
|
115.46***
|
52.17***
|
80.02***
|
3rd Surge
|
122.69***
|
142.99***
|
97.44***
|
60.09***
|
4th Surge
|
116.11***
|
141.15***
|
79.67***
|
20.69***
|
t-statistic is reported. ***p < 0.01, **0.01 < p < 0.05, *0.05 < p < 0.1. s denotes data during the given surge which consists of both the climbing and declining periods; q denotes data in the quiet period before the given surge; p denotes data on PGRs occurring before the local pandemic outbroke and then remaining in the network within all four surges; np denotes data on non-PGRs within the given surge.
|
Table 2 presents the t-test results regarding the degree and number of the PGRs and non-PGRs. The results corroborate the hypothesis that the degree and number of PGRs would be significantly higher than those of non-PGRs in any given surge (p < 0.01). Not surprisingly, even the PGRs were impacted by the surge of the locally confirmed cases–their k and n in the four climbing periods were mostly smaller than those in the corresponding quiet periods. To further explore the role of PGRs in maintaining the ITG network’s robustness, Hypothesis 4 was proposed.
Hypothesis 4
PGRs are essential vertices those influenced the properties of the ITG network.
To test this hypothesis, we conduct an experiment to compare how removal of PGRs, who remain active in group travel amid the pandemic, and the same number of riders randomly drawn from the ITG dataset would affect the interconnectedness in the network. The interconnectedness can be measured by k and S. We validated that this is the case in Hong Kong. As for the experimental failures for measurement on k, we call removing PGRs who remain in the ITG network up to a certain period “Failure 1” and removing the same amount of randomly drawn riders from the network “Failure 2”. S indicates the largest number of riders that are potentially affected by an infected rider in the ITG network. Taking the “social network” of the local metro riders in the whole year as the subject, we separately remove the same fraction (number) of different subsets of riders for 50 times in total and up to 19% of the network size (the proportion of the largest cluster in the whole-year network.), to investigate how that would influence S. Our subsets of ITG riders include those that were randomly selected throughout the year (Subset 1) and PGRs presented in at least one (Subset 2), two (Subset 3), three (Subset 4), and four (Subset 5) surges throughout the year. Removing the different subsets allows us to simulate different failures of the network and to see related consequences.
Figure 5a shows k becomes significantly lower under the “Failure 1”, while the k’s for “Failure 2” are almost the same as those without removing any riders from the ITG network in the same period. Similarly, if we assume it is highly likely for ITGs and related interactions to transmit viruses when one ITG rider has been infected, S would indicate the largest number of riders potentially affected by an infected rider in the ITG network. Figure 5b shows that S apparently remains more stable when randomly removing ITG riders (Subset 1), compared with removing PGRs (Subset 2–5). For the PGRs in different subsets, the more frequently they presented in the four surges, the more influential they were in S. Removing those PGRs who presented in all the four surges (Subset 5) resulted in the smallest S for failures. These indicate that the PGRs played a more influential role in interconnections of the network than wandering riders who entered or left during the given period.