Evolving Norms: Social Media Data Analysis on Parks and Greenspaces Perception Changes Before and After COVID-19 Pandemic

DOI: https://doi.org/10.21203/rs.3.rs-1499119/v1

Abstract

This study provides a novel approach to understand human perception changes in their experiences of and interactions with public greenspaces during the early months of COVID-19. Using social media data and machine learning techniques, the study delivers new understandings of how people began to feel differently about their experiences compared to pre-COVID times. The study illuminates a renewed appreciation of nature as well as an emerging but prominent pattern of emotional and spiritual experiences expressed through a social media platform. Given that most studies have almost exclusively examined whether park use increased or decreased during the pandemic, this research provides meaningful implications beyond the simple extensional visit pattern. It highlights the preeminent roles parks and greenspaces play during the pandemic and guides a new direction in future park development practice to support more natural elements and nature-oriented experiences from which emotional and spiritual well-being outcomes can be drawn.

1. Introduction

The world has faced the greatest global health crisis in the 21st century. As of March 2022, more than six million lives worldwide had been lost to the novel coronavirus (COVID-19), causing severe health problems. The consequences have touched every sector of society including economic and social instability with unprecedented changes in lives in a short time period, and these consequences are still continuing. Since a state of emergency was first declared in March 2020, the pandemic has recast our understanding of daily life spanning from the individual and household levels to institutional and societal levels. The primary administrative orders and public health guidance (e.g., stay-at-home and social distancing) have caused physical and psychological distress, fear, and depression (Fitzpatrick et al., 2020; Şimşir et al., 2021; Bäuerle et al, 2021). As a result, many indoor civic spaces including indoor recreation facilities were unavailable to the public in the early pandemic and public outdoor spaces were some of the only spaces for people to have much-needed respite and peace from an overwhelmed sense of anxiety and affliction.

Since late 2020, a growing number of studies and reports have documented a new surge in the use of parks and greenspaces in response to the pandemic. The data have been primarily gathered from data monitoring programs and social surveys targeting outdoor recreation managers or park/trail users. For instance, the Connecticut Trail Census (2020) found a substantial spike in trail counts during the early months of the pandemic through a counter sensing system, with a general increase in the use of trails across the state. There was also a notable increase in the number of first-time rural trails users (Brown, 2021) who might have sought safer spaces that may not have been present in their urban residences. Similar patterns have been documented in other parts of the country. For example, weekly bicycle and pedestrian use on local trails in Minnesota rose during this period compared to weather-adjusted averages (day-of-week, daily high temperature, and daily rainfall) to the extent that some popular trails were closed due to overcrowding.

Along with the new trend of a sudden increase in local rural trail use, urban greenspaces in densely populated cities such as New York City remained popular destinations and were even more important for mental and physical health than before the pandemic (McPhearson et al., 2020). As such, the COVID-19 pandemic has elevated the latitude of parks and everyday greenspaces from a personal recreational option to an essential amenity regardless of the location. In addition, the presence, accessibility, proximity, and safety of parks and greenspaces (as a general term and also with regards to physical distancing practices) have become central subjects related to equity and justice issues in park research and discourse (Larson et al., 2021; Rigolon et al., 2021; Gianfredi et al., 2021). Although recent literature has focused on the changing pattern of park visitation during the pandemic, it is less understood whether and how human perceptions have changed in response to people’s experiences of and interaction with parks and greenspaces. This study aims to understand changes in people’s cognitive experiences of parks and greenspaces at the beginning of the COVID-19 pandemic through a social media lens.

2. Study Settings

2.1 Spatial Setting

The study area includes three adjoining northeastern U.S. states: New York, New Jersey, and Connecticut. The tri-state area experienced hot spots during the first wave of the coronavirus epidemic and then was a safe-to-travel region only in the autumn of 2020 before the second wave hit in the winter. The three states are closely connected with significant floating populations moving across state boundaries for jobs and regional economy, centered largely on New York City. New York City had an unprecedented and rapid explosion of COVID-19 hospitalizations, becoming the global epicenter of the outbreak in the early pandemic period (our study period), and had a significant impact on high caseloads in the neighboring counties in New Jersey and Connecticut (Schulman and Bhanot, 2021).

2.2 Temporal Setting

Tweeter data were collected for the two-month period immediately after the World Health Organization (WHO) declared that COVID-19 was a global pandemic. The specific time span for data collection ranged from March 13, 2020 through May 12, 2020. For comparison, the exact time period was applied to four preceding years (2016, 2017, 2018, and 2019) to obtain pre-COVID tweeter data.

2.3 Conceptual Setting

Target spaces of interest included public parks, forests, trails, and other types of greenspaces, designed or natural. Social media users were engaged with these spaces as they remained open during the early pandemic months. Since the foci of this study were on physical green space features, as a natural form or for passive recreational purposes, we excluded spaces used mainly for outdoor social gathering (e.g., plazas, amphitheaters) from our study. Likewise, beaches were not included as access to these locations were restricted per administrative orders from the three states during the time span of the analysis.

3. Methods

3.1 Data Collection

This study analyzed Twitter data from a convenience sample of residents in the states of New York, New Jersey, and Connecticut. Twitter premium APIs were utilized along with pre-defined keywords to extract Twitter posts (tweets) relevant to park and greenspace experiences. The keywords were in three discrete categories: 1) place type, 2) user experience, and 3) geography. For example, the place type keywords included “park,” “trail,” and “nature.” The user experience keywords were those illustrating recreational activities typically observed in public greenspaces. Lastly, the geography keywords were the names of the three states in full and abbreviated forms, which set the geographic locations of the tweets to be retained. We scrapped tweets generated over a two-month window (March 13- May 12) in each year from 2016 to 2019, and then in 2020 for a comparison to the previous years. Through this process, a total of 73,315 tweets were initially retained.

3.2. Pre-processing

Prior to analysis, the scrapped tweets were pre-processed as follows. First, tweets that were nonsensible or too short (i.e., < 60 characters) were excluded. Second, people’s names were identified by cross-referencing them to the 151,671 common surnames found in the U.S. 2002 Census and they were removed from the tweets. Next, the NLTK library in Python 3.8 was used to filter out stop words such as pronouns, prepositions, and postpositions (Bird et al., 2009). For topic modeling, a TF-IDF weighting method was employed to identify words that frequently appeared exclusively within a topic. Although the word “park” appeared frequently, this word was not considered in the analysis, because it appeared across multiple topics and thus non-distinctive to a particular topic. Table 1 shows the number of tweets for each state and each year of interest after the pre-processing stage. The total of 41,861 tweets were ultimately accounted for analysis.

Table 1

The number of valid tweets included for data analysis for two-month window (March 13- May 12) in each year from 2016 to 2020

Year

State

Total

NY

NJ

CT

2016

3,679

778

477

4,934

2017

3,350

825

446

4,621

2018

5,264

1,250

686

7,200

2019

7,224

1,894

926

10,044

2020

11,670

2,442

950

15,062

Total

31,187

7,189

3,485

41,861

3.3. Data Analysis

Data analysis proceeded in four stages to examine changes in the public cognition (beliefs, opinions, attitudes) on parks and greenspaces after the COVID-19. First, Latent Dirichlet Allocation (LDA; Blei et al., 2003), a topic modeling technique used to identify classes (groups) of words that represent key topics in a set of documents (e.g., tweets in our case), was performed using topic models (Grün & Hornik, 2011) and ldatuning (Nikita & Chaney, 2020) packages in R. In the initial stage of LDA, the optimal number of topics were determined (Zhao et al., 2015) by inspecting various metrics. Figure 2 shows the metrics of Griffiths (2004), Cao and Juan (2009), and Arun (2010), all of which represent the method to arrive at the optimal number of topics. The optimal point was converged into where the Griffith’s value was maximized (approaching 1) and the Cao and Juan’s and Arun’s values were stabilized.

The optimal number of 50 was confirmed by the five-fold cross-validation with perplexity (Zhao et al., 2015; Ellis, 2017). In other words, the validation method used five different machine learning models to find the best fit for each candidate number of topics. A lower value of perplexity indicates better generalization performance of p, a predictive distribution over the words that were learned from a training dataset. As shown in Fig. 3, the perplexity value decreased and became the lowest (i.e., best fit) with 50 topics. After the number 50 was reached, the perplexity value increased indicating over-fitting.

Sentiment analysis was then conducted using the TextBlob library in Python to extract the writers’ emotional states from the tweets. Specifically, for each tweet, polarity and subjectivity were calculated. Polarity ranges from − 1 (extremely negative mood) to + 1 (extremely positive mood), and the tweets were categorized as positive (polarity ≥ 0.30), neutral (–0.30 < polarity < 0.30), or negative (polarity ≤ − 0.30). Subjectivity goes between 0 to 1. Tweets with a subjectivity value < 0.50 were classified as objective; otherwise as subjective.

We then conducted longitudinal cluster analysis, an unsupervised machine learning technique, on the 50 topics identified from the earlier LDA stage. This analysis classified the topics into a few clusters that shared unique joint trajectories over the five-year period. Specifically, the changes in three factors (frequency of the topic, average polarity of the tweets, and average subjectivity) over time were calculated for each topic. Then, the topics that showed a similar pattern of changes (trajectories) were grouped together using the kml3d package in R (Genolini, 2013b; 2015b). Lastly, using SAS 9.4, a chi-square test and t-test were performed to compare the number and proportion of tweets as well as the polarity and subjectivity of the tweets before and after the pandemic, respectively.

4. Results

4.1 Latent Dirichlet Allocation (Topic Modeling)

Of the 50 aggregated topics statistically grouped using LDA, qualitative cross-reference evaluation confirmed that only 20 topics were deemed interpretable and relevant to the study; the remaining 30 topics and data (words in these topics) were excluded from further analyses. Among the 20 topics, two topics and corresponding words therein (italicized in Table 2) were overlapped with each other and thus merged into a single topic (Topic 4) leaving 19 topics for analysis. Table 2 shows the final selected 19 topics and the words that are most representative of these topics. The words under each topic were identified through machine-aided techniques based on the frequency and uniqueness (i.e., the same words appearing across the topics were excluded from the final selection). The topics were then labeled in a way that best characterizes the words as follows: negative feelings (Topic 1), mask-wearing (Topic 2), enjoying nice weather (Topic 3), hiking trail (Topic 4), baseball (Topic 5), breathe fresh air (Topic 6), love and thank nature and God (Topic 7), children play outside (Topic 8), positive feeling (Topic 9), staying at home (Topic 10), social distancing (Topic 11), enjoy and celebrate a beautiful day (Topic 12), water bodies (Topic 13), pets (dogs) (Topic 14), love watching and hearing nature (Topic 15), running (Topic 16), health (Topic 17), driving/parking cars (Topic 18), and biking (Topic 19).

Table 2

Important words for the 19 final topics

Topic 1

Topic 2

Topic 3

Topic 4

Topic 5

Topic 6

Topic 7

Topic 8

Topic 9

Topic 10

really

shit

lol

ass

fucking

fuck

crazy

many

people

mask

without

wearing

face

virus

nature

weather

nice

spring

enjoy

cold

warm

trail

hiking

hike

trails

Appalachian

miles

rail

run

home

hit

game

ball

inning

baseball

get

air

fresh

need

nature

outside

breath

nature

life

love

thank

God

world

true

kids

school

play

outside

children

young

playground

like

feel

love

better

much

outside

feeling

people

stay

home

live

safe

staying

inside

Topic 11

Topic 12

Topic 4

Topic 13

Topic 14

Topic 15

Topic 16

Topic 17

Topic 18

Topic 19

social

distancing

space

people

distance

safe

socially

day

beautiful

happy

enjoy

celebrate

perfect

gorgeous

trail

hiking

national

mountain

shoes

boots

trip

beach

lake

island

preserve

falls

pond

creek

dog

cute

leash

pet

animals

wildlife

owner

nature

trees

birds

love

flowers

watching

sounds

run

mile

race

runners

track

road

marathon

health

help

outdoors

mental

stress

physical

energy

car

right

lot

parking

drive

street

waiting

biking

ride

lane

protected

sidewalk

safety

cycling

The longitudinal cluster analysis that investigated the trajectories of the 19 topics relative to

their frequency, polarity, and subjectivity over the five years (2016–2020) resulted in four unique trajectories (clusters). Cluster A (36.8%) included Topic 3 (enjoying nice weather), Topic 4 (hiking trail), Topic 6 (breathe fresh air), Topic 7 (love and thank nature and God), Topic 9 (positive feeling), Topic 13 (water bodies), and Topic 15 (love watching and hearing nature). The polarity and subjectivity of this cluster remained the same or slightly increased over the years, but the frequency increased considerably after the pandemic. Cluster B (36.8%) was characterized as having low and consistent frequency, polarity, and subjectivity. This cluster included Topic 5 (baseball), Topic 8 (children play outside), Topic 14 (pets/dogs), Topic 16 (running), Topic 17 (health), Topic 18 (driving/parking cars), and Topic 19 (biking). Cluster C (21.1%) included Topic 1 (negative feelings), Topic 2 (mask-wearing), Topic 10 (staying at home), and Topic 11 (social distancing). This cluster demonstrated a huge increase in frequency and a slight drop in polarity after the pandemic. However, the subjectivity has not changed significantly over time. Finally, Cluster D (5.26%) included only one topic (Topic 12: enjoy and celebrate a beautiful day), and this cluster showed the highest polarity and subjectivity compared to other clusters, which had significantly declined after the pandemic—i.e., more negative and objective tweets compared to the previous years (see Fig. 4).

Note that approximately 70% of the Cluster C tweets were generated after the COVID-19 pandemic. Excluding this cluster, the number and proportion of tweets in other clusters were further compared before and after the pandemic. Table 3 shows the number of tweets in Clusters A, B, and D from 2016 to 2020 for the three states (NY, NJ, CT) combined and individually. Although there was a shortfall in 2017, overall, the number of tweets in these clusters more than doubled from 2016 (1,974 tweets) to 2020 (4,287 tweets). The results of the chi-square test indicated that the proportions of tweets across different clusters (i.e., relative size) also significantly changed over the years (χ2(3) = 2063.31, p < .0001). More specifically, the proportion of Cluster A tweets had continued to decrease before the pandemic (from 64–51%), but in 2020 it sharply increased (to 61%) returning to the 2016 level. In contrast, the proportion of Cluster B tweets had gradually increased before the pandemic (from 30–42%) and then declined to 32% in 2020. Cluster D had remained the same proportion (6%) throughout the period. As Fig. 5 illustrates, the increase in the number of tweets was greatest in New York (222% increase), followed by New Jersey (220%) and Connecticut (176%). In particular, the increase in the last two years (i.e., right before and right after the pandemic) was remarkable in New York (148%) followed by Connecticut (130%) and New Jersey (121%). The proportions of tweets across Clusters A, B, and D and the changes over time were similar among the three states.

Table 3

The number of tweets from 2016 to 2020, by state and cluster

  

2016

2017

2018

2019

2020

Total

Cluster A

1,271 (64%)

911 (55%)

1,404 (53%)

1,559 (51%)

2,631 (61%)

 

Cluster B

593 (30%)

637 (39%)

1,104 (42%)

1,285 (42%)

1,393 (32%)

 

Cluster D

148 (6%)

163 (6%)

220 (6%)

273 (6%)

732 (6%)

NY

Cluster A

935 (64%)

664 (55%)

985 (51%)

1107(51%)

1978(61%)

 

Cluster B

449 (31%)

472 (39%)

827 (43%)

938 (43%)

1064 (33%)

 

Cluster D

70 (5%)

69 (6%)

103 (5%)

131 (6%)

188 (6%)

NJ

Cluster A

192 (60%)

152 (55%)

255 (54%)

311 (53%)

428 (60%)

 

Cluster B

97 (30%)

102 (37%)

179 (38%)

235 (40%)

233 (33%)

 

Cluster D

33 (10%)

23 (8%)

36 (8%)

40 (7%)

48 (7%)

CT

Cluster A

144 (73%)

95 (57%)

164 (60%)

141 (53%)

225 (65%)

 

Cluster B

47 (24%)

63 (38%)

98 (36%)

112 (42%)

96 (28%)

 

Cluster D

7 (4%)

9 (5%)

13 (5%)

15 (6%)

27 (8%)

 

Total

1,974

1,649

2,660

3,030

4,287

4.2 Semantic Analysis

The polarity and subjectivity of the tweets are presented in Table 4. A paired-samples t-test was conducted to examine the changes in the polarity and subjectivity of the tweets before and after the pandemic (2016–2019 vs. 2020). Overall, the polarity significantly decreased (t = 8.29, p < .001) but it was still in the “neutral” range. Interestingly, unlike other clusters, the tweets in Cluster B showed an increased polarity, albeit not statistically significant. The subjectivity significantly moved in the direction of “subjective” (t = − 5.49, p < .001), which was marked in Clusters A and B but not the other two clusters. All three states showed an increase in subjectivity, but it was significant only in New Jersey (Table 4).

Table 4

Descriptive statistics and t-test results on polarity and subjectivity

 

Polarity

 

Subjectivity

 

Before

After

t

 

Before

After

t

Cluster A

.18 (.27)

.16 (.24)

2.41*

 

.41 (.30)

.44 (.23)

–3.54***

Cluster B

.10 (.05)

.11 (.23)

–1.47

 

.36 (.00)

.41 (.24)

–6.79***

Cluster C

.09 (.27)

.07 (.23)

3.24**

 

.42 (.28)

.41 (.22)

1.27

Cluster D

.39 (.29)

.34 (.25)

2.69**

 

.58 (.29)

.56 (.24)

0.75

New York

.17 (.29)

.17 (.24)

0.21

 

.41 (.29)

.43 (.24)

–1.42

New Jersey

.14 (.27)

.12 (.24)

6.14***

 

.40 (.29)

.43 (.23)

–5.91***

Connecticut

.17 (.28)

.11 (.24)

6.27***

 

.42 (.29)

.42 (.24)

–0.37

*p < .05, **p < .01, ***p < .001.

5. Discussion And Conclusion

“Tweeting” more about nature and spiritual experiences during the COVID-19

The results indicate that the pandemic has transformed how people perceive parks and greenspaces in the eastern tri-state region during the initial surge of the pandemic. The model-driven topic classification method and longitudinal clustering outcomes turned out to be legitimate to understand the domains of human interests related to parks and greenspace experiences. Unsurprisingly, the topics associated with COVID-19 safety measures (e.g., mask wearing, staying at home, and social distancing) showed a significant increase in tweet data to the extent that is clustered into a discrete category (Cluster C). It is also understandable that traditional park activities (e.g., ball-playing, biking/running, dog walking) were still prominent topics on social media, as represented in Cluster B. The fact that there were no dramatic changes in people’s traditional park use patterns between pre- and post-COVID tweets reflects that people still may have found value in what they used to do in parks and greenspaces, and these recreational activities might have been perceived as safe enough when practiced alone or with immediate family.

Interestingly, people’s interests were noticeably shifted to natural components after the pandemic. This change is likely tied to the positive sentimental attachment to parks and greenspaces, as identified in Clusters A and D. For example, “feeling fresh air” appeared to be a critical value in people’s experiences during the first shock of the pandemic. In fact, some passive experiences such as bird watching or listening to birds and water sounds were not very common in pre-COVID tweets but became important experiences as the pandemic ramped up. More importantly, emotional and spiritual expressions elicited from people’s interaction with nature (e.g., loving and thanking nature and God) appeared to be an emerging yet strong theme in the post-COVID tweets in each state and also across the study region. This finding infers that people became more attentive to the natural features and conditions than before the pandemic, began to link nature exposure to sentimental and spiritual well-being, and reappreciate the natural and health values of public greenspaces compared to their typical recreational and aesthetic aspects. Although this study cannot claim a direct correlation between the motivation of spiritual well-being and park visitation (See Irvine et al., 2013 for relevant research), the benefits of emotional and spiritual well-being, anticipated or unanticipated, likely impacted the positive park experiences and outcomes. This finding suggests that the benefits of public greenspaces can be expanded to a dimension of spirituality where people experience self-reflection, introspection, meditation, and other inner feelings (Heintzman, 2002). These feelings are known to be benefits of distant wilderness areas, but they have not often been associated with public greenspaces at the local level.

The results might have been influenced by the general increase in the number of tweets pertaining to parks and greenspaces in 2020, which, in itself, is inspiring as it denotes emergent public interest in parks and greenspaces at the personal communication level. Nevertheless, people’s renewed appreciation of nature and immediate linkage to emotional and metaphysical realms illuminate the preeminent roles that parks and greenspaces play during the pandemic. The overall mood for the entire set of tweet data, however, moved toward a more negative tone, which is due presumably to the severity of the pandemic rather than being related to park visits and experiences.

Advantages and Disadvantages of using Social Media Data

As social media data has become increasingly abundant, the data are being utilized as a primary data source in many human-environments interaction research to investigate broader patterns of human interest and experiences. The greatest advantage of using social media data is that it can minimize subject response bias because respondents are not aware of the intent of the study. In other words, the gathered data are a collection of spontaneous responses from test subjects, which can later be utilized as input for analysis. However, in a typical social survey, respondents may have a prior perception of the survey intent as they are informed by and cognizant of the goals of the research before participating in the survey. In addition, survey instruments can only take responses at a specific point in time and may not reflect the actual perception trends over a longer timeframe. Another strength of the social media approach is the ability to obtain a comprehensive (literally all) set of data that fulfills certain conditions that the research defines (in our case, all Twitter posts that contain preset keywords) during a desired period of time. In contrast, it is often a challenge for social surveys to secure enough samples. A known caveat of the social media approach, however, is that the sample data are limited to users of social media, and do not capture potential respondents who are not on social media platforms. In addition, provided that tweets are voluntary expressions of personal feelings and instant thoughts, the language of the Twitter posts are only assumed to represent social media users’ individual cognition and inner ideas about their experiences. Nonetheless, it serves as a valid approach to address time-sensitive phenomena such as the pandemic from a vast amount of data that are relatively readily available and have been formed in a voluntary and instantaneous manner.

Under unique circumstances like COVID-19, data collection is especially bounded with time and sampling. As random sampling is often not viable, researchers tend to rely on convenience samples for easy and timely access to target subjects. In fact, many recent studies have capitalized on this type of method by adding COVID-19 related questions to existing surveys or by creating a quick questionnaire to sample a population that is easy to contact or reach, causing a generalization issue.

Summary and Future Research

This study attempted to understand people’s perceptional changes in their interactions with parks and greenspaces during the early months of COVID-19 using social media data combined with a machine-learning approach. We categorized key words from tweet data into 19 topics to understand what new parameters people began to appreciate in parks and greenspaces compared to pre-COVID times. The results demonstrate that people were more inclined to appreciate natural experiences in these spaces and tended to link them to their emotional and spiritual domains, as reflected in these very common themes across the three states in this study. Albeit geographically fractional, the research revealed that there is a clear difference between the early COVID-impacted timeframe versus non-impacted years in that people perceived nature in public greenspaces as a substantial emotional and spiritual lift. Most studies have almost exclusively examined whether park use had increased or decreased during the pandemic, but this research focused more on whether and how park user experiences were felt differently beyond the simple extensional visit pattern. The study demonstrates a new pattern of more nature-oriented activities in public greenspaces and how natural components have been important in people’s emotional and spiritual experiences during the COVID-19.

The implications of this research include the emergent and adaptive roles of public greenspaces

in response to changing human needs in disruptive phenomena. The study suggests a deliberate incorporation of biophysical characteristics of wilderness settings (e.g., elevations, water, naturalness, unique scenery and landscapes) into our public park systems to facilitate the emotional and spiritual dimensions of our experiences (e.g., feeling of unity with nature and unity with oneself) in addition to typical park experiences (Chiesura, 2004). We also argue that we need to position nature as a vital element in contemporary public park designs and development decisions.

One limitation of this study is that we relied largely on an indirect method to understand people’s nuanced feelings, senses, ideas, and perceptions expressed through a social media platform. Therefore, the results only represent the general pattern of public perceptions rather than direct reflection from individuals. Given that we continue to struggle with another surge in COVID-19 cases driven by the highly transmissible Delta and Omicron variants, we are likely to experience continued pandemic concerns, a new wave of restrictions on public gathering, and entry into a “with-COVID” era. Future research needs to understand the genuine desire of the public for parks and greenspaces as an essential and equitable resource in current and future health crises. The research agenda should fill the gaps between park provision and people’s new motivation to be safe in outdoor spaces during the pandemic. We plan to expand our analysis to the entire country in an elongated timeframe to trace the extended trends over time.

Declarations

Data availability

Scientific Reports requires a Data Availability Statement to be included in all submitted manuscripts (at the end of the main text, before the References section); see 'Availability of materials and data' section for more information.


Author Contributions 

SP wrote the main manuscript text and has contributed to the overall design of the work, literature reviews, and data interpretation. SK has contributed to the conception of the work, data collection, and data analysis.  JL has contributed to statistical analysis. BH prepared figures 1-3. All authors reviewed the manuscript.

 

Competing interests

The authors declare no competing interests.

References

  1. Bäuerle, A. et al. Increased generalized anxiety, depression and distress during the COVID-19 pandemic: a cross-sectional study in Germany, J. Public Health. 42, 4, 672–678 (2020).
  2. Bird, S., Klein, E. & Loper, E. Natural language processing with Python. (O'Reilly, 2009).
  3. Blei, D. M., Ng, A. Y. & Jordan, M. I. Latent Dirichlet Allocation. J. Mach Learn Res, 3, 993–1022 (2003).
  4. Brown, L. E. et al. Exploring the Implications of Increased Rural Trail Use During the COVID-19 Pandemic on Health, Planning, Equity and Inclusivity. Choices. Quarter 3 (2021).
  5. Connecticut Trail Census. COVID-19 Trail Impact Report. (2020).
  6. Ellis, P. Cross-validation of topic modelling. http://freerangestats.info/blog/2017/01/05/topic-model-cv (2017).
  7. Fitzpatrick, K. M., Drawve, G. & Harris, C. Facing new fears during the COVID-19 pandemic: The State of America’s mental health, J. Anxiety Disord. 75 (2020).
  8. Genolini, C., Alacoque, X., Sentenac, M. & Arnaud, C. kml and kml3d:R Packages to Cluster Longitudinal Data. J. Stat. Softw. 65 (2015).
  9. Genolini, C. kml3d: K-Means for Joint Longitudinal Data. R package version 2.3, URL http://CRAN.R-project.org/package=kml3d (2015)
  10. Genolini, C., Pingault, J., Driss, T., Côté, S., Tremblay, R., Vitaro, F., Arnaud, C. & Falissard, B. KmL3D: A non-parametric algorithm for clustering joint trajectories. Comput Methods Programs Biomed. 109, 104–111 (2013).
  11. Gianfredi, V., Buffoli, M., Rebecchi, A., Croci, R., Oradini-Alacreu, A., Stirparo, G., Marino, A. Association between Urban Greenspace and Health: A Systematic Review of Literature. Int. J. Environ. Res. Public Health. 18, 5137 (2021).
  12. Grün, B & Hornik, K.. Topicmodels: An R Package for Fitting Topic Models. J. Stat. Softw. 40, 1–30 (2011)
  13. Java, A., Song, X., Finin, T. & Tseng, B. L. Why we twitter. Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, 56–65 (2007).
  14. Larson, L. R. et al. Urban Park Use During the COVID-19 Pandemic: Are Socially Vulnerable Communities Disproportionately Impacted? Frontiers in Sustainable Cities. 29 (2021).
  15. Lopez, B., Kennedy, C. & McPhearson, T. Parks are Critical Urban Infrastructure: Perception and Use of Urban Green Spaces in NYC During COVID-19. Preprint at https://www.preprints.org/manuscript/202008.0620/v1 (2020).
  16. Oftedal, A. Trail use has surged in response to the COVID-19 pandemic. Parks and Trails Council of Minnesota. https://www.parksandtrails.org/2020/04/27/trail-use-covid19/ (2020).
  17. Ovadia, S. Exploring the Potential of Twitter as a Research Tool. Behav. Soc. Sci. Libr. 28, 202–205 (2009).
  18. Python Software Foundation. Python Language Reference, version 3.8. http://www.python.org.
  19. Rigolon, A., Browning, M. H., McAnirlin, O. & Yoon, H. V. Green space and health equity: a systematic review on the potential of green space to reduce health disparities. Int. J. Environ. Res. Public Health, 18, 2563 (2021).
  20. RStudio: Integrated Development for R. http://www.rstudio.com (2020).
  21. Schulman, A. & Bhanot, G. Migration of households from New York City and the Second Peak in Covid-19 cases in New Jersey, Connecticut and New York Counties. Preprint at https://www.medrxiv.org/content/10.1101/2021.03.29.21254583v1.full (2021).
  22. Shmueli, G. & Bruce, P., Yahav, I., Patel, N. R. & Lichtendahl, K. C. Jr. Data Mining for Business Analytics: Concepts, Techniques, and Applications in R (Wiley, 2017).
  23. Şimşir, Z., Koç, H., Seki, T. & Griffiths, M. D. The relationship between fear of COVID-19 and mental health problems: A meta-analysis, Death Studies. 46 (2022).
  24. Zhao, D. & Rosson, M. B. How and why people Twitter. Proceedings of the ACM 2009 International Conference on Supporting Group Work. 243–252 (2009).
  25. Zhao, W. et al. A heuristic approach to determine an appropriate number of topics in topic modeling. BMC Bioinform. 16 (2015).