Earliest Detection to Date of SARS-CoV-2 in Florida: Identification Together With Influenza Virus on the Main Entry Door of a University Building, February 2020

doi:10.21203/rs.3.rs-87486/v1

Download PDF

Research

Earliest Detection to Date of SARS-CoV-2 in Florida: Identification Together With Influenza Virus on the Main Entry Door of a University Building, February 2020

https://doi.org/10.21203/rs.3.rs-87486/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Questions persist about patterns of initial dissemination of SARS-CoV-2 in the United States in early 2020.

Methods

In February and March, 2020, environmental surface swab samples were collected from the handle of the main entry door of a major university building in Florida, as part of a pilot surveillance project screening for influenza. Samples were taken at the end of regular classroom hours, between the dates of February 1-5 and February 19-March 4, 2020.

Results

Influenza H1N1pdm09 was isolated from the door handle on four of the 19 days sampled. Both SARS-CoV-2 and influenza virus were detected in the sample collected on February 21, 2020. Based on sequence analysis, the Florida SARS-CoV-2 strain (designated UF-11) was identical to strains being identified in Washington state during the same time period, while the earliest similar sequences were sampled in China/Hubei between Dec 30^th 2019 and Jan 5^th 2020. The first human case of COVID-19 was not officially reported in Florida until March 1^st. In an analysis of sequences from COVID-19 patients in this region of Florida, there was only limited evidence of subsequent dissemination of the UF-11 strain. Identical or highly similar strains, possibly related through a common transmission chain, were detected with increasing frequency in Washington state between end of February and beginning of March.

Conclusions

Our data provide further documentation of the rapid early spread of SARS-CoV-2, and underscore the likelihood that closely related strains were cryptically circulating in multiple U.S. communities before the first “official” cases were recognized.

Epidemiology

Infectious Diseases

SARS-CoV-2

Influenza

environmental isolation

COVID-19 can be traced to an initial cluster of novel human pneumonia cases occurring in Wuhan City, China, in December, 2019, with the earliest date of symptom onset reported to be December 1, 2020. The World Health Organization (WHO) was officially notified about the infection on January 3, 2020, with the first sequence released on January 11 [1,2]. The first case officially reported in the United States was from the state of Washington, occurring in a person who arrived in Seattle from Wuhan on January 15, and become ill 4 days later. The first U.S. cases assumed to be due to community transmission occurred in Santa Clara County, California, in early February [3]. Rapid spread of the virus across the United States was documented by additional case reports from Illinois, Arizona, Massachusetts, Wisconsin, and Texas, with a total of 16 cases reported to CDC through February 20 [4]. We report here the identification of SARS-CoV-2 from an environmental surface in Florida on February 21, with sequence analysis showing identity to strains originating in Washington state.

Surface swabs specimens – As part of a pilot study screening high-touch surfaces for respiratory viruses, swabs were used to sample 25 cm² areas of the outside handle of the main entrance door of a joint teaching and office building housing the Colleges of Public Health and Health Professions, Nursing, and Pharmacy at a major Florida university. Over 300 persons were estimated to pass through the entrance during a normal school week (Monday through Friday). Samples were obtained from 1 to 5 February and from 19 February to 4 March, 2020; the dates chosen were arbitrary. Because the door handle was cleaned early each morning, swab samplings were performed after most classroom sessions, typically between 6 – 7 PM, to allow for fresh daily accumulation of hand-deposited microorganisms.

As previously reported by our group [5], flocked nylon swabs pre-moistened with phosphate-buffered saline were used for surface samplings, after which they were immersed into 1.0 mL universal transport medium (UTM) (COPAN Diagnostics, Inc., Murrieta, CA, USA). Swab samples were immediately transported to a BSL2-plus laboratory in a nearby building, material on the swab was extruded into the UTM, and the collection tube frozen at -80°C pending molecular and virology analyses. For molecular tests, RNA was purified by using a QIAamp Viral RNA Mini Kit (Qiagen, Valencia, CA, USA). Influenza A and B virus genomic RNAs were detected by RT-PCR directed at the HA and NA genes [6]. The primers and probes for the CDC 2019-Novel Coronavirus (2019-nCoV) rtRT-PCR test and an in-house (UF) test [7](Table 1) were synthesized by and purchased from Integrated DNA Technologies (Coralville, Iowa, USA). For both UF primer sets, the level of detection using synthesized oligonucleotide target sequences was approximately 5 genome equivalents with at least 95% detection probability per 25 µl PCR test. Neither UF N nor RdRp primer sets detect SARS or MERS CoV genomic RNA, or human RNA sequences. They do not detect human coronavirus OC43, NL63, or 229E genomic RNAs at approximately 1 x 10⁵ genome equivalents per 25 µl PCR test, and did not detect corresponding synthesized HKU1 oligonucleotide N and RdRp sequences. The sensitivity of the CDC, UF and the SARS CoV-2 rtRT-PCR test developed by Zhu et al. [8] are similar (data not shown).

In-house developed Madin-Darby canine kidney (MDCK) cells that over-express α2-6-sialylglycan receptors [9] were to isolate influenza viruses. As previously described [7], the African green monkey kidney cell line Vero E6, obtained from the American type culture collection (catalog no. ATCC CRL-1586), was used for attempts to isolate SARS-CoV-2.

For influenza virus, after about 50% of the killed cells had detached from the growing surface, virus genomic RNAs (vRNAs) were purified from virions in the cell growth media. The vRNAs served as templates to construct a cDNA library using an NEBNext Ultra RNA Library Prep Kit (New England BioLabs® Inc.) followed by sequencing on an Illumina MiSeq sequencer using a version 3 chemistry 600 cycle kit.The complete genome sequence of SARS-CoV-2 in the environmental sample (designated as UF-11) was determined using a genome walking strategy [10]. Briefly, cDNA was produced using AccuScript high-fidelity reverse transcriptase (Agilent Technologies, Santa Clara, CA) and sequence-specific primers based on SARS-CoV-2 WIV04 (GenBank MN996528.1). The resulting cDNA was amplified by PCR with Q5 polymerase (New England BioLabs) and non-overlapping gene-specific primers. The 5′ and 3′ ends of the SARS-CoV-2 genome were determined using a Rapid Amplification of cDNA Ends (RACE) kit (Life Technologies, Inc., Carlsbad, CA, USA), and the resulting sequences were assembled with Sequencher DNA sequence analysis software version 2.1 (Gene Codes, Ann Arbor, MI, USA).

Phylogenetic analyses

SARS-CoV-2 full or nearly-full genome (>29,000 bp) sequences, with a collection date prior or equal to March 6^th 2020, were downloaded from GISAID on August 18^th 2020. Genomes were subsequently filtered according the following exclusion criteria: 1) sequences with more than 150 uncertain nucleotides due to missing data and/or poor sequence quality; 2) sequences missing sampling date, and 3) sequences missing sampling location. After filtering, 2,439 genomes, including 17 new UF isolates (UF1-UF17), were retained and aligned using MAFFT [11]. Sequences identical or highly similar to UF11 were identified by BLAST. We found 75 identical sequences with a length of 29,596 bp (99+% of UF11 length), no insertion/deletion, nor nucleotide mismatches. We also found 360 similar genomes, defined as genomes with total of nucleotide mismatches < 6 (in coding regions, each long gap in multiples of three, if present, was also treated as a single mutational event). The threshold for highly similar genomes (0 < nucleotide difference < 6) was chosen by calculating the 95% confidence interval of the number of total mutations expected to accumulate, between January and March 2020, among UF-11 and other genomes potentially belonging to the same transmission chain. The mutational process was assumed to be Poisson distributed, with a mean evolutionary rate of 2.4 10^-4 nucleotide substitutions per year, independently calculated using a data set of 11,262 full genome sequences available in GISAID on April 25^th 2020 [12].

The 2,439 aligned genomes were ranked by similarity to UF-11 by calculating pair-wise Jukes-Cantor (JC) distances. Genomes identical to UF-11 were removed them from the set and the remaining ones were randomly subsampled using the following constraints: 1) final dataset should include min 250 and max 300 sequences; 2) all the UF isolates should be included, and 3) the median genetic diversity of the subsample should be the same as the median of the full data set. The subsampled dataset, representative of the overall diversity of the full data set, included 289 sequences and was used to infer a maximum likelihood tree, with the best fitting nucleotide substitution model and 1,000 bootstrap replicates with the IQTREE software [13]. The presence of sufficient tree-like signal in the subsampled data set was assessed by Likelihood mapping [14] also implemented in IQTREE. Tree branches were scaled in nucleotide substitutions per site since an accurate molecular clock could not be calibrated, given the lack of temporal signal in the phylogeny inferred from the sub-sampled sequences (root to tip distance versus sampling time correlation coefficient < 0.1).

Influenza virus H1N1pdm09 RNA was detected by RT-PCR in samples collected for three consecutive days in February (19 – 21 Feb.) and one day in March (2 March). Tests for H1N1 pdm09 HA and NA genes were positive, whereas no Influenza H3N2 or B-Yamagata or B-Victoria sequences were amplified. Viable virus was isolated from each of the four H1N1pdm09-positive specimens. In a retrospective analysis, since SARS-CoV-2 was not part of the planned pilot study, remaining RNA purified from the samples was tested for the presence of SARS-CoV-2 RNA. One sample, from 21 Feb 2020, was positive (Table 2) using both N- and RdRp primers.

Sequencing of the Influenza H1N1 RNA revealed that the four Influenza viruses belonged to hemagglutinin (HA) clade 6B.1A1. The sequences of the H1N1 viruses detected in this study were identical to a strain known to be in circulation in the USA; all eight genes of A/ENV/GNVL/03/2020 were deposited in GenBank (accession numbers identified in Table 2).

Unlike the influenza viruses, we were unable to isolate SARS-CoV-2 in cell cultures. The amount of virus in the original sample was expected to be low and insufficient for our NGS approach. Therefore, Sanger sequencing was used to obtain the virus’ genome sequence (GenBank Accession no. MT476384.1), and revealed that the virus belonged to clade S, an early genetic lineage of the virus which retains a D at aa 614; this strain was designated as UF-11.

According to the ML tree (Figure 1), the various UF strains cluster within different, well supported clades related to other sequences from the USA (mostly from Washington state) and Europe (Belgium, Denmark, France, Germany, Greece, Iceland, Italy, Portugal, Spain, UK), indicating multiple, separate introductions of the virus into this region of Florida between February and April 2020. The phylogenetic analysis included sequence data from 17 isolates from our institution (UF-1 to UF-17); among these, only UF-1 was closely related to UF-11 (Figure 1). UF-1, sampled on March 10^th 2020, displays only one mutation (one nt mismatch) compared to UF11. Interestingly, UF-1 was isolated from the first COVID-19 case at our institution, who had been transported over 100 miles from South Georgia for care at UFHealth in Gainesville; the patient had no recent travel history, including a history of travel to Gainesville. Unfortunately, while the sequence data set overall displayed sufficient signal for tree inference (Figure 1), the concomitant presence of phylogenetic noise (31.5%) resulted in poor resolution of branching patterns between and within major clades, making it impossible to establish exact routes of introductions from specific European countries or USA states to Florida.

The temporal distribution analysis of SARS-CoV-2 full genome sequences, identical or highly similar to UF-11, thus representing sequences potentially linked through the same transmission chain, was more revealing (Figure 2). UF-11 was sampled on Feb 21^th 2020, at the same time of sampling of two identical sequences in Washington state (Figure 2A). The Washington/UF-11 genome continued to expand clonally in Washington, likely through a series of closely related transmissions, with occasional spillovers in Utah, Vermont and North Carolina, as shown by the increase in the number of identical genomes isolated from other patients between Feb 22^nd and Mar 6^th. The temporal distribution analysis of highly similar genomes, likely linked through a direct transmission chain (see Methods), also shows that the earliest sequences were sampled in China/Hubei between Dec 30^th 2019 and Jan 5^th and 2020 (Figure 2B), thus indicating a direct link between Washington/UF-11 and the strains circulating right after the emergence of the first known outbreak in China. The following two weeks, similar strains were sampled in Thailand (Jan 8^th, 13^th and 15^th) and finally in Washington state (Jan 19^th). Interestingly, between Jan 21^st and Feb 21^st 2020, the frequency of the strains increased and then began to decline in Asia, while, by Feb 21^st it started to increase in the USA, matching the results of the identical genomes temporal distribution analysis (Figure 2A). Again, occasional spillovers were observed, in Canada and other USA states.

Overall, the results are compatible with a scenario of an early introduction (early/mid-Jan 2020) in Washington state of the Washington/UF11 strain from Asia (likely China/Hubei), followed by a dissemination in Asia and USA, and the subsequent introduction to Florida in early/mid-Feb 2020 from Washington state. Notably, UF-11 did not appear to have spread successfully in Florida, since only two other Florida genomes available in GISAID, sampled on Feb 28^th and March 5^th 2020, respectively (Figure 2b), were found to be highly similar to UF-11 (although this could have been the result of under-sampling).

Our data are consistent with rapid dissemination nationally of the Washington state SARS-CoV-2 strain. Finding the strain on the handle of the main entry door to a major university building was unexpected, but highlights the ease with which this (and other respiratory viruses) can contaminate high-touch surfaces. It is not implausible that the handle was contaminated by an asymptomatic or mildly symptomatic individual, who initially acquired the infection in Seattle (or, possibly, in China) and then traveled to the University. The isolation of virtually identical influenza strains on four different days from this same door handle provides further evidence of the validity of the methodology; identification of these strains was not unexpected, as influenza strains within this clade were known to be present in the community at the time the sampling was done.

Interestingly, we did not see evidence that the SARS-CoV-2 strain identified was the basis for subsequent emergence and spread of COVID-19 in our region; this may reflect low viral numbers on the handle, and/or a decreased likelihood of SARS-CoV-2 transmission from surfaces and fomites. Instead, our data are consistent with multiple local introductions of SARS-CoV-2 from different countries/regions, as reflected in the divergence of sequences seen among our UF strains. Keeping in mind that the door handle isolation occurred less than 8 weeks after the first official report of the virus from China (and less than 12 weeks after the first reported case in Wuhan), the speed with which the virus moved is impressive. This is further underscored by the isolation 18 days later of a similar, but not identical, strain from our first clinical patient, whose home was over 100 miles away. In today’s world of rapid, global transportation, these findings underscore the risk of rapid, cryptic community introduction and transmission of emerging pathogens, well before cases begin to be identified.

Ethics Approval: no human subjects involved in study; environmental screening only.

Availability of Data: genetic sequences deposited in GenBank; GenBank numbers included in Table 2.

Competing interests: authors declare that they have no completing interests.

Funding: Work was funded in part by an internal grant from the Clinical and Translational Science Institute, University of Florida.

Authors Contributions: Conceived project, performed relevant studies, and assisted with preparation of the manuscript: JL, MS, JGM; performed relevant studies, and assisted with preparation of manuscript: KS, TW, TS-A, JCL, SJ, MT, SM,MA, CS, ME

World Health Organization. https://www.who.int/news-room/detail/29-06-2020-covidtimeline. Accessed 8/28/2020.
Liu Y-C, Kuo R-L, Shih S-R. COVID-19: The first documented coronavirus pandemic in history. Biomedical Journal, https://doi.org/10/1016/j/bj.2020.04.007.
CDC COVID-19 response team. Evidence for limited early spread of COVID-19 within the United States, January-February 2020. Morbid Mortal Weekly Rep. 2020;69:680-684.
https://en.wikipedia.org/wiki/Timeline_of_the_COVID-19_pandemic_in_February_2020#1_February. Accessed 8/28/2020.
Memish ZA, Almasri M, AssirriA, Al-Shangiti AM, Gray GC, Lednikcy JA, Yezli S. Environmental sampling for respiratory pathogens in Jedah airport during the 2013 Hajj season. Am J Infect Control 2014;42:1266-1269.
Lednicky JA, Lauzardo M, Fan H, Jutla AS, Tilly TB, Gangwar M, Usmani M, Shankar SN, Mohamed K, Eiguren-Fernandez A, Stephenson CJ, Alam MM, Elbadry MA, Loeb JC, Subramaniam K, Waltzek TB, Cherabuddi K, Morris JG, Wu C-Y. Viable SARS-CoV-2 in the air of a hospital room with COVID-19 patients. medR_xiv doi: https://doi.org/10.1101/2020.08.03.20167395.
Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, et al.; China Novel Coronavirus Investigating and Research Team. A Novel Coronavirus from Patients with Pneumonia in China, 2019. N Engl J Med. 2020 Feb;382(8):727–33.
Iovine NM, Morris JG, Fredenburg K, Rand KH, Alnuaimat H, Lipori G, Brew J, Lednicky J. Severity of influenza A(H1N1) illness and emergence of D225G variant, 2013-14 influenza season, Florida, USA. Emerg Infect Dis 2015;21:664-667.
Lednicky JA, Shankar SN, Elbadry MA, Gibson JC, Alam Md M, Stephenson CJ, Eiguren-Fernandez A, Morris JG, Mavian CN, Salemi M, Clugston JR, Wu C-Y. Collection of SARS-CoV-2 virus from the air of a clinic within a university student health care center and analyses of the viral genomic sequence. Aerosol and Air Quality Res. 2020 doi.org/10.4209/aaqr.2020.05.0202.
Nakamura, Yamada, Tomii, Katoh. Parallelization of MAFFT for large-scale multiple sequence alignments. Bioinformatics 34, 2490–2492, 2018.
Rife Magalis, Ramirez-Mata, Zhukova, Mavian, Marini, Lemoine, Prosperi, Gascuel, Salemi. Differing impacts of global and regional responses onSARS-CoV-2 transmission cluster dynamics. Submitted.
Nguyen, Schmidt, von Haeseler, Minh. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum likelihood phylogenies. Biol. Evol. 32, 268-274, 2015.
Strimmer, von Haeseler. Likelihood-mapping: A simple method to visualize phylogenetic content of a sequence alignment. Proc Natl Acad Sci U S A. 94(13), 6815–6819, 1997.

Table 1. Primers and probes for rtRT-PCR analyses.

Test	Primer/probe name	Description	Oligonucleotide sequence (5’ to 3’)	Label
CDC	2019-nCoV_N1-F	N1 Forward Primer	5’-GACCCCAAAATCAGCGAAAT-3’	None
	2019-nCoV_N1-R	N1 Reverse Primer	5’-TCTGGTTACTGCCAGTTGAATCTG-3’	None
	2019-nCoV_N1-P	N1 Probe	5’-FAM-ACCCCGCATTACGTTTGGTGGACC-BHQ1-3’	FAM, BHQ1
	2019-nCoV_N2-F	N2 Forward Primer	5’-TTACAAACATTGGCCGCAAA-3’	None
	2019-nCoV_N2-R	N2 Reverse Primer	5’-GCGCGACATTCCGAAGAA-3’	None
	2019-nCoV_N2-P	N2 Probe	5’-FAM-ACAATTTGCCCCCAGCGCTTCAG-BHQ1-3’	FAM, BHQ1
UF	Led-N-F	N Forward Primer	5’ – GGGAGCAGAGGCGGCAGTCAAG - 3’	None
	Led-N-R	N Reverse Primer	5’ – CATCACCGCCATTGCCAGCCATTC – 3’	None
	Led-N-Probe	N Probe	5’ FAM -CCTCATCACGTAGTCGCAACAGTTC- BHQ1-3’	FAM, BHQ1
	Led-RdRp-F	RdRp Forward Primer	5’– GGTGGAACCTCATCAGGAGATGC-3’	None
	Led-RdRp-R	RdRp Reverse Primer	5’– CCATCAGTAGATAAAAGTGCATTAAC– 3’	None
	Led-RdRp-Probe	RdRp Probe	5’ FAM–CTGCTTATGCTAATAGTGTTTTTAAC-BHQ1–3’	FAM, BHQ1

TaqMan® probes are 5'-end labeled with the reporter molecule 6-carboxyfluorescein (FAM) and with quencher Black Hole Quencher 1 (BHQ-1) at the 3'- end.

Table 2. Detection of influenza and SARS-CoV-2 in environmental swab sample.

Sampling day (year 2020)	RT-PCR Detection of influenza or SARS-CoV-2	Virus isolation	Virus designation	GenBank accession #
1 -5 Feb.	-	-	-	-
19 Feb.	H1N1pdm09	+	A/environment/FL/ENV1/2020(H1N1)	(Not deposited)
20 Feb.	H1N1pdm09	+	A/environment/FL/ENV2/2020(H1N1)	(Not deposited)
21 Feb.	H1N1pdm09	+	A/environment/FL/ENV3/2020(H1N1)	MT474139.1 to MT474146.1
21 Feb.	SARS-CoV-2	-	SARS-CoV-2/ENV/USA/UF-11/2020	MT476384.1
22 -29 Feb.	-	-	-	-
1 Mar.	-	-	-	-
2 Mar.	H1N1pdm09	+	A/environment/FL/ENV4/2020(H1N1)	(Not deposited)
3-4 Mar.	-	-	-	-

Download PDF

Version 1

posted

You are reading this latest preprint version

Earliest Detection to Date of SARS-CoV-2 in Florida: Identification Together With Influenza Virus on the Main Entry Door of a University Building, February 2020

Status:

Version 1

Abstract

Figures

Introduction

Methods

Results

Discussion

Declarations

References

Tables

Status:

Version 1