Metagenomic Analysis of a Throat Swab Sample Collected in China on A Patient Infected With Varicella Zoster Virus

doi:10.21203/rs.3.rs-182926/v1

Download PDF

Research Article

Metagenomic Analysis of a Throat Swab Sample Collected in China on A Patient Infected With Varicella Zoster Virus

https://doi.org/10.21203/rs.3.rs-182926/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 06 Jul, 2021

Read the published version in Scientific Reports →

You are reading this latest preprint version

Varicella Zoster Virus (VZV) is endemic worldwide, causing varicella in children and zoster upon reactivation in adults. This study concerned a metagenomic analysis of a throat swab sample collected in China, on a young patient suffering from Systemic Lupus Erythematosus (SLE) and diagnosed with varicella. The complete genome sequence of a VZV strain of clade 2 has been generated. Clade 2 strains are the most prevalent in Asian countries. A comparison of 223 VZV genomes identified 77 clade specific markers, 20 of them specific to clade 2. The metagenomic analysis also identified sequences covering most of the genome of the bacteria Schaalia odontolytica also known as Actinomyces odontolyticus. VZV infection and bacterial infection in the context of SLE is further discussed. Even though the patient presented only mild symptoms, this study is a reminder that vaccination against VZV is critical to avoid severe complications like bacterial superinfection or even death in the case of immunodeficiency.

Epigenetics & Genomics

Biotechnology and Bioengineering

Infectious Diseases

Immunology

VZV

SLE

metagenomic

bacteria

Varicella Zoster Virus (VZV) belongs to the alphaherpesviruses and is also known as human herpesvirus 3 (HHV3) ¹. VZV genome is a double stranded DNA of around 120 kb. It consists of two regions, large and short (_L and _S), each consisting of unique and repeat sequence (U and R). Each U region is flanked by a repeat sequence, called terminal (T) and internal (I). In addition to the genome structure TR_LU_LIR_LIR_SU_STR_S, the genome contains 6 repeat regions called IR1, 2, 3, 4a, 5 and 4b. IR4 is located in IR_S and is inversely duplicated in TRs. The genome encodes 73 open reading frames (ORFs), 3 of them being inversely duplicated in TR_S. Sequence analysis identified 7 distinct clades (1, 2, 3, 4, 5, 6, 9). A putative clade VIII has been reported once ².

VZV infection usually causes varicella or monkeypox in children and zoster upon reactivation in adults. In addition to varicella and zoster, VZV infection has been associated with severe complications like bacterial superinfection, pneumonia, hepatitis, nephritis, encephalitis and even death ¹.

A live attenuated VZV vaccine has been developed based on the clade 2 Oka strain. Currently, many countries have introduced varicella vaccine in universal routine vaccination program and administered 2 doses of vaccine to control and prevent chickenpox ³. In China, varicella vaccine became available in 1998 and is currently available in the private sector.

This study concerned a metagenomics analysis of a throat swab sample collected in China on a young patient suffering from lupus and diagnosed with varicella. In addition to generating the complete genome sequence of the VZV strain contained in the throat swab sample, the metagenomics analysis also identified bacterial sequences present in the sample. Finally, association between VZV and lupus is discussed.

Ethical statement

The second session of the Ethics Review Committee of the National Institute for Viral Disease Control and Prevention (IVDC) at China Centers for Disease Control and Prevention (CDC) determined that the present study followed the working regulations of ethics review committee of Institute for viral disease control and prevention of Chinese Center for Disease Control and Prevention and therefore approved the study. The legal guardians of the patient involved in this study provided written informed consent to have data/samples from her medical records used in research. All methods were performed in accordance with the relevant guidelines and regulations.

Clinical sample

A throat swab sample was collected on a 10-year-old girl diagnosed with varicella at Children's Hospital of Chongqing Medical University. The date of onset was estimated on December 21^st 2018. The patient immediately found a rash all over her body after having contact with her sister with chickenpox. The patient began to exhibit macular papules that gradually turned into herpetic rash with itching, oral pain and no salivation. During the course of the disease, there was neither fever, spasm, disturbance of consciousness, headache, vomiting, unstable walking, skin/mucous membrane bleeding, abdominal pain, vomiting nor diarrhea. At the age of 9, the patient was diagnosed with Systemic Lupus Erythematosus (SLE) and Lupus Nephritis (LN) and was treated with glucocorticoids and immunosuppressants. The sample was collected in viral transport medium and stored at -80 °C until use.

Next Generation Sequencing (NGS)

Total DNA was extracted using QIAamp DNA Mini Kit (QIAGEN, Germany) then fragmented using the ultrasonicator S220 (Covaris, USA). A sequencing library was generated with the KAPA HyperPlus kit (Roche, Switzerland) and sequenced with the NovaSeq 6000 system (Illumina, USA). The paired-end reads have been deposited in the NCBI Sequencing Read Archive under the accession number PRJNA681411.

NGS analysis pipeline

Sequence quality was assessed using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Sequences were cleaned up with Trimmomatic ⁴ (http://www.usadellab.org/cms/index.php?page=trimmomatic). The sequences from the human genome (GRCh38) were depleted using the assembler Burrows-Wheeler Alignment tool (BWA) and Sequence Alignment/Map (SAM) tools 1.9 ^5,6. The remaining sequences were processed in 2 ways: 1- assembly with SPAdes⁷ ( http://bioinf.spbau.ru/spades ); 2- mapping against the VZV reference strain Dumas (NC_001348) using BWA and SAMtools 1.9 ^5,6. The VZV-related sequences were then de novo assembled using Sequencher 5.0 (Genecodes Corp., Ann Arbor, MI, USA). The sequence strategy as well as the number of reads at each step of the analysis pipeline are shown in Fig. S1. The full-length genome sequence was annotated in Artemis 16.0.0 ⁸ based on the Dumas reference genome (NC_001348) and submitted to GenBank (MW316406). The viral strain was named VZV/Chongqing.CHN/2018/V[2]. For convenience, however, the sample is referred in the manuscript as SD14.

VZV sequence analysis

Two hundred thirty-two VZV genomes were downloaded from GenBank. Identical or incomplete sequences were discarded. The remaining 222 genome sequences were compared with the de novo SD14 sequence (Table S1). As previously reported, the genome sequence was analyzed based on 8 regions (A to H), excluding the repeats TR_L, IR1, IR2, IR3 and TR_S^2,9. Sequences were aligned using MAFFT 7.311 ¹⁰ (http://maf.cbrc.jp/alignment/sofware/). Alignments were analyzed using BioEdit 7.0.4.1¹¹ (http://www.mbio.ncsu.edu/bioedit/bioedit.html). 7 single nucleotide polymorphisms (SNPs) found in each region were concatenated. Phylogenetic trees were generated with MEGA 6 ¹². Neighbor joining (NJ) trees were generated with the maximum composite likelihood nucleotide substitution model ¹². The phylogenetic inference was tested using the bootstrap method with 1000 replicate ¹³. Bootstrap values greater than 70% were indicated. Phylogenetic trees were also generated using the maximum likelihood (ML) method in MEGA ¹⁴. SNPs found only in SD14 genome were analyzed in Protein Variation Effect Analyzer (PROVEAN，http://provean.jcvi.org/seq_submit.php) in order to check whether these mutations had any effect on protein function ¹⁵.

Data availability

The paired-end reads have been deposited in the NCBI Sequencing Read Archive under the accession number PRJNA681411. The full-length genome sequence of SD14 strain VZV/Chongqing.CHN/2018/V[2] was submitted to GenBank (MW316406).

Close to 230 million sequencing reads were analyzed (Fig. S1). VZV-related sequences (11,275 sequences mapped on the VZV reference genome sequence strain Dumas (NC_001348)) were assembled in Sequencher. The de novo assembly generated 4 contigs. Gap sizes were estimated based on the Dumas strain, from 11 to 18 nucleotides. SD14 full genome was estimated at 125,184 nt long and 46.13% G+C content. This is in the same range as other HHV 3. For example, Dumas strain was estimated at 124,884 nt and 46.02% G+C ¹⁶. As expected, SD14 genome encoded all known 73 ORFs, ORF62, 63 and 64 being duplicated in reverse direction in the TRs. A comparison with 222 genomes identified 12 SNPs only found in the SD14 genomic sequence (Table 1). All 12 positions were located within ORFs but only 3 nucleotide substitutions were non-synonymous changes, L135R in ORF6, K98T in ORF37 and N47S in ORF54. None of these 3 non-synonymous substitutions were predicted to have an effect on protein function based on PROVEAN analysis.

The comparative genomic analysis of 223 genomes identified 2880 SNPs (Table S2). Phylogenetic trees based on the 2880 concatenated SNPs identified in the 223 genomes were generated (Fig. 1, Fig. S2). Both NJ and ML trees showed that SD14 strain belonged to clade 2 and could therefore be formerly named as VZV/Chongqing.CHN/2018/V[2]. The SNPs analysis identified 77 positions that were conserved within a clade and could therefore be considered as clade markers (Fig. 2, Tables S2-3). The number of markers was highly variable depending on the clade, from 20 in clade 2 to only 1 in clade 3. Furthermore, the distribution of the markers throughout the genome was not random. For example, 2 markers were identified within the 70-80 kb region of the genome whereas 15 markers were identified within the 90-100 kb genomic region. Finally, 34 of the 70 ORFs featured at least one clade specific marker. If we consider the number of markers and the size of the ORF, ORF60 with 480 amino acid (AA) and 2 markers featured the most whereas ORF31 featured the less with 2 markers among 2796 AA. Regarding the 20 clade 2 specific markers (in purple in Fig. 2), 18 were located within ORFs and 6 were non-synonymous changes: C1159R in ORF28, T136P in ORF31, P374F in ORF33, E128D in ORF54, H69P in ORF57 and A107T in ORF60 (Tables S2-3). The vaccine strains related to clade 2 Oka strain featured an additional marker (g911191t) (Tables S2-3). SD14 did not feature this vaccine marker and was therefore not related to vaccine strains as it was shown in the phylogenetic trees (Fig. 1, Fig. S2). Phylogenetic trees on the 8 genomic regions were generated in order to identify any major recombination event (Fig. S3). Whereas some recombination events were identified among VZV genomes from clades 3, 6 and 9 (identified with * in Fig. S3), no evidence of major recombination event was identified for SD14 genome, as SD14 sequences were always found within the clade 2 cluster.

Close to 230 million sequencing reads were depleted from the sequences of the human genome GRCh38 (Fig. S1). The remaining sequencing reads (~33 million, 14.4% of the sequencing data) were assembled using SPAdes and 159,894 contigs were generated (Table S4). The size of the contigs was highly variable, from 0.5Mb to 78 nt, with an average of 731 nt. A blastn search among the 100 largest contigs is summarized in Table 2. The bacteria Schaalia odontolytica was the most prevalent hit with 15 hits and 2.2 Mb of cumulated contig size.

This report concerned the metagenomic analysis of a throat swab sample collected in China from a young VZV patient. The phylogenetic analysis showed that this sample was of clade 2. Seven clades have been identified for VZV, one additional clade (VIII) is putative as only one strain has been reported so far (reviewed in ²). Clades 1 and 3 are mainly observed in Europe and Americas whereas clades 4 and 5 are frequently seen in people with African origin ^17,18. Clade 2 has been the dominant clade in Asian countries like Korea, Japan and China. Clade 6 and 9 have been reported recently and there is not enough data to conclusively assign a geographic region to these clades.

The number of complete VZV genome sequences has dramatically increased recently, 232 as of December 2020. Comparative genomic analysis showed that the VZV genome is very stable. The current analysis compared 223 genomes and identified 2880 SNPs, representing 2.3% of the genome. Several studies reported recombination events among VZV ^9,19. The present study did not identify any obvious recombination event within SD14 strain. Despite recombination events among VZV genomes, the present study identified positions that were conserved among clades meaning that the genomic regions containing these positions were not involved in recombination.

The present study concerned a throat swab sample. VZV samples are generally collected from rash vesicles. Among the 222 genome sequences analyzed in the present study, 137 (62%) were derived from vesicle fluid or skin lesion. Unfortunately, the sample information was not available for 64 sequences (29%). None of the analyzed sequences were from throat swab sample. In addition to VZV-related sequences, the present metagenomic analysis identified multiple bacterial sequences. The most prevalent was from Schaalia odontolytica. The genome of Schaalia odontolytica has been estimated at 2.3Mb (GB ID NZ_CP040006). The present study reported a cumulative contig size of 2.2Mb suggesting that most of the bacterial genome can be detected in the sequencing data and it is likely that the patient suffered a severe oral bacterial infection. Schaalia odontolytica is also known as Actinomyces odontolyticus²⁰. A. odontolyticus was first isolated in 1958 from persons with advanced dental caries ²¹. In 2003, Tang et al. analyzed root canal infections from 28 Chinese patients and detected A. odontolyticus 16S ribosomal DNA in 30% of the cases ²². Actinomyces are not generally detected in healthy patients ²³. Even though A. odontolyticus infection is relatively common in patients with dental issues, it has been linked to serious diseases, for example, neonatal sepsis ²⁴ or actinomycosis in a pediatric patient ²⁵ as well as in immunosuppressed patients ²⁶. VZV infection can lead to superinfection of the rash by Staphylococcus aureus and Streptococcus pyogenes²⁷. To our knowledge, any association between VZV and A. odontolyticus has not been investigated.

The patient featured in this study suffered from SLE and LN. Multiple reports of severe VZV infections in LN patients can be found in the literature. A recent report described a disseminated VZV infection likely to have caused the death of an LN patient ²⁸. A matched cohort study confirmed that patients with SLE presented an increased risk of disease flares if they were infected with VZV ²⁹. To our knowledge, the young patient showed only mild symptoms limited to skin rash. It is possible that the young age (10) of the patient might be the reason why she suffered a mild disease despite her immune deficiency. Most of the reports of an effect of VZV on SLE/LN patients concerned older patients with herpes zoster ^29,30.The patient was diagnosed with SLE and LN at the age of 9 and was treated orally with 5mg glucocorticoid daily. After suffering from varicella, she was prescribed with proprietary Chinese medicine, Siji Antiviral Oral Liquid, Lysine Inosite and Vitamin B12 Oral Solution and treated with Acyclovir for external use. However, as the treatment was not efficacious, hormonal treatment was replaced by Piperacillin-Tazobactam. In China, patients with SLE are often treated with immunosuppressants and glucocorticoid. As a consequence, these patients are easily affected by external factors and are physically weak. To our knowledge, any association between varicella disease and lupus remains to be reported.

In summary, this study concerned the metagenomic analysis of a throat swab sample collected in China from a young VZV patient suffering from SLE and LN. The VZV strain identified was of clade 2, clade prevalent in Asian countries. A comparison of 223 VZV genomes identified 77 clade specific markers, among them 20 were specific to clade 2. The metagenomic analysis identified sequences covering the entire genome of the bacteria Schaalia odontolytica also known as A. odontolyticus which have been linked to tooth decay as well as severe complications especially in immunocompromised patients. Even though the patient presented only mild symptoms, this study is a reminder that vaccination against VZV is critical to avoid severe complications like bacterial superinfection or even death in the case of immunodeficiency.

Acknowledgment

The authors are thankful for the samples and relevant clinical information provided by Children's Hospital of Chongqing Medical University. This study was supported by the Key Technologies R&D Program of the National Ministry of Science (2018ZX10713002).

Author Contributions

W.X., H.X., S.X., H.W., J.W. and R.H. designed the experiments, provided technical consultation and support; H.W., H.X, R.H, W.X. and S.X. planned the workflow and supervised the project. H.G. and P.R. performed the experiments, analyzed data and wrote the manuscript. All authors have read and approved the final version of the manuscript.

Competing interests

The authors declare no competing interests.

Arvin, A. & Gilden, D. in Fields Virology Vol. 2 (eds DM Knipe & PM Howley) Ch. 63, 2015–2057 (Lippincott, Williams & Wilkins 2013).
Jensen, N. J. et al. Revisiting the genotyping scheme for varicella-zoster viruses based on whole-genome comparisons. J Gen Virol. 98, 1434–1438. doi: 1410.1099/jgv.1430.000772. Epub 002017 Jun 000714(2017).
Wutzler, P. et al. Varicella vaccination - the global experience. Expert Rev Vaccines. 16, 833–843. doi: 810.1080/14760584.14762017.11343669. Epub 14762017 Jul 14760513(2017).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30, 2114–2120. doi: 2110.1093/bioinformatics/btu2170. Epub 2014 Apr 2111(2014).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 25, 1754–1760. doi: 1710.1093/bioinformatics/btp1324. Epub 2009 May 1718(2009).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 25, 2078–2079. doi: 2010.1093/bioinformatics/btp2352. Epub 2009 Jun 2078(2009).
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol.19, 455–477. doi: 410.1089/cmb. 2012.0021. Epub 2012 Apr 1016. (2012).
Rutherford, K. et al. Artemis: sequence visualization and annotation. Bioinformatics.16, 944–945. doi: 910.1093/bioinformatics/1016.1010. 1944. (2000).
Zell, R. et al. Sequencing of 21 varicella-zoster virus genomes reveals two novel genotypes and evidence of recombination. J Virol. 86, 1608–1622. doi: 1610.1128/JVI.06233-06211. Epub 02011 Nov 06230(2012).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 30, 772–780. doi: 710.1093/molbev/mst1010. Epub 2013 Jan 1016(2013).
Hall, T. A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl. Acids. Symp. Ser., 95–98(1999).
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Mol Biol Evol. 30, 2725–2729. doi: 2710.1093/molbev/mst2197. Epub 2013 Oct 2716(2013).
Felsenstein, J. Confidence Limits on Phylogenies: An Approach Using the Bootstrap. Evolution. 39, 783–791. doi: 710.1111/j.1558-5646 1985.tb00420.x. (1985).
Tamura, K. et al. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 28, 2731–2739. doi: 2710.1093/molbev/msr2121. Epub 2011 May 2734(2011).
Choi, Y., Sims, G. E., Murphy, S., Miller, J. R. & Chan, A. P. Predicting the functional effect of amino acid substitutions and indels. PLoS One 7, e46688. doi: 46610.41371/journal.pone.0046688. Epub 0042012 Oct 0046688(2012).
Davison, A. J. & Scott, J. E. The complete DNA sequence of varicella-zoster virus. J Gen Virol. 67, 1759–1816. doi: 1710.1099/0022-1317-1767-1759-1759(1986).
Pontremoli, C., Forni, D., Clerici, M., Cagliani, R. & Sironi, M. Possible European Origin of Circulating Varicella Zoster Virus Strains. J Infect Dis. 221, 1286–1294. doi: 1210.1093/infdis/jiz1227(2020).
Schmidt-Chanasit, J. & Sauerbrei, A. Evolution and world-wide distribution of varicella-zoster virus clades. Infect Genet Evol.11, 1–10 https://doi.org/10.1016/j.meegid.2010.1008.1014Epub 2010 Sep 1015 (2011).
Norberg, P. et al. Recombination of Globally Circulating Varicella-Zoster Virus. J Virol. 89, 7133–7146. doi: 7110.1128/JVI.00437-00415(2015).
Nouioui, I. et al. Genome-Based Taxonomic Classification of the Phylum Actinobacteria. Front Microbiol.9: 2007., 10.3389/fmicb.2018.02007. eCollection 02018. (2018).
Batty, I. Actinomyces odontolyticus, a new species of actinomycete regularly isolated from deep carious dentine. J Pathol Bacteriol. 75, 455–459. doi: 410.1002/path.1700750225(1958).
Tang, G. et al. Direct detection of Actinomyces spp. from infected root canals in a Chinese population: a study using PCR-based, oligonucleotide-DNA hybridization technique. J Dent. 31, 559–568. doi: 510.1016/s0300-5712(1003)00112-x(2003).
Qin, T. et al. Super-dominant pathobiontic bacteria in the nasopharyngeal microbiota as causative agents of secondary bacterial infection in influenza patients. Emerg Microbes Infect. 9, 605–615. doi: 610.1080/22221751.22222020.21737578. eCollection 22222020(2020).
Rueda, M. S., Hefter, Y., Stone, B., Hahn, A. & Jantausch, B. A Premature Infant With Neonatal Actinomyces odontolyticus Sepsis.J Pediatric Infect Dis Soc2 (2020).
Cho, J. J. & Shupak, R. P. Cervicofacial actinomycosis of the mandible in a paediatric patient. BMJ Case Rep. 13, e233681. doi: 233610.231136/bcr-232019-233681(2020).
Cone, L. A., Leung, M. M. & Hirschberg, J. Actinomyces odontolyticus bacteremia. Emerg Infect Dis. 9, 1629–1632. doi: 1610.3201/eid0912.020646(2003).
Ziebold, C., von Kries, R., Lang, R., Weigl, J. & Schmitt, H. J. Severe complications of varicella in previously healthy children in Germany: a 1-year survey. Pediatrics.108, E79 (2001).
Vassia, V. et al. Unusual presentation of fatal disseminated varicella zoster virus infection in a patient with lupus nephritis: a case report. BMC Infect Dis. 20, 538. doi: 510.1186/s12879-12020-05254-12876(2020).
Sun, F. et al. Varicella zoster virus infections increase the risk of disease flares in patients with SLE: a matched cohort study. Lupus Sci Med. 6, e000339. doi: 000310.001136/lupus-002019-000339. eCollection 002019(2019).
Chen, S. Y. et al. Incidence of herpes zoster in patients with altered immune function. Infection. 42, 325–334. doi: 310.1007/s15010-15013-10550-15018. Epub 12013 Nov 15010(2014).

Due to technical limitations, table 1,2 is only available as a download in the Supplemental Files section.

No competing interests reported.

Download PDF

Journal Publication

published 06 Jul, 2021

Read the published version in Scientific Reports →

Editorial decision: Major revision
11 Mar, 2021
Reviews received at journal
10 Mar, 2021
Reviews received at journal
18 Feb, 2021
Reviewers agreed at journal
12 Feb, 2021
Reviewers invited by journal
12 Feb, 2021
Editor assigned by journal
12 Feb, 2021
Editor invited by journal
09 Feb, 2021
Submission checks completed at journal
09 Feb, 2021
First submitted to journal
29 Jan, 2021

You are reading this latest preprint version

Metagenomic Analysis of a Throat Swab Sample Collected in China on A Patient Infected With Varicella Zoster Virus

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials And Methods

Results

Discussion

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1