Cell culture and viruses
HEK 293T, primary HFF and Vero cells were maintained in Dulbecco’s modified Eagle’s medium (DMEM, Thermo Fisher Scientific) supplemented with 10% (vol/vol) heat-inactivated fetal bovine serum (FBS, Thermo Fisher Scientific), 2 mM GlutaMAX (Thermo Fisher Scientific), 1 mM HEPES (Thermo Fisher Scientific) and 1% gentamycin (vol/vol) or penicillin-streptomycin (vol/vol). HAP1 were cultured in Iscove’s Modified Dulbecco’s Medium (IMDM,Gibco) supplemented with 10% (vol/vol) fetal bovine serum (FBS, Anprotec) and 1% (vol/vol) penicillin-streptomycin (Sigma-Aldrich). iSLK harbouring recombinant (r) KSHV.219 were cultured in DMEM supplemented with 10% (vol/vol) FBS, 2 mM GlutaMAX (Thermo Fisher Scientific), 10 mM HEPES buffer (Thermo Fisher Scientific) and 1% penicillin-streptomycin (vol/vol) , 1 µg/ml Puromycin and 250 µg/ml C418 . All cells were kept under standard culture conditions and monthly checked for mycoplasma contamination (MycoAlert Kit, Lonza).
Lytic replication of KSHV in iSLK.219 was induced by adding 1 µg/ml doxycycline. Infection experiments were performed with HSV-1 strain KOS, HSV-1 GFP (F-strain, provided by B. Kauffer, Berlin), HSV-2-GFP (strain MS, kindly provided by Beate Sodeik, Hannover) or HCMV (strain Ad-169). HSV-1 deltaICP0 (ICP4-YFP) and HSV-1 delta ICP27 were provided by R. Everett (University of Glasgow). HSV-1 T-VEC del ICP34.5/ICP47 IMLYGIC®, Talimogene laherparepvec) was provided by L. Heinzerling (Friedrich-Alexander-Universität Erlangen-Nürnberg). HSV-1 expressing ICP4-YFP and VP26-RFP is a new recombinant virus generated in the background of strain 17 by recombining HSV-1 expressing ICP4-YFP (obtained from Matthew D. Weitzman) and HSV-1 expressing VP26-RFP (obtained from Oren Kobiler) and plaque purifying for 5 cycles.
Reagents, plasmids and transfections
Transfections were performed with GenJet (SignaGen Laboratories) or Lipofectamin 2000 (Thermo Fisher Scientific) according to the manufacturer’s protocol. Plasmids are listed in Table 3.
Western Blotting
Cells were lysed in RIPA HS buffer (10 mM Tris-HCl pH 8.0, 1 mM EDTA, 500 mM NaCl, 1% Triton X-100 (vol/vol), 0,1% SDS (vol/vol), 0,1% deoxycholic acid (DOC) with Aprotinin and Leupeptin, MG-132 and sodium metavanadate (Sigma-Aldrich). The cell pellet was centrifuged at 4°C, 14.000 rpm for 30 min. Samples were diluted with Laemmli-SDS sample buffer and heated for 5 min at 95°C. Antibodies used are listed in Table 4.
qRT-PCR
RNA was extracted using the Direct-zol RNA Miniprep Plus kit from Zymo Research according to manufacturer’s instructions. Reverse transcription was performed using the Super Script IV Kit (Thermo Fisher Scientific) according to manufacturer’s instructions. qRT-PCR was carried out using TaqMan™ Universal PCR Master Mix I (Applied Biosystems, Thermo Fisher Scientific) 0,1 µg as template on a 7500 Fast Real-Time PCR machine. Or reverse transcription and qRT-PCR were conducted in one step using 0.1 µg RNA as template with the Luna Universal Probe One-Step RT-qPCR kit (New England Biolabs) following manufacturer’s protocols. Primers/probes are listed in Table 1. Expression levels for each gene were obtained by normalizing values to HPRT1 or VTRNA and fold induction was calculated using the comparative CT method (ΔΔCT method).
CRISPR and sgRNAs
All sgRNAs used in this study were previously described and are listed in Table 2. sgRNAs were cloned into LentiCRISPRv2 plasmid gifted from F. Zhang (Addgene plasmid #52961) and verified by sequencing. Lentiviruses were packaged with pMD2.G (Addgene plasmid #12259) and psPAX2 (Addgene plasmid #12260) (both gifted from D. Trono) into HEK 293T. HEK 293T were seeded into 12 well plates and lentiviral supernatants added at 70-80% confluency. Plates were centrifuged at 1200 rpm for 2 min after centrifugation culture medium was added and cells incubated over night at 37°C. The medium was changed to normal culture medium the next day and selection with 2 µg/ml puromycin in normal culture medium started on day 3.
DUX4 protein purification
The DUX4 protein was purified using the Intein Mediated Purification with an Affinity Chitin-binding Tag (IMPACT) system (NEW ENGLAND BioLabs INC). The coding sequence of DUX4 (Addgene: Plasmid #21156) was subcloned into the pTYB12 vector using EcoR1 and Sap1 restriction sites, with an intein-CBD tag added to the N-terminus of DUX4. Protein expression was induced by adding 0.4 mM of IPTG to ER2566 cells at an OD600 = 0.5 overnight at 18°C. Bacterial pellets were then resuspended in Lysis Buffer (20 mM Na-HEPES, 500 mM NaCl, 1 mM EDTA, 0.1% Triton X-100 and protease inhibitors (cOmplete™ Proteasehemmer-Cocktail)) and lysed using a French Press. The lysates were centrifuged at 15000 g at 4°C for 30 min, and the clarified lysate was slowly loaded onto the chitin column for purification using the ӒKTA pure™ chromatography system. The beads were then washed with 50 bed volumes of Column Buffer (20 mM Na-HEPES, 500 mM NaCl, 1 mM EDTA, and 0.1% Triton X-100) before protein cleavage with washing buffer containing 50 mM DTT. Finally, the eluate was further purified with Size Exclusion Chromatography using a Superdex 200 Increase 10/300 GL column (Cytiva).
Next generation Chip-Seq
For CHIPmentation primary HFF cells were seeded in T175 flasks and infected with HSV-1 KOS (MOI 10) for different time points. CHIPmentation was conducted as previously described44. Cells were sonicated using the Bioruptor (Diagenode) for 30 cycles. For the immunoprecipitation protein G Dynabeads (Thermo Fisher Scientific) were used. Samples were incubated with either 2.5 µg Anti-DUX4 (E5-5) (Abcam) or Normal Rabbit IgG (Cell Signaling Technology) as control. Samples were purified using AMpureXP beads (Beckman Coulter) according to manufacturer’s description. Libraries were sequenced on HiSeq 4000 System (Illumina)
Next generation RNA-Seq
HEK 293T wildtype cells and HEK 293T CRISPR/Cas knockout cells were seeded in T25 flasks. Cells infected for different time points with HSV-1 KOS (MOI 10). HAP1 and HAP1-DUX4 ko cells were Cells were lysed in Trizol (Life Technologies by Thermo Fisher Scientific), and total RNA was isolated using the RNA clean and concentrator kit (Zymo Research), according to the manufacturer's instructions. Sequencing libraries were prepared using the NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (NEB) with 9 cycles PCR amplification, and sequenced on a HiSeq 4000 device with 1x50 cycles. For quantification of viral gene expression alignments were done using hisat249 on the HSV-1 genome (strain 17, genbank accession no. NC_001806) and readcounts per gene quantified using quasR51.
CUT&Tag
Primary HFF cells were infected with HSV-1 GFP (MOI 1) in PBS supplemented with 0.1% Glucose and 1% FCS. the remaining virus was washed away with a low pH buffer (40mM Citric acid, 10mM KCl, 135mM NaCl, pH3) after one hour at 37°C. CUT&Tag was performed according to the manufacturing protocol (CUT&Tag-IT Assay Kit, Active Motif) using 2,5 µg Dux4 E5-5 (Abcam) or 2,5 µg rabbit IgG (Cell Signalling Technology). In short, cells were bound to concanavalin A-coated magnetic beads and permeabilized for subsequent incubation with primary and secondary antibodies. Specific cutting and addition of adapters was mediated by a protein A-Tn5 transposase fusion protein. Libraries were sequenced on HiSeq 4000 System (Illumina)
EMSA
Viral DNA of HSV-1 GFP and HSV-1 KOS was isolated using the Quick-DNA MiniPrep kit from Zymo Research. Two short regions containing either the DUX4 binding motif or no DUX4 binding motif were amplified by PCR: about 600bp long fragments were amplified from HSV-1 genomes using the primers for Binding site 1 (Dux4 motif) BS1-fwd:gtgtaccactgctgtcg, BS1-rev:gtctgatcatgccccatacc, and Binding site 2 (No Dux4 motif) BS2-fwd:cgtgaaccaaagacgagggc, and BS2-rev:ccacgttgagaagctcgtcg. The EMSA was performed as described by Lee et al. 45 using 50ng of amplified viral DNA, which was incubated 30 min at room temperature with 1µg purified DUX4 protein and 2µL sperm DNA (D7656, Sigma-Aldrich).
Flow cytometry
HAP1 wt and ko cells were simultaneously seeded and infected with HSV-1 GFP or HSV-2 GFP at a MOI of 0.1. At day 1, 2, 3 and 4 post infection, cells were detached by scraping, fixed in 1% PFA for 20 min and resuspended in FACS buffer (PBS supplemented with 2% FCS and 0.5mM EDTA). GFP expression was measured with BD LSRFortessa and the data was analyzed with FlowJo.
Predicted DUX4 binding sites in different herpesviral genomes
The following viral genomes were downloaded at NCBI: HSV-1 (GU734771.1), HSV-2 (Z86099.2), HCMV (NC_006273), EBV (NC_007605) and KSHV (OK358814). The viral genomes were analysed for DUX4 binding sites with FIMO46 using the default parameters and the DUX4 binding motif obtained with CUT&Tag.
Bioinformatic analysis
ChIP-seq data processing was done using the PiGx-ChIP-seq pipeline (https://doi.org/10.1093/gigascience/giy123). In short, adapters and low quality bases were trimmed from reads using Trim-galore. The reads were mapped on the hg19 version of the human genome, combined with HSV-1 genome, using Bowtie2 with k = 1 parameter. bigWig tracks were created by extending reads to 200, collapsing them into pileups, and normalizing to reads per million. Peak calling was done with MACS2 (https://github.com/taoliu/MACS) using the default parameters. Motif discovery was done using MEME 47 with the default parameters, on the top 100 peaks (sorted by q value), in a region of +/- 50bp around the peak center. Peak annotation was done using the hg 19 ENSEMBL GTF file, downloaded on 17.03.2017. from the ENSEMBL database 48. Peaks were annotated based on the following hierarchy of functional categories: tss -> exon -> intron -> intergenic (eg. if a peak overlapped multiple categories, it was annotated by the class that is highest in the hierarchy). Peaks were overlapped with the hg19 Repeatmasker repeat annotation, downloaded from the UCSC database on 03.02.2015.
For CUT&Tag analysis, the data was mapped to the hg19 version of the human genome with the viral genome using Bowtie2 with k = 1 parameter. bigWig tracks were created by extending reads to 200, collapsing them into pileups, and normalizing to reads per million. Peak calling was executed by MACS2 (https://github.com/taoliu/MACS) using the default parameters. The obtained peaks were filtered for being present in two consecutive time points and not present in the 2h time point. Motif discovery was done with MEME using the default parameters on all peaks of the human genome in a region +/-50bp around the peak center.
The bulk RNA-seq raw data of Human Adenovirus 5 infected cells (PRJEB57806), FSHD patients (GSE153301), and 8-cell-stage embryo cells (GSE36552), as well as the single-cell RNA-seq data of nasopharyngeal carcinoma patients (GSE162025), were downloaded from the NCBI database. The bulk RNA-seq data were aligned to the human genome (hg38) using STAR and then normalized using DESeq2. The R package VennDiagram was used to generate a Venn diagram showing the overlapping genes between the different samples. To visualize the DUX4 target genes expression patterns, the R package pheatmap was used to create a heatmap. For the single-cell RNA-seq data analysis of nasopharyngeal carcinoma patients, the STARsolo pipeline was used to align sequencing reads to the human reference genome hg38 and to generate feature-barcode matrices. The gene expression matrices for all PBMC and tumour cells were combined and converted to a Seurat object using the R package Seurat. The gene expression matrices were then log-normalized and linearly regressed using the NormalizeData and ScaleData function of the Seurat package. Finally, the scVirusScan pipeline was employed to identify viruses present in all PBMC and tumour cells.
Transposon Quantification
Transposon expression was quantified using TeTranscripts 49 version 2.2.0, using the hg19 version of the human genome and a custom formatted transposon GTF file, made available by the Hammell laboratory from the following link: http://labshare.cshl.edu/shares/mhammelllab/www-data/TEtranscripts/TE_GTF/. Before visualization, the read counts per TF repName repeatmasker category were multiplied by the following size factors 10000/(number of repeats), and 1000/(average repeat length). The transposon expression was visualized relative to the uninfected wild type sample, using the ComplexHeatmap package (27207943).
Comparison of HSV-1 gene expression with the embryonic expression profile
Expression profiles for HSV-1 and DUX4 ectopically expressed, 293T cells were taken from the following publication 6. The embryonic expression profiles were downloaded from the ARCHS4 database. The data originated from the following repository GSE44183. Data was visualized using the ComplexHeatmap function.
Comparison of repeat expression during HSV-1 infection and DUX4 overexpression.
Expression profiles obtained from HSV-1 and DUX4 ectopically expressed, 293T cells were taken from the following publication 6. DUX4 binding data extracted from the supplementary data from the following publication 14. RNA - seq data was mapped using STAR, and the repeats were quantified by counting the number of, uniquely mapping, spliced reads overlapping with the each transposon category. Expression was visualized using ComplexHeatmap. Prior to the visualization the reads were normalized to uninfected samples.
Statistical analysis
P values were calculated using an unpaired Student’s test. P <0.05 was considered statistically significant.