The POLV and JMJ14 proteins regulate stress-related genes during TuMV infection
We previously found that A. thaliana genotypes with mutations in various RdDM and histone modification genes had altered TuMV infectivity when compared to WT plants [23]. Here, a mutant associated with RdDM, nrpe1, coding for the largest subunit of PolV (polv mutant henceforth), and a mutant associated with histone modification, jmj14, coding for an H3K4m3 demethylase, were chosen for further analysis. In an effort to identify the genes directly affected by the mutations that lead to the previously observed viral tolerance, the transcriptomes of TuMV-infected WT, polv, and jmj14 mutants, as well as their respective mock-inoculated controls, were obtained. Non-inoculated central rosette leaves were collected at 4 days post-inoculation (dpi), when no symptoms were observed, and at 7 dpi, just after symptoms appeared.
Because both the POLV and JMJ14 proteins act as repressors of gene expression via DNA methylation and histone modification, respectively, their direct targets are expected to be expressed at a higher level in mutants than in WT plants. As a result, we concentrated the analysis on the induced set of differentially expressed genes (DEGs). The induced genes at 4 and/or 7 dpi were combined and classified into four groups: WT responsive (infected WT vs mock WT), mutant responsive (infected mutant vs mock mutant), mutant-enhanced mock (mock mutant vs mock WT), and mutant-enhanced infected (infected mutant vs infected WT) (Fig. S1a and Table S1). The two categories labeled as "enhanced" contain genes that are more likely to be expressed in mutants compared to WT plants. This was done to enrich genes directly targeted by the two epigenetics proteins.
The induced genes in WT plants were enriched in several gene ontology (GO) categories related to metabolism, abiotic and biotic stress responses, with salicylic acid (SA) response being the most significant (Fig. S2). The majority of the mutant-induced genes were also responsive in WT plants (Fig. S1a). In the polv mutant, a total of 6,157 induced genes were either enhanced in mock or infected tissues, and 5,545 in the jmj14 mutant (Fig. S1a and Table S1). The majority of the enriched GO categories were unique to each mutant (Fig. S2). These findings suggest that the POLV and JMJ14 proteins may directly or indirectly regulate several stress response pathways during TuMV infection, with little overlap.
Identification of candidate genes that are directly regulated by the POLV and JMJ14 proteins during infection
The augmented expression of genes in infected epigenetic mutants compared to WT infected plants is a common behavior observed in epigenetically regulated immunity genes [36]. As a result, we concentrated on the mutant-enhanced infected samples in an attempt to identify candidate genes that are directly targeted by the POLV and JMJ14 proteins (Fig. 1a). The number of DEGs in this category observed for both polv and jmj14 mutants was higher at 4 than at 7 dpi (Fisher’s exact test: P < 0.0001), indicating that gene expression reprogramming related to these epigenetic pathways was more pronounced at early stages of infection on non-inoculated leaves than at fully established stages (Fig. 1b and Fig. S3). At 4 dpi, the number of induced and repressed DEGs distributed evenly across both mutant genotypes (Fisher’s exact test: P = 0.2289). However, at 7 dpi the number of induced DEGs was significantly enriched in jmj14 compared to polv (Fisher’s exact test: P < 0.0001).
POLV and JMJ14 are epigenetic proteins that can repress genes directly by promoting DNA methylation and H3K4m3 removal, respectively, or indirectly through the effects of their direct targets in various signaling cascades. Direct POLV targets are expected to have a higher chromatin occupancy of RdDM proteins and H3K9m2 marks compared to indirect targets. In turn, JMJ14 direct targets may have higher JMJ14 occupancy than indirect ones. It is expected that the targets of this histone demethylase will also be H3K4 hypermethylated in jmj14 mutants compared to WT plants. Available chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq) experiments for the RdDM-related proteins RNA Polymerase IV (POLIV), POLV, DEFECTIVE IN RNA-DIRECTED DNA METHYLATION 1 (DRD1), DEFECTIVE IN MERISTEM SILENCING 3 (DMS3), RNA-DIRECTED DNA METHYLATION 1 (RDM1) and H3K9m2 marks were obtained from the NCBI GEO database (Table S2), and processed peaks overlapped with induced polv-enhanced genes in infected plants, including a region 1 kilobases (kb) upstream of the transcription start sites (TSS) (Fig. 1a). From a total of 4,556 genes induced genes at 4 and/or 7 dpi in the infected polv-enhanced group (Fig. S1a), 894 had significant occupancy of RdDM proteins or H3K9m2 marks, indicating that they could be potential direct targets of the POLV protein during infection (Fig. 1c and Table S3).
Genome occupancy data for the JMJ14 protein and H3K4m2/m3 marks in jmj14 mutants and WT plants were also obtained and analyzed from published studies (Table S2). Genomic ranges of all induced genes at 4 and/or 7 dpi enhanced in jmj14-infected samples (plus 1 kb upstream from the TSS) were overlapped with JMJ14 ChIP-seq peaks and regions found to gain H3K4m3 marks in jmj14 mutants. From a total of 3,233 genes in the jmj14-enhanced infected group (Fig. S1a), 1,230 were found to overlap with JMJ14 or H3K4 hypermethylation peaks (Fig. 1c and Table S3). The overlap between direct candidates was minimal, with only 49 genes shared by both groups (Fig. 1c and Table S3).
To validate the dataset of genes regulated by the two epigenetic proteins, profiles of different features were computed around the TSS (5 kb upstream and 1 kb downstream) of each one using ChIP-seq and whole genome bisulfite sequencing (WGBS-seq) data obtained from public databases (Table S2 and Fig. 2). As expected, direct POLV targets had a higher percentage of TEs and TE-related H3K9m2 and methylcytosine (mC) marks around their TSS than other genes (Fig. 2). In contrast, the set of JMJ14-regulated genes had lower levels of these same features around their TSS than the control set, indicating that genes found to be augmented in infected polv mutants but not the jmj14 ones are probably controlled by TE-related mechanisms. The set of JMJ14-regulated genes had two main peaks of hypermethylated H3K4m2/m3 marks in jmj14 mutants close to 5 kb upstream and 500 bp downstream of the TSS (Fig. 2). These peaks coincided with JMJ14 protein enrichment regions in WT plants (Fig. 2). This means that in mutant jmj14 plants, the regions of JMJ14 chromatin binding coincide with an average increase in H3K4 hypermethylation for this specific dataset. The set of POLV-regulated or non-regulated genes, on the other hand, showed no gain of H3K4 activation marks in jmj14 mutants in these same positions (Fig. 2). JMJ14 binding was increased around 2 kb downstream of the TSS in both POLV- and JMJ14-regulated genes, but this was not associated with increases in H3K4 marks in jmj14 mutants for the POLV dataset, possibly indicating that other H3K4 demethylases could be redundantly removing these marks in these regions in the mutant (Fig. 2). The profiles showed therefore that each dataset contained the expected marks for their respective pathways, indicating that they are likely enriched in true direct targets of these epigenetics proteins. These results collectively indicate that approximately 20% and 40% of all induced genes at 4 and/or 7 dpi, respectively, are candidates for direct regulation by POLV and JMJ14 during infection.
The majority of the POLV- and JMJ14-regulated candidate genes are protein-coding and dynamically regulated during infection
The vast majority of the candidate genes found to be directly regulated by POLV and JMJ14 were protein-coding genes, but non-coding RNAs and TEs were also observed (Fig. 3a). Interestingly the distribution of candidate genes among the categories listed in Fig. 3a were significantly different between POLV and JMJ14 (χ2 = 49.485, 7 d.f., P < 0.0001), with an enrichment of protein-coding RNAs in the case of JMJ14 and of TEs in the case of POLV. The major GO categories enriched in POLV-regulated genes were related to biotic and abiotic stress responses. Although some abiotic-related GOs were also enriched in JMJ14-regulated genes, the majority were involved in metabolism regulation (Fig. 3b). Both datasets of direct candidates contain Mapman annotation bins related to transcriptional factors (TF), phytohormones, and immune regulation (Fig. 3c-e). Several members of the TF families NAC and WRKY were downregulated at 4 dpi in polv-infected samples, but upregulated at 7 dpi when compared to the level of expression in infected WT plants (Fig. 3c). However, higher expression in mutant infected plants than WT infected at 4 dpi and similar levels in both conditions at 7 dpi, was generally observed for various genes associated with the phytohormone classes abscisic acid (ABA), ethylene, jasmonic acid (JA), SA and stress-related genes, indicating that epigenetic regulation is especially important for early expression of defense genes in non-inoculated leaves (Fig. 3D-e).
Six POLV-regulated genes and six JMJ14-regulated genes were chosen for reverse transcription quantitative polymerase chain reaction (RT-qPCR) validation based on function and fold change differences in mutant infected vs WT infected (Fig. 4). All tested genes confirmed enhanced expression in infected mutants at 4 and/or 7 dpi when compared to infected WT plants (Fig. 4). With the exception of the ABA-receptor gene PYRABACTIN RESISTANCE 1-LIKE 5 (PYL5), all of the other tested genes showed augmented expression in mock mutant plants at either 4 or 7 dpi, indicating that they were leaky expressed in the mutants prior to infection when compared to WT uninfected plants (Fig. 4). When comparing mutant infected plants to mock mutant plants, genes coding for the RESISTANCE TO POWDERY MILDEW 8 (RPW8)-domain containing protein (RPW8 hereafter) and the calcium channel-related CYCLIC NUCLEOTIDE GATED CHANNEL 19 (CNGC19), both predicted as JMJ14 targets, an increased expression in mutant infected plants was observed at both time-points (Fig. 4). Although non-infected plants have similar levels of PYL5 gene expression, jmj14 infected plants have a much stronger induction of this gene than WT infected plants (Fig. 4). These data collectively indicate that several genes involved in various aspects of immune responses are dynamically regulated during infection.
Virus-responsive genes may be controlled by distinct epigenetic modules
To increase confidence that the selected panel of genes are indeed regulated by epigenetic mechanisms, their sequences were analyzed in the Plant Chromatin State Database (PCSD), where a series of 36 epigenetic states that are commonly found correlated in the A. thaliana genome were identified based on the integration of publicly available epigenomic data [37] (Fig. S4). Three of the six POLV-regulated selected genes had TE-related epigenetic states associated with their promoter sequences, including the NAC DOMAIN CONTAINING PROTEIN 3 (NAC003) transcription factor, the ALTERNATIVE SPLICING COMPETITOR (ASCO) long non-coding RNA (lncRNA), and the NLR gene CONSTITUTIVE SHADE-AVOIDANCE 1 (CSA1) (Fig. S4). The CSA1 gene’s promoter also contained bivalent marks, including repression H3K27m3, activation H3K4m2, and the histone variant H2A.Z (Fig. S4). The promoter of the defense-related POLV-regulated gene LATE UPREGULATED IN RESPONSE TO HYALOPERONOSPORA PARASITICA-ONE-LIKE (LURP1-LIKE) was also found to have bivalent and H3K27m3 repression states (Fig. S4). None of the JMJ14 selected genes were linked to epigenetic states related to TE regulation, providing more evidence that they are not regulated by RdDM-related mechanisms (Fig. S4). With the exception of PYL5 and gibberellin-associated RGA-LIKE PROTEIN 3 (RGL3) genes, where only non-H3K4m-related accessible DNA states were found, the promoter sequences of the remaining four JMJ14 selected genes, the defense-related PHLOEM PROTEIN 2 A5 (PP2-A5), CNGC19, RPW8, and the chaperone CELL DIVISION CYCLE 48B (CDC48B) displayed both bivalent and activation marks (Fig. S4).
The presence of different kinds of marks in the promoter regions of the selected genes suggested that additional epigenetic modules may possibly be involved in their regulation. To directly test this hypothesis, the expression of the POLV-controlled genes CSA1 and RLP43, as well as the JMJ14-controlled genes CNGC19, RPW8, PYL5, and RGL3, was examined in six additional epigenetic mutants. Plants defective in the methyltransferase genes ARABIDOPSIS TRITHORAX 1 (ATX1), SET DOMAIN GROUP 8 (SDG8), CURLY LEAF (CLF), and KRYPTONITE (KYP), associated with H3K4, H3K36, H3K27 and K3K9 methylation marks, respectively, the H3K27 demethylase gene RELATIVE OF EARLY FLOWERING 6 (REF6) and the histone deacetylase gene HISTONE DEACETYLASE 19 (HDA19) were used in this assay. All of the selected pathways have previously been linked to the regulation of defense genes [38]. Among the examined mutant genotypes, expression analysis indicated that the CLF and HDA19 proteins may play the most significant role in the regulation of the selected genes. Genes RLP43, CNGC19, RPW8, and PYL5 were induced in hda19 mutants relative to WT plants (Fig. 5). This is consistent with HDA19 protein's predicted restrictive action in its targets due to the removal of activation acetylation marks. In the clf mutant background, CSA1, CNGC19, RPW8, and PYL5 were also dysregulated, suggesting that H3K27m marks may also be necessary for their regulation (Fig. 5). Contrary to expectations, CSA1 expression was lower in clf mutants than in WT plants, suggesting that this gene may be repressed via alternative mechanisms in the absence of CLF (Fig. 5). The expression of all six examined genes was not significantly affected in the lines sdg8, ref6 and kyp when compared to WT plants (Fig. 5). This could imply that the selected virus responsive genes are not targeted by these epigenetic proteins, or that their functions are performed by other family members in their absence.
Overall, the selected genes’ overlap with epigenetic proteins or marks (Fig. 1 and Table S3), their altered expression in epigenetic mutants (Fig. 4 and Fig. 5), and the presence of bivalent, activation, and repression epigenetic marks in their promoters (Fig. S4) all increase the likelihood that they —and the majority of the other identified candidates- are indeed regulated by epigenetic pathways during virus infection.
A subset of candidate genes regulated by POLV and JMJ14 are also regulated in WT plants submitted to repeated virus infection
It is known that some stress genes in plants are epigenetically regulated in response to recurrent stress situations. We wondered if some of the POLV- and JMJ14-regulated genes could have a biological role in WT plants under sequential viral stress. Plants were inoculated with a stimulating virus, and leaves were removed before virus spread to non-inoculated tissues (Fig. 6a and Fig. S5). After three days of recovery, a second challenging virus was inoculated, and symptoms were recorded for 21 days (Fig. 6a). Plants that had not been inoculated or had been mock-inoculated served as controls. TuMV was used as the second challenging virus in all cases. To determine if phylogenetic distance to TuMV influenced the results, tobacco mosaic virus (TMV; Tobamovirus, Virgaviridae), tobacco rattle virus (TRV; Tobravirus, Virgaviridae), and TuMV itself were used as stimulating viruses in separate experiments.
The level of TuMV symptom severity in plants was the same whether they were mock inoculated or left untouched in the stimulation step (Fig. 6b). Plants stimulated with any of the three viruses were more tolerant to subsequent TuMV infection than unstimulated plants (Fig. 6b), despite the high variability among the six full factor experimental replicates performed. Unstimulated plants developed stronger symptoms faster, as defined by the presence of clear yellowing in all rosette leaves (Fig. 6c). Interestingly, stimulation with TuMV made them more tolerant to later TuMV challenge than stimulation with TRV, indicating that the phylogenetic proximity between the stimulating and the challenging stresses was relevant for the outcome of the induced resistance phenotype. In some cases, the differences in TuMV tolerance between stimulated and unstimulated plants became visually prominent in late infection time-points (Fig. 6c).
To compare the dynamics of gene expression patterns and maximize the chances of finding virus memory genes, an experimental replicate with high TuMV tolerance due to TuMV stimulation was chosen for further investigation. Plants that had been stimulated with TuMV prior to challenge showed a significantly lower virus load four days after the challenge (Fig. 7a). This difference in virus load between conditions was even greater at 8 dpi, indicating that TuMV infection progressed more slowly in stimulated plants and correlating with previously obtained symptomatology data (Fig. 6b-c and Fig. 7a). Indeed, TuMV viral titers in stimulated plants at 8 dpi were comparable to those in unstimulated samples at 4 dpi, indicating that the two conditions were in different stages of infection at 8 dpi. Four unstimulated and TuMV-stimulated individual plants were chosen for transcriptome analysis at 8 dpi. As controls, three biological replicates of plants that were mock-inoculated in both the stimulation and challenging steps (mock plants) were included. This time point was chosen due to the greater differences in virus load between stimulated and unstimulated samples, as both of which had very low virus titers at 4 dpi. In the principal component analysis (PCA), both unstimulated and stimulated samples showed clear separation when compared to mock plants with no virus (Fig. S6a-b). We concentrated on a direct comparison between unstimulated and stimulated plants to find memory genes. When both conditions were compared, a general clear separation in PCA was observed, indicating that stimulation conditions influenced later infection expression patterns. However, one outlier was observed among the four biological replicates of unstimulated samples and another among the stimulated samples (Fig. S6c). The outliers were therefore removed, and the differential expression analysis was carried out with the remaining three biological replicates (Table S4).
The major unique GO categories in the set of induced genes were related to development and metabolism when the low virus titer TuMV-stimulated plants were used as treatments and the high virus titer unstimulated plants were used as controls (Fig. 7b). Induced genes were also found to be overrepresented in three categories related to heterochromatin assembly and RdDM-related processes (Fig. 7b). The majority of stress-related GO categories were only found in the set of repressed DEGs (Fig. 7b), reflecting the significantly lower virus load in stimulated samples at 8 dpi (Fig. 7 and Table S4).
Among the 5,261 DEGs between stimulated and unstimulated plants, 232 also appeared on the list of genes predicted to be directly controlled by POLV, 368 on the list of genes predicted to be JMJ14-regulated, and 21 on both lists (Fig. 8a and Table S5). Although there were no GO enrichment categories among these shared datasets, genes related to response to stimulus were present in all of them (Fig. 8b). Genes involved in organism interaction and immune responses were also found in the WT and JMJ14 groups. Six genes that had previously undergone RT-qPCR testing in mutant samples (Fig. 4) and that also appeared on the list of regulated genes in stimulated wild type plants (Table S5) were chosen for additional expression quantification analysis (Fig. 8c). At 4 dpi, a non-significant tendency to induction in stimulated plants was observed for all tested genes, including the defense-related LURP1-LIKE and PP2-A5, the PTI receptor RECEPTOR LIKE PROTEIN 43 (RLP43), the defense-related RPW8, the chaperone CDC48B, and the ABA-receptor PYL5 (Fig. 8c). Only samples collected at 8 dpi, which were the same ones used for the transcriptome analysis, showed significant differences, though. In four genes (LURP1-LIKE, RLP43, RPW8, and CDC48B), unstimulated samples showed a significant induction when compared to mock or stimulated samples, but there were no differences between stimulated and mock samples (Fig. 8c). This is consistent with the general repression of gene expression observed in stimulated samples when compared to unstimulated samples (Fig. 7b and Fig. 8c). Induction of the PYL5 gene was observed in stimulated samples versus unstimulated samples, despite the fact that the former has significantly lower virus titer than the latter (Fig. 7a and Fig. 8c). This suggests that some of the identified POLV- and JMJ14-regulated genes during infection are dynamically regulated in WT plants exposed to repeated virus stress, with some of them being poisoned to be expressed at earlier stages of the infection.