CUT&RUN reveals changes in CTCF binding profiles upon Wnt signaling activation
In our previous efforts to chart the genome-wide binding profile of β-catenin, we noticed enrichment for CTCF binding motifs within the β-catenin peak regions 29. This finding drove us to hypothesize that Wnt signaling could use CTCF-mediated chromatin organization to regulate target gene expression. To investigate this, we performed CUT&RUN in HEK293T cells to map CTCF genome-wide binding behavior under two conditions: either upon chemical inhibition of PORCN by LGK (Wnt-OFF) or of GSK3 by the small molecule CHIR99021 (Wnt-ON) (N = 3 per condition) (Fig. 1A). After DNA fragment isolation, we performed size selection to enrich for fragments < 120 bp, which represent the footprint of the transcription factor binding directly to DNA 30, and identified regions of significant read enrichment using the CUT&RUN peak calling software SEACR 31. The concordance of the CTCF replicates in each condition was high (> 80%), consistent with the known stability of CTCF binding. We employed a simple majority approach (2 of 3 datasets) to call peaks: ~20,000 peaks were identified in each condition, corroborating existing literature 20. Of these, over 17,000 (ca. 85%) were present in both Wnt-OFF and Wnt-ON conditions (Fig. 1B). Within the shared peaks, de novo motif search with MEME-ChIP 32 revealed motifs matching CTCF with an E-value of 4.5 e− 4129 (Fig. 1D, top), validating the precision of our datasets.
We then turned our attention to the CTCF binding events occurring in Wnt-OFF and Wnt-ON. Based on this initial peak calling we identified 2,573 and 3,020 unique peaks, respectively (Fig. 1B). However, most of these peaks presented residual signal in the other group: several regions displayed higher but not exclusive signal to Wnt-OFF or Wnt-ON, indicating that most of these peaks could result from differential binding affinity across the two conditions, rather than de novo binding sites in either of them. Notably, the Wnt-ON dataset showed higher average signal intensity in its unique regions (Fig. 1C), suggesting that Wnt signaling induction generates increased affinity of CTCF to a specific set of genomic regions. To identify Wnt-OFF and Wnt-ON exclusive de novo peaks, we rendered our selection more stringent by first excluding peaks that were called in a single dataset of the opposite condition. This resulted in 747 peaks unique to Wnt-OFF and 960 in Wnt-ON. Next, we performed k-means clustering (k = 4 clusters) to group these unique peaks based on difference in signal (Fig. 1D). This revealed 1 cluster for each condition with the highest average fold-change (FC) between the two conditions. Cluster 1 in Wnt-OFF with 235 peaks had an average FC > 2 over Wnt-ON, while Cluster 4 in Wnt-ON with 331 peaks had an average of FC > 5 over Wnt-OFF (Fig. 1D). Visual inspection of these peaks confirmed that the nearby shared peaks are comparable between the Wnt-OFF and Wnt-ON, while our approach isolated instances where the unique peaks are convincingly different in their signal profiles (Fig. 1E).
β-catenin dependent CTCF rearrangements under Wnt (RUW)
Our chosen method of Wnt signaling induction via stimulation with CHIR99021 (CHIR) is known to activate the pathway via inhibition of GSK3 and thus stabilization of β-catenin; however as GSK3 inhibition also stabilizes other proteins and is involved in other pathways 33,34, we decided to perform CTCF CUT&RUN in HEK293T cells lacking β-catenin (Δβ-catenin, from Doumpas et al. 2019) to discriminate β-catenin dependent and independent events (Fig. 2A).
We focused on our previously identified Cluster 1 (Wnt-OFF only) and Cluster 4 (Wnt-ON only) sets of peaks, and tested whether they were detected in the absence of β-catenin. Considering Cluster 1, we reasoned that Wnt-OFF CTCF peaks should also be present in Δβ-catenin cells in Wnt-OFF condition, where β-catenin, by definition, does not play a role. Peaks that did not meet this criterion were considered false positives and removed. This left us with 100 peaks, present in Wnt-OFF, but absent in Wnt-ON. Notably, none of these peaks were found in Δβ-catenin cells in Wnt-ON, indicating that the loss of CTCF binding upon CHIR treatment was independent of β-catenin (Fig. 2B, Supp. Table 1), and could be due to other pathways. When considering Cluster 4, we reasoned that if a peak was a true Wnt-ON only phenomenon, it should not be present in Wnt-OFF Δβ-catenin: those that were present were also considered false positives and removed. This left 145 CTCF sites that are gained upon CHIR treatment. Of these, none were present in Wnt-ON in Δβ-catenin cells, indicating that their appearance was dependent on β-catenin (Fig. 2B, Supp. Table 2), and thereby a true Wnt-induced event. We termed these 145 sites as CTCF Rearrangements Under Wnt (RUW).
Signal profile and intensity plots, which measure signal enrichment in regions whether a peak is called or not, confirmed the patterns indicated by the peak calling. CTCF binding sites lost under CHIR stimulation showed decreased CTCF occupancy regardless of β-catenin presence, whereas gained sites only showed increased signal in Wnt-ON when β-catenin was present (Fig. 2C, 2D). When normalized and visualized via the Integrative Genome Viewer (IGV, Robinson et al. 2023), nearby CTCF sites are comparable between cell lines and conditions while the lost or gained show differences in signal (Fig. 2E). Next, we performed motif analysis on the lost and gained CTCF binding sites. Lost sites, which are β-catenin-independent, showed enrichment for one de novo motif (E 1.5e− 8), which could be matched with significant p-values to many known motifs, including those of the transcription factors KLF5, SP9, and CTCFL, and showed central enrichment within the peak regions (p 5e− 5) (Fig. 2F). RUW gained sites had two significantly enriched motifs: one matching CTCF (E 1.3e− 38) which was centrally enriched (p 1.2e− 11), and one matching TCF/LEF (E 2.7e− 22) which was most enriched slightly offset (-40 bp) of peak center (p 0.06) (Fig. 2G). The finding that TCF/LEF motifs frequently lied adjacent to those of CTCF in a close but not overlapping manner indicated that both factors could simultaneously occupy the same region and strengthened the evidence of these CTCF RUWs being Wnt-driven occurrences.
RUW sites overlap with characterized Wnt responsive regions
We set out to explore the characteristics of Wnt-driven, β-catenin-dependent CTCF RUWs, given their potential relevance in the transduction of Wnt-target genes. To this aim, we first annotated them based on their genomic positions. The majority of RUWs were located in introns, followed by intergenic and promoter regions (Fig. 3A, top). Next, we explored RUW regions based on their chromatin characteristics by measuring how they were marked by the histone modifications H3K4me3, H3K4me1, and H3K27ac with CUT&RUN LoV-U (Fig. 3A, center). As expected, H3K4me3 signal was mostly found in promoter RUWs, which increased in signal upon Wnt induction, indicating that these promoters become more active. H3K4me1 enrichment within RUWs was generally low, though the signal within promoters seemed to decrease upon Wnt signaling activation. As H3K4me1 is typically depleted from active promoters, we considered it consistent with the increase in H3K4me3 36. Many RUWs were decorated with H3K27ac, a marker of open chromatin 37: these increased upon Wnt/β-catenin induction (Fig. 3A). We recently did an extensive time course study where we mapped the chromatin state via ATAC-seq upon Wnt/β-catenin signaling 27, mapping open chromatin 38. We cross-referenced the RUW sites with their chromatin accessibility data, comparing the number of RUWs called as ATAC-seq peaks and the ATAC-seq signal within RUW peaks (Fig. 3B, left). We saw that while most promoters were already accessible and did not change their signal profiles after 24 hr of Wnt induction, some intron and intergenic RUWs showed gained chromatin accessibility (Fig. 3B, left). In combination, these data suggested that the RUW sites possess functional activity.
CTCF and β-catenin come into physical proximity upon Wnt activation
A comparison of β-catenin, LEF1 and CTCF genome-wide binding profiles revealed – as the motifs indicated – that RUW sites could also be bound by components of the Wnt/β-catenin nuclear complex: > 25% of them were called as β-catenin peaks, and > 50% as LEF1 peaks (Fig. 3B, right). The strongest signal for β-catenin and LEF1 was seen in intron and intergenic RUWs (Fig. 3B, right). RUW sites that were found to exhibit changed chromatin accessibility, β-catenin binding, and LEF1 binding included the notable Wnt targets AXIN2, DKK1 and NKD1, as well as previously unreported direct targets such as DSG4 (Fig. 3C). The overlapping genomic signal between CTCF and β-catenin/LEF1 suggested a physical interplay between the Wnt transcriptional complex and CTCF. However, while CUT&RUN identifies regions bound by these different factors, it could not distinguish if they ever co-occupy the same locus at the same time. To test this, we performed the highly sensitive proximity ligation assay, which uses microscopy to detect a signal emerging when two proteins of interest are in close physical proximity (within 40 nm) 39. Indeed, we could see that β-catenin and CTCF were detected as proximal, and only upon Wnt signaling induction (p < 0.0001, Fig. 3D).
RUW associated genes include differentially expressed classical Wnt target genes
To further investigate the potential impact of RUWs, we used GREAT 40 to assign the RUW peaks to 254 genes (Fig. 4A, Supp. Table 3). STRING mapping revealed a significant degree of interaction within the network (PPI 2.33e− 15) and Wnt pathway components were overrepresented (FDR 0.0082), making up the center most interconnected cluster (Fig. 4A). Gene ontology analysis also revealed enrichment for the Wnt pathway, and terms related to Wnt signaling such as limb development or cell migration (Fig. 4B). To explore the potential effect of RUWs on gene expression, we overlapped the genes associated to the RUW peaks with gene expression data from HEK293T in Wnt-ON vs. Wnt-OFF (DEG, Log2FC > 0.6, adj. p < 0.05; Doumpas and colleagues 29). Of the 254 RUW genes, 20 were DEGs (3.1-fold greater than expected by chance: hypergeometric test, p 9.09e− 6), consisting of both up- and down-regulated genes (9 up, 11 down), including many of the targets with the highest log2FC such as AXIN2, DKK1 and NKD1 (Fig. 4C). To test whether the expression of RUW associated genes was dependent on β-catenin and/or TCF/LEF, we analyzed the DEGs upon Wnt induction in Δβ-catenin and Δ4TCF cells 29. Interestingly, while most of the upregulated targets were dependent on both β-catenin and TCF/LEF, the downregulated targets seemed to be mostly independent (Fig. 4D), suggesting that GSK3-inhibition-dependent downregulation might occur via different mechanisms.
Wnt signaling activation leads to larger-scale CTCF-mediated genomic reorganization
CTCF is a known regulator of the 3D genome structure, both in formation of TADs and smaller scale loops between enhancers and promoters 18. We set out to determine whether RUWs were associated with changes in the CTCF-mediated 3D structure by performing HiChIP for CTCF in Wnt-OFF and Wnt-ON conditions (N = 2 per condition) and then comparing loops exclusive to Wnt-ON with RUW peak regions (Fig. 5A). HiChIP genome-wide analysis provided a relatively low number of CTCF mediated loops that were common between the Wnt-OFF and Wnt-ON conditions. When all detected interactions were considered, the ones that were shared consisted of ~ 20% of loops in each condition, while, at high stringency, only ~ 1.5% of loops were shared (differential loops with FitHiChIP, FDR 0.01) (Fig. 5B). The difference between conditions could be partially due to the dynamicity of the CTCF looping events leading to a low concordance across replicates 41, a similar phenomenon of high CTCF ChIP overlap paralleled by a lower CTCF HiChIP concordance between cell types has been reported previously 42. However, the broad difference between Wnt-OFF and Wnt-ON is in line with the previously reported large-scale genomic rearrangements upon modulation of GSK3 activity43. Supporting the latter explanation, we have also noticed that the shared loops, unaltered by GSK3 inhibition, were significantly shorter in length compared to the loops unique to either condition (Supp. Figure 1A), though the loop annotations were similar in distribution across conditions (Supp. Figure 1B).
We focused our attention on those changes that coincide with our previously stringently identified RUWs, where a CTCF repositioning occurs in Wnt-ON and is dependent on β-catenin. Of the 145 gained RUWs, 99 were connected to other genomic regions via a total of 172 CTCF-mediated loops that are also specific to the Wnt-ON condition (Fig. 5C, Supp. Table 4). The RUWs involved in looping were mostly found within introns and intergenic regions, likely representing enhancers. These RUWs were often an anchor of multiple loops, which were most likely to have the other anchor in intergenic and intron regions, though we identified some instances of enhancer-promoter looping (Fig. 5D). Gene ontology analysis performed on the set of RUW loops revealed that these are likely involved in the coordination of biological processes, including epithelial morphogenesis, neurogenesis and embryogenesis (Fig. 5E), that are all broadly consistent with most Wnt/β-catenin signaling output in vivo 3.
The RUWs connected to other regions via chromatin loops present only in Wnt-ON included those found within the AXIN2 and DKK1 loci (Fig. 5F). The HiChIP data revealed, as we hypothesized, that these RUWs were associated with the chromatin looping towards another CTCF site closer to the promoter of the gene (Fig. 5F). Yet, we also identified instances where many new loops were tethered to RUW sites upon Wnt signaling stimulation, likely helping to connect hubs of TCF/LEF and β-catenin bound regions (Fig. 5F, MSX2 and WNT16).
Perturbation of CTCF binding sites within RUWs affects Wnt target gene upregulation
To test whether CTCF RUWs are functionally implicated in gene expression, we sought to selectively disrupt their CTCF binding motifs. We selected the RUWs annotated to AXIN2 and DKK1, as they are two of the most differentially expressed genes in our analysis and quasi-universal Wnt/β-catenin targets. The AXIN2 RUW was located within the characterized enhancer region for the gene 44, while the DKK1 RUW was located in a putative enhancer region ~ 100 kb away. We identified the exact position of the CTCF and TCF/LEF binding site(s) within these RUW regions and, as indicated by the earlier motif analysis, found that the two motifs and corresponding CUT&RUN peak summits were slightly offset. This allowed us to design CRISPR sgRNAs which would lead Cas9 to only disrupt the core of the CTCF binding motif (Fig. 6A). We transfected cells with Cas9 and either the targeting or a scrambled sgRNA control, and then stimulated to obtain Wnt-OFF and ON conditions. To avoid clonal effects, we measured gene expression in populations (N = 6 independent wells per condition) and could see that the sgRNA targeting the AXIN2 RUW led to a significant decrease in upregulation of AXIN2 (p 0.0028) but not DKK1 nor NKD1 (Fig. 6B, top panel). Similarly, the sgRNA against the DKK1 RUW decreased upregulation of only DKK1 (p 0.02) (Fig. 6B, bottom panel). We confirmed disruption of CTCF binding sites in the populations: the detected mutations included indels and sequence alterations which invariably affected CTCF but not TCF/LEF consensus sequences (Fig. 6C). These results indicate that CTCF binding to RUW regions contributes to the upregulation of each associated Wnt target gene.