Biallelic missense and nonsense THOC6 variants are the genetic basis of TIDS
While initially detected in Hutterite populations11, a growing number of pathogenic biallelic THOC6 variants are being discovered across the globe in individuals of diverse ancestry8-10,12-18,41. As part of an ongoing effort to determine the genetic etiologies of syndromic ID, we discovered nine THOC6 variants by exome-based genetic testing (Figure 1A). We confirmed six recurrent variants at W100, G190, V234, and G275 amino acids and identified three novel alleles (p.Q47*, p.E188K, and p.R247Q). The clinical phenotype associated with THOC6 variants affirm penetrance of the core clinical features of TIDS, namely global developmental delay, moderate to severe ID and facial dysmorphisms (Figure 1B). Variable expressivity of cardiac and renal malformations, structural brain abnormalities with and without seizures, urogenital defects, recurrent infections, and feeding complications were also noted, clinical features that highlight the multiorgan involvement of this developmental syndrome (Figure 1C). Detailed clinical summaries for all individuals are provided in Table S1 and S2.
The novel THOC6 variants are representative of previously described nonsense and missense variants that contribute equally to the severity of TIDS phenotypes. We describe a nonsense THOC6 c.139C>T, (p.Q47*) variant in exon 2 of proband 1, a missense c.740G>A, (p.R247Q) variant in exon 11 of proband 5, and a missense c.562G>A, (p.E188K) variant in proband 6 (Figures 1A and 1C). THOC6 is comprised of seven WD40 repeat domains (Figure 1D) that form a β-propeller structure when folded. These novel variants, like other clinically relevant THOC6 variants, map to the WD40 repeats that comprise the beta strand structural regions of THOC6 (Figure 1D). The p.Q47* nonsense variant represents a cluster of three THOC6 variants in the first WD40 repeat that exhibit a consistent genotype-phenotype correlation for both recessive nonsense and missense variants. The same trend is observed for the novel missense variants that are localized to subsequent THOC6 WD40 domains, which support a loss-of-function (LOF) pathogenic mechanism for both THOC6 variant types.
A LOF mechanism is also implicated by the clinical consistency observed between biallelic inheritance of pathogenic THOC6 haplotypes and biallelic inheritance of a single haplotype variant alone. Biallelic inheritance of a triple-variant haplotype (TVH), THOC6 c.[298T>A;700G>C;824G>A], (p.[W100R;V234L;G275D]) has been reported in seven individuals with clinical features of TIDS9,13,17,42. The TVH segregates as a founder haplotype in individuals of European ancestry42. In comparison, a homozygous THOC6 c.824G>A; p.G275D variant was identified in siblings with classic TIDS in family 4 who are of South Asian ancestry. This finding provides additional evidence for the pathogenicity of the TVH THOC6 c.824G>A; p.G275D variant but does not negate the predicted pathogenicity of the corresponding W100R or V234L TVH variants. Pathogenicity of p.W100R or p.V234L are supported by variants detected in WD40 repeats 2 and 4 (Figure 1D). Comparing biallelic inheritance of TVH and THOC6 c.824G>A; p.G275D suggests a single THOC6 variant in both alleles are sufficient to comprehensively disrupt THOC6, a baseline deficiency not exacerbated by accumulation of additional LOF variants.
THOC6 variants show mRNA stability with differential effect on protein abundance
To investigate the genetic mechanism of THOC6, the impact of on mRNA nonsense mediated decay (NMD) and protein expression on pathogenicity was tested in embryonic stem cells (ESCs) and iPSCs, collectively referred to as human pluripotent stem cells (hPSCs). hPSCs were reprogrammed from two individuals with TIDS (6:IV:1, THOC6E188K/E188K and 7:V:2, THOC6W100*/W100*) and their respective unaffected heterozygous parent (6:IV:2, THOC6E188K/+ and 7:V:1, THOC6 W100*/+) (Figure 1E), preserving the shared genetic background between affected and unaffected conditions. Consistent with a LOF mechanism, reduction in protein expression due to mRNA nonsense mediated decay was predicted. However, THOC6 mRNA transcripts remain relatively stable between genotypes, as assessed by Actinomycin D treatment where transcription is inhibited, compared to the unstable mRNA, FOS, that is quickly degraded (Figure 1F) 43. This finding is consistent regardless of THOC6 variant, with the c.299G>A, (p.W100*) and c.562G>A, (p.E188K) variants exhibiting similar decay rates as wildtype transcripts (Figure S1C and S1D). This finding could reflect defective NMD from failure of THOC6-affected mRNA to be exported to the cytoplasm. Nevertheless, the impact on protein expression is divergent between nonsense and missense variants. Significant reductions in THOC6 abundance were detected in THOC6W100*/W100* and THOC6E188K/E188K iPSCs relative to the THOC6+/+ control, with the most remarkable reduction for THOC6W100*/W100*. Significant abundance differences were also noted for heterozygous unaffected iPSCs relative to wildtype controls (Figure 1G). Full-length THOC6 in THOC6W100*/W100* samples represent a minority readthrough product. Increased frequency of this rare event was promoted by treatment with 30 mM Ataluren, which extends translation by skipping premature termination codons, leading to an increase in THOC6 detectable by Western blot in THOC6W100*/W100* iPSCs (Figure 1G). No truncated product was observed by Western blotting in THOC6W100*/W100* and THOC6W100*/+ iPSCs, suggesting THOC6 reduction is due to rapid degradation of an unstable, truncated protein. Stable expression from missense THOC6 alleles suggests variant THOC6 is likely functionally inactive.
THOC6 variants interfere with TREX functions
Based on the solved crystal structure25, LOF THOC6 variants are predicted to impair TREX tetramer formation. The WD40 repeat domains of THOC6 form beta strands predicted to provide the structural interface for TREX core tetramer formation. Pathogenic THOC6 variants disrupt residues conserved in mammals, though several are not conserved across other metazoan species, mirroring TREX composition variability across species and the evolving function for THOC6 in TREX (Figure 2A). Evaluation of the TREX crystal structure indicates that pathogenic THOC6 variants are positioned at the TREX core tetrameric interface, where THOC5-THOC7 interaction is responsible for dimerization and THOC6-THOC5 interaction tetramerizes the complex (Figure 2B, 2C, and 2D). The allelic series of THOC6 LOF variants implicate a pathogenic mechanism where LOF missense variants are predicted to perturb THOC6 β-propeller folding and/or interactions with THOC5 and THOC7 that are required for TREX tetramer assembly, and nonsense variants would produce a similar outcome due to low protein abundance25. Likewise, THOC6 variants do not alter the protein abundance of other THO/TREX members (Figure 2E). Consistent with normal abundance of functional THO protein, subcellular localizations at nuclear speckle domain were observed by immunohistochemistry (Figures S1E and S1F). Conversely, ALYREF association with the THO subcomplex is diminished in THOC6E188K/E188K and THOC6W100*/W100* as assessed by co-immunoprecipitation with THOC5 and THOC6 in patient-derived iPSC lines (Figure 2F). These findings suggest a THOC6-dependent association of ALYREF to THO, with implications for the affinity of other adaptors due to the potential disruption of TREX tetramer formation.
Thoc6 is required for mouse embryogenesis
To investigate Thoc6 pathogenic mechanisms in mammalian neural development in vivo, we used CRISPR/Cas9 genome editing to introduce an insertion variant in mouse Thoc6 exon 1 that resulted in a premature termination codon predicted to ablate Thoc6 expression (p.P6Lfs*8, herein referred to as Thoc6fs) (Figure 3A). Thoc6+/fs male and female mice do not display phenotypic abnormalities, but their intercrosses yielded no homozygous offspring. Analysis at selected embryonic days (E) of gestation confirmed Thoc6fs/fs littermates die in utero. Differences in embryonic morphology between wildtype (WT) and Thoc6fs/fs embryos were noted starting at E7.5, the earliest day of analysis. By E9.5, Thoc6fs/fs embryos were smaller with delayed development; however, the difference in the developing neocortex was particularly pronounced (Figure 3B). No body turning differences were observed. Consistent with embryonic lethality, THOC6 was undetectable in E8.5 Thoc6fs/fs mouse embryos relative to control littermates (Figure 3C). In E9.5 Thoc6fs/fs embryos, a developmental timepoint with high Thoc6 expression, Thoc6 is detectable by Western blot at greatly diminished levels (Figure 3C). Embryonic lethality was confirmed by E11.5, indicating one functional allele of Thoc6 is essential for mouse embryonic development (Figures 3C and 3D).
Forebrain tissues of E8.5-E10.5 Thoc6 littermates were characterized to identify neurodevelopmental changes in Thoc6fs/fs pups. Immunohistochemistry of the telencephalic vesicles revealed a consistently thinner neuroepithelium (PAX6) in Thoc6fs/fs compared to Thoc6+/+ littermate controls (Figures 3E,3F, S2B, and S2D). While E9.5 neuroepithelium showed a relative increase in mitotically active cells (PH3), widespread apoptosis (Cleaved Caspase-3 (C.CASP3)) was noted (Figures 3E and 3F). These findings are consistent with proliferative defects noted in Thoc6fs/fs tissue, suggesting THOC6 is important for corticogenesis.
Global mRNA export is not altered in THOC6 models of human in vitro neural development
Given the prominent link between the THO subcomplex and RNA export in current literature, we first sought to investigate the impact of THOC6 variants on RNA nuclear export functions in human neural progenitor cells (hNPCs) differentiated from hPSCs (Figure 4A). NPCs are the embryonic cell population frequently implicated in the developmental mechanism of primary microcephaly, a TIDS clinical feature. Defects in mRNA export are typically observed as differential accumulation of polyadenylated (polyA+) mRNA in the nucleus, enriched at nuclear speckle domains44. Standard oligo-dT fluorescent in situ hybridization (FISH) was performed on hNPCs to visualize polyA+ mRNA signal in nuclear and cytoplasmic cellular fractions (Figures S3A, S3B, and S3C). Comparison of the nuclear-to-cytoplasmic (N/C) polyA+ signal intensity ratios across genotypes indicates a slight reduction in THOC6 affected samples, suggesting a trend towards nuclear reduction in affected hNPCs relative to THOC6+/+ and heterozygous unaffected control hNPCs (Figure S3C). The significance of this modest export finding is evident when the N/C polyA+ signal intensity ratios are compared to THOC6+/+ hNPCs treated with wheat germ agglutinin (WGA), a potent inhibitor of all nuclear pore transport and the positive control for export changes45 (Figure S3A), Relative to all THOC6 genotypes, WGA treated hNPCs have a significantly higher N/C ratio (Figure S3C) attributed to strong polyA+ mRNA accumulation in the nucleus (Figure S3B). Although bulk mRNA export is largely unaltered in THOC6-affected hNPCs, a THOC6 dependent tetramer TREX export cannot be ruled out for specific polyA+ or in mRNP processing functions upstream of export—TREX functions that have not been explored in hNPCs.
THOC6 depletion reveals TREX function in pre-mRNA splicing in hNPCs
Proper mRNP processing, including mRNA splicing, is required for TREX-dependent RNA nuclear export, linking these steps in mRNA biogenesis. Co-transcriptional recruitment of TREX to the 5’ end of maturing mRNPs coupled with described TREX associations with splicing factors46 led us to investigate features of mRNA processing for vulnerability to loss of THOC6. To capture RNA processing differences caused by loss of THOC6 function, we performed RNA-sequencing (RNAseq) on ribosomal (r)RNA-depleted RNA extracted from wildtype and heterozygous unaffected and homozygous affected hNPCs (Table S4). Principal components analysis (PCA) of RNAseq data demonstrates reproducibility across, and distinct transcriptomic differences between the affected hNPC replicates compared to the unaffected control hNPC replicates (Figure S4A). Genotype driven differential expression and splicing changes were assessed. To investigate a THOC6-dependent role for TREX in splicing, comparative splicing analysis was carried out using the rMATS pipeline on biallelic THOC6E188K/E188K and THOC6W100*/W100* samples versus heterozygous controls (Table S5) 47. A combined total of 3,796 significant alternative splicing (AS) events were detected in affected hNPCs, representing the major AS types: skipped/cassette exon (SE), alternative 5’ splice site (A5SS), alternative 3’ splice site (A3SS), retained intron (RI), and mutually exclusive exon (MXE). The most overrepresented AS events observed in affected cells were SE (56%, 2136 of 3796) and RIs (21%, 784 of 3796). The high frequency of RIs is notable and unique relative to splicing defects identified in LOF models of other splicing factors, as well as for normal splicing patterns in neural development (Figure 4B)48-53. AS events in affected hNPCs show comparable inclusion and exclusion of AS junctions, with a slight trend towards inclusion due in part to the high frequency of RIs (Figure 4B). SE and RI splicing events occur by distinct molecular mechanisms, mediated by EJC pathways. Detection of defects in both splicing categories suggests the THO tetramer serves as a molecular platform for coordinating complex splicing events, as opposed to regulation of a specific subset of splicing events controlled by association with and function of individual RNA splicing factors. In agreement with this finding, we did not find consistent motif enrichment for specific RNA-binding proteins at AS junctions. These findings implicate a novel role for THOC6-dependent TREX splicing in mRNP processing in hNPCs.
Since AS motif enrichment analysis of THOC6-affected and control hNPCs transcriptomic data did not reveal trans-regulatory elements responsible for the differential splicing patterns, it was posited that cis-elements may underlie these differences. A maximum entropy model that assesses short sequence motif distributions was used to test the strength of the donor (5’) and acceptor (3’) of AS events54. A general trend towards weaker splice sites were detected at differential SE, RI, and A3SS events in affected cells (unpaired two-tailed t test, Figure 4C). The SE, RI, and A3SS events were enriched in genes with a disproportionately high number of isoforms that show dependence on weak, alternative/cryptic splice sites to facilitate isoform diversity (Figure 4D)55. RI events in affected hNPCs also had weaker splice sites compared to controls, suggesting that THOC6 deficiency induces mis-splicing at weak splice sites. In addition, the AS SE, RI, and A3SS events in affected THOC6 hNPCs impacted exons/introns that are significantly longer than nonsignificant events (Figure 4E). Likewise, the length of introns retained in RI events were significantly longer, with a 1.4-fold increase in length quantified for significant RI events (P=<0.0001, unpaired two-tailed t test, Figure 4E). Lastly, no positional bias was observed for AS events (Figure S4D and S4F). To validate our bioinformatic analysis, AS inclusion trends in select, top, shared events were validated by qRT-PCR, demonstrating a high correlation (unpaired two-tailed t test, Figure 4F). Together, the detected RNA processing signature across diverse SE and RI events at weak splices sites suggest impaired splicing fidelity from loss of THOC6.
To investigate the role of RNA misprocessing in ID pathology, we intersected our AS events with the genes that are known to cause syndromic ID, deposited in the SysID database (SysIDdb). 152 genes with significant AS events included or excluded in >10% of transcripts in hNPCS were detected in nonsense and missense affected genotypes (Figure 4G). 185 AS genes in THOC6W100*/W100* and 105 AS genes in THOC6E188K/E188K hNPCs are known genes causative for syndromic ID represented in the SysIDdb (Figure 4G). Aberrantly spliced ID genes were identified in both THOC6 affected genotypes, consistent with a role for THOC6 in ID. 37 ID genes (1.3% of SysIDdb) are AS in both affected genotypes, identifying genes for shared mechanisms that may preferentially contribute to TIDS pathology. To identify biological mechanisms implicated by THOC6-dependent AS, biological pathway enrichment analysis was performed on mis-spliced genes in affected cells. Genes with differential splicing were significantly enriched for functions in RNA splicing, cell projection organization, membrane trafficking, organelle organization, mitosis cell cycle, and DNA damage response (Figure 4H). RNA processing is tightly controlled by feedback loops (e.g., auto-repression by poison exons or intron retention), which would explain how effects on cis elements may lead to changes in trans factors (i.e., AS events in splicing regulatory factors).
THOC6-dependent mRNP processing is required for hNPC proliferation and differentiation
Alternative mRNA processing events, such as SE and RI in mRNA splicing, can impact gene expression through different mechanisms; isoform ratio differences versus inclusion of premature termination codons that initiate NMD. To investigate the impact of AS events on expression, we performed differential gene expression analysis on THOC6E188K/E188K versus THOC6W100*/+, and THOC6W100*/W100* versus THOC6W100*/+ dyads (Figure S5B). Among the 336 differentially expressed genes (DEGs) in THOC6E188K/E188K hNPCs, only 13 had splicing defects (p = 5.3x10-3, Fisher’s exact test) compared to 46 mis-spliced DEGs (of 661 DEGs, p = 4.2x10-7, Fisher’s exact test) in THOC6W100*/W100* hNPCs, indicating a subtle effect of mis-splicing on expression (Figure 5A). Notably, there are nearly double the number of AS genes (ASGs) (435 E188K; 741 W100*) and DEGs (336 E188K; 661 W100*) in THOC6W100*/W100* hNPCs, suggesting that the nonsense genotype has a greater impact on splicing and gene expression (Figure 5A). Relevant for TIDS pathology, 20% (68 of 336; p = 1.5x10-5, Fisher’s exact test) of THOC6E188K/E188K DEGs and 18% (118 of 661; p = 9.7x10-6, Fisher’s exact test) of THOC6W100*/W100* DEGs are syndromic ID genes, which conveys important information pertinent for understanding the pathogenic mechanisms of TIDS (Figure 5A).
Retained intron AS events are often subject to NMD in protein-coding genes (PCGs), while intron inclusion in lncRNAs alters nuclear export and conformation. Given the high number of RI events detected in THOC6 affected hNPCs, we tested the correlation between differential expression and RI events. For this analysis, gene-level mRNA abundance fold-change in affected hNPCs was correlated to the change in intron inclusion within transcripts from PCGs or non-coding RNA loci, referred to as percentage spliced-in (DPSI). We observed a trend in both THOC6-affected genotypes that lower gene expression correlates with greater intron inclusion (slope, p = 0.0045 for THOC6W100*/W100*and p = 0.0002 for THOC6E188K/E188K, simple linear regression) (Figure 5B). Conversely, the quadrant representing differential intron exclusion and elevated expression was prominently represented by lncRNAs, where intron exclusion is implicated in impaired lncRNA function. The three significantly dysregulated ASGs represented in both affected genotypes were MEG3, PAX6, and POSTN. Consistent with the observed trend between RI events and expression, analysis of all DEGs revealed that PCGs make up the largest portion of DEGs (with a greater portion, 96.45% affected in downregulated genes), while the portion of lncRNAs is highest in upregulated genes (6.29%; compared to 1.98% of non-significant genes), reflecting the molecular differences between these distinct mRNA subtypes.
Additional mRNA characteristics that may account for a portion of the observed differential expression are gene length and isoform number. In affected hNPCs, significantly more transcripts from long genes with on average of less than 10 annotated isoforms were identified compared to non-significant genes (Figures 5D and 5E). The trend towards DEGs with fewer transcript isoforms in affected hNPCs suggest alternatively spliced transcripts are more stable in affected cells (Figure 5C). These findings again reflect a requirement for the larger THOC6-dependent TREX tetramer complex function in facilitating mRNP processing of long mRNAs with high expression in brain.
To identify biological pathways predicted to contribute to THOC6 neuropathology, DAVID analysis was performed to identify biological categories defined by DEGs inTHOC6 affected hNPCs (Figure 5D). Downregulated genes are enriched in integrin cell adhesion, extracellular matrix interactions, PI3K-AKT signaling, and TGF-b signaling pathways, which are critical for brain development (Figure 5F). PI3K-AKT/mTOR signaling regulates cortical NPC proliferation, differentiation, and apoptosis56,57. Over 30 genes attributed to the PI3K-AKT/mTOR signaling pathway were downregulated in affected cells, accounting for the significant enrichment (p = <1 x 10-13). HAPLN1, MYC, BMPR1B, DCN, FBN1, INHBA, ID4, THBS1, TGFB2, DEGs enriched in the TGF-b signaling pathway (p = <0.001), have direct implications for TGF-b signaling in neural induction, differentiation, and NPC fate specification in TIDS developmental mechanisms58,59. Complementary pathways enriched with upregulated DEGs implicate multipotency (OCT4, PAX6), proliferation, neuron differentiation and WNT signaling pathways (p = <1 x 10-6, p = <0.001, p = <0.001, and p = <0.001, respectively). WNT signaling is known to promote NPC self-renewal expansion during corticogenesis60,61. Shared dysregulation of mTOR, TGF-b, and WNT signaling, coupled with upregulation of multipotency factors in affected genotypes, suggests defects in hNPC multipotency and neural differentiation underlie TIDS pathogenesis.
To identify transcription factor networks dysregulated in THOC6-affected hNPCs, transcription factor motif enrichment analyses was performed. Significant enrichment of MEF2, LHX3, and SRF target genes was observed in heterozygous controls compared to both affected genotypes (Figure S5D). Using a second analysis tool, ChEA3, differential expression of SOX, FEZF, FOX, and GLI target genes, and downregulation of HEYL, TWIST, FOX, MEOX2, PRRX2, and MKX target genes were enriched in affected hNPCs (Figure S5E). These transcription factor networks are important for neuronal differentiation and fate specification62-64, concordant with GSEA findings. Together, these results suggest that gene expression programs that modulate timing of the switch from neural proliferation to differentiation are altered in TIDS.
To refine specific candidate genes implicated in shared TIDS neuropathology, DEGs between affected THOC6 genotypes were intersected. 12 genes were upregulated and 117 were downregulated in affected hNPCs, with notable lncRNAs represented. Significant enrichment was detected in Integrin 1 pathway and extracellular matrix protein interaction networks (Figure S5C). Using mRNA obtained from three additional replicate differentiations of hNPCs per genotype, significant upregulation of MEG3, MEG8, ESRG, and NEAT1 lncRNAs was confirmed by qRT-PCR (Figures 5G). RNA FISH confirmed increased expression of MEG3 in affected hNPCs compared to controls, with elevated signal observed in both nuclear and cytoplasmic fractions (Figure 5H). Upregulation of functional lncRNAs NEAT1 and MEG3 has been linked to activation of WNT activation and suppression of TGF-b signaling, respectively65,66. Concordant with these findings, the protein level of WNT and TGF-b signaling components in THOC6-affected hNPCs exhibit a corresponding differential up- and down-expression relative to controls. Specifically, WNT signaling components WNT7A and TP53 showed increased protein expression, with higher abundance detected in affected hNPCs (Figure 5I). TGF-b pathway proteins HAPLN1 and TGFB2 also showed reduced protein expression in affected hNPCs together with high CEMIP and DKK2 (Figure 5I). We propose that loss of THOC6 leads to lncRNA-mediated dysregulation of key developmental signaling pathways which has implications for the balance of proliferation and differentiation during neural development.
Apoptotic upregulation and retained intron enrichment in Thoc6fs/fs E9.5 mouse forebrain
To investigate the conserved THOC6-dependent TREX functions that account for divergent phenotypic outcomes between mammalian models, mRNP processing was assessed in E9.5 mouse brain using complementary RNAseq experiments to those performed in hNPCs (Figures 6A and S6A). Three biological replicates were analyzed per genotype (Thoc6+/+, Thoc6fs/+, Thoc6fs/fs). Fewer significant AS events (FDR <0.05) were detected in Thoc6fs/fs E9.5 brain than in affected hNPCs, but the pattern of AS events was recapitulated, with the majority of AS events categorized as SE (45%) and RI (26%) (Figure 6B). Greater than 40 PSI was quantified in the Thoc6fs/fs transcriptome and retained and excluded intron events in Cenpt, Admts6, and Fam214b were validated (Figures 6C and S6B). Maximum entropy model analysis of splice junctions revealed significantly weaker 3’ splice site strengths for SE events and weaker 5’ splice sites associated with RI events in the Thoc6fs/fs mouse model. While this signature of splice site weakness is more modest in mouse than in human THOC6 models, these findings suggest a conserved role of THOC6-dependent TREX tetramer in coordinating mRNA processing that precedes TREX export functions (Figures 6D and S6C).
Notably, biological pathway and network enrichment analysis of AS genes identified mRNA processing, pre-miRNA processing, de-adenylation of mRNA, central nervous system development, forebrain development, multicellular growth, response to oxidative stress, cytoskeletal organization, and neuron projection (Figure S6D) — several of the biological categories associated with hNPCs ASGs. These shared findings suggest selective conservation of mRNP processing mechanisms by THOC6 in mouse and human forebrain.
To assess the correlation between THOC6 mRNP processing defects and expression, differential expression analysis of Thoc6fs/fs forebrain mRNA sequencing data was performed compared to Thoc6+/+ controls. Of note, Thoc6 mRNA is downregulated (two-fold, p=<0.0001), consistent with NMD (Figures 6E and S6F). In this model, 5x more genes were upregulated (144 genes) than downregulated (27 genes). Nevertheless, downregulated genes may covey important pathology. First, downregulated genes functionally converge on neurogenesis, proliferation, and differentiation pathways (Figure 6F). Upregulated genes are implicated in the hypoxic response, HIF-1 signaling pathway, and glycolysis—biological categories indicative of increased apoptosis in affected cells (Figure 6F). To investigate if altered transcription factor networks contribute to pathway dysregulation, we performed GSEA transcription factor motif enrichment analysis. HIF1, NRSF, SMAD3, and STAT3 target genes were enriched in Thoc6fs/fs E9.5 forebrain (Figure S6G). HIF-1 and STAT3 can induce apoptosis in response to hypoxia67,68, which is consistent the observed elevation of apoptosis in Thoc6fs/fs E9.5 neuroepithelium (Figures 3E and 3F). SMAD3 signaling is activated by TGF-b to promote cortical differentiation 58, suggesting shared disruption of TGF-b signaling in both human and mouse model systems.
DEGs shared between mouse and human model systems are consistent with conserved TIDS molecular pathology (Figure 6G). More Thoc6fs/fs DEGs overlapped with THOC6W100*/W100* (23 genes) than THOC6E188K/E188K (9 genes) samples, and include genes involved in neurogenesis, hypoxic response, and synapse regulation. Validation of Ier3, Islr2, Wnt7a, Kcnt2, Anax2, and Vegfa DEGs shared across affected models were confirmed by qRT-PCR in three additional E9.5 forebrain biological replicates for Thoc6+/+, Thoc6fs/+, and Thoc6fs/fs samples (Figures 6E and S6H). Overlapping affected human and mouse molecular mechanisms suggest shared pathology. However, the extent of upregulation of genes in response to increased apoptosis is exacerbated in mouse, highlighting species-specific phenotypic differences due to loss of THOC6.
Delayed differentiation and elevated apoptosis in THOC6-affected forebrain organoids
THOC6 pathogenesis in human cortical development was investigated using dorsal forebrain-fated organoids, neural differentiated from iPSC lines (Figure 7A). Forebrain organoids recapitulate the cellular heterogeneity and developmental dynamics of early corticogenesis69. Within each organoid, several neural rosette (NR) structures develop stochastically to recapitulate features of in vivo ventricular zone development, including hNPC proliferation and differentiation to cortical neuron fates (Figure 7A). NR morphology was evaluated in cortical organoids at 28 days of neural differentiation (ND) from three independent differentiations per genotype. To minimize the effect of inter-cell line NR variability, the following analyses focus on heterozygous unaffected and homozygous affected comparisons. To characterize the NR proliferative niche, the maximum thickness of the NR neuroepithelial center was measured as defined by N-cadherin immunostaining and pseudostratified NR cytoarchitecture by Hoescht staining (Figures 7B, 7C, S7A, and S7B). THOC6-affected organoids show significantly thinner pseudostratified neuroepithelium (p = <0.0001, two-tailed t test), concordant with reduced NR size composed of fewer cells (p = <0.0001, two-tailed t test) (Figures 7B, 7C, S7A, and S7B). Given the stark upregulation of apoptosis observed in Thoc6fs/fs E9.5 mouse forebrain, we investigated the contribution of apoptosis as a mechanism of reduced NR size in the THOC6-affected organoids (Figures 7B, 7D, S7A, and S7B). A significantly higher proportion of affected NR cells expressed the apoptotic marker C.CASP3, providing evidence for shared mechanism of altered corticogenesis (p = <0.0001, two-tailed t test) (Figure 7D).
To assess alterations in the timing of differentiation in affected NRs, we performed EdU-pulse labeling at day 21ND for 24 hours to label mitotically active cells, followed by organoid immunohistochemistry analysis at day 28ND (Figures 7E, 7F, S7C, and S7D). To assess the balance of multipotency and differentiation EdU, KI67, and DCX co-labeled cells per NR were quantified. A significant increase in cells co-stained with the proliferation marker KI67 and EdU per affected NR were detected at day 28ND (p = <0.0001, two-tailed t test), indicating affected NPCs remain mitotically active longer than control NPCs (Figure 7F). This finding paired with elevated mRNA and protein expression of OCT4 in affected hNPCs (data not shown) supports retention of multipotency model. Consistent with this finding, we observed a significant reduction in the fraction of EdU cells co-labeled with the migrating neuron marker doublecortin (DCX) in affected NRs (p = <0.0001, two-tailed t test) (Figure 7F). Paired with the prolonged proliferation dynamics, this suggests a disruption to the differentiation timeline in affected organoids.
To investigate effects of reduced NR growth on organoid size, we measured whole organoid cross section areas weekly from day 21 to day 42. Compared to the steady size increase of THOC6+/+ organoids, affected organoids showed a slower growth rate (E188K/E188K: p = 0.0122; W100*/W100*: p = 0.0362) (Figure 7G). Together, our findings implicate a pathogenic mechanism of delayed differentiation due to reduced NPC proliferative capacity and elevated apoptosis with subsequent cortical growth impairment in affected organoids.