So far, three balanced translocations and one microdeletion with hypogonadism or KS involving 12q24 have been reported 2–5. In one case reported in 1983, all three brothers of Vietnamese Chinese origin were revealed to have severe primary hypogonadism with 46,XY,t(1;12)(p32;q24) 4. In the second case, an apparently balanced translocation t(7;12)(q22;q24) in a male with KS and intellectual disability was published in 1990 2. Another case reported a balanced chromosomal translocation between the distal q arms of chromosome 4 and 12, t(4;12)(q25;q24.2), in a Turkish male patient in 1994, who showed IHH and a lack of secondary sexual characteristics yet had a normal sense of smell 3. After the three aforementioned balanced translocations with hypogonadism were published, a de novo interstitial deletion del(12)(q24.3q24.33) was described in a male with ambiguous genitalia and development delay in 1999 5.
We postulated that all four reported chromosomal rearrangements involving the 12q24 region could be explained by the haploinsufficiency of a gene responsible for KS or IHH. The positional cloning of balanced translocation patients was successful to identify KS genes at the translocation breakpoints 10,11. To investigate if the translocation might contribute to the phenotypes in the male Subject 1, we mapped and sequenced both translocation breakpoints to clone genes that may be involved in KS seen in this individual with t(7;12)(q22;q24) 13.
Physical mapping of the translocation breakpoint by FISH and Southern blot hybridization led to the identification of five genes at both breakpoints. ZNF804B mapped 37 kb distal from the breakpoint is the closest gene to the breakpoint at 7q21.139 and STEAP4 mapped 416 kb proximal to the breakpoint. The breakpoint at 12q23.1 directly disrupts the non-coding RNA RMST, whereas NEDD1 and PAFAH1B2P2 map closest to the breakpoint on the proximal and distal sides, respectively. Based on the molecular analysis results, the locations of the cytogenetic bands on both chromosomes 7 and 12 of the apparently balanced chromosome translocation have been precisely revised as t(7;12)(q21.13;q23.1) 9,13,14. We assumed that the causative gene for KS is located on chromosome 12 due to two previously reported chromosomal rearrangements displaying overlapping phenotypes of KS and screened for mutations in three-selected candidate genes (RMST, NEDD1, PAFAH1B2P2) based on their proximity to the 12q23.1 breakpoint. RMST is disrupted in intron 2, and mutation screening of this gene was performed in 48 KS subjects negative to ANOS1 and FGFR1. We identified a heterozygous nucleotide change in RMST (heterozygous change of C/C to C/T at the 214th nucleotide in exon 10 of RMST NR_152618.1) in a KS male alongside his mother with anosmia. The nucleotide change turned out to be a polymorphism, as two healthy sisters of the patient shared the same nucleotide change. Additionally, the two patients in this family were found to have a mutation of FGFR1 (c.821G > A, p.E274G) 26, suggesting this gene as the cause of the phenotype in this family. PAFAH1B2P2 and NEDD1, mapping 248 kb downstream and 513 kb upstream from the breakpoint, respectively, were also screened in the same collection of 48 KS patients we recruited, with no evidence of mutations.
Two additional genes were mapped in the breakpoint region at 7q21.13. ZNF804B, which has an unknown function, is located 37 kb downstream from the breakpoint; hence, it is the closest gene to the breakpoint 9. ZNF804B is a member of the zinc finger protein family, which is composed of four exons. ZNF804B has not previously been described as a disease-related gene with substantial evidence. There is limited information in the literature regarding its full biological functions, except it was found to be one of the candidate genes associated with autism spectrum disorder and neurodevelopmental disorders 27,28. This suggests that ZNF804B might be dysregulated by position effect and could be the cause of intellectual disability of Subject 1. In addition, STEAP4 at 7q21.12 is located 416 kb upstream from the breakpoint. Functioning as metalloreductase, STEAP4 resides in the Golgi apparatus and may be involved in adipocyte development and metabolism. Mutation screening of both genes in the same cohort of 48 patients we recruited did not detect any putative mutations, though polymorphisms were identified. Additionally, we screened for mutations in ANOS1 and FGFR1 in the t(7;12) translocation subject. No potential disease-causing mutations were found (data not shown). We cannot exclude the possibility of mutation of other causative gene(s) for KS in this subject.
Six percent of carriers of balanced translocations manifest abnormal phenotypes 29 due to the disruption of the genes at the breakpoints or dysregulation (i.e., position effect) resulting in reduced expression of the genes by the separation from its cis regulatory elements 30. Nonetheless, 40% of the patients with apparent balanced translocations were reported to have at least one deletion at one of the breakpoints or genomic regions elsewhere, suggesting that deletions might be common in apparently balanced chromosome rearrangements 16.
As we did not find any mutations in five candidate KS genes at or near both breakpoints from the apparent balanced translocation, we performed aCGH analysis, which remarkably unmasked a cryptic 4.7 Mb submicroscopic microdeletion at 12p11.21-12p11.23. This cryptic microdeletion was not detected in previous studies of the breakpoints of Subject 1 (DGAP032) on the molecular level 9,13,14.
There has been a report of chromosomal variation including CNVs in lymphoblastoid cell lines (LCLs). In their study, Shirley et al found more CNVs per sample on average for LCLs than for PBMCs, but the differences were statistically not significant. Furthermore, chromosomal regions associated with a significantly different number of CNVs did not include chromosome 12 31. Thus, it is unlikely that del(12)(p11.21p11.23) is a culture induced artifact, although we cannot exclude this possibility entirely. Unfortunately, the de novo status of this microdeletion could not be determined due to the unavailability of parental samples.
In a patient with two different concomitant genomic rearrangements such as an unbalanced translocation and a simultaneous translocation-unrelated microdeletion or microduplication, the culprit gene was found to be located at one translocation genomic breakpoint sometimes albeit rare. For instance, in a female subject affected with autism and ID with t(14;21)(q21.1;p11.2)dn and 2.6 Mb of microdeletion comprising 15 genes at 2q31.1, the causative gene LRFN5 (Leucine-Rich Repeat and Fibronectin Type III Domain-Containing Protein 5, MIM 612811) was found dysregulated at the 14q21.1 translocation breakpoint 32. In majority of cases, however, the disease gene is within CNV as exemplified in two positional ID candidate genes- VAMP8 (Vesicle-Associated Membrane Protein 8, MIM 603177) and RNF181 (Ring Finger Protein 181, MIM 612490) - identified in a cryptic 390 kb duplication region in a subject with unbalanced t(8;10)(p23.3;q23.2) 21.
The result of aCGH in Subject 1 raised a new possibility that the heterozygous 4.7 Mb interstitial deletion containing 29 genes at 12p11.21-12p11.23 (chr12: 27,003,224 − 31,687,824 /hg 38) (Fig. 1A and Fig. 2A) instead of the genomic breakpoints of the reciprocal translocation might harbor a KS gene in this subject. This hypothesis is corroborated by DECIPHER case 284660 (not listed in Fig. 2A), with a 7.09 Mb heterozygous deletion (chr12:22,444,774 − 29,533,886 [hg38]) exhibiting cryptorchidism and mild global developmental delay (https://decipher.sanger.ac.uk/). This microdeletion overlaps a 2.53 Mb genomic region with our subject with KS (chr12:27,003,224 − 29,533,886 [hg38]).
In addition to KS, Subject 1 with an unbalanced chromosome translocation presented with ID 2. Although the genes causing IHH associated with ID have been recently reported 33,34, more than one genes contributing to the comprehensive phenotype are alternative explanations in contiguous gene deletion syndromes represented by Potocki-Shaffer-Syndrome 7 or a deletion on chromosome X causing KS coupled with ID 35.
This microdeletion encompassing 29 genes is likely to harbor the disease genes involved in the common set of neurodevelopmental phenotypes shared with our seven studied CNV patients and eight unpublished CNV cases from the DECIPHER database (Fig. 2A, Tables 1 and 3).
Among 15 heterozygous CNVs at 12p11.2, duplications account for ten cases, whereas deletions represent the remaining five cases (Fig. 2A, Tables 1 and 3). At least one neurodevelopmental phenotype was seen in each case. Some of the CNVs were inherited from one parent with an unknown phenotype, whereas the inheritance of the remainders is unknown (Tables 1 and 3). This suggests that some genes in the CNVs located at 12p11.2 may have incomplete penetrance or epigenetic imprinting if a carrier parent is asymptomatic. We interpret the pathogenicity of deletion and duplication cases similarly since they both exhibit the similar neurological condition. This is because any CNV might interfere with the strict stochiometric control of genes on a protein expression level 36.
Based on in silico comparative genomic analysis at 12p11.21-12p11.23, we suggest one putative KS candidate gene, TSPAN11 (Tetraspanin 11). One missense variant c.203G > A (NM_001080509.3) leading to an amino acid substitution Glycine to Aspartic acid at position 68 was reported in a KS patient 37. This variant shows high deleterious CADD score of 25.8 (HG38).
Furthermore, seven intellectual disability candidate genes, TM7SF3, STK38L, ARNTL2, ERGIC2, TMTC1, DENND5B and ETFBKMT have been identified (Table 2).
Putative position effect of TM7SF3 and the inclusion of STK38L and ARNTL2 in CNVs at 12p11.23 along with their sporadic variants reported in neurodevelopmental disorder (NDD) patients will likely explain their candidacy. A de novo missense variant in TM7SF3 (Transmembrane 7 Superfamily Member 3, MIM 605181) was identified in a patient with a neurodevelopmental disorder 28,38. One missense variant in HNRNPL (Heterogeneous Nuclear Riboprotein L, MIM 603083), an interacting protein of TM7SF339, was described in an ID patient 40.
STK38L (Serine/Threonine Kinase 38 Like, aka NDR2, Nuclear Dbf2 Related Kinase 2, MIM 615836) regulates the morphology and division of neuronal cells 41–43, and integrin-dependent dendritic and axonal growth in mouse hippocampal neurons 44. Consequently, Stk38l KO mice exhibit arbor-specific alterations of dendritic complexity in the hippocampus 44. De novo nonsense and missense variants in STK38L have been identified in individuals with autism spectrum disorder 45 and schizophrenia 46, respectively. Genes mutated in schizophrenia are also mutated in autism and ID 46.
The third candidate gene of ID at 12p11.23 is ARNTL2 (Aryl Hydrocarbon Receptor Nuclear Translocator-Like Protein 2, MIM 614517), and its missense and nonsense variants were reported in patients with autism 47, and in developmental and epileptic encephalopathy 48, respectively. Proteins that physically interact with one another frequently participate in the same biological activity, and mutations in these genes may result in similar clinical features. Among the interactors of ARNTL2, CTTNBP2 (Cortactin Binding Protein 2, MIM 609772) 49 with 26 de novo genetic variants were identified in probands with autism/development delay 50. Another interactor UBE3A (Ubiquitin-Protein Ligase E3A, MIM 601623) 51 is a well-known Angelman syndrome gene 52 and its two frameshift variants were also reported in autistic individuals 53,54. Eight variants in PER2 (Period Circadian Regulator 2, MIM 603426), another interactor 55, were described in individuals with autism 45,56,57.
Two ID candidate genes, ERGIC2 and TMTC1, were found at 12p11.22. One frameshift variant in ERGIC2 (Endoplasmic Reticulum-Golgi Intermediate Compartment Protein 2, MIM 612236) has been reported in an individual with ASD 58. ERGIC2 physically interacts with SLC39A8 (Solute Carrier Family 39, Member 8, MIM 608732) 55,59 and CUX1 (Cut-Like Homeobox, 116896) 60, which are associated with autosomal recessive syndromic intellectual disability 61 and non-syndromic intellectual disability/development delay 62, respectively. As interactors of ERGIC2, two catalytic subunits of Rab GTPase activating proteins RAB3GAP1 (RAB3 GTPase-Activating Protein, Catalytic Subunit, 602536) 60 and RAB3GAP2 (RAB3 GTPase-Activating Protein, Noncatalytic Subunit, MIM 609275) 60 cause autosomal recessive Warburg Micro syndrome including developmental abnormality of the central nervous system, when mutated homozygously 63–65.
In addition, a de novo missense variant in TMTC1 (TransMembrane and Tetratricopeptide repeat Containing 1, MIM 615855) was found in a child with neurodevelopmental disorder 28. The Tetratricopeptide repeat (TPR) structural motif contained in this gene is reported to present as a functional domain in NAA1566, OGT67–69, TANC270, and TTC2571, all of which are associated with autism and intellectual disability. TMTC1 interacts with BCOR (BCL6 Corepressor, 300485) 72 and VIRMA (Vir like M6A Methyltransferase Associated, MIM 616447) (aka KIAA1429) 73. The mutations of the former cause Lenz microphthalmia, an X-linked syndromic intellectual disability 74; however, the variants of the latter were found in individuals with ASD 45, developmental delay 28, schizophrenia 45,46, and Tourette syndrome 75.
At 12p11.21, we identified two additional ID candidate genes. One ID candidate gene is DENND5B (DENN Domain Containing 5B, MIM 617279), a guanine nucleotide exchange factor (GEF) mediating the activation of small GTPases. With many downstream targets, they function as molecular switches in intracellular signaling pathways. There are some known GEFs involved in NDDs. Apart from IQSEC2 associated with X-linked intellectual disability, many variants in GEFs HERC127,76, TRIO77,78, ARHGEF979,80, and ARHGEF1081 have been reported in patients with intellectual disability, epilepsy, and/or autism. VAV3 (VAV Guanine Nucleotide Exchange Factor 3, MIM 605541) identified as an NDD candidate gene at 1p13.3, because of the KO mouse phenotype, its genomic position, and reported variants, is a GEF 82. Variants in RAB11A (RAS-Associated Protein, MIM 605570) 83 and GRB10 (Growth Factor Receptor-Bound Protein 10, MIM 601523) 84, interactors of DENND5B, are found in individuals with developmental and epileptic encephalopathies 48,85.
The second ID candidate gene, ETFBKMT (electron transfer flavoprotein subunit beta lysine methyltransferase, MIM 615256) known as METTL20 (Methyltransferase like 20), is a lysine methyltransferase. Some lysine methyltransferases such as KMT2C (Lysine Methyltransferase 2C, MIM 606833, aka MLL3), SETD1B (SET Domain Containing 1B, MIM 611055, aka KMT2G, Lysine-specific Methyltransferase 2G) 23,86, EHMT1 (Euchromatin Histone Lysine Methyltransferase 1, MIM 607001) 87,88, and KMT5B (Lysine Methyltransferase 5B, MIM 610881) 89 are well known to be associated with neurodevelopmental disorders. On the protein level, ETFBKMT interacts with TUBB2A (Tubulin, Beta-2A, MIM 615101) 90, TUBB4A (Tubulin, Beta-4A, MIM 602662 ) 90, DARS2 (Aspartyl-tRNA Synthetase 2, MIM 610956) 55, and GLS (Glutaminase, MIM 138280) 91, which are associated with NDDs. TUBB2A is associated with seizures, ID and development delay 92, while TUBB4A mutations cause leukoencephalopathy hypomyelination with atrophy of the basal ganglia and cerebellum 93. DARS2 is genetically linked to leukoencephalopathy with brain stem and spinal cord involvement 94,95. Trinucleotide expansion in GLS causes development delay, ataxia, and cerebellar atrophy 96.
DDX11 (DEAD/H-Box Helicase 11, MIM 601150) associated with autosomal recessive Warsaw Breakage syndrome with intellectual disability 97 was excluded due to its bi-allelic inheritance pattern.
Interestingly, four genes, INTS13, PPFIBP1, REP15, and FAR2, are likely candidates for KS coupled with ID at 12p11.2 (Fig. 2A).
INTS13 (Integrator Complex Subunit 13, MIM 615079) also known as ASUN (Asunder, Spermatogenesis Regulator) is mapped 65 kb distal from the 12p11.23 telomeric breakpoint of the 4.7 microdeletion. Although it is not directly encompassed in the 4.7 Mb deletion at 12p11.21-12p11.23, it might be dysregulated by a positional effect 30 in the KS phenotype seen in this patient with an unbalanced chromosome translocation 2. Three variants in this gene are reported in NDD patients (Table 2). INTS13 is also a critical regulator of spermatogenesis in Drosophila melanogaster. The study showed that knockout of this gene in Drosophila caused a defect in spermatogenesis, showing spermatocyte arrest during prophase of meiosis I 98. Another study revealed that germline expression of mouse Asun rescued sterility and dynein mislocalization in Asun mutant flies 99
PPFIBP1 (PPFIA Binding Protein 1, MIM 603141) has been found to interact with TACR3 (Tachykinin Receptor 3, MIM 162332) 55, a gene mutated in patients with hypogonadotropic hypogonadism 100. Among the interacting proteins of PPFIBP1, YWHAG (Tyrosine 3-Monooxygenase/Tryptophan 5-Monooxygenase Activation Protein, Gamma Isoform, MIM 605356) 55, KRAS (KRAS Protooncogene, GTPase, MIM 190070) 60, NRAS (NRAS Protooncogene, GTPase, MIM 164790) and HRAS (HRAS Protooncogene, GTPase, MIM 190020) 101, CUL3 102, and SNAP29 (Synaptosomal-Associated Protein, 29-KD, MIM 604202) 55 suggest a neurodevelopmental role of PPFIBP1. Five missense variants in YWHAG were reported in patients with developmental and epileptic encephalopathy 103, and KRAS104 and NRAS105 are associated with Noonan syndrome, whereas HRAS is involved in Costello syndrome 106. Apart from distinct facial dysmorphism, both syndromes share a neurodevelopmental phenotype. CUL3 mutations cause neurodevelopmental 77 and autism spectrum disorder 50. SNAP29 is genetically associated with Cednik syndrome including neuropathy 107, and schizophrenia 108.
The second candidate gene REP15 (RAB15 Effector Protein, MIM 610848) interacts with SLC4A2 (Solute Carrier Family 4, Member 2, MIM 109280) 55. Histopathologic analysis of Slc4a2 KO mice revealed an interruption in spermiogenesis leading to infertility 109. Moreover, REP15 interacts effectively with TLK2 (Tousled-Like Kinase 2, MIM 608439) 55, which is associated with neurodevelopmental delay 110, autism spectrum disorder 58 and schizophrenia 111.
FAR2 (Fatty Acyl-CoA Reductase 2, MIM 616156), the third candidate gene for KS coupled with ID, physically interacts with the zona pellucida glycoprotein 2 (ZP2, MIM 182888) 55, variants of which were found in females with infertility 112–116. FAR2 interacts with ATP2B2 (ATPase, Ca (2+)-Transporting, Plasma membrane, 2, MIM 108733) 55,59, variants of which are found in patients with autism spectrum disorder 58,117. KCNA2 (Potassium Channel, Voltage-Gated, Shaker-Related Subfamily, Member 2, MIM 176262) interacting with FAR2 55, is associated with epileptic encephalopathy 118 and epilepsy 119–122. Another FAR2 interacting protein, CUL3, is associated with neurodevelopmental disorders 77 and autism spectrum disorder 50.
Collectively, INTS13, PPFIBP1, REP15, and FAR2 are good candidate genes for KS combined with NDDs at 12p11.22-12p11.23.
PTHLH (parathyroid hormone like hormone MIM 168470) is associated with brachydactyly 123 and explains this phenotype in the subject DCP308811. Bi-allelic variants of IPO8 (importin 8 MIM 605600) are linked to cardiovascular defects, skeletal anomalies, and immune dysregulation 124. We excluded both genes due to the phenotype unrelated to KS and ID.
We propose that, despite not being encompassed by small CNVs we used in comparative genomic mapping, the expression levels of the two ID candidate genes DENND5B, and ETFBKMT could be altered due to position effect 125,126, providing a likely explanation for NDD phenotypes such as dystonia, global developmental delay, growth delay, motor delay, etc., observed in one DECIPHER proband DCP288321 (Fig. 2A and Table 3).
We also confirmed the high expression of our candidate genes in five different human tissues (i.e., brain, fetal brain, muscle, ovary, and testis) relevant to the phenotype of KS and NDD, to substantiate their pathogenicity (Fig. 4B).
During the composition of this manuscript, genome sequencing was performed in this same patient to map the translocation breakpoint, with the conclusion that deletion of RMST was implicated as a cause of KS through loss of function by the erroneous assumption that this chromosome translocation is balanced 14. RMST physically interacts with SOX2 127, a transcription factor known to regulate neural fate, and aids in the binding of SOX2 to the promoter of target genes important in neurogenesis 127. SOX2 (SRY-box transcription factor 2, MIM 184429) is a known disease gene for hypogonadotropic hypogonadism and combined pituitary hormone deficiency 1. RMST has also been associated with rhabdomyosarcoma and melanoma 128.
This ostensible pathogenicity of RMST in KS remains to be seen, because this subject has an unbalanced translocation accompanied by an additional 4.7 Mb microdeletion that we identified. Moreover, we did not find any mutations of this gene in the 48 KS patients we recruited. Our case underscores the necessity and significance of aCGH or sequencing analysis in individuals with disease-associated apparently balanced translocations to rule out cryptic microdeletions. At the same time, this study highlights the benefit of the integrated usage of karyotype analysis, aCGH and sequencing for an informed approach to phenotypic assessment.
In summary, we found that an apparently balanced translocation t(7;12)(q22;q24) 2,13,14 is actually unbalanced and the 4. 7 Mb cryptic deletion at 12p11.21-12p11.23 we identified is likely to explain the phenotype of KS and ID in the subject carrying these two unrelated chromosomal rearrangements. In silico comparative genomic mapping with additional 14 CNVs in this genomic region identified one potential KS candidate gene (TSPAN11), seven candidate genes for neurodevelopmental disorder (TM7SF3, STK38L, ARNTL2, ERGIC2, TMTC1, DENND5B, and ETFBKMT) and four candidate genes for KS with ID (INTS13, REP15, PPFIBP1, and FAR2). The candidacy of these genes was further supported by the high-level expression pattern in the relevant human tissues. We propose that some dosage-sensitive genes, either increased or decreased, in this genomic region might contribute to the sexual and/or cognitive impairment in the patients with KS and/or ID. Among our candidate genes, the probabilities of dosage sensitivity, such as pHaplo and pTriplo of at least three genes, are high (STK38L: pHaplo/pTriplo 0.77/0.94, DENND5B: 0.96/0.96, PPFIBP1: 0.8/0.7) 129, suggesting that both its increase or decrease can result in a related deleterious phenotype. RT-qPCR or western blot of candidate genes encompassed in both a deletion and duplication will help generate substantiating evidence for this hypothesis.
It is well known that heterogeneous neurodevelopmental phenotypes are caused by mutations in the same gene130. Given that our seven CNV subjects show diverse neurodevelopmental phenotypes including intellectual disability, autism, and epilepsy, our candidate genes at 12p11.21-12p11.23 will offer an opportunity to identify NDD disease genes from NGS databases containing a myriad of autosomal dominant or de novo VUSs.