The Sudanese population is characterized by a complex genetic structure and high consanguinity rates (6, 10). The increased homozygosity in our cohort was reflected by the predominance of mono-allelic recessive diseases (73%) and the detection of three established/possible founder variants. Two of these founder variants were in ADAT3 and PRUNE1 genes as we reported previously (19, 27). The third possible founder variant, NM_024306.4(FA2H):c.674T > C (p.Leu225Pro), was detected in two unrelated families, F61 and F68, that descended from different tribes in Kordofan province, western Sudan. Nevertheless, we also identified autosomal dominant and X-linked (hemizygous) conditions in several families. Most of our families originated from the central parts of Sudan. This can be attributed either to differences in the accessibility to the health system and our collaborating clinics or genuine differences in the frequency of genetic diseases between central Sudan populations and other Sudanese populations. We favor the first explanation as other consanguinity-linked genetic diseases, such as sickle cell anemia, are common in non-central parts of the country (10).
All age groups were represented in our cohort, particularly those < 18 years, indicating the degree of care provided to this age group by their families. On the other hand, we have patients with childhood-onset diseases who were first examined after their forties (after decades of disease duration, > 40 years in two patients), epitomizing the long-term odysseys of patients with genetic diseases and underlining the importance of genetic diagnosis for patients and families satisfaction. Also, the percentages of males and females in our cohort were approximately equal, signifying the absence of gender-based inequalities in the accessibility of care and minimizing the contribution of X-linked dominant inheritance to SCDs in our cohort.
Previously, we screened 25 Sudanese families with HSP for mutations in 68 known HSP genes using NGS targeted gene panel (28). We reached a genetic diagnosis in 28% of these cases (28), a diagnosis rate very similar to Portuguese (29) and European (12) patients. This last study, (ref. 12), showed that combining the HSP panel with subsequent WES increased the diagnosis rate up to 50% when focusing on OMIM disease-related genes. WES used to further identify novel genes was shown to give a diagnostic yield of up to 75% (30). In the current study, by using multiple genetic approaches, we identified disease-causing variants in known SCDs genes in 63–73% of the studied families (our overall diagnostic success rate if we consider our previous cohort (ref 11) is 52–59% (31–35/59 families)). Furthermore, extending the analysis to novel genes, we identified variants in novel candidate genes in seven out of the ten remaining families, potentially raising our diagnostic success rate ceiling to 92% instead of 73% (one of those seven novel causative genes has been reported (31) and the others are under validation). According to the results of our two studies, most of the major autosomal recessive SCDs genes are present in Sudan (SACS, SPG11, FXN) and some of the major dominant ones as well (e.g., SCA3), but there is no single major gene causing SCDs in Sudan. This might result from the position of Sudan in east Africa, at the frontiers between North Africa, the Middle East, and sub-Saharan Africa.
WES outweighs NGS targeted gene panel in discovering new SCDs genes (4). However, based on our experience with the Sudanese population, and the experience of others, exome sequencing also significantly outweighs NGS-targeted gene panels in diagnosing known SCDs phenotypes, particularly in complex phenotypes (32). Furthermore, WES enables the extension of phenotypes previously associated with mutations in certain genes in contrast to conservative NGS-targeted gene panels that target only the phenotype of interest. For instance, we extended the phenotypes associated with mutations in CCDC82 and CCDC88C in the current Sudanese cohort by using WES. CCDC82 was reported previously to cause an intellectual disability syndrome (33, 34). We expanded the CCDC82-linked phenotype to include spastic paraplegia (19). Later, another report of a patient of Pakistani origin confirmed that spasticity is part of the CCDC82-linked syndrome (35). Similarly, we expanded the presentation of mono-allelic mutations in CCDC88C to include early-onset pure spastic paraplegia (36). Before, mono-allelic gain-of-function CCDC88C mutations were only associated with spinocerebellar ataxia SCA40 (37). In this report we also potentially extended the phenotype of DMXL2-linked disorders to include complex HSP.
In our opinion, the higher diagnostic success rate of WES overrides its technical difficulties when compared to NGS targeted gene panel upon studying diseases with overlapping phenotypes like SCDs, particularly when considering the increasing technical feasibility of WES (38). However, WES is less efficient for rearrangement detection than panels of genes, usually optimized for such discovery, as discussed (ref, 12). An issue in SCDs is the detection of nucleotide repeat expansions that require independent specific techniques but there are improvements of some algorithm for such quest in WES data and in genome sequencing (39).
In conclusion, up-to-now, SCDs in Sudan are caused by multiple genes; none of them significantly predominate over the others. The use of multiple genetic approaches that included WES enhanced the diagnosis of known SCDs phenotypes and the potential discovery of new SCDs genes.