We demonstrate that the DoBSeq approach to mass genetic screening for rare disease-causing variants is a reliable and cost-effective method, which is likely to be highly scalable and hence applicable to population screening. The high performance and ability to directly pinpoint carriers of genetic variants are a product of the repeated sequencing of the same individual in two separate batches.To eliminate interpretation biases in the present analysis, we purposefully excluded variants that were internally classified as pathogenic (examples in Figure 2).
Surprisingly, our data showed that false positive LoF/P variants with passable quality parameters were widespread when the sequencing results from stand-alone batches were assessed in isolation (Figure 1A, 1B, 1E & 1F). However, by filtering variants to those seen in a combination of one column and one row batch, i.e., appearing twice in two different batches, could we distinguish between true and false positive variants. This suggests that single batched sequencing (where carriers cannot be pinpointed) would require copious individual resequencing, often in the pursuit of ultimately false positive variants, rendering the approach untenable. Moreover, considering the high number of RDs in the general population, even the pursuit of true positive variants could become costly if dozens or even hundreds of genes were screened simultaneously and each finding required resequencing of each individual in the singleton batch.
Currently, well over 400 gene-disease pairs are considered highly actionable in childhood with respect to age-of-onset and/or timing of intervention, and at least an additional 25 are highly actionable in adulthood15. Each of these conditions may be considered viable candidates for population screening, yet, primarily due to cost restrictions, few are routinely screened for in any healthcare system and no screening currently uses genomics up-front7,16.
A recent simulation model evaluated the cost-effectiveness of universal screening for a panel including 11 pediatric CPS genes17, which fully overlapped with the 113-gene panel used in our study. Because RDs, such as pediatric CPSs, by definition have very low prevalence, the number-needed-to-diagnose (NND) in an unselected cohort is high. For individual rare conditions, such as WT1-related disorders, the price per diagnosis (PPD) at the estimated cost of doing a single genetic test likely exceeds $10 million (Figure 3C). Of course, by using panels, several conditions can be screened for simultaneously, lowering the NND for any one condition. For the 11 pediatric CPS genes, Yeh et al.17 conclude that at a price per sample (PPS) of $55, tumor surveillance strategies are cost-prohibitive ($244,860 per life-year gained), yet, at a PPS of $20, such strategies approach liberal cost-effectiveness (<$100,000 per life-year gained) (Figure 3D). Practically, the PPD must be added to the isolated cost of the tumor surveillance strategy, making it increasingly likely that any given treatment, which is cost-effective in isolation, will remain cost-effective as PPD is reduced. According to the models developed by Yeh et al., the tumor surveillance strategies and treatments available for the 11 conditions they studied will be even conservatively cost-effective (<$50,000 per life-year gained) if PPS drops below $8, which we estimate is possible when DoBSeq is run at scale (Figure 3D).
DoBSeq has the potential to advance the rate of presymptomatic diagnosis, and the cost-effectiveness of doing so, for hundreds of actionable RDs caused by rare genetic variants. This is by no means limited to DNA samples from neonatal blood spots. Several studies have investigated the cost-effectiveness of population screening for adult CPSs18–22. One such study22, investigating the impact of PPS on universal adult screening for BRCA1/2 and MMR genes, found it to be conservatively cost-effective even at PPS exceeding $1000. Lowering PPS to those estimated for DoBSeq running at scale could lead to prices per life-year gained that approach cost-saving (Supplemental Figure 10 & 11). Most adult CPSs are believed to be undiagnosed, and therefore population based cancer screening stands to improve with genetically informed precision prevention21.
The promising aspects of DoBSeq must be viewed in light of some limitations of the method and of this study. If multiple individuals in the same DoBSeq matrix carry exactly the same variant it may not be possible to pinpoint the carriers directly. In this event, however, the number of possible carriers will be limited to a small group amenable for individual (re)sequencing. For instance, if two individuals carried the same variant, (re)sequencing of four individuals would be required to identify the two true carriers. Still, genetic heterogeneity, meaning that a specific disease may be caused by a myriad of distinct genetic defects, generally makes multiple carriers in that same matrix unlikely. This was also found in our study, e.g. for TP53 where four distinct variants all caused the same condition (Li-Fraumeni Syndrome) which is associated with a high risk of childhood cancers23 (Figure 2). Another possible limitation was that prior to batching, our method harmonized DNA concentration across all samples, which added a minor cost for each sample. At scale this may impact overall cost-effectiveness estimates and we did not test whether removing this step impacted the performance of DoBSeq. Lastly, structural variants, known to cause around 10% of RD24, were not investigated as part of the present study.
Lately, combinatorial pooling strategies, like DoBSeq, have received increased attention as a theoretical alternative to the extensive and costly population screening for the SARS-Cov-2 virus25. While the methods appear similar, a crucial difference is that, a disease may be caused by a myriad of distinct genetic variants, whereas SARS-Cov-2 tests, by and large, are positive in the same way.
Finally, we would be remiss not to mention that the implementation of mass genetic screening of healthy neonates, children, and/or adults, precipitates critical ethical considerations26. Ethics is not a focus of our current study, however, it is important to note that our method detects rare variants only, thus limiting the amount of personal data obtained per individual. Moreover, if our method is developed further, the low price per sample may extend accessibility in low- and middle-income countries, making it more equitable than individual NGS based methods.
In conclusion, the DoBSeq method reliably detects rare pathogenic germline variants in populations at single nucleotide resolution. The approach is highly scalable and may easily be incorporated into existing screening infrastructures based on the collection of blood spot samples. Consequently, DoBSeq may pave the way for cost-effective, at-scale, NGS-based population screening, and studies of large, unselected cohorts using DoBSeq are warranted.