MUTE-Seq: An Ultrasensitive Method for Detecting Low-Frequency Mutations in ctDNA with Engineered Advanced-Fidelity FnCas9

doi:10.21203/rs.3.rs-3313031/v1

Download PDF

Biological Sciences - Article

MUTE-Seq: An Ultrasensitive Method for Detecting Low-Frequency Mutations in ctDNA with Engineered Advanced-Fidelity FnCas9

https://doi.org/10.21203/rs.3.rs-3313031/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

In this study, we present the development of the Mutation tagging by CRISPR-based Ultra-precise Targeted Elimination in Sequencing (MUTE-Seq) method. We engineered a highly precise advanced-fidelity FnCas9 variant, named FnCas9-AF2, to effectively discriminate single-base mismatches at all positions of the single guide RNA (sgRNA) target sequences. FnCas9-AF2 exhibited significantly lower off-target effects compared to existing high-fidelity CRISPR-Cas9 variants. We developed MUTE-Seq by applying FnCas9-AF2 for enrichment of mutant DNA without sequence limitations via the exclusive cleavage. The depletion of perfectly matched wild-type DNA targets offered a sensitive detection method for low-frequency cancer-associated mutant alleles. MUTE-Seq enabled sensitive monitoring of minimal residual disease (MRD) from the bone marrow of patients with Acute Myeloid Leukemia (AML). Furthermore, MUTE-Seq was applied in a multiplexed manner on cell-free DNA (cfDNA) from patients diagnosed with non-small cell lung cancer (NSCLC). Multiplexed MUTE-Seq (mMUTE-Seq) resulted in a significant improvement in the sensitivity of simultaneous mutant detection and was also effective for stage I NSCLC patients with extremely low levels of circulating tumor DNA (ctDNA). We anticipate that the FnCas9-AF2-based MUTE-Seq method could offer a valuable clinical tool to facilitate improved molecular diagnosis, prognosis evaluation, and treatment planning for cancers in various stages.

Biological sciences/Biotechnology/Molecular engineering/Protein design

Biological sciences/Biotechnology/Sequencing/Next-generation sequencing

Cell-free DNA (cfDNA) comprises highly fragmented DNA molecules that are released into the circulatory system from cells^1-3. Circulating tumor DNA (ctDNA) specifically refers to the cfDNA that originates from tumor cells^4,5. During the early stages of cancer, ctDNA in the blood can be exceedingly minute (as low as 0.01% of total cfDNA), posing a significant challenge for detection^6-9. Contemporary diagnostic techniques utilizing Sanger sequencing or Next-Generation Sequencing (NGS) struggle with sensitivity, often failing to detect minuscule ctDNA levels^9,10. Since low levels of ctDNA are common for many early-stage cancers and minimal residual disease (MRD), it has been difficult to obtain reliable detection results for such cases^9-14.

CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) based ctDNA enrichment methods have shown promising enhancement of detection sensitivity^15-19. The methods involved the depletion of regular major alleles prior to conducting NGS, thereby enriching for the minor alleles. Innovations in this domain include the highly efficient CRISPR-based sequence depletion methods including CARM (Cas9 Assisted Removal of Mitochondrial DNA)¹⁵, DASH (Depletion of Abundant Sequences by Hybridization)¹⁶, MAD-DASH(miRNA and Adaptor Dimer-DASH)¹⁷ and CUT-PCR (CRISPR-mediated, Ultrasensitive detection of Target DNA using PCR)¹⁸. These methods facilitated the enrichment of minor allelic DNA by depleting major allelic DNA, thus enabling accurate detection of minor DNA with fewer NGS reads, suggestive of immense potential in the field of molecular diagnostics.

However, while these methods provided unprecedented effectiveness in selectively enriching the sequences of interest in NGS libraries, the applicability of the current CRISPR based mutant detection methods were limited by the specific requirements^15-18. The previous studies demonstrated that the eliminations required the sequences to contain significant sequence differences, or the mutations be positioned within the protospacer adjacent motif (PAM) site. A primary reason for this is that the specificities of the current CRISPR systems have been shown to be insufficient to effectively distinguish single- or double-base mismatches in the target DNA sequence^20-22. These inaccuracies in the CRISPR cleavage events constituted a significant barrier, both in vivo and in vitro, for medical and industrial applications that required exact base pair discrimination^18,19,23,24. The sequence limitation has been a major hindrance in applying CRISPR-based enrichment for diagnosis through detecting rare mutant alleles in cfDNA. For Streptococcus pyogenes Cas9 (SpCas9), as an example, the accurate single-base discrimination is limited to mutations where the 5’-NGG-3’ sequence within the PAM were changed to 5’-NHH-3’ (H: A, T, C).

Several studies reported enhanced CRISPR systems with improved specificity, such as eSpCas9 and SpCas9-HF^20,22. However, while these engineered CRISPRs showed considerably higher accuracies compared to wild-type SpCas9 (SpCas9-WT), significant cleavages still occurred sporadically at off-targets that had single-base pair mismatches. To overcome this precision issue, we sought to develop a CRISPR system capable of effectively distinguishing a single-base mismatch across all the sgRNA target sequence. To this end, we developed an Francisella novicida Cas9 (FnCas9) with advanced fidelity (FnCas9-AF2), that could efficiently discriminate mutations across all positions of a sgRNA target sequence. In vitro cleavage and genome-wide analyses demonstrated that FnCas9-AF2 had undetectable off-target activity.

Next, we applied FnCas9-AF2 to develop CRISPR-based sequence depletion approach for detecting low-frequency cancer-associated mutations, and we named it MUTE-Seq (Mutation tagging by CRISPR-based Ultra-precise Targeted Elimination in Sequencing). We observed that MUTE-Seq markedly increased the detection sensitivities of low-frequency mutant alleles via wild-type allele depletion in biopsy samples from both acute myeloid leukemia (AML) and non-small cell lung cancer (NSCLC) patients. MUTE-Seq significantly increased the detection rates of low-frequency NRAS mutations in AML patients monitored for minimal residual disease (MRD). The increment of detected mutant allele frequencies were apparent in both chromatogram in Sanger sequencing and NGS. Subsequently, we conducted multiple sgRNA-based simultaneous depletions of wild-type alleles in cfDNA using FnCas9-AF2, revealing that the system could be utilized in a multiplexed manner. Multiplexed MUTE-Seq (mMUTE-Seq) significantly enhanced the concordance of detected EGFR mutations between tissue and blood samples from NSCLC patients. Notably, mMUTE-Seq enabled effective detection of mutations that were present at very low-frequencies in stage I NSCLC patients. The findings suggest that the mMUTE-Seq method has considerable potential for developing diagnosis panels aimed at detecting multiple low-frequency ctDNA.

Comparison of Single-Base Mismatch Discrimination between SpCas9, SpCas9 Variants, and FnCas9

We sought to examine the mismatch tolerance of SpCas9, high-fidelity SpCas9 variants, and FnCas9. As a part of the study, 20 single guide RNAs (sgRNAs) were prepared with single-base mismatches, each at a different position of the KRAS target sequence. These were then tested in in vitro cleavage assays using SpCas9-WT, SpCas9-HF1, SpCas9-HF4, eSpCas9(1.0), eSpCas9(1.1), and FnCas9-WT^20,22.

SpCas9-WT induced significant DNA cleavage not only with perfectly matched sgRNA (sgRNA T), but also with sgRNAs that contain single-base mismatches in all 20 positions. SpCas9-HF1, SpCas9-HF4, eSpCas9(1.0), and eSpCas9(1.1), the engineered variants of SpCas9 designed for higher precision, also exhibited a noticeable cleavage for most mismatched sgRNAs. However, FnCas9 presented a tendency for lower rates of in vitro cleavage with mismatched sgRNAs. Together, the in vitro cleavage assays suggested that FnCas9 could potentially discern single-base mismatches more efficiently than SpCas9 and its high-fidelity variants (Fig. 1a and Extended Data Fig. 1).

Engineering FnCas9 for Enhanced Sensitivity to DNA Mismatches

Following the initial studies, the aim was to engineer the FnCas9 protein to produce optimized FnCas9 variants that would exhibit enhanced sensitivity to mismatches. The hypothesis was that by mitigating the interactions between positively charged amino acids and the nucleotide implicated in conformational shift during Cas9-sgRNA-DNA binding, it might be possible to destabilize the R-loop formation in the context of mismatched DNA. This potential instability could subsequently curtail nuclease activity directed towards the mismatched DNA sequences^20,22,25,26. To examine this, 49 recombinant FnCas9 proteins were prepared, each containing single amino acid substitution to alanine in positions expected to interact with the phosphate backbone of the target DNA (Extended Data Fig. 2a). Subsequently, in vitro cleavage assays were carried out to test the ability of these 49 FnCas9 single substitution variants to cleave the target sequence with sgRNAs, each bearing a single-base mismatch at a different position. The assays revealed that certain amino acid substitution in FnCas9 led to a significant reduction of cleavage with the single-base mismatched sgRNAs, while preserving on-target cleavage activity (Extended Data Fig. 2b).

To identify the most precise FnCas9 variant, their specificity scores were calculated base on their abilities to discriminate single-base mismatches (Extended Data Fig. 2c). Eleven variants demonstrated specificity scores above 60%: six with modified residues in the REC lobe (R455, R785, K721, K789, R919, and R1241) that interacted with the phosphate backbone of the target DNA strand, and five with altered residues in the NUC lobe (R939, K941, K1189, R1226, K1228) that interacted with the phosphate backbone of the non-target DNA strand (Fig. 1b). Of these 11 variants, it was found that the K1189A and R1241A mutants presented the highest specificity scores, leading to further optimization of FnCas9 based on these mutations (Fig. 1c and Extended Data Fig. 2c).

Multiple Amino Acid Substitutions Enhance Specificity of FnCas9 Across Diverse Target Sites and Mismatch Conditions

We sought to ask if the specificities of the FnCas9 variants could be further enhanced. We observed that the single substitution variants showed residual off-target cleavage at the NRAS target site. In order to improve the specificity, we conducted combinatorial alanine substitutions of FnCas9 at the identified amino acid positions and evaluated the variants for specificity at both the KRAS and NRAS target sites. As a result, two FnCas9 variants (FnCas9-K1189A/R1241A [termed FnCas9-AF1], FnCas9-R785A/K1189A/R1241A [termed FnCas9-AF2]) demonstrated undetectable off-target cleavages while preserving on-target activities for both target sites (Fig. 2a, b).

The specificity of these variants was further assessed using 60 sgRNAs containing all possible single-base mismatches at all 20 positions within the NRAS target sequence. FnCas9-AF1 exhibited superior specificity to the target sequence in comparison to the K1189A single substitution. FnCas9-AF2 showed significantly reduced cleavage rates for all mismatched base positions and base mismatch types (Fig. 2b and Extended Data Fig. 3).

Next, we asked whether the ability of FnCas9-AF2 to distinguish base mismatches can be generalized. To this end, we conducted in vitro cleavage assays using FnCas9-AF2 and sgRNAs with single-base mismatches at all 20 positions within KRAS and EGFR target sequences (Extended Data Fig. 4). The specificities of FnCas9-AF2 with the KRAS and EGFR sgRNAs were significantly higher than FnCas9-WT, suggesting that its sensitivity to single-base mismatches are applicable for other target sequences.

Comparative Specificity Assessment between FnCas9-AF2 and SpCas9 Variants

We evaluated the precision of FnCas9-AF2 on targets comprising single nucleotide variants (SNVs) or indel such as EGFR c.2573 T>G, EGFR c.2369 C>T, EGFR c.2389T>A, KRAS c.35G>A, MET c.3028+1 G>A, and EGFR c.2236_2250 del. For each site, we designed sgRNAs targeting the wild-type sequences, and carried out in vitro cleavage on both the wild-type and mutant DNAs. The wild-type DNA was completely cleaved by FnCas9-AF2 and was observed as two DNA fragments on the gel. However, in the case of the mutant DNA, FnCas9-AF2 did not produce detectable cleavage in targets with both indel and single nucleotide variants (Fig. 3a). We performed a quantitative analysis of the in vitro cleavage efficiency using FnCas9-AF2 and SpCas9 variants on both wild-type and mutant targets including the 5 SNVs and 1 indel. When compared to the SpCas9 variants, FnCas9-AF2 showed extremely low cleavage rate of mutant DNA (0.48% in average) while preserving highly efficient cleavage activity on wild-type DNA (97.11% in average) (Fig. 3b and Extended Data 5).

Genome-wide Analysis of Off-target Effects in FnCas9 and SpCas9 Variants

We then sought to determine whether the higher sensitivity of FnCas9-AF2 is associated with reduced off-target effects in the whole genome. In Digenome-seq analyses²⁷, SpCas9-WT exhibited 654 potential off-target sites. In contrast, FnCas9-WT had only 77 potential off-target cleavage sites, underscoring FnCas9’s enhanced specificity. The high-fidelity SpCas9 variants further minimized off-target cleavage sites, with eSpCas9(1.1) registering 37 sites and SpCas9-HF4 having 13. Remarkably, FnCas9-AF1 presented only one potential off-target cleavage site, while FnCas9-AF2 showed even higher precision with zero detectable off-target site (Fig. 3d, e). Therefore, FnCas9-AF2 was selected for further analysis and application.

Precision Enrichment of Mutant Alleles in AML Patients Using MUTE-Seq: An FnCas9-AF2-Based Wild-type Allele Depletion Technique

Utilizing FnCas9-AF2, we introduce MUTE-Seq, a novel technique that overcomes the limitations of applicable sequences in enriching minor alleles through the highly precise elimination of the major counterparts. To evaluate the capacity, we conducted MUTE-Seq on human genomic DNA (gDNA) samples obtained from the bone marrow of eight AML patients who were monitored for MRD. Our primary aim was to identify specific NRAS mutations, G12D, G12C, and G13D. To accomplish this, we designed sgRNA to target the NRAS locus, encompassing G12 and G13. The samples were divided into two groups: the MUTE-Seq group, which underwent in vitro cleavage with FnCas9-AF2 to eliminate wild-type DNA, and the control group, which remained untreated. Subsequently, we analyzed the VAFs in both the MUTE-Seq and control groups using Sanger sequencing. (Fig. 4a).

In the control group, NRAS mutations in DNA samples from patients 1 through 6 ranged from 3% to 16.9%, whereas samples from patients 7 and 8 exhibited no detectable NRAS mutations (Fig. 4b). Chromatograms of the control group with detectable mutant alleles displayed relatively lower peak sizes for mutant bases compared to wild-type base peaks. In contrast, MUTE-Seq group exhibited distinct double peaks at the positions of mutant bases in the chromatograms (Fig. 4c). Moreover, VAFs were significantly higher in the MUTE-Seq group (Fig. 4d). Notably, samples with lower VAFs in the control group displayed a more pronounced increase in VAFs following MUTE-Seq enrichment. For instance, the P1 sample with low initial VAF showed an 8.1-fold increase post-enrichment, whereas P2, with a relatively higher initial VAF, exhibited a 3.3-fold enrichment.

Furthermore, a substantial level of agreement was observed between the VAFs obtained from the control group and MUTE-Seq group (Fig. 4e). Additionally, we employed NGS to measure the VAF of both control and MUTE-Seq groups, finding relatively similar detected VAFs (R²=0.958) between Sanger and NGS in both groups (Fig. 4f). Notably, for samples 7 and 8, both the control and MUTE-Seq groups exhibited VAFs below the detectable limit in both Sanger sequencing and NGS.

Next, we aimed to validate the detection limit of MUTE-Seq below 2.5 % VAF, which is hard to discern with Sanger sequencing. For this test, We prepared blended gDNA samples by mixing NRAS wild-type and G13D-mutant gDNA from 0.25% to 2.5% VAFs. Quantification of NRAS G13D allele frequencies showed that the MUTE-Seq group consistently exhibited elevated VAFs compared to controls: rising to 11.1% from 0.25%, 16.4% from 0.5%, 31.8% from 1.25%, and 44.6% from 2.5% (Fig 4g, 4h). The increased levels of mutant ratios in MUTE-Seq were high-enough for definitive detection by Sanger sequencing. Significantly, the correlation between pre- and post-enrichment was highly maintained with a R²=0.977 (Fig. 4h). We also noticed higher amplification efficiencies for lower VAFs in the control group, similar to the previous analyses on samples from AML patients. The results suggested that MUTE-Seq methods could surpass the detection limit of Sanger sequencing for identifying low-prevalence mutations in cancer samples.

Multiplexed MUTE-Seq (mMUTE-Seq) Approach for Simultaneous Enrichment of Low-Frequency Mutations in cfDNA from Non-Small Cell Lung Cancer (NSCLC) Patients

We asked whether the MUTE-Seq could simultaneously enrich multiple mutant alleles in cfDNA via multiplexed manner. We conducted quantitative analyses of multiplexed MUTE-Seq (mMUTE-Seq) by applying 10 sgRNAs simultaneously on reference materials containing NSCLC associated mutations with VAFs of 1%, 0.1% and 0% (Fig 5a). We found that the VAFs of mutations present in the reference materials (EGFR exon 19 deletion, EGFR L858R, EGFR T790M, and KRAS G12D) were significantly increased by the mMUTE-Seq (Fig 5a). Notably, 0% samples demonstrated undetectable VAFs in both the unenriched control (Deep-Seq) and the mMUTE-Seq groups, suggesting that the VAF increment by the mMUTE-Seq was specific. We further analyzed the average enrichment efficiencies of all detected VAFs by the mMUTE-Seq and observed 34.2- and 66.2-fold increase of detection rates in 1% and 0.1% initial VAF samples, respectively (Fig 5b). In contrast, multiplexed mutant enrichment using SpCas9-WT, eSpCas9(1.1), SpCas-HF4, and FnCas9-WT showed only slight increment in the VAF detection rates, and the detected VAFs of the 0.1% initial VAF sample remained undetectable (Extended Data Fig 6).

To evaluate the sensitivity and specificity of mMUTE-Seq, we conducted statistical analyses on variants detected in reference materials at the 1% and 0.1% VAFs. We first determined the cut-offs for VAFs, calculated as three times the interquartile range above the third quartile (Q3), to minimize the influence of outliers. The cut-offs of Deep-Seq and mMUTE-Seq were determined as 0.21% and 0.55%, respectively. Subsequent analyses showed a notable difference in sensitivity between the two methods. (Fig 5c) Specifically, mMUTE-Seq achieved a sensitivity score of 1, significantly outperforming Deep-Seq, which had a sensitivity of 0.65. However, both methods demonstrated comparable specificity, each scoring 0.95.

Next, we asked if mMUTE-Seq could be applied to identify low-frequency ctDNAs in cfDNAs from the plasma of cancer patients. To test the clinical utility of mMUTE-Seq, we compared the VAFs from mMUTE-Seq and Deep-Seq of 10 NSCLC patients who were diagnosed positive for EGFR mutation. Importantly, when using the mMUTE-Seq, we observed an average 11.81-fold increase in the detected VAFs of mutations residing within the regions targeted by the sgRNAs (Fig 5d).

To evaluate the performance of mMUTE-Seq in a liquid biopsy-based genotyping of cancer, we compared mutation profiles derived from both tissue and cfDNA in NSCLC patients. We found notable correlation between the samples in key mutations commonly observed in NSCLC, including EGFR exon 19 deletion, EGFR L858R, and EGFR T790M (Fig 5e). The mMUTE-Seq consistently identified tissue specific-mutations within cfDNA across all 10 patients, highlighting its utility for liquid biopsy-based detection of cancer mutations. Only two cases exhibited discordant mutational profiles: one case had a mutation uniquely identified in the tissue, while the other had a mutation exclusively detected in the cfDNA. In contrast, the Deep-Seq method exhibited lower concordance between tissue and cfDNA (Extended Data Fig 7). Coherent detection was observed in only 6 out of the 10 pairs. Consequently, mMUTE-Seq achieved a significantly higher sensitivity score of 0.91, compared to the 0.55 sensitivity of Deep-Seq. However, both methods displayed comparable specificity, each registering at 0.95 (Fig 5f and Extended Data Fig 8).

Previous studies have shown that detection of ctDNA tends to be less sensitive in early stages of cancer compared to later stages^4,5,28,29. We asked if mMUTE-Seq could facilitate sensitive detection of stage I cancer in cfDNA. To this end, we generated receiver operating characteristics (ROC) curves of the above patient data for stage I to IV together, and stage I separately (Fig 5g). The area under the ROC curve (AUC) for all the stages were 0.96 for the mMUTE-Seq group, which was significantly higher than AUC of 0.72 in the Deep-Seq group. Notably, for stage I only, the AUC of the mMUTE-Seq and the Deep-Seq groups were 1.0 and 0.70, respectively. The results suggested that mMUTE-Seq offers a sensitive method for detecting mutations in the early stages of cancer.

As CRISPR systems could be widely applied for precision medicine, the needs for more accurate CRISPR systems have been high^{11,12,18,21,30}. Accordingly, previous studies reported development of high-fidelity CRISPR-Cas variants^20,22. Nonetheless, even with these engineered Cas variants, it has been difficult to effectively discriminate off-targets with single-base mismatch positioned within the target sequence of the sgRNA. In this study, we addressed this issue and utilized rational design to engineer a highly accurate FnCas9-AF2 variant that effectively discriminates single-base mutations at all 20 positions of the DNA that are complementary to sgRNA. We found that the precision of the engineered FnCas9-AF2 exceeds that of eSpCas9 and SpCas9-HF variants. In vitro cleavage assays and Digenome-seq analyses showed that FnCas9-AF2 is capable of inducing DNA cleavage exclusively for targets exhibiting perfect base matches, ensuring single-base precision and effective discrimination against off-targets with single-base mismatch.

Since FnCas9-AF2 efficiently distinguished base mismatches in the target sequence of the sgRNA, we anticipated that it could be leveraged with flexible target selection for detecting low-frequency mutations. As we anticipated, the FnCas9-AF2-based MUTE-Seq facilitated sensitive MRD monitoring of AML patients. The results suggested that by integrating MUTE-Seq methods with Sanger sequencing, we could surpass the detection limit of Sanger sequencing for identifying low-prevalence mutations in cancer samples. Moreover, MUTE-Seq could be employed in a multiplexed manner, suggesting that this method could pave the way for the development of a cancer detection panel for the simultaneous detection of multiple cancer-associated mutations in a single liquid biopsy analysis. We found that mMUTE-Seq could enable multiplexed detection of low-frequency mutations in cfDNA from blood of NSCLC patients. Previous studies on mutant detection methods showed that it is more difficult to detect cancer related mutations in blood samples of patients at early stage^28,31,32. However, we found that mMUTE-Seq facilitated sensitive detection of EGFR mutations in cfDNA of stage I NSCLC patients as well as later stages.

Over time, extensive efforts have been made to enhance the sensitivity of ctDNA detection^33-36. Strategies employing NGS aimed to overcome the detection barrier by ultra-deep sequencing. However, this approach not only increased costs but also appeared inefficient when the VAFs of ctDNA mutations dropped below the intrinsic error rates of NGS (0.1–1%)^11,12. To overcome this sensitivity issue, enhanced technologies such as CAPP-Seq³⁷, IDES³⁸, Safe-seq³⁹, and PhasED-seq⁴⁰ were developed. Although these techniques vary in their details, they commonly employed unique molecular identifier (UMI) barcoding strategies that detect scant amounts of ctDNA through ultra-deep sequencing with barcode to overcome the limitations of NGS errors^9,37-41. In contrast to these methods, MUTE-Seq does not require ultra-deep sequencing nor UMI, as it uniquely leverages ultra-precise CRISPR in the initial steps to reduce the wild type alleles in the samples. By implementing such noise canceling, MUTE-Seq enables sensitive detection of mutant allele at a moderate sequencing depth, as the VAFs enter within the NGS confidence intervals. Furthermore, MUTE-Seq may potentially be integrated with UMI-based techniques to provide more synergies.

The results together suggested that the development of accurate FnCas9-AF2 based MUTE-Seq provided multiplexable method with unprecedented sensitivity for detecting low-frequency cancer-associated mutations.

1. Protein engineering (structural analysis) and cloning

The protein structure of FnCas9 (PDB ID 5B2O) was analyzed using PyMOL (Schrödinger, New York, NY, USA), and the FnCas9 component residues within hydrogen bonding distance of DNA were marked with spheres. These residues were changed to alanine using the QuikChange II Site-Directed Mutagenesis Kit (Agilent, Santa Clara, CA, USA). Briefly, FnCas9-WT (cloned in a pET28-a vector) was used as a template to amplify FnCas9 variants using primers containing alanine point mutations. The FnCas9 variants were cloned according to the manufacturer’s instructions, with a His×6 tag at the N-terminus of each recombinant FnCas9.

2. Protein purification

The pET vectors containing the FnCas9 variants under the T7 promoter were transformed into BL21-DE competent cells (Novagen, San Diego, CA, USA) according to the manufacturer’s instructions. The transformed cells were cultured in Luria–Bertani medium (Duchefa, Haarlem, Netherlands) at 37°C. When the OD₆₀₀ (optical density at 600 nm) of the medium reached 0.5–0.7, the cells were treated with IPTG (Beams Biotechnology, Seongnam, Korea). Cells were harvested after overnight incubation at 18°C and lysed in LYSIS buffer (50 mM NaH₂PO₄, 300 mM NaCl, 10 mM imidazole, 1 mg/mL lysozyme, 1 mM PMSF, 1 mM DTT, pH 8) using an ultrasonicator. The lysate was centrifuged at 15,000 rpm to remove cell debris. The clear supernatant containing the FnCas9 variant protein was treated with Ni-NTA beads (Qiagen, Hilden, Germany), which were then washed with WASH buffer (50 mM NaH₂PO₄, 300 mM NaCl, 20 mM imidazole, pH 8). Proteins were eluted with ELUTION buffer (50 mM NaH₂PO₄, 300 mM NaCl, 250 mM imidazole, pH 8), which was then exchanged with STORAGE buffer (50 mM HEPES, 200 mM NaCl, 20% glycerol, 1 mM DTT, pH 7.5) using an Amicon centrifuge filter (100 kDa; Merck, Kenilworth, NJ, USA).

3. In vitro transcription of sgRNA

Using an in vitro transcription method as previously described⁴², sgRNAs with single-base mutations were designed and synthesized for SpCas9 and FnCas9. Briefly, sgRNAs were transcribed by T7 RNA polymerase in a reaction mixture consisting of 40 mM Tris‐HCl (pH 7.9), 6 mM MgCl2, 10 mM DTT, 10 mM NaCl, 2 mM spermidine, NTPs, and an RNase inhibitor. The reaction mixture was incubated at 37°C for 8 h, and the sgRNAs were purified using PCR purification kits (GeneAll, Seoul, Korea) and quantified using a NanoDrop spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA).

4. In vitro DNA cleavage assay

A 3-kb template including EGFR, KRAS, NRAS, and MET gene sequences (Extended Data Table 1) was cleaved with Cas9 proteins and sgRNAs (Extended Data Table 3). EGFR, KRAS, NRAS, and MET target sites were synthesized using IDT oligosynthesis platforms (Integrated DNA Technologies, Coralville, IA, USA) and cloned into a p3 vector, and the 3-kb target DNA sequence was amplified from the vector by PCR, using two pairs of primers and Q5 DNA polymerase (New England Biolabs, Ipswich, MA, USA). The reactions were cleaned up using a PCR clean-p kit (GeneAll, Seoul, Korea). The target DNA (100 ng) was incubated with 250 ng guide RNA and 500 ng Cas9 variant in CutSmart buffer (New England Biolabs) (100 mM potassium acetate, 20 mM Tris-acetate, 10 mM magnesium acetate, 100 µg/ml BSA, pH 7.9) for 1 h at 37°C. The nuclease-cleaved DNA fragments were run on 1.5% agarose gel with TBE and stained with Midori Green (NIPPON GENETICS EUROPE, Duren, Germany) .

5. Digenome-seq

Digenome-seq was carried out as described previously.²⁷ Briefly, 8 μg genomic DNA (gDNA) was extracted from HEK293T using a DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany), then digested with 40 μg Cas9 and 10 μg sgRNA (target sequence: 5′-TTGGACATACTGGATACAGC-3′) in 400 μL of 1× CutSmart buffer (New England Biolabs) at 37°C for 16 h. Digested gDNA was isolated using a DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) and then fragmented to a size of 500–600 bp using an M220 ultrasonicator (Covaris, Woburn, MA, USA). The NGS library for whole-genome sequencing was prepared with a TruSeq Nano kit (Illumina, San Diego, CA, USA) and sequenced by NovaSeq (Illumina). The double strand break (DSB) score was measured and analyzed using the Digenome Sequencing analysis tool at CRISPR RGEN Tools (http://www.rgenome.net/), with a 2-bp overhang in the case of FnCas9. The loci with DSB scores over 1 were sorted, and their places in the human genome (hg38) were plotted in a Manhattan plot.

6. Bone marrow sampling and gDNA extraction from AML patients

Patients with AML were included in the study, which received approval from the institutional review boards of St Mary’s Seoul Hospital (IRB No. KC23SISI0242). Bone marrow samples were collected from AML patients who were suspicious of MRD. Bone marrow samples were collected in BD Vacutainer EDTA tubes (Becton Dickinson, Franklin Lakes, NJ, USA) and gDNA was extracted using DNeasy Blood & Tissue Kits (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. The concentration and purity of the gDNA were analyzed using the software associated with the Agilent TapeStation System.

7. MUTE-Seq using FnCas9-AF2

gDNA blends were prepared by mixing the gDNA from HEK-293T cells (wild-type DNA) and patient’s DNA (mutant DNA) for VAF 0.25, 0.5, 1.25, and 2.5%. The gDNA blend (100 ng) or the clinical samples were digested with 500 ng (2.5 pmol/10 μL) FnCas9-AF2 and 200 ng (8.4 pmol/10 μL) sgRNA (Extended Data Table 3) in 10 uL of 1× Remov RXN buffer (GeneCker, Seoul, Korea) at 45°C for 1 h. The reaction was terminated by adding 10× STOP buffer (GeneCker, Seoul, Korea). For NRAS mutation enrichment for Sanger sequencing, the enrichment PCR was performed as follows: the enrichment 50 μL-PCR reaction contained 2 μL digested product, 5 μL primers mix (Extended Data Table 2), 25 μL 2× Master Mix (Sungenetics, Daejeon, Korea), and 18 μl nuclease-free water. The reaction was performed under the following conditions: 98°C for 3 minutes followed by 42 cycles of 98°C for 10 seconds, 55°C for 40 seconds and 72°C for 30 seconds. The tubes were incubated at 72°C for another 5 minutes before storing at 4°C. The PCR product was purified with a PCR clean-up kit (GeneAll, Seoul, Korea) and then eluted into DEPC-water. The purified PCR products were sent to Macrogen (Seoul, Korea) for Sanger sequencing (Extended Data Table 2). For validation of the NRAS mutation-enrichment coupled Sanger sequencing, the digested products were amplified using Q5 DNA polymerase (New England Biolabs, Ipswich, MA, USA) with an index primer. Index PCR amplicons were purified with AMPure XP beads (Beckman Coulter, Brea, CA, USA) and sequenced using the iSeq 100 Sequencing System (Illumina).

8. Blood sampling and cfDNA extraction from NSCLC patients’ plasma

Patients with lung cancer were included in the study, which received approval from the institutional review boards of Korea University Anam Hospital (IRB No. 2020AN0005) and Boramae Medical Center (IRB No. 20-2017-17). Blood samples were collected following the clinical diagnosis of lung cancer, and only Non-Small Cell Lung Carcinomas (NSCLCs) were classified after tissue confirmation. A total of 10 mL of blood was collected in Streck Cell-Free DNA blood collection tube (cfDNA BCT; Streck, La Vista, NE, USA). The collected blood was transferred to a Falcon tube and centrifuged at 1900 × g. The supernatant (plasma) was then gathered into Eppendorf tubes and centrifuged again at 16000 × g. The cfDNA was isolated from 1 mL of plasma using the Maxwell RSC cfDNA Plasma Kit (Promega, Madison, WI, USA), following the manufacturer's instructions. The cfDNA was eluted in 60 μL of elution buffer from the Maxwell RSC cfDNA Plasma Kit and was then processed with the Cell-free DNA ScreenTape assay (Agilent 4150 TapeStation system). The concentration and purity of the cfDNA were analyzed using the software associated with the Agilent TapeStation System.

9. Multiplexed MUTE-Seq(mMUTE-Seq) for wild type DNA depletion and ctDNA enrichment.

The 3ng Multiplex I cfDNA Referecne standard set (Horizon) or cfDNA extracted from the patient samples (5–10 ng) was prepared for amplification. Five genes containing hotspots of interest were amplified using Q5 DNA polymerase (New England Biolabs) and 10 multiplexed primer mix (Extended Data Table 2). To remove the WT allele, a 1-μL fraction of multiplexed PCR product (diluted 10-fold with DEPC-treated water) was treated with 4 μg (20 pmol/10 μL) FnCas9-AF2 (or 8 ug Cas variant) and 2 μg (66 pmol/10 μL) multiplexed sgRNA mix (Extended Data Table 3) in 10 μL 1× Remov RXN buffer (GeneCker, Seoul, Korea) at 45°C for 1 h ; the reaction was terminated by adding 10× STOP buffer (GeneCker, Seoul, Korea). The WT-depleted products were purified with AMPure XP beads (Beckman Coulter, Brea, CA, USA) then amplified using Q5 DNA polymerase (New England Biolabs) with an index primer. Index PCR amplicons were purified with AMPure XP beads (Beckman Coulter, Brea, CA, USA) and sequenced using an Illumina iSeq sequencer.

10. Next generation sequencing data analysis

Adapter sequences were removed from the raw sequencing data using BBDuk version 38.96. The trimmed reads were aligned against the human genome reference (GRCh38) with bwa-mem version 0.7.17. The aligned reads were partitioned to each amplicon using custom script prior to variant calling. Somatic variants and short indels were detected with Mutect2 version 4.2.6.1 and VarDict version 1.8.2. Variants were annotated using Variant Effect Predictor version 108.

Acknowledgements

This study was supported by grants from the Korea University, Republic of Korea (KR) [K2125811], the Korea Medical Device Development Fund (KR) [RS-2021-KD000007], the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) funded by the Ministry of Health & Welfare (KR) [HR22C1302], and the Gene Editing Control Restoration-based Technology Development Project through the National Research Foundation (NRF) (KR) [RS-2023-00262309] to JWH.

Author contributions.

Conceptualization: S.Y., J.K.H., and J.W.H. Data curation: S.Y., K.-Y.K., Y.-H.W., T.P., H.J., S.J.K., J.Y.H., H.J.C., I.S.L., H.J.C., J.W.R., J.-S.K., M.Y.K., M.K., Y.K., S.L., J.H.C., J.-A.G., W.H., J.K.H., and J.W.H. Formal analysis: S.Y., K.-Y.K., Y.-H.W., T.P., H.J., S.J.K., J.Y.H., H.J.C., I.S.L., J.-A.G., W.H., J.K.H., and J.W.H. Clinical sample acquisition: J.-S.K., M.Y.K., M.K., Y.K., S.L., J.H.C., and J.W.H. Funding acquisition: S.Y., H.J.C., J.W.R., J.K.H., and J.W.H. Writing: S.Y., J.K.H., and J.W.H.

Competing interests

S.Y. filed patent applications based on this study.

Correspondence

Correspondence to Junho K Hur or Junseok W Hur

Kustanovich, A., Schwartz, R., Peretz, T. & Grinshpun, A. Life and death of circulating cell-free DNA. Cancer biology & therapy 20, 1057-1067 (2019).
Volik, S., Alcaide, M., Morin, R. D. & Collins, C. Cell-free DNA (cfDNA): clinical significance and utility in cancer shaped by emerging technologies. Molecular Cancer Research 14, 898-908 (2016).
Bronkhorst, A. J., Ungerer, V. & Holdenrieder, S. The emerging role of cell-free DNA as a molecular marker for cancer management. Biomolecular detection and quantification 17, 100087 (2019).
Pessoa, L. S., Heringer, M. & Ferrer, V. P. ctDNA as a cancer biomarker: A broad overview. Critical reviews in oncology/hematology 155, 103109 (2020).
Ma, M. et al. “Liquid biopsy”—ctDNA detection with great potential and challenges. Annals of translational medicine 3 (2015).
Kennedy, S. R. et al. Detecting ultralow-frequency mutations by Duplex Sequencing. Nature protocols 9, 2586-2606 (2014).
Forshew, T. et al. Noninvasive identification and monitoring of cancer mutations by targeted deep sequencing of plasma DNA. Science translational medicine 4, 136ra168-136ra168 (2012).
Schwarzenbach, H., Stoehlmacher, J., Pantel, K. & Goekkurt, E. Detection and monitoring of cell‐free DNA in blood of patients with colorectal cancer. Annals of the New York Academy of Sciences 1137, 190-196 (2008).
Elazezy, M. & Joosse, S. A. Techniques of using circulating tumor DNA as a liquid biopsy component in cancer management. Computational and structural biotechnology journal 16, 370-378 (2018).
Abbosh, C. et al. Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution. Nature 545, 446-451, doi:10.1038/nature22364 (2017).
Salk, J. J., Schmitt, M. W. & Loeb, L. A. Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations. Nature Reviews Genetics 19, 269-285 (2018).
Jennings, L. J. et al. Guidelines for validation of next-generation sequencing–based oncology panels: a joint consensus recommendation of the Association for Molecular Pathology and College of American Pathologists. The Journal of molecular diagnostics 19, 341-365 (2017).
Chin, R.-I. et al. Detection of solid tumor molecular residual disease (MRD) using circulating tumor DNA (ctDNA). Molecular diagnosis & therapy 23, 311-331 (2019).
Moding, E. J. et al. Circulating tumor DNA dynamics predict benefit from consolidation immunotherapy in locally advanced non-small-cell lung cancer. Nature Cancer 1, 176-183 (2020).
Wu, J. et al. The landscape of accessible chromatin in mammalian preimplantation embryos. Nature 534, 652-657, doi:10.1038/nature18606 (2016).
Gu, W. et al. Depletion of Abundant Sequences by Hybridization (DASH): using Cas9 to remove unwanted high-abundance species in sequencing libraries and molecular counting applications. Genome biology 17, 1-13 (2016).
Hardigan, A. A. et al. CRISPR/Cas9-targeted removal of unwanted sequences from small-RNA sequencing libraries. Nucleic acids research 47, e84-e84 (2019).
Lee, S. H. et al. CUT-PCR: CRISPR-mediated, ultrasensitive detection of target DNA using PCR. Oncogene 36, 6823-6829 (2017).
Bae, T., Hur, J. W., Kim, D. & Hur, J. K. Recent trends in CRISPR-Cas system: genome, epigenome, and transcriptome editing and CRISPR delivery systems. Genes & genomics 41, 871-877 (2019).
Slaymaker, I. M. et al. Rationally engineered Cas9 nucleases with improved specificity. Science 351, 84-88, doi:10.1126/science.aad5227 (2016).
Fu, Y. et al. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nature biotechnology 31, 822-826, doi:10.1038/nbt.2623 (2013).
Kleinstiver, B. P. et al. High-fidelity CRISPR–Cas9 nucleases with no detectable genome-wide off-target effects. Nature 529, 490-495, doi:10.1038/nature16526 (2016).
Pickar-Oliver, A. & Gersbach, C. A. The next generation of CRISPR–Cas technologies and applications. Nature reviews Molecular cell biology 20, 490-507 (2019).
Barrangou, R. & Doudna, J. A. Applications of CRISPR technologies in research and beyond. Nature biotechnology 34, 933-941 (2016).
Pacesa, M. et al. R-loop formation and conformational activation mechanisms of Cas9. Nature 609, 191-196 (2022).
Hirano, H. et al. Structure and engineering of Francisella novicida Cas9. Cell 164, 950-961 (2016).
Kim, D. et al. Digenome-seq: genome-wide profiling of CRISPR-Cas9 off-target effects in human cells. Nature methods 12, 237-243 (2015).
Abbosh, C., Birkbak, N. J. & Swanton, C. Early stage NSCLC—challenges to implementing ctDNA-based screening and MRD detection. Nature Reviews Clinical Oncology 15, 577-586 (2018).
Chakrabarti, S., Xie, H., Urrutia, R. & Mahipal, A. The promise of circulating tumor DNA (ctDNA) in the management of early-stage colon cancer: a critical review. Cancers 12, 2808 (2020).
Ryu, S.-M., Hur, J. W. & Kim, K. Evolution of CRISPR towards accurate and efficient mammal genome engineering. BMB reports 52, 475 (2019).
Dasari, A. et al. ctDNA applications and integration in colorectal cancer: An NCI Colon and Rectal–Anal Task Forces whitepaper. Nature reviews Clinical oncology 17, 757-770 (2020).
Keller, L., Belloum, Y., Wikman, H. & Pantel, K. Clinical relevance of blood-based ctDNA analysis: mutation detection and beyond. British journal of cancer 124, 345-358 (2021).
Bai, Y. et al. Technical progress in circulating tumor DNA analysis using next generation sequencing. Molecular and cellular probes 49, 101480 (2020).
Schwaederle, M. et al. Genomic alterations in circulating tumor DNA from diverse cancer patients identified by next-generation sequencing. Cancer research 77, 5419-5427 (2017).
Lin, C., Liu, X., Zheng, B., Ke, R. & Tzeng, C.-M. Liquid biopsy, ctDNA diagnosis through NGS. Life 11, 890 (2021).
Tan, O., Shrestha, R., Cunich, M. & Schofield, D. Application of next‐generation sequencing to improve cancer management: A review of the clinical effectiveness and cost‐effectiveness. Clinical genetics 93, 533-544 (2018).
Newman, A. M. et al. An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage. Nature medicine 20, 548-554 (2014).
Newman, A. M. et al. Integrated digital error suppression for improved detection of circulating tumor DNA. Nature biotechnology 34, 547-555 (2016).
Kinde, I., Wu, J., Papadopoulos, N., Kinzler, K. W. & Vogelstein, B. Detection and quantification of rare mutations with massively parallel sequencing. Proceedings of the National Academy of Sciences 108, 9530-9535 (2011).
Kurtz, D. M. et al. Enhanced detection of minimal residual disease by targeted sequencing of phased variants in circulating tumor DNA. Nature biotechnology 39, 1537-1547 (2021).
Larribère, L. & Martens, U. M. Advantages and Challenges of Using ctDNA NGS to Assess the Presence of Minimal Residual Disease (MRD) in Solid Tumors. Cancers 13, 5698 (2021).
Ye, S. et al. CCN5 Reduces Ligamentum Flavum Hypertrophy by Modulating the TGF-beta Pathway. J Orthop Res, doi:10.1002/jor.24425 (2019).

Yes there is potential Competing Interest. Sunghyeok Ye filed patent applications based on this study.

ExtendedDataTablesandFigures.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

MUTE-Seq: An Ultrasensitive Method for Detecting Low-Frequency Mutations in ctDNA with Engineered Advanced-Fidelity FnCas9

Status:

Version 1

Abstract

Figures

Main

Discussion

Methods

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1