1.1 Sample extraction and data
From January 2019 to March 2022, bone marrow (BM) and peripheral bloods (PB) of all MM patients, never receiving treatment, were collected in our hospital. All patients were selected based on IMWG 2014 criteria [29], estimated survival time greater than 5 years. All patients signed an informed consent form which approved by the ethics committee. Blood samples and BM were collected before the first cycle of chemotherapy treatments. All samples were screened hematopoietic malignancy associated genetic mutations from cfDNA and BM, using the Next-generation sequencing. As the high price of the NGS, the mutated genetic, checked in both cfDNA and BM, were monitored at the different time points during the treatment by ddPCR. Paraprotein were checked at the same time to compare the fractional abundance (FA) of mutations. The following data were collected in a prospective database: clinical characteristics (gender, age, ECOG-PS), biological data (type of Myeloma, ISS and DS at diagnosis, Cytogenetic abnormalities) and follow-up data (paraprotein、fractional abundance (FA) of mutation、data of relapse、data of death or last follow-up).
1.2 gene target sequencing
1.2.1 Genomic DNA extraction
Peripheral bloods and BM of all subjects were collected in cell-free DNA storage tubes and EDTA tubes respectively. Genomic DNAs were extracted using QiaAmp Blood DNA Mini kits (Qiagen, CA, USA). The concentration and quality of genomic DNAs were examined with a NanoDrop® ND-1000 (Thermo Fisher Scientific, MA, USA), the Qubit® 3.0 Fluorometer (Thermo Fisher Scientific, MA, USA) and 1% agarose gel electrophoresis.
1.2.2 DNA Library Preparation
The paired-end DNA sequencing libraries were prepared through genomic DNA shearing, use a Covaris™ (Woburn, MA, USA) sonicator, followed by peak detection, end repair, poly A-tailing, paired-end adaptor ligation, and amplification. The qualified genomic DNA sample was randomly fragmented by Covaris ™ (Woburn, MA, USA) and the size of the library fragments is mainly distributed between 250 bp and 300 bp. End-Repair and A-tailing were then applied to facilitate ligation of the adapters, containing unique barcodes for each sample, specific to the Illumina technology for amplification and sequencing. KAPA Hyper Prep kit was used for these steps, according to the manufacturer’s instructions.
1.2.3 Targeted sequencing
Target gene capture was performed using SureSelect custom designs (Agilent Technologies, Inc., Santa Clara, CA) targeting whole exons of the 393 most frequent hematologic genes. Each captured library was then loaded on novase6000 platform (Illumina, Inc., San Diego, CA) at the Wuhan Kindstar Global gene Technology (Kindstar, Wuhan, China). Each sample was sequenced at the mean depth of 1500–2500× to achieve high sensitivity and accuracy for mutations detection.
1.2.4 Data analysis
Raw image files were processed by Illumina basecalling Software for base-calling with default parameters and the sequences of each individual were generated as 150 bp pair-end reads. Paired-end reads were aligned to the human reference genome (GRCh37/HG19) using the Burrows-Wheeler Aligner(BWA, v0.7.17). The strict data quality control (QC) was performed in the whole analysis pipeline for the clean data, the mapping data, the variant calling, etc. The Genome Analysis ToolKit ༈GATK, v4.1.1.0༉ was used for insertion/deletion realignment, quality score recalibration, and variant identification with duplicate reads removed by the Picard-tools (v4.1.1.0). ANNOVAR (v201804) was used to annotate mutations.
After sequence alignment and variant calling, synonymous variants, intronic variants far away from the exon/intron boundaries, and variants with a minor allelic frequency (MAF) ≥ 1% in the 1000 Genomes Project, the dbSNP database, and the Exome Aggregation Consortium (ExAC) database were removed from further analysis. NGS reads were visualized using an integrated genomic viewer (IGV).
1.2.5 Variant classification
Accurate classification of somatic genetic alterations detected by next-generation sequencing (NGS) was of paramount importance to ensure the provision of high-quality clinical data. Clinical significance of variants can be assessed and tiered based on guidelines from the Association for Molecular Pathology (AMP), the American Society of Clinical Oncology, and the College of American Pathology for the interpretation of somatic sequence variants identified in cancer. A four-tiered system to categorize somatic sequence variations based on their clinical significances is proposed: tier I, variants with strong clinical significance; tier II, variants with potential clinical significance; tier III, variants of unknown clinical significance; and tier IV, variants deemed benign or likely benign. Tiers I to III must be reported in descending order of clinical importance. It is not recommended to include tier IV or benign/likely benign variants/alterations in the report.
Then, we placed verified germline variants into the following categories according to guidelines from the American College of Medical Genetics and Genomics (ACMG) and Association of Molecular Pathology (AMP) : pathogenic (P), likely pathogenic (LP), variant of uncertain significance (VUS), likely benign (LB) and benign (B).
1.3 Droplet Digital PCR (ddPCR) Technology
1.3.1 Introduction
The unique sample partitioning step of digital PCR, paired with Poisson statistical data analysis, allows higher precision than traditional PCR and qPCR methods. Accordingly, digital PCR is particularly well suited for applications that require the detection of small amounts of input nucleic acid or finer resolution of target amounts among samples, for example, rare sequence detection, copy number variation (CNV) analysis, and gene expression analysis of the rare targets.
Droplet Digital PCR (ddPCR) is a method for performing digital PCR that is based on water-oil emulsion droplet technology. A sample is fractionated into 20,000 droplets, and PCR amplification of the template molecules occurs in each individual droplet. ddPCR technology uses reagents and workflows similar to those used for most standard TaqMan probe-based assays. This technique has a smaller sample requirement than other commercially available digital PCR systems, reducing cost and preserving precious samples.
1.3.2 QX200 Droplet Digital PCR System Workflow
1) Genomic DNA extraction
Genomic DNAs were extracted using QIAamp Circulating Nucleic Acid Kit (Qiagen, CA, USA) according to the manufacturer’s instructions.
2) Prepare PCR-Ready Samples prior to Starting ddPCR
Combine DNA sample and primers and probes with the ddPCR supermix to create prepared sample on ice. Load 20µl of our prepared sample into individual well.
3) Droplet Generation
Prior to droplet generation, nucleic acid samples are prepared as they are for any real-time assay: using primers, fluorescent probes (TaqMan probes with FAM and HEX or VIC), and a proprietary supermix developed specifically for droplet generation. Samples are then placed into the QX200 Droplet Generator, which utilizes proprietary reagents and microfluidics to partition the samples into 20,000 nanoliter-sized droplets. The droplets created by the QX200 Droplet Generator are uniform in size and volume.
4) PCR Amplification of Droplets
Droplets are transferred to a 96-well plate for PCR amplification with end point(40cycles)using thermal cycler (Table 1).
5) Droplet Reading
Following PCR amplification of the nucleic acid target in the droplets, the samples are placed in the QX200 Droplet Reader, which analyzes each droplet individually using a two-color detection system (set to detect FAM and either HEX or VIC), enabling multiplexed analysis for different targets in the same sample. The droplet reader and its bundled QuantaSoft™ software count the PCR-positive and PCR-negative droplets.
6) Analyze Results
Positive droplets, containing at least one copy of the target, exhibit increased fluorescence over negative droplets. In ddPCR, the QuantaSoft software measures the numbers of droplets that are positive and negative for each fluorophore (for example, FAM and HEX) in a sample. The fraction of positive droplets is then fitted to a Poisson distribution to determine the absolute initial copy number of the target DNA molecule in the input reaction mixture in units of copies/µl.