NGS technology has been applied to various hematologic diseases. Many gene panels and several methods have been used to detect not only sequence variants but also large gene deletions and duplications or fusion genes [12, 20, 21]. In this study, we found a significant agreement rate between NGS and FISH for CDKN2A/B CNV detection. We also found significant IKZF1 deletions using an NGS CNV analysis. A presumed diagnosis of BCR-ABL1-like ALL can be enabled by using an NGS CNV analysis to test for a high prevalence of genetic alterations in IKZF1 and JAK2 because IKZF1 alterations (~ 80%) and JAK2 alterations (~25%) are more prevalent in BCR-ABL1-like ALL than in other B-ALL sub-types [2]. Although we did not perform gene express profiling and FISH or RT-PCR for alterations commonly found in BCR-ABL1-like ALL, we found seven cases with IKZF1 alterations in non-ALL with BCR-ABL, and they had adverse clinical effects. We assume that the seven patients with non-ALL with BCR-ABL1 and an IKZF1 alteration probably had BCR-ABL1-like ALL.
In B-ALL, RAS signaling (NRAS, KRAS, PTPN11, FLT3, NF1, et al.) and B-cell differentiation and development (PAX5, IKZF1, EBF1, et al.) are the most common pathogenic pathways [11, 22], and we found a similar distribution of genetic changes in this study. In the adult high-risk cytogenetic group, IKZF1 deletion was the most common alteration (6/12, 50%). This high prevalence might have occurred because most cases (11/12) were ALL with BCR-ABL1.
In T-ALL, we found recurrent somatic sequence variants and CNVs similar to those reported in previous studies [4, 5]. Sequence variants of NOTCH1, IL7R, and FBXW7 and CDKN2A/B deletion are common mutations in T-ALL [5, 23]. Although only two cases of ETP were enrolled in this study, one of those patients had an FLT3-ITD and WT1 mutation [4]. More ETP cases are needed to reveal genetic alterations with T-ALL among Koreans.
Detection of gene mutations that predispose individuals to cancer is important for several reasons, including modification of cancer management, early detection of secondary malignancies or non-malignant complications, and genetic counseling. The characteristic clinical and physical manifestations of multiple cancers and familial cancer histories can be suggestive of predisposing gene mutations. Some researchers have published a guideline for referral indications for cancer predisposition assessment [8]. However, cancer predisposition assessments can be delayed or ignored for several reasons. Heterogeneous clinical and physical manifestations can be missed by physicians. Sometimes, patients have no familial history because they have a de novo mutation [24]. Therefore, the germline mutation detection rate from guideline-based referrals for cancer predisposition assessment (using clinical, physical, and familial history criteria) is lower than from universal, targeted panel sequencing of cancer genes using tumor–normal matched samples [25]. In this study, we analyzed unselected sporadic ALL cases to investigate the overall germline mutation rate in Korea.
In this study, we used CR-state bone marrow slides for germline mutation detection. Skin fibroblasts are the only recommended control sample for germline mutations. Peripheral blood (PB) and bone marrow can be contaminated with leukemic cells. Other samples, such as saliva or buccal swab, can be also contaminated with PB. Furthermore, Age-related clonal hematopoiesis can be observed in ~10% of the healthy population, which increases with age [26]. However, a skin biopsy is an invasive procedure, so such samples are not easily available. Therefore, we used CR-state bone marrow slides (acquired to test for residual leukemic cells) as the control for germline mutations. No apparent leukemic samples were obtained in our review of bone marrow morphology or the FISH, chromosome, flow-cytometry, and RT-PCR results. The variant allele fraction (VAF) and public population databases were used as filtering tools [27]. Germline variants can have a VAF of >33% even in tumor samples [27]. Variants within that range have a high possibility of germline origin. Variants registered in public databases, such as the Single Nucleotide Polymorphism database (dbSNP), the 1000 Genomes Project, the Exome Aggregation Consortium (ExAC) database, and the Human Gene Mutation Database (HGMD), are probably also of germline origin. We double checked the germline variants both in CR and leukemic samples. True germline variants (identified in CR samples) were also identified in paired leukemic samples with similar VAF. On the contrary, true somatic variants (identified in leukemic samples) were not found or were found with very low VAF in paired CR samples.
At initial diagnosis, only leukemic samples without a normal control might be available to analyze genetic alterations. In that case, discrimination of somatic and germline genetic alterations is difficult. Some genes could have both somatic and germline alterations. For example, PAX5 is a predisposition gene for ALL, and a PAX5 alteration can also be found in B-ALL as a somatic alteration. Moreover, if the VAF of a PAX5 alteration is around 50%, it is difficult to determine whether its origin is somatic or germline. Many clinically important genes in ALL, such as IKZF1, ETV6, and RUNX1, can have both somatic and germline alterations.
To overcome the difficulty of tumor-only analysis, some researchers have sought computational or automatic algorithms to discriminate somatic and germline alterations using VAF and population databases. The accuracy of those methods depends on tumor purity, CNVs, and population allele frequency cut-off values. Sensitivity and specificity of more than 90% were reported [28, 29], but no consensus guidelines or cut-off values for tumor-only filtering strategies have been established. Therefore, further studies are still needed to accurately discriminate somatic and germline mutations in tumor-only samples.
Various syndromes increase the risk of ALL, with variable penetrance and preference. DS is the most common genetic cause of childhood leukemia. In an analysis of the National Registry of childhood tumors in the United Kingdom, 131 of 142 leukemia patients with underlying genetic causes were DS patients [6], and an analysis of approximately 18,000 European childhood ALL cases found that 2.4% of ALL patients also had DS [7]. Other genetic diseases with connections to ALL are ataxia telangiectasia, Nijmegen breakage syndrome, neurofibromatosis type 1, familial ALL, and Noonan syndrome [7]. Germline PAX5 and ETV6 mutations carry a high risk (high penetrance) of cancer, mainly ALL [7, 9, 30]. Bloom syndrome and constitutional mismatch repair deficiency syndrome also carry a moderate risk of ALL. Constitutional mismatch repair deficiency syndrome is more associated with T-cell lineage leukemia and lymphoma than B-cell lineage [31]. However, in this study, we did not find pathogenic or likely pathogenic variants among those gene mutations (PAX5, ETV6, NF1, BLM, ATM, et al.), which have high penetrance for ALL [7], possibly because we enrolled a relatively small number of unselected sporadic cases. Moriyama et al. reported that only 0.79% (35/4,405) of sporadic childhood ALL cases have a potentially pathogenic ETV6 variant [30].
In this study, we did identify four pathogenic or likely pathogenic variants (TP53, CTC1, TINF2, and LIG4) among 82 genes associated with 23 syndromes well known for their connection to hematologic malignancy. All four of the variants were heterozygous. Li-Fraumeni syndrome (TP53) is a well-known rare cancer syndrome. The most common cancers in patients with Li-Fraumeni syndrome are solid cancers (such as breast cancer, lung cancer, and bladder cancer) [15]. However, a somatic TP53 alteration is strongly associated with low hypodiploidy ALL (~90%), disease relapse, and germline origin (~40%) [32]. Although hypodiploid ALL accounts for only 5% of childhood ALL cases, hypodiploid ALL patients should be tested for Li-Fraumeni syndrome because of its poor prognosis and the possibility of a germline TP53 mutation [33].
CTC1, TINF2, and, LIG4 are genes associated with inherited bone marrow failure syndrome, whose clinical features overlap with DC [16]. Fanconi anemia, DC, Diamond-Blackfan anemia, and Shwachman-Diamond syndrome are also associated predominantly with solid cancers, whereas myelodysplastic syndrome and acute myeloid leukemia are predominant among hematologic malignancies rather than ALL [34]. Therefore, it is unclear whether these variants are causative genetic factors of ALL in our patients. A functional study or familial study, along with a physical and clinical investigation, is needed to confirm the correlations between ALL and these variants.
We also found 31 germline pathogenic or likely pathogenic variants in genes other than those associated with the 23 syndromes, including a CNV (CASP10). Among these 31 germline variants, ADA, CASP10, IL12RB1, JAK3, LPIN2, MEFV, and TYK2 are genes associated with PID [35]. Given that CTC1, TINF2, and, LIG4 are also associated with PID, nine of our patients (9.7%) had pathogenic or likely pathogenic heterozygous variants associated with PID. All these genes are inherited in the AR pattern except MEFV (AR or AD, familial Mediterranean fever) and TINF2 (AD, DC).
More than 300 distinct disorders and genes of PID have been classified by the International Union of Immunological Societies PID expert committee [35]. Familial Mediterranean fever (MEFV) is an auto-inflammatory disorder causing polyserositis, abdominal pain, arthritis, and other symptoms. ADA, JAK3, and LIG4 are causative genes for severe combined immunodeficiencies (SCID), which affect cellular and humoral immunity. ALPS (CASP10) is a disease of immune dysregulation. CTC1 is a causative gene of COATS plus syndrome, which overlaps with DC. Chronic recurrent multifocal osteomyelitis and congenital dyserythropoietic anemia (Majeed syndrome, LPIN2) are auto-inflammatory disorders. IL-12 and IL-23 receptor b1 chain deficiency (IL12RB1) and Tyk2 deficiency (TYK2) cause mendelian susceptibility to a mycobacterial disease associated with defects in intrinsic and innate immunity.
An increase in leukemia/lymphoma with PID is well known. A large analysis of PID patients in the United States Immune Deficiency Network database found significant increases in lymphoma (>8 fold-changes in PID cancer incidence over what was expected) [36]. Leukemia also increased, with 1.43 fold-changes in men and a 1.0 fold-change in women. The mechanism of leukemogenesis in PID is unclear, but intrinsic (cancer predisposition parallel to the immunological defect) and extrinsic (following chronic infections, inflammation, or loss of immunosurveillance) mechanisms were proposed by Hauck et al [37].
We were unable to determine whether our patients with variants associated with PID were primary immunodeficiency patients, carriers, or carrying non-pathogenic variants because all of them were heterozygous, and appropriate clinical/laboratory findings or functional studies for the identified variants were unavailable. Nevertheless, the high prevalence of PID-associated gene variants in our ALL patients suggests that PID-associated germline mutations are important in hematologic malignancies in Koreans, especially ALL. However no comprehensive study of the prevalence of cancer or lymphoma/leukemia among Korean PID patients has been done. There is not even a national registry for PID in Korea. One large study of PID in Korea observed 152 PID patients (<19 years) from 2001 to 2005 [38]. The most common diseases were antibody deficiencies (such as congenital agammaglobulinemia or IgA deficiency) and chronic granulomatous disease, though one patient died from lymphoma [38]. Further study of PID and the risk of malignancies is needed because of the relatively high prevalence of genetic alterations association with PID that we found in Korean ALL patients.
In addition to gene variants associated with hematologic/immunologic disorders, we also identified various pathogenic or likely pathogenic variants in this study. Genes associated with bilirubin metabolism (UGT1A1), thrombophilia (THBD), thalassemia (HBD), platelet glycoprotein IV deficiency (CD36), and medulloblastoma (PTCH2), among others, were identified in this study. As reported in previous studies, variants not directly associated with hematologic malignancies are frequently identified incidentally [7, 24]. The genes/diseases identified are representative of common genetic diseases in this ethnic group. The clinical significance of these variants is the diagnosis of unexpected co-morbid genetic diseases and supporting the management of specific complications [7].
Our study has several limitations. First, we enrolled a relatively small number of cases. We enrolled few or no patients with ETP, hypodiploid ALL, and adult ALL. Second, skin fibroblasts were not used in our search for germline mutations. Third, we did not perform a familial study or a clinical or physical investigation of the identified germline variants.