The Landscape of Gene Co-Expression Modules Correlating with Prognostic Genetic Abnormalities in AML

doi:10.21203/rs.3.rs-52517/v1

Download PDF

Research article

The Landscape of Gene Co-Expression Modules Correlating with Prognostic Genetic Abnormalities in AML

https://doi.org/10.21203/rs.3.rs-52517/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: The heterogenous cytogenetic and molecular variations were harbored by AML patients, some of which are related with AML pathogenesis and clinical outcomes. We aimed to uncover the intrinsic expression profiles correlating with prognostic genetic abnormalities by WGCNA.

Methods: We downloaded the clinical and expression dataset from BeatAML, TCGA and GEO database. Using R (version 4.0.2) and ‘WGCNA’ package, the co-expression modules correlating with the ELN2017 prognostic markers were identified (R² ≥ 0.4, p < 0.01). ORA detected the enriched pathways for the key co-expression modules. The patients in TCGA cohort were randomly assigned into the training set (50%) and testing set (50%). The LASSO penalized regression analysis was employed to build the prediction model, fitting OS to the expression level of hub genes by ‘glmnet’ package. Then the testing and 2 independent validation sets (GSE12417 and GSE37642) were used to validate the diagnostic utility and accuracy of the model.

Results: A total of 37 gene co-expression modules and 973 hub genes were identified for the BeatAML cohort. We found that 3 modules were significantly correlated with genetic markers (the ‘lightyellow’ module for NPM1 mutation, the ‘saddlebrown’ module for RUNX1 mutation, the ‘lightgreen’ module for TP53 mutation). ORA revealed that the ‘lightyellow’ module was mainly enriched in DNA-binding transcription factor activity and activation of HOX genes. The ‘saddlebrown’ module was enriched in immune response process. And the ‘lightgreen’ module was predominantly enriched in mitosis cell cycle process. The LASSO- regression analysis identified 6 genes (NFKB2, NEK9, HOXA7, APRC5L, FAM30A and LOC105371592) with non-zero coefficients. The risk score generated from the 6-gene model, was associated with ELN2017 risk stratification, relapsed disease, and prior MDS history. The 5-year AUC for the model was 0.822 and 0.824 in the training and testing sets, respectively. Moreover, the diagnostic utility of the model was robust when it was employed in 2 validation sets (5-year AUC 0.743-0.79).

Conclusions: We established the co-expression network signature correlated with the ELN2017 recommended prognostic genetic abnormalities in AML. The 6-gene prediction model for AML survival was developed and validated by multiple datasets.

Cancer Biology

Oncology

acute myeloid leukemia

weighted co-expression network analysis

prognostic signature.

The prognosis of AML is characterized by clonal cytogenetic and molecular variations harbored by leukemic cells, the prognostic significance of which is validated by previous studies and integrated into the ELN 2017 risk-stratification system[1], which is recommended by the NCCN AML guideline. The prognostic markers include: 1) fusion genes, like RUNX1-RUNX1T1/MLLT3-KMT2A/CBFB-MYH11/DEK-NUP24/BCR-ABL; 2) cytogenetic variants, like − 5 or del(5q), -17 or abn(17p), -7; 3) molecular variations, like CEBPA mutation, FLT3-ITD, RUNX1, ASXL1, TP53 etc. The treatment option and prognosis are based on the risk stratification of individual patients. It is reported that specific expression signatures are strongly associating and interacting with some cytogenetic/molecular variation. For instance, the NPM1 mutation was reported to be correlated with over-expression of PBX3 and HOXA gene cluster, which is required for the maintenance of leukemia cells harboring NPM1 mutation [2]. To our knowledge, the association of expression signatures with other important mutations, such as FLT3-ITD/TP53/etc, have not been fully elucidated. Moreover, risk stratification of AML patients is not determined only by signal gene variation, but the combination of multiple gene status. Different mutation status of NPM1 and FLT3-ITD, demonstrate low risk (mutated NPM1 without FLT3-ITD), intermediate risk (mutated NPM1 with FLT3-ITD, or wildtype NPM1 without FLT3-ITD), and adverse risk (wildtype NPM1 with FLT3-ITD). Therefore, this study also aims to uncover the difference of transcriptomic signatures between mutation combinations. The correlation analysis between molecular variant and transcription profiles, will promote the identification of potential valuable prognostic markers and therapeutic targets[3].

Over the recent years, the increasing genomic and expression data has emerged by next generation sequencing, which provided us the chance to identify genetic and transcriptomic markers relating to clinical outcomes. To improve the power of test for correlation analysis, we utilized the BeatAML database, which was the largest RNAseq dataset for AML patients by now[4]. BeatAML database included the clinical/cytogenetic/mutation/expression data originated from 672 patients. The WGCNA was used to identify the co-expression modules based on the scale-free network, and calculate the first principal component of gene modules as module eigengenes (MEs) [5, 6]. Then the target modules were identified, the ME of which was significantly correlated with prognostic markers of ELN2017. The sequential ORA was conducted to reveal the enriched cell signaling pathways for target modules. Furthermore, the LASSO regression analysis was used to reduce the dimensionality and fit survival data to the prediction model based on expression level of hub genes. Due to the batch effect between individual expression datasets, the accuracy of prediction model was limited. However, the successful establishment of the model provided us key genetic variables associating with survival, which will uncover the potential crucial expression signatures.

In this study, we found significantly correlated gene modules to NPM1, RUNX1 and TP53 mutation, the enriched cell signaling pathways of which were identified. The integrated hub genes across modules were input into LASSO analysis and established prediction model of OS, which was validated by 2 external datasets. Our work offered the landscape of expression signatures relating to ELN2017 prognostic markers and revealed the key hub genes and pathways.

Datasets download

The clinical, genetic and transcription matrix was downloaded from BeatAML database [4] (http://www.vizome.org/aml/). 14 available markers were selected for further analysis, including ELN2017 risk stratification, complex karyotype, del(7), RUNX1-RUNX1T1, CBFB-MYH11, biallelic CEBPA mutation, MLL3-KMT2A, DEK-NUP214, GATA2-MECOM, FLT3-ITD, NPM1 mutation, ASXL1 mutation, RUNX1 mutation and TP53 mutation. Then the expression dataset was download in the form of RPKM. After the data was integrated, 421 non-APL AML patients with prognostic markers and RNAseq data were selected for WGCNA. Since the patients in BeatAML accepted various experimental target therapy and non-standard treatment, which lead to unignorable bias for survival analysis. So, we included other expression datasets with survival data to establish and validate the prediction model. We downloaded the OS and expression data from TCGA database (https://portal.gdc.cancer.gov/) (IlluminaHiSeq_RNASeqV2 platform,136 non-APL AML cases). The microarray data and survival information was obtained from GEO database (https://www.ncbi.nlm.nih.gov/geo/), for GSE12417[7] (Affymetrix Human Genome U133 Plus 2.0 Array, 79 cytogenetic normal non-APL AML cases) and GSE37642 [8]( Affymetrix Human Genome U133 Plus 2.0 Array, 140 non-APL AML cases). Because no detail allelic ration information was provided by BeatAML database, we followed the recommendation of ELN2017 in such situation, and considered the presence of FLT3-ITD as high risk, unless it co-occurred with NPM1 mutation which was considered as intermediate risk.

All datasets supporting our findings were available from public databases, the last visit was on June 22nd, 2012.

WGCNA

The whole gene set of RNAseq data were used to construct the co-expression network, by R software (version 4.0.2) and ‘WGCNA’ package[5]. The hierarchical clustering by average link, was implemented to detect outliers. To construct the scale-free network, the minimal beta value setting the scale free R² > 0.85, was defined as soft threshold power. Then the matrix of gene adjacency was generated from the calculation of inter-gene correlation coefficients by Pearson’s method, which was subsequentially turned into the topological overlap matrix (TOM). After the minimal module size was set as 30 genes, the average linkage hierarchical clustering was performed to divide the whole gene set into modules based on TOM-based dissimilarity. Then module membership (the correlation coefficients between individual gene and eigengene in the same module) and gene significance (the correlation coefficients between gene and prognostic markers) were calculated by Pearson’s method. Modules eigengenes (MEs), the first principal component of expression matrix, were correlated to the target prognostic markers. Correlation coefficient R²≥ 0.4 and p value < 0.01 were set as the criteria for significant correlation between MEs and prognostic markers. Then the hub genes were identified by gene significance ≥ 0.2, module membership ≥ 0.8 and q. Weighted < 0.01 (local FDR adjusted weighted p value of correlation between genes and prognostic markers).

Protein-protein interaction network of genes in selected modules

STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) database (https://string-db.org/) was used to predict PPI (protein-protein interaction) network information based on the previous evidence and experiments. After mapping the gene symbols into STRING database, minimal criteria for extracting PPI pairs was 0.4. The nodes were calculated and ranked by connectivity degree method, 10 top nodes were screened by cytoscape software (version 3.7.2) and cytohubba plugin.

To demonstrate the biological function and implication on cell signaling of significantly correlated MEs, DAVID[9] (Database for Annotation, Visualization and Integrated Discovery) online tool (https://david.ncifcrf.gov/) was used for gene enrichment analysis based on the Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The package ‘ReactomePA’ was implemented for analysis based on Reactome database (https://reactome.org/)[10]. The enriched q value (local FDR adjusted p value) < 0.05 is set as cut-off value for ORA.

Prediction model for AML survival

The univariate Cox proportional hazard regression analysis was employed to investigate the association of hub genes and OS of AML patients in TCGA AML cohort. Then to minimize the overfitting, we performed the iterative regression analysis (LASSO) to reduce dimensionality of inputted variables, and establish the prediction model of variables with non-zero coefficients. We conducted a bootstrap aggregation approach, and the 10-fold cross-validation by ‘glmnet’ package. The TCGA AML cohort was randomly assigned into training (50%) and testing (50%) sets. The risk scores of individual patients were calculated based on the expression level of gene variables with non-zero coefficients. The cutoff value was determined by function ‘surv_cutpoint’ of package ‘survminer’. Kaplan-Meier analysis and time-dependent ROC were performed by ‘survival’ and ‘survivalROC’ packages. Furthermore, the testing set and dependent validation cohorts, GSE12417 and GSE37642, were employed to validate the robustness of diagnostic accuracy on overall survival by the prediction model. Then the risk scores of AML patients were compared by unpaired t test, between ELN2017 risk groups, with or without MDS history, newly diagnosed or relapsed patients. Furthermore, the risk scores and other possible prognostic factors, like age, blast percentage, FAB subtypes, type of induction treatment, race, sex, risk stratification of ELN2017 and transplantation, were inputted into multivariate Cox analysis to validate whether the risk score was an independent risk factor for AML survival.

Results of WGCNA

The clinical and genetic features of included AML cases in BeatAML and TCGA database were shown in Table 1. The expression data of 421 non-APL AML patients were inputted into WGCNA. No outliers were detected after all samples were hierarchically clustered using average distance and Pearson’s method, the dendrogram for which was shown in Supplementary Figure 1. The lowest soft threshold power = 6, by which the scale free R² > 0.85 (Figure 1A). The calculation based on TOM-based dissimilarity divided the whole gene set into 37 gene modules (Figure 1B), after we merged the modules with dissimilarity less than 20% by setting the mergeCutHeight as 0.20. 400 randomly selected genes were grouped into modules and generate the heatmap of topological overlap (Figure 1C), indicating high topological overlap degree of co-expression network in individual modules. Moreover, the correlation of co-expressed modules was demonstrated by module eigengenes adjacency heatmap (Figure 1D). Finally, the relationship of modules and traits was shown in Figure 1E. whereas we noticed that the status of gene mutation combinations will also be of prognostic value rather than the single gene, such as NPM1 and FLT3. Therefore, we performed correlation analysis for modules with different combinations of NPM1/FLT3-ITD mutation status (Figure 1F). 3 pairs of module-trait were identified to be significant correlation, and were studied sequentially, including ‘lightyellow’ module (R² = 0.41/ p = 2e-18 for FLT3-ITD, R² = 0.66/ p = 4e-53 for NPM1 mutation), ‘saddlebrown’ module (R² = 0.6/ p = 2e-43, for RUNX1 mutation) and ‘lightgreen’ module (R² = 0.41/ p = 2e-10, for TP53 mutation). The ‘violet’ module was initially identified as significantly correlated with NPM1 mutation (R² = 0.46/ p = 3e-23), while the following detail analysis on NPM1/FLT3-ITD combinations indicated that this module was not significantly related to any combination. So, the ‘violet’ module was excluded from further analysis. According to the criteria of hub genes, 973 hub genes were identified for further analysis.

The ‘lightyellow” co-expressed module

The ‘lightyellow’ module included 143 genes and was positively correlated with FLT3-ITD (R² = 0.41, p = 2e-18) and NPM1 mutation (R²= 0.66, p = 4e-53) respectively. The further analysis on combinations of NPM1 mutation and FLT-ITD, the ‘lightyellow’ module was only found to be related to NPM1 mutation, regardless of the presence of FLT3-ITD. Despite the correlation coefficients didn’t meet the criteria, the potential negative correlations were detected for the ‘lightyellow’ module with ELN2017 risk stratification, complex karyotype, RUNX1-RUNX1T1, CBFB-MYH11, CEBPA biallelic mutation, ASXL1, RUNX1 and TP53 by p value less than 0.01.

The results of ORA for genes in the ‘lightyellow’ module were shown in Figure 2A. According to GO analysis, the genes were significantly enriched in biological processes like positive regulation of transcription, negative regulation of cell differentiation, etc. And the genes were enriched in molecular functions like RNA polymerase II regulatory region sequence-specific DNA binding, DNA binding, etc. Based on KEGG pathway analysis, the genes were enriched in transcriptional misregulation in cancer. The Reactome analysis indicated the genes were enriched in activation of HOX genes, activation of HOX genes during differentiation, etc.

Moreover, the genes of ‘lightyellow’ module were inputted into STRING analysis, resulting in the PPI network shown in Supplementary Figure 2. Top 10 genes with highest connectivity degrees among the PPI network were MEIS1, HOXA5, HOXA3, HOXA7, HOXA6, HOXA10, HOXB3, HOXA9, PBX3 and HOXB4 (Figure 2B).

The ‘saddlebrown’ co-expressed module

This module included 60 genes and was significantly correlated with RUNX1 mutation (R² = 0.6/ p = 2e-43). The results of ORA for genes in the ‘saddlebrown’ module were shown in Figure 3A, meanwhile the relationship of enriched Reactome pathways were shown in Figure 3B. GO analysis revealed that genes were significantly enriched in the following molecular functions: cytokine-mediated signaling pathway, positive regulation of immune response, etc. The biological processes analysis indicated the genes were enriched in MHC protein complex binding, MHC class II protein complex binding, etc. Based on cell component analysis, the genes were enriched in endocytic vesicle, lysosomal membrane, etc. KEGG pathway demonstrated the genes were significantly enriched in hematopoietic cell lineage, antigen processing and presentation, phagosome, etc. According to Reactome analysis, the genes were enriched in interferon signaling, PD-1 signaling, etc.

Furthermore, the genes of ‘saddlebrown’ module were inputted into STRING analysis, resulting in the PPI network shown in Supplementary Figure 3. Top 10 genes with highest connectivity degrees among the PPI network (Figure 3C) included HLA-DRB5, HLA-DMA, HLA-DRB1, CD74, HLA-DQA1, HLA-DPB1, CIITA, HLM-DMB, HLA-DRA and HLA-DQB1.

The ‘lightgreen’ co-expressed module

This module included 352 genes and was significantly correlated with TP53 mutation (R² = 0.41/ p = 2e-10). The results of ORA for genes in the ‘lightgreen’ module were shown in Figure 4A, meanwhile the relationship of enriched Reactome pathways were shown in Figure 4B. By GO analysis on biological processes, the genes were identified to be enriched in cell cycle phase transition, mitotic nuclear division, etc. Molecular function analysis based on GO database, indicated the genes were enriched in RNA binding, protein binding, etc. And cell component analysis according to GO database, revealed the genes were enriched in chromosome, ribosome, etc. KEGG pathway analysis demonstrated the genes were enriched in DNA replication, p53 signaling pathway, etc. In Reactome analysis, the genes were enriched in Mitotic G2-G2/M phases, transcriptional Regulation by TP53, etc.

Furthermore, the genes of ‘lightgreen’ module were inputted into STRING analysis, resulting in the PPI network shown in Supplementary Figure 4. Top 10 genes with highest connectivity degrees among the PPI network (Figure 4C) included CCNA2, CDK1, BUB1, NCAPG, KIF11, BUB1B, CCNB1, TOP2A, CDC20 and AURKA.

Results of LASSO penalized regression analysis

The hub genes were inputted into LASSO penalized regression analysis, to fit OS of AML patients to the prediction model. After 1000 times of iteration between training and testing set in TCGA AML cohort, an optimized model of 6 gene with non-zero coefficient were identified, including NFKB2, NEK9, HOXA7, APRC5L, FAM30A and LOC105371592. The prediction model for AML OS were established by the 6-gene expression signature, the coefficients of which were listed in Table 2.

The association of 6-gene expression signature with traditional risk factors of AML

The risk scores were calculated for 6-gene expression signature for the individual patients in TCGA AML cohort. The risk scores of patients in the adverse ELN2017 risk group, were significantly higher than that of favorable group (t = 2.041, df = 250, p = 0.0423, Figure 5A). meanwhile, AML patients with MDS history, had higher risk scores than that without MDS history (t = 2.268, df = 394, p = 0.0239, Figure 5B). The relapsed AML patients had higher risk scores than de novo AML patients (t = 2.347, df = 394, p = 0.0194, Figure 5C).

The validation of 6-gene signature model

The diagnostic utility was evaluated in the training set, testing set and 2 external independent validation sets. After the cut-off values for risk scores were calculated in each cohort and divided patients into low and high risk groups, the Kaplan-Meier plots were used to compare the OS between groups by log-rank test. The OS in low risk group was significantly longer than that in high risk group in all 4 sets (Figure 6A-D). In the TCGA training set, median OS was not reached in low risk group, and 8.08 months in high risk group (HR = 4.203, 95%CI 2.251-7.847, p < 0.0001). The median OS was also not reached in low risk group, and 9.30 months for high risk group (HR = 3.342, 95%CI 1.817-6.145, p = 1e-4). Similar results were uncovered in GSE12417 (HR = 3.188, 95%CI 1.720-5.910, p < 0.0001) and GSE37642 (HR = 2.489, 95%CI 1.658-3.737, p < 0.0001). The distribution of risk scores, survival time and gene-survival heatmap were shown for the training set (Figure 7) and the testing set (Figure 8). As AUC is a very crucial indicator for utility of a prognostic model, the time dependent ROC analysis was performed for the 4 set (Figure 9A-D). The 5-year AUC for the training and testing set were 0.822 and 0.824 respectively, whereas it is 0.79 and 0.743 in GSE12417 and GSE37642, respectively, which demonstrated the superiority and robustness in expression datasets generated from different platforms. The results of multivariate COX regression analysis were shown in Table 3, indicating the risk factor was an independent risk factor for OS of AML patients.

The heterogeneity of expression profiles has been studied for AML patients harboring different mutations or chromosomal abnormality[11–14], but the co-expressed gene modules have not been rarely linked to the genetic markers using WGCNA. Ravasz et al. demonstrated hierarchical network modularity for metabolic network, based on which they proposed the utility of Topological Overlap Matrix to measure how strongly the nodes are connected[15]. Then the concept was transplanted to gene expression network to investigate the scale-free properties by developing a clustering method (WGCNA), by which the gene modules were identified, with high co-expression and strong network connectivity[6].

In the present study, the results of WGCNA indicated that the ‘lightyellow’, ‘saddlebrown’ and ‘lightgreen’ were significantly correlated with NPM1, RUNX1 and TP53 mutation, respectively. In Fig. 1E, the ‘lightyellow’ module was insignificantly or negatively correlated with all genetic variations, except NPM1 and FLT3-ITD. Then the correlation analysis with NPM1/FLT3 combinations indicated that ‘lightyellow’ module was the NPM1-specific co-expression module. ORA indicated that genes in this module were mainly enriched in regulation of transcription and cell differentiation, the molecular function of which involved in RNA polymerase II-specific DNA binding transcription factor activity. In the ‘lightyellow’ module, the HOXA family genes (HOXA1-7/HOXA9-11/HOXA13) and HOXB family genes (HOXB2-9), and the cofactors of HOXA (PBX3/MEIS1) were involved in the abovementioned pathways[16, 17]. The HOX9-11, PBX3 and MEIS1 were also involved in the ‘Transcriptional misregulation in cancer’ pathway (KEGG database), and ‘Activation of HOX genes during differentiation’ pathway (Reactome database). The sequential PPI network analysis also supported the central position of HOXA/B family genes and their cofactors, which demonstrated they have the highest connectivity degrees among the ‘lightyellow’ module. The deregulation of HOX genes and their cofactors was initially reported in AML patients harboring MLL fusion genes[18–20], and played a role in leukemogenesis in this type of AML[21]. The similar expression signature was revealed in NPM1 mutated AML, in which upregulation of PBX3 and HOX9 was required to maintain the survival of leukemic cells[2, 22], and was related to unfavorable clinical outcomes[11, 23]. Additionally, similar WGCNA was performed using TCGA AML data, the results of correlation analysis between co-expression modules and genetic variants were shown in Supplementary Fig. 4. Notably, we noticed there was also a strongly significantly correlated module with NPM1 mutation (R² = 0.71/ p = 4e-28), 45 out 109 genes (Supplementary Table 2) in which were overlapped with the ‘lightyellow’ module, including HOXA/B gene clusters and PBX3/MEIS1. These results from clinical and experimental studies confirmed the accuracy of our module analysis and identification of key genes. Considering NPM1 mutation is the most common genetic variant in AML, the specific expression signature characterized by HOXA/B-PBX3-MEIS1, will provided insights to further studies.

For RUNX1 mutation, the ‘saddlebrown’ module was the most correlated co-expression module (Fig. 1E). Corresponding to the mutant exclusivity of NPM1 and RUNX1 mutations reported previously[24], the ‘saddlebrown’ module seems to be underexpressed in patients harboring NPM1 mutation (R² = -0.34/ p = 6e-13). The results of ORA indicated the genes of ‘saddlebrown’ were mainly enriched in pathways involving in immune response, cytokine signaling and antigen presentation. These genes are also implicated in the enriched KEGG pathways, including antigen processing and presentation, hematopoietic cell linage, phagosome, and cell adhesion (Fig. 3A). Figure 3B showed the interaction network of ‘saddlebrown’ module enriched pathways by Reactome database, which predominantly composed of immune signaling. The core part with highest connectivity degrees in the ‘saddlebrown’ module, included HLA gene cluster, CD74 and CIITA (Fig. 3C). CD74 is known as the chaperone for MHC class II molecules implicating in antigen presentation[25], the role of which in AML remained unclear. While a recent single cell transcriptomic analysis demonstrated CD74 was expressed at high level in AML cells instead of normal myeloid cells[26]. The prognostic value of CD74 and association with RUNX1 mutation still need further study to validate. CIITA is a vital regulator of MHC class II gene expression including HLA-DR, HLA-DP and HLA-DQ, deregulation of which promoted abolishment of recognition from donor T cell, and lead to AML relapse after allo-HCT[27]. The dysregulation of immune signaling pathways and MHC class II regulator (CD74 and CIITA) may contribute to the inferior clinical outcomes in AML patients harboring RUNX1 mutation.

Since TP53 encodes a DNA-binding transcription factor inducing cell growth arrest and apoptosis upon various cellular stress[28]. Missense or null mutations of TP53 is one of the most powerful independent markers for adverse prognosis in AML[29, 30]. As the results of ORA, the upregulation of mitotic process related pathways in AML patients harboring TP53 mutation, probably resulted from abolishment of TP53 induced cell cycle arrest and apoptosis, which may promote the leukemic cell proliferation and disease progression. Therefore, the regulation of mitotic exit, which refers to the transition from mitosis to interphase, is crucial for TP53 mutated AML. Liu et al divided the regulator of mitosis exit into 4 groups: 1) APC/C; 2) cyclin B; 3) mitosis kinase and phosphatase; 4) kinesin and microtube-binding proteins[31], all of which were partly overlapped with genes in the ‘lightgreen’ model. Among the top genes in the module, CDC20, encoding cell division cycle protein 20 homolog, is required for anaphase promoting complex/cyclostome (APC/C) to confer full ubiquitin ligase activity and substate specificity. The aberrant expression of CDC20 was reported in AML and positively correlated with EZH2 and TET2 in AML[32], suggesting epigenetic factors participated in the regulation of CDC20. BI-D1870, the inhibitor of RSK, potentiated anti-leukemia activity of vincristine, which prevent the association of activator CDC20 with APC/C and impeded mitosis exit[33]. Other preclinical studies were performed to explore the anti-tumor mechanism of APC/C inhibitors, in cervical cancer, osteosarcoma, colorectal and lung cancer [34, 35]. The association of CDC20 upregulation with TP53 null mutation or functional silencing was uncovered in aneuploid AML [36], which suggested CDC20 as a potential target and biomarker in TP53 mutant AML. The regulatory subunit of cyclin B (encoded by CCNB1) and CDK1 (encoded by CDK1) constitutes the mitosis promoting factor (MPF), which controls the entry or exit of mitosis[37]. Overexpression of CCNB1 was associated with various cancer types [38–41], and predicted inferior response to mTOR inhibitor[42]. The direct association of CCNB1/CDK1 with TP53 has not been validated in AML. 3 main mitotic kinases included Aurora kinases, PLK1 and PP2A. AURKA was one of core genes in PPI network regarding TP53 mutation (Fig. 4C). Aurora Kinase A, encoded by AURKA, is implicated in cell mitosis process, including centrosome maturation and spindle formation[43, 44]. The anti-leukemic effect of Aurora kinase inhibitors was reported via inducing mitochondrial impairment[45], cell cycle arrest[46] etc. Preliminary clinical studies have been conducted to validate the efficacy and safety of Aurora kinase inhibitors in AML[47–49]. Although the association of TP53 mutation and AURKA expression has been rarely studied previously for AML, it was explored in solid tumors with high frequency of TP53 variants, including adrenocortical carcinoma[50], pancreatic cancer[51], head and neck cancer[52], hepatocellular carcinoma[53] and ovarian cancer[54]. Moreover, in cancer cell lines lacking p53, resulting from genetic engineering to express HPV16-E6 oncoprotein or siRNA targeting TP53, the inhibitor of Aurora kinases (VX680) induced apoptosis was enhanced in comparison with cell lines with wide-type p53[55]. PLK1 was also in the ‘lightgreen’ module, which encoded Polo-like kinase 1. PLK1 is a crucial regulator of multiple processes[56], including mitotic spindle assembly, chromatid separation, activation of Cyclin B/CDK1 complex[57], etc. PLK1 was found to be overexpressed in various AML cell lines and the majority of AML patients[58]. PLK1 inhibitor or siRNA targeting PLK1 blocked the proliferation of AML cell lines, while the normal hematopoietic progenitors were less sensitive to abolishment of PLK1[59]. This result provided the rationale for targeting PLK1 in the treatment of AML, the clinical trials had been conducted[60, 61]. The association of PLK1 over-expression and silencing of TP53 was reported in aneuploid AML[36]. Kinesin superfamily proteins (KIFs) function mainly as molecular motors binding to and moving across the microtube network [62, 63]. Seven members of KIF family (KIF11, KIF14, KIF15, KIF18A, KIF18B, KIF20A and KIF23) were included in the ‘lightgreen’ module. KIF 11 and KIF23 are the well-studies KIF family members and considered as an oncogene, inhibition of which can cause arrest at mitosis exit in hepatocellular cell carcinoma, lung cancer, pleural mesothelioma, and glioma cancer, breast cancer, meningiomas[64–69]. A phase I clinical trial on the highly selective kinesin spindle protein inhibitor, ARRY-520, has been conducted on advanced AML patients[70]. As the expression level of mitosis exit regulators was positively correlating with TP53 mutation, targeting at these regulators (APC/C, Aurora kinases, PLK1, KIFs) seems to be a reasonable strategy for TP53 mutant AML patients.

No significant correlating modules were detected for other genetic markers in ELN2017, including complex karyotype, del(7), RUNX1-RUNX1T1, CBFB-MYH11, CEBPA biallelic mutation, MLLT3-KMT2A, DEK-NUP214, GATA2-MECOM and ASXL1 mutation. This may partly be attributed to insufficient samples. A quantity of gene variations for one specific abnormality (complex karyotype, del(7), ASXL1) may also abolish the accuracy of our analysis.

The 6-gene signature with non-zero coefficients was identified by LASSO penalized regression analysis. AML patients with traditional risk factors (ELN2017 adverse risk stratification, with MDS history, relapsed disease) have higher risk scores of 6-gene signature (Fig. 5A-C). The Kaplan-Meier analysis demonstrated much better survival profiles of low risk group than that of high risk group for the training set, the testing set and 2 validation sets. Moreover, the multiple Cox proportional hazards regression analysis indicated that 6-gene risk scores were an independent risk factor for AML survival (Table 3). The time dependent ROC was employed to illustrate the diagnostic utility of the 6-gene model, which demonstrated the superior 5-year AUC in the training and testing sets (0.822 and 0.824, respectively). To evaluate the robustness of this model across different datasets, GSE12417 and GSE37642 were used to fit the OS data into 6-gene model. Owing to the different methods of measuring expression level in training/testing sets (RNAseq) and validation sets (microarray), a slight lower 5-year AUCs were estimated for the 2 validation sets (0.79 and 0.743), suggesting that this model was robust across different datasets. NFKB2, encoding nuclear factor NF-kappa-B p100 subunit, was in the ‘brown’ module, which was correlated with FLT3-ITD with a significant p value (6e-4, Fig. 1E). Through a non-canonical pathway, the phosphorylation of NFKB2/p100 leads to its proteolytic process and formation of NF-kappa-B RelB-p52 complex [71]. The study using MV-4-11 cell line, demonstrated FLT3-ITD promoted activation of NF-kappa-B RelB-p52 complex and repressed the expression of DAKP1 in association with HDACs[72]. Since the DAKP1 was acknowledged as tumor suppressor via inducing apoptosis and autophagy, and the abolishment of DAKP1 expression occurred in various cancers, including AML[73]. Therefore, the relationship between NFKB2 expression and FLT3-ITD was intriguing and provided potential biomarkers for HDAC inhibitors in FLT3-ITD AML. NEK9 encodes serine/threonine-protein kinase Nek9, which played a crucial role in G1/S phase transition and S phase progression[74]. Few studies focused on the association of NEK9 and AML, while Matthew et al identified NEK9 with increased activity or abundance in the imatinib resistant cell model of CML[75]. HOXA7 is a member of HOXA family, which is one of the most studied gene clusters in AML. HOXA7 is identified as potential prognostic markers in AML previously [76]. The association of ARPC5L, FAM30A and LOC105371592 with AML survival or pathogenesis, has not been described previously. Our LASSO analysis provided a robust prediction model for AML survival, and several potential biomarkers or therapeutic targets.

We identified the significantly correlated gene co-expression modules with prognostic cytogenetic or molecular markers of ELN2017. The sequential ORA illustrated the involved pathways for the key modules. Additionally, the novel prediction model was established with robust and superior diagnostic utility based on hub genes obtained from WGCNA.

AML, acute myeloid leukemia

ELN, European LeukemiaNet

WGCNA, weighted gene correlation network analysis

RPKM, Reads Per Kilobase per Million mapped reads

ORA, over-representation analysis

LASSO, least absolute shrinkage and selection operator

TCGA, the cancer genome atlas

OS, overall survival

AUC, area under curve

ROC, receiver operating characteristic curve

GEO, Gene Expression Omnibus

PPI, protein-protein interaction

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials

The datasets analyzed during the current study are available in the GEO database (https://www.ncbi.nlm.nih.gov/gds/), TCGA database (https://portal.gdc.cancer.gov/) and BeatAML database (http://www.vizome.org/aml).

Competing interests

The authors declare that they have no competing interests.

Funding

This study was not supported by any funding.

Authors' contributions

GC, JQQ, ZCX, GM, LZL and GYY participated in the study design. GC performed most of the microarray and RNA-seq analyses. ZCX performed cluster and classification analyses. GC drafted the paper. GC, GM and LZL participated in the final preparation and revision of the paper. All authors read and approved this final manuscript.

Acknowledgements

Not applicable.

Dohner H, Estey E, Grimwade D, Amadori S, Appelbaum FR, Buchner T, Dombret H, Ebert BL, Fenaux P, Larson RA et al: Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood 2017, 129(4):424-447.
Zhang W, Zhao C, Zhao J, Zhu Y, Weng X, Chen Q, Sun H, Mi JQ, Li J, Zhu J et al: Inactivation of PBX3 and HOXA9 by down-regulating H3K79 methylation represses NPM1-mutated leukemic cell survival. Theranostics 2018, 8(16):4359-4371.
Handschuh L: Not Only Mutations Matter: Molecular Picture of Acute Myeloid Leukemia Emerging from Transcriptome Studies. J Oncol 2019, 2019:7239206.
Tyner JW, Tognon CE, Bottomly D, Wilmot B, Kurtz SE, Savage SL, Long N, Schultz AR, Traer E, Abel M et al: Functional genomic landscape of acute myeloid leukaemia. Nature 2018, 562(7728):526-531.
Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 2008, 9:559.
Zhang B, Horvath S: A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 2005, 4:Article17.
Metzeler KH, Hummel M, Bloomfield CD, Spiekermann K, Braess J, Sauerland MC, Heinecke A, Radmacher M, Marcucci G, Whitman SP et al: An 86-probe-set gene-expression signature predicts survival in cytogenetically normal acute myeloid leukemia. Blood 2008, 112(10):4193-4201.
Hanzelmann S, Wang J, Guney E, Tang Y, Zhang E, Axelsson AS, Nenonen H, Salehi AS, Wollheim CB, Zetterberg E et al: Thrombin stimulates insulin secretion via protease-activated receptor-3. Islets 2015, 7(4):e1118195.
Huang da W, Sherman BT, Lempicki RA: Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009, 4(1):44-57.
Yu G, He QY: ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Mol Biosyst 2016, 12(2):477-479.
Falini B, Brunetti L, Sportoletti P, Martelli MP: NPM1-mutated acute myeloid leukemia: from bench to bedside. Blood 2020.
Stoner SA, Liu KTH, Andrews ET, Liu M, Arimoto KI, Yan M, Davis AG, Weng S, Dow M, Xian S et al: The RUNX1-ETO target gene RASSF2 suppresses t(8;21) AML development and regulates Rac GTPase signaling. Blood Cancer J 2020, 10(2):16.
Gu S, Zi J, Han Q, Song C, Ge Z: Elevated TNFRSF4 gene expression is a predictor of poor prognosis in non-M3 acute myeloid leukemia. Cancer Cell Int 2020, 20:146.
Cucchi DGJ, Bachas C, Klein K, Huttenhuis S, Zwaan CM, Ossenkoppele GJ, Janssen J, Kaspers GL, Cloos J: TP53 mutations and relevance of expression of TP53 pathway genes in paediatric acute myeloid leukaemia. Br J Haematol 2020, 188(5):736-739.
Ravasz E, Somera AL, Mongru DA, Oltvai ZN, Barabasi AL: Hierarchical organization of modularity in metabolic networks. Science 2002, 297(5586):1551-1555.
Li Z, Zhang Z, Li Y, Arnovitz S, Chen P, Huang H, Jiang X, Hong GM, Kunjamma RB, Ren H et al: PBX3 is an important cofactor of HOXA9 in leukemogenesis. Blood 2013, 121(8):1422-1431.
Nakamura T, Largaespada DA, Shaughnessy JD, Jr., Jenkins NA, Copeland NG: Cooperative activation of Hoxa and Pbx1-related genes in murine myeloid leukaemias. Nat Genet 1996, 12(2):149-153.
Ross ME, Mahfouz R, Onciu M, Liu HC, Zhou X, Song G, Shurtleff SA, Pounds S, Cheng C, Ma J et al: Gene expression profiling of pediatric acute myelogenous leukemia. Blood 2004, 104(12):3679-3687.
Zangrando A, Dell'orto MC, Te Kronnie G, Basso G: MLL rearrangements in pediatric acute lymphoblastic and myeloblastic leukemias: MLL specific and lineage specific signatures. BMC Med Genomics 2009, 2:36.
Li Z, Luo RT, Mi S, Sun M, Chen P, Bao J, Neilly MB, Jayathilaka N, Johnson DS, Wang L et al: Consistent deregulation of gene expression between human and murine MLL rearrangement leukemias. Cancer Res 2009, 69(3):1109-1116.
So CW, Karsunky H, Wong P, Weissman IL, Cleary ML: Leukemic transformation of hematopoietic progenitors by MLL-GAS7 in the absence of Hoxa7 or Hoxa9. Blood 2004, 103(8):3192-3199.
Brunetti L, Gundry MC, Sorcini D, Guzman AG, Huang YH, Ramabadran R, Gionfriddo I, Mezzasoma F, Milano F, Nabet B et al: Mutant NPM1 Maintains the Leukemic State through HOX Expression. Cancer Cell 2018, 34(3):499-512 e499.
Nagy A, Osz A, Budczies J, Krizsan S, Szombath G, Demeter J, Bodor C, Gyorffy B: Elevated HOX gene expression in acute myeloid leukemia is associated with NPM1 mutations and poor survival. J Adv Res 2019, 20:105-116.
Mendler JH, Maharry K, Becker H, Eisfeld AK, Senter L, Mrozek K, Kohlschmidt J, Metzeler KH, Schwind S, Whitman SP et al: In rare acute myeloid leukemia patients harboring both RUNX1 and NPM1 mutations, RUNX1 mutations are unusual in structure and present in the germline. Haematologica 2013, 98(8):e92-94.
Schroder B: The multifaceted roles of the invariant chain CD74--More than just a chaperone. Biochim Biophys Acta 2016, 1863(6 Pt A):1269-1281.
van Galen P, Hovestadt V, Wadsworth Ii MH, Hughes TK, Griffin GK, Battaglia S, Verga JA, Stephansky J, Pastika TJ, Lombardi Story J et al: Single-Cell RNA-Seq Reveals AML Hierarchies Relevant to Disease Progression and Immunity. Cell 2019, 176(6):1265-1281 e1224.
Toffalori C, Zito L, Gambacorta V, Riba M, Oliveira G, Bucci G, Barcella M, Spinelli O, Greco R, Crucitti L et al: Immune signature drives leukemia escape and relapse after hematopoietic cell transplantation. Nat Med 2019, 25(4):603-611.
Muller PA, Vousden KH: p53 mutations in cancer. Nat Cell Biol 2013, 15(1):2-8.
Papaemmanuil E, Gerstung M, Bullinger L, Gaidzik VI, Paschka P, Roberts ND, Potter NE, Heuser M, Thol F, Bolli N et al: Genomic Classification and Prognosis in Acute Myeloid Leukemia. N Engl J Med 2016, 374(23):2209-2221.
Prokocimer M, Molchadsky A, Rotter V: Dysfunctional diversity of p53 proteins in adult acute myeloid leukemia: projections on diagnostic workup and therapy. Blood 2017, 130(6):699-712.
Liu X, Chen Y, Li Y, Petersen RB, Huang K: Targeting mitosis exit: A brake for cancer cell proliferation. Biochim Biophys Acta Rev Cancer 2019, 1871(1):179-191.
Wang J, He N, Wang R, Tian T, Han F, Zhong C, Zhang C, Hua M, Ji C, Ma D: Analysis of TET2 and EZH2 gene functions in chromosome instability in acute myeloid leukemia. Sci Rep 2020, 10(1):2706.
Chae HD, Dutta R, Tiu B, Hoff FW, Accordi B, Serafin V, Youn M, Huang M, Sumarsono N, Davis KL et al: RSK inhibitor BI-D1870 inhibits acute myeloid leukemia cell proliferation by targeting mitotic exit. Oncotarget 2020, 11(25):2387-2403.
Zeng X, Sigoillot F, Gaur S, Choi S, Pfaff KL, Oh DC, Hathaway N, Dimova N, Cuny GD, King RW: Pharmacologic inhibition of the anaphase-promoting complex induces a spindle checkpoint-dependent mitotic arrest in the absence of spindle damage. Cancer Cell 2010, 18(4):382-395.
Sackton KL, Dimova N, Zeng X, Tian W, Zhang M, Sackton TB, Meaders J, Pfaff KL, Sigoillot F, Yu H et al: Synergistic blockade of mitotic exit by two chemical inhibitors of the APC/C. Nature 2014, 514(7524):646-649.
Simonetti G, Padella A, do Valle IF, Fontana MC, Fonzi E, Bruno S, Baldazzi C, Guadagnuolo V, Manfrini M, Ferrari A et al: Aneuploid acute myeloid leukemia exhibits a signature of genomic alterations in the cell cycle and protein degradation machinery. Cancer 2019, 125(5):712-725.
Chang L, Barford D: Insights into the anaphase-promoting complex: a molecular machine that regulates mitosis. Curr Opin Struct Biol 2014, 29:1-9.
Santala S, Talvensaari-Mattila A, Soini Y, Santala M: Prognostic value of cyclin B in endometrial endometrioid adenocarcinoma. Tumour Biol 2015, 36(2):953-957.
Nozoe T, Korenaga D, Kabashima A, Ohga T, Saeki H, Sugimachi K: Significance of cyclin B1 expression as an independent prognostic indicator of patients with squamous cell carcinoma of the esophagus. Clin Cancer Res 2002, 8(3):817-822.
Aaltonen K, Amini RM, Heikkila P, Aittomaki K, Tamminen A, Nevanlinna H, Blomqvist C: High cyclin B1 expression is associated with poor survival in breast cancer. Br J Cancer 2009, 100(7):1055-1060.
Koliadi A, Nilsson C, Holmqvist M, Holmberg L, de La Torre M, Warnberg F, Fjallskog ML: Cyclin B is an immunohistochemical proliferation marker which can predict for breast cancer death in low-risk node negative breast cancer. Acta Oncol 2010, 49(6):816-820.
Tsaur I, Makarevic J, Hudak L, Juengel E, Kurosch M, Wiesner C, Bartsch G, Harder S, Haferkamp A, Blaheta RA: The cdk1-cyclin B complex is involved in everolimus triggered resistance in the PC3 prostate cancer cell line. Cancer Lett 2011, 313(1):84-90.
Abdelbaki A, Akman HB, Poteau M, Grant R, Gavet O, Guarguaglini G, Lindon C: AURKA destruction is decoupled from its activity at mitotic exit but is essential to suppress interphase activity. J Cell Sci 2020, 133(12).
Joukov V, De Nicolo A: Aurora-PLK1 cascades as key signaling modules in the regulation of mitosis. Sci Signal 2018, 11(543).
Wang JX, Zhang L, Huang ZW, Zhang XN, Jiang YY, Liu FJ, Long L, Xue MJ, Lu G, Liu Q et al: Aurora kinase inhibitor restrains STAT5-activated leukemic cell proliferation by inducing mitochondrial impairment. J Cell Physiol 2020.
Qi J, Gao X, Zhong X, Zhang N, Wang R, Zhang H, Pan T, Liu X, Yao Y, Wu Q et al: Selective inhibition of Aurora A and B kinases effectively induces cell cycle arrest in t(8;21) acute myeloid leukemia. Biomed Pharmacother 2019, 117:109113.
Kantarjian HM, Schuster MW, Jain N, Advani A, Jabbour E, Gamelin E, Rasmussen E, Juan G, Anderson A, Chow VF et al: A phase 1 study of AMG 900, an orally administered pan-aurora kinase inhibitor, in adult patients with acute myeloid leukemia. Am J Hematol 2017, 92(7):660-667.
Lowenberg B, Muus P, Ossenkoppele G, Rousselot P, Cahn JY, Ifrah N, Martinelli G, Amadori S, Berman E, Sonneveld P et al: Phase 1/2 study to assess the safety, efficacy, and pharmacokinetics of barasertib (AZD1152) in patients with advanced acute myeloid leukemia. Blood 2011, 118(23):6030-6036.
Yee KW, Chen HW, Hedley DW, Chow S, Brandwein J, Schuh AC, Schimmer AD, Gupta V, Sanfelice D, Johnson T et al: A phase I trial of the aurora kinase inhibitor, ENMD-2076, in patients with relapsed or refractory acute myeloid leukemia or chronic myelomonocytic leukemia. Invest New Drugs 2016, 34(5):614-624.
Ikeya A, Nakashima M, Yamashita M, Kakizawa K, Okawa Y, Saitsu H, Sasaki S, Sasano H, Suda T, Oki Y: CCNB2 and AURKA overexpression may cause atypical mitosis in Japanese cortisol-producing adrenocortical carcinoma with TP53 somatic variant. PLoS One 2020, 15(4):e0231665.
Feng Y, Liu H, Duan B, Liu Z, Abbruzzese J, Walsh KM, Zhang X, Wei Q: Potential functional variants in SMC2 and TP53 in the AURORA pathway genes and risk of pancreatic cancer. Carcinogenesis 2019, 40(4):521-528.
Lee JW, Parameswaran J, Sandoval-Schaefer T, Eoh KJ, Yang DH, Zhu F, Mehra R, Sharma R, Gaffney SG, Perry EB et al: Combined Aurora Kinase A (AURKA) and WEE1 Inhibition Demonstrates Synergistic Antitumor Effect in Squamous Cell Carcinoma of the Head and Neck. Clin Cancer Res 2019, 25(11):3430-3442.
Caruso S, Calatayud AL, Pilet J, La Bella T, Rekik S, Imbeaud S, Letouze E, Meunier L, Bayard Q, Rohr-Udilova N et al: Analysis of Liver Cancer Cell Lines Identifies Agents With Likely Efficacy Against Hepatocellular Carcinoma and Markers of Response. Gastroenterology 2019, 157(3):760-776.
Alcaraz-Sanabria A, Nieto-Jimenez C, Corrales-Sanchez V, Serrano-Oviedo L, Andres-Pretel F, Montero JC, Burgos M, Llopis J, Galan-Moya EM, Pandiella A et al: Synthetic Lethality Interaction Between Aurora Kinases and CHEK1 Inhibitors in Ovarian Cancer. Mol Cancer Ther 2017, 16(11):2552-2562.
Gizatullin F, Yao Y, Kung V, Harding MW, Loda M, Shapiro GI: The Aurora kinase inhibitor VX-680 induces endoreduplication and apoptosis preferentially in cells with compromised p53-dependent postmitotic checkpoint function. Cancer Res 2006, 66(15):7668-7677.
Gjertsen BT, Schoffski P: Discovery and development of the Polo-like kinase inhibitor volasertib in cancer therapy. Leukemia 2015, 29(1):11-19.
Chopra P, Sethi G, Dastidar SG, Ray A: Polo-like kinase inhibitors: an emerging opportunity for cancer therapeutics. Expert Opin Investig Drugs 2010, 19(1):27-43.
Ikezoe T, Yang J, Nishioka C, Takezaki Y, Tasaka T, Togitani K, Koeffler HP, Yokoyama A: A novel treatment strategy targeting polo-like kinase 1 in hematological malignancies. Leukemia 2009, 23(9):1564-1576.
Renner AG, Dos Santos C, Recher C, Bailly C, Creancier L, Kruczynski A, Payrastre B, Manenti S: Polo-like kinase 1 is overexpressed in acute myeloid leukemia and its inhibition preferentially targets the proliferation of leukemic cells. Blood 2009, 114(3):659-662.
Ottmann OG, Muller-Tidow C, Kramer A, Schlenk RF, Lubbert M, Bug G, Krug U, Bochtler T, Voss F, Taube T et al: Phase I dose-escalation trial investigating volasertib as monotherapy or in combination with cytarabine in patients with relapsed/refractory acute myeloid leukaemia. Br J Haematol 2019, 184(6):1018-1021.
Kobayashi Y, Yamauchi T, Kiyoi H, Sakura T, Hata T, Ando K, Watabe A, Harada A, Taube T, Miyazaki Y et al: Phase I trial of volasertib, a Polo-like kinase inhibitor, in Japanese patients with acute myeloid leukemia. Cancer Sci 2015, 106(11):1590-1595.
Vale RD, Reese TS, Sheetz MP: Identification of a novel force-generating protein, kinesin, involved in microtubule-based motility. Cell 1985, 42(1):39-50.
Liu X, Gong H, Huang K: Oncogenic role of kinesin proteins and targeting kinesin therapy. Cancer Sci 2013, 104(6):651-656.
Sun X, Jin Z, Song X, Wang J, Li Y, Qian X, zhang Y, Yin Y: Evaluation of KIF23 variant 1 expression and relevance as a novel prognostic factor in patients with hepatocellular carcinoma. BMC Cancer 2015, 15:961.
Kato T, Wada H, Patel P, Hu HP, Lee D, Ujiie H, Hirohashi K, Nakajima T, Sato M, Kaji M et al: Overexpression of KIF23 predicts clinical outcome in primary lung cancer patients. Lung Cancer 2016, 92:53-61.
Kato T, Lee D, Wu L, Patel P, Young AJ, Wada H, Hu HP, Ujiie H, Kaji M, Kano S et al: Kinesin family members KIF11 and KIF23 as potential therapeutic targets in malignant pleural mesothelioma. Int J Oncol 2016, 49(2):448-456.
Takahashi S, Fusaki N, Ohta S, Iwahori Y, Iizuka Y, Inagawa K, Kawakami Y, Yoshida K, Toda M: Downregulation of KIF23 suppresses glioma proliferation. J Neurooncol 2012, 106(3):519-529.
Jungwirth G, Yu T, Moustafa M, Rapp C, Warta R, Jungk C, Sahm F, Dettling S, Zweckberger K, Lamszus K et al: Identification of KIF11 As a Novel Target in Meningioma. Cancers (Basel) 2019, 11(4).
Jiang M, Zhuang H, Xia R, Gan L, Wu Y, Ma J, Sun Y, Zhuang Z: KIF11 is required for proliferation and self-renewal of docetaxel resistant triple negative breast cancer cells. Oncotarget 2017, 8(54):92106-92118.
Khoury HJ, Garcia-Manero G, Borthakur G, Kadia T, Foudray MC, Arellano M, Langston A, Bethelmie-Bryan B, Rush S, Litwiler K et al: A phase 1 dose-escalation study of ARRY-520, a kinesin spindle protein inhibitor, in patients with advanced myeloid leukemias. Cancer 2012, 118(14):3556-3564.
Dobrzanski P, Ryseck RP, Bravo R: Differential interactions of Rel-NF-kappa B complexes with I kappa B alpha determine pools of constitutive and inducible NF-kappa B activity. EMBO J 1994, 13(19):4608-4616.
Shanmugam R, Gade P, Wilson-Weekes A, Sayar H, Suvannasankha A, Goswami C, Li L, Gupta S, Cardoso AA, Baghdadi TA et al: A noncanonical Flt3ITD/NF-kappaB signaling pathway represses DAPK1 in acute myeloid leukemia. Clin Cancer Res 2012, 18(2):360-369.
Guzman ML, Upchurch D, Grimes B, Howard DS, Rizzieri DA, Luger SM, Phillips GL, Jordan CT: Expression of tumor-suppressor genes interferon regulatory factor 1 and death-associated protein kinase in primitive acute myelogenous leukemia cells. Blood 2001, 97(7):2177-2179.
Tan BC, Lee SC: Nek9, a novel FACT-associated protein, modulates interphase progression. J Biol Chem 2004, 279(10):9321-9330.
Cooper MJ, Cox NJ, Zimmerman EI, Dewar BJ, Duncan JS, Whittle MC, Nguyen TA, Jones LS, Ghose Roy S, Smalley DM et al: Application of multiplexed kinase inhibitor beads to study kinome adaptations in drug-resistant leukemia. PLoS One 2013, 8(6):e66755.
Chen SL, Qin ZY, Hu F, Wang Y, Dai YJ, Liang Y: The Role of the HOXA Gene Family in Acute Myeloid Leukemia. Genes (Basel) 2019, 10(8).

Table 1. The clinical and genetic features of TCGA and BeatAML cohorts

	BeatAML	TCGA
patient number	421	136
female/male	180/241	61/75
median age (yr)	61	58
relapsed disease	23(5.46%)	NA
MDS history	41(9.74%)	NA
ELN2017 risk stratification
favorable	101(24.0%)	17(12.5%)
intermediate	156(37.1%)	80(58.8%)
poor	163(38.7%)	36(26.5%)
unknown	1(0.2%)	3(2.7%)
ELN2017 prognostic markers
complex karyotype	69(16.4%)	18(13.2%)
del(7)	6(1.4%)	NA
RUNX1-RUNX1T1	11(2.6%)	6(4.4%)
CBFB-MYH11	10(2.4%)	8(5.9%)
CEBPA_Biallelic	7(1.7%)	13(9.6%)
MLLT3-KMT2A	13(3.1%)	2(1.5%)
DEK-NUP214	3(0.7%)	NA
GATA2-MECOM	8(1.9%)	NA
FLT3-ITD	95(22.6%)	38(27.9%)
NPM1	108(25.7%)	38(27.9%)
ASXL1	31(7.4%)	2(1.5%)
RUNX1	32(7.6%)	14(10.3%)
TP53	27(6.4%)	11(8.1%)
5- or del(5q)	NA	3(2.2%)
NPM1 (+) FLT3-ITD (-)	59	NA
NPM1 (+) FLT3-ITD (+)	49	NA
NPM1 (-) FLT3-ITD (-)	100	NA
NPM1 (-) FLT3-ITD (+)	46	NA

Table 2. The prediction model for AML OS. The risk score of individual patients equals to the summation of products of included gene expression level and the corresponding coefficient.

ENSEMBL ID	gene symbol	gene name	coefficent
ENSG00000077150	NFKB2	nuclear factor kappa B subunit 2	0.04296
ENSG00000119638	NEK9	NIMA related kinase 9	0.070743
ENSG00000122592	HOXA7	homeobox A7	0.055637
ENSG00000136950	ARPC5L	actin related protein 2/3 complex subunit 5 like	0.748162
ENSG00000226777	FAM30A	family with sequence similarity 30 member A	0.294823
ENSG00000262050	LOC105371592	uncharacterized LOC105371592	0.031033

Table 3. The results of univariate and multivariate Cox analysis of traditional prognostic factors and risk scores generated by the prediction model.

	Univariate Cox analysis			Multivariate Cox analysis
Characteristics	Hazard Ratio	95% CI	p value	Hazard Ratio	95% CI	p value
age	1.03	1.02-1.05	0	1.01	0.99-1.03	0.298
blast_percentage	1	0.99-1.02	0.443	NA	NA	NA
FAB	1.06	0.94-1.2	0.351	NA	NA	NA
induction	3.51	2.19-5.61	0	1.84	0.98-3.46	0.057
race	0.86	0.66-1.13	0.287	NA	NA	NA
risk_cytogenetic	1.73	1.23-2.41	0.001	0.64	0.22-1.92	0.43
risk_molecular	1.83	1.31-2.56	0	3.36	1.2-9.44	0.021
riskscore	1.01	1-1.02	0.012	1.01	1-1.02	0.023
sex	1.05	0.69-1.57	0.831	NA	NA	NA
transplantation	0.41	0.27-0.62	0	0.42	0.25-0.72	0.002

Download PDF

Version 1

posted

You are reading this latest preprint version

The Landscape of Gene Co-Expression Modules Correlating with Prognostic Genetic Abnormalities in AML

Status:

Version 1

Abstract

Figures

Background

Methods

Results

Discussion

Conclusion

Abbreviations

Declarations

References

Tables

Supplementary Files

Status:

Version 1