DOI: https://doi.org/10.21203/rs.3.rs-321154/v1
Background: N6-methyladenosine is the most abundant RNA modification, which plays a prominent role in multiple biology processes, including tumorigenesis. Multiple myeloma (MM) is the second most frequent hematological cancer. However, the expression status and value of m6A-related genes in multiple myeloma remain elusive.
Methods: m6A-related gene expression data and clinical information of MM patients were obtained from the Gene Expression Omnibus (GEO) database. Based on differentially expressed analysis, protein-protein interaction (PPI) analysis, Pearson correlation analysis, Kaplan-Meier survival analysis and Cox regression analysis, the prognostic m6A-associated genes were identified. The receiver operation characteristic (ROC) curves were used to verify the prognostic and diagnostic efficiency. The molecular mechanisms were investigated by differentially expressed mRNA and microRNA analyses with the help of online database ENCORI and POSTAR.
Results: Among 22 m6A-related genes, HNRNPC, RBM15B, RBM15, YTHDF3, YTHDF2, HNRNPA2B1 and IGF2BP2 were significantly upregulated, while ZC3H13, FTO, IGF2BP3, ALKBH5 and YTHDC1 were significantly downregulated in MM patients. The high expression of HNRNPC, HNRNPA2B1, YTHDF2 and the low expression of ZC3H13 were associated with adverse survival. Furthermore, the expression level of YTHDF2 was the independent prognostic factor of MM. ROC curves suggested the great prediction performance for MM patients. Differentially expressed mRNA and microRNA analyses indicated the probable involvement of miR-205/YTHDF2/EGR1 axis.
Conclusions: Our study first systematically analyzed the expression status of m6A-related genes in multiple myeloma, identified HNRNPC, HNRNPA2B1, YTHDF2 and ZC3H13 that could be the potential prognostic biomarkers, especially YTHDF2, which may be implicated in the miR-205/YTHDF2/EGR1 axis.
Multiple myeloma is a neoplastic hematological disease characterized by clonal proliferation of malignant plasm cells in bone marrow (BM). This furious proliferation results in the expansion and accumulation of monoclonal proteins to cause the typical CRAB clinical features: hypercalcemia, renal insufficiency/failure, anemia and bone lesions. MM has been the second most frequent hematological malignancy and accounts for 1.79% of all new cancer cases and 2.11% of all cancer deaths worldwide[1]. Proteasome inhibitor (PI)-based combination paradigm and the great introduction of novel targeted drugs and immune therapies, such as monoclonal antibodies (mAbs) and chimeric antigen receptor-engineered T cells (CAR-T), have become established as a cornerstone of therapy throughout the myeloma treatment algorithm[2]. However, multiple myeloma is still an incurable cancer. The two leading obstacles are: the high recurrence rate and highly heterogeneous cell population whose subclone content evolves over time, especially in the high-risk MM patients. Thus, it is urgent to explore the underlying mechanisms of cancer progression and identify potential biomarkers and therapeutic targets for clinical appliances.
N6-methyladenosine (m6A) was first discovered in eukaryotic mRNA in the 1970s, and it has been recognized as the most abundant RNA modifications among more than 160 types of distinct chemically posttranscriptional modifications[3-5]. However, it was not until 2012 that the first whole transcriptome high-throughput sequencing of m6A modification was finished, initiating the detailed investigation of m6A in various diseases, especially in cancers[6, 7]. Similar to DNA and histone methylation, m6A modification is dynamically reversible with the involvement of m6A “writers”, “erasers” and “readers”[8]. The installation of m6A is catalyzed by writers/methyltransferase complex (MTX), including methyltransferase-like protein 3 (METTL3), METTL14, METTL16, WT1-associated protein (WTAP), RNA-binding motif protein 15 (RBM15), RBM15B, KIAA1429 (or named VIRMA) and zinc finger CCCH-type containing 13 (ZC3H13) [9, 10]. Eraser is a class of demethylases, including fat mass and obesity-associated protein (FTO), α-ketoglutarate-dependent dioxygenase homolog 5 (ALKBH5) and ALKBH3 [11-13]. Read proteins can recognize specific m6A sites and regulate the biofunction of target RNA. YT521-B homology domain family proteins 1/2/3 (YTHDF1/2/3), and YTH domain containing 1/2 (YTHDC1/2) are associated with RNA splicing, translation efficiency and degradation[14, 15]. Insulin-like growth factor 2 mRNA-binding proteins 1/2/3 (IGF2BP1/2/3) can promote mRNA stability and translation[16]. Heterogeneous nuclear ribonucleoprotein A2B1 (HNRNPA2B1) plays an important role in microRNA maturation. HNRNPG and HNRNPG, which can mediate RNA abundance and splicing, are considered as “indirect” readers, given that their tendency to preferentially bind to an RNA structure switch induced by m6A modifications [17, 18]. Emerging studies have demonstrated that aberrant m6A modifications of coding and non-coding RNAs play critical roles in tumorigenesis, metastasis and resistance of various cancers, including glioblastoma (GBM), acute myeloid leukemia (AML), hepatocellular carcinoma (HCC) and breast cancer [19-22]. Several m6A genes are involved in both normal and malignant hematopoiesis, such as METTL3, YTHDF2 and FTO [23]. However, the function of m6A in multiple myeloma is still extremely elusive, and there are only a few published articles focusing on this field up to now[24-26]. In addition, previous articles have only taken into consideration one or few m6A-related regulators, no research have been able to conduct a systematic study about the field.
In the present study, we first used the RNA-seq datasets and clinical data of multiple myeloma from the GEO database to explore the differential expression m6A-related genes. The interactions among these regulators were assessed via PPI network and Pearson correlation analysis. Furthermore, Kaplan-Meier survival analysis and Cox regression model were established to identify prognostic predictors, which were further verified with the use of ROC curves. Finally, YTHDF2 was identified as the independent prognostic factor of MM, and the underlying molecular mechanism was investigated by differential mRNA/microRNA expression analyses. Our study first dissected the systematically abnormal expression status of m6A-related genes in MM and may provide novel and promising biomarkers and therapeutic targets.
2.1 Data preparation and processing
The gene expression profiles containing clinical information of multiple myeloma patients were downloaded from the NCBI GEO database (https://www.ncbi.nlm.nih.gov/geo/). The raw count data of mRNA expression profiles were preprocessed with normalization and log2 transformation. The overall design and workflow of this study are given in Fig. 1.
GSE47552 upon on the GPL6244 platform (Affymetrix Human Gene 1.0 ST Array transcription gene version) was selected to investigate m6A-related differentially expressed genes (DEGs) and the expression level of m6A-related genes in different stages of multiple myeloma. GSE47882 contains 5 normal plasma cell (NPC) samples, 20 patients with monoclonal gammopathy of unknown significance (MGUS), 33 with high-risk smoldering multiple myeloma (SMM) and 41 with MM. All samples are newly diagnosed without any treatment;
GSE24080 based on GPL570 platform (Affymetrix Human Genome U133 Plus 2.0 Array) was selected to conduct the survival analysis and the analysis of the relationship between expression level and clinical characteristics. GSE24080 includes totally 559 MM patients, and to ensure the homogeneity of this analysis, 313 IgG type MM were finally included. The detailed information of this dataset is shown as below: a. gender:195 (62.3%) males and 119 (37.7%) females; b. age: the median age is 57.75 (29.70 to 76.50); c. race: 270 (86.3%) white people and 43 other races (13.7%);
Four datasets from GEO database were employed to identify DEGs grouped by the YTHDF2 expression level (1) GSE47552; (2) GSE24080; (3) GSE19784 based on GPL570 includes 320 samples obtained from newly diagnosed MM patient; (4) GSE16558 based on GPL6244 platform contains 60 MM patients.
The GSE17498 dataset from GEO database was selected to identify differentially expressed microRNAs (DEMi) in MM; it upon GPL8227 platform (Agilent-019118 Human miRNA Microarray 2.0 G4470B, miRNA ID version) includes 38 multiple myeloma samples, 2 plasma cell leukemia (PCL) samples and 3 normal donors.
2.2 Identification of differentially expressed m6A genes
We systematically analyzed the expression of 22 m6A genes, including seven m6A writers (METTL3, METTL14, WTAP, KIAA1429, RBM15, RBM15B and ZC3H13), three m6A erasers (FTO, ALKBH5, ALKBH3) and twelve m6A readers (IGF2BP1/2/3, YTHDF1/2/3, YTHDC1/2, HNRNPA2B1, HNRNPC, RBMX) in GSE47552 by using the LIMMA package [27] (version 3.46.0; http://bioinf.wehi.edu.au/limma/) from the R Studio (version 1.4.1103; https://rstudio.com/). P-value < 0.05 was used as a threshold parameter to identify differentially expressed m6A genes. The heatmap was generated via PHEATMAP package (version 1.0.12; https://cran.rstudio.com/web/packages/pheatmap/index.html).
2.3 PPI network and Pearson correlation analysis
The PPI network among the differentially expressed m6A regulatory genes was constructed using the STRING online database (www.string-db.org). Cytoscape software (version 3.8.2) was used for visualization[28]. Pearson correlation analysis was performed to demonstrate the association among different m6A-related genes based on the Pearson’s correlation coefficient (r) value between -1 and +1. The CORRPLOT package (version 0.84; https://github.com/taiyun/corrplot) was used to visualize the correlation matrix.
2.4 Survival analysis
The expression level of each gene was divided into high expression group and low expression group according to the mean value. Kaplan-Meier curve analysis with log-rank test was used to analysis survival rates between different subgroups. Univariate and multivariate Cox regression analysis was performed to identify the independent prognostic factor. Briefly, variables with P-value<0.10 in univariate analysis were eligible for multivariate Cox regression analysis with forward likelihood ration (LR) method. R studio with SURVIVAL package (version 3.2-7; https://github.com/therneau/survival) was used for visualization.
2.5 Investigation of underlying molecular mechanisms of YTHDF2
Firstly, 60 MM patients in ISS-III stage selected from GSE24080 dataset, 41 MM patients from GSE47552, 60 MM patients from GSE16558 and 320 MM patients from GSE19784 were stratified into high and low YTHDF2 expression groups according to the mean value. Differential expression analysis was performed by LIMMA package, and an absolute log2 fold change (FC)>0.6 and P-value<0.05 was used as the cutoff for DEGs. The microRNA expression data of GSE17498 was used to observe DEMi between MM patients and normal samples. Then, the online Venn diagram website (https://bioinfogp.cnb.csic.es/tools/venny/index.html) was used to obtain the intersection of significantly downregulated genes from these four datasets. Next, the datasets downloaded from the POSTAR3 database were used to identify YTHDF2-targeting RNA among these common downregulated genes [29] (http://lulab.life.tsinghua.edu.cn/postar/). Finally, the information of DEMi and the dataset of their downstream genes from ENCORI database [30] (http://starbase.sysu.edu.cn/) were used to identify the upstream microRNA of YTHDF2. An absolute log2 fold change (FC)>2.0, and P-value<0.05 were used as the threshold for significantly DEMi.
2.6 Statistical analysis
Statistical analyses were conducted using SPSS 26.0, R Studio and GraphPad Prism software (version 8.0.2). Continuous variables were described as the mean ± standard deviation (SD); Mann-Whitney test and Kruskal-Wallis test were used to test the difference of subgroups as appropriate; Student’s t test and One-way ANOVA followed by Bonferroni post hoc comparison were employed to compare the difference in expression of m6A genes. ROC curves and the area under the ROC curve (AUC) were examined to determine the predictive value in MM diagnosis and prognosis. P<0.05 was considered statistically significant if not specified. All statistical tests were two-sided.
3.1 Twelve of twenty-two m6A-related genes were differentially expressed in multiple myeloma
The transcriptome data from the GSE47552 dataset was employed to identify the differential expression of twenty-two m6A-related genes in MM (n=41) compared with NPC (n=5). The heatmap showed that the expression of twelve m6A methylation regulators (HNRNPC, ZC3H13, RBM15B, FTO, RBM15, YTHDF3, IGF2BP3, ALKBH5, YTHDF2, YTHDC1, HNRNPA2B1 and IGF2BP2) were differentially expressed in MM patients compared to normal volunteers (Fig. 2a). We observed that HNRNPC, RBM15B, YTHDF3, RBM15, YTHDF2, HNRNPA2B1 and IGF2BP2 were significantly upregulated, while ZC3H13, FTO, YTHDC1, ALKBH5 and IGF2BP3 were significantly downregulated in MM patients (Fig. 2b). In addition, the expression of METTL3, KIAA1429, YTHDF1, METTL14, METTL16, YTHDC2, ALKBH3, WTAP, RBMX and IGF2BP1 was similar between MM and NPC samples (Additional file 1: Table S1).
3.2 PPI network and Pearson correlation analysis among m6A RNA methylation regulators
To further investigate the relationship among these 12 m6A RNA methylation regulators, PPI network was established via STRING database and CYTOSCAPE software (Fig. 3a). Of note, the PPI network demonstrated that RBM15, ALKBH5, YTHDC1, YTHDF2 and YTHDF3 were the hub genes. In addition, IGF2BP3 had no protein-protein association with other regulators, which was hidden in the figure. Moreover, the correlation of mRNA expression was analyzed by the Pearson correlation analysis (Fig. 3b). The correlation between YTHDC1 and HNRNPA2B1 was most positively significant (r=0.72); and the relationship between YTHDC1 and IGF2BP2 was most significantly negative (r=-0.38).
3.3 Identification the prognostic roles of m6A-related genes
To evaluate the prognostic roles of the m6A RNA methylation regulators in MM, Kaplan-Meier survival analysis was conducted among these 12 genes. GSE24080 dataset was selected, which containing 313 IgG type MM patient samples with clinical and survival information. Patients were divided into high expression group and low expression group according to the average expression level of each gene. We found that the high expression levels of YTHDF2, HNRPNC and HNRNPA2B1 were associated with poorer over survival (OS), albeit the OS of the low ZC3H13 expression group was significantly prolonged than that of the high expression group (Fig. 4a-d). However, the expression levels of RBM15B, FTO, RBM15, YTHDF3, IGF2BP3, ALKBH5, YTHDC1 and IGF2BP2 were not correlated with OS (Additional file 2: Fig. S1).
Furthermore, univariate Cox regression analysis demonstrated that International Staging System (ISS) stage, cytogenetic abnormalities, bone lesions, proportion of bone marrow plasm cell (BMPC), ZC3H13, HNRNPA2B1, HNRNPC and YTHDF2 were significantly associated with the OS, which were consistent with the results of Kaplan-Meier survival analysis. Based on these results, multivariate Cox regression analysis showed that ISS stage, cytogenetic abnormalities, bone lesions and the expression of YTHDF2 were independent negative prognostic factors (Table 1). Moreover, the performance of this prognostic predictor was confirmed by ROC curves. As shown in Additional file 3: Fig. S2, the AUC were 0.5611, 0.5815 and 0.5904 in the 3-, 4- and 5-year survival, respectively, which indicated some predictive power of YTHDF2.
Subgroup survival analysis of YTHDF2 showed that higher YTHDF2 expression level was correlated with poorer OS especially in ISS-III stage patients and patients with cytogenetic abnormalities (Fig. 5a-e). The subgroup analysis results of HNRNPC, HNRNPA2B1 and ZC3H13 (Additional file 4: Fig. S3; Additional file 5: Fig. S4) showed that low expression of ZC3H13 was correlated with poorer OS in ISS-II stage patients (P=0.00229).
Table 1. Univariate and multivariate analysis of OS using the Cox proportional hazard regression model
variables |
Number (%) all=360 |
univariate analysis |
multivariate analysis |
||
HR (95%CI) |
P value |
HR (95%CI) |
P value |
||
Age, years |
|
|
|
|
|
<65 |
236 |
||||
>=65 |
77 |
0.953(0.593-1.531) |
0.842 |
||
Gender |
|||||
female |
118 |
||||
male |
195 |
1.103(0.733-1.660) |
0.638 |
||
Race |
|||||
white |
270 |
||||
others |
43 |
1.145(0.639-2.052) |
0.649 |
||
ISS stage |
|||||
I |
194 |
||||
II+III |
119 |
2.531(1.705-3.759) |
<0.001 |
2.323(1.553-3.476) |
<0.001 |
BMPC a |
|||||
<46% |
157 |
||||
>=46% |
156 |
2.118(1.404-3.197) |
<0.001 |
0.054 |
|
Cytogenetic abnormalities |
|||||
yes |
117 |
||||
no |
196 |
2.662(1.788-3.961) |
<0.001 |
2.178(1.450-3.271) |
<0.001 |
Bone lesions |
|||||
yes |
210 |
||||
no |
103 |
2.246(1.374-3.670) |
0.001 |
2.372(1.447-3.890) |
0.001 |
ZC3H13 |
|||||
low |
169 |
||||
high |
144 |
0.602(0.405-0.893) |
0.012 |
0.245 |
|
HNRNPA2B1 |
|||||
low |
157 |
||||
high |
156 |
1.562(1.047-2.331) |
0.029 |
0.091 |
|
HNRNPC |
|||||
low |
144 |
||||
high |
169 |
1.594(1.062-2.394) |
0.025 |
0.600 |
|
YTHDF2 |
|||||
low |
155 |
||||
high |
158 |
1.849(1.233-2.773) |
0.003 |
1.684(1.115-2.541) |
0.013 |
a 46% is the mean value of the percentage of bone marrow plasm cell of 313 MM patients.
HR: hazard ratio
3.4 Correlation of prognostic m6A-related genes and clinical characteristics
Next, we analyzed the correlation between the expression of these four prognostic risk signature genes and the clinicopathological features of MM patients in the GSE24080 cohort (Table 2). We found gender-related differences in the expression of HNRNPC and ZC3H13, while each of the four was age-independent genes. High expression of YTHDF2 and HNRNPA2B1 were associated with advanced ISS stage. In addition, the higher expression of HNRNPA2B1 was correlated with higher BMPC ratio. Multiple myeloma patients with cytogenetic abnormalities were observed to have higher HNRNPC expression and lower ZC3H13 expression.
HNRNPC gene is located at 14q11.2, thus, we used the GSE14080 dataset to investigate the correlation between aberrant expression of HNRNPC and chromosome 14 translocations. However, although the HNRNPC expression level was increased in patients with chromosome 14 translocations, we found no significant difference between normal FISH (fluorescence in situ hybridization) MM patients and patients with t(4;14)/(14;16)/(11;14) (p=0.576, Additional file 6: Fig. S5A). Furthermore, ZC3H13 and RB1 gene (retinoblastoma-1) are co-located at chromosome 13q14, Del (13q) is considered as the adverse predictor.[31]. We observed the expression level of ZC3H13 was significantly decreased in MM patients with RB deletion compared to normal FISH patients (p=0.008, Additional file 6: Fig. S5B).
Table 2. The correlation of prognostic m6A RNA methylation regulators and clinical characteristics
variables |
YTHDF2, mean (SD) |
P value |
HNRNPC, mean (SD) |
P value |
HNRNPA2B1, mean (SD) |
P value |
ZC3H13, mean (SD) |
P value |
|||||
Age, years |
0.220 |
0.146 |
0.242 |
0.348 |
|||||||||
<65 |
11.6718(0.31443) |
10.2541(0.36895) |
10.5382(0.39030) |
9.4182(0.42882) |
|||||||||
>=65 |
11.7052(0.38974) |
10.3141(0.37951) |
10.4734(0.35631) |
9.4729(0.39573) |
|||||||||
Gender |
0.700 |
0.019 |
0.309 |
0.014 |
|||||||||
female |
11.7026(0.38356) |
10.2045(0.38197) |
10.4957(0.38109) |
9.3573(0.42531) |
|||||||||
male |
11.6937(0.36648) |
10.3079(0.36105) |
10.5384(0.38372) |
9.4766(0.41293) |
|||||||||
Race |
0.564 |
0.675 |
0.694 |
0.352 |
|||||||||
white |
11.6925(0.37966) |
10.2711(0.37270) |
10.5208(0.39060) |
9.4431(0.41324) |
|||||||||
others |
11.7251(0.32592) |
10.2548(0.37061) |
10.5319(0.33259) |
9.3598(0.46536) |
|||||||||
ISS stage |
0.045 |
0.266 |
0.044 |
0.106 |
|||||||||
I |
11.6661(0.33939) |
10.2704(0.34648) |
10.4892(0.36052) |
9.4680(0.41989) |
|||||||||
II |
11.7066(0.40567) |
10.2152(0.42793) |
10.5373(0.43813) |
9.3743(0.44538) |
|||||||||
III |
11.7875(0.42817) |
10.3167(0.39157) |
10.6143(0.38487) |
9.3701(0.39227) |
|||||||||
BMPC |
0.361 |
0.518 |
0.022 |
0.344 |
|||||||||
<46% |
11.6864(0.34063) |
10.2912(0.34700) |
10.4804(0.35906) |
9.4553(0.41615) |
|||||||||
>=46% |
11.7077(0.40273) |
10.2464(0.39516) |
10.5644(0.40183) |
9.4077(0.42575) |
|||||||||
cytogenetic abnormalities |
0.116 |
0.009 |
0.210 |
0.035 |
|||||||||
yes |
11.7491(0.38145) |
10.3312(0.36061) |
10.5672(0.39261) |
9.3598(0.45651) |
|||||||||
no |
11.6659(0.36437) |
10.2316(0.37440) |
10.4954(0.37507) |
9.4744(0.39323) |
|||||||||
bone lesions |
0.760 |
0.231 |
0.163 |
0.429 |
|||||||||
yes |
11.7017(0.38267) |
10.2908(0.37122) |
10.5039(0.37900) |
9.4185(0.42481) |
|||||||||
no |
11.6874(0.35224) |
|
10.2241(0.37095) |
|
10.5597(0.38924) |
|
9.4583(0.41375) |
|
3.5 Identification the diagnostic roles of m6A-related genes
The development of MM has been widely identified as a multistage process starting with the premalignant condition MGUS and followed by the intermediate stage SMM. The underlying mechanism of this continuous disease spectrum is considered as the genetic instability [32]. To figure out the roles of m6A RNA methylation regulators in the development of MM, GSE47552 dataset, including 5 NPC patients, 20 MGUS patients and 33 SMM patients, was employed. The results showed that the expression of HNRNPA2B1 was significantly different between NPC and SMM, albeit no significant difference was observed between NPC and MM (Fig. 6a). As for HNRNPC, statistically significant increase was observed in the development of MM (Fig. 6b). The expression of YTHDF2 was significantly different in MM compared with MGUS and SMM, while no significant difference was found between MM and NPC (Fig. 6c). In addition, ZC3H13 expression level was significantly decreased with the disease development (Fig. 6d).
Furthermore, ROC curve analysis was conducted to verify the diagnostic value of these four genes (Fig. 6e-h). The AUC value of HNRNPA2B1 in MM for NPC, MGUS and SMM were 0.6293, 0.6402 and 0.8004, respectively (Fig. 6e) As fig. 6f shows, the AUC value of HNRNPC in MM for NPC, MGUS and SMM were 0.9902, 0.7561 and 0.5595, respectively. The AUC value of YTHDF2 were 0.6537, 0.7805, 0.7502 for NPC, MGUS and SMM, respectively (Fig. 6g). In addition, The AUC value of ZC3H13 were 1.0000, 0.7646, 0.7302 for NPC, MGUS and SMM, respectively (Fig. 6h). Our results demonstrated that the expression levels of HNRNPA2B1, HNRNPC, YTHDF2 and ZC3H13 have accurate discernibility ability of the continuous disease spectrum of MM.
3.6 Potential molecular mechanisms of YTHDF2 in MM
Firstly, 481 MM patients from four different datasets (60 in ISS-III stage from GSE24080 dataset, 41 from GSE47552, 60 from GSE16558 and 320 from GSE19784) were stratified into high and low YTHDF2 expression groups according to the mean value, subsequently, differential expression analyses were performed (Additional file 7: Table S2-S5).
The primary biofunction of YTHDF2 is identified as the regulation of m6A-containing RNA degradation[33]. Therefore, to explore the target RNA of YTHDF2, we only selected significantly downregulated genes for the next study. The Venn diagram showed the intersection of significantly downregulated genes from these four datasets (Fig. 7a), which indicated that eight RNAs (EGR1, RNASE2, CLC, S100A12, RNASE3, HBA2///HBA1, HBB and CXCL12) were significantly downregulated among these four cohorts. Next, YTHDF2-targeting RNA was obtained through intersecting eight common downregulated RNAs and the data from the POSTAR3 database (Fig. 7b), which showed that EGR1 is the probable target of YTHDF2 (Additional file 8: Table S6). As expected, the expression of EGR1 was significantly decreased in MGUS, SMM and MM compared to NPC (Fig. 7c).
Finally, GSE17498 dataset including 722 human microRNAs was used to identify DEMi between MM patients and normal samples. As Table 3 shows, 4 miRNAs were upregulated and 10 miRNAs were downregulated in MM patients compared to healthy individuals. Furthermore, to investigate the cause of abnormal YTHDF2 expression, we screened these DEMi and their target mRNA in the ENCORI database. The results indicated that YTHDF2 was the target of miR-205 (Fig. 7d), which was significantly downregulated in MM patients. Furthermore, unsurprisingly, Pearson correlation analyses in GSE24080 and GES19784 indicated that the expression level of YTHDF2 was negatively associated with miR-205 in MM (r=-0.175 and -0.351, respectively; Fig. 7e-f).
Taken together, we found the probable mechanism of miR-205/YTHDF2/EGR1 in the development of MM.
Table 3. The DEMi in GSE17498 dataset between MM patients and healthy dornors
Up-regulated miRNAs |
||||||
|
logFC |
AveExpr |
t |
P.Value |
adj.P.Val |
B |
hsa-miR-148a |
4.353993 |
9.578942 |
12.33122 |
8.27E-16 |
2.99E-13 |
25.82094 |
hsa-miR-365 |
2.539799 |
5.981127 |
4.097674 |
0.000178 |
0.007091 |
0.27561 |
hsa-miR-193b |
2.329963 |
6.846663 |
3.487717 |
0.001125 |
0.03125 |
-1.47862 |
hsa-miR-451 |
2.312865 |
6.549557 |
3.042909 |
0.00396 |
0.084087 |
-2.65702 |
Down-regulated miRNAs |
||||||
hsa-miR-363 |
-3.31986 |
2.266472 |
-12.6704 |
3.26E-16 |
2.36E-13 |
26.7257 |
hsa-miR-205 |
-3.08017 |
1.481864 |
-9.31884 |
6.14E-12 |
1.48E-09 |
17.09942 |
hsa-miR-28-5p |
-3.13096 |
3.011141 |
-9.2066 |
8.75E-12 |
1.58E-09 |
16.75138 |
hsa-miR-203 |
-3.0607 |
1.811558 |
-7.5428 |
1.95E-09 |
2.35E-07 |
11.43158 |
hsa-miR-200c |
-2.14786 |
2.10884 |
-6.22333 |
1.64E-07 |
1.58E-05 |
7.076365 |
hsa-miR-342-3p |
-3.82554 |
3.944036 |
-6.20442 |
1.75E-07 |
1.58E-05 |
7.013807 |
hsa-miR-92a |
-2.67398 |
6.138967 |
-5.43078 |
2.35E-06 |
0.000155 |
4.468974 |
hsa-miR-151-5p |
-3.00543 |
3.450716 |
-4.26654 |
0.000105 |
0.005056 |
0.784062 |
hsa-miR-155 |
-4.53579 |
4.23315 |
-3.95619 |
0.000276 |
0.009972 |
-0.14346 |
hsa-miR-21 |
-2.41606 |
7.872264 |
-2.83685 |
0.006886 |
0.130841 |
-3.16799 |
AveExpr: average expression; adj.P.Val: adjusted P value.
S100A12 (S100 calcium binding protein A12), RNASE3 (ribonuclease A family member 3), HBA2///HBA1 (hemoglobin subunit alpha 2///1), HBB (hemoglobin subunit beta), CXCL12 (C-X-C motif chemokine ligand 12).
Thanks to the introduction of triple-drug combinations and the approval of new agents against MM, the survival of MM patients has dramatically extended in the last decade. However, the medical needs of high-risk patients remain unmet, given the difficulty of eradicating genetically high heterogeneity of constantly evolving subclones [34]. The present risk stratification models are mostly based on tumor load and genetic abnormalities, however, ignore the epigenetic and epitranscriptomic alterations. New inhibitor targeting significant epigenetic regulators histone deacetylase (HDAC) has been applied for relapsed/refractory MM in clinical trials, which showed effective and tolerable anti-tumor ability [35]. Thus, faced with this scenario, it will become increasing urgent to reformulate these models with epigenetic/epitranscriptomic prognosis predictors to allow the currently available therapeutic arsenal to be more individualized [36]. m6A RNA methylation, the most remarkable epitranscriptomic modification, has been widely considered to be involved in tumorigenesis and resistance [8], while its biofunction in MM progression is little known yet. In this study, we first explored the comprehensive roles of m6A-associated genes in the regulation of MM.
The expression status of 22 m6A RNA methylation genes were analyzed using the GEO database. We found 12 out of them were significantly expressed in MM patients (HNRNPC, RBM15B, YTHDF3, RBM15, YTHDF2, IGF2BP2 and HNRNPA1B1 were upregulated, ZC3H13, FTO, YTHDC1, ALKBH5 and IGF2BP3 were downregulated). These findings implicated that m6A-related genes, acting as oncogenes or tumor-inhibiting factors, were dysregulated in MM. Furthermore, we elucidated that the high expression level of HNRNPA2B1, HNRNPC, YTHDF2 and low expression level of ZC3H13 were negatively associated with the OS of MM patients through Kaplan-Meier survival analysis. Univariate and multivariate Cox regression analysis indicated that the ISS stage, cytogenetic abnormality, bone lesion and the expression level of YTHDF2 were independent prognostic factors. Additionally, ROC curves verified the prognostic prediction value of YTHDF2. The initiation and progression of multiple myeloma is a multistep process including MGUS and SMM. Our results also showed m6A-related genes were involved in this development with significantly diagnostic capacity confirmed by ROC curves.
Consistent with the published articles, HNRNPC, as an oncogene, was overexpressed in various cancers and identified as the indispensable factor for tumorigenesis in breast cancer [37]. ZC3H13 was downregulated in colorectal cancer and found to suppress proliferation and invasion via inactivating Ras-ERK pathway [38]. The overexpression of YTHDF2 was observed in liver cancer, AML and GBM, which was implicated in the stemness maintenance of cancer stem cells [19, 39, 40]. HNRNPA2B1 was highly expressed in multiple types of cancer, and it promoted the proliferation of breast cancer cells through STAT3 pathway [41]. However, our results are inconsistent with a few studies. Our results showed that ALKBH5 was upregulated in MM patients; the expression level of FTO and METTL3 had no significant difference. Xu et al. revealed that FTO was upregulated in myeloma and extramedullary myeloma patients, and promoted cancer progression by regulating heat shock transcription factor 1 (HSF1) in a YTHDF2-dependent way [26]. In contrast, Bach et al. emphasized that the overexpression of METTL3 in MM, while no significant differences in the expression of FTO and ALKBH5 [24]. Divergences among these studies are probably associated with sample size, clinical features, tumor heterogeneity and detection methods. Therefore, what remains to be established is the systematic and detailed function of each member in the elaborate m6A methylation network in mediating the progression of multiple myeloma and relative mechanisms by which m6A modification alters gene expression posttranscriptionally.
Noticeably, we confirmed the remarkable relationship between some m6A-related genes and clinical features, especially cytogenetic abnormalities. Our results showed HNRNPC was increased while ZC3H13 was reduced in MM patients with cytogenetic abnormalities. The most common translocations of MM are those involving the IGH gene (14q32); t(14;16) and t(4;16) have been identified as the high-risk predictors by the International Myeloma Group (IMWG)[42]. However, although the HNRNPC expression level was increased in patients with chromosome 14 translocations, we found no significant difference between MM with and without chromosome 14 abnormalities. Additionally, the low expression of ZC3H13 was significantly associated with 13p deletion, which may be one of the mechanisms of the negative prognostic impact of del(13p). Del(13p) was observed in about 46% of MM patients, including monosomy 13 and interstitial deletion [43]. Although chromosome 13 deletion has consistently been associated with poor prognosis [44], a retrospective study observed an intriguingly opposite effect of monosomy 13 and partial deletion. The OS and progression-free survival (PFS) of patients with monosomy 13 were significantly shorten, while the presence of partial deletion was associated with significantly better OS [45]. The tumor-inhibiting factor RB1 is simultaneously lost with chromosome 13 abnormality. A recent research found that RB1 could inhibit programmed death ligand 1 (PD-L1) expression and promote tumor immunity, which may provide new insight to overcome the tumor immune evasion [46]. In contrast, “eraser” FTO was demonstrated to rectify PD-L1 expression and promote immune evasion [47, 48]. However, the roles of other m6A regulators in immune escape remain unclear, and the deepen understanding of m6A RNA methylation may explain the dissatisfactory effect of immune check point inhibitors in certain cancers.
YTHDF2, as a member of m6A “readers”, can selectively recognize and bind to m6A-containing RNA to initiate its degradation in a deadenylation-dependent manner by interacting directly with the SH domain of CNOT1, the scaffolding subunit of the CCR4-NOT deadenylase complex [33]. Accumulating evidences have demonstrated that YTHDF2 can act as an oncogene cooperating with dysregulated “writers” or “erasers” via degrading m6A-containg tumor-inhibiting factors [49]. For example, the overexpression of METTL3 and YTHDF2 in bladder cancer can promote cell proliferation and migration via the decay of tumor suppressors Kruppel-like factor 4 (KLF4) and SET domain containing 7 (SETD7) [50]. In addition, the downregulation of ALKBH5 abrogated the anti-tumor function of period circadian protein 1 (PER1) in a YTHDF2-dependent manner, which was involved in the reactivation of ATM-CHK2-P53/CDC25C signaling pathway [51].
Therefore, to investigate the regulatory mechanism of YTHDF2, we first conducted PPI network and Pearson correlation analyses. The results demonstrated that YTHDF2 can interplay with the m6A “writers” RBM15/RBM15B and “erasers” ALKBH5. The reduction of ALKBH5 cooperating with increased RBM15/RBM15B in MM may result in the elevated m6A level of YTHDF2-targeting mRNAs. Then, we divided MM patients into two groups according to YTHDF2 expression, and found that EGR1, a tumor suppressor significantly downregulated in MM, may be the downstream of YTHDF2. EGR1 plays a significant role in the regulation of cell growth, differentiation, apoptosis and has potent anti-tumor function [52] via participating the tumor suppressor network, including p53 [53] , phosphatase and tensin homolog (PTEN) [54] and AP-1 [55]. Low expression of EGR1 is correlated with high risk and poor survival of MM; and overexpression of EGR1 in MM cell lines leaded to profound proliferation inhibition and apoptosis with more sensitive to bortezomib [55]. Moreover, the combination of lenalidomide and dexamethasone synergistically induced cell cycle arrest and apoptosis via promoting the expression of a series of tumor suppressors including EGR1 [56].
MicroRNA is a class of non-coding RNA (ncRNA) which plays an important role in gene silencing ; and the aberrant microRNA expression is one of the key determinants of oncogenesis, metastasis and resistance. Not only do m6A modifications have regulatory effects on miRNAs, such as HNRNPA2B1, but also miRNAs can regulate m6A modifications by a sequence pairing mechanism[22]. The involvement of microRNA dysregulation in YTHDF2 overexpression has been recognized in cancers. miR-145 was implicated in the regulation of m6A levels of HCC by targeting YTHDF2 [57]. Furthermore, decreased expression of miR-495 and miR-493-3p leaded to upregulated YTHDF2 and enhanced proliferation and invasion in prostate cancer [58, 59]. Therefore, we finally performed differential microRNA expression analysis and observed that YTHDF2 was the target of miR-205, which was decreased in MM and negatively correlated with YTHDF2. Taken together, we observed the potential pathway miR-205/YTHDF2/EGR1 leading to the adverse survival of MM.
However, our study does have some limitations. First, the majority of patients included in our study were white, the prognostic value of m6A-related genes remains to be further verified in Asian and African cohorts. Second, given that our results are based on the RNA-seq data, the expression of these genes at protein level should be further clarified. Third, some important clinical information, including R-ISS stage, mSMART risk stratification, therapeutic strategies, was not available in the GEO database. Thus, experiment verification and clinical investigation with innovative design and orchestrated implement are needed to translate these descriptive results into clinical benefits.
In conclusion, our study identified m6A-associated genes ZC3H13, HNRNPC, HNRNPA2B1 and YTHDF2, especially YTHDF2, that could be the potential diagnostic and prognostic predictors in multiple myeloma. Furthermore, the potential pathway miR-205/YTHDF2/EGR1 was constructed to explain the oncogenic role of YTHDF2 in MM. Epitranscriptomics, as a novel subfield of epigenetics, may provide new targets for cancer treatment. Future research focusing on this field is pressing.
Additional file 1: Table S1. The results of differential expression m6A-related gene analysis
Additional file 2: Figure S1. The prognostic roles of m6A-related genes. Kaplan-Meier survival curve for YTHDF3, YTHDC1, IGF2BP3, IGF2BP2, RBM15B, RBM15, FTO and ALKBH5.
Additional file 3 Figure S2. The verification of the prognostic role of YTHDF2 in multiple myeloma. ROC curves in GSE24080 dataset.
Additional file 4: Figure S3. Subgroup analysis of ZC3H13, HNRNPA2B1 and HNRNPC for overall survival in MM. Kaplan-Meier survival curves comparing the OS between high and low expression among ISS-I/II/III patients.
Additional file 5: Figure S4. Subgroup analysis of ZC3H13, HNRNPA2B1 and HNRNPC for overall survival in MM. Kaplan-Meier survival curves comparing the OS between high and low expression among patients with/without cytogenetic abnormalities.
Additional file 6: Figure S5. The expression level of HNRNPC and ZC3H13 in MM patients. (A) The expression level of HNRNPC in MM patients with normal FISH and with chromosome 14 translocations. (B) The expression level of ZC3H13 in MM patients with normal FISH and with RB deletion. ns means no significant difference.
Additional file 7: Table S2. The results of significantly downregulated genes of GSE24080; Table S3. The results of significantly downregulated genes of GSE47552; Table S4. The results of significantly downregulated genes of GSE19784; Table S5. The results of significantly downregulated genes of GSE16558.
Additional file 8: Table S6. The binding information of YTHDF2 and EGR1 from POSTAR3 database
ALKBH5: α-ketoglutarate-dependent dioxygenase homolog 5; AML: acute myeloid leukemia; AUC: area under the ROC curve; BM: bone marrow; BMPC: bone marrow plasm cell; CAR-T: chimeric antigen receptor-engineered T cells; CLC: Charcot-Leyden crystal galectin; CXCL12: C-X-C motif chemokine ligand 12; DEGs: differentially expressed genes; DEMi: differentially expressed microRNA; EGR1: early growth response protein 1; FTO: fat mass and obesity-associated protein; GBM: glioblastoma; GEO: Gene Expression Omnibus; HBA2///HBA1: hemoglobin subunit alpha 2///1; HBB: hemoglobin subunit beta; HCC: hepatocellular carcinoma; HDAC: histone deacetylase;
HNRNPA2B1: Heterogeneous nuclear ribonucleoprotein A2B1; HSC: hematopoietic stem cell;
HSF1: heat shock transcription factor 1; IGF2BP1/2/3: Insulin-like growth factor 2 mRNA-binding proteins 1/2/3; IMWG: International Myeloma Group; KLF4: Kruppel-like factor 4; LSC: leukemia stem cell; m6A: N6-methyladenosine; mAbs: monoclonal antibodies; METTL3: methyltransferase-like protein 3; MGUS: monoclonal gammopathy of unknown significance; MM: Multiple myeloma; mSMART: Stratification for Myeloma and Risk-Adapted Therapy; MTX: methyltransferase complex;
ncRNA: non-coding RNA; NPC: normal plasma cell; OS: overall survival; PCL: plasma cell leukemia;
PER1: Period circadian protein 1; PFS: progression free survival; PI: Proteasome inhibitor; PPI: Protein-protein interaction; RBM15: RNA-binding motif protein 15; RNASE2: ribonuclease A family member 2; ROC: receiver operation characteristic; S100A12: S100 calcium binding protein A12; SD: standard deviation; SETD7: SET domain containing 7; SMM: smoldering multiple myeloma; WTAP: WT1-associated protein; YTHDC1: YT521-B homology domain containing 1; YTHDF1/2/3: YT521-B homology domain family proteins 1/2/3; ZC3H13: zinc finger CCCH-type containing 13.
Ethics approval and consent to participate
Not applicable.
Consent to publication
Not applicable.
Availability of data and materials
The datasets of this article were generated from the GEO database, STRING database, POSTAR database and ENCORI database.
Acknowledgments
None.
Competing interests
The authors declare that there is no conflict of interests.
Author Contribution
RL, DW and MZ processed data and performed the analyses. RL prepared all the tables and figures. RL, YS, XMW and ALH conceived, designed, and wrote the manuscript. YS, XMW, DW, MZ, JB and ALH revised the manuscript. All authors approved the final manuscript.
Funding:
This work was supported by Natural Science Foundation of Shaanxi Province (Grant/Award Number: No. S2020-JC-QN-1401).