3.1. Genome-Wide Identification of m6A Regulatory Genes in Phaseolus vulgaris
To identify m6A components and their protein families in common bean, the amino acid sequences of m6A related proteins reported in Arabidopsis thaliana, including writers, erasers, and readers, were used as queries to perform BLASTP against the bean genomic sequences in TBtools. Following the removal of repeated sequences, 36 putative candidates were retained with their gene identifications.3 of the 36 candidates did not have the 2OG_Fe(II)_Oxy domain (PF13532), and 2 of the 36 candidates did not have the typical YTH domain (PF04146), which were all eventually removed. The remaining 31 candidate genes were renamed based on our subsequent evolutionary analysis. The identified all genes and transcripts ID of common bean were shown in Supplementary Table S1. The amino acid sequence length, relative molecular weights (MWs), and isoelectric points (PIs) of each amino acid were listed in Table 1. In detail, the lengths of the listed proteins ranged from 224 (PvALKBH7) to 2188 (PvVIR) amino acids, and the corresponding range for MWs was 25.44-240.79 KDa. The predicted pI values ranged from 5.25 (PvECT5) to 9.51 (PvECT11). Among these genes, PvFIP37, PvHAKAI and PvVIR, three putative catalytic subunits of the m6A methyltransferase complex, only had one copy in the common bean genome, respectively. However, the MT-A70, ALKBH, and YTH domain protein families consisted of multiple members.
Table 1
m6A genes detected in the P.vulgaris genome.
Gene name
|
Locus ID
|
Group
|
Chr
|
Protein property
|
Length(aa)
|
PI
|
MW (kDa)
|
PvMTA
|
Phvul.010G102500.1
|
MT-A70
|
10
|
761
|
6.01
|
84.32
|
PvMTB
|
Phvul.007G073300.1
|
7
|
1086
|
6.86
|
120.67
|
PvMTC
|
Phvul.001G016200.1
|
1
|
427
|
8.04
|
48.64
|
PvALKBH1A
|
Phvul.001G262100.1
|
ALKBH
|
1
|
358
|
6.11
|
40.36
|
PvALKBH1B
|
Phvul.009G262600.1
|
9
|
344
|
9.47
|
37.40
|
PvALKBH1C
|
Phvul.001G131400.1
|
1
|
443
|
8.89
|
49.26
|
PvALKBH2A
|
Phvul.006G137400.1
|
6
|
235
|
9.21
|
27.06
|
PvALKBH2B
|
Phvul.006G137611.1
|
6
|
235
|
9.21
|
27.02
|
PvALKBH6
|
Phvul.004G131600.1
|
4
|
259
|
5.79
|
29.74
|
PvALKBH7
|
Phvul.008G264300.1
|
8
|
224
|
6.36
|
25.44
|
PvALKBH8
|
Phvul.002G123600.1
|
2
|
342
|
7.59
|
38.29
|
PvALKBH9A
|
Phvul.001G044000.2
|
1
|
476
|
5.92
|
53.60
|
PvALKBH9B
|
Phvul.006G214800.1
|
6
|
425
|
8.83
|
48.48
|
PvALKBH10A
|
Phvul.001G147800.1
|
1
|
516
|
5.89
|
56.99
|
PvALKBH10B
|
Phvul.007G168900.1
|
7
|
505
|
6.44
|
55.72
|
PvALKBH10C1
|
Phvul.002G181800.1
|
2
|
671
|
6.36
|
73.15
|
PvALKBH10C2
|
Phvul.003G014200.1
|
3
|
691
|
6.18
|
74.54
|
PvECT1
|
Phvul.001G110200.1
|
YTH
|
1
|
658
|
7.96
|
72.22
|
PvECT2
|
Phvul.002G247000.1
|
2
|
677
|
6.02
|
74.03
|
PvECT3
|
Phvul.004G080300.1
|
4
|
638
|
5.96
|
70.15
|
PvECT5
|
Phvul.010G165400.1
|
10
|
634
|
5.25
|
69.57
|
PvECT6
|
Phvul.006G121600.1
|
6
|
649
|
5.84
|
71.55
|
PvECT7
|
Phvul.003G119300.1
|
3
|
698
|
7.59
|
77.24
|
PvECT8A
|
Phvul.005G045600.1
|
5
|
575
|
6.76
|
63.27
|
PvECT8B
|
Phvul.004G132700.1
|
4
|
231
|
9.34
|
26.62
|
PvECT11
|
Phvul.006G218800.1
|
6
|
557
|
9.51
|
62.58
|
PvECT12
|
Phvul.002G152600.1
|
2
|
379
|
5.38
|
42.68
|
PvCPSF30
|
Phvul.006G130200.1
|
6
|
697
|
6.28
|
76.46
|
PvFIP37
|
Phvul.002G107400.1
|
|
2
|
337
|
5.41
|
38.15
|
PvVIR
|
Phvul.007G267500.1
|
|
7
|
2188
|
5.35
|
240.79
|
PvHAKAI
|
Phvul.008G108800.1
|
|
8
|
436
|
6.16
|
47.61
|
3.2. Chromosomal Location and Collinearity Analysis of m6A Regulatory genes in common bean
All 31 genes were distributed on ten chromosomes in the common bean (a total of 11 chromosomes), and they were primarily located at the proximal or distal ends of the chromosomes. The MT-A70, ALKBH, and YTH family genes were highlighted in different colors (Fig. 1A). Among these genes, PvALKBH2A and PvALKBH2B were adjacent on chr06, however, no tandem duplication event was identified. Gene family expansion in common bean driven by duplication was searched using genome-wide synteny analysis, and three gene pairs, PvALKBH10A- PvALKBH10B, PvALKBH10C1-PvALKBH10C2 and PvECT6-PvECT7 were identified as segmental duplication/whole genome duplication (WGD) (Fig. 1B). Therefore, segmental duplication/WGD event appeared to be the primary factor involved in expanding the common bean m6A gene family. Ka/Ks calculation was used to evaluate the extent and type of selective pressure of each gene pair. All three gene pairs were under purifying selection, and the earliest differentiation (PvALKBH10A-PvALKBH10B) occurred 28.42 million years ago (Table 2). To further investigate the phylogenetic mechanisms of bean m6A components and their protein families, a synteny analysis between common bean and Arabidopsis was constructed. Eighteen orthologous pairs consisting of 14 common bean genes and 13 Arabidopsis genes were identified (Fig. 1C), indicating these orthologous pairs' existence before the divergence of Arabidopsis and common bean.
Table 2
༎The evolutionary analysis of duplicated m6A regulatory genes and estimated divergence times.
GeneⅠ
|
Location
|
GeneⅡ
|
Location
|
Type of duplication
|
Ka
|
Ks
|
Ka/Ks
|
T = Ks/2r(MYA)
|
PvALKBH10A
|
1
|
PvALKBH10B
|
7
|
Segmental/WGD
|
0.126348392
|
0.852865956
|
0.148145662
|
28.42
|
PvECT6
|
6
|
PvECT7
|
3
|
Segmental/WGD
|
0.174929236
|
0.683184232
|
0.256049874
|
22.77
|
PvALKBH10C1
|
2
|
PvALKBH10C2
|
3
|
Segmental/WGD
|
0.189372849
|
0.669639206
|
0.28279833
|
22.32
|
3.3. Evolutionary and functional key residue Analyses of m6A components of common bean
To analyze the evolutionary relationship among bean MT-A70 family proteins, a phylogenetic tree was constructed using common bean, Arabidopsis MT-A70 sequences, and human reference sequences (Table S2). Phylogenetic analysis suggested that the MT- A70 proteins of common bean could be divided into three clades: METTL3 subfamily (PvMTA), METTL14 subfamily (PvMTB), and METTL4 subfamily (PvMTC) (Fig. 2A). Compared to Arabidopsis and humans, no orthologs were found in MT- A70 proteins of common bean, suggesting little functional diversity or redundancy. A further multiple sequence alignment showed that many functional sites, including residues involved in AdoMet interactions and RNA binding, were conserved (Fig. 2B), indicating that a core heterodimer catalyzing mechanism might be similar among common bean, human, and Arabidopsis.
To analyze the evolutionary relationship among ALKBH family proteins in common bean, a phylogenetic tree was constructed using common bean, Arabidopsis ALKBH sequences, and human reference sequences (Table S2). ALKBH proteins of common bean could be divided into seven subfamilies (Fig. 3A). In bean and Arabidopsis, ALKBH9 and ALKBH10 subfamily proteins were orthologs of the m6A methylase HsALKBH5. Further multiple sequence alignment of HsALKBH5 and ALKBH9 subfamily proteins showed that many functional sites, including residues involved in 2OG and metal-binding, were conserved (Fig. 3B), suggesting that PvALKBH9 subfamily proteins may have a similar catalyzing mechanism for methyl group removal as human HsALKBH5.
To analyze the evolutionary relationship among YTH family proteins in common bean, a phylogenetic tree was constructed using the reference sequences of common bean, Arabidopsis YTH sequences, and humans (Table S2). The YTH proteins of common bean could be divided into YTHDF and YTHDC subfamilies, and the YTHDC subfamily comprised two subclades: PvYTHDC and PvCPSF30 (Fig. 4A). Similar to Arabidopsis, two YTH proteins belonged to the YTHDC subfamily in the common bean (also two in Arabidopsis). In contrast, fewer belonged to the YTHDF subfamily (nine in bean and 11 in Arabidopsis). Moreover, two copies of AtECT8 orthologs were found in common beans, suggesting functional diversity or redundancy. Additional multiple sequence alignment of YTHDF and YTHDC subfamily proteins displayed that many functional sites, including residues involved in the aromatic cage, contact with m6A, and RNA binding, were conserved, suggesting that PvYTHDFs might share a similar m6A binding structure and have a similar read mechanism to human and Arabidopsis YTHDF proteins. (Fig. 4B). Additionally, multiple sequence alignment of YTHDC subfamily proteins also displayed a conserved aromatic cage, suggesting the ability of the m6A read mechanism (Figure S1).
3.4. Phylogenetic and gene structure Analyses of m6A components of common bean
To further explore the structure and sequence characteristics of MT-A70 family genes in common beans, a maximum likelihood phylogenetic tree (Fig. 5A) was constructed using full-length amino acid sequences to classify visualized analyses. In the conserved motif analysis, compared to PvMTA and PvMTB, PvMTC lacked some motifs (3,4,8,9), whereas PvMTA and PvMTB basically had the same motifs of the same arrangement (Fig. 5A). Meanwhile, except for the MT-A70 protein domain (CDD: pfam05063), or MT-A70 superfamily domain (CDD: pfam05063), PvMTB had extra PRK12678 superfamily (CDD: PRK12678) and Med15 superfamily (CDD: pfam09606) domains (Fig. 5A), indicating that PvMTB may participate in different regulatory pathways. A number of studies have shown that structural similarity and diversity play an important role in gene family evolution. Using the TBtools, we generated an exon-intron map for the MT-A70 gene family. Figure 5A showed a detailed representation of the gene's coding region, introns and untranslated regions. A key component of eukaryotic genomes, introns are actively involved in genetic recombination that results in gene rearrangements and evolution of the genome. The MT-A70 family genes have 5 to 8 introns.
To further explore the structure and sequence characteristics of ALKBH genes in common beans, a maximum likelihood phylogenetic tree (Fig. 5B) was constructed using full-length amino acid sequences to classify visualized analyses. In the conserved motif analysis, the PvALKBH9 and PvALKBH10 subfamilies had the same motifs 1–3,5,8 and motif 10 with the same arrangement, and PvALKBH10 subfamily had two additional motifs, 4 and 7. The PvALKBH2 subfamily had two different motifs 6 and 9. PvALKBH8 had three motifs 1, 4 and 8.PvALKBH6 had three motifs and possessed a extra motif 1 than PvALKBH7 (Fig. 5B).All of the above suggested a potential loss of function or functional differentiation of these proteins. Among the PvALKBH proteins, all had a conserved domain of either 2OG- Fe(II)_Oxy, 2OG- Fe(II)_Oxy2, or 2OG-Fe(II)_Oxy superfamily. PvALKBH10C1 and PvALKBH10C2 had a different domain of PHA03378, PvALKBH8 had an extra domain of RRM_SF superfamily and PvALKBH10B had a special domain of MSCRAMM_ClfB superfamily (Fig. 5B). PvALKBH genes displayed relatively different gene structures, including the number and position of exons and introns (Fig. 5B). The ALKBH family genes have 1 to 8 introns.
To further explore the structure and sequence characteristics of YTH genes in common beans, a maximum likelihood phylogenetic tree (Fig. 5C) was constructed using full-length amino acid sequences to classify visualized analyses. Most PvYTHDF subclade proteins exhibited the conserved motifs 1–7, except PvECT11 lacked motifs 5–6, and PvECT8B lacked motif 1 and motif 6. PvECT1-3 and PvECT5 had an extra conserved motif 8, whereas PvECT8A had an extra motif 10, and PvECT6-7 had two additional motifs 9 and 10. PvYTHDC subclade proteins all had conserved motifs 2–3, whereas PvCPSF30 had an extra motif 5. Together, these results indicated that the YTH family proteins were highly conserved in common beans and that there was a slight evolutionary divergency in the subfamily or subclade. All PvYTHs had a typical YTH conserved domain (CDD: pfam04146) of similar length. Additionally, PvCPSF30 had the YTH1 superfamily domain (CDD: COG5084). PvYTH genes clustered in the same clade shared a similar gene structure, including the number and position of exons and introns (Fig. 5C).
3.5. Cis-acting element Analyses of MT-A70 Family of common bean
To better study the responsive regulation and potential biological functions of m6A-related genes from common bean plants, the PlantCare database was used to analyze the cis-acting elements in 31 m6A regulatory gene promoter regions (Fig. 6). In m6A regulatory components, light responsive, and phytohormone responsive and stress responsive elements are more common than plant growth and development related elements.
In the light responsive category, Box 4 was identified in all MT-A70, ALKBH, and YTH family members except PvCPSF30 and encompassed the highest proportion (39.4%,32.6%,36.4%), followed by the G-box (10.6%,22.3%,15.5%). The most prevalent cis-element in the phytohormone group was ABRE, which was responsible for abscisic acid responsiveness, followed by CGTCA and TGACG motifs associated with methyl jasmonic acid (MeJA) responsiveness. Several stress-responsive elements were identified in the promoter regions of m6A regulatory genes. The most prevalent cis-element in this group was ARE in response to anaerobic induction. GC motif in response to anoxic specific inducibility was only present in the promotor region of PvCPSF30. The plant growth and development response related cis-elements like CAT-box, circadian, GCN4 motif, HD-Zip 1, O2-site and RY-element were revealed (Fig. 7), which are linked with meristem expression, circadian control, endosperm expression, differentiation of palisade mesophyll cells, zein metabolism regulation and seed-specific regulation. More plant growth and development responsive elements were identified in the promoter regions of ALKBH family members than in the upstream region of other m6A regulatory components.
3.6. Go annotation and putative protein-protein interaction Analyses of MT-A70 Family of common bean
To predict the localization and potential functions of m6A regulatory protein of common bean, subcellular localization prediction and Go ontology annotation analysis was conducted. A total of 31 proteins were categorized into 55 functional groups, and they were divided into four categories: Biological process, Molecular function, Cellular component, and Subcellular localization (Fig. 8). About the molecular function, we found that approximately 18.99% of annotated proteins function in the activity of binding, followed by nucleic acid binding (11.39%), RNA-binding (10.76%) and metal ion binding (9.49%). In the biological process, m6A members involved in the nucleic acid metabolic processes (8.96%) and RNA metabolic processes (7.96%), macromolecule modification (7.46%), regulation of gene expression (6.47%), followed by negative regulation of biological process (5.97%), RNA modification (5.97%), positive regulation of cellular process (5.47%) and negative regulation of gene expression (5.47%). Subsequently, predicted m6A proteins are also annotated with regulation of mRNA metabolic process (5.47%), negative regulation of cellular process (4.98%), mRNA metabolic process (4.48%), mRNA destabilization (4.48%), methylation, macromolecule methylation, RNA processing and RNA methylation (< 4%) in biological process annotation. In the cellular component, the m6A proteins were annotated with the cellular anatomical entity and intracellular (both 19.35%), intracellular organelles (15.48%), cytoplasm and nucleus (both 14.84%). Furthermore, in terms of subcellular localization, our results demonstrated that the proportion of m6A proteins located in nuclear was approximately 89.66%, whereas the extracellular, cytoplasmic, and plasma membranes contained the same amount (3.45%).
A total of 31 m6A regulatory proteins including writers, erasers, and readers were aligned to the thirty STRING proteins, which were used to construct the interaction networks. All m6A writer components, with the exception of PvMTC, had significant connections, suggesting that they might have contributed in m6A modification by building protein complexes. In addition, we found that putative homologs of ALKBH1A and ALKBH2A were frequently mentioned together in other organisms, which could be explained by their comparable functions in removing m6A modification and lowering the overall m6A level (Fig. 9).
3.7. Transcriptomic analysis
To discover the potential functions of m6A pathway genes in response to viral infection, the expression profiles of 31 genes in true primary leaf and systemic leaf were obtained from our transcriptome datasets (Fig. 10). Among six m6A writer genes, we found that almost all of the m6A regulatory genes displayed high expression in control and treatment (FPKM > 10). On the other hand, PvMTC expressed at moderate levels (FPKM > 1 and FPKM < 10) in eight groups of samples. Among the ALKBH family genes, PvALKBH1B and PvALKBH1C showed extremely low expression levels(FPKM < 1), suggesting that they were nonfunctional or had temporal and spatial-specific expression patterns. In the resistant cultivars, the expression of PvALKBH9B increased significantly after viral infection. In the sensitive varieties, the expression level of PvALKBH10A in the local leaves decreased significantly after viral infection.PvECT2 showed predominant expression among all 31 genes in all groups of samples(FPKM> 100), and its expression level in RGLC primary true leaf was significantly higher than that in systemic leaf, which could also be found in PvECT8A.Besides, in the YTH family, only PvECT8B was basically not expressed(FPKM < 1). Cluster analysis of all samples showed that the samples from different parts of the susceptible cultivar infected by the virus were clustered together, while the healthy control of the resistant cultivar were clustered together with the samples from the same parts treated by the BCMV.
3.8. Expression analysis
It has shown that m6A modification is involved in plant response to various biotic stresses. In this study, bean common mosaic virus was used as a biotic stress factor to study the reaction of m6A regulatory genes in healthy and treated plants at 2/4/8/16/32/64h and 5/7/14/21 d after viral inoculation (Fig. 11). The expression patterns of 28 genes were diverse. The components of m6A writer complex (PvMTA, PvMTB, PvHAKAI and PvVIR) exhibited generally the same trend of expression, and these genes in virus-treated plants were basically upregulated compared with healthy controls before 16 hours, and significantly downregulated at 64 h and 5 days. In the ALKBH family, except for PvALKBH9B and PvALKBH10C1/C2, all other members were highly expressed in the early stages of viral infection (2–4 hours). The ALKBH1 subfamily was significantly upregulated at 32/64 h and then downregulated at 14/21d. PvALKBH9A was upregulated for one day after viral stimulation, after that its expression was not substantially different from that of the control. The expression of PvALKBH10A and PvALKBH10B fluctuated after viral infection, but the changing pattern of the two was quite similar. The expression of PvALKBH10C1-2 was enhanced in the early stage of BCMV infection, and basically downregulated in the later stage, but they were significantly upregulated at 21 days. Compared with other genes, PvALKBH9B showed a unique expression trend. It was slightly downregulated within 8 hours after viral inoculation, and then this gene was continuously upregulated, with the highest expression observed at 7 dpi. With the exception of ECT5, all identified reading proteins were downregulated at 5d. PvECT1 was significantly upregulated within the first four hours of viral inoculation, with little change in most later periods. The expression profile of PvECT2 is relatively unique, exhibited sustained descending expression at all time points. The expression of PvECT3/5/8B fluctuated slightly in the early stage of viral infection and was basically downregulated after 5 days. The expression of PvECT6 was significantly downregulated in 2 h after viral infection and was significantly upregulated at 21 days. The expression of PvECT11 and PvCPSF30 increased within four hours of viral infection and was downregulated in the later stage. The expression variation of genes involved in m6A modification in common beans were complex and diverse, which may be closely related to their biological functions when responding to viral infection in plants.