Identification of MBF1 genes in the M. truncatula and M. sativa genome
Based on domain confirmation and homology search, a total of four putative MBF1s (MtMBF1a.1, MtMBF1a.2, MtMBF1b, MtMBF1c) from M. truncatula, and two putative MBF1s (MsMBF1a and MsMBF1c) from M. sativa, were identified, respectively (Table 1). The molecular property analysis revealed that most MBF1 proteins were between 110aa and 150aa, which is similar to the MBF1 profile of tomato [26]. Differently, the MtMBF1c and MsMBF1a proteins are shorter in length, 56aa and 74aa, respectively. The predicted pI values ranged from 8.6 (MtMBF1b) to 10.24 (MsMBF1c). The predicted molecular weights ranged from 6.18 kDa (MtMBF1c) to 15.6 kDa (MtMBF1a.1). The corresponding homologous MBF1 genes of M. sativa and M. truncatula were identified in parallel by sequence alignment. The predicted subcellular locations suggested that two proteins (Mt/MsMBF1c) were located in the nuclear, MtMBF1a.2 were located in the cytoplasmic, and the remaining three proteins (MtMBF1a.1, MtMBF1b and MsMBF1a) were located in extracellular.
Table 1 Properties of the predicted MBF1 proteins in M. truncatula and M.sativa
Gene number | Gene Name | TIGR Locus | Start site | End site | Homologous Gene | pI | MW(kDa) | Protein Length | Subcellular Localization |
MtMBF1-1 | MtMBF1a.1 | MtrunA17_Chr2g0320211 | 41480478 | 41484655 | MsMBF1a | 9.95 | 15.35 | 141 | Extracellular |
MtMBF1-2 | MtMBF1a.2 | MtrunA17_Chr4g0041301 | 38998265 | 39003781 | MsMBF1a | 9.91 | 15.6 | 143 | Cytoplasmic |
MtMBF1-3 | MtMBF1b | MtrunA17_Chr4g0057871 | 50949726 | 50950391 | MsMBF1a | 8.6 | 13.75 | 121 | Extracellular |
MtMBF1-4 | MtMBF1c | MtrunA17_Chr6g0485281 | 39789414 | 39792731 | MsMBF1c | 9.9 | 6.18 | 56 | Nuclear |
MsMBF1-1 | MsMBF1a | MsG0480021982.01.T01 | 66798008 | 66798612 | MtMBF1a.1 | 9.45 | 7.8 | 74 | Extracellular |
MsMBF1-2 | MsMBF1c | MsG0680035636.01.T01 | 109794752 | 109795081 | MtMBF1c | 10.24 | 11.87 | 110 | Nuclear |
Multiple sequence alignment, phylogenetic analysis and classification of MBF1 genes in Medicago
Sequence alignments of the multiple amino acids between MtMBF1s and MsMBF1s indicated that the sequences are highly conserved (Fig. 1): all the MBFs containing MBF1 domain in N-terminal (MtMBF1b was an incomplete domain) and HTH domain was present in the C-terminal (except MsMBF1a).
Sequence-based phylogenetic analysis among M. truncatula, M. sativa, Arabidopsis, O. sativa, Z. mays, V. unguiculata and B. rapa showed that these proteins were grouped into 3 distinct clades (a, b, c, Fig. 2). MBF1a and MBF1b were detected to have a closer relationship, which suggested that they might conduct more similar functions. However, MBF1c was displayed to be a relatively independent group (Fig. 2), which indicated that it might play some specifc roles related to the development process in plants.
Analyses of conserved motif and gene structure
In order to understand the structural characteristics of MBF1 genes of Medicago, the motifs and exons/introns structure analyses of the MBF1 family genes from the Medicago were conducted and the results are consistent with the phylogenetic analyses (Fig. 3). Ten preserved motifs were detected in these MBF1 proteins. Motif 1 and motif 2 were detected in all members’ MBF1 domain, where motif1 is MBF1 domain and motif 2 is HTH domain (Fig. 3B). Therefore, the similar motif distribution of the MBF1 proteins in Medicago may promote to the prediction of the functions of MBF1s.
The gene structures of exon numbers ranged from one to four, and the intron numbers ranged from zero to three for two species (Fig. 3C). MtMBF1a. 2 and MtMBF1c contain more UTR sequences, accounting for 55% and 46% of the gene sequences respectively, while MtMBF1b and MsMBF1c do not contain UTR, and differences in exon or intron length were also observed between orthologous members.
Analysis of chromosome location and collinearity of MBF1 genes
The chromosome location data of each MBF1 gene were download from the Phytozome database, based on which these MBF1 gene were anlysised by TBtools and mapped on their corresponding chromosomes (Fig. 4A, B). As a result, the distribution of MBF1 genes were not even in either M. truncatula or M. sativa. MtMBF1 were distributed on chromosome 2, 4, 6, and two genes (MtMBF1a.2 and MtMBF1b) are distributed on chromosome 4 (Fig. 4A). MsMBF1a and MsMBF1c were distributed on chromosome 4 and 6 respectively. (Fig. 4B). In addition, there were no segmental duplication and tandem duplication events in both M. truncatula and M. sativa.
Furthermore, three comparative syntenic maps of M. sativa and M. truncatula associated with the representative plant species Arabidopsis were constructed to illustrate the evolution relationship of MBF1 genes family (Fig. 4C). Two pairs of genes are covalently related between M. sativa and M. truncatula. MtMBF1a.2 was collinear with MsMBF1a, and MtMBF1c was collinear with MsMBF1c. Moreover, to better understand the evolutionary selection pressure during the formation of MBF1 gene families, the non-synonymous/synonymous substitution (Ka/Ks) values of MBF1 gene pairs were analyzed between M. sativa and M. truncatula. Generally, Ka/Ks > 1, Ka/Ks = 1, and Ka/Ks < 1 indicate positive selection, neutral evolution, and purifying selection, respectively [27]. The ratios of two (MtMBF1a.2-MsMBF1a, MtMBF1c-MsMBF1c) homologous pairs were even smaller than 0.2, suggesting that these genes have undergone purifying selection after segmental and whole genome duplications.
Analysis of cis-acting element of MBF1 genes
To further determine the molecular functions and expression pattern of the MBF1 family, we focused on hormones, abiotic stress and growth & development related cis-acting elements (Fig. 5 and Addition file 1). In phytohormone pathways, most of the MBF1 genes were detected to have more cis-element involved in Methyl jasmonate (MeJA) and ethylene responsiveness (Fig. 5A). In addition, a series of cis-acting elements were identifed with abiotic stress. All MBF1 genes were mainly related to drought and light responsiveness. Combining hormone and abiotic stress cis-elements, it was concluded that MBF1 gene family had strong response to external environmental stimuli. Furthermore, MBF1 gene family contains a large number of promoter enhancement regions and transcription initiation elements, which may play a key role in the rapid response of plants to external stimuli. Notably, an exclusive cis-element associated with circadian control was obtained in MtMBF1a.2 and MsMBF1a, suggests that MBF1 genes may play a role in regulating plant cyclic rhythms.
Expression profiles of MBF1 genes in different tissues and validation by qRT-PCR
We investigated the expression patterns of MBF1s in various tissues of M. truncatula with genechip dataset from the MtGEA web server, including roots, stems, petioles, leaves, flowers, buds, pods, seeds and seeds coat (Fig. 6A, Additional file 2). MtMBF1a.1 and MtMBF1a.2 showed relatively high expression level in these tissues, MtMBF1b and MtMBF1c were expressed at relatively low level in different tissues (Fig. 6A). As for M. sativa, six tissues of different type were analyzed based on transcriptome data, including roots, elonged-stems, pre-elonged-stems, leaves, flowers and nodules (Fig. 6C, Additional file 3). Among them, there was no significant difference in the expression of MsMBF1a in different tissues, but the expression of MsMBF1c in leaves and flowers was significantly higher than that in roots, stems and nodules (Fig. 6C).
To understand the potential function of the Medicago MBF1 genes, the expression pattern were examined using qRT-PCR in different organs, including the roots, stems, leaves and flowers (Fig. 6B, D, Additional file 2, 3). All of the MBF1 genes were detected in these four tissues (Fig. 6B, D). Remarkably, the expression levels of all genes in leaves were significantly higher than those in other tissues, followed by flowers in M. truncatula and roots in M. sativa.
Expression profile of MBF1 gene in Medicago under stress
Expression profiles of MBF1 genes from M. truncatula were initially analyzed based on the genechip data, including samples from roots and shoots under drought treatment, and roots under vitro culture salinity and under hydroponic salinity (Additional File 2). As for M. sativa, transcriptome data for samples from roots and shoots under NaCl and drought treatment were also analyzed (Additional File 3).
Under NaCl conditions, the expression changes of four genes showed the similar trend under two different NaCl treatments (Fig. 7A). Among them, MtMBF1a.1 and MtMBF1a.2 showed relatively high expression level, MtMBF1b and MtMBF1c were expressed at relatively low level (Fig. 7A), this is consistent with the expression of genes in tissues (Fig. 6A). MtMBF1a showed an increasing and then decreasing trend under vitro culture salinity conditions, while it showed a decreasing and then increasing trend under hydroponics salinity conditions. Notably, under drought conditions, the expression trends of these genes were consistent in both roots and stems, and both gradually increased with increasing stress time. (Fig. 7B). Contrary to roots drought treatment, the expression of MtMBF1a.2 and MtMBF1b showed a downward trend after rewatering in shoots drought treatment, (Fig. 7B). This may be related to the special response mechanism of MBF1 genes to abiotic stress. In M. sativa, the expression of both members (MsMBF1a and MsMBF1c) showed an increasing trend under NaCl and drought stress treatments, with MsMBF1a showing a decrease in expression at 24 h of drought.
Validation of the expression profile of stress-responsive MBF1 gene by qRT-PCR
In order to verify the expression profiles of MBF1 genes from genechip data and microarray data, seedlings were treated with NaCl, mannitol, hot and cold at 1h, 3h, 6h, 12h, 24h, 48h and used for qRT-PCR analysis (Fig. 8, Additional File 4). As expected, most of the MBF1 genes responded to different stress treatments. For example, MtMBF1b tended to increase and then decrease under all stress treatments and peaked at 3-6h (Fig. 8). The expression level of MtMBF1a.1 decreased obviously under NaCl treatment, which was consistent with the genechip data (Fig. 7). The expression level of MtMBF1c increased significantly under NaCl treatment, but remained unchanged under drought stress (Fig. 8). In the face of heat stress, MtMBF1b was induced during the early stage, the other genes have no obvious changes. Notably, MsMBF1a and MsMBF1c showed extremely low levels of expression at all times in response to NaCl and drought treatments.