Database mining identifies EsMYB90, a candidate regulator for anthocyanin synthesis
Eutrema Salsugineum is a stress-tolerance halophyte, which produces purple flower buds after vernalization [52]. Since MYB genes are required for anthocyanin synthesis [13,15,36] we identified which MYB controls this purple phenotype in E. salsugineum. After comparing MYB genes obtained from the transcriptome of E. salsugineum based on our previously published results[46], with 72 MYB genes known acting as proanthocyanin (PA) and anthocyanin regulators in other plants, we found one candidate MYB gene, named as EsMYB90.
To determine the relationship of EsMYB90 to characterized flavonoid and PA MYBs, we performed similarity analysis at the protein level. Our results showed that EsMYB90 has 80.5%, 78.9%, 78.4%, 74.4%, 69.4%, 65.9%, 50% identities respectively to 7 MYB proteins, i.e. BoMYB1, AtMYB90, BrMYB114, AtMYB75, CrMYB114, AtMYB113, and AtMYB114 (Fig. 1A). In addition, similarities between EsMYB90 and other 10 MYB proteins range from 44.1% to 39.0% (Fig. 1A). Those MYB proteins with high similarities to EsMYB90 belong to the class of R2R3-MYB,which have a conserved DNA-binding domain (R2 and R3 repeats) in the N-terminal and a variable C-terminal region [41,53,54]. The ANDV motif (marked by red A box in Fig.1A), a characteristic identifier for anthocyanin-promoting MYBs in dicots [10], existed in EsMYB90, AtMYB90, AtMYB75, AtMYB113, AtMYB114, AmROSEA1, StMYB113, EsMYBA1(AGT39060), VvMYBA1, MrMYB1, and FaMYB10, while the C-terminal-conserved motif KPRPR [S/T]F for Arabidopsis anthocyanin-promoting MYBs [25,36] (marked by blue B box in Fig.1A), was only found in EsMYB90, AtMYB75, AtMYB90, AtMYB113, and MrMYB1. Moreover, EsMyb90 has a conserved [D/E]Lx2[R/K]x3Lx6Lx3R motif (marked by black arrows in Fig.1A), which is required for interaction with R/B-like bHLH proteins [10,16].
To further identify the relationship of EsMYB90 to other MYB proteins, we generated a phylogenetic tree with 29 MYB proteins involved in anthocyanin synthesis in 16 plants. Our results demonstrated that EsMYB90 was clustered in the clade I (Fig. 1B), which consists of AtMYB75, AtMYB90, AtMYB113, AtMYB114, BoMYB1 and BrMYB114 that are important for anthocyanin accumulation [17,19,25,26,27,55]. EsMYB90 has a relatively farer phylogenetic relationship to MYB proteins in clades II, III, and IV, although those MYBs promote biosynthesis of PA and anthocyanin, except for EsMYB5(XP_006407201) have no research reports yet.
In summary, our results suggest that EsMYB90 is a R2R3-MYB, which may function in proanthocyanin and anthocyanin synthesis.
Expression pattern of EsMYB90 in E.salsugineum and subcellular localization of the protein
In order to detect the expression pattern of EsMYB90, we collected leaves, petioles, stems, roots and flowers of E.salsugineum, and performed qRT-PCR. Our result showed that EsMYB90 was expressed in all examined tissues of E.salsugineum, among which it has the highest expression level in petiole (Fig. 2B), followed by stems and flowers (Fig. 2C,D), but a relative lower expression in leaves and roots (Fig. 2A,E). This result is consistent with the color phenotype of different tissues observed, suggesting the expression of EsMYB90 is related to the synthesis of anthocyanins (Fig. 2A-F).
To test the subcellular localization of EsMYB90, we examined the transient expression of YFP-EsMYB90 fusion protein in onion epidermal cells. Our results showed that YFP signals were observed in both cytoplasm and nucleus of the onion epidermal cells expressing 35S:YFP (Fig. 2G-I), while the YFP signal was only detected in the nucleus in cells expressing 35S:YFP-EsMYB90 (Fig. 2J-L). Our result showed EsMYB90 is localized to the nucleus, suggesting that as other MYB proteins EsMYB90 also functions as a transcriptional regulator.
Ectopic expression of EsMYB90 promotes anthocyanin accumulation in tobacco and Arabidopsis
To investigate the possible function of EsMYB90 in anthocyanin biosynthesis, we generated the 35S:EsMYB90 construct to ectopically expression EsMYB90 in tobacco and Arabidopsis (Additional file 1: Fig.1A). Eighteen 35S:EsMYB90 transgenic tobacco and 15 35S:EsMYB90 transgenic Arabidopsis plants were obtained, respectively (Additional file 1: Fig. 1B,C).
We found that in all developing stages, leaves and stems in 35S:EsMYB90 tobacco plants appeared purple-red, and the color became deepened with development (Fig. 3A-C). In addition, 35S:EsMYB90 tobacco plants produced purple-red corollas, purple-black sepals, and purple-black fruit pods, whereas wild-type corollas were pink, with green sepals and fruit pods (Fig. 3D-F). Our results from examining anthocyanin production showed that the total anthocyanin contents in three 35S:EsMYB90 tobacco lines were significantly increased in stems, young leaves (YL), mature leaves (ML), flowers, fruit pods, and mature seeds, compared with the wild type (Fig. 3G). Among L1, L2 and L4 three tested lines, the L4 transgenic line had the highest anthocyanin contents. Compared with the wild type, the total anthocyanin contents in young leaves (YL), mature leaves (ML), stems, flowers, fruits pods, and mature seeds of the L4 line were increased 95.2, 45.7, 48.8, 4.9, 17.8, and 2.6 folds, respectively (Fig.3G). These results indicate that the enhanced pigmentation in 35S:EsMYB90 tobacco plants was caused by the increased synthesis of anthocyanins.
We observed similar phenotypes in 35S:EsMYB90 Arabidopsis transgenic plants. In comparison to the wild-type plants, the color of leaves, roots, stems, flowers, fruit pods, and seeds became light-purple to dark-purple in 35S:EsMYB90 Arabidopsis plants (Fig. 4A-G). In particularly, seeds from 35S:EsMYB90 Arabidopsis plants exhibited black color (Fig. 4H,I). Furthermore, the contents of anthocyanins in the roots, stems, leaves, flowers, and fruit pods at the bolting stage, and the mature seeds from three 35S:EsMYB90 Arabidopsis transgenic lines (L1, L2, and L3) were significantly higher than that in wild-type plants (Fig. 4J).
Collectively, our results suggest that EsMYB90 functions as a transcription factor to promote anthocyanin biosynthesis in plants.
Transcriptomic analyses show that EsMYB90 is a key regulator in the proanthocyanidin and anthocyanin pathway
To examine the molecular mechanisms by which EsMYB90 controls anthocyanin biosynthesis in the genome wide, we performed RNA-seq analysis using the leaves from wild-type and 35S:EsMYB90 tobacco transgenic plants. We identified 51,202 differentially expressed genes (DEGs) in the comparison of wild-type plant with 35S:EsMYB90 transgenic tobacco plants, among which 2,446 DEGs have log2 Fold Change ≥1 or ≦-1 and a Padj ≤ 0.05 (Additional file 2). Furthermore, 1199 out of 2446 DEGs were up-regulated, while 1247 genes were down-regulated(Additional file 2). Moreover, 476 unique DEGs were annotated into 43 GO terms, wherein the GO terms with the top 3 of the number of DEGs encoding the binding (249 genes), catalytic activities (237 genes) and metabolic processes (236 genes) (Fig. 5A, Additional file 3).
We revealed that among 2446 DEGs, 1023 unique genes were annotated into the 128 KEGG pathways (Additional file 4). The most prominent KEGG-enriched genes are involved in secondary metabolite biosyntheses, followed by plant hormone signaling, flavonoid biosynthesis (Additional file 1: Fig. 2). According to the q value of DEGs, the top 20 of enrichment paths included the flavonoid biosynthesis (ko00941), anthocyanin biosynthesis (ko00942),flavone and flavonol biosynthesis (ko00944), plant circadian rhythm (ko04712),and glutathione metabolism (ko00480;Fig.5B, Additional file 5). Moreover, the anthocyanin biosynthesis pathway has the largest enrichment factor, followed by the pathway of flavonoid biosynthesis. The flavone and flavonol biosynthesis pathways also has a large enrichment factor (Fig.5B, Additional file 5).
Mapping to the KEGG reference pathways found that a total of 57 significantly differential expression genes were assigned to five secondary metabolic pathways, i.e. phenylpropanoid biosynthesis (ko00940), flavonoid biosynthesis (ko00941), anthocyanin biosynthesis (ko00942), isoflavonoid biosynthesis (ko00943), and flavone and flavonol biosynthesis pathways (ko00944). The gene names, gene ID, and the combined functional annotations were seen in Additional file 6. Out of 57 genes, 42 genes encode PA and anthocyanin biosynthesis enzymes, such as PAL (107802063, 107761482 and 107820497), CHS (107826422, 107801774 and 107813613), CHI (107779699, 107810515 and 107825576), F3H (107770893, 107806462), DFR (107803097,107797232), and ANS/ LDOX (107819370, 107778118, 107787193, 107787195 and 107808500). Particularly, six genes encoding UFGT (107781346, 107822886, 107781522, 107831042, 107767212 and 107819220) in the anthocyanin biosynthesis pathway (ko00942) were all strongly up-regulated (Fig. 6, Additional file 6). However, in ko00940-ko00944 pathways, only 15 genes including that encoding flavonol synthase/flavanone 3-hydroxylase (107794305, 107814657), trans-resveratrol di-O-methyltransferase-like (107785995, 107797481), and flavone 3'-O-methyltransferase 1-like (107792977), were down-regulated (Fig. 6, Additional file 6).
Taken together, our RNA-seq results demonstrated that identified DEGs are significantly enriched in the flavonoid and anthocyanin synthesis pathway (ko00941-ko00944), suggesting that EsMYB90 play an important regulatory role in proanthocyanidin and anthocyanin synthesis.
Validation of RNA-seq results by qRT-PCR
To validate the RNA-seq results, we performed quantitative reverse transcription PCR (qRT-PCR) for 18 genes which are assigned to 5 groups related to anthocyanin biosynthesis, antioxidant production, signal transduction, transcription regulation, and ion channel in tobacco (Additional file 7). Our results showed that expression level changes of 5 anthocyanin biosynthesis genes [NtDFR (107803097), NtLDOX54 (107778118), Nt3GT12 (107781346), Nt3GT36 (107781522), and Nt3GT53 (107831042)] detected by qRT-PCR were in agreement with the RNA-seq data (Fig.7A). We obtained similar qRT-PCR results from examining expression of 4 antioxidant-related genes [NtP450 (107772738), NtCu-ZnSOD (107806960), NtPOD44-1 (107827231), and NtPOD44-2 (107797651); Fig.7B], 4 genes encoding transcription factors [NtbZIP (107795590), NtMYB3R-1 (107795213), NtMYB4 (107802984), and NtWRKY53 (107825953); Fig.7C], NtAKT2/3(107761230) encoding a potassium channel protein (Fig.7C), and 4 genes related to signal transduction and ion channel [NtMAPK3 (107782983), NtMAPK6 (107806359), NtAX15A (107805986), and NtCaM1 (107803626); Fig.7D]. We found similar differential expression patterns for the DEGs in the qRT-PCR and RNA-seq data, with a lower pearson’s coefficient (R2) as 0.9232. Therefore, qRT-PCR results support that our transcriptome results are reliable.
EsMYB90 promotes expression of anthocyanin biosynthetic genes in tobacco
To further elucidate the molecular function of EsMYB90 in proanthocyanin and anthocyanin biosynthesis, we examined expression of key anthocyanin biosynthesis genes PAL, CHS, CHI, F3H, F3´H, DFR, ANS/LDOX, and UFGT in the stems, young leaves(YL) and flowers from 35S:EsMYB90 tobacco transgenic lines (L2, L4) and wild-type tobacco plants at the flowering stage by qRT-PCR.
PAL is the first key enzyme in the metabolic pathway of phenylpropanoid [9]. Expression levels of NtPAL in stems, leaves and flowers from the L4 line increased 4.6, 6.1, and 6.3 times, respectively, than that of wild type (Fig. 8A). CHS catalyzes the first step of anthocyanin biosynthesis, while CHI catalyzes the cyclization of chalcone molecules to form naringenin [8]. Expression levels of both NtCHS and NtCHI in stems, leaves and flowers from the L4 line were significantly increased compared to that in the wild type (Fig. 8B,C). Whereas, the relative transcript level of NtF3H in the flowers was slightly down-regulated in L4 transgenic line(Fig. 8D), and NtF3'H transcripts in the stems were down-regulated in L2 and L4 transgenic lines (Fig. 8E). Finally, the anthocyanin biosynthesis genes NtDFR, NtANS and NtUFGT which are required for anthocyanin biosynthesis at later steps were also significantly upregulated by EsMYB90 (Fig. 8F-H).
In summary, our results suggest that EsMYB90 controls anthocyanin biosynthesis by promoting expression of anthocyanin biosynthesis genes, particularly LBGs.