3.1. Identification and physicochemical properties CtCYP81E subfamily in safflower
In total, 15 full-length safflower CtCYP81E genes with c. tinctorius location markers were extracted from the draft genome database selected on the basis of P450 Pfam00067 domain. In order to further classify CtCYP81E genes into subfamily, we extensively carried out comparitive analysis with the overall P450 genes of Arabidopsis genome using Phytozome 4.0. A total of 246 P450s and 26 pseudogenes were retrieved for further characterization. In addition, the physicochemical properties were also determined with the help of ProtParam online tool (Table 1). The result of the protein size for all 15 CtCYP81E encoded amino acids was found between 115–516 amino acids. The expected molecular weight was recorded theoretically resulting within the range of 12.97 kDa (CtCYPE9) to 59.01 kDa (CtCYPE14), with an average value of 45.59 kDa. The values of isoelectric points (pI) ranged from 4.74 (CtCYPE9) to 9.35 (CtCYP81E10). Furthermore, the grand average of hydropathicity (GRAVY) index was also determined revealing that most of the CtCYP81E proteins were allocated to hydrophilic nature. Out of all, the most stable protein was CtCYPE4, comprising a stability index equals to 40.66.
Table 1
The physiochemical properties of safflower CYP81E subfamily proteins. The data was collected using the online tool of ExPASy (available online: http://web.expasy.org/protparam/).
Gene Name | Gene ID | Protein Length | PI | MW(Da) | Subcellular Localization | IS index | Arabidopsis Homology | GRAVY |
CYP81E1 | CCG011365.1 | 516 | 8.38 | 58237.60 | Plasma membrane | 49.02 | AT4G37370.1 | -0.111 |
CYP81E2 | CCG011366.1 | 501 | 8.96 | 57297.86 | Plasma membrane | 46.06 | AT4G37370.1 | -0.125 |
CYP81E3 | CCG011367.1 | 501 | 8.73 | 57298.61 | Plasma membrane | 43.78 | AT4G37370.1 | -0.162 |
CYP81E4 | CCG011368.1 | 339 | 5.61 | 38282.11 | Plasma membrane | 40.66 | AT1G66540.1 | -0.332 |
CYP81E5 | CCG007304.1 | 506 | 6.78 | 57579.54 | Plasma membrane | 53.61 | AT4G37370.1 | -0.173 |
CYP81E6 | CCG007305.1 | 506 | 6.78 | 57579.54 | Plasma membrane | 53.61 | AT4G37370.1 | -0.173 |
CYP81E7 | CCG007306.1 | 440 | 6.45 | 50001.45 | Plasma membrane | 54.20 | AT4G37370.1 | -0.255 |
CYP81E8 | CCG007309.1 | 337 | 6.49 | 38489.90 | Plasma membrane | 44.14 | AT1G66540.1 | -0.320 |
CYP81E9 | CCG007984.1 | 115 | 4.74 | 12976.39 | Plasma membrane | 70.48 | AT4G37370.1 | -0.640 |
CYP81E10 | CCG008862.1 | 246 | 9.35 | 27858.38 | Plasma membrane | 47.65 | AT4G37320.1 | -0.147 |
CYP81E11 | CCG008863.1 | 177 | 6.83 | 19946.29 | Endoplasmic reticulum | 47.57 | AT1G66540.1 | -0.194 |
CYP81E12 | CCG013393.1 | 435 | 8.49 | 49493.63 | Plasma membrane | 44.98 | AT4G37360.1 | -0.143 |
CYP81E13 | CCG013395.1 | 437 | 8.84 | 49746.09 | Plasma membrane | 42.55 | AT4G37340.1 | -0.127 |
CYP81E14 | CCG021962.1 | 516 | 7.68 | 59005.54 | Plasma membrane | 45.29 | AT4G37340.1 | -0.156 |
CYP81E15 | CCG004067.1 | 434 | 8.41 | 50124.07 | Plasma membrane | 46.92 | AT4G37370.1 | -0.286 |
3.2. Phylogenetic reconstruction of CtCYP81E and Arabidopsis P450 genes
A neighbor-joining (N-J) phylogenetic tree was created using 15 CtCYP81E candidate protein sequences obtained from safflower genome and 272 Arabidopsis P450s (Table S2). The division of cluster groups between these two plant species by means of P-distance method determined their phylogenetic origin using MEGA-X package [25]. The clustering of P450 genes were initially divided into two widely known classess known as A-type and non-A-type P450 sequences. These two clades were further subdivided into nine different clans including, clan71, clan51, clan710, clan85, clan711, clan86, clan97, clan72, and clan74 (Fig. 1). The CYP71 clan was found the largest A-type class which contains 131 genes (48.51%) succeeded into 10 more subfamilies such as CYP71AH, CYP71AT, CYP71AU, CYP71AX,CYP71D, CYP71BE CYP71BG,, CYP71BL, CYP71BN and CYP71BP. Mostly, these subfamilies of CYP71 clan symbolize the presence of plant-specific enzymes that are involved during secondary metabolic reactions mainly in flavonoid biosynthesis. Our phylogenetic analysis indicated that CtCYP81E gene family of safflower is consistently clustered with the CYP71 clan of Arabidopsis. Hence, it is possible that the CtCYP81E subfamily in safflower is most likely involved in flavonoid biosynthesis as found in the CYP71 clan of the model plant. [26, 27].
3.3. Analysis of CtCYP81E gene structure, motifs and promoter
The details analysis of CtCYP81E subfamily including gene organization, signature motifs and cis regulatory elements were carried out. As described in (Fig. 2), most of the CtCYP81E genes (Table S3) shared a common organization of exon/intron makeup. Nonetheless, a small number of CtCYP81E genes did not contain the basic gene structures. For instance, the gene structure organizations of CtCYP81E6 and CtCYP81E1 showed that both of these genes consist of a short exon and long intron when compare with the other candidate genes present in the same family. Our findings were found consistent with CYP450 organization in A. thaliana [28] which confirrns that the number of exons in CYP71 clans of A. thaliana was ranged between 2–5. In addition, the shortest exon (27 bp) in the CYP71 clan (CYP71B32) of Arabidopsis was found longer than the shortest exon of CtCYP81E6 (16 bp) in safflower. Safflower CYP81E genes most likely contain 1–3 exons as demonstrated in (Fig. 2B). Mostly, these genes shared two (60%, 9/15), three (26.6%, 4/15), and one (13.3%, 2/15) of exon arrangments. Whereas the length of the introns in safflower P450 genes were estimated from 126 to 5946 bp indicating parallel results with the A. thaliana [28] and C. elegans [29].
In the next level, all 15 CtCYP81E genes from saflower were analyzed for the identification of naturally conseved protein motifs. The multiple sequence alignemt of these proteins demonstrated that almost all 15 candidate safflower P450 genes contained the basic signature motifs of P450 family such as heme binding region, PERF region, K-helix region, and I-helix motif (Fig. S1). A total of 10 conserved motifs of CtCYP81E proteins have been found consistent with the Arabidopsis P450s MEME online investigation. The output results implied that nearly all candidate CtCYP81E proteins contained these conserved motifs exluding CtCYP81E4, CtCYP81E8, CtCYP81E9, CtCYP81E11, which described a different conservation pattern. In addition, the position of few conserved motifs in CtCYP81E was not inlined with the Arabidopsis P450 s. The sequences and organization of these signatory motifs of CtCYP81E proteins were demonstrated in (Fig. 2C). These results suggested that CtCYP81Es in safflower inherit basic structural and functional domians during the process of evolution. Moreover, the cis- regulatory elements organizations of these CtCYP81Es specifically in the 2 kb 5′flanking region upstream to the start codon (Table S4) was thoroghly analyzed. Altogether, six major types of cis-elements were found in the 2 kb 5′flanking region of the promoter (Fig. 2D). Among these genes promoter, the result of a few CtCYP81Es members contain endosperm expression elements (AACA_motif; GCN4_motif), some of these CtCYP81Es contained element essential for the anaerobic induction (ARE), another group of CtCYP81Es showed the presence of abscisic acid (ABA) responsive element (ABRE), and low-temperature responsiveness elements (P-box; TATC-box), while the remaining CtCYP81Es consist hormonal responsive elements (methyl jasmonate (MeJA), and light responsiveness element (G-box; AAAC-motif). Conclusively, the occurrence of these cis-elements identified in safflower CtCYP81Es genes indicated that they are most likely to be involved during plant growth, development and plant adaptation to various stress responses and hormone signal pathways.
3.4. Functional classification of the safflower CtCYP81E subfamily
The functional categorization of the 15 CtCYP81E transcripts in safflower was carried out using GO analysis. The results of the in silico classfication revealed that all 15 CtCYP81E transcripts were allocated into one or additional GO terms. These CtCYP81E transcripts were found in all three fundamental functional categories including biological process indicated as (BP), molecular function indicated as (MF), and cellular component indicated as (CC). Moreover, eight functional subcategories were also demonstarted in the next level wherein two CC subcategories: integral compoent of membrane and membrane were detected. Five MF subcategories: oxioreductase activity, iron ion binding, heme binding, mrtal binding and isoflavone 2'-hydroxylase activity; and one BP subcategories: oxidation-reduction process (Fig. 3). Despite the fact that CtCYP81E gene belongs to type A, as previously reported, there is no significant difference found in the functional annotation of the type A and non-type A P450 sequences [30].
3.5. Protein Clustering Networks
The monooxygenases (cytochrome P450) play important roles in xenobiotic metabolism and biosynthesis of internal nutrients such as flavonoids, vitamins, steroids, hormones, and fatty acids. The capability of the P450 encoded enzymes to catalyze important substrates that involve interaction with its redox protein counterparts. This biochemical catalysis can be altered in association with the membrane-bind heme protein cytochrome b5 [31]. With the help of AtCYP81E orthologous, we systematically predicted the PPI interaction network of CtCYP81Es subfamily in safflower. We confirmed 10 widely spread proteins co-associated with these CtCYP81Es, which include translocation (2), membopane lipoprotein (1), aquaporin-like (1), ABC transporter (1) and protein kinase (1) (Fig. 4). The independent interactor protein networks indicated that CtCYP81E protein (1, 2, 3, 5, 6, 7, 9 and 15) interacts with the UGT74E2 protein, which is mainly involved in the biosynthesis of IBA (indole-3-butyric acid) and directly influence the homeostasis of auxin. Additionally, CtCYP81E proteins (1, 2, 3, 5, 6, 7, 9 and 15) work together with AT5G25930 proteins which are largly associated with the protein amino acid phosphorylation. The CtCYP81E proteins (4, 8 and 11) interact with the AT5G48605 protein and can enhance plant defense mechanism. Besides this, CtCYP81E protein (4, 8 and 11) interacts with the AT1G59660 protein which acts as key regulator in the water channel. Notably, we found that other CtCYP81E orthologous except CtCYP81E (4, 8 and 11) proteins interact with ABCD1 proteins and may be involved in the transportation mechanism. The PPI network of CtCYP81E orthologous highlights its potential role in several physiological and biosynthetic process occurred simultaniously in plants system.
3.6. Expression analysis and functional annotation of CtCYP81E subfamily genes
Expression levels of P450 variants in safflower were initially determined with the help of RNA-seq data (whole Transcriptome Shotgun Sequencing) in different tissue specifying five selected tissues/organs (root, stem, seed, flower and leaf tissues). The expression level was calculated according to kilobase model of exon model per million mapped read (RPKM) method according to the instruction given by (Mortazavi et al. 2008). The RNA-seq data was obtained from the safflower genome database (PRJNA399628; posted to NCBI on August 23, 2017). In general, the expression signals of almost all selected safflower P450 genes were detected in all organs but with different patterns. As revealed in (Fig. 5A), the expressed P450 genes in safflower were clustered into five groups including, G1 (6.6%, 1/15), G2 (6.6%, 1/15), G3 (33.3%, 5/15), G4 (33.3%, 5/15), and G5 (13.3%, 2/15) that were more preferably expressed in the leaves, stems, seeds, flowers, and roots, respectively. Furthermore, to validate the transcript abundance of CtCYP81E genes and their correlation in biosynthetic processes, we extensively carried out qRT-PCR analysis of these 15 genes at different flowerering stages such as bud, initial, flower and fading stage. Across CtCYP81E subfamily, the expression level of CtCYP81E2, CtCYP81E8, and CtCYP81E15 were abundantly detected at flower stage indicating that there might be a strong link between the regulation of transcription of CtCYP81E genes and cellular metabolism in safflower.In addition, the transcripts of CtCYP81E1, CtCYP81E2, CtCYP81E5, and CtCYP81E7 were identified in high expression level at the fading stage of flowering suggesting the transcription regulation of these genes at a later flower developmental period. The transcripts of CtCYP81E14, and CtCYP81E15 showed high expression level at initial flowering of safflower (Fig. 5B). Altogether, the qRT-PCR assay suggested a differential expression pattern and fold-change values of the selected CtCYP81E subfamily genes highlight their decisive roles in plant defense systems and developmental processes.
3.7. Subcellular localization and transcriptional regulation of CtCYP81E8
Based on our previous study on a CtCYP82G24, [32], we aimed to investigate the correlation between the quantitative expression trend of CtCYP81E8 gene and metabolite accumulation at different flowering stages of safflower. As described in (Fig. 6A&B), the expression level of CtCYP81E8 was detected consistent with the accumulation rate of total metabolites content in safflower petals. These findings provide a practical basis for the funtional characterization of CtCYP81E8, which could be a crucial modulator in the biosythetic pathway of flavonoid biosynthesis in safflower. Taking into consideration the functional importance and differential expression pattern of CtCYP81E8 gene, we therefore, cloned the full length sequence of CtCYP81E8 from safflower (Fig. 6C) and then constructed a fusion vector of CtCYP81E8 and GFP gene under the control of the 35S promoter (pCAMBIA1302- CtCYP81E8-GFP-35S) in order to determine the experimental subcellular localization. After the efficient construction of the plant overexpression vector fusion, the recombinant vector was then transiently transformed intothe onion epidermal cells through agrobacterium mediated transformation system. Fluorescence imaging of infected epidermal cells of onion bombarded with CtCYP81E8-GFP showed cell membrane localization (Fig. 6D). These findings revealed important evidence to support the assumption that CtCYP81E is able to catalyze cellular based biological reactions occurred in Carthamus tinctorius.
3.8. The induction of CtCYP81E8 transcription under variable stress conditions
The transcriptional regulatory network of CtCYP81E8 mRNA under variable stress environments has been demonstrated to confirm the underline notion of Cytochrome P450s involvement during a variety of plant secondary metabolites biosynthesis (Mizutani) Ohata 2010; Nelson, Werck-Reichhart 2011). By exploiting the temporal transcriptional regulatory channels of CtCYP81E8 under artificial environmental switches, we demonstrated a multi-regulation control system using qRT-PCR assays. The treatment group with methyl jasmonate at 0–12 hours, compared with the control group, the expression level of CtCYP81E8 gene showed an upward trend, among which the expression of CtCYP81E8 showed a unique increase at 8 h where, the transcription level was reached to its maximum. In contrast, at 12 h timepoint, the expression decreased significantly (Fig. 7A). Under drought stress conditions, the CtCYP81E8 gene expression was significantly induced at 4–8 h than the control plants. The expression level was reached to its maximum at 8 h timepoint under PEG induced stress however, the transcription of CtCYP81E8 was down-regulated at 12 h treatment times (Fig. 7B). Under strong light irradiation, the gene expression level of CtCYP81E8 at different treatment times 12–60 h was surprisingly down-regulated compared with control plants. The down-regulation was most significant at 36 h treatment time indicating intense susceptibility towards light stress (Fig. 7C). The transcription level of this gene after dark treatment was expectedly upregulated in all treatment times reaching to the maximum at 36 h. In general, the expression level was consistently rising from 12–36 h, and suddenly drops sharply after 48 h, but the overall expression level is up-regulated compared to the control group (Fig. 7D). These findings unanimously represented the multi-dimensional periodic regulatory network of CtCYP81E8 transcriptional system upon different stress conditions, highlighting cruicial blueprints in the molecular regulation system of plants adaptation to biotic and abiotic stress responses.
3.9. Transcriptional regulation system CtCYP81E8 overlapping with flavonoid accumulation in wild and mutant safflower
The correlation between the transcription level of CtCYP81E8 and accumulation pattern of total metabolite content through multiple flower developmental stages of wild and mutatnt safflower varieties was extensively investigated using qRT-PCR assay. Simultaneously, the accumulation content of total metabolites was purposely investigated using the same phases of the two naturally occuring flowers types in safflower including red and yellow flower. Interestingly, the expression profile of CtCYP81E8 showed a programmed expression system correspondantly during flowering developmental phases both in wild and mutant type of safflower (Fig. 8A&B). The transcription control level of CtCYP81E8 during the red flowering development stages except at the bud flowering phase (R1), confirmed that the increased trend of CtCYP81E8 transcript simultaneously affect the accumulation level of total metabolite content in red-typed wild safflower. In the same way, the yellow-typed wild safflower showed a consistent network of increased trend in the accumulation of metabolite content with the increase in the expression level of CtCYP81E8 excluding the bud flowering stage (Y1). The estimated theme was further confirmed by conducting simillar analysis in the mutant safflower line, suggested almost a similar type of correlation, however, the opposite trend was also found as in the wild type safflower but through a different flower developmental stage. It was suggested that the accumulation content of safflower metabolites was significantly increased with the increase in the transcription level of CtCYP81E8 during all three flowering stages of the red-typed mutant safflower. Nonetheless, a discontinued scheme of the metabolite accumulation in the white-typed mutant safflower was observed at bud flowering (M5) and full flowering (M7) stages with an axception to intitial flowering phase (M6), indicating a reverse order in comprison to their corresponding flower stages. As mentioned earlier, the expression level of CtCYP81E8 was significantly exploited under different stress conditions, indicating the concept of secondary metabolic activation in plants under variable abiotic stress conditions. Conclusively, these results insistingly suggested that the transcription regulation of CtCYP81E8 has a certain relationship with the accumulation profiling of metabolite content of different safflower varieties. Though assumuingly, but these findings could be cruicial in understanding the core concept of molecular regulatory signals that strategically switch on the secondary metabolic flux by intervening through a bulk of genetical and particularly, transcriptional events, to ensure plant's survival under acute environmental drifts.
3.10. Heterologous Expression and in vitro enzymatic assay of CtCYP81E8
In order to validate the potential function of CtCYP81E8 in vitro, the full length cDNA of CtCYP81E8 was cloned into the prokaryotic expression vector (pET28a+), and then transformed into E. coli BL21DE3 cells by thermal and electric shock transformation, and then induced by adding different concentrations of IPTG. The bacterial solution without IPTG induction and no load were used as control. The heterologous expression of CtCYP81E8 recombinant protein was mainly detected by Coomassie blue staining SDS-PAGE and Western Blot hybridization. The analysis of SDS-PAGE showed that the recombinant CtCYP81E protein was expressed at the 36.4 KDa site, but it seemed that the concentration of IPTG did not affect the protein expression (Fig. 9A). Then we purified the recombinant CtCYP81E8 protein and further identified the expression of the protein at different IPTG concentrations by western blot hybridization. As shown in (Fig. 9B), we found a single purpose band and a change in the protein expression with the concentration of IPTG induction. From our findings, we deduced that target protein of CtCYP81E8 was stably detected on SDS-PAGE and western blot hybridization on nylon membrane, moreover, the product size was also consistent with theoretical molecular weight of CtCYP81E8 protein (36.4 kDa), suggesting that the target protein was effetiently expressed in prokaryotic system, however, the different concentrations of IPTG could potentially affect the expression level of the target protein [16].
The primary objective of the DPM assay was designed to explore the complete consumption of oxygen by CtCYP81E8 recombinant enzyme for the oxidation of 100 mM DMP by direct absorbance method under different time periods. For the present study, by adding hydrogen peroxide in the reaction, the highest removal of 100 mM DMP concentrations (60%) in CtCYP81E8 batch was variably detected including the optimum removel at 24 h followed by 48 h respectively as compare to the control group (Fig. 10). For 12 h and 72 h, the removal of 100 mM DMP was less found than 15–40%, but it was also observed that after 36 h reaction the removal of DMP was reached to almost 40%. These findings depicts that insufficient absorbance of oxygen in a uniform reaction batch may be due to the poor efficiency rate. It was also suggested that the rate of reaction was independent of dissolved oxygen at the start of the reaction, until enough oxygen was present. But, after the sufficient utilization of the dissolved oxygen in the aforesaid reaction, the dependency of the reaction becomes essential for exogenous oxygen addition. Hence, more efforts are still needed to provide further insights in obtaining more efficient removal of DMP during in vitro activity assay.