Determination of photosynthetic and anthocyanin content
To elucidate the mechanism of biosynthesis of pigments in three-leaf colors of L. chinense. var. rubrum, the GL, ML, and PL were chosen for this study (Fig. 1). The contents of chlorophyll and anthocyanin in these three leaf colors were distinct. Compared to the GL and ML, PL had relatively lower levels of carotenoid and total chlorophyll but higher anthocyanin accumulation, which indicated that anthocyanin might be the main pink-colored material in leaves.
Differential metabolites from different leaves
Since anthocyanin content is a determinant for leaf color change in the plant, we profiled the metabolome of the leaf samples of GL, PL, and ML using the LC-MS/MS analysis and annotated it in the MS2 database. A total of 207 compounds were detected in the L. chinense. var. rubrum leaf, which could be grouped into eight classes, including Proanthocyanins, Polyphenol, Isoflavone, Flavonol, Flavanone, Anthocyanins, Flavonoid, Flavone, etc. (Additional file 1) All metabolites were analyzed by principal component analysis and hierarchical cluster analysis to determine the influence of leaf-color change on the change of metabolites. (Fig. 2A) [32]. A Fold change (≥2 or ≤0.5) among the metabolites with a VIP>1 was used as an identification criterion. In all group comparisons, a total of 11 metabolites showed significant differences in each combination. (Fig. 2B). There were 35 differential metabolites (31 up-regulated and 4 down-regulated) between GL and PL (Fig. 2C), 37(33 up-regulated and 4 down-regulated) between GL and ML (Fig. 2D). Two cluster heat maps of 46 flavonoid compounds were obtained. (Fig. 2E, F). Interestingly, all the differentially accumulated anthocyanins were up-accumulated in ML and PL samples.
Differential metabolic pathways in different leaves
The enrichment of differential metabolites in the metabolic pathway by the KEGG database helps us to understand the changes in the mechanisms of metabolic pathways in different leaf colors. (Additional file 2). Notably, different metabolites were enriched in the KEGG metabolic pathway. Compared with GL and ML, PL was mainly involved in biosynthesis, including flavonoid biosynthesis and anthocyanin biosynthesis.
Transcriptome de novo assembly, annotation
Three standardized cDNA libraries were constructed from RNA from GL, ML, and PL. After the cDNA library was cleaned and characterized, 186,694,570,149,946,386 and 123,143,062 clean reads were obtained, respectively. More than 98.09% of them have Q20(an error probability of 0.02%) for GL, PL and ML were 97.9%, 97.48%. The GC contents were approximately 43.73% 43.84%, and 43.98% for GL, PL, and ML (Additional file 3). These clean reads were assembled into 231,810 unigenes with lengths ranging from 65 to 21,135 bp (average length 1271bp) and an N50 of 2608 bp. There were 110,962 unigenes (47.87%) with a length of<500 bp; 31,726 unigenes (13.69%) with a length of 500-1000 bp; 38,050 (16.41%) unigenes with a length of 1000-2000 bp; 23,499 unigenes (10.14%) with a length of 2000-3000 bp, and the remaining 27,573 unigenes (11.89%) with a length of>3000 bp (Additional file 4). The sequencing quality was able to cover the vast majority of expressed genes in GL, ML, and PL, which provided the foundation for further analysis.
For comprehensive annotation, all the assembled unigene sequences were matched against NR, SwissProt, KEGG, KOG, GO, NT and Pfam databases produced 118,518(17.88%);96,572(14.57%);115,058(17.36%);75,254(11.36%);84,662(12.77%);88,001(13.28%) and 84,662(12.77%) respectively (Additional file 5).
With GO classification, 391,895 matched unigenes were classified into 3 major categories: biological process, molecular function, and cellular component (Additional file 6A). In the group of biological processes, the majority of unigenes involved in the metabolic process (43,655) and cellular process (41,359); in the cellular components group, the cell (21,255) and cell part (21,255); and the molecular function group, binding (47,320) and catalytic activity (41,565).
The annotated unigenes were assigned to the KEGG database for the identification of the biochemical pathways (Additional file 6B). A total of 72,251 unigenes were divided into 46 metabolic pathways under six main branches: cellular processes (6,659), environmental information processing (5,400), genetic information processing (11,381), human diseases (13,719), metabolism (25,260), and organismal systems (9,832). The top ten metabolic genes in subclasses included the following: signal transduction, translation, carbohydrate metabolism, global and overview maps, folding sorting and degradation, transport and catabolism, viral, energy metabolism, amino acid metabolism, and overview.
In total, pathways related to cellular processes and environmental information processing were also well represented by unigenes from L. chinense. var. rubrum. This result should play an important role in our metabolic pathway research in L. chinense. var. rubrum leaves.
Analysis of the differential expression genes in different colors
We applied the standard |Log2FC|≥2 to make all pairwise comparisons of the different expressions between GL, PL, and ML leaf colors. We identified 6743 significant DEGs between the GL and ML and 1636 significant DEGs between the GL and PL. With GL set as control, the numbers of up-regulated and down-regulated genes in ML were 2855 and 3888, respectively. 910 DEGs were up-regulated and 726 DEGs were down-regulated between GL and PL (Fig. 3A, B).
Go allocation was applied for predicting the function of gene expression that is involved in the flavonoid biosynthesis pathway. The most important enrichment Go terms are selected and displayed in figure (Additional file 7), and the most significant enrichment was found in the correlation between GL and ML, such as metabolic process, organic substance, metabolic process, primary metabolic process, and cellular metabolic process. Oxidation-reduction process, single-organic metabolic process, oxidoreductase activity and catalytic are significant between GL and PL.
To further investigate the biochemical pathways of these DEGs, all of which were mapped to terms in the KEGG database, which was a pathway-related database that could help better understand specific processes, gene functions, and gene interactions at the transcriptomic level[33]. There were 591 (GL vs ML) and 247 (GL vs PL) DEGs mapped onto the top 20 enriched KEGG pathways in the L. chinense. var. rubrum transcriptomes from different colors of leaves. In these samples, flavone and flavanol biosynthesis(ko00942) and Flavonoid biosynthesis (ko00941) were closely related to anthocyanin synthesis (Additional file 8).
The focus of this study is the difference in anthocyanin accumulation among different leaf colors. Previous anthocyanin determination results showed that the anthocyanin content in GL was significantly lower than in ML and PL (Fig. 1A), which could prove the color substance of L. chinense var. rubrum leaves were anthocyanin. Therefore, we focused on the metabolic pathways related to anthocyanin synthesis. Genes enriched in this pathway were 2 ANR (EC:1.3.1.77; ANR1467 and ANR1398); 4 CYP75A (EC:1.14.14.81; CYP75A1815 CYP75A2846, CYP75A2909 and CYP75A1716); 1 DFR (EC:1.1.1.219; DFR92899), and 2 UFGT (EC:2.4.1.115; UFGT1876 and UFGT3273).In addition to the structural genes, MYB, bHLH and WD40 great influence on anthocyanin biosynthesis[34]. Because the leaves appear to be of different colors and have different anthocyanins, we will focus on the transcription factors and select 3 MYB, 3 bHLH, and 3 WD40 transcription factors.
qRT-PCR validation transcriptome results
To verify the expression levels of structural genes and transcription factors related to anthocyanin synthesis in L. chinense var. rubrum, β-actin was selected as the reference gene. The results showed that the expression was similar to what had been obtained by RNA-Seq (Fig. 4A). ANR and DFR were significantly up-regulated in ML and PL, and the expression of CYP75A in PL and ML was higher than it in GL, but the difference was not significant. Interestingly, the genes associated with stable anthocyanin synthesis were differentially expressed in the GL, ML, and PL, and the expression of the UFGT gene in the GL was much higher than that in the PL and ML. However, qRT-PCR results showed that the UFGT gene behaved extremely differently between the GL and PL. The high expression level of the UFGT gene in PL was consistent with the formation of stable anthocyanins, suggesting that it should be involved in the biosynthesis of anthocyanins[35]. Unlike structural genes, the expression level of candidate regulatory genes (MYB, BHLH, and WD40) was also higher in PL except for BHLH2379, BHLH2808 and WD3208(Fig. 4B). Three MYB genes (MYB1131, MYB1811, and MYB3057) and WD40 proteins (WD2173 and WD3291)were up-regulated in PL, suggesting that MYB and WD factors might be involved in the pigmentation of PL.
Integrative metabolomic and transcriptomic analysis
As some differentially expressed genes were located in multiple branches, the intensity of metabolite accumulation could not be judged. Therefore, a joint analysis of transcriptome and metabolome was needed to analyze large-scale secondary metabolites and pathway regulation. We focused on the flavonoid biosynthesis pathway and the anthocyanin biosynthesis pathway (Fig. 5).
There were 9 genes involved in anthocyanin biosynthesis, including ANR, CYP75A, DFR, and UFGT. Besides, 11 different metabolites also participated in this process. Based on the differentially expressed genes and metabolites, the metabolic pathway of anthocyanins in L. chinense. var. rubrum was summarized and analyzed. Interestingly, all differentially accumulated anthocyanins, such as cyanidin 3-O-glucoside, cyanidin O-syringic acid, cyanidin 3,5-O-diglucoside, pelargonidin, petunidin 3-O-glucoside, and peonidin 3-sophoroside-5-glucoside, increased in ML and PL, indicating the presence of anthocyanin branching in ML and PL. Surprisingly, UFGT is directly involved in the stable biosynthesis of anthocyanin synthesis in PL, which is consistent with the accumulation of anthocyanins. In summary, our study showed that DFR, ANR, and UFGT genes played an important role in promoting the accumulation of anthocyanins, including the rise of cyanidin 3-O-glucoside, cyanidin O-syringic acid, cyanidin 3,5-O-diglucoside, pelargonidin, petunidin 3-O-glucoside, and peonidin 3-sophoroside-5-glucoside which brought the leaves of L. chinense. var. rubrum different colors. Typically, TFs activate genes in secondary metabolic pathways to coordinate their expression[36]. While bHLH was not significantly expressed, MYB (gene1131,3057) and WD40(gene2173,3219) were highly upregulated in PL (Fig. 4B). This is consistent with the trend of the late synergistic genes DFR, ANR, and UFGT. The results had shown that they formed a ternary complex with bHLH and WD40 transcription factors to up-regulate the expression of the DFR, ANR, and UFGT genes, which ultimately formed anthocyanins and greatly improved the level of anthocyanins[37, 38]. In summary, our study showed that DFR, ANR, and UFGT genes played an important role in promoting the accumulation of anthocyanins, including the rise of cyanidien 3-O-glucoside, cyanidin O-syringic acid, cyanidin 3,5-O-diglucoside, pelargonin, petunidin 3-O-glucoside, and peonidin 3-sophoroside-5-glucoside which caused the L. chinense. var. rubrum leaves to have different colors.