Qualitative and quantitative analysis of anthocyanins
Table 1
Chromatographic and spectral data of anthocyanina from 'Qiansiban'.
Peak | Retention time (min) | Molecularion | Fragmention | Tentative identification |
1 | 11.4 | 465.1 | 303.05 | Delphinidin-3-O-glucoside |
2 | 13.2 | 449.1 | 287.05 | Cyanidin-3-O-glucoside |
3 | 13.95 | 479.11 | 317.06 | Petunidin-3-O-glucoside |
4 | 14.84 | 433.11 | 271.06 | Pelargonidin-3-O-glucoside |
5 | 16.3 | 493.13 | 331.08 | Malvacin-3-O-glucoside |
6 | 11.28 | 465.1 | 303.05 | Quercetin-3-glucoside |
7 | 13.07 | 449.1 | 287.05 | Luteolin-7-O-glucoside |
8 | 14.14 | 451.12 | 289.07、199.01 | Santianol-7-O-glucoside |
9 | 16.28 | 611.15 | 499.1 | Kaempferol-O-hexose-C-hexoside |
10 | 16.66 | 595.16 | 433.11、313.07 | Apigenin-C-diglucoside |
11 | 17.13 | 611.15 | 499.1 | Luteolin-3,7-diglucoside |
12 | 19.9 | 579.16 | 433.11 | Apigenin-7-O-rutoside |
The color of flower samples varies from S1 to S5. (B): Total contents of individual anthocyanidins at S1 to S5. The bars represent the average data of the three operations of HPLC‒MS/MS.
Flower color of 'Qiansiban' is purplish red at S1 stage, and as the flowers bloom, the color gradually changes to pinkish purple and to purple finally (Fig. 1A).
Anthocyanins and flavones/flavonols were identified by comparison with the HPLC retention time, elution order, UV–vis spectrum and MS fragmentation pattern in published data (Table 1).
Through the analysis of mass spectrometry molecular ions and fragment ions and in combination with references [10, 11, 12], five anthocyanins were detected in the petal of 'Qiansiban' at five stages, namely, delphinidin-3-O-glucoside (De3G), cyanidin-3-O-glucoside (Cy3G), petunidin-3-O-glucoside (Pe3G), pelargonidin-3-O-glucoside (Pg3G), and malvacin-3-O-glucoside (Ma3G) (Table 1). It is speculated that the 7 flavonoes and flavonols were quercetin-3-glucoside, luteolin-7-O-glucoside, santianol-7-O-glucoside, kaempferol-O-hexose-C-hexoside, apigenin-C-glucoside, luteolin-3,7-diglucoside, and apigenin-7-O-rutoside (Table 1) [13, 14, 15].
H. syriacus petals undergo major changes in flavonoid and anthocyanin accumulation from S1 to S5. Anthocyanidin is the main contributor to the color of H. syriacus, thus the type, content, and biosynthesis process of flavonoids in H. syriacus petals are analyzed emphatically.
The highest flavonoid content was found at S1, which gradually decreased with the flower opening process. The higher content of anthocyanidin in S1 was Ma3G (Fig. 1B), and the higher content of flavonoe and flavonol were apigenin-C-diglucoside (Supplementary Table S1). The accumulation trend of flavonoes and flavonols were S1 > S2 > S3 > S4 > S5, showing a gradually decreasing trend. The analysis based on HPLC‒MS/MS shows that all anthocyanidins have the highest content in S1 and the lowest content in S5. The contents of these five anthocyanins gradually decreased with the flowering process; from S1 to S5, they decreased by 46%, 52%, 40%, 64%, and 39%, respectively.
In summary, the accumulation of flavonoids and anthocyanidin reached highest and lowest at S1 and S5 respectively. Ma3G (anthocyanidin content with high accumulation content), De3G, Pe3G, Cy3G, and Pg3G are the main chromogenic anthocyanidins. With the decrease in anthocyanidin concentration, the color of the petals changed from purplish red to purple (Fig. 1).
Transcriptome profiling of H. syriacus petals in different states
Overview of transcriptome sequencing
Transcriptome sequencing was performed on 15 samples at 5 stages of 'Qiansiban'. A total of 89 Gb clean data was obtained, and the clean data of each sample reached 6.616 Gb. The GC content was 44.34% − 45.62%, and the percentage of the Q30 base was 93.62% − 95.03% (Supplementary Table S2).
Functional annotation of new genes
The DIAMOND [16] code was used to match the newly discovered sequence with the NR [17], Swiss professional [18], COG [19], KOG [20], and KEGG [21] databases, and the KEGG origin and alternative results of the new sequence were obtained. InterProScan [22] used the info integrated by InterPro to investigate the GO [23] GO Orthology results of the latest genes, foreseen the aminoalkanoic acid sequence of the latest genes, and then compared them with the Pfam [25] database victimization HMMER [24] code to obtain annotation data of the latest genes.
Based on the chosen order, the mapped reads were spliced with the NeighborTie code, and compared with the first-order annotation data, 32956 new genes were discovered, of which 23322 were annotated. The genes annotated in different databases range from 4319 to 23235, with TrEMBL annotation having the highest number of genes and COG annotation having the lowest number of genes (Table 2).
Table 2
Statistics of annotation results of new gene functions
Annotated databases | COG | GO | KEGG | KOG | Pfam | Swiss-Prot | TrEMBL | eggNOG | NR | All |
New Gene Number | 4,319 | 15,725 | 13,795 | 11,717 | 12,552 | 14,343 | 23,235 | 18,006 | 23,052 | 23,322 |
Analysis of DEGs identified in the four libraries
With thresholds FDR < 0.01 and FC > 2, 3456 DEGs (3 053 up- and 403 downregulated) were found between the FS1 and FS2 libraries. Meanwhile, the number of differential expressed genes between the FS1 and FS3 libraries is the most, with 27 736 DEGs (12 655 up- and 15 081 downregulated) found. 25 968 DEGs (12 124 up- and 13 844 downregulated) were found between the FS1 and FS4 libraries, and 20 937 DEGs (10 013 up- and 10 924 downregulated) between the FS1 and FS5 libraries were detected (Fig. 2A). The common differential expressed genes among 5 comparisons were 2702 (Fig. 2B ).
GO enrichment analysis was conducted on the screened differentially expressed genes, and Fig. 3A shows the top 20 classifications with the highest GO enrichment of differentially expressed genes. The differentially expressed genes are distributed in the three primary classifications of biological process, cell composition, and molecular operation. Mainly enriched in cell process, biological process, and single-organism process in biological process, catalytic activity, binding, and transporter activity in cell composition, and cell, cell part, and organelle in molecular operation.
The differential factor KEGG metabolic pathway map is especially integrated into a whole metabolic pathway by classifying the differential genes. KEGG metabolic pathway of the ‘Qiansiban’ differential factor primarily focuses on essential amino acid synthesis, flavonoid synthesis, cyanide metabolism, flavonoid and flavonol synthesis, essential amino acid metabolism, and anthocyanin synthesis. Among them, the metabolic pathways associated with flower color formation are essential amino acid synthesis and metabolism, flavonoid and flavonol synthesis, and anthocyanin synthesis (Fig. 3B).
By analyzing the sequences of the transcriptome of H. syriacus, combined with KEGG annotation, sixty-four differentially expressed genes were obtained at the 5 stages of 'Qiansiban'. The utmost variety of UFGT genes was fourteen, and the minimum variety of F3'5'H genes was one (Fig. 3C).
Identification of structural genes of interest in the flavonoid synthesis pathway
Transcriptome analysis showed that sixty-four genes were considerably enriched in the flavonoid biogenesis pathway. The accumulation patterns of these anthocyanidins at different stages were consistent. The relative expression level of PAL sequentially decreases when reaching its most worth at S2 and increased at S5. The relative expression levels of CHS1 and ANS1 genes additionally reached their most throughout the S2 amount, with the CHS1 sequence step by step decreasing throughout the S2-S5 amount and the ANS1 sequence increasing throughout the S4 amount and decreasing throughout the S5 amount. The relative expression level of the CHI1 sequence step by step decreased throughout the S1-S4 amount and increased throughout the S5 amount. The relative expression level of the FLS1 sequence is strictly the other, step by step increasing throughout the S1-S4 amount and decreasing throughout the S5 amount. The relative expression level of the DFR1 sequence step by step decreased throughout the S1-S5 amount. Among the relative expression levels of the UFGT1 sequence, the relative expression level was the lowest within the S2 stage and increased from the S2-S5 stage. The relative expression levels of F3H3 associated with nursingd F3'5'H genes showed an increasing decreasing trend, reaching their most values throughout the S4 and S2 periods (Figs. 4 and 5, Supplementary Table S3 and Table S6).
According to the GO enrichment map and KEGG enrichment map of DEGS, nine structural genes were screened, which played a key role in the flower color change of 'Qiansiban', namely, UFGT1(gene-LOC120143098), DFR1(gene-LOC120216676), CHI1(gene-LOC120193176), FLS1(gene-LOC120188023), ANS1(gene-LOC120214326), CHS1(Hibiscus_syriacus_newGene_21723), PAL1(Hibiscus_syriacus_newGene_32513), F3'5'H(Hibiscus_syriacus_newGene_29100) and F3H(gene-LOC120154124).
Changes in the expression of transcription factors
Transcription factors such as MBW (MYB, bHLH, WD40) affect the color of H. syriacus flowers by regulating the synthesis of anthocyanins. According to transcriptome sequencing and classification statistics, the number of five transcription factors, including AP2-ERF-EFR, MYB, bHLH, C2H2, and NAC (Supplementary Figure S1), are relatively high in the 'Qiansiban'. Among them, there are 279 MYB transcription factors and 203 bHLH transcription factors (Supplementary Table S4).
Based on GO and KEGG enrichment, transcription factors (TFs) coded by DEGs and seven structural genes were subjected to protein‒protein interaction analysis to screen crucial TFs control anthocyanin synthesis. The results are displayed (Supplementary Table S4): bHLH2 (gene-LOC120149108), bHLH4 (gene-LOC120171442), bHLH6 (gene-LOC120209233), bHLH7 (gene-LOC120215242), MYB1 (Hibiscus_syriacus_newGene_28382), MYB3 (gene-LOC120154943), and MYB5 (gene-LOC120214348) would possibly participate in the regulation of the seven structural genes.
qRT‒PCR analysis validation of anthocyanin synthesis structural genes
To verify the accuracy of the RNA-Seq results, we designed seven structural genes and seven regulatory genes in flavonoid metabolism and analyzed the relative expression quantities in the five flowering stages through qRT‒PCR (Fig. 5). The change trend of relative expression was consistent with that of substance content, which indicated that the result of RNA-Seq was reliable.