Polysaccharide accumulation in different ginseng cultivars
In this study, the content of nine monosaccharides in the roots, stems and leaves of four cultivars were determined by HPLC. The total content of nine monosaccharides in roots was higher than that in stems and leaves, and that in stems and leaves were similar (Fig. 1). Among those samples, the total content of nine monosaccharides was varied. In the roots, the total content of nine monosaccharides was the highest in CM, followed by the BT and GL, and the lowest in SZ (P < 0.05, Fig. 1A). The total content of nine monosaccharides was similar of the four cultivars in the stems and leaves (P < 0.05, Fig. 1A). PCA score plot of four cultivars based on the nine monosaccharides content, and the results showed that the first two principal components (PC) explained 96% of the total variation (PC1 = 86%, PC2 = 10%). The all stems and leaves from four cultivars were clustered together, and clearly separated from roots, indicating significant differences in polysaccharides accumulation among tissues (Fig. 2A). In addition, Glc was the most component of monosaccharides in all samples (P < 0.05, Table 1).
Table 1
The content of nine monosaccharides in each sample.
|
|
GL (mg/g)
|
|
|
CM (mg/g)
|
|
|
SZ (mg/g)
|
|
|
BT (mg/g)
|
|
|
Root
|
Stem
|
Leaf
|
Root
|
Stem
|
Leaf
|
Root
|
Stem
|
Leaf
|
Root
|
Stem
|
Leaf
|
Man
|
0.0290
|
0.0277
|
0.0339
|
0.0358
|
0.0267
|
0.0277
|
0.0268
|
0.0271
|
0.0272
|
0.0307
|
0.0276
|
0.0288
|
GlcA
|
0.0587
|
0.0508
|
0.0501
|
0.0657
|
0.0500
|
0.0498
|
0.0588
|
0.0502
|
0.0503
|
0.0681
|
0.0510
|
0.0524
|
Rha
|
0.0382
|
0.0292
|
0.0292
|
0.0477
|
0.0283
|
0.0279
|
0.0340
|
0.0281
|
0.0287
|
0.0455
|
0.0288
|
0.0306
|
GalA
|
0.0389
|
0.0304
|
0.0298
|
0.0497
|
0.0293
|
0.0324
|
0.0365
|
0.0294
|
0.0323
|
0.0488
|
0.0294
|
0.0304
|
Glc
|
15.6215**
|
1.5288**
|
1.3461**
|
17.2789**
|
1.1827**
|
1.5607**
|
4.2758**
|
1.5825**
|
1.1975**
|
15.0773**
|
1.3242**
|
1.4523**
|
Gal
|
0.0513
|
0.0631
|
0.0639
|
0.0561
|
0.0537
|
0.0895
|
0.0481
|
0.0539
|
0.0722
|
0.0499
|
0.0578
|
0.0868
|
Xyl
|
0.0394
|
0.0349
|
0.0337
|
0.0388
|
0.0334
|
0.0334
|
0.0361
|
0.0346
|
0.0335
|
0.0373
|
0.0346
|
0.0346
|
Ara
|
0.0452
|
0.0470
|
0.0485
|
0.0475
|
0.0443
|
0.0557
|
0.0441
|
0.0450
|
0.0511
|
0.0458
|
0.0465
|
0.0591
|
Fuc
|
0.0451
|
0.0448
|
0.0450
|
0.0451
|
0.0448
|
0.0452
|
0.0449
|
0.0449
|
0.0451
|
0.0452
|
0.0450
|
0.0451
|
Note: The significance of glucose content in nine monosaccharides |
*Significant at the 0.05 probability level. |
**Significant at the 0.01 probability level. |
Differential expression genes in four cultivars
To comprehensively investigate the differences in gene expression levels of four ginseng cultivars, we performed transcriptome sequencing for GL, CM, SZ and BT. In total, we sequenced 33 libraries from the roots, stems and leaves of four cultivar samples (Additional file 2: Table S1). For further analysis, low-quality sequences were filtered out, and 241.37 G clean reads were obtained from the 33 libraries. Using the ‘Chunpoong’ genome as a reference genome, we mapped an average of 79.27%, 78.33% and 77.65% of clean reads for the roots, stems and leaves, respectively (Additional file 3: Table S2). The heatmaps of PCC values showed that the biological replicates had similar expression patterns and an extremely high PCC value (PCC > 0.80), except for BT1_R (average PCC = 0.44; Additional file 1: Fig. S1). Therefore, sample BT1_R was discarded from all subsequent analyses.
Based on the transcriptomic profile, PC1 and PC2 together explained 56% and 17% of gene expression variances among all samples, respectively. It is worth noting that the PCA score map showed stems and leaves tissue are clustered together and significant segregation from roots, indicating that gene expression at the transcriptome level responded to tissue changes, which was consistent with the results of the polysaccharides content (Fig. 2B). DEGs were identified from the samples of different tissues of each ginseng cultivars, we found that the number of DEG between GL and SZ samples was the highest in roots and stems, while the number of DEGs was the highest between CM and BT samples in leaves. The fewest DEGs were detected between BT and SZ samples in roots and leaves. In stems, there were the fewest DEGs between GL and BT (Additional file 1: Fig. S2).
Next, to better understand the functions of DEGs, we performed the KEGG enrichment analysis and GO category enrichment analysis. The DEGs of roots, stems and leaves were enriched in some secondary metabolic pathways, such as glutathione metabolism and flavonoid biosynthesis, as well as MAPK signaling pathway, protein processing in endoplasmic reticulum and plant-pathogen interaction et al (Fig. 3). In the GO enrichment analysis, the enriched terms of the DEGs included response to chitin, response to high light intensity, photosynthesis, and phenylalanine ammonia-lyase activity in roots, stems and leaves (Additional file 1: Figs. S3, S4, S5).
Analysis of ginseng polysaccharides biosynthetic pathway
To identify genes involved in the biosynthetic pathway of ginseng polysaccharides, we annotated the genes related to starch and sucrose metabolism (ko00500), fructose and mannose metabolism (ko00051), galactose metabolism (ko00052) and amino sugar and nucleotide sugar metabolism (ko00520). Based on the main monosaccharide components in ginseng polysaccharides, we outlined potential biosynthetic pathways for the formation of ginseng polysaccharides from sucrose. Sucrose was converted to D-fructose, then D-fructose-6phosphate (D-fructose-6p) to D-mannan-6p indirectly, and from D-mannan-1p to GDP-D-Man, subsequently, GDP-4-oxo-6-deoxy-D-Man to GDP-L-Fuc. In addition, sucrose was instantaneously transformed into UDP-Glc, UDP-glcA to UDP-D-xyl, and then UDP-D-xyl into UDP-L-Ara. Moreover, UDP-Glc was converted to D-glucose-6p and then to GDP- Fuc. In addition, UDP-Gal was also directly derived from UDP-Glc, and UDP–4-keto-6-deoxy-D-Glc was converted to UDP-4-keto-Rha and UDP-Rha (Fig. 3). The components of ginseng polysaccharides included Glc, Gal, Rha, Man, Xyl, Ara, GlcA, GalA and Fuc (Fig. 4).
Relationship between gene expression and metabolite accumulation in polysaccharides biosynthesis
In biosynthetic pathway of ginseng polysaccharides, we found 102 genes encoding 19 key enzymes that control the synthesis of ginseng polysaccharides. According to structural genes extracted from polysaccharides biosynthesis pathway, the expression levels of these genes in different samples are significantly district. We found the most of the genes encoded UTP-glucose-1-phosphate uridylyltransferase (UGP2), phosphoglucomutase (PGM) and sucrose synthase (SUS) in the root of GL and CM expressed at higher levels than that of BT and SZ. The expression level of genes encoded UDP-glucose 4-epimerase (GALE) were higher in stems and leaves of GL and SZ than of CM and BT. In addition, the genes encoded mannose-6-phosphate isomerase (MPI) and GDP-mannose 4,6-dehydratase (GMDS) expressed at highest levels in the stems of SZ. Other genes in the pathway that synthesize polysaccharides, such as UDP-glucuronate decarboxylase (UXS1), UDP-arabinose 4-epimeras (UXE), UDP-glucose 6-dehydrogenase (UGDH) and hexokinase (HK) et al, The Expression levels varied in different tissues across samples (Fig. 4). These results suggested that the synthesis of ginseng polysaccharides may be a pathway for multigene cooperative regulation.
Through the correlation analysis of polysaccharides synthesis related genes and content of polysaccharides, suggesting that 17 enzymes [PGM, fructokinase (scrK), beta-fructofuranosidase (sacA), UXE, UXS1, mannose-1-phosphate guanylyltransferase (GMPP), UGP2, GALE, MPI, GDP-L-fucose synthase (TSTA3), SUS, UDP-glucuronate 4-epimerase (GAE), HK, phosphomannomutase (PMM), UGDH, GDP-mannose 4,6-dehydratase (GMDS) and glucose-6-phosphate isomerase (GPI)] were obviously correlated to the content of monosaccharide content and total polysaccharides content. In addition, the expression of genes encoding scrK (Pg_S0635.5, Pg_S1306.14, Pg_S1495.1, Pg_S0588.13, Pg_S5155.1, Pg_S2241.31 and Pg_S3153.2) was positively corrected to most of monosaccharide content and polysaccharides content, which of HK (Pg_S4434.4, Pg_S3346.1, Pg_S4929.12 and Pg_S0234.21) was negatively correlated (Additional file 4: Table S3).
Co-expression modules related to the content of polysaccharides
In our study, the difference in total content of nine monosaccharides between CM and SZ was most obvious in the root, and the number of DEGs were the most of the root in CM_vs_SZ. We screened a co-expression module by WGCNA of 49877 genes and content of nine monosaccharides, which is come from the root of CM and SZ. This analysis identified 12 co-expression modules, and each containing 919 to 9747 genes (Fig. 5A). Pearson correlation analysis between module eigengenes and the content of Glc, Gal, Rha, Man, Xyl, Ara, GlcA, GalA, Fuc and total content of nine monosaccharides (total), indicated the biological importance of the two modules (greenyellow and brown). The greenyellow module obviously correlated with the content of Man (r = 0.99, P = 0.002), Glc (r = 0.91, P = 0.03) and total (r = 0.92, P = 0.03). The brown module was highly correlated with the content of Gal (r = -0.91, P = 0.03), Xyl (r = -0.95, P = 0.02) and Ara (r = -0.98, P = 0.004) (Fig. 5B). These results suggested that the two modules obviously correlated with the content of polysaccharides accumulation in ginseng.
The KEGG annotation showed that genes in these two modules were mainly related to metabolite pathways, such as phenylpropanoid biosynthesis (ko00940), starch and sucrose metabolism (ko00500) and amino sugar and nucleotide sugar metabolism (ko00520) (Fig. 5C, D). In addition, a large number of genes for polysaccharides biosynthesis were found in these two modules, such as genes encoding PGM, GPI, scrK, UGP2, GMPP, PMM, GALE, sacA, and SUS (Table 2).
Table 2
Genes are involved in the ginseng polysaccharide synthesis pathway in the module
Module Gene_id Gene family
|
greenyellow
|
Pg_S2017.3
|
PGM
|
Pg_S7036.4
|
GPI
|
Pg_S1306.14
|
scrK
|
Pg_S1124.2
|
UGP2
|
Pg_S0167.13
|
GPI
|
brown
|
Pg_S0635.5
|
scrK
|
Pg_S0953.13
|
GMPP
|
Pg_S4516.21
|
PMM
|
Pg_S0889.28
|
GALE
|
Pg_S2031.4
|
sacA
|
Pg_S0897.14
|
GALE
|
Pg_S1886.12
|
GMPP
|
Pg_S0061.8
|
sacA
|
Pg_S0219.46
|
UGDH
|
Pg_S3876.17
|
sacA
|
Pg_S3604.8
|
GALE
|
Pg_S3338.6
|
GMPP
|
Pg_S2762.11
|
SUS
|
Pg_S0455.9
|
GMPP
|
Note: PGM, phosphoglucomutase; GPI, glucose-6-phosphate isomerase; scrK, fructokinase; UGP2, UTP–glucose-1-phosphate uridylyltransferase; GMPP, mannose-1-phosphate guanylyltransferase; PMM, phosphomannomutase; GALE, UDP-glucose 4-epimerase; sacA, beta-fructofuranosidase; SUS, sucrose synthase. |
In order to find the key regulatory TFs related to polysaccharides biosynthesis from these two modules, we constructed a gene correlation network for each module by Cytoscape. In the greenyellow module, 6 TFs were identified, GRAS (Pg_S2354.13), MADS (Pg_S4852.3), AP2/ERF (Pg_S4672.9), MYB (Pg_S1414.8 and Pg_S4889.3) and HSF (Pg_S3558.9) (Fig. 6A). It was found that these TFs highly related to GPI, PGM and UGP2 (Fig. 6C). A total of 18 genes encoding 8 TFs were found in the brown module, including MYB (Pg_S3722.2, Pg_S7293.3 and Pg_S2010.18), bZIP (Pg_S1242.23), AP2/ERF (Pg_S6406.9, Pg_S3071.2, Pg_S0253.9, Pg_S3048.23 and Pg_S4277.1), bHLH (Pg_S0724.61, Pg_S0734.14 and Pg_S0817.8), NAC (Pg_S2569.3, Pg_S1059.27 and Pg_S3248.6), MADS (Pg_S1390.1), GRAS (Pg_S0325.10) and C2H2 (Pg_S6161.2) (Fig. 6B). These TFs were highly related to all the genes that encode scrK, GMPP, PMM, GALE, sacA and SUS in the brown module, except for Pg_S3338.6 encoding GMPP (Fig. 6D). These results suggested that these TFs might regulate the expression of genes related to ginseng polysaccharides synthesis.
qRT-PCR validation
To verify the gene expression levels produced by RNA-Seq, we performed qRT-PCR analysis on ten independent samples. We selected 10 DEGs in six compared groups, and as expected, 10 DEGs exhibited similar expression tendencies. Finally, the results show that the RNA-Seq data are accurate and useful (Additional file 1: Fig. S6).