Characteristics of ‘Damaohua’ and ‘Yujin 2’ transcriptome
A total of 42 samples (‘Damaohua’ and ‘Yujin 2’, respectively, have seven stages, each with three independent replicates) were sequenced using the Illumina HiSeq X Ten sequencer (Illumina Inc., USA), from which 311 Gb of raw data was harvested. Using a pair-end sequencing strategy, 2G raw reads with mean lengths of 150 nt were obtained from the 42 samples with 47.62 M reads for each sample. After the filtering of raw reads, 1.95 G reads were counted, with an average of 46.43 M reads per sample (Additional file 1: Table S1). Following the removal of redundant sequences and quantity control, 133,487 unigenes were assembled. The average length of the unigenes was 1,047 bp with 45.75% GC and 1642nt N50. The longest transcript was 16,864 bp, and the shortest was 301 bp.
The unigenes were annotated to obtain functional information using seven separate databases, including NCBI non-redundant protein (NR), Clusters of Orthologous Groups (COG) for eukaryotic complete genomes, Gene Ontology (GO), Swiss-Prot, evolutionary genealogy of genes: Non-supervised Orthologous Groups (eggNOG), and the Kyoto Encyclopedia of Genes and Genomes (KEGG) databases by Diamond Software, and mapped to Pfam databases by HMMER. Overall, 66.79% (133,487 unigenes) were successfully annotated, among which most unigenes (66.46%) were assigned to annotation terms in the NR database (Table 1). Further, 61.43% of the unigenes were annotated on eggNOG, 46.62% on Swiss-prot, 43.46% on GO, 35.76% on KOG, and 26.02% on the KEGG pathway (Table 1).
Table 1
Percentage of annotated unigenes.
Database | Count of annotated unigenes% |
NR | 66.46% |
KOG | 35.76% |
GO | 43.46% |
Swiss-prot | 46.62% |
eggNOG | 61.43% |
KEGG | 26.02% |
Pfam | 0.13% |
Total | 66.79% |
According to the NR annotation results, the current study compared L. japonica with taxonomic species stored in the NR database. A total of 97247 sequences were matched with L. japonica unigenes, which covered 1512 taxonomy categories (Additional file 2: Table S2). The highest ranked species was Alternaria alternate, which took up 12.82%, whereas the second highest-ranked was Vitis vinifera, which contributed 8.80% (Fig. 1).
Differentially expressed genes (DEGs)
When comparing seven flowering stages of expressed genes between ‘Damaohua’ and ‘Yujin 2’, 73,088 differentially expressed genes exhibited remarkable changes. A total of 5,355 genes were differentially expressed, with 2,864 genes up-regulated and 2491 genes down-regulated at the S1 stage. At the S2 stage, 21,159 genes were differentially expressed, with 12,734 genes up-regulated and 8,425 genes down-regulated. At the S3 stage, 19,126 genes were differentially expressed, with 10,343 genes up-regulated and 8,783 genes down-regulated. At the S4 stage, 11,153 genes were differentially expressed, with 6,029 genes up-regulated and 5,124 genes down-regulated. At the S5 stage, 27,718 genes were differentially expressed, with 12,102 genes up-regulated and 15,616 genes down-regulated. At the S6 stage, 18,652 genes were differentially expressed, with 8,486 genes up-regulated and 10,166 genes down-regulated. Finally, at the S7 stage, 38,162 genes were differentially expressed, with 21,153 genes up-regulated and 17,009 genes down-regulated (Fig. 2). As shown in the figures, the number of up-regulated DEGs was higher than those that were down-regulated.
KEGG-pathway analysis of differentially expressed genes
Unigenes were assigned to KEGG pathways based on annotation data. DEGs were enriched into 181, 202, 202, 191, 207, 204, and 210 pathways (S1, S2, S3, S4, S5, S6, and S7) (Additional file 3: Table S3). At the S1 stage, 53 pathways were enriched with a statistically significant enrichment p-value. Plant hormone signal transduction (ko04075), and phenylpropanoid biosynthesis (ko00940) during the entire process, as well as up-regulation and down-regulation processes, played active roles, and the expression was significant. Amino sugar and nucleotide sugar metabolism (ko00520) were also active during the entire process and up-regulation, while they were not active in the down-regulation process. Starch and sucrose metabolism (ko00500) were active during the entire process and down-regulation process, while they were not active in the up-regulation process (Figs. 1A).
At the S2 stage, 72 pathways were enriched with a statistically significant enrichment p-value, which was more than the other six paired comparison phases. Endocytosis (ko04144), plant hormone signal transduction (ko04075), and starch and sucrose metabolism (ko00500) during the entire process, as well as up- and down-regulation processes played an active role, and the expression was significant. Phenylpropanoid biosynthesis (ko00940) was active during the entire process and the down-regulation process, while it decreased in the up-regulation process (Figs. 1B).
At the S3 stage, 66 pathways were enriched with a statistically significant enrichment p-value. Plant hormone signal transduction (ko04075), phenylpropanoid biosynthesis (ko00940), starch and sucrose metabolism (ko00500) during the entire process, and the up- and down-regulation processes played an active role, and the expression is significant. Glycolysis/Gluconeogenesis (ko00010) was active during the entire process and the down-regulation process, while it was not active in the up-regulation process (Figs. 1C).
At the S4 stage, 71 pathways were enriched with a statistically significant enrichment p-value, and plant hormone signal transduction (ko04075) during the entire process, the up- and down-regulation processes played an active role, and the expression was significant. Phenylpropanoid biosynthesis (ko00940) and cell cycle (ko04110) were active during the entire process and the up-regulation process, while they were not active in the down-regulation process. Starch and sucrose metabolism (ko00500) were active during the entire process and the down-regulation process, while they were not active in the up-regulation process. Cell cycle-yeast (ko04111) was triggered in the up-regulation process, while carbon metabolism (ko01200) was triggered in the down-regulation process (Figs. 1D).
At the S5 stage, 63 pathways were enriched with a statistically significant enrichment p-value. Plant hormone signal transduction (ko04075) and phenylpropanoid biosynthesis (ko00940) played active role during the entire process as well as up- and down-regulation processes, and the expression was significant. Glycerophospholipid metabolism (ko00564), starch and sucrose metabolism (ko00500) were active in the total process and the up-regulation process, which were not active in the down-regulation process. Glutathione metabolism (ko00480), drug metabolism - cytochrome P450 (ko00982), and metabolism of xenobiotics by cytochrome P450 (ko00980) were triggered in the down -regulation process (Figs. 1E).
At the S6 stage, 31 pathways were enriched a with statistically significant enrichment p-value. Plant hormone signal transduction (ko04075), phenylpropanoid biosynthesis (ko00940), starch and sucrose metabolism (ko00500), glycolysis/gluconeogenesis (ko00010) played an active role during the entire process, the up- and down-regulation process, and the expression was significant. Valine, leucine and isoleucine degradation (ko00280) was triggered in the up-regulation process (Figs. 1F).
At the S7 stage, 39 pathways were enriched with a statistically significant enrichment p-value. Fatty acid metabolism (ko01212) was active during the entire process, but was not in the up- and down-regulation process. Plant hormone signal transduction (ko04075), phenylpropanoid biosynthesis (ko00940), starch and sucrose metabolism (ko00500) were active during the entire process and up-regulation process, while they were not in the down-regulation process. Tyrosine metabolism (ko00350) was active during the entire process and the down-regulation process, while it was not in the up-regulation process. Spliceosome (ko003040) was triggered in the up-regulation process, whereas MAPK signaling pathway-yeast (ko04011), as well as glycine, serine and threonine metabolism (ko00260) were triggered in the down -regulation process (Figs. 1G).
Summarizing the above KEGG pathways enriched during the flowering period, it was found that phenylpropanoid biosynthesis and flavonoid biosynthesis were active through the entire flowering process and nearly up-regulated at each stage, Peroxisome(ko04146), Carbon metabolism༈ko01200༉ and Amino sugar and nucleotide sugar metabolism༈ko00520༉can be found in each period(Fig. 3A) ,and carotenoid biosynthesis was down-regulated from S1-6. The anthocyanin precursor, phenylalanine, was enhanced at the S1 phase; however, the synthesis of anthocyanin was weakened at the S5 and S6 phases (Table 2).
Table 2
Enriched KEGG pathways by up-regulated and down-regulated genes.
KEGG Pathways | S1 | S2 | S3 | S4 | S5 | S6 | S7 |
Phenylpropanoid biosynthesis | ↑↓ | ↑↓ | ↑↓ | ↑ | ↑↓ | ↑↓ | ↑ |
Carotenoid biosynthesis | ↓ | ↑↓ | ↑↓ | ↓ | | ↑↓ | ↑ |
Flavonoid biosynthesis | ↑ | ↑↓ | ↑ | ↑ | ↑ | ↑ | ↑ |
Phenylalanine metabolism | ↑ | | | | | | ↓ |
Phenylalanine, tyrosine and tryptophan biosynthesis | ↑ | | | | | | |
Anthocyanin biosynthesis | | | | | ↓ | ↓ | |
*↑represents KEGG pathways enriched by up-regulated genes;↓represents KEGG pathways enriched by down-regulated genes. |
|
Gene Ontology (GO) analysis of differentially expressed genes
Differentially expressed genes were annotated by Gene Ontology terms (Additional file 4: Table S4). Genes were classified into three GO categories, that is, Biological Process, Cellular Component, and Molecular Function. From the Biological Process, auxin-activated signaling pathway (GO:0009734), response to abscisic acid (GO:0009737), response to wounding (GO:0009611) were remarkable at the S1 stage. The negative regulation of the hydrogen peroxide metabolic process (GO:0010727), positive regulation of the receptor biosynthetic process (GO:0010870), cellular response to topologically incorrect protein (GO:0035967) were remarkable at the S2 stage. The regulation of the transcription of nuclear large rRNA transcript from RNA polymerase I promoter (GO:1901836), nucleolar chromatin organization (GO:1990700), and regulation of superoxide anion generation (GO:0032928) were remarkable at the S3 stage.
The priming of the cellular response to stress (GO:0080136), cellular response to sucrose starvation (GO:0043617), and the cellulose metabolic process (GO:0030243) were remarkable at the S4 stage. The negative regulation of the gibberellin biosynthetic process (GO:0010373), guanosine tetraphosphate metabolic process (GO:0015969), and the regulation of the transcription of nuclear large rRNA transcript from RNA polymerase I promoter (GO:1901836) were remarkable at the S5 stage. The regulation of the transcription of nuclear large rRNA transcript from RNA polymerase I promoter (GO:1901836), nucleolar chromatin organization (GO:1990700), and the establishment of protein localization to endoplasmic reticulum membrane (GO:0097051) were remarkable at the S6 stage. The negative regulation of the extrinsic apoptotic signaling pathway in absence of ligand (GO:2001240), the phosphatidylinositol-3- phosphate biosynthetic process (GO:0036092), and the regulation of the embryo sac egg cell differentiation (GO:0045694) were remarkable at the S7 stage (Figs. 2).
From the Cellular Component process, apoplast (GO:0048046), plant-type cell wall (GO:0009505), and cell wall (GO:0005618) were remarkable at the S1 stage. Inclusion body (GO:0016234), nitrite reductase complex [NAD(P)H] (GO:0009344), and plasma membrane (GO:0005886) were remarkable at the S2 stage. Nucleolar chromatin (GO:0030874) vacuole (GO:0005773), and chloroplast (GO:0009507) were remarkable at the S3 stage. Anchored component of plasma membrane (GO:0046658), apoplast (GO:0048046), and plasma membrane (GO:0005886) were remarkable at the S4 stage. Nucleolar chromatin (GO:0030874), P4 peroxisome (GO:0019822), chloroplast (GO:0009507) were remarkable at the S5 stage. Nucleolar chromatin (GO:0030874), chloroplast (GO:0009507), vacuole (GO:0005773) were remarkable at the S6 stage. The DNA topoisomerase complex (ATP-hydrolyzing) (GO:0009330), nuclear ubiquitin ligase complex (GO:0000152), and TRAPPII protein complex (GO:1990071) were remarkable at the S7 stage (Figs. 2).
From the Molecular Function process, DNA binding transcription factor activity (GO:0003700), aspartic-type endopeptidase activity (GO:0004190), hydrolase activity, and acting on ester bonds (GO:0016788) were remarkable at the S1 stage. Tau protein binding (GO:0048156), euchromatin binding (GO:1990188), and IAA-Phe conjugate hydrolase activity (GO:0010210) were remarkable at the S2 stage. IAA-Phe conjugate hydrolase activity (GO:0010210), IAA-Leu conjugate hydrolase activity (GO:0010211), and dehydroquinate synthase activity (GO:0102042) were remarkable at the S3 stage. Glucuronoxylan 4-O-methyltransferase activity (GO:0030775), DNA binding transcription factor activity (GO:0003700), sequence-specific DNA binding (GO:0043565) were remarkable at the S4 stage. Euchromatin binding (GO:1990188), malonyl-CoA decarboxylase activity (GO:0050080), and glutamine-phenylpyruvate transaminase activity (GO:0047316) were remarkable in the S5 stage; euchromatin binding (GO:1990188), IAA-Phe conjugate hydrolase activity (GO:0010210), IAA-Leu conjugate hydrolase activity (GO:0010211) were remarkable at the S6 stage. Arylacetonitrilase activity (GO:0047428), euchromatin binding (GO:1990188), and calcium-independent phospholipase A2 activity (GO:0047499) were remarkable at the S7 stage (Figs. 2).
From seven periods,defense response to bacterium (GO:0042742)༌transcription, DNA-templated (GO:0006351)༌regulation of transcription, DNA-templated (GO:0006355)༌vacuole (GO:0005773)༌vacuolar membrane (GO:0005774)༌extracellular region (GO:0005576)༌chloroplast (GO:0009507)༌nucleus (GO:0005634)༌integral component of membrane (GO:0016021)༌plasma membrane (GO:0005886)༌cytoplasm (GO:0005737)༌sequence-specific DNA binding (GO:0043565)༌DNA binding transcription factor activity (GO:0003700)༌metal ion binding (GO:0046872)༌ATP binding (GO:0005524) were found at the every period(Fig. 3B)
Anthocyanin metabolism analysis of differentially expressed genes
Sequencing results revealed that 114 unigenes were associated with anthocyanin metabolism, and 72 were significantly upregulated or downregulated across the seven flowering stages. The number of up-regulated unigenes was highest at the S7 stage, while the number of down-regulated unigenes was highest at the S5 stage (Fig. 4A). According to the analysis of TFs in anthocyanin metabolism, we obtained 47 genes relating to transcription factors in anthocyanin metabolism that belonged to 18 families, of which bHLH had the largest number, followed by GeBP (Additional file 5: Table S5, Fig. 4B). According to the anthocyanin heatmap analysis, the expression of genes at S7 was different from that at other stages, with the highest up-regulated expression. The expression of genes at S6 was similar to that at S5, while those at S4, S3, S2, and S1 were similar, among which the expression of genes in S1 and S2 were the most similar (Fig. 4C).
qRT-PCR verification
To validate the accuracy and repeatability of our RNA-Seq data, the current study selected six genes for quantitative real-time polymerase chain reaction (qRT-PCR) analysis, with gene-specific primers designed using Primer software (version 5.0), as shown in Additional file 6: Table S6. As shown in Fig. 5, the results of qRT-PCR indicated that most of these genes had expression patterns that correlated with the RNA-Seq data, which confirmed the reliability of our data. The genes LjDFR (TRINITY_DN37612_c0_g1_i2_29), LjABCB1 (TRINITY_DN23592_c0_g2_i2_8), LjMYC6 (TRINITY_DN17985_c0_g1_i2_8), LjDDB2 (TRINITY_DN25019_c0_g2_i6_8), and LjANS (TRINITY_DN27734_c0_g1_i1_12) rapidly increased during the first three stages. However, the expression of only LjF3'5'H (TRINITY_DN27086_c0_g1_i1_29) was significantly down-regulated at S5, which was consistent with anthocyanin accumulation.