Morphological changes and ascorbic acid accumulation at six different stages of Actinidia latifolia fruit development and ripening
Visual inspection of six different stages corresponding to 30, 60, 90, 120, 150, and 170 days after flowering (DAF) of A. latifolia fruit development and ripening is presented in Fig. 1a. The quantification of AsA content was evaluated by HPLC (Fig. 1b). Statistical evaluation results of ANOVA showed that the different stages of A. latifolia fruit development and ripening had no significant effect on the AsA content, however, it can be found that the initial expansion stage was characterized by a rapid increase of AsA content, reaching the maximum level at 60 DAF (1194.21 ± 69.25 mg 100 g− 1 FW). Later, a decrease at the veraison stage (60–90 D AF) was followed by a gradual increase in the early ripening stage (90–120 DAF). The AsA content showed a declining trend from 120 to170 DAF and reached the minimum levels (1028.76 ± 31.19 mg 100 g− 1 FW) at full maturity. The details of AsA content changes over the time course expressed as mean ± standard error (SE) in FW basis are shown in Fig. 1b.
PacBio SMRT sequencing-based full-length transcriptome atlas of Actinidia latifolia fruit
To capture as many transcripts of A. latifolia fruit as possible, a total of 18 RNA samples from six stages of A. latifolia fruit maturation were equally pooled together for library preparation (1–6 kb libraries). A total of 19.50 Gb subreads were obtained from offline data. The strict screening criteria (full passes greater than 1.0 and accuracy greater than 0.90) led to 358,138 CCS reads, comprising 743,399,287 read bases, with an average read length of 2,087 bp. After error correction, a total of 34,148 polished high-quality isoforms were obtained, among which, 91.80% (326,926 reads) were filtered as the FLNC reads of A. latifolia fruit transcriptome. The FLNC read length distribution ranged from 700 to 3500 bp, and its overall distribution of each size bin agreed with the size of its cDNA library (Fig. 2a).
Functional annotation of Actinidia latifolia transcripts with multiple databases
Transcripts were scanned against and successfully annotated by Nr, GO, COG, KOG, eggNOG, KEGG, Pfam, and Swiss-Prot databases, whose integrated alignment results are summarized in Table S1. Based on Nr functional annotation, the best blast hit of homologous species with A. latifolia fruit was Vitis vinifera (5689 isoforms, 31.33%), followed by Sesamum indicum (1260, 6.73%) and Theobroma cacao (1082, 5.78%) (Fig. 2b). A total of 15,113 transcripts were successfully annotated with GO database and classified into three categories, including biological process (BP), cellular component (CC) and molecular function (MF). The genes under cellular process (8291 matched genes, 54.86%), metabolic process (8072, 53.41%), and single-organism process (5540, 36.66%) were highly represented in the BP category. Cell (8924 matched genes, 59.05%) was the most abundant subcategory within CC, followed by cell part (8911, 58.96%) and organelle (6340, 41.95%). In the category of MF, catalytic activity (7823 matched genes, 51.76%) was the most prominently represented, followed by binding (7672, 50.76%), transporter activity (1023, 6.77%) (Fig. 2c).
Structural analysis of the full-length transcriptome of Actinidia latifolia
A total of 15,505 CDS were identified. The frequencies for each length of CDS were evaluated with the most frequent length ranging from 100 to 1200 bp (Fig. S1a). Furthermore, a total of 2436 genes were predicted to be TFs by predicting non-redundant transcript via iTAK software. The TFs were classified into different families, among which the most abundant type identifed was C3H (84 matched genes), followed by GRAS (76), AP2/ERF-ERF (68), MYB-related (56), B3-ARF (56), CAMK_CDPK (55), C2H2 (54), and RLK-Pelle_LRR_Xl-1 (51) (Fig. S1b). A total of 3328 lncRNA candidates were identified, and 447, 445, 2459, and 1332 of them were identified by CNCI, CPC, Pfam and CPAT prediction results, respectively. Comparison results proved that 169 transcripts were simultaneously identified by the four computational approaches (Fig. 2d). In addition, transcripts were subjected to SSR analysis via MISA, and a total of 18,797 SSR including seven SSR types (i.e., mononucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide, hexanucleotide and compound nucleotides) were detected. The dinucleotide SSR loci exhibited the highest frequency, followed by mononucleotide and the compound SSR types (Fig. S1c). A detailed breakup showing SSR types from PacBio is summarized in Table S2.
Differentially expressed genes identified in the comparative Illumina transcriptome analysis
The non-redundant transcripts obtained above was used a reference for Illumina sequence alignment and subsequent analysis. A total of 18 cDNA libraries for Illumina sequencing were generated from six different stages, with three biological replicates per stage. DEG identification was independently performed using a pair-wise comparison between developmental stage and the baseline control (DAF30). Based on the adopted cutoff (FDR < 0.01 and absolute fold change ≥ 2), the numbers of all genes displaying both significantly up- and downregulated profiles are presented in Fig. 3. By using FPKM values, a total of 3788, 3485, 4315, 5265 and 6079 DEGs were identified in the comparisons of DAF30_vs_60, DAF30_vs_90, DAF30_vs_120, DAF30_vs_150, and DAF30_vs_170 (the former is used as a baseline control, and the latter is the treatment group), respectively. The highest number of DEGs was identified in the comparison of DAF30_vs_170 (baseline_vs_ripening), comprising 2632 upregulated and 3447 downregulated DEGs (Fig. 3).
Functional annotation and categorization of differentially expressed genes
The annotation, pathway and functional categorization of the DEGs were analyzed based on COG, GO, KEGG, KOG, Pfam, Swiss-Prot, eggNOG and Nr databases. Assignments indicated that at least 97.79% DEGs were functionally annotated at multiple databases (Table 1). Analysis of GO categories showed that the functional distribution of DEGs in the comparable groups was similar. In detail, the BP category of GO terms were primarily grouped into cellular process, metabolic process and single-organism process. The CC category were mainly assigned to cell, cell part and membrane. Catalytic activity, binding and transporter activity were prominent in the MF category (Fig. S2). A pathway-based categorization of orthologous genes, KEGG pathway enriched analysis, was carried out to predict the functional profiles and biological significances of DEGs identified during the six different stages of A. latifolia fruit development and ripening. Plant hormone signal transduction, carbon metabolism, biosynthesis of amino acids, starch and sucrose metabolism, and protein processing in endoplasmic reticulum were the most significantly enriched metabolic pathways (Fig. S3).
Table 1
Annotation summary of every DEG set between six developmental stages of Actinidia latifolia fruit development and ripening
DEG_seta
|
Annotatedb
|
COGc
|
GOd
|
KEGGe
|
KOGf
|
Pfamg
|
Swiss-Proth
|
eggNOGi
|
Nrj
|
DAF30_vs_DAF120
|
4,232
|
1,993
|
3,435
|
1,687
|
2,207
|
3,823
|
3,474
|
4,142
|
4,209
|
DAF30_vs_DAF150
|
5,176
|
2,477
|
4,187
|
2,093
|
2,771
|
4,663
|
4,232
|
5,093
|
5,156
|
DAF30_vs_DAF170
|
5,991
|
2,832
|
4,841
|
2,386
|
3,305
|
5,383
|
4,807
|
5,893
|
5,964
|
DAF30_vs_DAF60
|
3,718
|
1,736
|
2,940
|
1,455
|
2,053
|
3,306
|
2,983
|
3,664
|
3,709
|
DAF30_vs_DAF90
|
3,405
|
1,542
|
2,704
|
1,234
|
1,760
|
3,045
|
2,756
|
3,326
|
3,381
|
aDEG: differentially expressed gene. bAnnotated: number of differentially expressed transcripts annotated. cCOG: clusters of orthologous groups. dGO: gene ontoglogy. eKEGG: kyoto encyclopedia of genes and genomes. fKOG: eukaryotic orthologous groups. gPfam: protein family. hSwiss-Prot: a manually annotated and reviewed protein sequences database. ieggNOG: evolutionary genealogy of genes, non-supervised orthologous groups. jNr: non-redundant protein sequence database. |
Identification of key network modules associated with AsA content by weighted gene co-expression network analysis
To further unveil the crucial regulatory genes associated with AsA accumulation during A. latifolia fruit development and ripening, we performed WGCNA with all DEGs identified. The co-expression network was constructed, and the genes with similar expression patterns were clustered together into eight distinct modules with the average linkage hierarchical clustering (Fig. 4a). Eight different colors including blue, brown, cyan, yellow, salmon, red, tan and grey were used to represent distinct modules, containing 764, 2688, 316, 212, 77, 993, 80, and 27 transcripts, respectively (Table S3). WGCNA also allowed the assessment of the overall correlation of the modules with AsA content. The blue and brown modules that possibly related to AsA content were found (Fig. 4b). Analysis of the module and AsA content relationship revealed the blue module (r = 0.5, p = 0.03) was significantly (p < 0.05) associated with AsA content during A. latifolia fruit development and ripening (Fig. 4c). In the following analysis, the gene significance and module membership of the blue (cor = 0.7, p = 1.6e-113), brown (cor = 0.019, p = 0.32), cyan (cor = 0.2, p = 0.00035), yellow (cor = 0.14, p = 0.042), red (cor = 0.45, p = 1.1e-50), salmon (cor = 0.031, p = 0.79), and tan (cor=-0.25, p = 0.025) modules were calculated (Fig. S4).
The highest association in the module-trait relationship was found between blue module and AsA content, followed by brown module (Fig. 4b). Therefore, these two modules were selected as module of interest and studied in subsequent biological and functional interpretations (Fig. 5). Other modules without statistical significance with AsA content were not further considered. To evaluate the biological significance of module of interest, KEGG pathway enrichment analysis was performed. The blue module was enriched with genes associated with protein processing in endoplasmic reticulum (ko04141), glycolysis/gluconeogenesis (ko00010), and carbon metabolism (ko01200) (Fig. 5a). However, genes associated with ascorbate and aldarate metabolism (ko00053) was only enriched in the brown module, and the absolute value of gene significance in the brown module enriched with genes involved in ascorbate and aldarate metabolism (ko00053) is generally lower than that in several pathways including plant hormone signal transduction (ko04075), starch and sucrose metabolism (ko00500), phenylpropanoid biosynthesis (ko00940), and alpha-linolenic acid metabolism (ko00592) (Fig. 5b).
Expression patterns of genes involved in ascorbic acid biosynthesis during fruit development and ripening of Actinidia latifolia verified by qRT-PCR analysis
To investigate the molecular mechanism regulating AsA biosynthesis in A. latifolia fruit development and ripening, the expression levels of AsA biosynthesis genes in different developmental stages were further examined by qRT-PCR, as illustrated in Fig. 6. L-galactose pathway, L-gulose, myo-inositol and D-galacturonic acid pathways were identified. The overall gene expression levels of eight enzymes, including L-galactose dehydrogenase (GalDH), GDP-L-galactose phosphorylase (GGP), GDP-mannose-3’,5’-epimerase (GME), L-galactose-l-phosphate phosphatase (GPP), GDP-mannose pryophosphorylase (GMP), phosphoglucoisomerase (PGI), myo-inositol oxygenase (MIOX), and ascorbate peroxidase (APX), exhibited a similar trend with the cumulative content of AsA described above (Fig. 1). The L-galactose pathway and myo-inositol pathway were the predominant pathways regulating the high AsA content in A. latifolia fruit development and ripening (Fig. 6).