Transcriptome sequencing and assembly
A total of 118.98 Gb clean data from four floral parts with three biological replications each was acquired following a systematic quality check and data filtering. The base number of each sample was greater than 8.5 Gb (Table 1). The percentages of the GC bases of all 12 samples were greater than 44%, and the percentages of the reads with an average quality value greater than 30 were over 89%. These parameters confirmed that sequencing data were reliable and appropriate for further analysis.
A total of 76,007 unigenes were obtained with median contig (N50) length of 1639 bp and the mean length of 965.14 bp (Table 2). The distribution of the unigene numbers in different length ranges were highly uniform (Additional Fig. 1). Hence, it is reasonable to assume that all assembled unigenes have a high degree of integrity.
Tabal 1. Summary of the sequencing results for four floral organs in tree peony |
Samples | Number | Read Number | Base Number | GC Conten | %≥Q30 |
Petal | PE1 | 30,505,090 | 9,088,941,390 | 44.52% | 89.99% |
| PE2 | 34,668,124 | 10,331,991,108 | 44.57% | 90.58% |
| PE3 | 33,988,044 | 10,138,469,592 | 45.40% | 91.04% |
Sepal | SE1 | 32,785,787 | 9,774,077,598 | 44.89% | 91.20% |
| SE2 | 31,070,742 | 9,253,816,890 | 44.76% | 91.13% |
| SE3 | 29,309,927 | 8,742,359,852 | 44.83% | 90.84% |
Stamen | ST1 | 36,870,554 | 10,982,297,828 | 45.40% | 92.12% |
| ST2 | 32,823,379 | 9,781,335,460 | 45.41% | 92.15% |
| ST3 | 35,848,484 | 10,696,258,334 | 45.46% | 91.78% |
Carpel | CA1 | 34,168,656 | 10,192,298,092 | 44.54% | 91.52% |
| CA2 | 35,251,448 | 10,516,365,094 | 44.32% | 91.63% |
| CA3 | 31,763,113 | 9,479,948,186 | 44.49% | 91.36% |
Note: %≥Q30 represents the percentage of the base with the quantity be equal or greater than 30 in clean data. |
Table 2. Summary of the assembly for four floral organs in the tree peony |
Length Range | Transcript | Unigene |
200–300 | 23,645(13.32%) | 18,966(24.95%) |
300–500 | 26,606(14.99%) | 15,324(20.16%) |
500–1000 | 45,226(25.48%) | 18,128(23.85%) |
1000–2000 | 47,716(26.88%) | 13,952(18.36%) |
2000+ | 34,315(19.33%) | 9,637(12.68%) |
Total Number | 177,508 | 76,007 |
Total Length | 223,165,477 | 73,357,029 |
N50 Length | 1,903 | 1,639 |
Mean Length | 1257.21 | 965.14 |
Note: N50 Length represents the length of the unigene of N50. |
Functional annotation and clustering of gene expression profiles of tree peony floral parts transcriptome
BLAST was used to annotate unigenes by databases including Nr, Swiss-Prot, GO, COG, KOG, eggNOG4.5 and KEGG, resulting in 30,505 (40%) successfully annotated unigenes (Fig. 2A). Nighty eight percent of total annotated unigenes, or 29,872 of 30,505 were perfectly matched in Nr, while the smallest number of unigenes (8,321) were hit in KEGG. All unigenes were classified using a COG database search. In total, 8,321 unigenes were annotated in 25 COG classifications. Among them, the term ‘general function prediction only’ (2042, 24.5%) represented the highest number of unigenes, while ‘extracellular structures’ was represented by zero (Fig. 2B).
For the GO annotation, a total of 17,184 annotated unigenes were classified into three principal categories which were further subdivided into 52 functional terms (Fig. 2C). Among them, ‘cell’ (7113, 41.4%), ‘cell part’ (7113, 41.4%) and ‘organelle’ (5304, 30.9%) were the dominant function terms in ‘cellular component’ principal category. The term which was most represented in ‘molecular function’ principal category was ‘binding’ (9171, 53.4%), followed by ‘catalytic activity’ (8950, 52.1%). Within the ‘biological process’ principal category, unigenes (11857, 69%) belonged to ‘metabolic process’.
Identification of different expression unigenes and clustering of gene expression profiles of tree peony floral parts transcriptome
Differences of gene expression profiles among four floral parts were examined by pairwise comparisons of the 12 libraries. A total of 11,801 (36.7%) unigenes were shared among all libraries, while approximate 14.3% of unigenes (4355) was detectable in individual floral parts. For petal, there were 328 unique unigenes which was the smallest number of unigenes of all four parts. Pistil had the largest number of unique unigenes, followed by stamen (1129) and calyx (935) (Fig. 3A). Those genes expressed in certain floral parts, probably played important roles in development of the coincident floral parts.
Based on the six comparisons of Petal Vs Sepal, Petal Vs Stamen, Petal Vs Carpel, Sepal Vs Stamen, Sepal Vs Carpel and Stamen Vs Carpel, we identified 9592 different expression unigenes (DEGs) in total (Fig. 3C). Among them, comparison of Petal Vs Sepal had the smallest number of 1706 (17.8%) DEGs, while comparison of Sepal Vs Stamen had the largest number of 5517 (57.5%) DEGs followed closely by Stamen Vs Carpal 5310 (55.4%) and Petal Vs Stamen 3505 (36.5%).
To further investigate the gene expression profiles, we performed K-Means clustering of gene expression levels of DEGs. We identified 9 gene clusters with distinctive expression patterns (Fig. 3B). Cluster 7 and Cluster9 containing 499 and 452 unigenes respectively were showing opposite patterns. While Cluster 7 showed gradual increase from petal to pistil, cluster 9 showed gradual decrease, indicating different roles regarding the floral organ development. Cluster 1, 5 and 6 including 1342, 894 and 550 unigenes were showing similar patterns with up-regulated unigenes in stamen, on the contrary, Cluster 2, 3, 4 and 8 containing 1320, 1311, 1096 and 466 respectively with down-regulated unigenes in stamen. These facts indicted that stamen might be the floral part contained more complex molecular process than others.
KEGG pathway enrichment of DEGs
In order to identify active pathways in the four floral organs in tree peony, the DEGs were annotated in the KEGG pathways. KEGG pathway enrichment analysis was performed to search the dominant pathways in the four floral organs. Specific enrichment of genes was obtained for 102 pathways at Petal vs Calyx, for 113 pathways at Petal vs Stamen, for 102 pathways at Petal vs Pistil, for 116 pathways at Calyx vs Stamen, for 111 pathways at Calyx vs Pistil and for 115 pathways at Stamen vs Pistil. Among the six paired comparisons, a total of 5,616 unigenes were mapped in KEGG pathways including 324, 505, 361, 821, 571 and 835 DEGs respectively in Petal Vs Calyx, Petal Vs Stamen, Petal Vs Pistil, Calyx Vs Stamen, Calyx Vs Pistil and Stamen Vs Pistil. The results of each comparison are listed in the Fig. 4. These pathways were divided into five groups, which include “Cellular Process,” “Environmental Information Processing,” “Genetic Information Processing,” “Metabolism,” and “Organizational systems.” A common theme was identified through these six paired comparisons and the dominant pathways were annotated with either a circle or a triangle (Fig. 4). In the “Metabolism” group, the total numbers of DEGs regulating carbon metabolism, photosynthesis, biosynthesis of amino acids, starch and sucrose metabolism and phenylpropanoid biosynthesis were shown in greater abundance and these DEGs may be critical in the formation of flower parts in tree peony. In the “Environmental Information Processing” group, the total number of DEGs for “plant hormone signal transduction” was also high.
Key genes involved in regulating flowering time in tree peony
The KEGG pathway analysis revealed a general view of the involvement of many biological processes among the floral parts development. To explore the detail expression patterns of the key genes involving in flowering time pathways, BlastT was used to search the orthologous of these key genes in Arabidopsis thaliana. A total of 24,163 pairs of orthologs were identified. Among them, key genes related to flowering time were selected and performed expression analysis (Fig. 5). We found that genes such as red/far-red light signaling pathways (PHYA, PHYB, CRY1, ZTL) were lower expressed in stamen, which indicated that light signaling was less involved in stamen development. The expression level of genes related to floral integrators and flora parts development (AP1, SOCI, FT, FLC, LHY, SVP) were showed expected patterns which were high expressed in petal or in sepal. The blast search also displayed that genes involved in autonomous pathways (FCA, FY, FLD, FVE, FPA) were all shared similar expression patterns which reached peaks in carpal and throughs in stamen, which indicated that these genes more involved in carpal development.
Analysis of MADS-box genes in tree peony
A total of 1833 TFs were identified in our study, which represented 6% of annotated unigenes and fall into 205 TF families classified by plant transcription factor database PlantTFDB. Among these TF gene families, AP2/ERF-ERF (68) was the most abundant TF family, followed by C2H2 (63) and bHLH (61). Especially, a total of 29 MADS-box genes including 16 MADS-MIKC type were found in tree peony transcriptome data (Fig. 6A).
The MADS-box genes have been involved in many aspects of floral development and flower organs differentiation. To further examine the phylogenetic relationships among the tree peony MADS-box genes and to classified them into the established subfamilies, a phylogenetic Neighbor-joining tree containing MADS-box genes in tree peony, grape, kiwifruit and arabidopsis was made (Fig. 6B). The phylogenetic analysis revealed that 16 MADS-box genes were classified into 10 subfamilies including a new clade which was not reported in arabidopsis. In addition, a few key MADS-box genes described in ABCDE models were also included, such as AP1 (APETALA1), FUL (FRUITFULL), PI (PISTILATA), AP3 (APETALA3), AG (AGAMOUS), STK (SEEDSTICK), SHP (SHATERPROOF) and SEP (SEPALLATA). In the AP1/FUL subfamily (A class), c58746 was belonged to AP1 clade, while c87227 fell within AGL79. Two genes, c70349 and c80146, were assigned to PI/AP3 subfamily, which were functionally classified as B class. c57631, c77234 and c85161 belonged to AG/SHP subfamily in C/D class, which included AG, SHP and STK three subclade. c77234 was classified into AG clade in C class, while c57631 and c85161 were sorted into STK and SHP subclade respectively in D class. Only one gene c72216 was assigned to SEP subfamily in the E functional group. In conclusion, the phylogenetic relationships showed that the tree peony contained all five classes flower parts differentiation MADS-box genes.
To further explore how these regulators related to the flower developmental process, we examined the expression patterns of the 16 MADS-box genes (Fig. 6C). Genes including c77234 (C class) and c87227 were high expressed in all four floral parts, while c60425, c91686 and 75076 were nearly no expressed in all four floral parts. For stamen, c85161 (D class), c72042, c70349, and c80146 were of a high expression, meanwhile c70349 (B class), c80146 (B class), c87227, c58746 (A class) and c72216 (E class) were overexpressed in petal. Notably, c70164 and c69592 were specific expressed in sepal and c57631 (D class) in carpal.
Closer Examination of Six Key Genes Involved in Controlling the Development of Flower Forms Using Stamen Petalody Types
Out of the 16 MADS-Box genes examined is this study, six of them were specifically selected for determining their specific roles in controlling stamen petalody based on two criteria. They include significant gene expression in the specific flower parts and their fit for the ABCDE model. These six keys genes were AG, AP1-1, AP1-2, AP3, PI, and SEP, respectively. Eight tree peony cultivars classified to stamen petalody and non-stamen petalody groups were used to verify the functions of the six candidate genes by real-time PCR (Fig. 7).
The results showed that SEP had a high-level expression of sepals in both stamen petalody and non-stamen petalody cultivars. Meanwhile, AG showed similar expression profiles as SEP, but in carpals instead of sepals. AP3 showed high level expressions in petals and much lower level expressions in stamens in the non-stamen petalody group. However, the expression profiles of AP3 showed high level in both the ow-petals and the iw-petals in the stamen petalody group. The expression profiles of PI are similar to those of AP3. PI showed lower expression in stamen in the non-stamen petalody group and very higher expression in the stamen petalody cultivars. This indicates that PI likely has the similar function as AP3 in regulating stamen petalody in tree peony. The expression profiles of AP1-1 and AP1-2 were quite inconsistent in both stamen petalody types and non-stamen petalody types.