Developmental stages of cucumber fruit trichomes
Trichomes were widely distributed on the leaves, tendrils, stems, calyx and ovaries of cucumber. The trichomes could be divided into two types. Type I was generally visible to the naked eye, and it had an obvious spiny structure (Fig. 1A). The characteristics of type I trichomes included a pyramidal apical cell connected to a cylindrical structure consisting of 5~8 single cells, which formed the stalk. Below the stalk was a base made of hundreds of spherical cells. The structures of type I trichomes on different tissues were generally similar, but type I trichomes on the cucumber ovary were larger and harder than those on other tissues, so they were called fruit spines. Contrary to type I trichomes, type II trichomes were much smaller, and their top was composed of four cells that were not very distinct from each other. This top was connected to the epidermis by 3~5 cells (Fig. 1B).
Since the fruit spines on ovaries were more consistent than other type I trichomes on other tissues in the development state and their cells were harder and larger, making them convenient for slicing, we decided to focus on fruit spines as the research object in order to explore the developmental process of type I trichomes. Combined with the findings of previous studies [30], microscopy and SEM observations were used to divide the developmental process of spines into four stages (Fig. 2A-I).
In stage I, there were pus-like protrusions on the cucumber epidermis where the spines would eventually develop, and then the apical cells developed first in these protrusions; in stage II (Fig. 2A, D, G), the development of the stalk was initiated. The mode of development was as follows: the single cells that made up the stalk developed in the form of a relay; when one cell began to develop, the next cell was flat, similar to a cake. This ‘cake-like’ cell would not begin to lengthen until the last cell completely developed. This stage continued until the ovary grew to approximately 0.25~0.30 cm in length and the stalk of the spines had four complete cells. During stage I and stage II, the ovary was relatively thin, and because of the low rigidity of the newly developed spines, they clung to the ovary like “fetal hair”, while the population of cells that later developed into the base also underwent modest elongation but did not complete the initiation of development. At the start of stage III (Fig. 2B, E, H), the ovary swelled obviously and grew to 0.35 cm in length. The fifth single cell completed development, and the rigidity of the spines began to increase, causing the stalk to become erect. This stage continued until the ovary length was approximately 0.65 cm. The end of stage III was also the end of stalk development. The mature stalk was composed of 7~8 fully elongated single cells. Morphologically, the stalk was narrow at the top and wide at the bottom. The cells that developed later were larger than the ones that grew before, but the cell connected to the base (the last single cell) was shorter than other single cells, ensuring structural stability. In stage IV (Fig. 2C, F, I), the population of cells that made up the base began to multiply and expand in volume until development was completed, forming a spherical pedestal.
CsTs influences the development of spines after stage II
To determine the specific stages in which differences in spine form between the wild type (CsTs) and mutant (Csts) occurred, we continuously compared the spine development state at different stages on the ovary.
In stages I and II (Fig. 3A, D, G), the spine development state of the mutant was consistent with that of the wild type, and both had four complete single cells that clung to the ovary, similar to “fetal hair”. In stage III (Fig. 3B, E, H), starting with the fifth cell, all single cells that made up the stalk were not fully elongated like those in the wild type; instead, they were stacked together like a multilayer pie. At the same time, the growth and division mode of single cells (after the fifth single cell) also changed, with some cells becoming shorter and some dividing into multiple cells. In the mutant, stage IV (Fig. 3C, F, I) no longer seemed to exist, and the boundary between the base and stalk was no longer obvious. The base of spines was no longer a spherical structure but an oval structure that linked a shorter stalk and the epidermis of cucumber. The narrow lower part of the oval structure led to the unstable binding of spines on the epidermis, resulting in the spines of the mutant appearing to be soft and elastic. Finally, the mature spines were distorted due to disorder of the cell division mode, the spines were curved, and a large vacuole formed in the fifth single cell. From stage III, although the stalk (or spines) was erected due to an increase in its rigidity, the insufficient development of cells led to a disorganized arrangement of spines on the mutant ovary surface compared to the wild type ovary surface. We also observed hairs on receptacles of the wild type and mutant and found that they also had this structural difference (Fig. 4). However, no difference in root hair structure was found between the wild type and the mutant by either microscopy or SEM (Fig. 5).
Transcriptome analyses of cucumber fruit spines
To explore the gene networks involved in different stages of spine development in the wild type and mutant, we collected spines from three different developmental stages (stage II, stage III and stage IV) for RNA-seq. RNA-seq was performed for three biological replicates in each stage. On average, approximately 5.94 Gb bases was generated from each sample by the Illumina HiSeq platform. After mapping sequenced reads to the reference genome (cucumber 9930 v2 genome), a total of 23,943 genes were obtained from all samples, including 695 new genes. The data quality of RNA-Seq is summarized in Table 1. We also calculated correlations between the samples from different stages and materials to test whether the samples chosen were reliable. The three replicates at the three stages showed a good correlation (Fig. 6).
Using the thresholds of a false discovery rate < 0.05 and an absolute log2 ratio value ≥1.5, we divided the transcriptome data into two groups for analysis by comparing Fragments Per Kilobase of transcript per Million fragments mapped (FPKM) values from different libraries (Additional file 1). For group 1, we identified Differentially Expressed Genes (DEGs) with expression specificity at different developmental stages of fruit spines in the wild type. As shown in Fig. 7A, the expression levels of 389 genes were significantly upregulated in stage II compared with stage III and stage IV. A total of 414 DEGs were significantly downregulated in stage IV compared with stage II and stage III. There were also 722 DEGs with the highest expression levels in stage IV (Fig. 7B). For group 2, because stage III was the critical period for the difference between the wild type and mutant, we focused on DEGs that were not differentially expressed at stage II but were significantly differentially expressed at stage III between the wild type and mutant. A total of 628 genes showed this expression pattern in the transcriptome data, among which 249 genes were upregulated and 379 genes were downregulated in the mutant (Additional file 1). Although the mutation of CsTs (Csa1G056960) is the main reason for the phenotypic difference between wild-type and mutant, there was no significant difference in the expression level of CsTs between wild-type and mutant. At the same time, the expression of CsTs in both the mutant and the wild type did not change during the three developmental stages of fruit spines.
Functional annotation of key DEGs in stalk development stages of fruit spines
There were 389 DEGs with the highest expression levels at stage II in wild type (for convenience, we called them DEG-IIs). To evaluate the functional categories of the DEG-IIs, we compared these DEGs against the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases (Fig. 8A). At the same time, we also performed in-depth tracking by a GO acyclic graph to analyze the GO categories in which the DEG-IIs were enriched (Additional file 2). The GO categorization of DEG-IIs according to biological processes, molecular functions, and cellular components was analyzed. Categories based on biological processes demonstrated that the DEG-IIs were mainly enriched in lipid metabolic process, amino acid, iron transport, signal transduction and anatomical structure development and morphogenesis. When we classified the DEG-IIs according to cellular components, we found that the DEG-IIs were mainly involved in plasma membrane, cell wall and intracellular organelle. In the molecular function category, oxidoreductase activity, transferase activity and hydrolase activity were the subgroups with the highest numbers of DEG-IIs. We also mapped these DEG-IIs to the KEGG database to identify pathways in which DEG-IIs might be involved. The DEG-IIs were mainly enriched in amino acid metabolism, linoleic acid metabolism and plant pathogen interaction (Additional file 3 and Additional file 4). To date, a variety of phytohormones have been shown to be involved in the initiation and branching of plant trichomes [31, 32]. We also found that the DEG-IIs were involved in signal transduction and response for almost all kinds of phytohormones (Additional file 5), including auxin, ethylene (ET), abscisic acid (ABA), gibberellin (GA), cytokinin (CK), brassinolide (BR), jasmonic acid (JA), and salicylic acid (SA). Specifically, auxin polar transport was the category involving the most genes, including 19 genes. In addition, we found that most of the DEG-IIs related to phytohormones were also involved in tissue pattern formation (Additional file 5). Previous studies also reported that genes involved in meristem regulation may play an important role in the development of spines [23]. Among the DEG-IIs, we identified 30 genes involved in meristem regulation and cell cycle (Additional file 6). Some of their homologues have also been shown to participate in the polarity development of tissues. For example, AGO10 (encoded by Csa3G144740) regulates leaf polarity establishment by repressing miR165 and miR166 in Arabidopsis [33]. ATHB-14 (encoded by Csa6G525430) is involved in the fate regulation of adaxial-abaxial polarity in the ovule primordium in Arabidopsis [34]. Transcription factors (TFs) are the key nodes of gene regulatory networks, and expression level changes of transcription factor will affect the expression levels of many downstream genes. We identified 54 transcription factors among the DEG-IIs (Additional file 7). The 54 TFs could be subdivided into 20 categories according to their transcription factor family. The most common category was the HD-ZIP transcription factor family, which contains 6 genes. Some homologues of TFs in other plants have been identified to be involved in the initiation and development of tissues. For example, the homolog of Csa7G428260, encoding a C2H2 transcription factor, regulates the normal pattern of cell division, expansion and differentiation during morphogenesis of the silique in Arabidopsis [35]. The homolog of MYB21, encoded by Csa3G168940, regulates stamen filament elongation in the late developed flowers in Arabidopsis [36].
In addition, 414 genes showed no significant difference in expression level between stage II and stage III but were significantly downregulated at stage IV (for convenience, we called them DEG-II-IIIs), indicating that these genes may play a role in maintaining the development of the stalk. In the GO category biological process, DEG-II-IIIs were mainly enriched in lipid metabolic process, amino acid, ion transport, signal transduction, multicellular organismal development and response to endogenous hormone (Fig. 8B and Additional file 8). When classifying the DEG-II-IIIs according to GO cellular components, we found that DEG-II-IIIs were mainly involved in plasma membrane, cell wall and intracellular organelle. Compared to the DEG-IIs, more DEG-II-IIIs were associated with the cytoskeleton. Analysis of the GO category molecular function demonstrated that oxidoreductase activity, transferase activity, hydrolase activity and lyase activity were subgroups with the highest number of DEG-II-IIIs. The results of KEGG analysis for DEG-II-IIIs indicated that the biosynthesis pathways of amino acid metabolism, linoleic acid metabolism and phenylpropanoid biosynthesis were significantly enriched for the DEG-II-IIIs (Additional file 4 and Additional file 9). The DEG-II-IIIs included 36 TFs (Additional file 7), and these TFs belonged to 20 transcription factor families. HD-ZIP was still the family with the largest number of enriched genes, indicating that HD-ZIP transcription factors play an important role in the development of cucumber spines. For example, the homologs of Csa1G031750 and Csa1G064670, GL2 and GL11, are well-known TFs that can affect the development of trichome branching in Arabidopsis [37]. There were also genes associated with plant hormones found among the DEG-II-IIIs, while compared with the DEG-IIS, these genes were involved not only in polar transport of hormones (Additional file 5), but also in biosynthesis and transport of substances, as well as stress resistance. A few genes that regulate the development of fruit trichomes by interacting with the cytoskeleton were also found among the DEG-II-IIIs. For example, KIC (encoded by Csa6G106800) can interact with kinesin motor protein in a calcium-dependent manner to regulate the pattern of trichome branching in Arabidopsis [38]. The expression level of Csa6G106800 was downregulated 1.6-fold in stage IV compared with stages II and III.
We found only 26 genes whose expression levels were significantly higher in stage III than in stage II and stage IV (for convenience, we called them DEG-IIIs). According to GO analysis and KEGG analysis (Additional file 1), these 26 genes were mainly related to secondary metabolism and protein processing and transportation. At the same time, we did not find any genes related to hormones or meristems among the DEG-IIIs.
These results indicate that the stalk development of fruit spines depends on the polar transport of phytohormones regulated by the cytoskeleton. This developmental process is continuous, and most genes reach their peak expression levels at the initial developmental stage of fruit spines.
Functional annotation of key DEGs in the base developmental stages of fruit spines
Stage IV was the key period for the base development of fruit spines, and the pattern of cell division and proliferation was different from those of other stages, implying a big change in the involved gene regulatory network. There were 722 genes with the highest expression levels in stage IV (for convenience, we called them DEG-IVs) (Additional file 1). The results of GO and KEGG analyses also showed that the function of DEG-IVs was different from those of DEG-IIs and DEG-IVs.
In the biological process category, DEG-IVs were enriched in single-organism process, cellular process and metabolic process (Fig. 8C). The GO directed acyclic graph showed clearer details of DEG-IV enrichment (Additional file 10). Signal transduction, multicellular organism, transport, anatomical structure, lipid metabolism and amino acid metabolism were the most common biological processes in which DEG-IVs were involved. When we classified the DEG-IVs according to cellular components, we found that in addition to being enriched in plasma membrane and cell wall, DEG-IVs were also enriched in protein complex and intracellular membrane. Furthermore, compared with DEG-IIs and DEG-II-IIIs, DEG-IVs also had more genes related to the cytoskeleton. For molecular function, unlike the DEG-IIs and DEG-II-IIIs, which were only enriched in catalytic activity (oxidoreductase activity, transferase activity, hydrolase activity and lyase activity), the DEG-IVs also included some genes enriched in transporter activity. The results of KEGG analysis of the DEG-IVs indicated that the biosynthesis pathways of carbon metabolism, glyoxylate and dicarboxylate metabolism and glycine, serine and threonine metabolism were significantly enriched. (Additional file 4 and Additional file 11). Another difference from the stalk developmental stage was that bHLH was the transcription factor family with the most genes among the DEG-IVs, with a total of 13 genes (Additional file 7). Although none of these bHLHs have been shown to be involved in the development of plant trichomes, several bHLH transcription factors have been shown to be involved in the specialization of epidermal cells in Arabidopsis. For instance, SCRM (encoded by Csa1G051760) mediates stomatal differentiation in the epidermis by controlling MUTE and FAMA in Arabidopsis [39-41]. Coincidentally, the homologous genes of FAMA and MUTE, Csa3G150010 and Csa3G131940, were also present among the DEG-IVs, suggesting that this set of bHLH complexes may be involved in the development of cucumber trichomes. Similar to the results for the DEG-IIs and DEG-II-IIIs, we also identified multiple phytohormones related to the DEG-IVs. We found a total of 71 genes involved in the response of eight phytohormones (Additional file 5). Among them, auxin response-related genes were still the most abundant (20 genes), indicating the important role of auxin in the development of cucumber trichomes. In addition, there were 50 genes involved in the meristem and cell cycle among the DEG-IVs (Additional file 6). However, we did not find any genes among the DEG-IVs known identified to be involved in trichome development in Arabidopsis, suggesting that the base is a unique structure of cucumber trichomes and the development mechanism is very different from that of unicellular trichomes.
Auxin polar transport and cytoskeletal signaling are the main factors causing phenotypic differences between the wild type and mutant
Stage III was the critical period for the difference between the wild type and mutant. In stage III, the development of cells that made up the stalk was disrupted and incomplete in the mutant (Fig. 3), indicating that the normal gene regulatory network was disturbed, so we focused on DEGs that were not differentially expressed at stage II between the wild type and mutant but were significantly differentially expressed at stage III. A total of 628 genes showed this expression pattern in the transcriptome data, among which 249 genes were upregulated and 379 genes were downregulated in the mutant (Additional file 1).
Then, we performed GO and KEGG analyses to evaluate the functions of these DEGs. In the biological process category, upregulated genes in mutant were more enriched in the processes of multicellular organismal development and anatomical structure development (Additional file 12), while downregulated genes in mutant were more enriched in the processes of endogenous hormone response and amino acid metabolism (Additional file 13). When classifying the DEGs according to molecular function, we found that the downregulated genes were enriched not only in the activities of oxidoreductase, transferase and hydrolase, similar to upregulated genes, but also in lyase activity. The analysis of cellular components demonstrated that the enriched processes of upregulated genes and downregulated genes were roughly the same, except that the upregulated genes were also enriched in the processes of intracellular membrane-bound organelle and plasma membrane. The results of KEGG analysis showed that the upregulated and downregulated genes participated in different pathways (Additional file 14 and Additional file 15). Pentose and glucuronate interconversions, phenylalanine metabolism and tryptophan metabolism were significantly enriched for upregulated genes. Downregulated genes were mainly enriched in glutathione metabolism, plant-pathogen interaction and alpha-linolenic acid metabolism.
By further exploring the functions of DEGs, we found that auxin polar transport was associated with the upregulated genes in mutant. A series of auxin carrier proteins, namely, LAX4 (encoded by Csa4G308640), LAX5 (encoded by Csa7G010800), PID2 (encoded by Csa2G006100), and PIN1 (encoded by Csa4G430820), were detected among the upregulated genes. NPY1 (encoded by Csa2G382680) can regulate cotyledon development through the control of PIN1 polarity in Arabidopsis [42], and was also found among the upregulated genes. Among the downregulated genes in mutant, there were far more genes related to the meristem and cell cycle than there were among the upregulated genes. Among those downregulated genes, genes related to calcium signaling in the cytoskeleton were very noteworthy. A series of calcium sensors, such as CML46, CML38, and CML11, were found among the downregulated genes. PCM1 (encoded by Csa3G167380), a kind of calmodulin that mediates the control of a large number of protein kinases, ion channels and other proteins by calcium signaling, was also detected among the downregulated genes. Furthermore, genes that directly affect the cell cycle by interacting with the cytoskeleton were also detected among the downregulated genes. KIC (encoded by Csa6G106800) is a calmodulin that can interact with kinesin in the cytoskeleton to directly affect the branch number of trichomes in Arabidopsis [38]. RANGAP1 (encoded by Csa2G355000) is an activator of the cell polarity molecule Ran GTPase and plays a role in spatial signaling during cell division [43].
qRT-PCR validation of differentially expressed genes
To validate the accuracy of the RNA-seq data, quantitative RT-PCR was conducted on the randomly selected 10 DEGs from our RNA-seq data to detect their expression patterns in three development stages. These DEGs included Csa3G748220 (CsMict [9] from down-regulated DEGs of mutant), Csa5G577350 (CsTu [28] from DEG-IVs), Csa3G824850 (CsMYB6 [44] in DEG-IIs), Csa2G006270 (ERF TF from DEG-II-IIIs), Csa3G020060 (Aspartic proteinase from DEG-II-IIIs), Csa1G006300 (signal response regulator from DEG-IIs), Csa1G267240 (Chitinase from DEG-IIs), Csa3168940 (MYB TF from DEG-IIs), Csa3G824850 (MYB TF from DEG-II-IIIs) and Csa5G623360 (Aquaporin from DEG-IIs). The results showed there were high expression pattern correlation between the RNA-seq data and qRT-PCR in all selected genes, whether in wild type or mutants (R = 0.82~0.99) (Additional file 16).