Copy number variation in lipid- and carbohydrate-related genes of rapeseed, castor bean, and maize
To eliminate the effect of species ploidy, the relative copy numbers of key regulators that participate in seed oil biosynthesis were analyzed in these three species. Copy numbers of genes that encoding proteins involving in FA biosynthesis, TAG assembly, and oil body formation were higher in rapeseed than in castor bean and maize (Fig. 1a). This result probably reflects the fact that rapeseed is an allopolyploid and contains duplicates of most genes [45]. All lipid biosynthesis genes had more copies in maize than in castor bean except for ACCase, the key rate-limiting enzyme of FA biosynthesis (Fig. 1a). It has been well-known that ACCase contains the homomeric ACC2 and heteromeric ACCase complex composed of CT-α, CT-β, BCCP, and BC subunits. Our findings revealed that the castor bean and rapeseed genomes contained the heteromeric ACCase but not the homomeric ACC2, whereas the maize genome contained only homomeric ACC2 (Fig. 1a). In addition to different copy numbers of lipid biosynthesis genes, the three species also had different copy numbers of genes encoding oil biosynthesis regulators. All lipid-related TFs had fewer copy numbers in castor bean than in rapeseed (Fig. 1a). The relative copy numbers of lipid-related TF genes in maize were inconsistent: maize had the greatest numbers of WRI1, ABI3, and VAL2 genes among three species, but the lowest copy numbers of LEC1 and FUS3. More importantly, LEC2 and PKL genes were not present in the maize genome (Fig. 1a). In addition, rapeseed had the highest copy numbers of carbohydrate-related genes that functioned in sucrose metabolism, glycolysis, the PPP, and the Calvin Cycle, followed by maize and castor bean (Fig. 1b). These findings revealed that the copy numbers of most lipid- and carbohydrate-related genes were highest in rapeseed, lower in maize, and lowest in castor bean.
Identification of DEGs during seed oil biosynthesis in rapeseed, castor bean, and maize
Rapeseed embryo, castor bean endosperm, and maize embryo tissues were collected at four developmental stages (S1–S4) for SOC and transcriptome analysis (Fig. 2a). Seed oil content increased as seed development progressed and reached its highest level at S3 in rapeseed, castor bean, and maize (Fig. 2a). After transcriptome sequencing and data processing, relative gene expression levels at stages 2, 3, and 4 were compared to that at stage 1 to identify the differentially expressed genes (DEGs). In total, 47 408, 10 708, and 7144 genes were differentially expressed over the course of seed development in rapeseed, castor bean, and maize, respectively (Fig. 2b). Moreover, lipid- and carbohydrate-related genes accounted for 3.5% and 1.3% of all DEGs in the rapeseed embryo, and similar numbers were found in the endosperm of castor bean or the embryo of maize (Fig. 2b). In contrast, when the number of lipid- and carbohydrate-related DEGs was compared with the number of lipid- and carbohydrate-related genes in the entire genomes, 61.8% (937/1516) of the lipid-related genes and 68.0% (607/892) of the carbohydrate-related genes in rapeseed were regulated in embryo; 61.2% (224/366) of the lipid-related genes and 61.7% (142/230) of the carbohydrate-related genes in castor bean were regulated in endosperm (Table 1). However, only 20.1% (112/558) of the lipid-related genes and 22.5% (82/365) of the carbohydrate-related genes in maize were regulated in the embryo (Table 1). To further investigate the regulation of every homologous lipid- and carbohydrate-related gene in these species, we calculated a regulation ratio for each homologous gene by dividing the number of differentially expressed gene copies in a species by the total number of gene copies in the species. The comparison of the regulated ratio between three species showed that most genes had a higher regulation ratio in rapeseed and castor bean than in maize (Fig. S1). We therefore concluded that fewer genes involved in seed oil accumulation were regulated during seed development in maize.
Table 1
The number of total and differential expressed lipid-related and carbohydrate-related genes in rapeseed, castor bean, and maize.
| All lipid-related genes | Differential expressed lipid-related genes | All carbohydrate-related genes | Differential expressed carbohydrate-related genes |
Rapeseed | 1516 | 937 (61.8%) | 892 | 607 (68.0%) |
Maize | 558 | 112 (20.1%) | 365 | 82 (22.5%) |
Castor bean | 366 | 224 (61.2%) | 230 | 142 (61.7%) |
We next performed Gene Ontology (GO) analysis to identify differentially regulated pathways between three species during seed oil accumulation. GO terms enriched in the rapeseed DEGs were mainly related to photosynthesis, response to stresses such as cold, water deprivation and abscisic acid, and lipid metabolic processes (Data S1). Castor bean DEGs were mainly enriched in GO terms related to protein biosynthesis and modification, lipid metabolic processes, and DNA replication (Data S1). In contrast, maize DEGs were enriched in GO terms such as regulation of transcription, DNA-templated, and spermine biosynthetic process (Data S1). Thus, more lipid related genes were regulated in rapeseed and castor bean than in maize during seed development.
Next, homologous genes were identified in rapeseed, castor bean, and maize, and 2108 genes were found to be conserved among these species. Furthermore, there were 5040 dicot-specific genes (shared by rapeseed and castor bean) and 1256 embryo-specific genes (shared by rapeseed and maize) (Fig. 2c). Enriched GO terms in these 2108 conserved genes included acetyl-CoA biosynthetic process from pyruvate, long-chain fatty acid biosynthetic process, carbohydrate metabolic process, and acyl-CoA metabolic process (Data S1). Those 5040 dicot-specific genes were enriched in GO terms such as negative regulation of fatty acid biosynthetic process, lipid catabolic process, and lipid oxidation, while those 1265 embryo-specific genes were enriched in GO terms including response to salt stress, response to auxin, regulation of jasmonic acid-mediated signaling pathway, and ethylene biosynthetic process (Data S1). Therefore, we speculated that oil biosynthesis and carbon partitioning were differentially regulated between rapeseed, castor bean, and maize seeds and that fewer oil biosynthesis genes were regulated in maize than in rapeseed and castor bean.
Differential expression of lipid-related genes in rapeseed, castor bean, and maize during seed development
Subsequently, we focused on the gene expression profiles of lipid-related genes that have been reported to regulate lipid biosynthesis, and classified these genes into different categories based on lipid-related metabolic pathways [3]. Comparison of lipid-related DEGs (LDEGs) between species revealed that fewer lipid biosynthesis genes such as plastidial fatty acid synthesis and TAG synthesis genes were regulated in maize than in rapeseed and castor bean (Fig. 3a). By contrast, more lipid metabolism genes such as those encoding cutin synthesis, aliphatic suberin synthesis, GDSL, galactolipid degradation, and lipase were regulated in maize (Fig. 3a). All DEGs in three species could be grouped into different clusters based on their gene expression patterns (Fig. S2). Nevertheless, rapeseed lipid biosynthesis genes were clustered into four expression profiles; these genes reached their highest expression levels at stage 2 (Fig. 3b). Likewise, maize and castor bean lipid biosynthesis genes were clustered into one and two expression profiles, respectively, and their highest expression levels were also appeared at stage 2 (Fig. 3b). Maize lipid metabolism genes were clustered into four expression profiles and showed increasing expression trends during seed development. By contrast, lipid metabolism genes in the four expression profiles of castor bean tended to decrease in expression (Fig. 3c). Genes in five out of eight rapeseed expression profiles also tended to decrease in expression during development, except profiles 34, 39 and 44, whose genes showed a temporary increase in expression at S2 or S3 (Fig. 3c). Thus, we concluded that fewer lipid biosynthesis genes but more lipid metabolism genes were regulated in the maize embryo during seed oil accumulation.
More genes involved in FA biosynthesis and oil body formation were differentially expressed in rapeseed and castor bean than in maize (Fig. S3), even though these genes had higher copy numbers in maize than in castor bean (Fig. 1a). Interestingly, almost all key TAG biosynthesis enzymes (e.g., GPDH, GPAT9, DGAT, PDAT, and PDCT) showed no regulation during maize embryo development (Fig. 4a). Similarly, key TFs that participate in lipid biosynthesis (e.g., LEC1/2, ABI3, VAL1/2, ASIL1, and PKL) showed little regulation in the maize embryo (Fig. 4b). Therefore, we concluded that lack of regulation of lipid biosynthesis enzymes, especially key TAG assembly enzymes and lipid-related TFs, may lead to the low abundance of oil in maize embryo.
In addition to LDEG numbers, gene expression levels also differed between these species and may have led to differences in seed oil biosynthesis. There were significant differences in the fold changes of many lipid-related genes between rapeseed, castor bean, and maize. The key lipid biosynthesis regulator genes PKL and VAL2 showed opposite regulation patterns between rapeseed and castor bean at the four developmental stages, and they were not regulated in maize (Fig. 5a). The rate-limiting FA biosynthesis ACCase enzymes were up-regulated at S2 and S3 in rapeseed, but ACCase homologs were down-regulated or not regulated in castor bean and maize at S2 and S3 (Fig. 5a). Similarly, numerous lipid-related genes (e.g., FUS3, WRI1, LEC1/2, PDCH, GPDHC, and DGAT) were also differentially regulated in rapeseed, castor bean, and maize (Fig. S4). To compare the gene expression levels of lipid-related genes in three species, the average FPKM of all expressed genes was introduced to normalize the relative gene expression levels in each species. We found that most lipid-related genes had higher relative expression levels in castor bean than in rapeseed and maize (only homologous ACCase and DGAT genes were shown), a finding that might account for the rapid accumulation of oil in castor bean endosperm (Fig. 5b).
Differential expression of carbohydrate-related genes in rapeseed, castor bean, and maize during seed development
In addition to lipid-related genes, carbohydrate-related genes also play vital roles in seed oil biosynthesis. Comparison of carbohydrate-related metabolic pathways between species indicated that glycolysis, the PPP, and the Calvin Cycle were up-regulated in rapeseed and castor bean. Other carbohydrate metabolic pathways were up-regulated in castor bean and maize, while sucrose metabolism and sugar transport pathways were up-regulated in maize (Fig. 6a). The numbers of carbohydrate-related DEGs (CDEGs) associated with sugar transport, plastid transport, organic acids, the TCA, and other carbohydrate metabolic pathways were the highest in rapeseed, followed by castor bean and maize (Fig. S5).
Most enzymes that participate in the PPP and Calvin Cycle (e.g., 6-phosphogluconate dehydrogenase (6PGDH), ribulose-phosphate 3-epimerase (RPE), transketolase (TK), transaldolase (TA), ribulose bisphosphate carboxylase (RBCS), and phosphoribulokinase (PRK)) were not regulated in maize embryo. Numerous glycolytic enzymes (including key enzymes such as hexokinase (HXK), phosphofructokinase (PFK), phosphoglycerate kinase (PGK), and PK) were regulated only in the cytosol in maize embryo. By contrast, they were differentially expressed in the cytosol and plastid of rapeseed embryo and castor bean endosperm (Fig. 6b). More interestingly, invertase (INV) genes that play important roles in sucrose metabolism functioned only in the cytosol in castor bean endosperm. This differed from the situation in rapeseed and maize embryos, in which INV genes were differentially expressed in the cytosol, plastid, cell wall, and vacuole (Fig. 6b). Another vital sucrose metabolism enzyme, sucrose synthase (SuSy), plays important roles in carbon fixation and metabolism. We found that more SuSy homologs were regulated in castor bean and rapeseed than in maize: 9 of 14 SuSy genes were regulated in rapeseed, and all five SuSy genes were regulated in castor bean, but only 3 of 14 SuSy genes were regulated in maize (Fig. 2b and 6b). Analysis of gene expression patterns showed that most carbohydrate-related genes were highly expressed at S1 and S2 in castor bean and rapeseed, but were highly expressed at S2 and S3 in maize (Fig. 6c). We therefore speculated that reduced regulation of carbohydrate-related genes may limit the supply of substrates for oil biosynthesis in maize.