Comparison of the phenotypic characteristics, cytology, and physiology of the XX048 and NZ199 plants
The plant height of the two varieties of XX048 and NZ199 was 420.27 ± 5.2cm and 219 ± 4.57cm, respectively, and the difference was extremely significant (Fig. 1). August to September is the period of rapid growth in cassava plant height. During this period, the height of XX048 plants increased by nearly 150cm, while NZ199 plants increased by about 55cm, XX048 plants grew faster than NZ199 plants, and the difference in height between the two was the largest during this period.
To observe the anatomical differences between XX048 and NZ199, we performed paraffin section staining in the stem of the top internode tissue. The woody area of XX048 is 414.87 ± 14.28mm2, while that of NZ199 is 911.10 ± 82.62mm2, with significant differences between the two. The average length and width of XX048 xylem cells were 27.04 ± 4.37mm and 22.11 ± 4.36mm, respectively, while the average length and width of NZ199 xylem cells were 35.25 ± 3.81mm and 29.12 ± 5.15mm(Fig. 2), respectively. There was no significant difference in length and width between XX048 and NZ199. These results indicate that the xylem of NZ199 is more developed and has a higher degree of lignification.
The content of lignin and cellulose in the stem of the top internode tissue of XX048 and NZ199 is shown in Fig. 3. The lignin content of NZ199 and XX048 is 162.37 ± 1.81mg/g and 132.01 ± 3.83mg/g, respectively. The cellulose content of NZ199 and XX048 is 73.47 ± 2.69mg/g and 70.91 ± 1.45mg/g, respectively. The difference in lignin content between the two is significant, while the difference in cellulose content between the two is not significant. The results showed that the cell wall lignification development of the two was different, and the main stem of NZ199 cassava entered the secondary wall lignification development state earlier. This result is consistent with the cytological results.
Transcriptomic analysis
To explore the key genes related to plant height development of cassava, six cDNA libraries were constructed. Based on Illumina sequencing, after removing splices and low-quality reads, 121791451 clean readings were obtained(Supplementary Table 1), and the percentages of Q20, and Q30 base rates were more than 97.82% and 93.65%, respectively. The clean reads of each sample were compared with the cassava reference genome, and the comparison efficiency ranged from 94.56–94.88%. The sample correlation heat map showed that the R2 value among three biological repetitive samples was greater than 0.89, and that of most of the samples was greater than 0.9, indicating that this experiment was highly repeatable and the data were reliable (Fig. 4).
To study the expression pattern of genes related to plant height development of cassava, the transcriptome map of cassava with tall and short stems was compared, and the differentially expressed genes of cassava with tall and short stems were identified by log2 | Fold Change | ≥ 2 and p - value < 0.01. A total of 2368 DEGs were differentially expressed between XX048 and NZ199, of which 1112 genes were up-regulated and 1256 genes were down-regulated (Fig. 5).
To infer the biological processes of all DEGs, We have classified DEGs into GO functions, the identified DEGs were assigned into three main GO functional categories(Fig. 6), including biological process(BP), cellular component(CC), and molecular function(MF). The three most abundant sub-categories in which the BP category was divided were “metabolic process”(753 DEGs), “cellular process”(590 DEGs), and “single-organism process” (458 DEGs). For the CC category, the three most abundant sub-categories were “membrane”(521 DEGs), “membrane part” (480 DEGs), “cell part” (377 DEGs), and “cell” (377 DEGs). The majority of DEGs of the MF category were assigned to “binding” (829 DEGs), “catalytic activity”(795 DEGs), and “transporter activity”(105 DEGs).
The KEGG enrichment analysis found that the five most abundant pathways as "Plant-pathogen interaction" (117 DEGs), “Plant hormone signal transduction" (47 DEGs), "Starch and sucrose metabolism" (46 DEGs), "phenylpropanoid biosynthesis" (43 DEGs), and "Carbon metabolism (41)". In addition, the “Flavonoid biosynthesis”, “Brassinosteroid biosynthesis”, and “Zeatin biosynthesis”, pathways were also enriched, and the “Flavonoid biosynthesis” pathway was the most significantly enriched(Fig. 7).
DEGs involved in plant hormone biosynthesis and signal transduction pathways
Plant hormone biosynthesis and signal transduction pathways include auxin, gibberellin, cytokinin, BL, abscisic acid, ethylene, jasmonic acid, and salicylic acid synthesis and transduction. Many important genes involved in auxin, cytokinin, gibberellin, and BL biosynthesis and signal transduction pathways were significantly down-regulated in dwarf cassava (Fig. 8, Supplementary Table 2). In the IAA biosynthesis and transduction pathway, auxin response factor ( ARF ), auxin-responsive GH3 gene family ( GH3 ), and small auxin-up RNA(SAUR) were generally down-regulated. In the cytokinin pathway, two-component response regulators ARR-A and ARR-B families were generally down-regulated. In the GA pathway, one DELLA protein was up-regulated, one was down-regulated, phytochrome-interacting factor 3 ( TF ) was down-regulated, and gibberellin receptor GID1 ( GID1 ) was up-regulated. Several BL biosynthesis and signal transduction genes were also down-regulated, including brassinosteroid insensitive 1-associated receptor kinase 1 ( BAK1 ), protein brassinosteroid insensitive 1 ( BRI1 ), and cyclin D3 ( CYCD3 ). Many important genes involved in ETH, SA, JA, and ABA biosynthesis and signal transduction pathways were significantly up-regulated in dwarf cassava. In the ABA pathway, serine / threonine-protein kinase SRK2 ( SnRK2 ) and ABA-responsive element binding factor ( ABF ) were generally up-regulated. In the JA pathway, transcription factor MYC2 (MYC2) was generally up-regulated. In the ETH and SA pathway, serine / threonine-protein kinase CTR1 ( CTR1 ), mitogen-activated protein kinase kinase 4 / 5 ( MKK4 _ 5 ), regulatory protein NPR1 ( NPR1), pathogenesis-related protein 1 ( PR1 ) were up-regulated.
DEGs involved in flavonoid biosynthesis and phenylpropanoid biosynthesis
The flavonoid biosynthesis pathway was the most significantly enriched in the KEGG enrichment analysis. In the flavonoid biosynthesis pathway, most of the genes were significantly up-regulated in the XX048 Vs.NZ199 group, including 4 chalcone synthase genes EC: 2.3.1.74 ( CHS ), 5 shikimate O-hydroxycinnamoyltransferase EC: 2.3.1.133 ( HCT ). 1 phlorizin synthase genes EC: 2.4.1.357 ( PGT1 ), 2 chalcone isomerase EC: 5.5.1.6, 1 naringenin 3-dioxygenase EC: 1.14.11.9 ( F3H ), 1 bifunctional dihydroflavonol 4-reductase/flavanone 4-reductase EC:1.1.1.219 1.1.1.234 (DFR), 1 flavonoid 3',5'-hydroxylase EC:1.14.14.81 (CYP75A), 1 flavonol synthase EC:1.14.20.6(FLS), 1 anthocyanidin synthase EC:1.14.20.4 (ANS),1 anthocyanidin reductase EC:1.3.1.77(ANR), 1 leucoanthocyanidin reductase EC:1.17.1.3(LAR). Only 1 chalcone synthase gene EC: 2.3.1.74 ( CHS ), 1 caffeoyl-CoA O-methyltransferase gene EC: 2.1.1.104, and 5 phlorizin synthase genes EC: 2.4.1.357 ( PGT1 ) were significantly down-regulated in the XX048 Vs.NZ199 group(Fig. 9, Supplementary Table 3).
The phenylpropanoid biosynthesis pathway is also one of the KEGG enrichment pathways. The products of the phenylpropanoid biosynthesis pathway include lignin monomers such as p-coumarol, coniferyl alcohol, and sinapyl alcohol. It showed 37 significantly differentially expressed genes involved in lignin monomer synthesis (Fig. 10, Supplementary Table 4), including 4 phenylalanine ammonia-lyase ( PAL ), 6 4-coumarate-CoA ligase ( 4CL ), 5 cinnamoyl-CoA reductase ( CCR ), 4 cinnamyl-alcohol dehydrogenase ( CAD ), 2 peroxidases ( POD ), 5 shikimate O-hydroxycinnamoyltransferase ( HCT ), 5 caffeic acid 3-O-methyltransferase / acetylserotonin O-methyltransferase ( COMT ), 1 caffeoyl-CoA O-methyltransferase ( CCoAOMT ), 1 ferulate-5-hydroxylase ( F5H ) and 4 laccase genes. Twenty-eight genes were significantly up-regulated in the XX048 Vs.NZ199 group and only 9 genes were significantly down-regulated.
DEGs involved in cell wall biosynthesis and expansion
To provide mechanical strength to newly elongated cell walls and internodes, the synthesis of new cell wall components is essential. We analyzed the differential expression of genes involved in cell wall synthesis or modification, such as cellulose synthase ( CES ), expansin ( EXP ), xylosidase ( XYL ), pectin lyase ( PL ), pectin esterase ( PM), pectin acetyl esterase ( PA ) and polygalacturonase ( PG ), which may be involved in the elongation and expansion of cassava stems.
The relaxation of the cell wall is a physiological process that must occur during cell expansion and elongation during plant growth and development. Expansins (EXPs) are nonhydrolytic cell wall-loosening proteins that enable cells to expand while allowing tissues to differentiate and grow. Two members of the expansin gene family were differentially expressed in
tall and dwarf cassava and were significantly up-regulated in dwarf cassava(Fig. 11, Supplementary Table 5 ). Xyloglucan endoglycosyltransferase / hydrolase ( XTH ) has also been shown to participate in cell expansion by relaxing and rearranging cell wall fibers in growing tissues. Four XTHs were differentially expressed between tall and dwarf cassava, of which 3 XTHs were significantly down-regulated and 1 XTH was up-regulated in the tall vs. dwarf group(Fig. 11, Supplementary Table 5 ). Pectin is a complex polymer, and its degradation is the initiation of secondary wall lignification and cell wall disintegration. A total of 11 related genes were significantly differentially expressed in tall and dwarf cassava, including 5 pectin esterase genes ( PM ), 3 polygalacturonase genes ( PG ), 2 pectin lyase genes ( PL ), 1 pectin acetyl esterase gene ( PA ). In addition, 1 cellulose synthase gene and 2 XYL xylosidase genes were also differentially expressed in the tall vs. dwarf group(Fig. 11, Supplementary Table 5 ).
Transcription factor analysis
Transcription factors play an important role in gene expression. In this study, we identified 92 transcription factors differentially expressed in XX048 Vs.NZ199, representing 34 transcription factor families, of which 56 were down-regulated and 36 were up-regulated(Supplementary Table 6). These transcription factors mainly included bHLH ( 9 ), MYB ( 9 ), C2H2 ( 7 ), NAC ( 5 ), MYB-related ( 5 ), bZIP ( 5 ), AP2 / ERF-ERF ( 5 ) and WRKY ( 4 ) transcription factor families.
qRT-PCR validation
To verify the accuracy of RNA-Seq results, 11 differentially expressed genes were randomly selected for qRT-PCR verification. The results showed that the expression patterns of these 11 differentially expressed genes were similar to RNA-Seq, with R2 of 0.9804(Fig. 12), which shows that RNA-Seq is reliable.