Analysis of physiological characteristics of overwintering branches of Walnut
The most intuitive manifestation of the shoot shriveling is that the epidermis of the branch shrinks due to water loss. According to the phenotype in the figure below, the epidermal shrinkage of 'Liaohe' was more obvious than that of 'Xiangling'. As we can seen from Fig.1-a, the soluble starch of the two branches increased first and then decreased in the whole overwintering stage, and the content of soluble starch in 'Liaohe' was higher than that of 'Xiangling'. The Fig.1-b showed the change trend of soluble sugar content in 'Xiangling' and 'Liaohe' were basically the same (with a gradual decline from P1 to P5), and the soluble sugar content in 'Liaohe' is always higher than that of 'Xiangling' during the whole overwintering period. The soluble sugar content of 'Xiangling' and 'Liaohe' reach the highest value in P1 (6.585% and 7.347%), while it drops to the lowest in P5 (4.848% and 5.798%). As can be seen from the sucrose content (Fig.1-c), the sucrose content of 'Xiangling' showed a trend of first increasing and then decreasing, reaching the highest point in P2. The sucrose content of 'Liaohe' was higher than that of 'Xiangling', and the sucrose content was 1.25% in P1.
It is showes in Fig.1-d that the glucose content of 'Xiangling' showed a trend of gradual decrease, while that of 'Liaohe' showed a trend of first increase and then decrease. The glucose content of 'Liaohe' was always higher than that of 'Xiangling' from P2. As we can see from Fig.1-e that the sucrose phosphorylase in two walnut varieties generally increased first and then decreased with the extension of overwintering time, and reach the maximum value of 0.0101 mg/g ('Xiangling') and 0.0129 mg/g ('Liaohe') at the same time. It can be seen from Fig.1-f that the amylase activity of the branches of 'Xiangling' and 'Liaohe' increased first and then decreased during the overwintering period, and from P1 to P5, the amylase activity of 'Xiangling' was higher than that of 'Liaohe' in the same period, and the amylase activity of the two branches reach the highest in P4 and P3, respectively, where the amylase activity of 'Xiangling' was 0.1830mg/g, while the activity of amylase in 'Liaohe' is 0.1660mg/g. The activity of sucrose synthase in 'Xiangling' decreases during the overwintering process, while that in 'Liaohe' increase first and then decrease (Fig.1-g). The sucrose invertase activity of 'Liaohe' was higher than that of 'Xiangling' from P1 to P3, and reach a maximum value of 25.485mg/g in P3, while the sucrose invertase activity of 'Liaohe' was always lower than that of 'Xiangling' from P4 to P5 (Fig.1-h).
Transcriptomic analysis of overwintering branches of Walnut
In order to further explore the molecular and related metabolic pathways of shoot shriveling in overwintering, three key periods were selected for transcriptomic analysis. The clustering diagram of sequencing samples obtained according to gene expression is shown in Fig.2-a, and the correlation between the same biological repeats are better. The density distribution of FPKM reflects the expression patterns of protein-coding genes in each sample. It could be seen that 'Liaohe' and 'Xiangling' showed similar change curves during the three overwintering periods, showing non-standard normal distribution (Fig.2-b). In order to clarify the repeatability and overall change trend of transcriptome samples of 'Liaohe' and 'Xiangling' after overwintering stress, intersample principal component analysis (PCA) was conducted using the FPKM values of all annotated genes in six transcriptomic samples as parameters. The results showed that the horizontal, vertical and vertical coordinates could explain 78.59% (PC1), 6.12% (PC2) and 6.37% (PC3) of the total sample variation, the explainable overall variation rate is 91.08%, and there is a significant aggregation trend among samples, indicating that the samples have good repeatability and could be used for subsequent difference analysis (Fig.2-c).
DEGs involved in the plant hormone, cutin and wax biosynthesis pathways
According to KEGG analysis, phytohormone metabolism, cutin and wax play an important role in the overwintering process of plants (Fig.3-a,c). The metabolism of abscisic acid mainly involves PYR, PP2C, SnRK2, and ABF, which can inhibit growth and accelerate aging. XP-035550417 was downregulated in both 'Liaohe' and 'Xiangling', while XP-0188085889 was not significantly different in 'Liaohe', and was up-regulated first and then downregulated in 'Xiangling'. The expression of genes such as AUX1, SUAR and ARF in the auxin pathway regulated cell division and plant growth, and they all showed different trends during overwintering. The metabolism of cytokinin mainly involves CRE1, AHP, B-ARR and A-ARR which reach shoot initiation and cell division, the expression of these genes showed diversity, and XP-018808887 was up-regulated in 'Liaohe' and 'Xiangling', while XP-018819058 showed a trend of rising first and then falling in 'Liaohe', and continue to rise in 'Xiangling' (Fig.3-b).
The formation of long-chain acyl-coA from acetyl-coA can be derived into other fatty compounds through three processes (initialization, elongation, and termination) through three pathways. Acetyl-coA forms long chain acyl-coA by 2.3.1.85 and 6.2.1.3 gene, in which XP-018815548 in 'Liaohe' and 'Xiangling' were progressively down-regulated, while XP-018808861 and XP-018809212 had no significant changes. The other two pathways are more complex and involve multiple genes. For example, acetyl-coA forms 3-Oxoacyl-acp through 2.3.1.180 or Malonyl-CoA through 2.3.1.39, and finally through 3.1.2.14 forms a Long-chain acyl-coA, in with a rising trend in 3.1.2.14 (Fig.3-d).
In the process of lignin biosynthesis, many genes participate in the formation of different types of lignin, among which 4CL gene playes an important role. XP-018812460.2, a gene essential for the formation of P-Coumarcyl CoA from P-Coumaric acid, showed different expression trends in 'Liaohe' and 'Xiangling', suggesting that the expression of lignin genes may affect lignin biosynthesis to regulate plant stress adaptability (Fig.3-e).
qRT-PCR validation of differentially expressed genes
In order to further verify the reliability of transcriptome data, major differential genes involved in plant hormone signal transduction, lignin and wax biosynthesis were selected for qRT-PCR analysis (Fig.4). Transcriptomic analysis revealed that plant overwintering was related to lignin and flavonoid metabolic pathways. 4CL, as a functional gene with both properties, showed a gradual decline in 'Xiangling', while an increase in 'Liaohe' during the overwintering process. In plant signal transduction, SnRK2 and other genes play an important role and the expression trend changed significantly during overwintering. The B-ARR of 'Liaohe' and 'Xiangling' decreased gradually during overwintering, while the change was not obvious in qRT-PCR. CYP86, 1.2.1.84, CER1 and other genes played an important role in fatty acid metabolism and can regulate overwintering. CYP86 of 'Liaohe' increased gradually with the extension of overwintering time, while that of 'Xiangling' decreased gradually. In general, the expression trend of most genes in RNA-seq were more obvious than that in qRT-PCR, but the expression patterns of the two genes are similar, which further proves the reliability of sequencing results.
Identification and analysis of physicochemical properties of Jr4CL family members
AMP-binding-C (PF13193.9) hidden Markov model was used to search and screen walnut 4CL family members, and the analysis was carried out in combination with SMART, NCBI CDD and pfam, and the final retained sequences were identified as members of the walnut Jr4CL gene family, designated as Jr4CL1 to Jr4CL52 according to their chromosomal locations (Table 2). The physicochemical property analysis of the protein sequence showed that the amino acid length is between 268 and 1219 aa using the ProtParam tool. Jr4CL45 has the smallest molecular mass of 29.4 kD. Jr4CL1 has the largest molecular weight of 135.7 kD, the theoretical isoelectric point is between 5.39 and 9.26. The total average hydrophilicity of the protein is 0.120 - 0.295.
Table. 2 Analysis of physicochemical properties of Jr4CL gene family
Gene name
|
Gene ID
|
Amino acid
number
|
Molecular
weight/D
|
Isoelectri
point
|
Instability
index
|
GRAVY
|
Jr4CL1
|
JreChr01G10261
|
1219
|
134709.30
|
6.04
|
43.12
|
0.087
|
Jr4CL2
|
JreChr01G10401
|
565
|
61938.44
|
6.79
|
35.70
|
-0.027
|
Jr4CL3
|
JreChr01G10828
|
603
|
66935.68
|
6.34
|
45.64
|
-0.156
|
Jr4CL4
|
JreChr01G10888
|
653
|
72914.57
|
6.28
|
45.57
|
-0.153
|
Jr4CL5
|
JreChr01G11687
|
694
|
76587.12
|
6.91
|
31.00
|
-0.069
|
Jr4CL6
|
JreChr01G12667
|
554
|
60772.60
|
5.63
|
37.18
|
-0.154
|
Jr4CL7
|
JreChr01G13040
|
573
|
62260.32
|
5.86
|
41.61
|
-0.020
|
Jr4CL8
|
JreChr01G13626
|
544
|
59499.91
|
6.06
|
41.19
|
-0.013
|
Jr4CL9
|
JreChr02G10700
|
800
|
88525.12
|
8.70
|
31.41
|
-0.245
|
Jr4CL10
|
JreChr02G11270
|
573
|
62514.69
|
6.31
|
34.47
|
-0.157
|
Jr4CL11
|
JreChr02G11899
|
394
|
42854.74
|
5.39
|
36.17
|
0.047
|
Jr4CL12
|
JreChr03G10018
|
832
|
92312.08
|
6.26
|
36.28
|
-0.203
|
Jr4CL13
|
JreChr03G12827
|
730
|
80196.55
|
8.56
|
30.20
|
-0.062
|
Jr4CL14
|
JreChr03G13128
|
582
|
64270.92
|
7.60
|
34.50
|
-0.136
|
Jr4CL15
|
JreChr03G13375
|
555
|
60992.05
|
6.83
|
38.14
|
-0.087
|
Jr4CL16
|
JreChr03G13461
|
523
|
56465.37
|
5.64
|
32.84
|
-0.025
|
Jr4CL17
|
JreChr04G11976
|
587
|
65052.87
|
8.17
|
32.67
|
-0.147
|
Jr4CL18
|
JreChr05G11706
|
572
|
62357.74
|
5.54
|
37.10
|
0.067
|
Jr4CL19
|
JreChr05G11730
|
697
|
76624.15
|
6.91
|
31.28
|
-0.054
|
Jr4CL20
|
JreChr05G12314
|
574
|
62574.92
|
5.42
|
33.61
|
0.014
|
Jr4CL21
|
JreChr05G12962
|
558
|
60999.74
|
5.81
|
36.82
|
0.023
|
Jr4CL22
|
JreChr06G11268
|
661
|
73865.53
|
5.94
|
40.28
|
-0.164
|
Jr4CL23
|
JreChr06G11338
|
573
|
62384.45
|
7.16
|
35.08
|
0.084
|
Jr4CL24
|
JreChr06G11877
|
697
|
76527.42
|
6.85
|
29.46
|
0.006
|
Jr4CL25
|
JreChr07G11075
|
660
|
74219.24
|
6.39
|
30.63
|
-0.208
|
Jr4CL26
|
JreChr07G11306
|
563
|
61944.21
|
8.39
|
32.91
|
-0.132
|
Jr4CL27
|
JreChr07G11325
|
550
|
59855.19
|
8.63
|
34.35
|
0.044
|
Jr4CL28
|
JreChr07G11607
|
1106
|
122555.35
|
9.04
|
39.53
|
-0.095
|
Jr4CL29
|
JreChr07G11893
|
550
|
60700.08
|
5.86
|
33.68
|
-0.011
|
Jr4CL30
|
JreChr07G11894
|
542
|
59598.00
|
5.53
|
30.86
|
-0.009
|
Jr4CL31
|
JreChr07G11896
|
542
|
59548.01
|
5.70
|
30.09
|
0.007
|
Jr4CL32
|
JreChr08G10014
|
712
|
79988.84
|
6.11
|
31.42
|
-0.220
|
Jr4CL33
|
JreChr08G11472
|
694
|
76403.81
|
6.81
|
38.21
|
-0.068
|
Jr4CL34
|
JreChr08G11573
|
590
|
65128.47
|
6.80
|
33.78
|
-0.162
|
Jr4CL35
|
JreChr08G11599
|
546
|
59807.53
|
6.34
|
32.74
|
-0.110
|
Jr4CL36
|
JreChr08G12150
|
534
|
57489.92
|
8.41
|
39.25
|
-0.050
|
Jr4CL37
|
JreChr09G10404
|
577
|
63214.78
|
8.94
|
43.99
|
-0.058
|
Jr4CL38
|
JreChr10G10412
|
564
|
62168.99
|
7.00
|
43.88
|
0.077
|
Jr4CL39
|
JreChr10G10423
|
594
|
65639.98
|
7.88
|
44.52
|
0.028
|
Jr4CL40
|
JreChr10G10964
|
542
|
59454.94
|
8.76
|
41.64
|
0.028
|
Jr4CL41
|
JreChr11G10166
|
561
|
61032.36
|
7.18
|
48.20
|
0.047
|
Jr4CL42
|
JreChr11G10952
|
573
|
62656.33
|
6.25
|
35.24
|
0.120
|
Jr4CL43
|
JreChr11G11200
|
662
|
74108.18
|
5.74
|
32.05
|
-0.196
|
Jr4CL44
|
JreChr11G11369
|
524
|
56485.44
|
6.36
|
37.46
|
-0.019
|
Jr4CL45
|
JreChr12G10316
|
268
|
29351.72
|
9.26
|
47.48
|
-0.295
|
Jr4CL46
|
JreChr12G10871
|
639
|
71566.70
|
6.67
|
34.22
|
-0.152
|
Jr4CL47
|
JreChr12G11007
|
525
|
56369.53
|
6.22
|
34.54
|
0.015
|
Jr4CL48
|
JreChr13G10323
|
544
|
59467.97
|
8.80
|
38.52
|
0.083
|
Jr4CL49
|
JreChr13G11269
|
727
|
80820.81
|
6.82
|
37.27
|
-0.116
|
Jr4CL50
|
JreChr14G10240
|
540
|
59238.40
|
8.22
|
38.87
|
0.066
|
Jr4CL51
|
JreChr14G11032
|
727
|
80459.38
|
6.97
|
37.88
|
-0.063
|
Jr4CL52
|
JreChr16G11309
|
545
|
60360.72
|
6.74
|
35.30
|
0.060
|
Bioinformatics analysis of the Jr4CL family
The gene structure analysis results of the members of Jr4CLs showed that Motif 2 and Motif 6 were common in the family. According to the number and species of Motifs, the evolutionary tree can be roughly divided into three groups, among which Group I contained Motif 1-Motif 7, Motif 9 and Motif 10, without Motif 8, and Group II and Group III do not contain Motif 8. Conservative motif analysis found that most members of the same group have similar Motif, indicating that the genes in the same group have functional similarity (Fig. 5-a). Motif analysis is performed on 52 Jr4CL family member genes using MEME online tool, the results show that the motif 1 and motif 2 have typical AMP-binding -c domains (Fig.5-b).
In order to study the evolutionary relationship of Jr4CL in walnut, phylogenetic tree analysis is performed on 52 4CL amino acid sequences (Fig.5-c). According to sequence similarity and topological structure, the evolutionary tree is divided into 7 4CL subfamilies (I, II, III, IV, V, VI, VII). Subclass I has 8 Jr4CL amino acid sequences, subclass II has 5 Jr4CL amino acid sequences, and subclass III has 7 Jr4CL amino acid sequences. Subclass IV has 4 Jr4CL amino acid sequences, Subclass V has 9 Jr4CL amino acid sequences, Subclass VI has 7 Jr4CL amino acid sequences, and subclass VII has 12 Jr4CL amino acid sequences.
The Fig.5-d showes the chromosome localization and gene replication of the Jr4CL gene family in Walnut. Members of the 4CL gene family are unevenly distributed on 16 chromosomes, and only one gene is found on chromosomes 4, 9 and 16, namely Jr4CL17, Jr4CL37 and Jr4CL52. Eight genes (Jr4CL8, Jr4CL1, Jr4CL2, Jr4CL3, Jr4CL4, Jr4CL5, Jr4CL6, Jr4CL7) are most distributed on chromosome 1, while the genes on chromosome 15 are least distributed. A total of 4 pairs of tandem repeats are identified, including 1 pair of tandem repeats in chromosome 8 and 10, and 2 pairs of tandem repeats in chromosome 7.
To analyze how Jr4CL in walnut is involved in gene expression regulation, cil-acting element analysis of the promoter region (2000 bp) of the Jr4CL family gene in walnut is performed using PlantCARE online (Fig.5-e,f).The results show that the genes promoter of Jr4CL family in walnut contain photoresponsive elements, abscisic acid, gibberellin, auxin and other hormone response elements, and the response elements related to low temperature and defense. Among the many response elements, the number of light response elements is relatively large (ACE, G-box, AE-box, I-box, AT1-motif, GA-motif, etc.), which may play an important role in the growth and development of plants, and the Jr4CL gene of walnut may be able to resist various stresses, speculating that the gene function is diverse.
Expression of Jr4CLs gene under drought stress
In order to identify the expression characteristics of Jr4CLs under stress, drought stress is used as the background. Fig.6 showes that drought stress can significantly induce the expression of most Jr4CLs genes, among which Jr4CL44 and Jr4CL37 have the highest expression. In addition, genes represented by Jr4CL36 and Jr4CL21 are down-regulated.
Bioinformatics analysis of Jr4CL44 gene
The identification of the secondary structure of Jr4CL44 protein showes that it mainly included Alpha helix (Hh) and random coil (Cc), in which α-helix accounted for the largest proportion (Figure .7-a). The hydrophilicity/hydrophobicity of Jr4CL44 protein is predicted by Protscale online software. The results showes that the whole polypeptide chain encoded by Jr4CL44 was less than zero, and it is a hydrophilic protein (Figure .7-b).
The analysis of cis-acting elements on the Jr4CL44 promoter (Figure .7-c,d) showes that it contained a variety of abiotic stress and hormone response elements, such as drought response elements (MBS) and anaerobic induced regulatory elements (ARE), suggesting that Jr4CL44 may be involved in the regulation of abiotic stress tolerance. At the same time, there are several hormone-related elements such as Methyl Jasmonate (CGTCA-motif) and Abscisic acid (ABRE), which indicated that this gene could respond to various external hormone signals, and may play an important role in plant growth and development.
Cloning of Jr4CL44 gene and identification of transgenic Arabidopsis
We clone Jr4CL44 gene using Juglans regia L. cDNA as template, and its band size is 1734bp (Figure .8-a). In order to verify the successful transfer of Jr4CL44 into Arabidopsis Thaliana, RNA and DNA levels of the transformed plants are verified. RNA level verification showes that the PCR band size is about 1734bp, which is consistent with the expected results, indicating that transgenic Arabidopsis thaliana plants are obtained (Figure .8-b). At the DNA level, there are bands in transgenic Arabidopsis using 18s as a reference, but no bands in WT (Figure .8-c).
Determination of physiological indexes of transgenic Arabidopsis thaliana under drought stress
To determine the response ability of Jr4CL44 to drought stress, three Jr4CL44-OE A. thaliana lines and the WT control are cultured under normal and drought stress for 20 days, respectively. As shown in Fig.9-a, WT and transgenic A. thaliana grow well without significant differences in the control of pouring water, while under drought stress, transgenic A. thaliana leaves wilte and yellow more seriously than WT, suggesting that overexpression of Jr4CL44 increased Arabidopsis tolerance to drought. After obtaining the transgenic Arabidopsis thaliana, we quantitatively detect the expression level of Jr4CL44 in WT and overexpressed lines using real-time fluorescence. The results show that the expression level of Jr4CL44 in transgenic Arabidopsis thaliana plants is significantly higher than that in WT ( Fig.9-b).
In order to investigate the function of Jr4CL44 in lignin synthesis pathway, the lignin content in leafs of Arabidopsis thaliana lines at 6 weeks and 8 weeks of age is determined by thioglycolic acid method. The results show that the lignin content of Jr4CL44-OE A. thaliana lines is significantly increased compared with the WT (the content of S-type lignin increased), indicating that Jr4CL44 plays a positive regulatory role in lignin synthesis ( Fig.9-c,d).
The lignin synthesis pathway is closely related to the flavonoid synthesis pathway. In order to study the effect of Jr4CL44 on the flavonoid synthesis pathway, the flavonoid content in Arabidopsis thaliana lines growing normally for 1 month is determined according to the above method. The results show that the flavonoid content of OE-Jr4CL44 is not significantly different from that of WT (Fig.9-e). To investigate the effect of overexpression of Jr4CL44 on downstream gene expression of lignin synthesis pathway, we analyze the expression changes of genes encoding multiple enzymes downstream of 4CL in WT and OE-Jr4CL44 Arabidopsis thaliana. The result showes that C3H, F5H, COMT and CCR genes are significantly up-regulated, indicating that overexpression of Jr4CL44 affectes the expression of some genes downstream of the lignin pathway ( Fig.9-f).
Through DAB and NBT staining of Arabidopsis leaves, it is found that under normal growth conditions, the staining of WT and transgenic Arabidopsis are relatively light and have no significant difference. However, the color of overexpressed Arabidopsis leaves is light under drought stress (Fig.10-a), indicating that the degree of oxidative damage suffered by transgenic Arabidopsis is low under drought stress.
The results of chlorophyll content determination show that there is no significant difference between WT and overexpressed lines under normal conditions, but the chlorophyll content of overexpressed Jr4CL44 is significantly higher than that of WT under drought stress. Electrolyte leakage showes that the electrolyte content of WT is higher than that of OE1, OE2 and OE3 under drought stress, indicating that overexpression of Jr4CL44 reduces the electrolyte leakage of Arabidopsis thaliana under drought stress. Under normal conditions, there is no significant difference in fresh weight between WT and transgenic plants, but the fresh weight of WT and overexpressed lines decrease significantly after drought stress, and the fresh weight content of transgenic lines are higher than that of WT (Fig.10-b).
Additionally, we can know that overexpression of OE-Jr4CL44 could increase the content of various substances to resist stress after drought stress by measuring SOD, POD of two types of Arabidopsis thaliana, but the difference is not significant with WT under normal condition. Therefore, overexpression of Jr4CL44 remarkably increases tolerance to drought stress in Arabidopsis thaliana (Fig.10-b).