Characterization of variegated testa by stereo microscope.
Pigmentation in colored area (F) of the variegated peanut testa began DAF30, which is the active period of related gene expression. The “a” and “b” values of the non-colored area (B) were lower than in colored area during testa development with a slight change, and the “L” value in non-colored area was higher than in colored area, possibly indicative of the white color in non-colored area. As the pod matured after DAF45, the color of variegated testa stabilized. The values of “a” and “b” in colored area changed slightly and “L” increased 12.85% (Fig. 1, Table S1). The colored areaand the non-colored areaare located in the same seed coat with identical genetic backgrounds, which is ideal for the study of DEGs in variegated testa pigmentation.
Measurement of anthocyanins.
The colored (F1 and F2) and the non-colored (B1 and B2) areas were compared and analyzed horizontally and vertically at DAF 30 and DAF45 In the comparison of F1-B1, the content of procyanidin A1, A2, B2, B3, delphinidin, cyanidin, cyanidin 3-O-galactoside, and rosinidin O-hexoside in colored area increased by 185.40-895.58%, while the relative content of petunidin 3-O-glucoside and cyanidin O-syringic acid in colored area were lower than in non-colored area, which decreased by 62.70-76.92% (Fig. 2a). In the comparison of F2-B2, the relative content of procyanidin A1, A2, B2, B3, delphinidin 3-O glucoside, delphinidin, cyanidin, cyanidin 3-O-galactoside, rosinidin O-hexoside in the color areas increased by 106.54-1,759.77%, while the relative content of cyanidin 3-O-galactoside and cyanidin O-syringic acid in non-colored area increased by 35.25-80.35%, compared with that in colored area (Fig. 2b). In the comparison of B1-B2, the relative content of petunidin 3-O-glucoside, delphinidin, cyanidin 3-O-galactoside at DAF45 increased by 43.73-81.62%, while the relative content of cyanidin O-syringic acid, cyanidin 3-O-glucoside, procyanidin A1, rosinidin O-hexoside at DAF45 decreased by 152.39-313.82% compared with DAF30 (Fig. 2c). The metabolites between F1 and F2 were almost identical. The relative content of delphinidin 3-O-glucoside, petunidin 3-O-glucoside, and rosinidin O-hexoside at DAF45 was 48.44-92.25% higher than at DAF30, while the relative content of cyanidin O-syringic acid at DAF45 decreased by 386.24% (Fig. 2d, Table S2).
RNA Sequencing.
Transcriptome sequencing was performed on F1, B1, F2, and B2 of the variegated peanut accession VG-01. The contaminated adapter sequences and low-quality read sequences were all removed from the sequencing data. A total of 64.33Gb clean data were obtained, and the clean data of each sample reached 6.53Gb. The base recognition error rate was about 0.02%, the Q20 value of the obtained sequence was above 97.45%, the Q30 base percentage was above 93.23%, and the GC content was above 43.90% (Table S3)
Detection of DEGs. Comparison to the cultivar tetraploid reference genome of peanut and hierarchical clustering analysis based on the FPKM of the DEGs, a total of 214 differential genes in F1-B1 were identified, of which 53 were up-regulated and 161 were down-regulated. There were 348 differential genes identified in F2-B2, including 97 up-regulated and 251 down-regulated. There was a total of 152 differential genes in B1-B2, including 82 up-regulated and 70 down-regulated genes, and 213 differential genes in F1-F2, with 169 up-regulated and 44 down-regulated (Fig. 3, Table S4).
GO enrichment analysis of DEGs.
1,050 DEGs (accounting for 82.48% of all differential genes) were compared with GO public databases of tetraploid cultivated peanut and diploid wild peanut, respectively. The comparison results of the tetraploid reference genome showed that F1-B1, F2-B2, F1-F2, and B1-B2 comparison groups were all enriched in 51 GO entries and the diploid genome comparison showed that 4 comparison groups were all enriched in 41 GO entries (Table S5). Following comparative analysis, the enrichment items in 3 branches of 4 comparison groups were horizontally similar. Analysis of the tetraploid cultivar transcriptome data revealed that the enriched items in the cellular components category were membrane, and membrane part, the molecular function were enriched in catalytic activity, binding, and transporter activity. In addition, biological processes showed that DEGs enriched in metabolic, single-organism, and cellular processes (Fig.4).
COG enrichment analysis of DEGs.
The annotated genes were compared with the COG database and it was found that F1-B1 contained the largest DEGs proportion (R-General functional prediction only), accounting for 12.61%, followed by Transcription, accounting for 6.54%. Amino acid transport and metabolism accounted for 5.61 %. Energy production and conversion, and Defense mechanisms accounted for 4.67%. F2-B2 had the largest DEGs proportion (R-General functional prediction only), accounting for 10.63%, followed by Transcription, accounting for 6.32%. Amino acid transport and metabolism accounted for 5.61 %. Energy production and conversion, and Defense mechanisms accounted for 4.89%. DEGs in B1-B2 contained the largest DEGs proportion (R-General functional prediction only), accounting for 5.92%, followed by Transcription, accounting for 4.61%. The synthesis, transport, and metabolism of Secondary metabolites biosynthesis accounted for 3.29%. F1-F2 had the largest DEGs proportion (R-General functional prediction only), accounting for 7.51%, followed by Modification and transport after transcription and protein translation, accounting for 5.63% (Fig. 5).
KEGG pathways enrichment analysis of DEGs.
In order to understand the biological functions of DEGs discussed above, the transcriptome sequencing results were compared with the KEGG public database of tetraploid cultivated peanut and diploid wild peanut. The results showed that 363 DEGs (28.52% of all differential genes) were annotated in the KEGG database. Among them, the metabolic pathways associated with testa color enrichment in F1-B1 included phenylalanine metabolism, phenylpropanol biosynthesis, flavonoid biosynthesis, and plant circadian rhythm. Testa color-related enrichment metabolic pathways in F2-B2 included phenylpropanol biosynthesis, flavonoids and flavonol biosynthesis, and flavonoid biosynthesis. Testa color-related enrichment metabolic pathways in B1-B2 included plant hormone signals transduction and biosynthesis of phenylalanine, tyrosine and tryptophan. Testa color-related enrichment pathways in F1-F2 included phenylalanine, tyrosine, and tryptophan biosynthesis. The diploid and tetraploid reference genomes were compared based on different parts of the same period and different periods of the same part and 6 metabolic pathways related to anthocyanin biosynthesis were screened out, including phenylpropane acid metabolism, phenylpropanol biosynthesis, flavonoids and flavonol biosynthesis, flavonoid biosynthesis, plant hormone signal transduction, and circadian rhythm in plants. (Fig. 6, Table S6).
Structural genes related to testa pigment synthesis.
The Veen diagram analysis showed that 71 DEGs co-expressed between F1-B1 and F2-B2, and 143 DEGs presented specifically F1-B1 while 277 DEGs particularly expressed in F2-B2(Fig.7c, Table S7). 15 DEGs co-expressed between B1-B2 and F1-F2, and 137 DEGs presented specifically B1-B2 while 198 DEGs particularly expressed in F1-F2 (Fig.7c, Table S7). KEGG analysis showed that both F1-B1 and F2-B2 involved differential genes related to peanut testa color and the expression of DEGs was closely related to testa development. Genes involved in the pigment synthesis included 3 PAL,1 C4H, 2 CHS, 1 F3H, 1 F3'H, 2 DFR, 2 LAR, and 2 IAA (Table 1). Three PAL, 1 C4H, 2 CHS, and 1 F3'H in non-colored area were down-regulated compared with the colored area. Two DFR, 2 LAR, and 2 IAA in non-colored area were up-regulated compared with the colored area (Fig. 7a), involving the entire anthocyanin metabolism pathway key enzyme gene metabolic pathway.
Table 1 Genes related to anthocyanin synthesis in the variegated peanut testa among different comparison groups.
gene ID F1-B1
|
F1 FPKM
|
B1 FPKM
|
FDR
|
log2FC
|
regulated
|
annotation
|
arahy.Tifrunner.gnm1.ann1.0B4MFB
|
9.054759
|
1.449418
|
0.000731142
|
-2.718222922
|
down
|
PAL
|
arahy.Tifrunner.gnm1.ann1.NB2RRK
|
9.630893
|
0.917825
|
8.56E-11
|
-3.465122425
|
down
|
PAL
|
arahy.Tifrunner.gnm1.ann1.PUMP6L
|
12.603869
|
3.956659
|
0.004951254
|
-1.736346256
|
down
|
PAL
|
arahy.Tifrunner.gnm1.ann1.AYA1A5
|
2.3203895
|
0.26419
|
0.00547686
|
-3.06931628
|
down
|
C4H
|
arahy.Tifrunner.gnm1.ann1.CY60UM
|
22.02194
|
6.7419675
|
0.003171101
|
-1.771575358
|
down
|
DHS2
|
arahy.Tifrunner.gnm1.ann1.6J3HHE
|
10.014363
|
0.6830215
|
5.04E-10
|
-3.917616027
|
down
|
CHS
|
arahy.Tifrunner.gnm1.ann1.UDJX6I
|
91.9906315
|
20.076782
|
1.65E-05
|
-2.256203498
|
down
|
CHS
|
arahy.Tifrunner.gnm1.ann1.CB6084
|
0.550337
|
9.0050005
|
4.54E-09
|
3.958349247
|
up
|
IAA14
|
arahy.Tifrunner.gnm1.ann1.QUY0YV
|
0.6811405
|
5.7989815
|
0.000156105
|
3.016455081
|
up
|
IAA14
|
arahy.Tifrunner.gnm1.ann1.X2EPYD
|
0
|
14.942496
|
8.91E-43
|
Inf
|
up
|
CSNK2B
|
gene ID F2-B2
|
F2 FPKM
|
B2 FPKM
|
FDR
|
log2FC
|
regulated
|
annotation
|
arahy.Tifrunner.gnm1.ann1.NB2RRK
|
5.903561
|
0.920466
|
8.56E-11
|
-3.465122425
|
down
|
PAL
|
arahy.Tifrunner.gnm1.ann1.K8H9R8
|
11.108361
|
26.2737925
|
0.005593953
|
1.202514755
|
up
|
F3'H
|
arahy.Tifrunner.gnm1.ann1.HE8J5U
|
27.034563
|
62.0989035
|
0.009349418
|
1.160958266
|
up
|
F3H
|
arahy.Tifrunner.gnm1.ann1.X8EVF3
|
29.294177
|
152.0960005
|
1.81E-05
|
2.334863116
|
up
|
DFR
|
arahy.Tifrunner.gnm1.ann1.7JZ58T
|
11.6955015
|
144.9762875
|
4.15E-08
|
3.60358824
|
up
|
DFR
|
arahy.Tifrunner.gnm1.ann1.M05HM0
|
0.8104485
|
8.8487325
|
8.14E-13
|
3.265464934
|
up
|
LAR
|
arahy.Tifrunner.gnm1.ann1.T1J2UZ
|
1.2703215
|
18.3464365
|
2.87E-08
|
3.848946241
|
up
|
LAR
|
arahy.Tifrunner.gnm1.ann1.D1ITL5
|
1.7008935
|
0.1929725
|
0.000903522
|
-3.16215485
|
down
|
BG
|
arahy.Tifrunner.gnm1.ann1.13Q8F8
|
30.508549
|
0
|
2.99E-32
|
#NAME?
|
down
|
EC
|
arahy.Tifrunner.gnm1.ann1.49I11A
|
12.856979
|
3.036938
|
2.45E-07
|
-2.118638966
|
down
|
POD
|
arahy.Tifrunner.gnm1.ann1.D687JL
|
0.4383385
|
5.6579165
|
8.33E-08
|
3.520180624
|
up
|
ORR10 X1
|
arahy.Tifrunner.gnm1.ann1.CB6084
|
0.550337
|
9.0050005
|
4.54E-09
|
3.958349247
|
up
|
IAA14
|
arahy.Tifrunner.gnm1.ann1.QUY0YV
|
0.6811405
|
5.7989815
|
0.000156105
|
3.016455081
|
up
|
IAA14
|
Analysis of transcription factors differentially expressed in transcriptome.
The transcription factors in the sequencing results were analyzed and it was found that 9 differentially expressed MYBs were screened from 410 MYB transcription factors, 4 differentially expressed bHLH were screened from 278 bHLH transcription factors, and no differentially expressed in WD40 family transcription factors(Fig.7b). All MYB regulatory factor genes were enriched in the 2 metabolic pathways of splicing and plant circadian transduction,and all bHLH regulatory factor genes were enriched in the two metabolic pathways of plant hormone signal transduction and plant circadian rhythm.However, the differentially expressed MYB and bHLH transcription factors were not annotated clearly among the metabolic pathways, and the regulatory role of these differential transcription factors in the metabolic pathway was unclear (Table S8).
Joint analysis of transcriptome and metabolome.
The combined analysis of metabolome and transcriptome showed that the differential genes in the flavonoid biosynthetic pathway are directly related to the synthesis of delphinidin and cyanidin. The F1-B1 correlation results showed that higher delphinidin and cyanidin content in colored area compared with the non-colored area caused variegated testa, due to the up-regulation of 2 CHS and 1 C4H in colored area produces. The F2-B2 correlation results showed that the up-regulation of 2 DFRs, 1 F3'H,1 F3H, and 2 LARs in non-colored area caused the accumulation of procyanidin, but leading to the significant increase in the content of delphinidin and cyanidin in colored area compared with the non-colored area (Fig. 8).
qPCR analysis of differential gene expression levels.
Fluorescence quantitative qPCR verification was performed on 20 genes related to anthocyanin metabolism. The results showed that nine genes were verified in F1-B1 (Fig. 9a), F2-B2 (Fig. 9b), B1-B2,(Fig. 9c), and 11 genes were verified in F1-F2 (Fig. 9d),which were consistent with transcriptome results. The 20 selected differential genes in the four comparison groups showed similar qPCR expression trends to the transcriptome detection results (Fig. 9).