Principal Coordinate Analysis (PCO) of marker data
The cabbage collection utilized in this study included accessions representing different crop types obtained from multiple sources (open-pollinated accessions from gene bank collections and modern F1 hybrids from companies) and collected from different geographical regions. The number of cabbage accessions included in each trial varied (Supplementary Table S1) and as a consequence the SNP set for each trial also varied in number after filtering for a GC ≥ 80% and MAF≥ 2.5%. The SNP filtering resulted in the identification of 10,712, 12,803, and 10,112 SNPs over the nine physical chromosomes of the JZS v1 genome, and 3307, 3830, and 3081 over “chromosome zero” respectively for 2017, 2018, and 2019 SNP datasets. Most of these SNPs occur in the three data sets. These SNP datasets were utilized to assess the genetic diversity among the cabbage accessions included in each trial. Overall, the PCO plots of 2017 (Fig. 2), 2018 (Supplementary Fig. S1), and 2019 (Supplementary Fig. S2) show similar patterns. The first two principal components show three clusters of accessions (Fig. 2, Supplementary Fig. S1 and S2). The cluster at the left consists almost exclusively of white cabbages coming mostly from the Balkan Peninsula (Bulgaria, Greece, Macedonia, and Turkey). The cluster at the top right includes a dense group of red cabbages. The cluster at the bottom right comprises partly overlapping groups of white, pointed, and savoy cabbages. The more sparsely populated area in the middle includes white, pointed, savoy, and red cabbages. On closer inspection, the first two principal components show that most of the modern hybrid’s cluster at the right, while the genebank accessions are distributed over the whole range from left to right (Fig. 2, Supplementary Fig. S1 and S2). Few accessions dissociate from their peers (Fig. 2, Supplementary Fig. S1 and S2). Most of these accessions show intermediate phenotypes, like red cabbages (Supplementary Fig. S3) with low red pigmentation, or a pointed cabbage and a savoy cabbage (Supplementary Fig. S4) with red (-dish) leaves.
Phenotypic assessment
The adjusted trait estimates for each cabbage accession obtained with SpATS (Rodriguez et al. 2018) were used in all further analyses. Descriptive statistics were calculated for the common traits with all the accessions included in each trial (Table 3) and with only the 139 common accessions (Supplementary Table S5), and the trial-specific traits for all accessions (Supplementary Table S6). For the area, length, and width of rosette leaves the white, red, savoy, and pointed accessions scored values within the same range. This was not the case for the head traits area, width, and height, where the pointed accessions were significantly larger and red significantly smaller (Table 3 and Supplementary Table S5). The PCAs for phenotypic traits (Supplementary Fig. S5a, S5b and S5c) also show that red cabbages have the smaller heads, mainly grouping in the quadrants opposite to the direction of the vectors of the head area, head width, and head height, and opposite to the areas containing the white, savoy and pointed cabbage accessions.
The descriptive statistics (Table 2 and Supplementary Table S5 and S6) and the PCAs (Supplementary Fig. S5a, S5b and S5c) show phenotypic relationships between and within cabbage rosette leaf and head traits. To visualize these relationships scatter plots, histograms, and correlation coefficients were calculated for the common traits with all the accessions included in each trial (Fig. 3) and the 139 common accessions (Supplementary Fig. S6). Additionally, the complete set of correlations was calculated for all the cabbage accessions including the eight phenotypic traits scored in 2017, 12 in 2018, and 14 in 2019 (Supplementary Fig. S7). The PCA plots (Supplementary Fig. S5a, S5b and S5c), the year-specific correlations, and the overall correlations -correlations values without year distinction- (Fig. 3, Supplementary Fig. S6 and S7) show a high positive correlation (≥ 0.5) between the area, width, and height of the heads except for head width with head height in 2017 (0.345) and 2019 (0.479). The correlations between the area, width, and length of rosette leaves are high except for leaf width with leaf length (0.383) in 2018 (Figure 3 and Supplementary Fig. S6 and S7). Interestingly, in 2018, correlations between rosette leaf traits and head traits were identified with R-squared (R2) values ranging from -0.410 till 0.514, while in 2017 from -0.197 till 0.293 and in 2019 from -0.407 till 0.262. In 2018, the leaf width showed a slightly higher correlation with both the head area (0.514), and head height (0.514) than the leaf area with the head area (0.486), head height (0.449), and head width (0.405) (Fig. 3 and Supplementary Fig. S6 and S7), while correlation between these head traits and leaf length were low (-0.020 till 0.085). These latter observations indicate that cabbage plants with larger and wider rosette leaves produced larger heads in 2018. Moreover, the four cabbage crop types (white, pointed, savoy, and red) show similar leaf ratio values, meaning that their rosette leaf shapes are similar. This is not the case for head ratio, as the white and savoy cabbages show lower ratios than the pointed and red cabbages (Table 3 and Supplementary Table S5).
The histograms of the 2017 and 2019 trials (where heads were harvested based on maturity) show lower values with less variation for the head traits area, height, and width and higher values with more variation for the rosette leaf area, length, and width scored at the heading stage than the histograms for the 2018 trial (where heads were harvested in a narrow time window (Fig. 3 and Supplementary Fig. S6 and S7).
Table 3 Descriptive statistics of the phenotypic values scored at heading stage for leaf and head traits. The number of accessions (n), Standard Deviation (S.D.), Coefficient of Variation (C.V.), grouping based on Least Significant Difference (LSD). LSD groups are limited within year and trait
All cabbage accessions
|
Trait
|
Year
|
Sub-morphotype
|
n
|
Mean ± S.D.
|
C.V.
|
Overall mean
|
C.V.
|
LSD group
|
Leaf Area
|
2017
|
White
|
200
|
889.36 ± 324.69
|
0.37
|
905.59 ± 321.42
|
0.35
|
a
|
Red
|
40
|
959.99 ± 274.62
|
0.29
|
a
|
Savoy
|
40
|
947.31 ± 368.63
|
0.39
|
a
|
Pointed
|
11
|
850.63 ± 216.21
|
0.25
|
a
|
2018
|
White
|
122
|
590.40 ± 125.74
|
0.21
|
563.03 ± 131.19
|
0.23
|
a
|
Red
|
27
|
498.89 ± 104.53
|
0.21
|
b
|
Savoy
|
24
|
503.46 ± 133.69
|
0.27
|
a
|
Pointed
|
7
|
647.52 ± 147.83
|
0.23
|
a
|
2019
|
White
|
153
|
823.67 ± 242.37
|
0.29
|
837.80 ± 236.66
|
0.28
|
a
|
Red
|
43
|
885.49 ± 184.13
|
0.21
|
a
|
Savoy
|
39
|
801.79 ± 246.64
|
0.31
|
a
|
Pointed
|
11
|
892.02 ± 294.28
|
0.33
|
a
|
Leaf Length
|
2017
|
White
|
200
|
38.15 ± 8.62
|
0.23
|
38.42 ± 8.35
|
0.22
|
a
|
Red
|
40
|
39.96 ± 6.51
|
0.16
|
a
|
Savoy
|
40
|
38.83 ± 9.28
|
0.24
|
a
|
Pointed
|
11
|
36.18 ± 5.15
|
0.14
|
a
|
2018
|
White
|
122
|
34.56 ± 4.56
|
0.13
|
33.87 ± 5.04
|
0.15
|
a
|
Red
|
27
|
35.12 ± 4.12
|
0.12
|
a
|
Savoy
|
24
|
29.57 ± 5.91
|
0.20
|
b
|
Pointed
|
7
|
35.31 ± 6.30
|
0.18
|
a
|
2019
|
White
|
153
|
39.68 ± 8.03
|
0.20
|
40.80 ± 8.14
|
0.20
|
b
|
Red
|
43
|
46.09 ± 6.13
|
0.13
|
a
|
Savoy
|
39
|
39.92 ± 8.32
|
0.21
|
b
|
Pointed
|
11
|
38.79 ± 8.61
|
0.22
|
b
|
Leaf Width
|
2017
|
White
|
200
|
33.03 ± 6.70
|
0.20
|
33.31 ± 6.54
|
0.20
|
a
|
Red
|
40
|
34.25 ± 5.89
|
0.17
|
a
|
Savoy
|
40
|
33.98 ± 6.95
|
0.20
|
a
|
Pointed
|
11
|
32.47 ± 4.08
|
0.13
|
a
|
2018
|
White
|
122
|
26.98 ± 3.84
|
0.14
|
26.23 ± 3.79
|
0.14
|
a
|
Red
|
27
|
24.14 ± 2.82
|
0.12
|
a
|
Savoy
|
24
|
25.13 ± 3.21
|
0.13
|
a
|
Pointed
|
7
|
27.85 ± 3.82
|
0.14
|
a
|
2019
|
White
|
153
|
32.80 ± 5.27
|
0.16
|
32.79 ± 5.03
|
0.15
|
a
|
Red
|
43
|
33.65 ± 4.21
|
0.13
|
a
|
Savoy
|
39
|
31.57 ± 4.51
|
0.14
|
a
|
Pointed
|
11
|
33.65 ± 5.90
|
0.18
|
a
|
Leaf Ratio
|
2017
|
White
|
200
|
1.22 ± 0.29
|
0.24
|
1.21 ± 0.26
|
0.21
|
a
|
Red
|
40
|
1.16 ± 0.20
|
0.17
|
a
|
Savoy
|
40
|
1.19 ± 0.14
|
0.12
|
a
|
Pointed
|
11
|
1.15 ± 0.10
|
0.09
|
a
|
2018
|
White
|
122
|
1.28 ± 0.24
|
0.19
|
1.29 ± 0.24
|
0.19
|
b
|
Red
|
27
|
1.44 ± 0.18
|
0.13
|
a
|
Savoy
|
24
|
1.20 ± 0.19
|
0.16
|
b
|
Pointed
|
7
|
1.26 ± 0.21
|
0.17
|
b
|
2019
|
White
|
153
|
1.22 ± 0.20
|
0.16
|
1.25 ± 0.20
|
0.16
|
a
|
Red
|
43
|
1.37 ± 0.19
|
0.14
|
a
|
Savoy
|
39
|
1.26 ± 0.16
|
0.13
|
a
|
Pointed
|
11
|
1.16 ± 0.19
|
0.16
|
a
|
Head Area
|
2017
|
White
|
166
|
355.38 ± 75.93
|
0.21
|
347.87 ± 74.41
|
0.21
|
a
|
Red
|
40
|
298.06 ± 51.22
|
0.17
|
b
|
Savoy
|
32
|
349.48 ± 63.12
|
0.18
|
a
|
Pointed
|
11
|
411.01 ± 65.99
|
0.16
|
a
|
2018
|
White
|
105
|
390.10 ± 142.44
|
0.37
|
367.88 ± 150.39
|
0.41
|
b
|
Red
|
27
|
213.98 ± 77.90
|
0.36
|
c
|
Savoy
|
24
|
389.73 ± 126.80
|
0.33
|
b
|
Pointed
|
7
|
559.56 ± 95.08
|
0.17
|
a
|
2019
|
White
|
133
|
232.35 ± 73.53
|
0.32
|
206.95 ± 81.35
|
0.39
|
a
|
Red
|
42
|
122.49 ± 57.79
|
0.47
|
c
|
Savoy
|
34
|
193.10 ± 61.98
|
0.32
|
b
|
Pointed
|
9
|
277.92 ± 51.70
|
0.19
|
a
|
Head Width
|
2017
|
White
|
166
|
22.31 ± 3.20
|
0.14
|
21.56 ± 3.37
|
0.16
|
a
|
Red
|
40
|
18.42 ± 2.69
|
0.15
|
b
|
Savoy
|
32
|
21.60 ± 2.88
|
0.13
|
a
|
Pointed
|
11
|
21.47 ± 3.11
|
0.14
|
a
|
2018
|
White
|
105
|
24.51 ± 5.72
|
0.23
|
23.60 ± 6.23
|
0.26
|
b
|
Red
|
27
|
16.75 ± 4.10
|
0.24
|
c
|
Savoy
|
24
|
25.20 ± 4.85
|
0.19
|
ab
|
Pointed
|
7
|
31.31 ± 2.92
|
0.09
|
a
|
2019
|
White
|
133
|
18.46 ± 4.06
|
0.22
|
16.74 ± 4.55
|
0.27
|
a
|
Red
|
42
|
11.64 ± 3.45
|
0.30
|
c
|
Savoy
|
34
|
16.23 ± 3.03
|
0.19
|
b
|
Pointed
|
9
|
17.04 ± 2.39
|
0.14
|
ab
|
Head Height
|
2017
|
White
|
166
|
19.96 ± 2.60
|
0.13
|
20.39 ± 2.84
|
0.14
|
b
|
Red
|
40
|
20.07 ± 2.02
|
0.10
|
b
|
Savoy
|
32
|
20.81 ± 2.46
|
0.12
|
b
|
Pointed
|
11
|
26.70 ± 2.52
|
0.09
|
a
|
2018
|
White
|
105
|
21.23 ± 4.57
|
0.22
|
20.81 ± 4.82
|
0.23
|
b
|
Red
|
27
|
16.72 ± 2.67
|
0.16
|
c
|
Savoy
|
24
|
21.71 ± 4.92
|
0.23
|
b
|
Pointed
|
7
|
27.41 ± 3.23
|
0.12
|
a
|
2019
|
White
|
133
|
15.66 ± 2.65
|
0.17
|
15.39 ± 3.18
|
0.21
|
b
|
Red
|
42
|
13.33 ± 1.99
|
0.15
|
c
|
Savoy
|
34
|
14.79 ± 2.81
|
0.19
|
bc
|
Pointed
|
9
|
23.27 ± 3.34
|
0.14
|
a
|
Head Ratio
|
2017
|
White
|
166
|
0.86 ± 0.13
|
0.15
|
0.92 ± 0.18
|
0.20
|
d
|
Red
|
40
|
1.08 ± 0.16
|
0.15
|
b
|
Savoy
|
32
|
0.94 ± 0.13
|
0.14
|
c
|
Pointed
|
11
|
1.27 ± 0.24
|
0.19
|
a
|
2018
|
White
|
105
|
0.88 ± 0.13
|
0.15
|
0.92 ± 0.14
|
0.15
|
b
|
Red
|
27
|
1.05 ± 0.14
|
0.13
|
a
|
Savoy
|
24
|
0.93 ± 0.13
|
0.14
|
b
|
Pointed
|
7
|
0.93 ± 0.12
|
0.13
|
ab
|
2019
|
White
|
133
|
0.87 ± 0.15
|
0.17
|
0.96 ± 0.22
|
0.23
|
c
|
Red
|
42
|
1.16 ± 0.23
|
0.20
|
b
|
Savoy
|
34
|
0.93 ± 0.16
|
0.17
|
c
|
Pointed
|
9
|
1.37 ± 0.29
|
0.21
|
a
|
Association of loci to leaf and head traits in cabbage.
The genotypic dataset produced by Cai et al. 2022 provided high-quality SNPs for association studies. In general, these markers are well distributed over the JZS v1 cabbage genome with some genomic regions covered more densely (Supplementary Fig. S8). These markers are utilized for the GWAS of rosette leaf and head traits scored over the three field trials. In total, the 68 GWAS identified 1,703 significant SNP-trait associations (Supplementary Table S7) over the JZS v1 (Liu et al. 2014) genome for the three complete sets of traits (8 for 2017, 12 for 2018, and 14 for 2019) and with both methods to correct the population structure (kinship matrix as default and with or without using the PCO dissimilarity matrix). These 1,703 SNPs represent 481 QTLs (see Material and Methods) and 52 of these QTLs are detected in multiple years so are defined as hotspots (see Material and Methods). These 52 hotspots include 469 SNPs (Supplementary Table S8) and their -10log(P) average (3.76) is significantly higher (p.value = 1.6x10-12) than that of the 1,234 SNPs (3.53) not included in these hotspots. Fourteen hotspots localize at “chromosome zero” (scaffolds not assigned to a chromosome, see Material and Methods) and 38 are distributed over the nine physical chromosomes (Supplementary Table S8). These 52 hotspots are re-mapped to the updated JZS v2 (Cai et al. 2020) reference genome (see Material and Methods) and all fourteen hotspots on chromosome zero could be mapped to the nine physical cabbage chromosomes (Supplementary Table S9). Adjacent hotspots with a separation of less than 55 kb are considered the same hotspot. Thus, the 52 hotspots identified over the JZS v1 genome result in 50 hotspots over the JZS v2 (Supplementary Table S9). These 50 hotspots include 25 associations with both leaf and head traits, 15 associations with only leaf traits, and 10 with only head traits (Fig. 4 and Supplementary Table S8). The correction for population structure influences the number of hotspots identified (Supplementary Table S8). The kinship matrix is utilized in all analyses to correct the population structure, either with or without additional correction based on the PCO dissimilarity matrix. From the 50 hotspots, 17 are identified both with and without using this dissimilarity matrix, 13 only with and 15 only without using this matrix. Five hotspots are detectable only by combining QTLs from analyses with and without the PCO dissimilarity matrix (Supplementary Table S8). To evaluate the validity of both correction methods for population structure, 34 comparisons of the two Q-Q plots of the two GWAS (one for each correction method) are performed (Supplementary Fig. S9, S10 and S11). In these comparisons, 18 Q-Q plots show a slightly more linear distribution of the SNPs -10log(P) values when both correction methods are utilized and 10 when only the kinship matrix is utilized. Six comparisons show no clear difference between both correction methods.
We selected hotspots with allele frequencies >0.20 and marker-trait associations explaining >5% of the variation to visualize the effects of the QTLs using boxplots. Hotspot 10 (in JZS v1; hotspot 27 in JZS v2) includes SNP C00_102232378 significantly associated with the 2018 and 2019 Leaf Ratio at rosette stage and 2018 Leaf Ratio at heading stage, and these associations show an allele frequency of approximately 0.25 and explains eight, six and nine percent, respectively, of the variation (Supplementary Table S8). In these effect plots (Fig. 5, Supplementary Fig. S12a and S12b), the red and white cabbage accessions homozygous for the alternative allele show, overall, higher values in leaf ratio than the accessions homozygous for the reference allele or heterozygous. The differences between the accessions homozygous for the alternative allele and the accessions heterozygous or homozygous for the reference allele is significant. This observation suggests that the reference allele is dominant.
Candidate gene mining
The border limits of each of the 50 hotspots (JZS v2) are expanded 55 Kb up- and downstream for gene mining (Supplementary Table S10). In total, these expanded regions cover 5.7 Mb of the 561 Mb (1.0%) of the JZS v2 genome and include 763 of the 59,064 predicted genes (1.3%). These 763 cabbage genes were blasted in the BRAD database (http://brassicadb.cn/; accessed 26 April 2021) to identify their orthologous genes in A. thaliana and inspect their annotation in the TAIR database (http://arabidopsis.org, accessed 11 May 2021) (Supplementary Table S11). In total, 591 cabbage genes were found to have an ortholog in Arabidopsis and 550 of these are unique and have an ontology/domain/functional annotation (Supplementary Table S11). Sixteen of these 550 genes match the annotation search terms of a candidate gene (material and methods): six are related to phytohormone pathways, eight to leaf development, and two to both sets of annotation terms (Table 4 and Supplementary Table S12).
Table 4 Genes of interest within 55 Kb up-/downstream of QTL hotspots borders. RS = rosette stage; HS = heading stage
Hotspot
|
Hotspot identified with traits
|
Bol name in JZS v2/v1
|
Other names
|
Tair or others description
|
1
|
Leaf Length (HS)
|
BolC01g004800.2J Bol013627
|
CER9 and SUD1
|
Encodes a protein involved in cuticular wax biosynthesis. Lines carrying a recessive mutation in this locus show downward cupped leaves.
|
9
|
Leaf Ratio (RS and HS)
Leaf Width (RS and HS)
Head Weight
|
BolC02g001500.2J Bol005467
|
LMI1
|
Encodes a homeodomain leucine zipper class I meristem identity regulator. Acts together with LFY to induce CAL expression. Has a role in leaf morphogenesis.
|
10
|
Leaf Ratio (RS and HS)
|
BolC02g009950.2J Bol035882
|
OFP8
|
Regulates cell elongation by suppressing the expression of Gibberellin 20 oxidase 1.
|
11
|
Leaf Length (HS)
Leaf Ratio (RS and HS)
Head Ratio
|
BolC02g037390.2J
|
WDS1
|
Its expression is induced by IAA, ABA, and ethylene.
|
14
|
Leaf Area (RS and HS)
Leaf Width (HS)
|
BolC02g059800.2J Bol020633
|
TCP7
|
Transcription factor involved in leaf development.
|
18
|
Leaf Ratio (RS)
Head Area
|
BolC03g013160.2J Bol025978
|
SWC6 and SEF
|
Arabidopsis mutants show serrated leaves.
|
BolC03g013530.2J Bol026004
|
CPL4
|
RNAi suppression mutant lines show epinasty leaves.
|
19
|
Rosette Leaf Number
Head Width
|
BolC03g014880.2J Bol027883
|
AGC1-1 and D6PK
|
Protein kinase involved in polar auxin transport and asymmetrical distribution.
|
21
|
Leaf Area (HS)
Leaf Length (HS)
Leaf Width (HS)
|
BolC03g043570.2 Bol025737
|
TCP4
|
Transcription factor regulated by miR319. Involved in the regulation of leaf differentiation.
|
BolC03g043610.2J Bol025740
|
CTL02 and TEAR2
|
Ligase protein that positively regulates CIN-like TCP activity to promote leaf development.
|
25
|
Leaf Ratio (RS)
Head Ratio
|
BolC04g067970.2J Bol021779
|
ATGH3.11, FIN219, and JAR1
|
JAR1 is an auxin-induced gene. Loss of function mutants are defective in a variety of responses to jasmonic acid.
|
26
|
Leaf Area (RS and HS)
Leaf Length (HS)
Leaf Width (RS and HS)
|
BolC05g007470.2J Bol036810
|
HYL1
|
Encodes a nuclear dsRNA binding protein involved in mRNA maturation. Mutant show hyponastic leaves and less sensitivity to auxin and cytokinin.
|
42
|
Leaf Length (HS)
Head Height
|
BolC08g038570.2J Bol044169
|
AMP1, COP2, HPT, MFO1, and PT
|
Encodes glutamate carboxypeptidase. Various alleles show an increased rate of leaf initiation and cytokinin biosynthesis.
|
44
|
Leaf Ratio (RS and HS)
|
BolC09g002000.2J Bol006070
|
AXR6, CUL1, ETA1, and ICU13
|
Encodes a cullin that is a component of SCF ubiquitin ligase complexes involved in mediating responses to auxin and jasmonic acid.
|
46
|
Leaf Area (HS)
Leaf Length (RS and HS)
Leaf Width (HS)
|
BolC09g026780.2J Bol018919
|
HAT2
|
Homeobox-leucine zipper gene induced by auxin. Involved in regulating auxin-mediated morphogenesis.
|
BolC09g026820.2J Bol018917
|
KUA1 and MYBH
|
Transcription factor that specifically controls cell expansion during leaf development by controlling ROS homeostasis.
|