Fine mapping of REnBo2 locus using BC2F2 population
The phenotypes of one hundred and fifty-eight plants of BC2F2 populations (Fig. S1) were investigated in terms of stem height, maximum diameter and stem grade. A wide ranged distribution of the mean diameter for stem swelling was observed in the recombinant populations (Fig. S2). The phenotypic data of stem swelling related trait fit a normal continuous distribution, but was slightly skewed toward the less radially enlarged grades. The plants were categorized into four grades (1–4) according to the degree of stem swelling, showing a uniform segregation of the stem swelling trait from grade 1; slim-cylindrical broccoli type, to grade 4; round-spherical kohlrabi type. The recombinant plants were further genotyped using the additional nine CAPS and InDel markers within the REnBo2 interval (Table S2). Consequently, different types of recombinants were observed based on their marker genotypes in this region, and graphical genotypes of the recombinants showing grade 1 and grade 4 stem swelling were created (Fig. S3). However, narrowing down the REnBo2 region proved challenging, as we observed a few individuals whose graphical genotypes did not match their respective phenotypes. This discrepancy may be attributed to the heterogeneous genetic background of BC2F2 plants. Thus, we did a QTL analysis on C03 chromosome using the phenotypic and genotypic data of all the recombinant plants. A linkage map with a total length of 59.5cM was created using the AntMap ver. 1.2 (Iwata and Ninomiya 2006) with an average marker distance of 6.59Mbp (Fig. 1A). Consequently, a significant LOD peak with a LOD value of 4.8 was detected between the markers Bol012989 and Bol017480 (Fig. 1B). On the linkage map, the range was 10.9cM, and the physical distance was 0.73Mbp in B. oleracea (Braol_JZS_V1.1) reference genome. Thus, through QTL mapping, we were able to narrow down the REnBo2 region, flanked by the markers Bol012989 and Bol017480.
Fine mapping of REnBo2 using BC3F2 and BC3F3 populations
To further delimit the target region, we utilized a total of 592 BC3F2 plants derived from the DH lines of kohlrabi and broccoli for fine mapping. Two flanking markers Bol017491 and Bol035216 were used for preliminary screening. Sixty-seven recombinants from the BC3F2 populations were further genotyped and used to refine the target locus. In addition to the above-mentioned nine markers, another six CAPS and InDel markers were designed at the target region (Table S1); and a total of fifteen markers were used to genotype all these recombinants. Stem swelling related traits of all the recombinants, which were grown in the incubator controlled at the temperature of 18°C, were assessed at 75 DAG. A wide range of segregation was observed among the recombinant populations for all the stem swelling traits (Fig. S4). We created graphical genotypes for the recombinants that had either kohlrabi or broccoli homozygous alleles at the target region, along with the phenotype data (Fig. 2, details in table S3). The REnBo2 locus was restricted in a 42.8-kb interval flanked by the markers Bol012971 and Bol012974. The correlation between the phenotypes and genotypes of a marker (Bol012972) within the fine-mapped region was observed (Fig. S5). All the recombinants revealed a distinct correlation, where the individuals having kohlrabi-homozygous alleles showed a maximum stem diameter with kohlrabi (grade 4)-type phenotype. On the other hand, the recombinants showing broccoli-type (grade 1) stem with a maximum diameter of 20 mm, possessed broccoli-homozygous alleles at the target locus (Fig. S5). The heterozygous individuals exhibited a wider variation in stem diameter and grade, ranging from grade 1 (broccoli type) to grade 4 (kohlrabi type) swelling pattern and thus the recombinants having heterozygous alleles at the target region were excluded from the graphical genotypes in Fig. 2.
Furthermore, we reconfirmed the REnBo2 fine mapping using the self-progeny (BC3F3) of twelve selected recombinants from the BC3F2 populations (Fig. S1). The criteria for the selection of BC3F2 recombinants were mentioned in Materials and methods section. A total of 216 BC3F3 plants were developed and genotyped using the all fifteen markers within the REnBo2 interval (Table S4). The segregation patterns of these selected plants in the BC3F3 populations for stem swelling were shown in Fig. S6. Graphical genotypes were created using the recombinants having either kohlrabi or broccoli homozygous alleles at the target region (Fig. 3A). The graphical genotypes, together with the phenotypic data of the recombinants of each line suggested the confirmation of the previously delimited region flanked by the marker Bol012971 and Bol012974, and the physical distance was 42.8 kb.
Table 1
Prediction and annotation of candidate genes in the fine-mapped region of REnBo2 in three different Brassica oleracea reference genomes. The details of the candidates are shown in the Supplementary table 5.
Gene name in Braol JZS V1.1 | Gene name in Braol HDEMV1 | Gene name in ‘TO100’ | Orthologue in A. thaliana | Gene annotation |
Bol012972 | BolC3t21148H | Bo3g175290 | AT4G30960 | CBL-interacting protein kinase 6, CIPK6 |
Bol012973 | BolC3t21150H | Bo3g175310 | AT4G30950 | Fatty acid desaturase 6, FAD6 |
Bol012974 | BolC3t21152H | Bo3g175320 | AT4G30935 | WRKY transcription factor 32, WRKY32 |
Candidates of REnBo2
According to the B. oleracea reference genome (Braol_JZS_V1.1, Liu et al. 2014), the fine-mapped interval contained four predicted genes, designated as Bol012971 through Bol012974. In the corresponding region, there are three, four and five genes in the other B. oleracea reference genomes, Braol_JZS_V2.0 (B. oleracea var. capitate, Cai et al. 2020), TO1000’ (B. oleracea var. alboglabra, Lv et al. 2020) and Braol_HDEM_V1.0 (B. oleracea var. italica, Belser et al. 2018), respectively (Fig. 3B). Among the four genes on Braol_JZS_V1.1, Bol012971 exist only in the Braol_JZS_V1.1 but not in Braol_JZS_V2.0 and in two other B. oleracea genomes (TO1000 and Braol_HDEM_V1.0), indicating that it might be a mistake in gene prediction on the Braol_JZS_V1.1, which is the older version of ‘Braol_JZS_V2.0’ genome. Thus, we speculated that Bol012971 might not exist in REnBo2 and discarded it from further analyses. The rest three genes, Bol012972, Bol012973 and Bol012974 encodes CBL interacting protein kinase 6 (CIPK6), omega-6 fatty acid desaturase protein (FAD6) and WRKY DNA binding transcription factor 32 (WRKY 32) protein, respectively (Table 1). Besides, three small-sized annotated genes, BolC3t21149H and BolC3t21151H in ‘Braol_HDEM_V1.0’ and Bo3g175300 in ‘TO1000’ genome were also detected (Table S5).
Structure analyses of the candidates
To further assess the candidates for the REnBo2, the coding (CDS) and amino acid sequences of the three candidates were compared between the parents. Genomic DNA sequence for the candidate Bol012972 (CIPK6) was 1309 bp, encoding 435 amino acids in both KDH and GCP04. A total of 13 SNPs between the parents were detected, of which six showed non-synonymous substitutions resulted in the change in four amino acids at the kinase domains (STKc_SnRK3 and SPS1) (Fig. 4A). To further study the function of CIPK6, the 3-kb region upstream of the start codon in parental lines, Arabidopsis and three B. oleracea subspecies were analyzed. The hormone related cis-elements including auxin, gibberellin and ABA responsiveness were repeatedly detected in promoter of KDH and GCP04 (Fig. S7A). A single non-synonymous mutation was observed in second exon of Bol012973 (FAD6) causing an amino acid substitution (Proline to Leucine) between the parents (Fig. 4B). A total of six SNPs between the parents in the first, second, third and fourth exons of Bol012974 (WRKY32) were observed. Among them, five were non-synonymous type causing five AA substitutions including one substitution (Alanine to Threonine) at WRKY domain (Fig. 4C).
To analyze the conserved regions of the candidate genes in the parents, A. thaliana and three other Brassica subspecies, their conserved domains and motifs were examined, and their putative protein sequences were aligned. The AA sequences of CIPK6 were analyzed with the CD-Search software, showing that they each had four domains including three kinases (STKc_SnRK3, SPS1 and CIPK_C) and one NAF domain (Fig. S7B). Three conserved domains including DesA, Delta12_FADS like and FA desaturase in FAD6 and two WRKY domains in WRKY32 were identified in the parents, Arabidopsis and three other Brassica subspecies (Fig. S8). Ten conserved motifs along the whole protein sequences for each of CIPK6, FAD6 and WRKY32 of parental lines, Arabidopsis and other three B. oleracea subspecies were shown in Fig. S8. Regarding the other three small-sized annotated genes in fine-mapped region, we did not find any homologous sequence of Bo3g175300 in parental lines, and the AA sequences of BolC3t21149H and BolC3t21151H were similar between the parents and in a B. oleracea subspecies, broccoli (HDEM) (Fig. S9). Therefore, these small-sized genes are not candidate genes for the REnBo2. In addition, Bol012973 (FAD6) is unlikely to be a candidate gene due to its function as a fatty acid desaturase.
Homology and phylogeny analyses of CIPK6 and WRKY32
To reveal the evolutionary relationships of two candidate genes, the global homology and phylogenetic analyses of these genes in Arabidopsis and different Brassica species including B. oleracea, B. rapa, B. napus, B. juncea, B. nigra and B. carinata were performed with the putative protein sequences using MEGA (version 5) software. (Fig. S10). Three homologs of CIPK6 located on C01, C03 and C07 and two homologs of WRKY32 on C03 and C07 were detected in each B. oleracea subspecies (cabbage, broccoli and Chinese kale), respectively. In overall, the phylogeny analysis revealed that the parents, kohlrabi and broccoli were close to the cabbage and broccoli reference genomes, respectively, in regard to CIPK6 and WRKY32.
Expression pattern analysis of candidate genes
To determine if the mRNA expression levels in the candidate genes might lead to functional differences, their expression patterns in early developmental stages were analyzed by both transcriptome and qRT-PCR (Figs. 5 and S11). Based on qRT-PCR data, CIPK6: C03 at chromosome C03 was up-regulated in kohlrabi stem from 10 to 20 DAG with a significantly lower expression at 10 DAG compared to the broccoli. On the contrary, in broccoli stem, CIPK6: C03 was constantly highly expressed at all the growth stages tested as revealed by qRT-PCR analysis. In the transcriptome analysis using RNA-seq, CIPK6: C03 showed significantly lower expression in kohlrabi stem at both growth stages compared to the broccoli (Fig. 5A). Regarding the homologs, one homolog of CIPK6: C01 showed up-regulation in kohlrabi stem at 10 DAG, but at 20 DAG, there was no significant difference between the stem of kohlrabi and broccoli (Fig. S11A). Another homolog of CIPK6: C07 was down-regulated in kohlrabi stem at both growth stages compared to broccoli (Fig. S11B). WRKY32: C03 was up-regulated in kohlrabi stem at all the growth stages tested in both qRT-PCR and RNA-seq, but there was no significant difference in expression of its only homolog on chromosome C07 (Bo7g114840) (Figs. 5B and S11C). FAD6 showed significantly higher expression in broccoli stem at 20 DAG, but non-significant at 10 DAG (Fig. S11D).