Phenotypic variation for SW of an association panel of B. napus
SW was investigated for the association panel of 403 B. napus accessions in 2018–19 (Trial 1) and 2019–20 (Trial 2) field trials. Extensive phenotypic variations for SW were observed in the association panel of B. napus (Fig. S1; Table 1, Table S1). For example, SW ranged from 3.30 to 4.60 g/ 1000 seeds (1.4-fold) in Trial 1 and from 3.00 to 5.08 g/ 1000 seeds (1.69-fold) in Trial 2 (Table 1). In addition, high h2 values were observed for SW (Table 1). The correlation coefficient (r) of SW between Trial 1 and Trial 2 were 0.64, which showed that the phenotype had good repeatability in two trials (Fig. S2).
Table 1
Mean, maximum (max), minimum (min), coefficient of variation (CV, %) and heritability (h2) of 1000-seed weight (SW) in an association panel of B. napus in Trial 1 and Trial 2
Year | Mean | Min | Max | CV | h2 (%) |
Trial 1 | 3.46 | 3.30 | 4.64 | 16.57% | 78.64% |
Trial 2 | 4.04 | 3.00 | 5.08 | 12.55% |
Genome-wide association study of SW of B. napus
We performed GWAS with GLM and MLM approaches to identify SNPs associated with SW in B. napus. A total of 340 SNPs were significantly associated with SW across two trials (P < 6.25 × 10− 07) (Fig. 1; Table S2; Table S3). Among the 340 SNPs, 180 were identified in Trial 1, 16 in Trial 2, and 144 were identified by the mean values of two trials (Fig. 1; Table S2; Table S3). The GLM analysis detected a total of 340 SNPs significantly associated with SW, distributed on 18 of the 19 B. napus chromosomes (excluding A08). Chromosomes C06 and A02 had the largest number of significant SNPs (32) and the second largest number of significant SNPs (61 SNPs), respectively (Table S2). Since MLM considers both population structure and kinship, only 3 SNPs were detected by MLM, which distributed on chromosomes A02, C06 and C07 of B. napus, explaining the PVE of 9.72–10.25% (Fig. 1D, E, F; Table S3). These three SNPs were detected simultaneously by GLM and MLM (Table S2; Table S3). Additionally, 10.30% (35/340) of the significant SNPs were identified in more than one trial (including the mean value of two trials), which showed high reliability (Fig. 1, Table S2). For example, the significant SNP marker (chrA02__3219827) was detected by Trial 1, Trial 2 and the mean value of two trials, which could explain 12.81% of the PVE of SW (Fig. 1A, B, C; Table S2). The significant SNP marker (chrC06__29452686) was detected by Trial 1, Trial 2 and the mean value of two trials, which could explain 9.28% of the PVE of SW (Fig. 1A, B, C; Table S2).
Comparison of significant SNPs by GWAS with reported QTLs for SW in B. napus
Based on the Darmor-bzh reference genome, we analysed the co-localization of the significant SNPs detected in our study and the QTLs detected by previous studies (Basunanda et al. 2010; Yang et al. 2012; Li et al. 2014b; Fu et al. 2015; Liu et al. 2015; Wang et al. 2020). Nine significant SNPs detected by GWAS co-localized with the intervals of the QTLs for SW in a previous linkage analyses, including seven SNPs on A03, and two SNPs on A09 (Table 2). For example, the SNP ‘chrA03__18945635’ on chromosome A03 was detected for SW in Trial 1, and explained 8.83% of the PVE, which co-located with the reported QTL (cqSW.A03-2) (Table 2). The SNP ‘chrA09__27535146’ on chromosome A09 was detected for SW with the mean value of the two trials, and explained 9.34% of the PVE (Table 2), which co-located with the reported QTL (cqSW.A09-3) (Table 2).
Table 2
Co-location of SNPs associated with 1000-seed weight (SW) detected by GWAS with an association panel of B. napus and QTLs for SW detected in previous studies.
SNP | Chr | Pos. (bp) | P value | PVE (%) | Experiment | References (Previous QTL loci) |
---|
chrA03__18945635 | A03 | 18945635 | 9.56E-08 | 8.83% | Trial 1 | Wang et al. 2020 |
chrA03__19955110 | A03 | 19955110 | 3.24E-07 | 8.53% | Trial 1 |
chrA03__19955115 | A03 | 19955115 | 2.09E-07 | 8.67% | Trial 1 |
chrA03__19955118 | A03 | 19955118 | 1.76E-07 | 8.98% | Trial 1 |
chrA03__18022289 | A03 | 18022289 | 6.37E-07 | 7.48% | The mean value of two trials |
chrA03__18034286 | A03 | 18034286 | 2.02E-07 | 8.90% | The mean value of two trials |
chrA03__18091274 | A03 | 18091274 | 4.28E-07 | 7.39% | The mean value of two trials |
chrA09__27535146 | A09 | 27535146 | 2.26E-07 | 8.76% | The mean value of two trials | Basunanda et al. 2010; Yang et al. 2012; Li et al. 2014; Fu et al. 2015; Liu et al. 2015 |
chrA09__27915575 | A09 | 27915575 | 1.16E-07 | 9.34% | The mean value of two trials |
New loci and candidate genes for SW in B. napus
The SNP of chrA02__3219827 on chromosome A02 was associated with SW in both Trial 1, Trial 2 and the mean value of two trials (Fig. 1A, B, C; Table S2). The LD decay of chromosome A02 in this association panel was 174 kb (Liu et al. 2021). Based on the LD decay, 174 kb up/down-stream of the significant SNP (chrA02__3219827) was selected to identify candidate genes on A02 and 82 candidate genes were detected (Table S4). Previously, a transcriptome analysis of the immature seeds of two B. napus lines with extremely different SW has been conducted to identify the candidate genes related to SW (Geng et al. 2018). In this study, a comparison of our GWAS and transcriptome sequencing results by Geng et al. (2018) revealed 2 common genes (BnaA02g06330D and BnaA02g06870D) on A02 chromosome (Table 3). In the seeds of large-seed line, the expression levels of BnaA02g06330D and BnaA02g06870D were 4.18 and 3.02 fold higher than those of small-seed line, respectively (Table 3). Twelve SNPs were located within the 2 kb promoter region and the entire coding region of BnaA02g06870D (Table S4; Table S5). However, no SNP was identified within the corresponding region of BnaA02g06330D (Table 3). We performed candidate gene association analysis of BnaA02g06870D, and four SNPs in BnaA02g06870D were significantly associated with SW in Trial 1, Trial 2 and the mean value of two trials (Fig. 2A, B, C). Further analysis demonstrated that the A allele of chrA02__3257485, C allele of chrA02__3257503, C allele of chrA02__3257506 and C allele of chrA02__3257518 were the large-seed alleles (Fig. 3). Two major haplotypes were detected, and cultivars with BnaA02g06870Hap1 (ACCC) had much higher SW than cultivars with BnaA02g06870Hap2 (TTTT), in Trial 1, Trial 2 and the mean value of two trials. Thus BnaA02g06870Hap1 was confirmed as the favorable haplotype (Fig. 2D, E, F).
Table 3
Common candidate genes associated with 1000-seed weight identified by GWAS and transcriptome sequencing
Gene ID | Log2(fold change) | P value | Number of SNPs in genes | Arabidopsis homologue | Function description |
BnaA02g06330D | 4.18 | 1.28E-07 | 0 | AT5G58470.2 | TBP-associated factor 15B (TAF15b); FUNCTIONS IN: binding, nucleotide binding, zinc ion binding, nucleic acid binding |
BnaA02g06870D | 3.02 | 1.14E-07 | 12 | AT5G59740.1 | UDP-N-acetylglucosamine (UAA) transporter family; FUNCTIONS IN: galactose transmembrane transporter activity |
BnaC06g28920D | 3.04 | 1.11E-06 | 20 | AT1G17770.1 | SU(VAR)3–9 homolog 7 (SUVH7); FUNCTIONS IN: zinc ion binding, histone-lysine N-methyltransferase activity |
The confidence region significantly associated with SW in both Trial 1, Trial 2 and the mean value of two trials, on chromosome C06 ranged from 29.47 to 31.25 Mb (Fig. 1A, B, C; Table S2). The lead SNP ‘chrC06__30203895’ on chromosome C06 was detected for SW in Trial 1, and explained 11.30% of the PVE (Table S2). In previous study, the LD decay of C06 chromosome in this association panel was 229 kb (Liu et al. 2021). Based on the LD decay, 229 kb upstream and downstream regions of the significant SNP (chrC06__30203895) were selected and found to contain 81 genes (Table S6). One significantly differentially expressed gene (BnaC06g28920D) was detected by combining GWAS and the transcriptome data by Geng et al. (2018) (Table 3). The expression levels of BnaC06g28920D in the seeds of large-seed line were 3.04 fold higher than that of small-seed line (Table 3). Twenty SNPs located within the 2 kb promoter region and the entire coding region of BnaC06g28920D (Table 4). Candidate gene association analysis showed that, five SNPs in BnaC06g28920D were detected to be significantly associated with the SW in Trial 1, six in Trial 2 and seven with the mean value of two trials (Fig. 4A, B, C). Notably, chrC06__30094959 (T/C) were located in the exon region of the gene BnaC06g28920D and resulted in amino acid changes from isoleucine to threonine; chrC06__30095553 (T/C) were located in the exon region of the gene BnaC06g28920D and resulted in amino acid changes from leucine to proline; chrC06__30095570 (G/T) were located in the exon region of the gene BnaC06g28920D and resulted in amino acid changes from valine to leucine; and chrC06__30096038 (G/C) were located in the exon region of the gene BnaC06g28920D and resulted in amino acid changes from valine to leucine (Table 4). We observed that the T allele of chrC06__30094959, T allele of chrC06__30095553, G allele of chrC06__30095570, and G allele of chrC06__30096038 were large-seed alleles (Fig. 5). These four significant SNPs revealed four haplotypes, and BnaC06g28920Hap1 (TTGG) had significantly greater SW value than other haplotypes in Trial 1, Trial 2 and the mean value of two trials. BnaC06g28920Hap1 was a favorable haplotype (Fig. 4D, E, F). In addition, a total of 44 B. napus cultivars contained these eight favorable alleles, and they had higher SW than B. napus cultivars with other alleles (Fig. 6).
Table 4
List of synonymous and non-synonymous SNP variants identified in the candidate gene of BnaC06g28920D
SNP | Major allele | Minor allele | SNP location | SNP types | Amino acid changes |
chrC06__30094868 | C | T | Exon | Non-synonymou | Proline to serine |
chrC06__30094959 | T | C | Exon | Non-synonymou | Isoleucine to threonine |
chrC06__30095014 | A | G | Exon | Synonymous | - |
chrC06__30095467 | T | C | Exon | Synonymous | - |
chrC06__30095553 | T | C | Exon | Non-synonymous | Leucine to proline |
chrC06__30095570 | G | T | Exon | Non-synonymous | Valine to leucine |
chrC06__30095585 | T | C | Exon | Synonymous | - |
chrC06__30095596 | T | C | Exon | Synonymous | - |
chrC06__30095632 | G | A | Exon | Synonymous | - |
chrC06__30095668 | C | T | Exon | Synonymous | - |
chrC06__30095711 | A | G | Exon | Synonymous | - |
chrC06__30095762 | A | C | Exon | Non-synonymous | Lysine to glutamine |
chrC06__30095770 | C | T | Exon | Synonymous | - |
chrC06__30095926 | A | G | Exon | Synonymous | - |
chrC06__30095928 | A | T | Exon | Non-synonymous | Tyrosine to Phenylalanine |
chrC06__30095944 | A | G | Exon | Synonymous | - |
chrC06__30096004 | G | A | Exon | Synonymous | - |
chrC06__30096013 | T | C | Exon | Synonymous | - |
chrC06__30096037 | G | A | Exon | Synonymous | - |
chrC06__30096038 | G | C | Exon | Non-synonymous | Valine to leucine |
Note: SNP, Single-nucleotide polymorphism. |