Segregation population development and phenotyping analysis
Phenotypic values of F2 population were collected for two kernel size traits i.e., kernel length (KL) and kernel width (KW) in 2016 while phenotypic values of F2:3 populations were collected for the two traits in 2018 and 2019. Phenotypic mean values of KL and KW in F2 and F2:3 were shown in Table 1. The difference of KL between SG5 and SG7 is extremely significant (P<0.01, Figure 3(a), Figure 3(b)). The phenotypic value of SG5 and SG7 were significantly different in both KL and KW traits. The two observed kernel size traits both displayed as bell shaped normal distribution (Supplemental Fig.1). Pearson’s correlation coefficients between KL and KW is 0.20, 0.25 and 0.24 in F2-2016, F2:3-2018 and F2:3-2019, respectively. The correlation coefficients between KL and KW were all significant in three environments.
QTL mapping
CIM procedure was applied for mapping QTLs conferring KL and KW. Ten QTLs were mapped for KL and KW in F2 and F2:3 populations. In F2 population, two KL QTLs were mapped and both located on maize chromosome 9, three KW QTLs were mapped and located on maize chromosomes 3 and 8. In F2:3-2018 population, four KL QTLs were mapped and located on maize chromosome 3,7, and 9, three KW QTLs were mapped and located on maize chromosomes 3 and 8. In F2:3-2019 population, three KL QTLs were mapped on maize chromosome 7, and 9, three KW QTLs were mapped and located on maize chromosomes 3 and 8. Manhattan plots of QTL mapping results were shown in Fig. 1. The physical intervals for all ten detected QTLs spanned from 2.30 to 14.35 Mb by comparison to reference genome (B73 RefGen_v3). The phenotypic variation explained by each QTL ranged from 8.4 to 23.0% in a trait, with means of 14.25, 14.03 and 13.67% for traits KL in F2-2016, F2:3-2018 and F2:3-2019 respectively while means of 14.63, 12.97 and 10.83% for traits KW in three environments. The LOD scores range from 4.0 (qKL-7) to 9.5(qKW-1). Information of QTL identified are summarized in Table 2. The results showed that, qKL-2 for KL and qKW-1 were both detected in all three environments (highlighted with green color circle in Fig. 2). Major QTLs qKW-2, qKL-7, qKW-3 and qKL-10 were detected repeatedly in two environments. (highlighted with blue color circle in Fig. 2). To sum up, QTLs qKL-2 and qKW-1 detected in this study are considered to be stable QTLs (Table 2). However, results above are primary mapping results, which need to be validated and fine mapped in further study.
NILs development and qKL-2 fine mapping
In 2017 to 2019, a total of 998 BC3F1 NILs of qKL-2 with SG7 background were developed by continuous backcross method. A secondary linkage map with six separated markers around qKL-2 was generated. The sequences of the associated markers were listed Supplemental Table 1. The total length of the map was 43.35cM and the genetic distances between each two adjacent markers were 16.75, 8.39, 0.80, 5.67 and 11.74 cM, respectively. The physical positions of the six markers were 115.23, 130.51, 133.34,135.29, 139.75 and 153.88Mb on chromosome 9 by blasting maize B73 RefGen_v4(Figure 3(c)). Major QTL qKL-2 was detected on the secondary linkage map of NILs based on CIM method in QTL Cartographer v2.5. The peak of LOD value indicating that qKL-2 was most likely located between SSR3 and SSR4 (Figure 3 (c)). The additive effect of qKL-2 was 0.97mm and the phenotypic variation was 16.0%. To confirm the narrowed qKL-2 interval, Five kind of recombinant types named Class 1 to Class 5 were selected from 998 BC3F1 individuals, and the progeny test of homozygous segregants indicated the interval to a 1.95-Mb region flanked by the markers SSR3 and SSR4 (table 3). There are significant difference between phenotypic values of the two set of recombinant types Class 2,3 and Class 1, 4, 5(Figure 3 (d)). The loci of SSR3 and SSR4 are homozygous in Class 2,3 while heterozygous in Class 1, 4, 5. The results that qKL-2 located in SSR3 to SSR4 interval also supported by selected overlapping recombinant chromosomes.
Candidate genes prediction for qKL-2
RNA-seq procedure was conducted on an Illumina NovaSeq instrument with 18 RNA samples from grains in different developmental stages of plants. The 1.95-Mb physical intervals of qKL-2, encompass 40 protein coding genes, (figure 3(e)). After DEGs analysis, a total of 11 protein coding genes left in the physical intervals (Table 4). Previous studies indicated that FERONIA receptor kinase controls seed size in Arabidopsis thaliana[41]. GRMZM2G006080 encodes receptor-like protein kinase FERONIA and was predicted as candidate gene of qKL-2 which most likely responsible for kernel length.