Plant materials
Zhongmai 895 (hereafter ZM895) with a current production area around 0.7 million ha annually is a leading cultivar in the Yellow and Huai River Valleys Winter Wheat Region of China. ZM895 and Zhongmai 871 (hereafter ZM871), developed by pedigree selection and bulked as fixed lines at F5, are sister lines that can be traced back to a single F2 plant of cross Zhoumai 16/Liken 4. The detailed information of ZM895 and ZM871 was described in a previous study (Yang et al, 2020). One residual heterozygous line (L2925) within the marker interval of QTgw.caas-5B was selected from BC1F6 generation of the ZM871/ZM895//ZM871 population (Fig. 1a, Fig. 1b). A heterozygous recombinant plant (HRL2925) from L2925 was self-pollinated, and 12 heterozygous recombinant plants (designated RL1 to RL12) and 119 homozygous plants with 57 having 5B+ alleles (ZM895 genotype) and 62 having 5B− (ZM871 genotype) were selected from the selfed progenies by markers flanking QTgw.caas-5B, in which two groups of homozygous plants were used for a preliminary evaluation of the phenotypic effects of QTgw.caas-5B on TGW, GL and GW (Fig. 1c, Fig. 1d). Twenty 5B+ near-isogenic lines (NILs) and 20 5B− generated from two kinds of homozygous plants, respectively, were used to measure TGW, GL and GW at different grain developmental stages and for RNA-sequencing. In addition, 52 to 77 NILs from each of RL1 to RL12 were identified using QTgw.caas-5B-flanking markers to narrow the region of candidate genes (Fig. 1d, Table 2). A diverse panel of 166 cultivars (Li. 2019a) was used to validate the effects of QTgw.caas-5B.
Field trials and trait measurement
The progeny from HRL2925 were sown in ten 3.0 m rows spaced 0.3 m apart with 30 seeds per row at Xinxiang (Henan province) during the 2017–2018 cropping season. Twenty homozygous NILs with 5B+ and 20 with 5B− selected from 119 selfed homozygous plants of HRL2925 were evaluated at Xinxiang and Anyang (Henan province), during the 2018–2019 cropping season. The lines were grown as plots in randomized complete blocks with three replications. Each plot comprised two 1.0 m rows spaced 0.3 m apart, with 30 seeds per row. The NILs from each of RL1 to RL12 were sown in plots of seven 3.0 m rows spaced 0.3 m apart with 30 seeds per row at Xinxiang (Henan province) during the 2018–2019 cropping season. The panel of 166 cultivars was sown in three 1.5 m rows spaced 0.2 m apart in 50 seeds each row with three replications at Suixi (Anhui province) during the 2012–2013, 2013–2014 and 2014–2015 cropping seasons, at Anyang (Henan province) during the 2012–2013 and 2013–2014 cropping seasons, and at Shijiazhuang (Hebei province) during the 2014–2015 cropping season.
Wanshen SC-G seed detector (Hangzhou Wanshen Detection Technology Co., Ltd) was used to record TGW, GL and GW. Thirty random spikes were harvested from each plot of all 20 homozygous individuals in the contrasting 5B+ and 5B − groups to measure TGW, GL and GW. The same parameters were measured on the 119 homozygous plants from HRL2925 were measured on grains from 6-10 spikes of each plant and 52 to 77 homozygous progenies from each of RL1 to RL12. For the diversity panel TGW was determined by weighting 500 grains, and GL and GW were measured on 20 random grains from each plot to obtain mean length and width values, respectively (Li et al. 2019b).
Grain sampling
Main stems at anthesis were marked with red tags at 09:00–11:00 am every day. Ten grains from outer florets of five spikelets in the middle regions of tagged spikes were sampled at 09:00–11:00 am at 4, 8, 12, 16, 20, 25 and 30 days post-anthesis (DPA). At each time point 200 grains were sampled (20 spikes × 10 grains) from each of the 40 homozygous NILs. GL and GW were measured using image analysis software (Image-Pro Plus 6.0, http://www.mediacy.com/) after scanning the 200 grain samples placed on a scanner panel with grain creases placed downwards. Following measurement, the grains were dried in an oven at 135ºC for 15 min and then at 65ºC until a constant weight. The TGW of the dried grain samples harvested at various DPA was determined Three biological replications were performed for each time point.
RNA and DNA extraction and RNA-sequencing
Two grains sampled from the outer florets of spikelets in the middle of tagged spikes of main stems at 4, 8, 12, 16, 20 and 25 DPA were snap frozen in liquid nitrogen and stored at -80 °C. For 20 homozygous NILs with 5B+ and 20 with 5B− selected from 119 selfed homozygous plants of HRL2925 at every time point, 40 grains were sampled (20 spikes × 2 grains) with three biological replications.
Total RNA was isolated using the TRIzol protocol (Invitrogen, Carlsbad, CA, USA). After quality testing, a single RNA library for each sample was constructed, and the library preparations were sequenced on an Illumina Hiseq platform with 250–300 bp paired-end reads at Novegene Bioinformatics Technology in Beijing (http://www.novogene.com/). Differential expression analysis of 5B+ and 5B− genotypes was performed with three biological replications at each stage using the DESeq2 R package (1.10.1), which provides the statistical routines for determining differential expression in digital gene expression data using a model based on a negative binomial distribution. The resulting P-value was adjusted using the Benjamini and Hochberg’s approach for controlling false discovery rate. Genes with an adjusted P <0.05 determined by DESeq2 were considered as differentially expressed. Genomic DNA was extracted from young leaves of experimental lines using the CTAB method.
Whole genome resequencing
Quanified DNA samples of ZM871 and ZM895 were randomly fragmented by Covaris and the fragments were collected by magnetic beads. Ligation products with end-repair and addition of 3’ adenine DNA fragments were cycled and amplified by linear isothermal Rolling-Circle Replication and DNA NanoBall technology. Sequencing of these DNA libraries was performed by the BGISEQ sequencing platform at Shenzhen Huada Gene Technology (Shenzhen, https://www.genomics.cn/). The remaining high-quality paired-end reads following filtering of a high proportion of adaptor and low-quality reads in the raw data were mapped to the Chinese Spring reference genome (IWGSC, https://urgi.versailles.inra.fr/blast_iwgsc/) using the Burrows-Wheeler Aligner Tool. SNPs and small Insertion/Deletions (InDels) were detected by GATK (https://www.broadinstitute.org/gatk/) with filtration parameters of "QD < 2.0 || FS > 60.0 || MQ <40.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0" for SNP calling and "QD < 2.0 || FS > 200.0 || SOR > 10.0 || MQRankSum < -12.5 || ReadPosRankSum < -8.0" for InDel calling.
Array-based SNP markers or SNPs from RNA-sequencing and whole genome resequencing upstream of the physical location of the Kasp_5B2 locus and between Kasp_5B2 and Kasp_5B6 were converted to Kompetitive Allele Specific PCR (KASP) markers for fine mapping of QTgw.caas-5B. Allele-specific and common reverse primers for each KASP marker were designed using PolyMarker (http://www.polymarker.info/).
Statistical Analysis
Phenotypic differences between the allelic pairs in progeny tests were determined by Student’s t-tests with SAS 9.2 software (SAS Institute Inc, Cary, NC, USA).