Plant materials and phenotyping
Phenotype records for six quality traits from hexaploid bread wheat breeding lines measured in laboratory assays along with NIR records of approximately 27,000 lines were used in this study. Wheat lines were evaluated for eight years from 2012–2019 grown in NSW, VIC, WA, and SA in more than 150–197 trials of end-product traits and 409–450 trials for NIR records. Lines were evaluated for crumb yellow/blueness (b*, colour), flour water absorption (Wab, %), hardness /particle size index of flour (PSI, %), flour yield (FlrYld, %), protein (Protein, %) and flour swelling volume (FSV, ml/g). A summary of the number of records and lines in each trait and their range and means is provided in Tables 1 and 2.
Laboratory assays were conducted on 2–4kg composite samples assembled by blending all replicated plots of each line per trial. The samples were conditioned at 16% moisture content for 24 hours prior to milling using a Buhler Laboratory Mill (MLU 202). End-use quality tests were performed using approved methods of the American Association of Cereal Chemists International (AACC International 2008). Particle size index (PSI, %), a measure of wheat hardness, was determined after grinding and sieving of grain samples (AACC Method 55-30.01). Flour yield (FlrYld, %), is the percentage by weight recovered of the total products as straight-grade white flour. Flour yellowness (b*) was analyzed with a Minolta Chroma Meter (C-100, Minolta Camera Co. Ltd, Osaka, Japan) (Oliver et al. 1992). Flour swelling volume (FSV, ml/g) test was performed using AACC Method 56-21.01 and water absorption (Wab, %) was measured using a Farinograph (Brabender) following AACC Method 54-21.02. Near-infrared spectroscopy data was also acquired for each of the end-use quality traits. NIR predictions were generated by loading 100 grams of sample into the XDS Rapid Content Analyzer (FOSS). AACC Methods 39-25.01 and 39-70.02 were used to determine NIR predicted protein content and PSI of whole grains using a local calibration.
Genotyping
Genomic DNA was extracted from 6 seeds per sample using a modified CTAB method (Tibbits et al. 2006), where a magnetic bead clean-up step replaced isopropanol precipitation. In brief, the modifications consisted of mixing 120 µL of upper aqueous phase with 120 µL of 10X diluted AMPure XP beads (Beckman Coulter, USA) (diluted with a solution containing 20% PEG and 2.5M NaCl), followed by one wash with 200 µL of 50% DNAzol® ES (Molecular Research Centre, USA) and 42.5% ethanol, and two washes with 70% ethanol. DNA was eluted in 15 µL of 10 mM Tris-HCl, pH 8.0.
The reference panel were genotyped with the Illumina Infinium Wheat Barley 40K XT SNP array v1.0 (Keeble-Gagnère et al. 2021) according to manufacturer’s instructions (Illumina Ltd., CA, United States), with modifications detailed in Keeble-Gagnère et al. 2021. Genotype calling was performed using a custom pipeline (Maccaferri et al. 2019; Keeble-Gagnère et al. 2021).
Genotypes were imputed to exome density following the procedure described in Keeble-Gagnère et al. 2021. Briefly, sporadic missing data was filled in with Beagle v4.1(Browning and Browning 2016), before converting SNP coordinates to positions in the IWGSC RefSeq v2.0 (O’Leary et al. 2016) assembly and imputing to 435,404 SNPs with Minimac 3 (Das et al. 2016) using the reference haplotypes described in Keeble-Gagnère et al. 2021, together with 102 exome-sequenced historical lines from InterGrain’s breeding program. This target set of 435,404 SNPs was defined as the set of SNPs with \({r}^{2}>0.7\) to the set of genotyped SNPs, based on the LD in the InterGrain historical lines. This set of SNPs was further reduced to 330,169 after selecting the SNP in common with the transcriptome genotyping-by-sequencing (tGBS) genotypes after imputation.
Broad-sense heritability
Phenotypic records were edited for possible outliers (mean ± 4SD). Phenotypes were also corrected for fixed effects using a linear mixed model (Eq. 1) in ASReml (Butler et al. 2017), where y was a vector of quality phenotypes, \(\mu\) was the trait mean, Trial was fixed effect for environment, a group effect of year, location and nursery.
(1)
Line was fitted as fixed to estimate BLUEs as corrected phenotypes in GEBV estimation and random to estimate the variance due to lines in order to determine the broad sense heritability using:
where \({\sigma }_{g}^{2}\) and \({\sigma }_{e}^{2}\) are line and residual variances, respectively. T and R are the mean number of trials and replications per line.
Genomic predictions
Genomic Restricted Maximum Likelihood (GREML) was used for estimating GEBVs and variance components in the MTG2 software (Lee and Van Der Werf 2016).
Single trait genomic prediction
The single trait GREML model used in this study was a linear mixed model described as y = 1nµ + Zu + e, where y is the vector of adjusted BLUEs for the trait, µ is the overall mean, 1n is a vector of ones, Z is a design matrix relating records to breeding values, u is a vector of GEBVs, and e is a vector of residual effects. It was assumed that u ~ N(0, Gσ2g), where σ2g is additive genetic variance and G is the genomic relationship matrix calculated from the 330,169 SNP markers (Yang et al. 2010), and e ~ N (0, Iσ2e), where σ2e is the residual variance and I is the n × n identity matrix. We performed a Principal Component Analysis (PCA) of G using the prcomp function in R (R Core Team 2020) to investigate the population structure in the sample. A plot of first two components was visualised using the ggplot2 package (Wickham 2016) in R. Narrow-sense heritability (h2) was calculated as the ratio of the additive variance to the total phenotypic variance using the GREML model.
Single trait genomic prediction accuracy was evaluated with 5-fold random cross validation. In each cross validation run, one-fold was used as validation set with their phenotype data masked in the analysis (Fig. 1). The accuracy of genomic prediction was calculated as the Pearson correlation coefficient between corrected phenotypic values and GEBVs in the validation subset. This process was repeated 10 times and average prediction accuracy across all folds (50) was reported.
Multi-trait genomic prediction
A basic multi-trait mixed model was used as follows:
$$\left[\begin{array}{c}{\mathbf{y}}_{\text{E}\text{P}}\\ {\mathbf{y}}_{\text{N}\text{I}\text{R}}\end{array}\right]=\left[\begin{array}{cc}{1}_{\text{E}\text{P}}& 0\\ 0& {1}_{\text{N}\text{I}\text{R}}\end{array}\right]\left[\begin{array}{c}{{\mu }}_{\text{E}\text{P}}\\ {{\mu }}_{\text{N}\text{I}\text{R}}\end{array}\right]+\left[\begin{array}{cc}{\mathbf{Z}}_{\text{E}\text{P}}& 0\\ 0& {\mathbf{Z}}_{\text{N}\text{I}\text{R}}\end{array}\right]\left[\begin{array}{c}{\mathbf{u}}_{\text{E}\text{P}}\\ {\mathbf{u}}_{\text{N}\text{I}\text{R}}\end{array}\right]+\left[\begin{array}{c}{\mathbf{e}}_{\text{E}\text{P}}\\ {\mathbf{e}}_{\text{N}\text{I}\text{R}}\end{array}\right]$$
2
where yEP and yNIR are the vectors of adjusted phenotypes, 1 is vector of ones, µ EP and µNIR are general means, ZEP and ZNIR are the design matrices of breeding values, \({\mathbf{u}}_{\text{E}\text{P}}\) and \({\mathbf{u}}_{\text{N}\text{I}\text{R}}\) are the vectors of genomic breeding values, and eEP and eNIR are the vectors of random residual effects, for traits “End-Product” and “NIR”, respectively. Residuals (\(\mathbf{e}=\left[{\mathbf{e}}_{\text{E}\text{P}},{\mathbf{e}}_{\text{N}\text{I}\text{R}}\right]\)), are assumed to follow a normal distribution, e | R0 ~ N(0, R0 ⊗ I) and \({{R}}_{0}=\left[\begin{array}{cc}{\sigma }_{{e}_{EP}}^{2}& {\sigma }_{{e}_{EP-NIR}}\\ {\sigma }_{{e}_{NIR-EP}}& {\sigma }_{{e}_{NIR}}^{2}\end{array}\right]\).
Wheat end-product quality traits are costly to assay directly and therefore cannot be rapidly determined for large number of lines in breeding programs. NIR data can be generated easily across many lines and used to predict end-use traits, making them an interesting test case for examining bivariate models. The benefit of using NIR to predict end-product quality traits was investigated using three different cross-validation scenarios.
-
Cross validation scenario S1: End-product quality phenotypes of validation lines were masked and predicted with a reference population with equal numbers of end-product and NIR phenotypes. This aimed to assess the effect of multi-trait analysis in improvement of prediction accuracy of end-product quality traits where both traits are available on the same lines (i.e., their assessment is costly and time consuming).
-
Cross validation scenario S2: Validation lines were masked from both end-product and NIR traits, however, the reference set included all additional lines with only NIR. This aimed to study the effect of extra NIR data on improving prediction accuracy.
-
Cross validation scenario S3: Assessment of adding NIR on validation lines to increase prediction accuracy, assuming that NIR are easy and cost effective to generate on all lines. In this scenario, end-product phenotypes were masked only in the validation set, while their corresponding NIR measurements were included in the reference set.
Forward genomic prediction
In breeding programs, the aim is to predict future years/environments using previous years’ data. These independent predictions were performed by training the model on previous years data and predicting future environments using GREML. 2012–2015 data was used to predict 2016, 2012–2016 to predict 2017, and 2012–2018 to predict 2019.