Rice (Oryza sativa L.) is one of the most important food crops in the world and in Asia in particular. About 3.5 billion people depend on rice as their main food source. As the world's population increases, the demand for rice will be under pressure as an estimated 116 million additional tons of rice will be needed to meet demand by 2035 (Seck et al., 2012). In this context, genetic improvement for yield potential is considered to be one of the most effective strategies to meet this growing demand and also to address the growing impact of climate change on rice production (Saito et al., 2021). Rice breeders, therefore, must increase yield potential at a greater pace (Cobb et al., 2019). However, the use of conventional breeding methods is time-consuming and can take up to ten years to develop and evaluate new elite varieties (Collard & Mackill, 2008). To some extent, the advances of marker-assisted selection (MAS) enable faster development of new varieties but are limited to the introgression favorable alleles of major genes or quantitative trait loci (QTLs) with large effects mainly related to abiotic (e.g. submergence, salinity) or biotic (e.g. blast, bacterial leaf blight) stress tolerance into elite backgrounds (Gregorio et al., 2013; Jena & Mackill, 2008). MAS is not tailored to enhance the effectiveness of breeding strategies for quantitative traits like grain yield which are governed by a large number of genes or QTLs with small effects (Jena & Mackill, 2008).
With the reduction in genotyping costs, genomic selection (GS) has arisen as a more efficient option for breeding program optimization (Ahmadi et al., 2020; Heffner et al., 2009). GS can accelerate the rate of genetic gain without significantly increasing the size of the breeding program by reducing the length of the breeding cycle (Cobb et al., 2019). GS uses genome-wide markers (mainly SNPs markers) to predict the genomic estimated breeding values (GEBV) of selection candidates based on statistical models trained on a reference population that is both genotyped and phenotyped (Ahmadi et al., 2020; Jannink et al., 2010; Meuwissen et al., 2001). Since 2010, many GS studies have been published on small grain crops such as wheat, barley, oats, or rice, indicating that GS has been successfully applied in cereals breeding programs to increase the rate of genetic gain (Crossa et al., 2017). More recently, genomic prediction models integrating multi-environment data have emerged in the plant breeding community in order to increase accuracy by modeling the genotype-by-environment interactions (G×E) rather than ignoring them (Burgueño et al., 2012; Heslot et al., 2013; Jarquín et al., 2014; Lopez-Cruz et al., 2015). The G×E interactions in plant breeding are usually evaluated through multi-environment trials and refer to changes in the ranking of genotypes between environments (Freeman, 1973). The G×E analysis also plays a key role in evaluating the stability of genotypes across environments (Cooper et al., 1993; Elias et al., 2016). Crossa et al. (2022) have recently reviewed the evolution of genomic prediction models that consider G×E interactions. Burgueño et al. (2012) and Schulz-Streeck et al. (2013) proposed the first multi-environment prediction models. These models were subsequently enhanced by using different statistical regressions and kernel methods (Crossa et al., 2019; Cuevas et al., 2016, 2019; Lopez-Cruz et al., 2015, p.; Montesinos et al., 2016, 2018), or by using crop growth models (Cooper, 2015; Heslot et al., 2014; Messina et al., 2017; Rincent et al., 2017) and recently by using reaction-norm models integrating the information of environmental covariates, such as weather and soil information of the experimental trials, for prediction in the context of G×E (Costa-Neto et al., 2020, 2021; de los Campos et al., 2020; Jarquín et al., 2014; Ly et al., 2018; Millet et al., 2019; Morais Júnior et al., 2017). In this latter approach, G×E is accounted for by using the interaction between markers and environmental covariates (ECs) and has been shown to increase the accuracy of genomic prediction in plant breeding. For example, Jarquín et al. (2014), using wheat data, reported an increase in the accuracy of the reaction-norm model integrating ECs compared to models with main effects alone. The effectiveness of the use of ECs in GS is also discussed in the literature (Costa-Neto et al., 2020; Heslot et al., 2014; Millet et al., 2019; Monteverde et al., 2019; Morais Júnior et al., 2017). In rice, a large number of GS studies have been published since 2014, when the first empirically based study was published (see a review by Bartholomé et al., 2022). Through these studies, we gained a better understanding of the benefits and limitations of GS in the context of rice breeding. The impact of trait architecture, population structure, the training set size, and composition, as well as marker density, has been well covered. However, the impact of G×E has received somewhat less attention. Indeed, only a few studies using breeding material have used multi-environment models including G×E (Ben Hassen et al., 2018; Monteverde et al., 2018, 2019; Morais Júnior et al., 2017). The conclusions arising from these works based on a relatively small number of environments are that multi-environment models tend to give higher prediction accuracies.
This study aimed to assess the efficiency of multi-environment genomic prediction models in the context of an applied breeding program. We used an elite core panel that represents the elite diversity managed by the irrigated rice breeding program at the International Rice Research Institute (IRRI). This panel was phenotyped in 15 environments in Asia and Africa regions from 2018 to 2020. This information from multi-environment trials (phenotypic data and environmental covariates) was used to characterize the level of G×E interaction and to cluster the environments. We then compared seven genomic prediction models to evaluate the impact of modeling G×E and environmental covariates on predictive abilities when new environments were predicted.