Genomic selection (GS) has garnered increased attention for its potential to deliver significant genetic gains per unit of time and cost. In traditional genetic evaluation, phenotypic performance is regressed on the identity-by-descent (IBD) probabilities inferred from pedigrees, producing the best linear unbiased predictions (BLUP) of additive genetic (breeding) values. However, when genetic covariance is estimated from DNA markers (e.g., SNP), genomic-based predictions (GBLUP) of true breeding values may provide additional benefits over that of the traditional BLUP, mainly when used in the context of GS.
In forest trees, recurrent GS is based on establishing training populations (i.e., marker-trait associations). These, in turn, eliminates, to a certain extent (loss of accuracy over time), the needed phenotypic evaluation phase of selection candidates. Using computer simulation, the theoretical potential of GS in tree breeding was demonstrated with the conclusion that it could radically increase breeding efficiency1-2. They assessed GS scenarios under variable narrow-sense heritability (h2), number of QTLs, marker density, and effective population size (Ne). Recent studies utilizing empirical data reported additional prediction accuracies3-4.
Unlike the studies mentioned above, the genetic response to selection was recently simulated in a seed production population, providing a more realistic basis for comparing actual genetic gains available in forest reproductive material5. In particular, they expressed the efficiency of GBLUP/BLUP protocols based on the respective ranking of selection candidates and genetic gains provided in forest reproductive material. Furthermore, a combination of low h2, high Ne, and dense marker coverage resulted in the maximum genomic prediction efficiency and added within-family selection accuracy (exploitation of the Mendelian sampling term). Adoption of GS in operational breeding programs is challenging in predominantly outbred forest tree species. These are characterized by: 1) long generation intervals, 2) sensitivity to inbreeding, i.e., breeding relying on high Ne, 3) fast gametic-phase disequilibrium decay, 4) considerable temporal and spatial environmental sensitivity, and 5) large genome sizes (especially in conifers), requiring dense SNP genotyping involving many individuals6. Regarding forest tree species' spatial and temporal sensitivity, excessive field experiments are required to assess genotype-by-environment interactions and age-age correlations on top of complicated covariance structure among multiple traits and its shift with selection. These issues are specific to populations, traits, ages, and environmental conditions and require testing hundreds of families in several environments over an excessive time scale7.
Genetically improved seed production predominantly relies on seed orchards, i.e., bulk seed from open-pollinated crosses, capturing additive genetic variance. Additionally, one can exploit non-additive genetic effects through the mass deployment of full-sib families (dominance) or clonal mixtures (dominance + epistasis). Apart from deploying improved forest reproductive material, one can clonally replicate selection candidates (offspring genotypes) in progeny trials, enhancing the precision of forward selection8,9. In Sweden, clonal replication in progeny testing has provided operational benefits in the Norway spruce breeding program by boosting the within-family response to selection while minimizing genetic diversity loss9,10. Under fixed progeny test size, a trade-off exists between the family size and the number of clonal propagules per genotype (NR), i.e., clonal size11.
Here, building on our earlier stochastic simulations5, we evaluated the impact of clonal replication in progeny testing on the efficiency of BLUP and GBLUP evaluation and the actual genetic response realized in seed orchards. Specifically, we assessed the combined effect of marker density, effective population size (Ne), family size, and NR.