Stalk strength is a highly important agronomic trait in maize because of its relationship with stalk lodging and grain yield. However, RPR, as a crucial measurement index, can efficiently and precisely evaluate stalk strength to improve the lodging-resistance of breeding lines. Hence, the genetic dissection of RPR can provide powerful assistance for the selection of candidate lines with high stalk strength based on functional molecular marker detected by association and linkage mapping [1, 16, 20, 28, 29]. Furthermore, the utilization of genomic selection can also accelerate the breeding process of complex traits without phenotyping in later breeding phases [35–38]. Taking full advantage of genomic information led to better genomic prediction of RPR in this study.
The relatively higher stalk strength in 2012B compared to that in other environments is likely attributed to the lower planting density in that environment, indicating that a high planting density may reduce RPR, which is consistent with a previous study [6]. According to the ANOVA results, RPR has relatively high broad-sense heritability, which is supported by several previous studies [1, 19, 20, 26, 27], illustrating that genetic effects can account for the most proportion of phenotypic variance in RPR, and that better selection of RPR can be achieved in early generations if target lines are used as parents to construct breeding populations to screen out varieties with high stalk strength. However, RPR, a complex quantitative trait, is controlled by multiple genes with minor effects, which has been discussed in previous studies [1, 19, 28, 29]. The breeding scheme of population recurrent selection may be more efficient for the pyramid of favorable alleles related to RPR [8, 15, 25]. In brief, combining early generation selection and population improvement can enhance the breeding efficiency of selecting breeding lines with high stalk strength. But a lower broad-sense heritability was estimated in stage V10 in each RIL population, which may be attributed to the fact that stage V10 is a vegetative growth stage and nutrient and dry weights greatly increase in this stage [39]. Individual plants have weak stalk strength due to the rapid growth of the internodes, which can be affected by nutrient deficiencies, heat, and drought [40]. As illustrated by the ANOVA results in the present study, nongenetic effects account for a higher proportion of phenotypic variance of RPR in stage V10. According to the results of the phenotypic clustering and correlation analyses, stage V10 was individually separated from other stages and was associated with other stages with lower correlation coefficients, which further indicated that RPR in various stages may be controlled by different genetic factors and that the last six stages might have a similar genetic basis. Nevertheless, DTS was not classified into common subgroup with other stages after silking, which was likely attributed to the fact that the latter stages belonged to the phase of kernel development undergoing grain filling to maturity. On the other hand, the difference in RPR values between stages except for stage V10 was small, as shown by the distribution boxplot of each RIL population. Moreover, the broad-sense heritability in these stages was relatively higher than it was in stage V10. Hence, RPR measured in the silking phase or stage after silking can be used to evaluate stalk strength, as shown by several previous studies that had been provided evidence directly and were performed in silking phase or a few weeks after flowering [4, 20, 23, 26, 28, 29, 41]. Finally, inbred lines with high stalk strength in this study can be selected as novel germplasms to make candidate crosses in the future.
Genetic maps of each RIL population were constructed by the R package based on the Kosambi mapping function. Classical and cloned genes, including P1 [31, 32] and R1 [33, 34], were detected in each RIL population, indicating that these constructed linkage maps had high quality and accuracy to allow subsequent analysis of QTL mapping. The broad-sense heritability of RPR varied from stage to stage and was positively correlated with the number of QTL detected in each stage. It is implied that more QTL can be identified to better dissect the genetic basis of complex traits if a high broad-sense heritability is estimated for the target traits. On the other hand, more overlapped and common QTL for RPR can be obtained between different stages when the genetic correlation coefficient of both stages is increasingly large. In general, the higher the genetic correlation between traits, the more common the QTL, which may be illustrated by the fact that these traits were controlled by alike or linked genes or had common metabolic pathways [42]. The position and number of QTL detected in each experimental population were generally different across stages and environments. It is implied that discrepant genetic mechanisms may exist for RPR, which has been investigated in various situations, and it is further indicated that gene expression may be characterized by spatiotemporal specificity and is activated at specific times during plant development. Besides, the phenotypic variance explained by each detected QTL was lower than 15%, which was consistent with the results of other studies [1, 19], indicating that RPR is controlled by multiple alleles with minor effects and that there is a lack of major QTL for this trait. However, there were 18 pleiotropic QTL with overlapped genomic regions that were identified in multiple stages. In particular, pQTL6-1 in the HO population was repeatedly detected 16 times in different stages across environments. In addition, the pleiotropic QTL, namely, pQTL8, was identified 9 times across various phases and environments, including 8 times in the LR population and one time in the HO population. This phenomenon illustrates that certain alleles related to RPR are steadily expressed across stages during the development of maize and contribute to the formation of stalk strength throughout the entire growth period. From another perspective, several QTL detected in this study, including qAhb1-2, qAhg2, qAhe2, qBhe2-1, and qBhc3, were identified and consistent with previous studies in which discrepant populations and genotypic data were used to perform association or linkage mapping to explore the genetic architecture of RPR [20, 27–29], which provides further support for the topic mentioned above that some QTL associated with RPR are steadily expressed in diverse experimental populations. These loci in the genome may be regarded as candidate genomic regions and can likely be used to perform fine mapping and identify functional genes to dissect the genetic mechanism of RPR. Additionally, the relatively obvious difference in QTL mapping for RPR among the experimental populations was determined according to the results of this study and other previous studies [1, 19, 21, 26]. A reasonable explanation of this difference is as follows: first, RPR is regarded as a complex quantitative trait with an intricate genetic mechanism. There may be epistatic effects in which a QTL can interact with one QTL in this experimental population and with another locus in other genetic background, so that the QTL will produce different genetic effects in different populations; the second explanation is that the QTL related to target traits can be legitimately detected following segregation and recombination within this region. In other words, the associated QTL cannot be identified in a situation in which both parents of an experimental population have identical alleles at a QTL; the third explanation is that many QTL with minor genetic effects will not be detected repeatedly because they likely lack sufficient statistical power for QTL mapping [1, 43]. Hence, further research is needed to break RPR into a few direct components or sub-factors that can be used to more effectively dissect the genetic basis and explore candidate genes for stalk strength with the purpose of providing advice for marker-assisted breeding.
As an efficient approach for exploring the genetic architecture of target traits, linkage mapping has been widely applied to identify QTL and explore functional genes in molecular genetics research. The identified QTL can be used to develop molecular markers to assist the practical breeding and accelerate the selection process. Several primary candidate genes were found in the MaizeGDB database that corresponded to RPR in this study. One candidate gene within pQTL6-2 with the gene model ID GRMZM2G031200 is located on chromosome 6 with a physical position of 164.69 Mb. The homologs of this gene in Arabidopsis encode regulated transcription factors, namely, secondary wall-associated NAC domain protein1 (SND1), which is required for the normal biosynthesis of the secondary wall and is a critical transcriptional switch to activate this developmental program. The SND1 combines with other transcription factors to constitute a transcriptional network that regulates downstream targets that affect the biosynthesis of the secondary wall in fibers [44]. Moreover, two candidate genes were detected within the genomic region of pQTL6-1 consisting of seven RPR-related QTL with the model IDs GRMZM2G027723 and GRMZM2G135108 that are relevant to the formation of cell wall components. The first gene is ZmCesA-2, which is required to produce cellulose and is involved in primary wall biosynthesis [45, 46]. The another gene, namely ZmPox3, is a critical gene in the process of lignin biosynthesis and is involved in monolignol polymerization and exerts a positive effect on cell wall digestibility [47]. In addition, a candidate gene located in pQTL4-2, ZmFBL41, has the biological function of resistance to banded leaf and sheath blight and indirectly influences the accumulation of lignin. This gene encodes an F-box protein (ZmFBL41) that interacts with the protein ZmCAD, and its knockout has a negative effect on ZmCAD degradation and thus promotes lignin biosynthesis and restricts lesion expansion [48]. These descriptions indicate that candidate genes corresponding to cell wall components may regulate and determine the formation of RPR. On the other hand, several studies have reported the results of QTL mapping for cell wall components [26, 49–51], and some of these QTL have overlapped confidence intervals with the QTL identified in this study. Regarding pQTL6-2 detected in the HO population, its genomic region is consistent with the physical position of QTL associated with lignin, acid detergent fiber (ADF), neutral detergent fiber (NDF), acid detergent lignin/NDF, and in vitro dry matter digestibility (IVDMD) identified in previous studies [49, 50, 52–54]. Based on the results of related studies, pQTL4-2 has a physical region that overlaps with other loci that are associated with IVDMD and lignin [48, 50], and the interval of pQTL8 is consistent with the QTL related to IVDMD, which have negative relationships with lignin content [26, 51]. Hence, this evidence implies that certain QTL have pleiotropic effects and can control both RPR and the content of cell wall components, which likely indicates that RPR is closely associated with cell wall components, such as cellulose, hemicelluloses, and lignin, consistent with the results of previous studies [20, 26, 28]. In addition, the results of the GO and KEGG analyses provide further support for the abovementioned scenario because the enrichment items and metabolic pathways associated with cellar components and the formation of the cell wall were identified in this study. Consequently, candidate genes relevant to RPR are likely involved in the regulation and control of cell wall components, which may exert an important effect to improve RPR.
Genomic selection has been recognized as an efficient approach to select for complex traits in comparison with conventional marker-assisted selection [36, 37, 55, 56]. In the present study, the prediction accuracy estimated in each stage and population was obviously different when the UV model was used to perform cross-validation, which was likely attributed to the different estimates of broad-sense heritability in various situations. This phenomenon is in accordance with previous studies illustrating that broad-sense heritability is an important factor that impacts the evaluation of prediction accuracy [57–60]. The information on functional loci identified by linkage mapping can be used as fixed effects in the GS model to improve the predictive ability of models, which was performed in this study and in previous research [61–64]. However, the prediction accuracy was increased when the fixed effects model was implemented using the QTL that explained the proportion of phenotypic variance lower than 10%, which was consistent with a previous study [65]. This result illustrates that QTL related to target traits have the potential ability to improve prediction accuracy and should be assigned important roles in the models. A remarkable improvement of prediction accuracy was achieved in this study when the multivariate model was applied to perform the GS. Several previous studies have shown that using correlated traits as auxiliary variates in the GS model can efficiently enhance the prediction accuracy and is obviously superior to the univariate model [66–70]. The increase of prediction accuracy estimated by the FIXED and multivariate models is mainly attributed to the higher proportion of genetic variance captured by these models than in the univariate model, as shown in the present study and previous researches [66, 71]. Another explanation may be that multivariate models likely capture both additive and nonadditive interaction effects by using auxiliary covariates in the models [72]. In brief, information on genetic dissection or additional auxiliary variates can be integrated into improved models to enhance the selection efficiency of complex agronomic traits, such as yield, RPR and other resistance-relevant traits. Hence, several advices for GS-assisted breeding programs can be concluded to improve selection efficiency and further enhance the genetic gain per breeding cycle. Regarding the target traits with complex genetic architecture, the first point is that the information of functional markers developed based on cloned genes or validated QTL can be applied to modified models to improve the precision of the estimated marker effects; the second point is that the information from genetic correlated traits can be used to achieve a higher prediction accuracy for the target trait, namely, by using other traits as auxiliary variates in statistical models; and the last point is that historical data accumulated by breeding experiments can be used to capture the interaction effects between the environment and genotype for the purpose of increasing the predictive ability of GS models. These points may allow better selection of candidate lines with good performance in practical GS-assisted breeding schemes.