Trial conditions and experimental design
The field trial was established in 2014 at the INRAE unit in Estrées-Mons (49° 72’ N, 3° 00’ E) in northern France, relying on the frame of the Biomass For the Future (BFF) project. It was repeated twice in 2014 and in 2015 using a staggered-start design [21]. The field soil type was characterized as deep loam (Ortic luvisol, FAO, classification). The trial received no nitrogen input and weeds were regularly manually removed. Meteorological data were recorded daily at a short distance from the trial (about 1 km) and downloaded from the CLIMATIC database (INRAE AGROCLIM, 2020).
Population and experimental design
The population consisted of the F1 progeny of 159 individuals generated from an intraspecific cross between two diploid M. sinensis varieties, “Silberspinne” and “Malepartus”. Plantlets were initially produced from the seeds and grown in a greenhouse in order to provide clonal replicates of each genotype using in vitro propagation method [24]. The population was established with single plants organized in a staggered-start design [23]. An initial group of genotypes (coded G1), that contained a total of 111 genotypes and included the two parents, was established in 2014 using an initial incomplete block design [25]. A second group of 130 genotypes (coded G2) included the two parents as well and was established in 2015 using a second incomplete block design. 82 genotypes were common to both groups and established twice in 2014 and 2015 to allow the partition of the year effect into age and climate effects. The staggered-start design comprised a total of 1155 plants. Each established group comprised 5 blocks of 90 individuals each (6 rows of 15 plants each). The two parents were grown in each block while the G1 remaining genotypes were randomly spread into four blocks on average and the G2 genotypes into at least three blocks. The two groups of genotypes, which were adjacent in the field, were separated using 75 plants (15 x 5) and surrounded by 180 border plants, to guarantee similar competition effects. Individuals were randomly planted using the Excel package « BICA » wh,ich was based on VBA programming language and developed for the randomization of incomplete blocks (JC Bastien, personal communication). A density of 1 plant/m² was applied at regular intervals of 1 m × 1 m between the plants.
Phenotyping of traits related to flowering
The phenotypic observation of the individuals within the population for flowering-related traits was conducted in 2018 and 2019. The following traits were recorded on each plant and targeted stems belonging to the higher part of the plant canopy (above 80%): the start of panicle emergence (SPE), which corresponds to the date on which the first panicle was visible at least 1 cm above the plant stem; anther appearance (AA), which corresponds to the date on which anthers appeared and began to shed pollen -this observation was carried out on the first panicle that emerged in the plant; the mid-point panicle emergence (MPE), which corresponds to the date on which 50% of the panicles had emerged in the plant, and finally, the complete panicle emergence (CPE), which corresponds to the date on which over 80% of the panicles of the entire plant had emerged. In addition, one variable was calculated: the interval between heading and flowering (IHF) that was defined as the difference between the heading and flowering dates observed on the first panicle that emerged and flowered. To reduce bias and increase accuracy in the data To guarantee data accuracy, phenotypic observations of the traits were made three times a week, starting with the first plant that showed panicle emergence. The first panicle to appear on the plant was marked and used for further determination of the date of anther appearance to facilitate determination and avoid sampling bias. In addition, other traits were observed in the population: the emergence date (ED), the senescence date (SD) that corresponded to the date on which the first stem cohort had become completely senescent, and the date of the first winter frost (FD).
To further calculate weather indicators, extensive observations were conducted on Malepartus, one of the two population parents, in order to determine the date of the floral transition. This clone was established in 2014 in a specific trial not far from the population trial, at a plant density of 2 plants/m² (see Leroy et al., submitted to BioEnergy research). The sampling was carried out on three plots of about 75 m² each. The floral transition corresponded to the differentiation of the terminal vegetative apex into a panicle and was observed by dissecting 30 apices per day on average from May 24th onwards. It was determined as the date of the first floral transition appearance, which was expected to correspond to the floral transition of the first stem that showed a panicle above the canopy.
Based on previous results obtained in maize, leaf appearance rate and duration responded to temperature in approximately the same way as for primordia and were constant across temperature regimes, at least for the appearance of the first 12 leaves [26]. According to Hesketh and Warrington [27], the use of a degree-day model must predict the appearance rate of new organs, such as leaf primordia, tassel, and ears, as a function of air temperature or degree days. These results relating to constant degree-day durations were applied here in miscanthus to predict floral transition (referring to leaf primordia results in maize) and flowering (referring to tassel and ear results in maize). We found that the sum of temperatures calculated from emergence to flowering for the Malepartus clone with a base temperature between 6 and 8.5°C were almost equal in 2018 and 2019 when the base temperature used was of 8°C. This meant that the base temperature was close to 8°C for Malepartus and could be applied for further calculations. Therefore, growing-degree days (GDD) were estimated using 8°C as a base temperature for Malepartus and its offspring. Using the floral transition date determined from the apices observations in Malepartus, the ratio between the GDD for the emergence-to-floral transition period and the GDD for the emergence-to-flowering period was calculated for Malepartus in 2018 and reached 0.17. This ratio was then applied to the population in order to estimate floral transition dates based on flowering date observations.
Determination of plant growth periods
Five periods of the plant cycle were taken into account to calculate the weather indicators (Table 1). The first period corresponded to the pre-emergence phase (coded Period 1) and the calculations started at senescence, which occurs in the autumn, and precedes shoot emergence. The second period (Period 2) started with shoot emergence and ended with floral transition. The third period (Period 3) corresponded to the interval between floral transition and anthesis. The fourth period (Period 4) was from anthesis to senescence. Lastly, the fifth period (Period 5) covered the duration from senescence to the date of the first winter frost, the plant cycle being stopped by the first frosts.
Table 1 Description of the periods used for the weather indicator calculations. The dividing point of the growth stages was determined by mean Malepartus growth stages.
Period
|
Description
|
Year
|
Beginning
|
Ending
|
Duration
|
Period 1
|
Pre-emergence from the senescence of the year (n-1) to emergence year (n)
|
2018
|
324
|
97
|
138
|
2019
|
289
|
81
|
157
|
Period 2
|
Emergence to beginning of floral transition
|
2018
|
98
|
134
|
36
|
2019
|
82
|
147
|
65
|
Period 3
|
Beginning of floral transition to beginning of anthesis
|
2018
|
135
|
204
|
69
|
2019
|
148
|
228
|
80
|
Period 4
|
Beginning of anthesis to senescence
|
2018
|
205
|
288
|
83
|
2019
|
229
|
327
|
98
|
Period 5
|
Senescence to the date of frost day of the year
|
2018
|
289
|
346
|
57
|
2019
|
328
|
337
|
9
|
Weather indicator description
In order to explain the climate effect, the following meteorological indicators were determined: air minimum temperature (MinT), air maximum temperature (MaxT), air mean temperature (MeanT), soil minimum temperature (MinTs), precipitation (Prec), cumulated precipitation (Prec_S), mean humidity (MeanH), vapour pressure deficit (VPD), maximum VPD (VPD_M), Penman evapotranspiration (ETPP), photosynthetically active radiation (PAR), cumulated growing degree-days (CGDD). The summary of the meteorological indicators per growing period over the two years is shown in table 2.
Table 2 The mean value of weather indicators in five growth periods in 2018 and 2019.
Weather indicator
|
Period 1
|
Period 2
|
Period 3
|
Period 4
|
Period 5
|
2018
|
2019
|
2018
|
2019
|
2018
|
2019
|
2018
|
2019
|
2018
|
2019
|
Prec_S/mm
|
312.5
|
327.5
|
26.0
|
54.5
|
136.5
|
112.5
|
126.5
|
208.5
|
116.5
|
21.5
|
Prec/mm
|
2.25
|
2.09
|
1.08
|
0.97
|
1.64
|
1.22
|
1.38
|
2.11
|
2.38
|
2.39
|
MeanH/mm
|
88.5
|
87.5
|
75.4
|
72.1
|
72.0
|
72.1
|
73.3
|
81.4
|
89.8
|
91.4
|
MinTs/℃
|
4.1
|
5.6
|
11.9
|
9.5
|
18.1
|
18.1
|
16.7
|
13.1
|
7.3
|
6.6
|
MinT/℃
|
2.3
|
3.5
|
8.2
|
5.2
|
12.2
|
12.2
|
11.6
|
9.0
|
4.8
|
4.2
|
MaxT/℃
|
7.8
|
9.6
|
18.3
|
15.3
|
23.8
|
24.0
|
23.1
|
17.4
|
10.4
|
9.1
|
MeanT/℃
|
4.8
|
6.3
|
13.1
|
10.1
|
17.9
|
18.0
|
16.9
|
12.9
|
7.4
|
6.5
|
VPD/%
|
0.1
|
0.13
|
0.41
|
0.37
|
0.61
|
0.63
|
0.57
|
0.33
|
0.11
|
0.08
|
VPD_M/%
|
0.27
|
0.37
|
1.16
|
1
|
1.66
|
1.74
|
1.64
|
0.96
|
0.31
|
0.23
|
ETPP/mm
|
0.63
|
0.57
|
2.88
|
2.81
|
4.27
|
4.42
|
2.84
|
1.78
|
0.38
|
0.23
|
PAR/d cm-2
|
230.4
|
217.5
|
747.9
|
720.2
|
989.0
|
932.6
|
678.6
|
438.5
|
163.2
|
120.6
|
CGDD/d ℃
|
-
|
-
|
209.2
|
203.1
|
963.5
|
957.6
|
590.6
|
453.1
|
-
|
-
|
Prec_M: mean precipitation; Prec_S: accumulated precipitation; MeanH: mean humidity; MinTs: soil minimum temperature; MinT: minimum temperature; MaxT: maximum temperature; MeanT: mean temperature; VPD: vapor pressure deficit; VPD_M: maximum VPD; ETPP: Penman evapotranspiration; PAR: photosynthetically active radiation; CGDD: cumulated growing degree-days.
Statistical analyses
Two different analyses for unbalanced datasets were performed, according to the “age” effect and the “climate” effect models (Fig. 1). 159 genotypes were included in each of these analyses, with 82 in common between the G1 and G2 genotype groups.
An initial round of analyses of Linear Mixed Models consisted of a full statistical model which took into account block and spatial effects using the “breedR” package implemented in the R software [28]. It accounted for the environment heterogeneity within each trial among the blocks and diagnosed spatially distributed patterns of remaining residual variations using variograms. All variance components of the progeny were estimated by using the remlf90 function of the package, based on restricted maximum likelihood estimations. The equation of Model 1 was as follows:
where Yijkl represents the phenotypic value measured on plant k of genotype i at age j in block l; µ is the overall mean; αi is the random effect of genotype i; βj is the fixed effect of age j; (αβ)ij is the random interaction between genotype i and age j; δl is the effect of block l and εijkl is the random residual for plant k of genotype i at age j in block l. An autoregressive spatial component was also considered in the model based on x and y coordinates in each plot in order to partition the residual εijkl into a spatially dependent parameter, θikl, for plant k of genotype i in block l, and an independent remaining residual [29].
This full model was compared with two sub-models for each trait: a sub-model with no decomposition of the residual term into spatially dependent and independent effects and another sub-model with no block effect but with the decomposition of the residual term into spatially dependent and independent effects. The performance of the three models was compared on their Akaike information criterion, for which a lower value corresponded to a better performance. For all traits, the last sub-model, which dropped the block effect but took into account the spatial effect, was found to have a slightly better performance than the full model or than the sub-model which dropped the spatial effect but took into account the block effect. Accordingly, it was retained afterwards, which simplified the models.
The “age” effect was assessed using the following Linear Mixed Model, fitted for each studied trait and considering each year in which the progeny grew. It was referred to as “age effect model”:
where Yijk represents the phenotypic value measured on the plant k of genotype i at age j, μ is the overall mean; αi is the random effect of the genotype i; βj is the fixed effect of age j; (αβ)ij is the random interaction between genotype i and age j; θik is the random spatial effect for plant k of genotype i; and εijk is the random residual error for plant k of genotype i at age j. This modeling was carried out by using two subsets of the data (rectangles colored in blue in Fig. 1). The year 2018 was considered for G1 and G2 and the age effect was taken into account by 4-year old plants with G1 and 3-year old with G2. Plants of different ages grew in the same climate during a same year (2018), which makes it possible to take into account the age effect in the same climate conditions. The age effect was also analyzed in 2019 in the same manner when the genotypes were one year older.
The performance of the full model including the interaction term was compared with the corresponding sub-model with no interaction by using the Akaike information criterion. Based on the AIC, the analysis showed that a model that included interaction effect was always more performant than the sub-model for all traits.
Then, the “climate” effect was assessed using the following Linear Mixed Model, for each age of the progeny. It was referred to as “climate effect model”:
where each term is similar to model (1), except that the age effect βj is replaced by γl, the climate effect of year l and the interaction between genotype i and age j by the interaction between genotype i and climate l . It can be noted that the spatial effect term was a different estimate in comparison to the estimate from model 1. In this modeling, only 4-year old plants were considered and corresponded to G1 genotypes which grew in 2018 and G2 genotypes in 2019 (orange rectangle in Fig. 1). Plants of the same age grew in two different climate conditions, related to each year considered. The performance of the full model including the interaction term was tested in the same manner as previously and the results showed a higher performance of the model that included interaction effect for all traits.
According to these two models, genotypic broad-sense heritability was estimated as follows:
where is the variance attributed to the genotype, is the variance of the genotype × age interaction, is the spatial variance and is the residual variance.
Here, each term is similar to the previous formula, except for which is the variance of the genotype × climate interaction.
According to the two previous models, progeny-mean heritability [30, 31] was also estimated for the age and climate effect models, respectively as:
where J are the two ages considered, L are the two climates (of each year) and K is the mean number of repetitions per genotype.
Due to the unbalanced feature of the design, as all genotypes were not common between G1 and G2 (i.e. not established twice in 2014 and 2015), most previous models were applied twice as a second round of analyses was restricted to the 82 common genotypes between G1 and G2. It allowed a more precise comparison of the climate and age interaction effects, thus avoiding biases due to non-common genotypes between the comparisons.
Genetic and phenotypic correlations were then assessed for each trait, considering a given age or a given year. Genetic correlations were estimated as described by Howe et al [32]. To calculate phenotypic correlations, Pearson correlation coefficients were computed using R package “stats” and visualized using R package “corrplot” [33].
A Principal Component Analysis (PCA) was performed on the weather indicators determined in Table 2 using the R “FactoMineR” package. This PCA was followed by a clustering that was applied to the coordinates of the individuals in the two first components of the PCA. These individuals corresponded to combinations between the periods and the two years. Within a period, the number of clusters was defined when the two years of a given period were separated. Then, the climatic features of the separated combinations were pointed out.