Genetic variation in plantation of seed orchard crops of Larix kaempferi
The number of alleles and levels of heterozygosity in the three groups of seed orchard (SO) and offspring (HC and HS) were estimated using seven microsatellite markers. The average number of alleles (NA) and effective alleles (NE) were 9.7 and 3.6, respectively (Table 1). The estimated genetic parameters of the mother tree group were analyzed by the National Forest Seed and Variety Center [8]. Genetic variation in the L. kaempferi was slightly lower than that in the Japanese natural population (0.717–0.762) based on the average expected heterozygosity (HE) [31]. However, the difference was considered to be due to the low number of polymorphic markers used, as the estimates were in the 0.518–0.860 range. The inference was supported by the even lower HE in the introduced 140 plus trees and the offspring in China with 17 microsatellite markers, at 0.525 and 0.557, respectively [32]. The plantation of the seed orchard crops, with the considerable genetic variation compared to that in the group of mother trees, was considered the appropriate base population.
N A and NE were similar between SO and HC, with averages of 10.0 and 3.7, respectively (Table 1). The differences in NA and NE between SO and offspring (HC or HS) were not significant, with HS having slightly lower estimates of 9.1 and 3.3, respectively. Furthermore, neither HO nor HE were significantly different between SO and offspring. HO in HC and HS were 0.683 and 0.669, respectively, showing little difference from the 0.677 in SO. HE in the two offspring groups were 0.681 and 0.656, with no significant difference from 0.672 in SO. In a comparison between the two offspring groups (HC and HS), the differences in genetic variation were not significant, although some of the estimates were slightly lower in HS. The latter had lower NA, NE, HO, HE values in except in the analysis with bcLK033 and LK4178. NA in bcLK033 was similar between the two groups, but NE was somewhat higher in HS. Both estimates of LK4178 were higher in HS than in HC, exhibiting distinctiveness from the other markers.
Table 1
Genetic variation across populations of Larix kaempferi based on microsatellite marker analysis
Group
|
Locus
|
NA
|
NE
|
HO
|
HE
|
FIS
|
Plus trees1)
|
Total
|
10.0
(1.7)
|
3.7
(0.7)
|
0.677
(0.059)
|
0.672
(0.058)
|
-0.010
(0.027)
|
Offspring
(Hongcheon)
|
Total
|
10.0
(1.9)
|
3.7
(0.7)
|
0.683
(0.055)
|
0.681
(0.050)
|
-0.006
(0.035)
|
bcLK033
|
12.0
|
4.0
|
0.754
|
0.754
|
-0.002
|
bcLK224
|
12.0
|
5.0
|
0.802
|
0.804
|
-0.001
|
bcLK235
|
17.0
|
3.2
|
0.659
|
0.694
|
0.048
|
bcLK241
|
4.0
|
2.1
|
0.572
|
0.526
|
-0.092
|
LK4146
|
6.0
|
2.2
|
0.591
|
0.537
|
-0.104
|
LK4170
|
14.0
|
7.0
|
0.908
|
0.860
|
-0.059
|
LK4178
|
5.0
|
2.4
|
0.492
|
0.592
|
0.166
|
Offspring
(Hwaseong)
|
Total
|
9.1
(1.4)
|
3.3
(0.5)
|
0.669
(0.053)
|
0.656
(0.051)
|
-0.026
(0.033)
|
bcLK033
|
12.0
|
4.6
|
0.723
|
0.785
|
0.075
|
bcLK224
|
12.0
|
4.8
|
0.785
|
0.795
|
0.010
|
bcLK235
|
11.0
|
3.1
|
0.645
|
0.683
|
0.052
|
bcLK241
|
4.0
|
1.8
|
0.452
|
0.439
|
-0.034
|
LK4146
|
6.0
|
2.1
|
0.577
|
0.529
|
-0.093
|
LK4170
|
13.0
|
4.0
|
0.881
|
0.752
|
-0.177
|
LK4178
|
6.0
|
2.6
|
0.621
|
0.612
|
-0.018
|
Overall
|
Total
|
9.7
(0.9)
|
3.6
(0.3)
|
0.676
(0.031)
|
0.671
(0.029)
|
-0.014
(0.018)
|
Pedigree reconstruction of individuals in the L. kaempferi plantation
The average polymorphic information content (PIC) was 0.638 and the non-exclusion probability of identity was as low as 4.1E-07, based on the seven microsatellite markers. Each individual was identifiable as none of them showed the coincident genotypes in the seven loci. The non-exclusion probability of the first parent was 0.055. Ninety trees corresponding to 27% of the overall trees were assigned at relaxed confidence level (Table 2). The assignment of the other 244 trees was accomplished but did not fall within the confidence level. The proportions of assignments within the relaxed confidence level were 23% and 25% in HC and HS, respectively (Table 2). The low confidence level in the present study was considered to be derived from the characteristics of the used markers in the form of similar proportions of assignments at confidence levels in the two groups. Both exclusion probability and likelihood are considered in the maternity analysis by CERVUS. The low confidence level of assignment is caused by the negative score of the logarithm-of-odds (LOD) in the case of frequent major alleles or discordant genotypes in relation to the mother candidates.
The maternity analysis assigned the offspring to 92 and 79 candidate mothers in HC and HS respectively (Table S1). The total number of the candidate mothers was 112. The number of mutually exclusive mother trees was 33 in HC and 20 in HS, displaying the diverse constitution of the mother trees. The results potentially indicated that the genetic constitution would be distinguished by the reproduction year considering the samples of the seed orchard crops were produced in different years. The number of progenies by each mother tree were 1–6 trees in HC. The mother trees with six progenies were CN05, JPR-4, and CN09, and the 42 mother trees had only the one offspring. The number of progenies by each mother tree were 1–5 trees in HS. The assignment revealed that the mother trees GB12 and GB27 each had five progenies, and 36 mother trees had one offspring in HS. The offspring by mother trees were relatively evenly dispersed in the plantation (Fig S1–S5). Similarly, majority of the mother trees had one offspring according to the results of pedigree reconstruction analyses for Abies nordmanniana and Larix occidentalis [15, 17].
Table 2
Maternity analysis of individual trees by population of Larix kaempferi
Level
|
Confidence (%)
|
Observed assignments (%)
|
|
|
Hongcheon
|
Hwaseong
|
Strict
|
95
|
10 ( 5%)
|
6 ( 4%)
|
Relaxed
|
80
|
53 ( 23%)
|
37 ( 25%)
|
Low
|
< 80
|
135 ( 71%)
|
109 ( 75%)
|
Total
|
|
188 (100%)
|
146 (100%)
|
Genetic testing in the plantation of L. kaempferi
The prediction of breeding values was performed for each mother tree and the individual tree in the plantation although the confidence level of the maternal assignment was rather low. The estimated variance of the genetic effect was infinitesimal, at 4.9E-05, and the estimate for the residual was 25.07. Consequently, differences among the individuals were imperceptible (Table S2). The maximum predicted breeding value was 0.00003 for hc1485, with a DBH 29.7 cm, and the minimum estimates was − 0.00002 for hc1442, with a DBH of 6.6 cm. The correlation coefficient of the observed and predicted value was 0.94, which could be explained with predictions considering not only observations but also maternal information.
To separate the genetic effects with increased accuracy, the environmental effects were investigated to be accounted in the model of genetic testing. The spatial autocorrelation of phenotype was significant using Moran’s I index (p < 7.4E-12). The results of seven types of spatial models, including exponential, gaussian, spherical, linear, rational, AR1, AR1⊗AR1, and the basic animal model were compared to assess the model that would most adequately explain the phenotypic spatial structure of the plantation. The distribution of the phenotype was rasterized for use in the spatial analysis with AR1⊗AR1 (Fig. 1). The rasterization was cautiously processed to prevent data loss due to adjacency between individuals, considering their irregular distribution.
The integration of the animal and AR1⊗AR1 model (animal + AR1⊗AR1) yielded the highest fitness followed by the basic animal model, based on the log likelihood and AICc (Table 3). The AICc of the animal + AR1⊗AR1 model was largely distinct from those of the other models, highlighting its superior fitness. The fitness of the model with spatial AR1 was very low, suggesting the need for the use of a two-dimensional first-order autoregression model. The fitness and residual of the animal model and animal + AR1⊗AR1 model were confirmed in the present study (Fig. 2, Fig. 3). The residual was reduced in the animal + AR1⊗AR1 model when compared with in the animal model. In the meanwhile, the fitness was relatively low for the individuals with both extreme observations (individuals with either large or small DBH). The results implied that the trees with poor growth were more inferior than predicted, and trees with strong growth were more superior. The reason for this phenomenon was potentially the exceeded spatial autocorrelation for those individuals. However, no distortion of the selection was expected due to over or underestimation of the superior (high DBH) and inferior (low DBH) individuals.
Table 3
Comparison of the fitness of spatial autoregression models of DBH distribution in Larix kaempferi stand
Spatial model
|
Intercept
|
Degree of freedom
|
Log likelihood
|
AICc1)
|
AR1⊗AR1
|
18.07
|
3
|
-553.6
|
1113.1
|
Animal model
|
17.751
|
2
|
-568.9
|
1141.7
|
Rational
|
17.8
|
5
|
-568.1
|
1146.5
|
Gaussian
|
17.76
|
5
|
-568.1
|
1146.6
|
Spherical
|
17.76
|
5
|
-568.3
|
1147.0
|
Exponential
|
17.8
|
5
|
-568.4
|
1147.2
|
Linear
|
17.76
|
5
|
-568.4
|
1147.2
|
AR1
|
18.18
|
4
|
-603.9
|
1216.0
|
1)Second-order Akaike information criterion |
The variance component of genetic factor was 9.086, and the heritability was estimated as 0.344 (Table 4). The accuracy of the genetic testing improved following the elimination of the environmental effects, with a heritability estimate of 0.224 for L. kaempferi DBH [1]. The predicted breeding value (PBV) of each individual also reflected the enhanced power of explanation of the animal + AR1⊗AR1 model (Table S2). The maximum and minimum PBV were 5.655 and − 5.937, respectively, showing the apparent difference between the animal model and the animal + AR1⊗AR1 model. The Spearman correlation coefficient (r) between the observation and the prediction using the animal + AR1⊗AR1 model was 0.87 (Fig. 4). The weakened correlation was considered to be due to the consideration of the environmental effects when evaluating the phenotypes. The r between the prediction by the animal model and the animal + AR1⊗AR1 model was 0.89, revealing the dissimilar estimates obtained by the two models. The breeding values predicted by the two models were compared to confirm the frequent changes in rank (Table S2) reflecting the practicality of spatial analysis approaches in genetic testing.
Table 4
Variance components based on animal + AR1⊗AR1 model analysis of DBH growth of Larix kaempferi
|
Estimated variances
|
Standard error
|
Genetic
|
9.086
|
7.634
|
Spatial
|
12.478
|
3.894
|
Residual
|
4.885
|
7.393
|
PBVs of mother trees were estimated, with the highest value obtained in KW02, followed by JB10, NO05, NO03, and GB25 (Table S3). The PBV of GB23 was the lowest, at − 3.90, followed by JB08, CN15, and KW56. In the case of KW02, which had the highest PBV, five trees were assigned to its family in HC (Table S1, Table S3). All the five individuals, including hc1327, hc1427, hc1485, hc1538, and hc1549, had positive PBVs of 1.507, 4.846, 3.621, 2.171, and 0.117, respectively. JB10, which had the second highest PBV, was the mother tree of hc1335, which showed the highest PBV among the offspring, at 5.655. The progenies of NO05 were hc1436 and hc1552, with PBVs of 3.971 and 2.604, which were the 6th and 24th highest PBVs among the 188 trees. NO03 had two progenies (hc1437 [PBV: 3.795], hc1463 [PBV: 2.442]), and GB25 had five progenies (hc1423 [PBV: 2.390], hc1468 [PBV: -1.163], hc1519 [PBV: 0.875], hc1520 [PBV: 3.900], and hc1559 [PBV: 2.390]). GB23, the mother tree with the lowest PBV, had two progenies, hc1509 and hc1516, with PBVs of -3.812 and − 5.937, respectively. The latter had the lowest breeding value among the 188 trees. The PBVs of mother trees were considered to reflect the estimates of their progenies (Fig. 5), however the accuracy of analysis was limited due to the insufficient number of mother trees and their progenies.
The individual trees and their mother trees were evaluated with PBVs based on the animal + AR1⊗AR1 model. Considering the spatial autoregression as the products of innumerous biotic and abiotic factors proved useful in the plantation genetic testing, in combination with the pedigree reconstruction analysis.
Regression analysis of growth and environmental factors of L. kaempferi
The environmental factors of individual trees in the plantation of L. kaempferi assessed were elevation, slope, solar irradiance, and soil characteristics (Table S4, Fig S6). The average elevation was 390 m, and mostly distributed in the 374–418 m range. The aspects were east, south, and southwest, with an average slope of 30.7°, and mainly in the 17°–36° range. The wetness index of the geographic environment for each individual was 3.51 on average, ranging from 1.41 to 16.85, indicating that the trees were located near the valley section with low humidity. The average percentages of sand, silt, and clay were 69.7%, 21.8%, and 8.5%, respectively, and were within the 67.8–70.9%, 21.1–22.6%, and 8.0–11.1% ranges. The average pH was 5.4 and the average organic matter, total N, and available P contents, and CEC were 8.5%, 0.36%, and 65.6 mg/kg, and 13.6 cmolc/kg respectively. Among the exchangeable cations, K+, Na+, and Ca2+ were within the optimal ranges for tree growth, while Mg2+ was relatively low.
Geographically weighted regression (GWR) analysis was conducted using total N, aspect, wetness, clay content, and Na+, which were selected as the key factors influencing growth based on the correlation and regression analysis results. The AICc was sequentially reduced as the inclusion of the five factors showing the adequacy of factors (Fig. 6). The fitness of the GWR model was higher than that of the global regression model, indicating the significance of local spatial differences within the plantation (Table 5).
Table 5
Comparison between global regression model and geographically weighted regression (GWR) model results
Variable
|
Global model
|
GWR model
|
Coefficient
|
Probability (>|t|)
|
mean of coefficient
|
Probability(>) [33]
|
F1
|
F3
|
(Intercept)
|
-40.88
|
0.006
|
-767.27
|
0.051
|
0.000
|
Total Nitrogen
|
163.34
|
0.000
|
2303.95
|
0.000
|
Aspect1)
|
-1.57
|
0.001
|
-1.23
|
0.000
|
Wetness
|
1.25
|
0.000
|
0.72
|
0.001
|
Clay
|
16.61
|
0.232
|
-28.10
|
0.000
|
Na+
|
-2699.57
|
0.218
|
3613.8
|
0.000
|
AIC2)
|
1109.1
|
1065.7
|
AICc3)
|
1109.8
|
1087.8
|
R2
|
0.20
|
0.38
|
adjusted R2
|
0.18
|
0.30
|
The ranges of GWR coefficients of the environmental factors were estimated (Table 6) and the changes were visualized by color to indicate the appropriate interval (Fig. 7). The changes in regression coefficients by grade were expressed through the comprehensive sectioning of the plantation. The GWR coefficients were distinguished among the north, south, and the valley section within the south, indicating the underlying disparity. The change of the effect, not the change of the factor per se, was thought to be the product of the interaction among the numerous environmental factors. The GWR analysis, conceived from the existing spatial autoregression, was useful in detecting the environmental gradients and facilitating a comprehensive understanding of the plantation characteristics. The plantation information could in turn facilitate systematic management and exploitation, and address some limitations of genetic testing with plantation.
Table 6
Range of the coefficient of independent variables of geographically weighted regression model
Variable
|
Minimum
|
1st Quantile
|
Median
|
3rd Quantile
|
Maximum
|
Intercept
|
-5536.0
|
-47.9
|
28.3
|
70.3
|
139.6
|
Total N
|
60.2
|
86.6
|
118.8
|
293.2
|
15214.1
|
Aspect
|
-2.3
|
-1.8
|
-1.3
|
-0.6
|
-0.2
|
Wetness
|
-0.1
|
0.3
|
0.7
|
1.1
|
1.5
|
Clay
|
-88.6
|
-59.0
|
-28.7
|
-0.3
|
43.5
|
Na+
|
-7476.3
|
-150.2
|
3881.0
|
7578.5
|
13644.9
|