Traits of Interest
The vegetative parameters, carotene content and FACs, observed in the 2.6-1 and 2.6-5 mapping families are summarized in Table 1. All the traits showed wide segregation and followed normal distribution (P<0.05, Shapiro-Wilk test) in the 2.6-5 population. However, for population 2.6-1, rachis length (RL), height increment (HI), iodine value (IV) and oleic acid (C18:1) were not normally distributed. The means for RL, petiole cross section (PCS) and IV were slightly higher in 2.6-1, but HI and carotene content were higher in 2.6-5. However, both RL and HI were considerably lower than in the commercial DxP, whereby, they are generally >5 and >0.45 meters, respectively [27]. For FAC, population 2.6-1 had higher stearic (C18:0), oleic (C18:1) and linoleic (C18:2) acids, whereas palmitic (C16:0) was slightly higher (35.33%) in the 2.6-5 population.
The relationships between the individual fatty acids were evaluated using Pearson’s correlation analysis and consistent results were obtained for both mapping families (Table 2). The most abundant saturated fatty acid, C16:0, was negatively correlated with the unsaturated fatty acids (C18:1 and C18:2). A negative correlation was also observed between C18:0 and C16:0. As expected, IV, as an indicator for unsaturation, was positively correlated with C18:1 and C18:2, and negatively with C16:0 and C18:0. In addition, correlations were positive between C18:0 and C18:2 and negative between C18:1 and C18:2. The results corroborated with those from other studies [14,16,20]. Correlation trends for vegetative parameters were similar in both populations (Table 3). PCS was positively correlated with HI and RL, while RL and HI appeared to be negatively correlated, although not significant at P ≤ 0.05.
Table 1: Summary of vegetative traits and fatty acid composition (FAC) in the 2.6-1 and 2.6-5 BC2 mapping populations
Population
|
2.6-1
|
2.6-5
|
Category
|
Variable
|
Mean
|
SD
|
CV
|
N
|
Range
|
Mean
|
SD
|
CV
|
N
|
Range
|
Vegetative
|
RL (m)
|
5.28
|
0.67
|
12.78
|
72
|
2.45 – 6.80
|
5.17
|
0.61
|
11.75
|
69
|
3.53 – 6.56
|
HI (m)
|
0.34
|
0.07
|
19.82
|
72
|
0.21 – 0.60
|
0.38
|
0.06
|
16.18
|
69
|
0.27 – 0.53
|
PCS (cm2)
|
26.62
|
5.58
|
20.95
|
72
|
10.80 – 40.85
|
26.49
|
5.20
|
19.61
|
69
|
15.68 – 40.50
|
FAC
|
Carotene (ppm)
|
1031
|
343
|
33.27
|
54
|
221- 1965
|
1162
|
468
|
40.27
|
58
|
441 – 2738
|
IV
|
65.13
|
2.55
|
3.92
|
54
|
60.19 – 69.86
|
63.40
|
3.22
|
5.07
|
58
|
57.12 – 71.60
|
C16:0 (%)
|
31.06
|
2.47
|
7.94
|
54
|
24.73 – 36.68
|
35.33
|
2.85
|
8.06
|
58
|
26.85 – 41.69
|
C18:0 (%)
|
6.15
|
1.31
|
21.38
|
54
|
3.29 – 9.43
|
3.79
|
0.92
|
24.30
|
58
|
2.11 – 6.48
|
C18:1 (%)
|
48.61
|
2.87
|
5.91
|
54
|
40.92 – 57.18
|
47.02
|
3.67
|
7.80
|
58
|
37.58 – 54.48
|
C18:2 (%)
|
12.86
|
1.41
|
10.95
|
54
|
9.60 – 16.29
|
12.67
|
2.05
|
16.16
|
58
|
8.15 – 17.65
|
Table 2: Pearson’s correlations between individual fatty acids in the 2.6-1 and 2.6-5 mapping populations
Population
|
|
2.6-1
|
|
|
|
2.6-5
|
|
|
C18:0
|
C18:1
|
C18:2
|
IV
|
C18:0
|
C18:1
|
C18:2
|
IV
|
C16:0
|
-0.26
|
-0.74*
|
-0.03
|
-0.75*
|
-0.32*
|
-0.71*
|
-0.04
|
-0.71*
|
C18:0
|
|
-0.24
|
0.03
|
-0.22
|
|
-0.16
|
0.26*
|
-0.18
|
C18:1
|
|
|
-0.48*
|
0.51*
|
|
|
-0.62*
|
0.20
|
C18:2
|
|
|
|
0.50*
|
|
|
|
0.55*
|
|
|
|
|
|
|
|
|
|
|
|
*significant at p ≤ 0.0
Table 3: Pearson’s correlations between vegetative parameters in the 2.6-1 and 2.6-5 mapping populations
Population
|
2.6-1
|
2.6-5
|
RL
|
HI
|
RL
|
HI
|
HI
|
-0.21
|
|
-0.08
|
|
PCS
|
0.45*
|
0.10
|
0.28*
|
0.48*
|
*significant at p ≤ 0.05
BC2 Consensus Genetic Map
In this study a total of 4491 SNP markers were utilized to test for polymorphism in both populations, while 515 and 715 SSR markers were screened for informativeness in population 2.6-1 and 2.6-5, respectively. Only polymorphic markers which met the expected segregation ratios at 5% significance level (p<0.05) and had nearest-neighbor stress value of > 3cM were considered suitable for utilization in construction of genetic map [see Additional file 1]. The consensus genetic map for both families is shown in Figure 1 while Table 4 summarizes the marker compositions and lengths of the individual linkage groups (LGs). The 2.6-1 and 2.6-5 genetic maps had 1,744 and 1,254 markers covering 1,505 and 1,564 cM, respectively. Both maps were integrated into a consensus map [see Additional file 2] of 16 LGs with 1,963 markers (1,814 SNPs and 149 SSRs), spanning a total map length of 1793 cM. The lengths of the individual LGs on the consensus map ranged from 57 - 195 cM, with a mean of 112 cM. The average distance between markers was 0.91 cM. Initially, LG5 in population 2.6-1 and LGs 1 and 15 in population 2.6-5 had two sub-groups. However, these subgroups were successfully integrated into a consensus linkage group.
QTLs Associated with Vegetative Parameters, Carotene and FAC
Results of the QTL analysis obtained using Genstat are summarized in Table 5 and Additional File 3, and distribution of QTLs on the respective LGs is shown in Figure 1. The study revealed eight significant QTLs associated with HI, RL and PCS in the integrated genetic map of BC2 interspecific hybrid populations. The three traits in combination contribute to shorter and more compact palms. Interestingly, a single genomic locus in LG4 was associated with two of the traits, namely HI and PCS, while the QTL for the third trait RL, was located in close proximity in the same LG. An additional QTL for PCS was identified in LG4 for population 2.6-5 (Additional file 3), which also appeared on the integrated LG4. The two QTLs for PCS on LG4 were more than 150 cM apart, so they are likely not linked.
In addition, ten other QTLs were identified for traits associated with fatty acid composition (FAC) namely, IV, C16:0, C18:1 and C18:2 content across six LGs. Single QTLs for IV and C16:0 were, as in a previous study [20], located in LG1. Relatively high LOD levels were detected for some of the compactness and FAC traits, namely HI (7.77), PCS (7.70), C18:2 (8.19) and IV (13.02). Interestingly, this study also for the first time revealed four QTLs associated with carotene content across four different LGs.
Table 4: Distribution of markers on the 16 linkage groups (LG) of the BC2 genetic map
Population
LG
|
2.6-1
|
2.6-5
|
Integrated
|
Map Length (cM)
|
No. Markers
|
Map Length (cM)
|
No. Markers
|
Map Length (cM)
|
No. Markers
|
1
|
115
|
121
|
88+23*
|
95+9*
|
138
|
146
|
2
|
101
|
87
|
97
|
62
|
113
|
107
|
3
|
55
|
48
|
58
|
42
|
57
|
54
|
4
|
174
|
178
|
212
|
117
|
195
|
202
|
5
|
25.2 + 14.8*
|
21+17*
|
53
|
22
|
100
|
44
|
6
|
106
|
129
|
126
|
82
|
122
|
135
|
7
|
78
|
173
|
85
|
117
|
113
|
189
|
8
|
122
|
132
|
175
|
97
|
152
|
154
|
9
|
82
|
56
|
37
|
15
|
83
|
157
|
10
|
88
|
129
|
93
|
93
|
116
|
138
|
11
|
128
|
166
|
133
|
125
|
145
|
179
|
12
|
108
|
130
|
124
|
91
|
124
|
144
|
13
|
77
|
127
|
57
|
96
|
80
|
135
|
14
|
98
|
77
|
88
|
69
|
99
|
94
|
15
|
69
|
119
|
52 + 46*
|
79+10*
|
98
|
142
|
16
|
50
|
34
|
63
|
33
|
58
|
43
|
Total
|
1505
|
1744
|
1564
|
1254
|
1793
|
1963
|
* Sub-groups
Table 5: QTLs associated with compactness traits and FAC identified via Genstat on the interspecific BC2 integrated map
|
|
BC2 Integrated Map
|
HI
|
Closest marker to QTL peak
|
LG
|
Position (cM)
|
LOD
|
QTL Interval (cM)
|
SNPM00563
SNPM04928
|
4
7
|
4.27
110.78
|
7.77
3.83
|
0 – 17.36
101.46 – 113.25
|
RL
|
SNPM03201
SNPM03772
SNPM03676
|
4
8
11
|
11.30
92.56
38.69
|
3.79
3.17
4.17
|
8.06 – 17.36
81.63 – 99.73
37.43 – 47.36
|
PCS
|
SNPM00563
SNPM03375
sEg00213
|
4
4
8
|
4.27
192.47
139.86
|
7.70
3.25
3.81
|
0 – 17.36
187.44 – 193.13
137.88 – 147.19
|
C16:0
|
SNPM00796
|
1
|
133.4
|
3.59
|
132.5 – 133.9
|
C18:1
|
SNPM02507
SNPM03249
SNPM00274
|
4
8
12
|
169.17
44.0
31.38
|
3.44
3.19
5.97
|
169.17
44.0 – 47.48
22.00 – 34.15
|
C18:2
|
SNPM01602
SNPM00249
SNPM01190
|
1
4
15
|
124.87
3.43
70.32
|
4.62
8.19
5.14
|
122.00 – 138.78
0 – 17.36
58.20 – 98.11
|
IV
|
SNPM01452
SNPM04197
SNPM03285
|
1
3
15
|
132.53
15.78
98.11
|
13.02
3.72
5.98
|
91.79 – 138.78
12.36 – 19.94
80.21 – 98.11
|
Carotene
|
SNPM02349
SNPM00729
SNPM03960
SNPM03921
|
3
4
7
10
|
4.35
181.83
108.53
77.39
|
5.12
3.32
3.73
3.63
|
0 – 22.98
180.97 – 183.63
107.42 – 113.25
63.65 – 77.39
|
Common and Population Specific QTL
Figure 2A shows a major genomic region in LG4 influencing vegetative parameters namely HI, PCS and RL in the interspecific hybrid populations evaluated in the study. The closest marker to the QTL peak for HI and PCS was the same, namely SNPM00563, both in the two independent maps and the integrated map. Genstat revealed that the QTL related to RL for population 2.6-1 and the integrated map was also detected in the same region, about 7cM away from HI and PCS. Interestingly the QTL for C18:2, one of the most abundant unsaturated fatty acids in oleifera and interspecific hybrids also mapped around the same region for the two independent populations and the integrated maps. The SNP markers corresponding to the QTLs for HI, PCS, C18:2 and RL were also physically positioned on the genome build, spanning about 3600 kb (Figure 2B). This confirmed that the QTLs influencing the traits were located in close proximity, thus suggesting a major genomic region influencing compactness and unsaturation level in interspecific hybrids. Another QTL for PCS, specific to population 2.6-5, was also located at the other end of LG4.
It was also clear that there were other population specific QTLs [see Additional file 3], where for example, a significant QTL for RL was located in LG8 for population 2.6-1 as well as the integrated map. Similarly, there were population specific QTLs for HI, PCS and fatty acids compositions (C18:1, C18:2 and IV) reflecting the diversity of the 2 populations used in the study. The QTLs linked to carotene content were also specific to population 2-6-5 (also in the integrated map), likely because the variation for the trait was higher in population 2-6-5 (40.27%) than that observed in 2-6-1 (33.27%) (Table 1).
The integrated map also proved useful in detecting QTL not detected in an individual population, especially for FAC. QTL was revealed for C18:2 in the integrated map (confirmed as minor QTL in MapQTL analysis below in Table 6), and correspond to the same linkage group previously linked to FAC in other studies [17]. Although a QTL was associated with RL in LG13 in the population 2.6-5 (Additional File 3), this was not reproducible in the integrated map. Interestingly, the integrated map while maintaining the population specific QTL for RL in LG4, revealed a new QTL for RL in LG11, which was not detected in the individual populations. The QTL in LG11 was also confirmed as a minor QTL based on analysis carried out using MapQTL (Table 6).
Further Confirmation of QTLs via Interval Mapping (IM)
The interval mapping approach implemented via MapQTL was used to further validate the QTLs identified earlier. The analysis was restricted to the integrated map and the results are presented in Table 6 and Additional file 4. Generally, the QTLs detected earlier (with one exception), were also revealed as influencing the specific traits and could be further divided into major QTLs (significant at genome-wide) and suggestive minor QTLs (significant at chromosome-wide). The QTLs linked to HI in LG4 and LG7 as well those linked to PCS in LG4 and LG8 were major QTLs, significant at genome-wide level. The genomic region associated with RL in LG8 was also a major QTL. However, the QTLs linked to RL via Genstat in LG04 and LG11, were considered as minor QTLs (significant at chromosome wide) in the MapQTL analysis. In terms of FAC, QTLs linked to C18:1 (LG04 and LG08), C18:2 (LG4), C16:0 (LG1) and IV (LG1 and LG15) were also significant at genome-wide, as were the QTLs linked to carotene content in LG03, LG04, LG07 and LG10. However, the QTLs linked to C18:2 in LG1 and IV (LG03) were only significant at chromosome-wide, indicating they were minor (suggestive) QTLs pointing to a region that may be regulating the traits concerned. The only exception was the QTL linked to C18:1 in LG12, which was not significant in the MapQTL analysis, and as such, not considered as representing a genomic region associated with the trait. A point to emphasize is that the QTLs revealed in integrated maps, but not in the individual maps, particularly C18:2 (LG1) and RL (LG11), were actually revealed as minor QTLs. This clearly demonstrates that the detection power for minor QTLs is significantly enhanced in the integrated map.
The phenotypic variation observed is also indicated in Table 6. The phenotypic variation described by the major QTL for PCS, HI and Carotene content were above 20%, indicating their major influence on the traits concerned, while that for minor QTLs linked to RL was below 20%. Similarly, the phenotypic variation observed for the major QTLs linked to FAC were mostly between 20 – 35 %, similar to that reported by [39], but slightly higher than reported by [16], where the range was 11 – 20% for similar FAC. The highest phenotypic variation observed was that for IV (measure of level unsaturation) consistent with [16], who also observed highest phenotypic variation for IV.
Segregation profiles of markers associated with QTLs
Relationships between the genotype profile of the closest marker (to the QTL peak) and the traits of interest was determined. Palms were differentiated by their genotype (“aa”, “ab” or “bb”), and the phenotype means compared between the different genotypes (Figure 3 and Additional file 5). All the selected markers associated with FAC and vegetative traits were significantly different (P<0.05) in the traits for the different genotypes. Interestingly, the markers SNPM01452 and SNPM00796 in LG01 linked to IV and C16:0 content respectively are located in close proximity (<1cM apart) and show significant difference for the traits (P<0.05) between the genotypes. The “aa” genotype for the respective markers had higher C16:0 content (saturated fatty acid) and lower IV content (lower levels of unsaturation), indicating the segregation profile of the markers in close proximity are in agreement with the negative correlation between both traits. Similarly, PCS and HI are positively correlated and have the same marker, SNPM00563 linked to both traits in LG04. The “aa” genotype results in smaller PCS and lower HI than the “ab” genotype, also in agreement with their positive correlation.
Table 6. QTLs detected using both the Genestat and MapQTL methods
QTL
|
Genestat
|
MapQTL
|
Marker
|
LG
|
Position (cM)
|
LOD Score for Genestat
|
LOD threshold for MapQTL
|
LOD score
|
Adjusted Phenotypic variation
|
HI
|
SNPM00563
SNPM04928
|
4
7
|
4.27
110.78
|
7.77
3.83
|
3.4
|
5.69
4.33
|
30.68
29.07
|
RL
|
SNPM03201
SNPM03772
SNPM03676
|
4
8
11
|
11.30
92.56
38.69
|
3.79
3.17
4.17
|
3.3
|
3.0*
4.28
3.05*
|
14.44
22.78
16.44
|
PCS
|
SNPM00563
SNPM03375
sEg00213
|
4
4
8
|
4.27
192.47
139.86
|
7.70
3.25
3.81
|
3.5
|
4.61
3.87
3.94
|
24.30
21.71
24.19
|
C16:0
|
SNPM00796
|
1
|
133.4
|
3.59
|
3.6
|
4.20
|
28.73
|
C18:1
|
SNPM02507
SNPM03249
SNPM00274#
|
4
8
12
|
169.2
44.03
31.38
|
3.44
3.19
5.97
|
3.3
|
4.33
4.25
-
|
29.82
29.32
-
|
C18:2
|
SNPM01602
SNPM00249
SNPM01190
|
1
4
15
|
124.87
3.43
70.32
|
4.62
8.19
5.14
|
3.6
|
3.31*
5.55
3.80
|
21.58
34.21
26.12
|
IV
|
SNPM01452
SNPM04197
SNPM03285
|
1
3
15
|
132.53
15.78
98.11
|
13.02
3.72
5.98
|
3.5
|
8.93
3.4*
8.51
|
52.00
22.09
74.16
|
Carotene
|
SNPM02349
SNPM00729
SNPM03960
SNPM03921
|
3
4
7
10
|
4.35
181.83
108.53
77.39
|
5.12
3.32
3.73
3.63
|
3.7
|
4.24
4.36
4.07
4.83
|
27.51
28.22
33.04
30.85
|
*Significant at chromosome wide
# SNPM00274 was considered not significant, as association with C18:1 was not reproducible in the MapQTL analysis
|
Candidate genes identified within the QTL intervals
Candidate genes residing within the QTL interval were identified using the existing oil palm genome assembly [28]. Blast results to the genome build identified 21 candidate genes within the QTL confidence intervals affecting the vegetative traits and FAC. The ERECTA gene [GenBank: XM_010910431.1] was found in the QTL interval linked to PCS, RL and HI on LG4. In addition, the QTL region for HI in LG7 also revealed an interesting gene with high similarity to the auxin transport protein BIG [GenBank: XM_010943964.2] in oil palm. Similarly, we identified BAM1 [GenBank: XM_010914345.2] which co-localized with markers in the QTL region associated with RL on LG11. For FAC, two 3-ketoacyl-CoA synthase genes in Arabidopsis, CUT1 [GenBank: XM_010917870.2] and KCS11 [GenBank: XM_010916640.2] flanked the QTLs for IV, C16:0 and C18:2 on LG1. Details of all 21 genes identified are provided [see Additional file 6].