Phenotypic variation in the F2 population
To understand the genetic basis of fruit traits, pumpkin inbred lines of N137 with long fruit and N29 with oblate fruit were selected as the parents to generate an F2 segregating population. The long and oblate fruits are the dominant fruit shapes in the commercial pumpkin cultivars consuming for cuisine. The fruits of two parental lines, F1 hybrid, and several F2 individuals are shown in Fig. 1. Phenotypic variations of FL, FD, FSI, FTH, SCS, FW, and TSS were recorded for the parental lines, F1 hybrids and individuals in the F2 population at fruit commercially matured stage. Distributions of these traits are shown in Fig. 2. The FL of N137 was 5 folds higher than that of N29. While, the FD of N29 was 2 folds higher than that of N137. The FSI for N137 and N29 were 3.65 and 0.34, respectively. The FTH and SCS in N29 were higher than that of N137. The FW and TSS were higher in N137 compared with N29. Remarkable differences of these fruit-related traits between N137 and N29 demonstrated that different alleles harbored by the parental lines. Based on the performance of F1 hybrid, FL, FD, FSI, and SCS had high degree of dominance, while FTH, FW, and TSS exhibited transgressive segregation. Both the histograms and density plots displayed the continuous distributions of these fruit-related traits in the F2 population, suggesting their quantitative inheritances.
Person correlation coefficients were calculated between the traits, which are presented in Fig. 2. Highly positive correlations were observed between FSI and FL, FD and SCS, as well as FD and FTH, respectively, suggesting their potentially common genetic basis. While, highly negative correlations was observed between FSI and FD, FSI and SCS, FD and FL, respectively, implying different genetic basis between these traits. TSS exhibited very low or no significant correlation with the other traits, demonstrating its distinct genetic basis. Moderate correlations were observed between the remaining pairs of traits, suggesting different genetic factors may regulate them.
GBS sequencing and SNP calling
To determine the genotype of each plant, a total of 202 GBS libraries, including the 200 F2 individuals and the two parent lines, were sequenced and yielded 158.0 Gb paired-end reads. The clean reads for the parental lines of N29 and N317 were 3.4 Gb and 3.7Gb, representing 12.5 × and 13.7 × sequencing depth, respectively. The clean reads generated by the F2 individuals ranged from 0.45 Gb to 0.90 Gb, with the mean of 0.66 Gb. The corresponding sequencing depths of the F2 individuals varied from 1.7 × to 3.4 × with the mean of 2.5 ×.
The clean reads were aligned to the C. moschata reference genome using bwa-mem2. The successful mapping rates varied from 98.1–99.7% with the mean of 99.3%. Using GATK HaplotypeCaller, a total of 2073737 variants, including 1649104 SNPs and 431304 InDels, were identified. BCFtools identified 2355207 variants, including 2117313 SNPs and 237894 InDels. Intersection analysis was conducted between the variants identified by GATK and BCFtools. The results showed that a total of 1423580 variants, including 1391130 SNPs and 32450 InDels were collectively identified by both GATK and BCFtools. The shared variants were selected and further filtered. The biallelic SNP loci with homozygous genotypes in each parental line and exhibiting polymorphism between the parental lines were kept. The loci with larger than 20% genotype missing ratio were filtered out, resulting in a total of 7082 high-quality SNPs with the Ts/Tv ratio of 2.13.
Genetic map construction
Lep-MAP3 was used to construct the genetic map. The parental genotypes firstly were called using the ParentCall2 module. The loci with significant segregation distortion (p <0.001) were removed. The remaining SNPs were separated into LGs with the LOD scores varying from 20 to 40. At LOD score of 27, a total of 6918 SNPs were classified into 20 LGs and the number of single markers was 13. The other LOD scores resulted in number of LGs different from 20 or larger number of single markers. As a result, LOD score of 27 was adopted to separate the SNPs into LGs. The SNPs were then ordered in each LG and the Kosambi function was used to calculate their genetic distances. The SNPs locating in the interval without recombination were collapsed into a bin. The length and genotype for each bin are show in Fig. S1. All the SNPs were grouped into their corresponding chromosome, demonstrating a chromosome-level genetic map was established (Fig. S2). The number of crossovers in the F2 population varied from 33 to 64 with the mean of 46, which was reasonable in a F2 segregating population with 200 individuals (Fig. S3). The recombination fractions and the corresponding LOD scores for tests of linkage between all pairs of markers are shown in Fig. S4. The largest LOD was observed between the adjacent markers on the LGs, indicating the high quality of the genetic map. Good agreements were observed between the genetic marker positions and physical marker positions on the reference genome, adding confidence in the quality of this map (Fig. 3). All these evidences proved that a high-quality and chromosome-level genetic map was constructed for pumpkin. Summary statistics on the genetic map are list in Table 1. The genetic map comprised of 2413 bins spanning a total length of 2252.10 cM with an average genetic distance of 0.94 cM. Numbers of bins in the LGs varied from 74 (Chr20) to 192 (Chr4). Lengths of the LGs ranged from 78.85 cM (Chr20) to 197.28 cM (Chr4), with the average of 112.61 cM. The averaged spacing in each LG ranged from 0.74 cM to 1.18 cM with the mean of 0.94 cM. The maximum spacing in each LGs varied from 5.27 cM (Chr15) to 11.45 cM (Chr17). The high-density genetic map was integrated with the data of fruit-related traits for QTL analysis.
Table 1
Summary statistics on the genetic map
Chromosome/ linkage group | Number of bins | Length (cM) | Average spacing (cM) | Maximum spacing (cM) |
1 | 171 | 141.93 | 0.83 | 9.36 |
2 | 138 | 124.13 | 0.91 | 6.54 |
3 | 115 | 114.89 | 1.01 | 6.03 |
4 | 192 | 197.28 | 1.03 | 7.30 |
5 | 118 | 99.85 | 0.85 | 6.54 |
6 | 123 | 112.60 | 0.92 | 5.52 |
7 | 101 | 101.39 | 1.01 | 6.79 |
8 | 88 | 102.95 | 1.18 | 9.62 |
9 | 125 | 97.32 | 0.78 | 6.28 |
10 | 125 | 107.07 | 0.86 | 5.52 |
11 | 172 | 140.86 | 0.82 | 6.28 |
12 | 148 | 108.61 | 0.74 | 6.54 |
13 | 82 | 89.28 | 1.10 | 11.45 |
14 | 119 | 125.86 | 1.07 | 5.27 |
15 | 105 | 106.85 | 1.03 | 5.27 |
16 | 98 | 98.10 | 1.01 | 6.28 |
17 | 108 | 103.49 | 0.97 | 11.45 |
18 | 124 | 114.85 | 0.93 | 6.28 |
19 | 87 | 85.96 | 1.00 | 9.88 |
20 | 74 | 78.85 | 1.08 | 5.27 |
Overall | 2413 | 2252.10 | 0.94 | 11.45 |
Identification of QTLs underlying fruit traits
QTL analysis for the fruit traits was performed in R/qtl using the CIM method. Genome-wide views of QTLs detected in the F2 segregating population are illustrated in Fig. 4. Detailed information on the QTLs is reported in Table 2. In total, 30 QTLs were detected for the 7 traits.
Table 2
QTLs for fruit traits detected in pumpkin
Traits | QTL | Chr | Peak marker position (cM) | Peak LOD | P value | 1.5 LOD interval left maker | 1.5 LOD interval right maker | Additive effects | Dominant effect | PVE |
FL | fl2.1 | 2 | 1.25 | 10.08 | 0.000 | Chr02_15483 | Chr02_479657 | -4.45 | -2.18 | 8.02 |
fl6.1 | 6 | 97.34 | 6.95 | 0.004 | Chr06_9951181 | Chr06_10445766 | 3.93 | 0.68 | 5.67 |
fl14.1 | 14 | 40.02 | 5.73 | 0.026 | Chr14_3471947 | Chr14_3832932 | -1.92 | 3.67 | 3.45 |
FD | fd1.1 | 1 | 65.39 | 7.09 | 0.001 | Chr01_5335115 | Chr01_7205006 | 0.97 | 1.02 | 6.66 |
fd5.1 | 5 | 18.76 | 6.49 | 0.010 | Chr05_1998816 | Chr05_2265078 | 0.01 | 1.61 | 6.05 |
fd12.1 | 12 | 24.79 | 6.95 | 0.001 | Chr12_1829011 | Chr12_1931323 | -0.38 | -1.38 | 4.55 |
fd15.1 | 15 | 54.00 | 6.15 | 0.014 | Chr15_4022423 | Chr15_4922896 | -0.99 | 0.38 | 4.97 |
fd19.1 | 19 | 28.81 | 6.33 | 0.013 | Chr19_2048733 | Chr19_2518096 | 0.07 | 1.19 | 3.26 |
FSI | fsi2.1 | 2 | 1.25 | 8.02 | 0.000 | Chr02_15483 | Chr02_479657 | -0.37 | -0.17 | 8.46 |
fsi6.1 | 6 | 97.84 | 11.38 | 0.000 | Chr06_9354982 | Chr06_10262644 | 0.28 | 0.13 | 4.95 |
fsi15.1 | 15 | 80.00 | 6.55 | 0.002 | Chr15_7914605 | Chr15_8691834 | 0.17 | -0.43 | 6.84 |
FTH | fth1.1 | 1 | 71.40 | 5.43 | 0.038 | Chr01_8382741 | Chr01_9215814 | 0.15 | 0.20 | 4.22 |
fth4.1 | 4 | 53.83 | 6.39 | 0.012 | Chr04_4341605 | Chr04_5663770 | 0.04 | 0.28 | 3.86 |
fyh5.1 | 5 | 11.00 | 5.56 | 0.031 | Chr05_1409933 | Chr05_1998816 | 0.03 | 0.31 | 4.46 |
fh6.1 | 6 | 77.56 | 7.30 | 0.003 | Chr06_8853925 | Chr06_9460491 | 0.19 | 0.10 | 4.14 |
fth14.1 | 14 | 8.25 | 7.07 | 0.004 | Chr14_975739 | Chr14_1306550 | 0.10 | 0.29 | 4.88 |
fth16.1 | 16 | 36.53 | 6.10 | 0.016 | Chr16_2836084 | Chr16_3412954 | 0.10 | 0.18 | 2.55 |
SCS | scs5.1 | 5 | 19.01 | 11.28 | 0.000 | Chr05_2129410 | Chr05_2265078 | -0.05 | 1.07 | 5.23 |
scs12.1 | 12 | 95.60 | 8.11 | 0.000 | Chr12_10532303 | Chr12_10661046 | -0.34 | -0.78 | 3.52 |
FW | fw1.1 | 1 | 13.26 | 9.19 | 0.000 | Chr01_1149445 | Chr01_1255673 | 0.30 | 0.25 | 5.05 |
fw2.1 | 2 | 14.76 | 11.2 | 0.000 | Chr02_1355009 | Chr02_1712624 | -0.08 | 0.37 | 4.01 |
fw12.1 | 12 | 61.33 | 5.84 | 0.020 | Chr12_7581002 | Chr12_11670428 | -0.08 | -0.38 | 4.33 |
fw16.1 | 16 | 37.53 | 8.14 | 0.001 | Chr16_2513448 | Chr16_3198950 | 0.09 | 0.41 | 5.13 |
TSS | tss2.1 | 2 | 56.54 | 8.96 | 0.000 | Chr02_4883539 | Chr02_5529654 | -0.29 | -0.13 | 2.60 |
tss3.1 | 3 | 9.76 | 5.46 | 0.043 | Chr03_1131739 | Chr03_1436846 | 0.05 | -0.54 | 3.99 |
tss4.1 | 4 | 137.23 | 6.28 | 0.010 | Chr04_16131184 | Chr04_16757538 | 0.25 | -0.59 | 6.03 |
tss5.1 | 5 | 7.25 | 6.71 | 0.007 | Chr05_1078419 | Chr05_1769395 | -0.05 | -0.68 | 6.06 |
tss7.1 | 7 | 82.59 | 6.02 | 0.016 | Chr07_6470983 | Chr07_6998095 | 0.44 | -0.28 | 6.58 |
tss8.1 | 8 | 8.00 | 9.55 | 0.000 | Chr08_1038422 | Chr08_1574490 | 0.20 | 0.53 | 4.98 |
tss9.1 | 9 | 89.00 | 9.99 | 0.000 | Chr09_8704222 | Chr09_10729153 | 0.19 | 0.45 | 3.96 |
Fruit size was evaluated in terms of FL and FD. Three QTLs, namely fl2.1, fl6.1, and fl14.1, were detected for FL, which could explain 8.02%, 5.67%, and 3.45% of observed phenotypic variance (PVE), respectively. The fl2.1 had negative additive and dominant effects on FL, whereas, fl6.1 had positive additive and dominant effects on FL. A total of 5 QTLs were identified for FD, which located on the chromosomes of 1, 5, 12, 15, and 19, respectively. The total PVE of the 5 QTLs reached up to 25.45%. No co-localized QTLs were found between FL and FD, demonstrating their different genetic basis.
Fruit shape was defined using the ratio of FL to FD, namely FSI. Three QTLs with the total PVE of 20.25% were detected for FSI. The fsi2.1 and fsi6.1 were overlapped with fl2.1 and fl6.1, respectively, suggesting that FL plays a major role in determining fruit shape. Meanwhile, fsi15.1 was closely linked with fd15.1.
Fruit internal quality was evaluated in terms of FTH and SCS. Six QTLs with the total PVE of 24.11% and 2 QTLs with the total PVE of 8.75% were detected for FTH and SCS, respectively. The fth1.1 was closely linked with fd1.1. The fth5.1 and fth6.1 were co-localized with fd5.1 and fsi6.1, respectively. While, scs5.1 was overlapped with fd5.1. These results demonstrated that the genetic basis of fruit shape and internal quality were closely related.
FW is an important trait that determines the yield. A total of 4 QTLs for FW were identified. These QTLs showed similar effects on FW with the total PVE of 18.52%. The fw12.1 and fw16.1 were co-localized with scs12.1 and fth16.1, respectively, indicating that FTH and SCS share the same genetic basis with FW to some extent.
TSS is an important factor determining fruit sensory quality. Seven QTLs were detected for TSS with the total PVE of 34.20%. All the QTLs had small effects on TSS. The tss5.1 was overlapped with fth5.1 and closely linked with fd5.1 and scs5.1. These results suggested that this region possibly had pleiotropic effect on fruit size and quality.