DOI: https://doi.org/10.21203/rs.3.rs-365411/v1
Wheat grain protein content (GPC) is an important quality indicator. The GPC of wheat grown in the middle and lower reaches of the Yangtze River is often low. Marker-assisted selection (MAS) is an effective tool for improving quantitative traits; however, except Gpc-B1, most markers have not been effectively applied in GPC improvement, although many related loci have been identified. Linkage analysis using a recombinant inbred line population from the cross of core parents of Ningmai 9 and Yangmai 158 and association mapping using the local cultivated varieties were performed and nine candidate intervals were identified. The appropriate kompetitive allele specific PCR (KASP) markers associated with GPC were successfully developed and applied in 1163 F4 breeding lines. Three markers, Kgpc-2B, Kgpc-2D, and Kgpc-4A, were validated to be significantly related to GPC by large-scale association mapping, and they were combined to achieve the highest efficiency to enhance GPC. We applied these markers in 164 F6 breeding lines and obtained 15 lines with high GPC, indicating their high selective efficiency. Further, strategies for gene exploration in the three significant intervals were proposed. These results were expected to provide a novel route for improving GPC in wheat quality breeding.
Linkage and association mapping identified 9 candidate intervals for wheat GPC, and large-scale association mapping based on 9 corresponding KASP markers and 1163 F4 breeding lines revealed 3 significant markers.
Wheat is one of the most widely grown food crops and provides 20% of human food energy requirements and 25% of proteins consumed on a daily basis, worldwide (Wheat Initiative, www.wheatinitiative.org/). Grain protein content (GPC) of wheat is also related to the end-use quality. Therefore, improving GPC is an important breeding objective for programs that aim to increase the nutritional value and quality of wheat.
GPC is a quantitative indicator of an important multi-gene trait (Balyan et al. 2013), and the quantitative trait loci (QTL) associated with GPC have been reported on all chromosomes (Fatiukha et al. 2020; Groos et al. 2003; Kunert et al. 2007; Ma et al. 2012). Echeverry-Solarte et al. (2015) used 163 recombinant inbred lines (RILs) derived from two widely divergent parents and detected 11 QTL on 9 chromosomes, and two of these QTL had major effects, explaining up to 16.5% and 16.9% of phenotypic variation. Fatiukha et al. (2020) identified 12 GPC QTL in an RIL population derived from the elite durum cultivar Svevo and a wild emmer wheat accession Y12-3, and four major QTL with the highest LOD scores and relatively high and stable phenotypic variance explained (PVE) were found on chromosomes 4BS, 5AS, 6BS, and 7BL. The QTL on 6BS had similar physical position as that on the cloned QTL, Gpc-B1 (Uauy et al. 2006). A worldwide bread wheat core collection, including 372 accessions, was genotyped using DArT, SSR, and single nucleotide polymorphism (SNP) markers on the whole wheat genome, and 14 stable markers across 15 chromosomes were found to be associated with GPC (Bordes et al. 2011). A genome-wide association study was conducted using 189 spring bread wheat genotypes from the International Center for Agricultural Research in the Dry Areas, and 37 significant GPC SNPs were identified, some of which also exhibited an effect on gluten content (Suliman et al. 2021). However, the mapping results were variable with different mapping populations, methods, and environments, and most studies mainly focused on the identification of related loci, but the consequent validation and application were seldom reported except for Gpc-B1.
SNP markers have emerged as powerful tools for many genetic applications due to their low assay cost, high genomic abundance, locus specificity, and codominant inheritance. Kompetitive allele specific PCR (KASP) assay provides a rapid and cost-effective way to precisely determine SNP genotypes that perfectly fit the marker-assisted selection (MAS) on a large-scale (Semagn et al. 2014). Rasheed et al. (2016) developed and validated 70 functional KASP markers that are associated with wheat adaptability, grain yield, quality, and biotic and abiotic stress resistances, revealing that the KASP assays were 45 times faster compared with gel-based PCR markers. Lin et al. (2021) identified a major QTL related to grain number per spikelet on chromosome 2D and generated a closely linked KASP marker to validate its effects in different genetic backgrounds. Zhang et al. (2021) identified 22, 22, and 23 loci associated with seedling resistance to leaf rust, adult plant resistance to leaf rust, and adult plant resistance to stripe rust, using GWAS, respectively, and they converted 12 associated SNPs into KASP markers and verified in bi-parental populations.
A combination of linkage mapping and GWAS analysis can provide richer and more accurate information for QTL detection. Dhakal et al. (2018) identified a QTL for wheat curl mite resistance on chromosome 6DS in the wheat cultivar TAM 112 using linkage and association analyses, thus demonstrating the effectiveness of this locus to reduce symptom severity. QTrl.saw-2D.2 that controls total root length, was identified in the RILs from SY95-71×CH7034 using linkage mapping and was validated in another population comprising 215 diverse lines, which suggests an effective and reliable result (Zheng et al. 2019).
The middle and lower reaches of the Yangtze River is the largest soft wheat production area and the second-largest wheat belt in China (He et al. 2002). The quality traits of wheat, including GPC, have received more attention in recent years; however, limited progress has been made in genetic improvement of GPC in this area (He et al. 2018; Zhao et al. 2020). In this study, we performed linkage and association mapping in the wheat grown in this area to identify the genetic loci related to GPC, and successfully developed and validated the corresponding KASP markers. On the basis of these results, we obtained some effective molecular markers which can be utilized in MAS for GPC in the middle and lower reaches of the Yangtze River, and are expected to provide assistance for the quality breeding.
Materials
A RIL population derived from Ningmai 9 × Yangmai 158 (282 F8 RILs) was used for linkage mapping. Ningmai 9 is a soft wheat variety, whereas Yangmai 158 is a hard wheat variety. The RILs and their parents were planted using an augmented design with single row plots, 60 seeds per row with 1.6 m row length, 0.25 m row spacing in the growing seasons of 2015–2016 (E1), 2016–2017 (E2), 2017–2018 (E3), and 2018–2019 (E4) at Nanjing, China. Here 30 RILs were replicated twice and randomly distributed across the population (Lan et al. 2015).
A natural population comprising 103 local varieties released in the period of 1972–2016 in the middle and lower reaches of the Yangtze River (Supplemental Table 1) was planted in a randomized complete block design experiment with the same plots as that of RIL population, and two replicates in the growing seasons of 2015–2016 (E1), 2016–2017 (E2), 2017–2018 (E3), 2018–2019 (E4), and 2019–2020 (E5) at Nanjing, China.
All mature grains were harvested for GPC testing, and the 1163 F4 lines and 164 F6 lines were harvested from the breeding population in 2019–2020 at Nanjing, China. The GPC was measured using a Perton DA7200 (Sweden) device, according to the AACC Method 39-10.
Genotyping
Total genomic DNA was isolated from young leaf tissues using the CTAB method (Porebski et al. 1997). The RILs and their parents were genotyped using Illumina 90 K SNP assay (Wang et al. 2014). The genetic map containing 2285 bin markers with a total length of 3002 cM has been published in a previous study (Jiang et al. 2020). The natural population was genotyped using Affymetrix 50 K assay (CapitalBio Technology, Beijing, China). All markers were blasted in IWGSC Reference Sequence v1.0 to obtain the physical positions (The International Wheat Genome Sequencing Consortium 2018).
Phenotypic analysis and QTL analysis
Preliminary statistical analysis and correlation analysis for phenotype were performed in Microsoft Excel 2016. The test for analysis of variance (ANOVA) was conducted using IBM SPSS Statistics 19.0. The broad sense heritability (h2) was estimated according to He et al. (2016): h2 = δg2/(δg2 +δg*y2/y +δe2/ry). δg2 was genetic variance, δg*y2 was genotype × environment variance, δe2 was error variance, and y was number of environments, r was number of replication in each environment.
QTL mapping was conducted using the BIP function in IciMapping 4.1, and the algorithm of inclusive composite interval mapping (ICIM) was selected (Li et al. 2007). The walking step for QTL detection was set as 0.1 cM, and the threshold of LOD scores was set as 2.5.
Association mapping
Quality control for the genotype data from Affymetrix 50 K assay was performed in TASSEL software V5.2.13 (Bradbury et al. 2007), and the SNP markers with a minor allele frequency ≤ 5% and missing data >10% were removed.
The population structure was evaluated using STRUCTURE software 2.3.4 (Hubisz et al. 2009). The number of subsets (k) was set from 1 to 8 with five iterations each. Markov Chain Monte Carlo was used with a burn-in period of 10,000 repeats and 100,000 repeats after burn-in. The number of subgroups in the population (△K) was evaluated according to Evanno et al. (2005). The kinship matrix was calculated using the VanRaden method (VanRaden 2008).
GWAS was performed using R software through the GAPIT package (Lipka et al. 2012), and Bayesian-information and linkage-disequilibrium iteratively nested keyway method was selected (Huang et al. 2018). Principle components (PCs) calculated using TASSEL software V5.2.13 were used as covariates to reduce false positives. The threshold of the significant P values was set at 10−3.
Development, validation, and application of KASP markers
KASP primers which contained two allele-specific forward primers and one common reverse primer were designed based on the sequences of identified SNPs using PolyMarker (Ramirez-Gonzalez et al. 2015) and synthesized by Shanghai Sangon Biotech Co., Ltd. (Supplemental Table S2). The KASP assays were performed in 384-well PCR plates with 2.5 μL KASP 2X Reaction Mix, 0.07 μL assay mix (12 μL each allele-specific forward primer [100 μM], 30 μL reverse primer [100 μM], and 46 μL TRIS [10 mM, pH 8.3]), and 2.43 μL 30 ng μL−1 genomic DNA. The PCR program was as follows: 94°C for 15 min; 10 touchdown cycles of 94°C for 20 s, 61°C–55°C for 60 s (decreasing by 0.6°C per cycle); and 26 cycles of 94°C for 20 s and 55°C for 60 s. The PCR reactions was carried out in Hydrocycler 16 water instrument for PCR (LGC Genomics, Beverly, USA), and the PCR Fluorescence detection was performed using a PHERAstar (BMG LABTECH, Germany) microplate reader. KlusterCaller software (LGC Genomics, Beverly, USA) was used to analyze the data.
All the F4 lines were screened with the developed KASP markers, and a large-scale association mapping was performed to identify significant markers by TASSEL software V5.2.13 based on general linear model (GLM). Then we used the significant KASP markers to screen the F6 lines to test the select effect.
Exploration of candidate genes
To investigate candidate genes within the identified intervals, high confidence (HC) genes were extracted from the JBrowse website (https://urgi.versailles.inra.fr/jbrowseiwgsc/gmod_jbrowse/) according to IWGSC Reference Sequence v1.0 (The International Wheat Genome Sequencing Consortium 2018). The gene expression pattern was determined by blasting the predicted gene-id with WheatEXP (https://wheat .pw.usda.gov/WheatExp/) (Ramirez-Gonzalez et al. 2018).
For sequencing the candidate genes, multiple pairs of primers were designed to amplify the whole gene using Primer 6.0, and the sequencing services were provided by Shanghai Sangon Biotech Co., Ltd.
Phenotype analysis
The description of genotype for the RIL and natural populations is shown in Table 1. In the RIL population, the parent Ningmai 9 showed a lower GPC than Yangmai 158 in all environments, which was consistent with their production performance. The max–min and coefficient of variation (CV) were above 5%, except for E1. The GPC in the natural population also presented a great variation, with CV ranging from 6.28% to 10.40% across the environments. GPC showed a significant positive correlation among almost all the environments in RIL and natural populations (Table 2), and the heritability reached 0.56 and 0.74, respectively (Table 3). Both genotype and environment had significant influence on the GPC of the RIL and natural populations, and their interaction also had significant influence on GPC of the natural population, but not on that of the RIL population (Table 3).
QTL mapping
Using QTL mapping, we identified 17 QTL on 10 chromosomes, including 1A (1), 1B (1), 2B (1), 2D (1), 3A (2), 3B (3), 4A (1), 5A (2), 5B (3), and 7A (2), which explained 3.94%–9.21% of phenotypic variance (Table 4 and Fig. 1). The additive effect of eight QTLs, including Qgpc-3B.1, Qgpc-3B.3, Qgpc-4A, Qgpc-5A.1, Qgpc-5A.2, Qgpc-5B.3, Qgpc-7A.1, and Qgpc-7A.2, was from Ningmai 9 and that of the other 9 QTL was from Yangmai 158. Two of the QTLs, Qgpc-2D and Qgpc-3B.2, were identified in two environments, and the other QTL were identified only in single environment.
Association mapping
The natural population was genotyped using Affymetrix 50 K assay. After quality control, 36360 SNPs were retained for further analysis. The SNPs on the A, B, and D genomes were 11415, 13133, and 11202, respectively, and the position of other 610 SNPs was uncertain. The number of SNPs per chromosome ranged from 875 (chromosome 4D) to 2698 (chromosome 3B) (Supplemental Table S3). According to the population structure analysis, the population was divided into three distinct subgroups (△K = 3) (Fig. 2A), and a similar result was obtained from the kinship matrix analysis (Fig. 2C). Approximately one fifth of the materials was assigned to different subgroups by the two programs because of the different statistical methods used (Supplemental Table S1).
Using association mapping, we obtained 17 chromosome intervals containing significant markers associated with GPC distributed on 14 chromosomes, including 1B (1), 1D (1), 2A (1), 2B (1), 2D (1), 3A (2), 3B (1), 3D(2), 4A (1), 4B (1), 5A (2), 6B (1), 7A (1), and 7D (1) (Table 5 and Fig. 3). We then divided the intervals into three types for further analysis: Type A, P values below 10−4 and in multiple environments; Type B, P values below 10−4 and in single environment; and Type C, P values below 10−3 and in multiple environments.
Development and validation of KASP markers for candidate intervals
To further utilize the related QTL and associated markers, we tried to convert them into friendly KASP marker. Firstly, nine of these intervals were selected out according to the following three criterions (Table 6): (1) QTL detected in multi-environments; (2) Type A associated intervals; (3) repeatedly regions between QTL mapping and association mapping. Secondly, the loci with low homology in these intervals were chosen for marker development (Supplemental Table S2). At last, KASP genotyping was performed in the materials from RIL population and natural population to test the developed markers.
Association mapping was performed in a large breeding population (1163 F4 lines) with the nine successfully developed KASP markers based on GLM (Fig. 4). Then, we compared the GPC of the lines with 1~9 GPC-increased alleles and GPC-decreased alleles, and the selected order of the markers was according to the P values from lowest to highest (Fig. 5). It was found that the difference between the lines with GPC-increased alleles and GPC-decreased alleles increased as the number of markers increased, and remained relatively stable as the three markers with lowest P values were used in selection. Therefore, the three markers of Kgpc-2B, Kgpc-2D, and Kgpc-4A with low P values (<10−10) were applicable for GPC selection, and their combination presented more effective.
Application of significant KASP markers
Further, we used the three markers to test 164 F6 lines, and 15 lines with GPC-increased alleles showed an average GPC of 14.85%, which was significantly higher than 13.15% for eight lines with GPC-decreased alleles (Fig. 6 and Supplemental Table S4), thus indicating a good selection effect.
Exploration of candidate genes in the significant intervals
The published IWGSC reference sequence provided detailed information for the identification of candidate genes, and 607, 42, and 235 high confidence (HC) genes were identified in the intervals of Kgpc-2B, Kgpc-2D, and Kgpc-4A, respectively (Supplemental Table S5). We also analyzed the homologous genes of Gpc-B1, and a homologous gene of TraesCS4A01G242700 was found in the interval of Kgpc-4A. Further, TraesCS4A01G242700 of the parents, Ningmai 9 and Yangmai 158, was sequenced (Supplemental Table S6), however, no sequence difference were found. Gene expression data of TraesCS4A01G242700 was extracted from WheatEXP, indicating that TraesCS4A01G242700 was expressed in the grain, leaf, spike, and roots, and it showed the highest expression level in roots (Supplemental Table S7). In addition, 23 of the HC genes in the interval of Kgpc-2D expressed in different tissues at reproductive stage, and 17 expressed in the grain were analyzed (Supplemental Table S8).
MAS is an effective route for increasing wheat GPC in the middle and lower reaches of the Yangtze River
The middle and lower reaches of the Yangtze River is the second largest area for wheat production in China. It was considered as a region with low GPC with a warm and humid climate (He et al. 2002), and little attention was paid to the selection of high GPC in wheat breeding in this area (Zhang et al. 2013a). The quality traits of common parents in this area were not outstanding, and the GPC were mostly at a low level (Qi et al. 2012; Wen et al. 2020). However, high GPC received more attention in local markets due to a rise in demand in recent years (Guo et al. 2020).
In traditional breeding, a high GPC parent from other ecological areas was used to improve local varieties, and quality testing is usually performed after the agronomic selection in field, leading to some lines with excellent quality being abandoned in early breeding generations. In addition, many quality traits, including GPC, are sensitive to environment and show significant difference among years or locations (Conti et al. 2011; Li et al. 2009; Ma et al. 2012). Therefore, the evaluation in a single environment may be inaccurate. MAS is independent of the environment and has a great potential in wheat quality breeding. There have been many reports on the application of high-molecular weight glutenin subunit related to dough rheological properties (Gautam et al. 2020; Izadi-Darbandi and Yazdi-Samadi 2012; Zhang et al. 2013b). Many studies have shown that the heritability of GPC was relatively high (Conti et al. 2011; Fatiukha et al. 2020; Li et al. 2009), which was sufficient for genetic selection and consistent with our study. Gpc-B1, which was identified in a wild emmer wheat, was previously reported as being used for GPC selection (Carter et al. 2012; Joppa et al. 1997). However, it was absent in most Chinese varieties (Rasheed et al. 2016), indicating that it was difficult to be directly utilized in our practical breeding.
Varieties with low GPC also contained some GPC-increased alleles, and the lines with expected high GPC could be produced as polymerizing more GPC-increased alleles. In this study, neither the parents of the RIL population showed good performance in GPC. The additive effect of eight and nine QTL was contributed from Ningmai 9 and Yangmai 158, respectively, and some derived RILs, which polymerize their GPC-increased alleles showed high GPC, which is consistent with many previous studies (Fatiukha et al. 2020; Li et al. 2009). Therefore, it is more feasible to improve GPC for exploring and polymerizing the GPC-increased alleles in local varieties.
Comparison of the candidate intervals for GPC with previous studies
In this study, we identified 17 QTL and 17 associated intervals using linkage mapping and association mapping, respectively. Most of the loci had a similar or close physical position to those reported in previous studies, particularly the ones selected for KASP marker development. Li et al. (2009) identified an interval on 1B related to GPC, grain hardness and wet gluten content, and the physical position was close to Kgpc-1B. The interval of Xgwm210–Xwmc25 on 2B related to GPC, flour protein content, and some mixograph parameters covered Kgpc-2B (McCartney et al. 2006). Laperche et al. (2007) found a QTL controlling GPC and grain yield near Ppd-D1 on 2D, and it was also close to Kgpc-2D. Kgpc-3A was physically close to QGpc.caas-3A as reported by Li et al. (2012), which also affected mixograph parameters. A GPC QTL on 3B was co-located with those controlling the quantity of Glu-A1 and Glu-A3 (Zhang et al. 2011) and was near Kgpc-3B. Wang et al. (2012) identified a QTL on 4A, which was close to Kgpc-4A. QGpc-5A1 contributed by Jing 411 had a similar position to Kgpc-5A (Xu et al. 2012). Liu et al. (2017) identified an associated marker neighboring Kgpc-7A. For Kgpc-7D, no adjacent loci have been reported, whereas a QTL on flour protein content was nearby (Smith et al. 2011). Using high-density SNP assays, some QTL, such as Kgpc-2D and Kgpc-5A, were mapped to a small interval, which provided closer markers and facilitated gene exploration. Some adjacent loci were reportedly related to several quality traits and were worthy of further research and utilization. In addition, some loci, such as Qgpc-5B.1 and Qgpc-5B.2, and the associated intervals on 3D, were first reported to enrich the genetic study of GPC.
Development of available KASP markers for GPC
For quantitative traits, polymerizing more loci may produce a better effect. However, as the number of unlinked loci increased, the breeding population needed to be large enough to obtain a target genotype. When the number of unlinked loci was three or four, the minimum population sizes were 293 and 1177, respectively. As the number of loci increased to five and six, the population sizes expanded rapidly to 4714 and 18861, respectively (Wang et al. 2007). In our practical breeding program, a large number of loci and population sizes can make selection impracticable due to the high labor and time cost involved; therefore, 3–4 loci may be acceptable.
In this study, we obtained three efficient molecular markers, and their combination could produce a good selection effect, which depended on the appropriate materials, a reliable mapping method, and strict validation. First, both the parents of the RIL population, Ningmai 9 and Yangmai 158, were the main varieties; parents in the middle and lower reaches of the Yangtze River, and approximately 80% of the approved varieties in this area were derived from them during the past decade (Jiang et al. 2021). Therefore, their genetic information is widely distributed in the present breeding lines. The natural population, including most commercial varieties that were used during 1972–2016, covered the main genetic information from the middle and lower reaches of the Yangtze River. Second, linkage mapping and association mapping are both powerful methods in marker–trait association analysis, and their combination has successfully identified many loci associated with important traits, such as wheat agronomic traits and disease resistance (Liu et al. 2018; Shi et al. 2020). The statistical power of linkage mapping and association mapping were significantly related to population size, and in this study, their statistic powers were estimated to be approximately 90% and 80%, respectively (Hu et al. 2014; Li et al. 2010). Third, substantial artificial selection along with genetic recombination occurred in the separated generations of F2–F5, and the trait stability in the breeding population increased constantly, whereas the genetic heterozygosity and richness degraded gradually along with the generations. F4 lines showing relatively stable traits and rich genetic information were chosen for marker validation and produced broadly applicable markers. Moreover, the large population (>1000) provided rich variations and greatly improved the accuracy and precision of the genetic effect estimates and false discovery rate (Hu et al. 2014), thus producing a more reliable result. In addition, all the markers were based on KASP genotyping arrays, which substantially facilitated MAS for GPC.
In the application trial, 15 lines with GPC-increased alleles were selected from 164 F6 lines, and they were used as candidate lines with high GPC. In practical breeding, to obtain more candidate lines, it was better to apply the markers in early generations, such as F2 and F3. F2 enrichment increased the frequency of selected alleles and was a considerable strategy in MAS (Bonnett et al. 2005).
Exploration of candidate genes for GPC in the significant intervals
We obtained three significant markers by genetic analysis, and gene exploration in the intervals would be the next topic of study. On one hand, gene exploration is the precondition of the development of functional markers, whereas on the other hand, it can help to clarify the mechanism underlying grain protein synthesis. The physical sizes and HC genes differed greatly, so the method used for further gene exploration may be different. The database of high-quality genome and transcriptome provided great support (Kuzay et al. 2019; Zheng et al. 2019). Gpc-B1, which encoded an NAC transcription factor, has been previously cloned (Uauy et al. 2006). We found a homologous gene, TraesCS4A01G242700, in the interval of Kgpc-4A, but both the sequencing and expression pattern of TraesCS4A01G242700 objected to a possible role in GPC accumulation. Therefore, fine mapping work should be considered for too many genes included in the interval of Kgpc-4A and that of Kgpc-2B. Combining the transcriptome data, 17 candidate genes expressed in the grain were identified in the interval of Kgpc-2D; however, further genetic and functional studies are needed.
In this study, we identified 17 QTL and 17 associated intervals by linkage and association mapping, respectively, and 9 of them were selected for KASP marker development. Large-scale association mapping was conducted on the basis of nine KASP markers and 1163 F4 breeding lines, and the markers Kgpc-2B, Kgpc-2D, and Kgpc-4A were proved to be applicable for GPC selection and their combination was found to be more effective. The three markers were then used to test 164 F6 lines, and candidate lines with high GPC were successfully obtained. Further, the strategies for gene exploration in the three significant intervals were discussed. These results are expected to be useful for wheat quality breeding in the middle and lower reaches of the Yangtze River.
Acknowledgments This work was financially supported by the National Key Research and Development Program of China (2016YFD0101802 and 2017YFD0100806), and Jiangsu Agricultural Science and Technology Innovation Fund (CX(18)1001).
Author contribution statement PJ, HXM and XZ conceived and designed the research. PZ, LW and YH conducted the investigation of phenotype. PJ and CL conducted molecular marker analysis. PJ and PZ analyzed the data and wrote the article. HXM and XZ revised the article. All the authors read and approved the final version for publication.
Compliance with ethical standards
Conflicts of interest The authors declare that they have no conflict of interest.
Table 1 Phenotypic analysis of grain protein content in RIL and natural populations
Population |
Environment |
Ningmai 9 (%) |
Yangmai 158 (%) |
Max (%) |
Min (%) |
Mean (%) |
Stdev |
CV (%) |
RIL population |
E1 |
12.79 |
14.4 |
15.26 |
11.61 |
13.30 |
0.63 |
4.76 |
E2 |
11.62 |
13.88 |
16.85 |
10.23 |
13.75 |
1.14 |
8.26 |
|
E3 |
12.17 |
14.25 |
14.71 |
9.23 |
12.19 |
1.00 |
8.19 |
|
E4 |
11.7 |
12.49 |
16.74 |
11.03 |
14.05 |
0.84 |
6.00 |
|
|
|
|
|
|
|
|
|
|
Natural population |
E1 |
- |
- |
15.13 |
9.31 |
12.27 |
1.28 |
10.40 |
E2 |
- |
- |
16.62 |
10.64 |
13.30 |
1.37 |
10.27 |
|
E3 |
- |
- |
15.42 |
11.31 |
13.35 |
0.84 |
6.28 |
|
E4 |
- |
- |
16.46 |
11.44 |
13.75 |
1.18 |
8.56 |
|
E5 |
- |
- |
15.24 |
10.41 |
13.15 |
1.03 |
7.82 |
CV, coefficient of variation; RIL, recombinant inbred lines; Stdev, standard deviation
Table 2 Correlation analysis of grain protein content in the RIL and natural populations
|
E1 |
E2 |
E3 |
E4 |
E5 |
E1 |
- |
0.182** |
0.215** |
0.135* |
- |
E2 |
0.295** |
- |
0.528** |
0.329** |
- |
E3 |
0.465** |
0.321** |
- |
0.033 |
- |
E4 |
0.479** |
0.502** |
0.371** |
- |
- |
E5 |
0.378** |
0.259** |
0.387** |
0.352** |
- |
* and ** indicates significant differences at the 0.05 and 0.01 probability level, respectively.
RIL, recombinant inbred lines
Table 3 ANOVA of grain protein content by F value
Item |
RIL population |
Natural population |
Genotype |
2.21** |
9.14** |
Environment |
233.26** |
85.94** |
Genotype * Environment |
0.91 |
2.34** |
Heritability |
0.56 |
0.74 |
** indicates significant differences at the 0.01 probability level.
Table 4 QTL analysis of grain protein content
No. |
QTL |
Environment |
Genetic position (cM) |
Physical interval (Mb) |
Interval |
LOD |
PVE(%) |
AE |
1 |
Qgpc-1A |
E3 |
19.95-25.35 |
15.4-18.5 |
IAAV1469~Tdurum_contig61492_684 |
3.23 |
4.84 |
-0.97 |
2 |
Qgpc-1B |
E1 |
90.55-91.85 |
673.1-673.9 |
Tdurum_contig93330_263~Excalibur_c11190_617 |
3.03 |
4.48 |
-0.13 |
3 |
Qgpc-2B |
E4 |
25.85-27.85 |
12.8-17.4 |
BobWhite_c19945_341~RAC875_rep_c69222_996 |
3.04 |
4.90 |
-0.16 |
4 |
Qgpc-2D |
E2/E4 |
1.55-6.25 |
64.9-69.6 |
BS00073229_51~BobWhite_c47086_539 |
4.58 |
4.77 |
-0.27 |
5 |
Qgpc-3A |
E4 |
53.65-54.35 |
624.5-686.8 |
GENE-3346_167~RAC875_rep_c109433_782 |
3.15 |
5.01 |
-0.21 |
6 |
Qgpc-3A |
E2 |
69.65-73.75 |
697.2-701.0 |
Kukri_c49927_151~Excalibur_c96921_206 |
4.96 |
5.67 |
-0.28 |
7 |
Qgpc-3B.1 |
E3 |
59.55-62.45 |
24.9-101.4 |
BS00070456_51~BS00032830_51 |
4.21 |
5.21 |
1.02 |
8 |
Qgpc-3B.2 |
E2/E3 |
68.15-71.75 |
39.9-133.9 |
BS00087825_51~BS00092492_51 |
6.93 |
6.91 |
-0.32 |
9 |
Qgpc-3B.3 |
E4 |
106.25-106.4 |
146.8-153.7 |
IAAV1079~Kukri_c65979_1410 |
5.00 |
7.48 |
0.33 |
10 |
Qgpc-4A |
E1 |
13.55-15.75 |
544.8-570.3 |
wsnp_Ex_c24443_33688268~GENE-2947_175 |
3.00 |
4.50 |
0.13 |
11 |
Qgpc-5A.1 |
E2 |
16.45-17.15 |
390.3-398.2 |
RAC875_rep_c113386_131~wsnp_Ex_c14839_22961659 |
4.05 |
3.94 |
0.23 |
12 |
Qgpc-5A.2 |
E1 |
43.65-46.05 |
503.1-503.8 |
GENE-3314_78~IAAV5294 |
3.89 |
5.84 |
0.14 |
13 |
Qgpc-5B.1 |
E1 |
6.45-8.95 |
4.8-9.0 |
BS00065164_51~Kukri_c65921_274 |
3.75 |
5.57 |
-0.14 |
14 |
Qgpc-5B.2 |
E4 |
45.35-46.15 |
545.4-546.9 |
Tdurum_contig25513_195~wsnp_Ex_c8659_14515623 |
3.23 |
4.74 |
-0.16 |
15 |
Qgpc-5B.3 |
E4 |
129.25-129.75 |
682.6-697.7 |
Excalibur_c6346_266~IAAV8023 |
6.07 |
9.14 |
0.24 |
16 |
Qgpc-7A.1 |
E3 |
55.25-56.8 |
205.4-210.9 |
Kukri_c10197_186~Tdurum_contig4676_3592 |
4.65 |
5.77 |
1.07 |
17 |
Qgpc-7A.2 |
E2 |
0-2.55 |
670.7-679.9 |
Ku_c62742_888~wsnp_Ku_c42539_50247333 |
9.11 |
9.21 |
0.39 |
Positive effect and negative effect were contributed by Ningmai 9 and Yangmai 158, respectively.
PVE, phenotypic variance explained; AE, additive effect; QTL, quantitative trait loci, LOD, logarithm of the odds
Table 5 Association mapping for grain protein content
Type |
Chromosome |
Physical interval (Mb) |
Peak marker |
P value |
Environment |
A |
1B |
633.1-656.7 |
AX-109109358 |
1.99×10-5 |
E1/E2 |
|
3A |
682.2-727.3 |
AX-109840225 |
1.04×10-5 |
E1/E2/E5 |
|
7D |
232.6-280.4 |
AX-110444859 |
6.02×10-5 |
E4/E5 |
|
|
|
|
|
|
B |
1D |
460.7-471.4 |
AX-94759528 |
5.97×10-5 |
E1 |
|
3A |
1.9-9.9 |
AX-110464884 |
2.10×10-5 |
E4 |
|
3B |
0-4.4 |
AX-94806155 |
3.01×10-5 |
E4 |
|
3D |
0-5.5 |
AX-110403551 |
8.78×10-6 |
E4 |
|
4A |
543.2-543.3 |
AX-108860370 |
3.43×10-5 |
E4 |
|
6B |
6.6-7.8 |
AX-94768259 |
8.19×10-6 |
E5 |
|
7A |
688.9-694.1 |
AX-109879999 |
1.01×10-5 |
E3 |
|
|
|
|
|
|
C |
2A |
763.3-770.0 |
AX-111461751 |
5.91×10-4 |
E1/E2/E3 |
|
2B |
31.4-49.4 |
AX-109372952 |
1.86×10-4 |
E2/E3/E5 |
|
2D |
8.7-29.9 |
AX-94514671 |
2.47×10-4 |
E2/E3/E4/E5 |
|
3D |
363.7-382.3 |
AX-94457474 |
2.31×10-4 |
E3/E5 |
|
4B |
9.9-22.0 |
AX-110976982 |
1.19×10-4 |
E3/E4/E5 |
|
5A |
2.8-14.3 |
AX-110963298 |
3.12×10-4 |
E4/E5 |
|
5A |
504.8-512.1 |
AX-111524034 |
2.18×10-4 |
E2/E5 |
Table 6 Candidate markers for grain protein content based on QTL mapping and association mapping
Name |
Marker |
SNP |
GPC-increased allele |
GPC-decreased allele |
Physical interval (Mb) |
Method |
Kgpc-1B |
AX-111461175 |
A/G |
G |
A |
633.1-656.7 |
A |
Kgpc-2B |
BobWhite_c19945_341 |
T/G |
G |
T |
12.8-49.4 |
L+A |
Kgpc-2D |
BS00067757_51 |
A/G |
A |
G |
64.9-69.6 |
L |
Kgpc-3A |
AX-89352667 |
T/C |
C |
T |
682.2-727.3 |
L+A |
Kgpc-3B |
BS00087825_51 |
T/C |
C |
T |
39.9-133.9 |
L |
Kgpc-4A |
GEN×102947_175 |
T/C |
C |
T |
543.2-570.3 |
L+A |
Kgpc-5A |
Excalibur_rep_c111546_167 |
A/G |
G |
A |
503.1-503.8 |
L+A |
Kgpc-7A |
AX-108794609 |
A/G |
A |
G |
670.7-694.1 |
L+A |
Kgpc-7D |
AX-110278908 |
A/G |
A |
G |
232.6-280.4 |
A |
L: QTL mapping; A: association mapping