Physical mapping&nbsp;and&nbsp;InDel&nbsp;marker&nbsp;development&nbsp;for&nbsp;the&nbsp;restorer gene&nbsp;Rf2&nbsp;in&nbsp;cytoplasmic male sterile&nbsp;CMS-D8&nbsp;cotton&nbsp;

doi:10.21203/rs.3.rs-57917/v3

Download PDF

Research article

Physical mapping and InDel marker development for the restorer gene Rf2 in cytoplasmic male sterile CMS-D8 cotton

https://doi.org/10.21203/rs.3.rs-57917/v3

This work is licensed under a CC BY 4.0 License

Journal Publication

published 06 Jan, 2021

Read the published version in BMC Genomics →

You are reading this latest preprint version

Background: Cytoplasmic male sterile (CMS) with cytoplasm from Gossypium Trilobum (D8) fails to produce functional pollen. It is useful for commercial hybrid cotton seed production. The restore line of CMS-D8 containing Rf2 gene can restore the fertility of the corresponding sterile line. This study combined the whole genome resequencing bulked segregant analysis (BSA) with high-throughput SNP genotyping to accelerate the physical mapping of Rf2 locus in CMS-D8 cotton.

Methods: The fertility of backcross population ((sterile line×restorer line)×maintainer line) comprising of 1623 individuals was investigated in the field. The fertile pool (100 plants with fertile phenotypes, F-pool) and the sterile pool (100 plants with sterile phenotypes, S-pool) were constructed for BSA resequencing. The selection of 24 single nucleotide polymorphisms (SNP) through high-throughput genotyping and the development insertion and deletion (InDel) markers were conducted to narrow down the candidate interval. The pentapeptide repeat (PPR) family genes and upregulated genes in restore line in the candidate interval were analysed by qRT-PCR.

Results: The fertility investigation results showed that fertile and sterile separation ratio was consistent with 1:1. BSA resequencing technology, high-throughput SNP genotyping, and InDel markers were used to identify Rf2 locus on candidate interval of 1.48 Mb on Chromosome D05. Furthermore, it was quantified in this experiment that InDel markers co-segregated with Rf2 enhanced the selection of the restorer line. The qRT-PCR analysis revealed PPR family gene Gh_D05G3391 located in candidate interval had significantly lower expression than sterile and maintainer lines. In addition, utilization of anther RNA-Seq data of CMS-D8 identified that the expression level of Gh_D05G3374 encoding NB-ARC domain-containing disease resistance protein in restorer lines was significantly higher than that in sterile and maintainer lines.

Conclusions: This study not only enabled us to precisely locate the restore gene Rf2 but also evaluated the utilization of InDel markers for marker assisted selection in the CMS-D8 Rf2 cotton breeding line. The results of this study provide an important foundation for further studies on the mapping and cloning of restorer genes.

Epigenetics & Genomics

Cotton

CMS

Rf2

BSA

SNP

InDel

The cytoplasmic male sterility(CMS) system plays an important role in utilization of crop heterosis. CMS is a maternally inherited trait, that includes degenerate anthers, aborted pollen with carpelloid and petaloid stamens[1]. Current research has determined that the CMS phenotype is caused by mutations in the mitochondrial genome linked genes and reserved by fertility restorer genes in the nuclear genome[2-4].The CMS system avoids the removal of anthers, thereby enabling the generation of dramatically superior F1 progeniesthrough hybrid technology. These offsprings display significant advantages over their parents and existing popular cultivars in terms of yield, stress tolerance, adaptability, etc.[5]. The CMS phenomenon exists in more than 150 plants and is also used for hybrid breeding of crops, such as maize[6, 7], rice[8, 9],pepper[10]and sorghum[11].

Cotton (Gossypium hirsutum L.) is a vital source of fibre, oil, and the most important economic crop for the textile industry in the world.In cotton, the CMS system is an ideal way to improve hybrid yields[12],Harknessii(D2-2) cytoplasmic male sterile (CMS-D2) lines [13, 14], Trilobum(D8) cytoplasmic male sterile (CMS-D8) lines [15], and upland cotton cytoplasmic male sterile (104-7A, Xiangyuan A, Jin A) have been established and utilized [16]. Normally, different CMSlines could be recovered by different restorer genes. In cotton,therestorer gene Rf1of CMS-D2could restore the fertility of CMS-D2and CMS-D8 sterile lines, while fertility of CMS-D8 sterile lines could only be restore with Rf2[17].Furthermore,the Rf1 gene functions in sporophytes, whereasthe Rf2 gene has a gametophytic restoration system.Previous studies revealed that Rf1gene loci and Rf2 gene loci are not allelic, but these genes are tightly linked at a genetic distance of 0.93 cM on chromosome D05. The mapping and identification of the molecular markerslinked with the Rf1restorer gene in cotton has alreadybeen progressed[18-23].However,there are few researches about the Rf2, compared with Rf1.

With the increase in crop functional genome research, Rf genes have been successfully cloned in maize(Rf2)[24],petunia(Rf-PPR592)[25],radish(Rfo)[26, 27], rice(Rf1a, Rf1b, Rf2)[28-31],sorghum(Rf1) [32], and sugar beet(Rf1) [33]. Most of these genes encode PPR proteins, but Rf2in maize CMS-T, Rf17 in rice CMS-CW and Rf2 in rice CMS-LD encode aldehyde dehydrogenase, 178-amino-acid mitochondrial sorting protein and mitochondrial glycine-rich protein, respectively[24, 34, 35]. At present, the major bottleneck of cotton CMS breeding system is a narrow source of restorer genes and lack of excellent restorer lines compatible with a given sterile line. Unfortunately, no restorer gene has been cloned in cotton. Therefore, fine mapping and isolation of the restorer geneRf2 in upland cotton are highly needed for efficient breeding.Interestingly, bulked segregant analysis (BSA) make it possible to quickly locate molecular markers closely linked to the target gene by analysing the differences between SNPs and InDels in segregating population pools[36]. This method has already been used in gene mapping of Arabidopsis thaliana[37],rice[38-40] maize[41] and tomato[42]. The SNP and theInDelare the most abundant type of DNA sequence polymorphisms, found within the genomic sequence of each species[43, 44], and used in QTL analysis. These markers have widely been used in cultivar identification, construction of genetic maps, genetic diversity,map-based cloning, the detection of genotype/phenotype associations, and marker-assisted breeding (MAS) [45-47]. In recent years, the release of the upland cotton genomic sequence [48-50] and the rapid development of sequencing technology have enhanced the detection and application of SNP and InDel. Furthermore, the application of high-throughput genotyping methods makes SNP highly attractive genetic markers[51, 52].

The objectives of this study were to physically map restorer gene Rf2andto developInDelmarkers co-segregated with Rf2. A 1.88Mb candidate interval was obtained by combining BSA with high-throughput SNP genotyping using a BC1F1segregation population. Based on the InDel variation in the 1.88 Mb interval, the InDel markers were developed and used to narrow down a 1.48 Mb candidate interval. The PPR family genes and the genes selected by transcriptome data in candidate region were analysed by qRT-PCR. The InDel markers co-segregated with Rf2will be useful to trace Rf2 breeding restorer lines in cotton.

Anther observation and BC1F1 fertility analysis

The anthers of fertile plants had a large amount of pollen, while the sterile plants had no pollen, and their anthers did not crack. Overall, a total of 1623 BC1F1 plants were classified as 850 fertile and 773 sterile plants, and the ratio of the number of fertileplants (850) to the numberof sterile plants (773) fit a 1:1 segregation (χ2=3.6531 < χ2(0.05,1) = 3.84), confirming thatfertility restoration is conditioned by one dominantrestorer gene, Rf2.This result is consistent with the results of Zhang et al.[53].

Whole genome resequencing data analysis and evaluation

The two parent lines (maintainer line B and restorer line R), F-pool,and S-pool of the BC1F1 segregation population were sequenced.The Illumina platform was selected to constructthe paired-end (PE) library, and the PE fragment was between 300 and 500 bp; 1,251,289,091 reads were obtained(Table 1).The reads from samples were aligned to the referencegenome using BWA software, with >82.57% normal efficiency.For the sequencing results, the averageQ30 was 94.95%, and the average GC content was37.22%. A total of 177,874,504 reads were obtained for the R restorer line, with a Q30 value of 94.69%, and average GCcontent of 37.77%. On the other hand, 174,907,610 reads were obtained for the B maintainer line, witha Q30 value of 94.29%, and an average GC content of 36.69%.Finally, 465,660,282and 432,846,695 reads were obtained for the filial BC1F1 generation (fertile and sterile) with Q30 values of 94.83% and95.97%, and average GC content of 36.93% and 37.22%, respectively (Table 1).

BSA combining SNP-index and G’ values

The average sequencing depth of the parent lines and the offspring pools was 30.92×. Of these, the R restorer linehas a sequencing depth of 16.62×. The Bmaintainer line sequencing depth was 16.03×, whereas the sequencing depth of the filial BC1F1 generation was 47.77× + 43.26× (Table 2).

These reads were mapped onto the reference genomeof Gossypium hirsutum (Tm-1, http://mascotton.njau.edu.cn/info/1054/1118.htm).A total of 798,286 SNPs was obtained from the two mixed pools, and 72,108small InDelwere obtained from the mixed pools. We used two different methods to map the Rf2locus responsible for restoring fertility. As shown in Fig. 1 and Fig. 2, only one locus was identified, and both the SNP-index and G' value association algorithms mapped thislocus to chromosome D05. More specifically, thislocus was located in the region of 25.61Mb-59.94Mb(34.33Mb) using the SNP-index and G' value method.

Fine mapping of the Rf2 gene

It was difficult to determine the candidate gene of Rf2, since the candidate range of 34.33 Mb contains a large amount of genetic information. Thus, itwas necessary to fine map Rf2. We developed 24 SNP markers, and 23 valid SNP markers in this region were used for genotyping an additional 1423 individuals by high-throughput SNP genotyping. We found 6 recombinant plants in the BC1F1 population. The position of theRf2locus was narrowed down and was located between SNP563981 and SNP597385, a 1.88Mb region (Supplementary Table S2, the information of the SNP site). Next, we developed InDel markers on the correlated region, and InDel marker analyses revealed that 16InDel markers were polymorphic.These InDelmarkersnarroweddownthecandidate interval to 1.48 Mb forexisting recombinants atInDel marker sites. Finally, 10InDel markers were co-segregated with Rf2(Fig.3,Fig.4, Supplementary Table S3, InDel primers).

Marker-assisted breeding of restorer lines and CMS-D8 hybrid identification

Subsequently, 500 plants were randomly selected from the BC4F1 population of CMS-D8, and the InDel1327 marker was used for genotype analysis. The BC4F1 population wastypedbyvisual fertility investigations. The PCR products were analysed by agarose gel electrophoresis, and the results of agarose gel electrophoresis showed two different banding patterns. A single small PCR product was considered homozygous and lacked the restorer gene allele [S (rf2rf2)], indicating sterile plants, whereas two fragments were considered heterozygous at the restorer gene locus [S(Rf2rf2)], indicating fertile plants. Furthermore, the segregation ratio followed a 1 (Rf2rf2):1 (rf2rf2) (254 Rf2rf2: 246 rf2rf2, c20.05 = 0.1667 < 3.841), and the results were consistent with those of the fertility survey.

The plants were scanned with an atpA SCAR marker[2] and InDel1327 markers, as the hybrids of the CMS-D8 system have sterile cytoplasm and Rf2 heterozygous sites. The InDel1327 primer amplification product produced two bands, and the atpA SCAR marker amplification product produced a band with a size of 611 bp (Fig.5b). Therefore, the CMS-D8 hybrids with heterozygous restorer gene sites and sterile cytoplasm were differentiated by the genotyping of restorer genes and the identification of cytoplasm type.

Candidate gene selection and expression pattern analysis

To determine candidate genes, we adopted a method that combined the 67 genes in the interval with the functional annotation of Arabidopsis orthologues and transcriptome data [54]. The 67 genes were subjected to Gene Ontology (GO)analysis. The GO analysis indicated that most of the genes are involved in binding (Fig. 6).According to the successfully isolated restorer genes of other crops belonging to the PPR family, we used qRT-PCR to analyse PPR family genes in candidate interval. Interestingly, the candidate region oftheRf2 locus was found to contain 8 PPR genes (Gh_D05G3356, Gh_D05G3357, Gh_D05G3359, Gh_D05G3378, Gh_D05G3389, Gh_D05G3391, Gh_D05G3392, Gh_D05G3380) in the region of 1.48 Mb. The relative expression levels of the eight PPR family genes in the restorer line were not significantly higher than those of the maintainer and the sterile lines. However, the relative expression of the Gh_D05G3391 gene in the restorer line was significantly lower than that in the sterile and maintainer lines (Fig. 7).

Furthermore, the Gh_D05G3374, Gh_D05G3407 and Gh_D05G3417geneswerechosenbasedon FDR< 0.05 and |log2FC|>= 1 bythe RNAsequencedata (Supplementary Table S4). Since, the expression level in the restorer line was significantly higher than that in the sterile and maintainer lines.TheqRT-PCRresults showedthat theGh_D05G3417 gene in the restorer line was significantly higher than that in the sterile and maintainer lines(Fig. 7). Finally, two genes (Gh_D05G3391 and Gh_D05G3374) were selectedas possible candidate genes.

CMS is a common phenomenon that occurs in flowering plantsdue to interactions between the mitochondrial genome and the nuclear genome[55].CMS systems have been proven to be a proficient tool in hybrid seed production. Considering the importance of the CMSand restoration systems, numerous molecular mappingstudies have been performed on restorer genes in crops, and Rf geneshave alreadybeen isolatedinothercrops[24-33].With CMS systemsin cotton, fertilitycan be restored by restoring the genes Rf1 or Rf2. However, these two genes have not yet been identified and cloned. With the availability of upland cotton whole genome sequencing[48-50]and cotton mitochondrial genome sequencing[56], breakthroughs in the study of cotton CMS and restoration of fertility mechanisms can be realized in recent years.

Molecular marker discovery and fine mapping of the fertility restoring gene of CMS cotton

Some researchers have recently studiedcotton CMS systems for molecular marker development and fine mapping of fertility restoration genes. For instance, Liu et al. [18] identified 2 RAPD and 3 SSR markers closed linked with Rf1. Feng et al.[19] developed 4 STS markers associated with Rf1.Yin et al. [20] constructed a BAC library ofCMS-D2 restorer lines and reported that Rf1 was located 100 kb between two BAC clone overlapping regions. Yang et al. [57] identified 6 EST-SSR markers (NAU2650, NAU2924, NAU3205, NAU3652, NAU3938, and NAU4040) with a genetic distance of 0.327 cM linked to Rf1 of CMS-D2.Wu et al. [21] screened 13 molecular markers closely linked to Rf1and located Rf1 between the SSR markers BNL3535 and NAU3652, with genetic distances of 0.049 cM and 0.078, respectively. Recently, they have reported co-segregated InDel markers such as InDel-1891, InDel-3434, InDel-7525, InDel-9356 and InDel-R [22, 58]. Zhao et al. [23]used super-BSA and successfully mappedRf1to 1.35 Mb region of chromosome D05.Previous studies have shown that Rf1and Rf2 are tightly linked at a genetic distance of 0.93 cM on chromosome D05[17]. The findings of Wang et al. [59] revealed that CIR179-250 was strictly linked with both Rf1 and Rf2, which was located on chromosome D05(19th chromosome). The present study on theRf2 gene identified the location of the chromosome D05 base sequence as 54.3-55.78 Mb. Furthermore, the present study developed 10 InDel markers in the correlated region. These markers laid the foundation for locating and fine mapping Rf2in CMS-D8 cotton.

Mapping Rf2 using an efficient strategy

Traditional map-based cloning is an efficient approach to isolate genes/QTLs responsible for desired agronomic traits[60-62]. Usually, a genetic map of F2, double haploid (DH) or recombinant inbred line (RIL) populations based on hundreds of SSR or InDel markers is used to make a primary map.Then a near-isogenic line (NIL) is developed which based on MAS to narrow down the region of interest to a sufficient size to screen for a few candidate genes. Unfortunately, this workflow requires relatively more labour and time[63]. Compared with genetic mapping, the next generation sequencing (NGS) is a faster and reliable method for mapping [64]. Nevertheless, one mixed pool typically contains approximately 20–100 individuals and generally maps the target region at a Mb-level interval[65-67].Because of insufficient meiotic recombination events, we still have to perform fine mapping or use omics methods such as RNA-seq to further screen the candidate genes[68, 69].

High-throughput SNP genotypingis one of the dimorphic methods in which genotypes are confirmed by direct sequencing[70]. Ithas been successfully used to genotype interesting traits in plants [71, 72]. In this regard, Yang et al. [73] developed 1,536 SNP markers tomeasure genetic diversity by a high-throughput SNP genotyping method.

In this study, SNP-index and InDel-index analyses were used to first position the Rf2genewithin a 34.33 Mb region. Later on,twenty-three SNP sites selected in this region helped to narrow the Rf2gene to a 1.88Mb region. We developed InDel markers based on InDel variationsand used these markers to locate the Rf2gene in a 1.48 Mb region. We thus put forward an approach that could rapidly fine map geneloci using only a large BC1F1segregation population, especially for those traits governed by single nuclear-encoded genes. This can be achieved bydeveloping a large segregation population, mapping by sequencing analysis, and high-throughput SNP genotyping in a short time. Moreover, rapid and accurate identification of phenotypecan be performed with progeny tests for desired objectives. Our study results suggested that BSA-seqcombined with SNP genotyping can accelerate the mapping of loci controlling quality traits.

Utilization of InDel markers for MAS

Development of DNA markers linked to agronomically important traits and their use for MAS plays the role in promoting variety[74]. Andvarious types of molecular markers closely linked to cotton restorer genes have been developed[19, 21, 57], but these markers are difficult to use for molecular marker-assisted breeding because ofthe complex experimental processes or low sensitivityof the markers [22]. Very recently, PCR based InDel have become a popular gel based genotyping solution, since InDel hasthe advantages of co-dominant, inexpensive, and highly polymorphic nature[44, 75].Inthisstudy, InDel markers co-segregated with restorer genestrackedRf2 for molecular marker-assisted breeding.InDel markers developed on the region showed a higheridentification rate of the Rf2 phenotype than previously developed markers, when applied to thebreeding improvement of restorer lines.

Characteristics of the potential candidate gene Rf2

Currently, Rf genes have been successfully isolated from different crop species [76]. Most of these restorer genes belong to the PPR genefamily. PPR-type fertility restorer genes have been cloned for petunia[25],Ogura and Kosena cytoplasm in Raphanus sativus[26, 27, 77], BoroII CMS in Oryza sativa[78], A1 cytoplasm in Sorghum bicolor[32], Honglian CMS in rice[28, 29], and nap CMS in Brassica napus[79]. In this study, we explored theexpression patterns of 8PPR genes of theCMS-D8 systeminthe candidate interval, and the expression level of most genesin the restorer line was not significantly different from that of the sterile and the maintainer lines.Interestingly, D05G3391, a PPR family gene had significantly lower expression in restorer line than in the sterile and maintainer lines.

However, non-PPR restorer genes also exist in other crops,Rf2 in the maize CMS-T system encodes aldehyde dehydrogenase [24], Rf17of CMS-CW system and Rf2 of CMS-LD system in rice encode 178-amino-acid mitochondrial sorting protein and mitochondrial glycine-rich protein [34, 35]. Likewise, transcriptome data (unpublished data)was used to select upregulated genes of restorer line and then analysed by qRT-PCR. The relative expression of D05G3374, an NB-ARC disease-resistant protein gene, was significantly higher in restorer line than in sterile line and maintainer line.Based on the results of qRT-PCR, the candidate genes were not determined with the desired results in this study. The reason may be that Rf2 is derived from the nuclear gene of G. trilobum[15] and might not be available in the reference genome of G.hirsutum. Therefore, cloning of fertility restorer genes in cotton CMS systems still needs further investigation.

In our study, the BC1F1 population was chosen as a genetic population to map Rf2 of CMS-D8. Integration of BSA, high-throughput SNP genotyping, and InDel markers identified 1.48 Mb candidate interval on chromosome D05. The InDel markers co-segregated with Rf2 can be used to trace Rf2 for molecular marker-assisted breeding of restorer lines or hybrids. The qRT-PCR analysis identifiedGh_D05G3391andGh_D05G3374genesas a putative candidate in this interval.The InDel markers co-segregated with Rf2 canbe usednot only to trace Rf2 for molecular marker-assisted CMS breeding but also cornerstone in fine mapping and cloning of restorer genes in cotton.

Materials and sample collection

TheCMS-D8 system, a sterile line (A), maintainer line (B) and restorer line (R) were provided by the Institute of Cotton Research (ICR), Chinese Academy of Agricultural Science, Anyang, Henan, China. A BC1F1((A line ×Rline) ×B line) segregation population was constructed, and all materials were grown at the Cotton Research Farm at the ICR. Fresh leaves were obtained from the parent lines and BC1F1 population. Anthers from buds1-2 mm, 3 mm, and 4 mm in length were collected and combined from 100 plants. All harvested samples were snap-frozen in liquid nitrogen and stored at −80°C before use.

Fertility segregation analysis

During anthesis, visual fertility surveys were conducted for 1623 individuals of the BC1F1 population of CMS-D8 under fieldtrial conditions, three times per plant. The presence of pollen in a plant indicated fertility and was determined by squeezing the anthers between the fingers because the male sterility of CMS-D8 occurs during meiosis, the S(rf) gametes are sterile, and S(rf2rf2) produces no pollen. So,the observed values of fertile and sterile plants were obtained, the fertility trait segregation ratio compatibility test was carried outusingExcel software, and the Chi-square value was obtained to determine the genetic model of Rf2.

DNA extraction, library construction andIllumina sequencing

DNA was extracted by the CTAB method[80], and the quality of DNA was assessed by 1.2% agarose gel electrophoresis. The purity of DNA was examined usingan Agilent Technologies 2100 Bioanalyzer. The DNA concentration was estimated using a Qubit® DNA Assay Kit in a Qubit® 2.0 Fluorometer (Life Technologies, CA, USA).Equal amounts of DNA (1.5 μg/sample) from 100 BC1F1 plants with fertile phenotypes were mixed to form the fertile sample (F-pool), and those from another 100 plants with sterile phenotypes were mixed to form the sterile sample (S-pool). Sequencing libraries were generated using the VAHTSTM Universal DNA Library Prep Kit for Illumina® V3(Vazyme Biotech) according to the manufacturer’s recommendations. Briefly, the DNA samples were fragmented by sonication to a size of 300-500 bp. Then, the DNA fragments were end-polished, A-tailed, and ligated with the full-length adapter for Illumina sequencing by PCR amplification. Consequently, the PCR products were purified (VAHTSTM DNA Clean Beads (Vazyme #N411)), and libraries were analysed for size distribution by an Agilent 2100 Bioanalyzer and quantified by real-time PCR. The libraries constructed above were sequenced by the Illumina HiSeq platform, and 150 bp paired-end reads were generated with an insert size of approximately 350 bp.Thesequenceraw reads were submitted to the SRA database of NCBI under the BioprojectPRJNA685585.

Data analysis, data filtering, and alignment

The Fast x-toolkit (v 0.0.14–1) was used to filter out the low-quality reads such as reads with ≥10% unidentified nucleotides (N), reads with > 50% bases having phredquality < 5,r reads with > 10 nt aligned to the adapter, allowing ≤10% mismatches, and putative PCR duplicates generated by PCR amplification in the library construction process (read 1 and read 2 of two paired-end reads were completely identical).The released genome of Gossypium hirsutum was downloaded from the Cotton Research Institute (CRI) of Nanjing Agricultural University of China (http://mascotton.njau.edu.cn/Data.htm, v1.1) and used as a reference genome[49].For mapping to the reference genome, BWA(Burrows-Wheeler Aligner)[81] was used to align the clean reads of each sample against thereference genome (settings: mem -t 4 -k 32 –M -R). Alignment files were converted to BAM files usingSAMtools software [82] (settings: –bS –t). In addition, potential PCR duplications were removed usingthe SAMtools command “rmdup”. If multiple read pairs had identical external coordinates, only thepair with the highest mapping qualitywas retained.

SNP/InDel detection and annotation

Variant calling was performed for all samples by using the unified genotyper function in GATK [83] software. SNP was used as the variant filtration parameter in GATK (settings: --filterExpression "QD < 4.0 || FS > 60.0 || MQ < 40.0", -G_filter "GQ<20", --cluster WindowSize 4). InDel was filtered by Variant Filtration parameter (settings: --filter Expression "QD < 4.0 || FS > 200.0||Read PosRankSum < -20.0 || Inbreeding Coeff < -0.8 "). ANNOVAR [84], an efficient software tool, wasused to annotate the SNPsandInDels based on the GFF3 files for the reference genome.

Determination of a candidate interval using SNP index and G’values

To obtain a highly accurate SNPset, a range of filters were also employed[85].The homozygous SNPs between two parents were extracted from the VCF files for SNPs.The read depth information for homozygous SNPs in the above offspring pools was obtained tocalculate the SNP index [86]. The SNP index method was used for the analysis, and the SNP-index dot matched curve was obtained by regression fitting as described by Abe et al.[87].The G’ values areused for noise reduction while also addressing linkage disequilibrium (LD) between SNPs[88].To rule out the effects of unreliable markers, we screened markers based on the SNPindex using the following conditions: 1) the sequencing depth of both parents was greater than 8; 2) both pools had a sequencing depth greater than 10;3) the SNPindex values of the two pools were not greater than 0.8 or less than 0.2 at the same time; and4) the SNPindex value difference was greater than 0.8.Sliding window methods were used to present theSNP index of the whole genome. Usually, we used a window size of 2 Mb as the default settings, andused QTL-seqr software to calculate Δ (SNPs-index) and G', based on the markers that were scanned[89].

Fine mapping ofRf2usinghigh-throughput SNP genotypingandInDel makers

For the analysis, DNA was isolated from the two parental lines and the BC1F1 population. The informative molecular markers were used for genotyping each plant of the BC1F1 population, various recombinants in the target region were identified, and the linkage relationship between markers and the Rf2 locus was analysed for gene mapping. According to the results of sequencing mutation detection and BSA of sequencing candidate intervals, one SNP was selected for approximately every 1.5 Mb of physical distance. The selected SNPs met the requirement of having a variation index close to 0.5 in the F-pool and0 in the S-pool.A subset of 24 selected SNPs was used for genotyping by the Illumina HiSeq PE150 sequence. Individual BC1F1 population (except 200 plants that formed pools) plants were genotyped using high-throughput SNP genotyping. Based on the SNP locus exchange of individual plants, we narrowed down the range in which the target gene was located.

TheInDel markers were developed based on the insertion deletion mutation within the selection interval. Primers were designed using Oligo7 software [90] and synthesized commercially (TSINGKE Biological Technology, Zhengzhou, China). The PCR system consisted of 20 μL of PCR mixturethat contained 1× reaction buffer, 2.0 mM MgCl2, 0.2 mM dNTPs, 0.5 mM each primer, 1 U Taq DNA polymerase (Takara, Japan), and 50 ng of DNA template.The PCR amplification conditions were as follows: 35 cycles of denaturation at 94°C for 30 s, annealing at 56°C-58°C for 30 s, and extension at 72°C for 30 s. Then, the reaction was held at 4°C. The PCR products were visualizedby3.5% agarose gel electrophoresis. Based on the difference between the genotypes as assessedusing polymorphic markers, recombinants were identified inthe BC1F1 population and used to fine mapRf2.

Marker-assisted breeding of restorer lines

The utility of the InDel markers for marker-assisted selection was determined in a segregating population. First, the restorer line [N(Rf2Rf2)] of CMS-D8 was crossed with the recurrent parent [N(rf2rf2)], which has excellent agronomic characteristics. Beginning in the BC1F1 generation, the codominantInDel marker 1327 was used to track the restorer gene in each generation, and the other markers were used for further verification. Only those individuals verified by the markers were chosen as the female parent for successive backcrosses. In the BC4F1 population, 120 individuals were randomly selected, and then InDel1327 was used to perform segregation analysis. The individuals verified by the markers as homozygous at the restorer gene locus were test-crossed with the sterile line [S(rf2rf2)] to determine the segregation of the fertility phenotype in the offspring under field conditions. The atpA SCAR marker distinguishes the CMS cytoplasm from other types of cytoplasm[2].Here, the InDel 1327 marker was combined with theatpASCAR markerto identify hybrids in the CMS-D8 system.

Real-time quantitative PCR (qRT-PCR) analysis

The annotated genesin the interval were analysed, and real-time quantitative PCR was performed to identify PPR family genes and differentially expressed genes that were selected based on anther transcriptome data of the CMS-D8 system(unpublished data).Total RNA was isolated using the Sigma Spectrum Plant Total RNA Kit (Sigma-Aldrich, USA) according to the manufacturer’s protocol. Reverse transcription was conducted using the PrimeScriptTM RT Reagent Kit (TaKaRa, Beijing, China). TransStartTopGreen qPCRSuperMix (Trans gen, Beijing, China) was used according to the manufacturer's instructions to conduct qRT-PCR of the genes.The internal control gene used for qRT-PCR was the cottonHis3 gene (i.e., histone 3) with primers of GhHIS3F and GhHIS3R, and relative gene expression levels were calculated using the 2-△△CT method[91],as described in detail in previous studies[92-94]. Each gene in each sample was analyzed with three replicates and two technical replicates. All primers are listed in Table S1.

Ethics approval and consent to participate

All the cotton lines used and analysed were public and available for noncommercial purposes. This article did not contain any studies with human participants or animals performed by any of the authors.

Availability of data and materials

Not applicable

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was supported by Tianshan Youth Program (2018Q010).The funding body had no role in the design, analysis, interpretation of data, or writing of the manuscript.

Authors' contributions

JYW and CZX designed the experiments.LPG, TXQ, HNT,HYZ, XQQ, and HLW

constructed the BC1F1 population and did field management. JJF, XXZ, and MZ performed the fertility survey, data analysis and qRT-PCR. JJF and JYW contributed to the preparation of the manuscript. All authors read and approved the final manuscript.

Acknowledgments

The authors appreciate the field staff for their anonymous work.

Availability of data and materials

The datasets generated and analysed during the current study are available in the NCBI Sequence Read Archive (SRA) database under Bioproject PRJNA685585repository(https://www.ncbi.nlm.nih.gov/bioproject/PRJNA685585).

BSA: Bulked segregant analysis; CMS: Cytoplasmic male sterility; MAS: Marker assisted selection; PPR: Pentapeptide repeat; Rf: Restorer of fertility; SNP: Single nucleotide polymorphisms; InDel: insertions/deletions

Chase CD:Cytoplasmic male sterility: a window to the world of plant mitochondrial-nuclear interactions.Trends Genet. 2007, 23(2):81-90.
Wu J, Gong Y, Cui M, Qi T, Guo L, Zhang J, Xing C:Molecular characterization of cytoplasmic male sterility conditioned by Gossypium harknessii cytoplasm (CMS-D2) in upland cotton.Euphytica. 2011, 181(1):17-29.
Linke B, Börner TJM:Mitochondrial effects on flower and pollen development.Mitochondrion. 2005, 5(6):389-402.
Hanson MR:Plant mitochondrial mutations and male sterility.Annual review of genetics. 1991, 25:461-486.
Bohra A, Jha UC, Adhimoolam P, Bisht D, Singh NP:Cytoplasmic male sterility (CMS) in hybrid breeding in field crops.Plant Cell Rep. 2016, 35(5):967-993.
Laughnan JR, Gabay-Laughnan S:Cytoplasmic male sterility in maize.Annual review of genetics. 1983, 17:27-48.
Jaqueth JS, Hou Z, Zheng P, Ren R, Nagel BA, Cutter G, Niu X, Vollbrecht E, Greene TW, Kumpatla SP:Fertility Restoration of Maize CMS-C Altered by a Single Amino Acid Substitution within the Rf4 bHLH Transcription Factor.The Plant journal : for cell and molecular biology. 2019.
Akagi H, Sakamoto M, Shinjyo C, Shimada H, Fujimura TJCG:A unique sequence located downstream from the rice mitochondrial atp6 may cause male sterility.Current Genetics. 1994, 25(1):52-58.
Chang Z, Chen Z, Wang N, Xie G, Lu J, Yan W, Zhou J, Tang X, Deng X:Construction of a male sterility system for hybrid rice breeding and seed production using a nuclear male sterility gene.Proceedings of the National Academy of Sciences of the United States of America. 2016, 113(49):14145-14150.
Ma Y, Huang W, Ji J, Gong Z, Yin C, Ahmed SS, Zhao Z:Maintaining and restoring cytoplasmic male sterility systems in pepper (Capsicum annuumL.).Genet Mol Res. 2013, 12(3):2320-2331.
Tang HV, Pring DRShaw LC, Salazar RA, Muza FR, Yan B, Schertz KFJPJ:Transcript processing internal to a mitochondrial open reading frame is correlated with fertility restoration in male-sterile sorghum.the Plant Journal. 2010, 10(1):123-133.
Zheng J, Kong X, Li B, Khan A, Li Z, Liu Y, Kang H, Ullah Dawar F, Zhou R:Comparative Transcriptome Analysis between a Novel Allohexaploid Cotton Progeny CMS Line LD6A and Its Maintainer Line LD6B.International journal of molecular sciences. 2019, 20(24).
Meyer VG:Male Sterility From Gossypium harknessii.Journal of Heredity. 1975, 62(1).
Weaver DB, Weaver JB:Inheritance of Pollen Fertility Restoration in Cytoplasmic Male-Sterile Upland Cotton1.Crop Science. 1977, 17(4):497-499.
Stewart J, Dugger C, Richter D:A new cytoplasmic male sterility and restorer for cotton.Proceedings of Beltwide Cotton Conferences, Nashville. 1992; 610.
Zhang C:Preliminary studies and breeding on cytoplasmic male sterility three-line of Gossypium hirsutum L.HuaZhong Agricultural University; 2005.
Zhang J, Stewart JM:Inheritance and Genetic Relationships of the D8 and D2-2 Restorer Genes for Cotton Cytoplasmic Male Sterility.Crop Science,2001, 41(2).
Liu L, Guo W, Zhu X, Zhang T:Inheritance and fine mapping of fertility restoration for cytoplasmic male sterility in Gossypium hirsutum L.Theor Appl Genet. 2003, 106(3):461-469.
Feng CD, Stewart JMD, Zhang J:STS markers linked to the Rf1 fertility restorer gene of cotton.Theor Appl Genet. 2005, 110(2):237-243.
Yin J, Guo W, Yang L, Liu L, Zhang T:Physical mapping of the Rf1 fertility-restoring gene to a 100 kb region in cotton.Theor Appl Genet. 2006, 112(7):1318-1325.
Wu J, Cao X, Guo L, Qi T, Wang H, Tang H, Zhang J, Xing C:Development of a candidate gene marker for Rf 1 based on a PPR gene in cytoplasmic male sterile CMS-D2 Upland cotton.Molecular Breeding. 2014, 34(1):231-240.
Wu J, Meng Z, Zhang X, Guo L, Qi T, Wang H, Tang H, Zhang J, Xing C:Development of InDel markers for the restorer gene Rf1 and assessment of their utility for marker-assisted selection in cotton.Euphytica. 2017, 213(11):251.
Zhao C, Zhao G, Geng Z, Wang Z, Wang K, Liu S, Zhang H, Guo B, Geng J:Physical mapping and candidate gene prediction of fertility restorer gene of cytoplasmic male sterility in cotton.BMC Genomics. 2018, 19(1):6.
Cui X, Wise RP, Schnable PS:The rf2 nuclear restorer gene of male-sterile T-cytoplasm maize.Science (New York, NY). 1996, 272(5266):1334-1336.
Bentolila S, Alfonso AA, Hanson MR:A pentatricopeptide repeat-containing gene restores fertility to cytoplasmic male-sterile plants.PNAS. 2002, 99(16):10887-10892.
Brown GG, Formanova N, Jin H, Wargachuk R, Dendy C, Patil P, Laforest M, Zhang J, Cheung WY, Landry BS:The radish Rfo restorer gene of Ogura cytoplasmic male sterility encodes a protein with multiple pentatricopeptide repeats.The Plant journal. 2003, 35(2):262-272.
Desloire S, Gherbi H, Laloui W, Marhadour S, Clouet V, Cattolico L, Falentin C, GiancolaS, Renard M, Budar Fet al: Identification of the fertility restoration locus, Rfo, in radish, as a member of the pentatricopeptide-repeat protein family. EMBO reports. 2003, 4(6):588-594.
Hu J, Wang K, Huang W, Liu G, Gao Y, Wang J, Huang Q, Ji Y, Qin X, Wan Let al:The rice pentatricopeptide repeat protein RF5 restores fertility in Hong-Lian cytoplasmic male-sterile lines via a complex with the glycine-rich protein GRP162. The Plant cell. 2012, 24(1):109-122.
Huang W, Yu C, Hu J, Wang L, Dan Z, Zhou W, He C, Zeng Y, Yao G, Qi Jet al:Pentatricopeptide-repeat family protein RF6 functions with hexokinase 6 to rescue rice cytoplasmic male sterility. PNAS. 2015, 112(48):14984-14989.
Kazama T, Toriyama K:A pentatricopeptide repeat-containing gene that promotes the processing of aberrant atp6 RNA of cytoplasmic male-sterile rice.FEBS letters 2003, 544(1-3):99-102.
Komori T, Ohta S, Murai N, Takakura Y, Kuraya Y, Suzuki S, Hiei Y, Imaseki H, Nitta N:Map-based cloning of a fertility restorer gene, Rf-1, in rice (Oryza sativa L.).The Plant journal .2004, 37(3):315-325.
Klein RR, Klein PE, Mullet JE, Minx P, Rooney WL, Schertz KF:Fertility restorer locusRf1 of sorghum (Sorghum bicolor L.) encodes a pentatricopeptide repeat protein not present in the colinear region of rice chromosome 12. Theoretical and Applied Genetics. 2006, 112(2):388-388.
Matsuhira H, Kagami H, Kurata M, Kitazaki K, Matsunaga M, Hamaguchi Y, Hagihara E, Ueda M, Harada M, Muramatsu Aet al:Unusual and typical features of a novel restorer-of-fertility gene of sugar beet (Beta vulgaris L.). Genetics. 2012, 192(4):1347-1358.
Fujii S, Toriyama K:Suppressed expression of Retrograde-Regulated Male Sterility restores pollen fertility in cytoplasmic male sterile rice plants.PNAS. 2009, 106(23):9513-9518.
Itabashi E, Iwata N, Fujii S, Kazama T, Toriyama K:The fertility restorer gene, Rf2, for Lead Rice-type cytoplasmic male sterility of rice encodes a mitochondrial glycine-rich protein.The Plant journal. 2011, 65(3):359-367.
Takagi H, Abe A, Yoshida K, Kosugi S, Natsume S, Mitsuoka C, Uemura A, Utsushi H, Tamiru M, Takuno Set al:QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. The Plant journal.2013, 74(1):174-183.
Schneeberger K, Ossowski S, Lanz C, Juul T, Petersen AH, Nielsen KL, Jorgensen JE, Weigel D, Andersen SU:SHOREmap: simultaneous mapping and mutation identification by deep sequencing.Nature methods. 2009, 6(8):550-551.
Cai Z, Jia P, Zhang J, Gan P, Shao Q, Jin G, Wang L, Jin J, Yang J, Luo J:Genetic analysis and fine mapping of a qualitative trait locus wpb1 for albino panicle branches in rice.PLoS One. 2019, 14(9):e0223228.
Sun J, Yang L, Wang J, Liu H, Zheng H, Xie D, Zhang M, Feng M, Jia Y, Zhao Het al:Identification of a cold-tolerant locus in rice (Oryza sativa L.) using bulked segregant analysis with a next-generation sequencing strategy. Rice (New York, NY). 2018, 11(1):24.
Abe A, Kosugi S, Yoshida K, Natsume S, Takagi H, Kanzaki H, Matsumura H, Yoshida K,Mitsuoka C, Tamiru Met al: Genome sequencing reveals agronomically important loci in rice using MutMap. Nat Biotechnol. 2012, 30(2):174-178.
Klein H, Xiao Y, Conklin PA, Govindarajulu R, Kelly JA, Scanlon MJ, Whipple CJ, Bartlett M:Bulked-Segregant Analysis Coupled to Whole Genome Sequencing (BSA-Seq) for Rapid Gene Cloning in Maize.G3 (Bethesda, Md). 2018, 8(11):3583-3592.
Liu G, Zhao T, You X, Jiang J, Li J, Xu X:Molecular mapping of the Cf-10 gene by combining SNP/InDel-index and linkage analysis in tomato (Solanum lycopersicum).BMC Plant Biol. 2019, 19(1):15.
Rafalski A:Applications of single nucleotide polymorphisms in crop genetics.Current opinion in plant biology. 2002, 5(2):94-100.
Weber JL, David D, Heil J, Fan Y, Zhao C, Marth G:Human diallelic insertion/deletion polymorphisms.Am J Hum Genet.2002, 71(4):854-862.
Flint-Garcia SA, Thuillet AC, Yu J, Pressoir G, Romero SM, Mitchell SE, Doebley J, Kresovich S, Goodman MM, Buckler ES:Maize association population: a high-resolution platform for quantitative trait locus dissection.The Plant journal.2005, 44(6):1054-1064.
Simko I, Haynes KG, Ewing EE, Costanzo S, Christ BJ, Jones RW:Mapping genes for resistance to Verticillium albo-atrum in tetraploid and diploid potato populations using haplotype association tests and genetic linkage analysis.Molecular genetics and genomics. 2004, 271(5):522-531.
Szalma SJ, Buckler ESt, Snook ME, McMullen MD:Association analysis of candidate genes for maysin and chlorogenic acid accumulation in maize silks.Theor Appl Genet.2005, 110(7):1324-1333.
Li F, Fan G, Lu C, Xiao G, Zou C, Kohel RJ, Ma Z, Shang H, Ma X, Wu Jet al:Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat Biotechnol.2015, 33(5):524-530.

Zhang T, Hu Y, Jiang W, Fang L, Guan X, Chen J, Zhang J, Saski CA, Scheffler BE, Stelly DMet al:Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat Biotechnol.2015, 33(5):531-537.
Wang M, Tu L, Yuan D, Zhu, Shen C, Li J, Liu F, Pei L, Wang P, Zhao Get al:Reference genome sequences of two cultivated allotetraploid cottons, Gossypium hirsutum and Gossypium barbadense. Nat Genet.2019, 51(2):224-229.
De la Vega FM, Lazaruk KD, Rhodes MD, Wenz MH:Assessment of two flexible and compatible SNP genotyping platforms: TaqMan SNP Genotyping Assays and the SNPlex Genotyping System.Mutation research.2005, 573(1-2):111-135.
Shen R, Fan JB, Campbell D, Chang W, Chen J, Doucet D, Yeakley J, Bibikova M, Wickham Garcia E, McBride Cet al:High-throughput SNP genotyping on universal bead arrays. Mutation research 2005, 573(1-2):70-82.

Zhang JF, Stewart JMD:CMS-D8 restoration in cotton is conditioned by one dominant gene.Crop Science. 2001, 41(2):283-288.
Ma J, Liu J, Pei W, Ma Q, Wang N, Zhang X, Cui Y, Li D, Liu G, Wu M et al:Genome-wide association study of the oil content in upland cotton (Gossypium hirsutum L.)and identification of GhPRXR1, a candidate gene for a stable QTLqOC-Dt5-1. Plant science.2019, 286:89-97..
Touzet P, Meyer EH:Cytoplasmic male sterility and mitochondrial metabolism in plants.Mitochondrion. 2014, 19:166-171.
Tang M, Chen Z, Grover CE, Wang Y, Li S, Liu G, Ma Z, Wendel JF, Hua J:Rapid evolutionary divergence of Gossypium barbadense and G. hirsutum mitochondrial genomes.BMC Genomics 2015, 16:770.
Yang L:Map-based cloning of fertility restoring gene of CMS and analysis of PPR gene family in cotton. Nanjing: Nanjing agricultural university; 2009.
Zhang B,Zhang X, Guo L, Qi T, Wang H, Tang H,Qiao X, Kashif S, Xing C, Wu J:Genome-wide analysis of Rf-PPR-like(RFL)genes and a new InDel marker development for Rf1 gene in cytoplasmic male sterile CMS-D2 Upland cotton.Journal of Cotton Research. 2018, 1(03):12-22.
Wang F, Yue B, Hu JG, Stewart JM, Zhang JF:A target region amplified polymorphism marker for fertility restorer gene Rf1 and chromosomal localization of rf1 and Rf2 in cotton.Crop Science.2009, 49(5):1602-1608.
Zuo W, Chao Q, Zhang N, Ye J, Tan G, Li B, Xing Y, Zhang B, Liu H, Fengler KAet al:A maize wall-associated kinase confers quantitative resistance to head smut. Nature Genetics.2015, 47(2):151-157.
Lu X, Xiong Q, Cheng T, Li QT, Liu XL, Bi YD, Li W, Zhang WK, Ma B, Lai YCet al:A PP2C-1 Allele Underlying a Quantitative Trait Locus Enhances Soybean 100-Seed Weight. Molecular plant 2017, 10(5):670-684.
Zhang Z, Li J, Pan Y, Li J, Zhou L, Shi H, Zeng Y, Guo H, Yang S, Zheng Wet al:Natural variation in CTB4a enhances rice adaptation to cold habitats. Nature communications.2017, 8:14788.
Zhu Z, Tan L, Fu Y, Liu F, Cai H, Xie D, Wu F, Wu J, Matsumoto T, Sun C:Genetic control of inflorescence architecture during rice domestication.Nature communications.2013, 4:2200.
Schneeberger K:Using next-generation sequencing to isolate mutant genes from forward genetic screens.Nature reviews Genetics.2014, 15(10):662-676.
Haase NJ, Beissinger T, Hirsch CN, Vaillancourt B, Deshpande S, Barry K, Buell CR, Kaeppler SM, de Leon N:Shared Genomic Regions Between Derivatives of a Large Segregating Population of Maize Identified Using Bulked Segregant Analysis Sequencing and Traditional Linkage Analysis.G3 (Bethesda, Md).2015, 5(8):1593-1602.
Geng X, Jiang C, Yang J, Wang L, Wu X, Wei W:Rapid Identification of Candidate Genes for Seed Weight Using the SLAF-Seq Method in Brassica napus.PLoS One.2016, 11(1):e0147580.
Kayam G, Brand Y, Faigenboim-Doron A, Patil A, Hedvat I, Hovav R:Fine-Mapping the Branching Habit Trait in Cultivated Peanut by Combining Bulked Segregant Analysis and High-Throughput Sequencing.Front Plant Sci.2017, 8:467.
Song J, Li Z, Liu Z, Guo Y, Qiu LJ:Next-Generation Sequencing from Bulked-Segregant Analysis Accelerates the Simultaneous Identification of Two Qualitative Genes in Soybean.Front Plant Sci.2017, 8:919.
Gu A, Meng C, Chen Y, Wei L, Dong H, Lu Y, Wang Y, Chen X, Zhao J, Shen S:Coupling Seq-BSA and RNA-Seq Analyses Reveal the Molecular Pathway and Genes Associated with Heading Type in Chinese Cabbage.Frontiers in genetics.2017, 8:176.
Ohnishi Y, Tanaka T, Ozaki K, Yamada R, Suzuki H, Nakamura YJJoHG:A high-throughput SNP typing system for genome-wide association studies.Gan to kagaku ryoho Cancer & chemotherapy. 2002, 29(11):2031-2036.71.Grattapaglia D, Silva-Junior OB, Kirst M, de Lima BM, Faria DA, Pappas GJ: High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species. BMC Plant Biology.2011, 11(1):65.
Yan J, Yang X, Shah T, Sánchez-Villeda H, Li J, Warburton M, Zhou Y, Crouch JH, Xu Y:High-throughput SNP genotyping with the GoldenGate assay in maize.Molecular Breeding.2010, 25(3):441-451.
Yang X, Yan J, Shah T, Warburton ML, Li Q, Li L, Gao Y, Chai Y, Fu Z, Zhou Yet al:Genetic analysis and characterization of a new maize association mapping panel for quantitative trait loci dissection. Theor Appl Gene.t 2010, 121(3):417-431.
Xu Y, Crouch JH:Marker-Assisted Selection in Plant Breeding: From Publications to Practice.Crop science.2008, 48(2):391-407.
Vasemägi A, Gross R, Palm D, Paaver T, Primmer CR:Discovery and application of insertion-deletion (INDEL) polymorphisms for QTL mapping of early life-history traits in Atlantic salmon.BMC Genomics.2010, 11(1):156.
Chen L, Liu YG:Male sterility and fertility restoration in crops.Annual review of plant biology.2014, 65:579-606.
Koizuka N, Imai R, Fujimoto H, Hayakawa T, Kimura Y, Kohno-Murase J, Sakai T, Kawasaki S, Imamura J:Genetic characterization of a pentatricopeptide repeat protein gene, orf687, that restores fertility in the cytoplasmic male-sterile Kosena radish.The Plant journal.2003, 34(4):407-415.
Wang Z, Zou Y, Li X, Zhang Q, Chen L, Wu H, Su D, Chen Y, Guo J, Luo Det al:Cytoplasmic male sterility of rice with boro II cytoplasm is caused by a cytotoxic peptide and is restored by two related PPR motif genes via distinct modes of mRNA silencing. The Plant cell.2006, 18(3):676-687.
Liu Z, Dong F, Wang X, Wang T, Su R, Hong D, Yang G:A pentatricopeptide repeat protein restores nap cytoplasmic male sterility in Brassica napus.Journal of experimental botany.2017, 68(15):4115-4123.
Zhang JF, Stewart JM:Economical and rapid method for extracting cotton genomic DNA.J Cotton Sci.2000, 4(3):193-201.
Li H, Durbin R:Fast and accurate short read alignment with Burrows-Wheeler transform.Bioinformatics (Oxford, England). 2009, 25(14):1754-1760.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan JJB:The Sequence Alignment-Map format and SAMtools.Bioinformatics (Oxford, England). 2009, 25(16):2078-2079.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly Met al:The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome research.2010, 20(9):1297-1303.
Wang K, Li M, Hakonarson H:ANNOVAR: functional annotation of genetic variantsfrom high-throughput sequencing data. Nucleic acids research.2010, 38(16):e164.
Reumers J, De Rijk P, Zhao H, Liekens A, Smeets D, Cleary J, Van Loo P, Van Den Bossche M, Catthoor K, Sabbe Bet al:Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing. Nat Biotechnol.2011, 30(1):61-68.
Takagi H, Abe A, Yoshida K, Kosugi S, Natsume S, Mitsuoka C, Uemura A, Utsushi H, Tamiru M, Takuno Set al:QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. The Plant journal : for cell and molecular biology.2013, 74(1):174-183.
Abe A, Kosugi S, Yoshida K, Natsume S, Takagi H, Kanzaki H, Matsumura H, Yoshida K, Mitsuoka C, Tamiru Met al:Genome sequencing reveals agronomically important loci in rice using MutMap. Nat Biotechnol.2012, 30(2):174-178.88.Siepel A, Magwene PM, Willis JH, Kelly JK: The Statistics of Bulk Segregant Analysis Using Next Generation Sequencing. PLoS Computational Biology.2011, 7(11).
Mansfeld BN, Grumet R:QTLseqr: An R Package for Bulk Segregant Analysis with Next-Generation Sequencing.The plant genome.2018, 11(2).
Rychlik W:OLIGO 7 primer analysis software. In:PCR primer design. Springer; 2007: 35-59.
Schmittgen TD, Livak KJ:Analyzing real-time PCR data by the comparative C(T) method.Nature protocols.2008, 3(6):1101-1108.
Wu J, Zhang M, Zhang B, Zhang X, Guo L, Qi T, Wang H, Zhang J, Xing C:Genome-wide comparative transcriptome analysis of CMS-D2 and its maintainer and restorer lines in upland cotton.BMC Genomics.2017, 18(1):454.
Zhang M, Guo L, Qi T, Zhang X, Tang H, Wang H, Qiao X, Zhang B, Feng J, Zuo Zet al:Integrated Methylome and Transcriptome Analysis between the CMS-D2 Line ZBA and Its Maintainer Line ZB in Upland Cotton. Int J Mol Sci.2019, 20(23).
Zhang M, Zhang X, Guo L, Qi T, Liu G, Feng J, Shahzad K, Zhang B, Li X, Wang Het al:Single-base resolution methylome of cotton cytoplasmic male sterility system reveals epigenomic changes in response to high-temperature stress during anther development. J Exp Bot.2020, 71(3):951-969.

Table 1

Results of high-throughput resequencing data mining
Sample	Reads	Bases	GC(%)	Q20(%)	Q30(%)
B	174,907,610	52,032,498,409	36.69	98.43	94.29
R	177,874,504	52,866,884,655	37.77	98.57	94.69
fertility-bulk	465,660,282	138,509,668,729	37.50	98.53	94.83
sterility-bulk	432,846,695	128,892,417,044	36.93	98.90	95.97
Mean	312,822,273	93,075,367,209	37.22	98.61	94.95
Sum	1,251,289,091	372,301,468,837	-	-	-

Table 2

Sequencing coverage and depth data
Sample	Coverage(%)	Mean Depth
B	82.94	16.03
R	79.95	16.62
fertility-bulk	83.44	47.77
sterility-bulk	83.93	43.26
Mean	82.57	30.92

SupplementaryTable.docx
Table S1 The primer of qRt-PCR Table S2 Genotype of SNP site Table S3 The primer of InDel markers Table S4 The FPKM of 67 genes in CMS-D8 system
Fig.S2.tif
Fig.S1.tif
Fig.S4.tif
Fig.S3.tif
Fig.S5.tif

Download PDF

Journal Publication

published 06 Jan, 2021

Read the published version in BMC Genomics →

Editorial decision: Accept
18 Dec, 2020
Editor assigned by journal
17 Dec, 2020
Submission checks completed at journal
17 Dec, 2020
Editor invited by journal
17 Dec, 2020

You are reading this latest preprint version

Physical mapping and InDel marker development for the restorer gene Rf2 in cytoplasmic male sterile CMS-D8 cotton

Status:

Journal Publication

Version 3

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

Declarations

Abbreviations

References

Tables

Supplementary Files

Status:

Journal Publication

Version 3