Identification of cauliflower CMS types and genetic divergence
The 76 CMS accessions (63 indigenously developed CMS lines and 13 CMS based hybrids) of cauliflower (Table 1) were screened using nineteen pairs of mitochondrial markers. These markers consist of 13 pairs of mitochondrial gene-specific primers: P1-P13 and six pairs ofmt-SSRs: P1-P13, designed based on theB. napusmitochondrial genome(Additional file 1: Table S1)[23, 38, 50]. Among the 13 pairs of mitochondrial gene specific primers, 7 pairs specific to orf138 were amplified. Although, they did not show any amplification for the control non CMSline (Fig 1).However, none of the primers governing pol, nap and tour cytoplasm generate any amplification.The 6 pairs of mitochondrial SSRs were amplified across the studied Indian cauliflower CMS lines. The amplification pattern of the gene based primers indicated that the cytoplasmtype of all the cytolines in Indian cauliflower was derived from Ogura CMS system;however, there was variation at nucleotide level among the studied CMS accessions. Two pairs of mt-SSRs, P15and P16specific toBnTR4 and orf125 genomic regions respectively, exhibited striking polymorphism among the CMS accessions. The amplicon size for the primer P15 and P16 varied between 200-280 bp and 410-470bp, respectively.
The PCA (Principal component analysis) and Neighbour joining (NJ) cluster analysis based on sequence of the mitochondrial amplicons obtained fromOguraCMS and in case of Brassica napusmitochondrialgenome based mt-SSR markers was used to estimate the diversity in origin of male sterile cytoplasm in cauliflower accessions. In cluster analysis the CMS cauliflower lines could be categorized into three major distinct groups each with two sub-groups and further sub-clusters in each sub-group (Fig. 2a, 2b). Broadly, six distinct sub-groups were determined based on the origin of their cytoplasm source (Fig. 2b). The six CMS lines (A38 to A43) including four CMS based hybrids CFH1522, Kimaya, Pahuja and Snowpearl remained in one single group; the other eight CMS accessions (A44 to A51) belonged to different group. Ten CMS accessions (A52 to A61) belonged to third group and other fifteen CMS lines (A62 to A76) including six CMS based hybrids (SM, Indam, KTCF-10A, Casper, Ponder, Brahma) formed fourth group. The nine CMS accessions (A29 to A37) represented in fifth group and rest of the twenty-eight CMS accessions (A1 to A28) including three CMS based hybrids of AcsenHyVeg private limited (HVCF-29, HVCF-18 and HVCF-16) formed the largest group (Fig. 2b).
Sequence features of polymorphic amplicons
The fragments amplified by primer P15 were of 251 bp for 46 CMS accessions under study and 220 bp for the remaining cytolines.The polymorphic amplicons of both these groups depicting 251 and 220 bp size were purified, cloned in pGEM-T vector and sequenced. The obtained sequences were subjected to multiple sequence alignment with each other and reference mitochondrial genome sequences of Brassicaceae crops exhibiting high degree of sequence similarity (Table 2). The accession numbers for the sequenced fragments were obtained from GenBank NCBI as described in the method section. The sequence analysis revealed that MN549523shared highly conserved region with MN549524 and high degree of similarity with other reference genomic regions of mitochondrial genome sequences of KU831325.1, KJ820683.1, AB627043.1, Sequence 3 P15, AP012988.1 and JF920286.1(Fig.3; Table 2).While, the MN549525 exhibited high degree of similarity with MN549526 and reference mitogenome sequences used in sequence alignment analysis such as Sequence 4 P15, AP018472.1 and AP012989.1 (Fig.3). The sequence alignment revealed single nucleotide polymorphism (SNP) at position 78 (C/T) and in addition, the deletion of 31 nucleotide between positions 78 and 110 for MN549525 and MN549526 (Fig. 3).
The amplicons generated by primer P16 were of size 420 bp in 42 CMS accessions under study and 471 bp in remaining. The obtained sequences were subjected to multiple sequence alignment with other mitogenome sequences of Brassicaceae crops available in NCBI database (Table 3). The sequence similarity analysis revealed that amplicons of MN549527 to MN549530shares a high degree of sequence similarity with each other and CMS related proteins in other Brassicaceae accessions AP018472.1, MG872827.1, JF920287.1, Seq6 P16, AP012990.1, AP012989.1 and AB694744.1 (Fig.4). Likewise, MN549531 exhibited highlyconserved regionwith Seq5 P16, KU831325.1 and KJ820683.1 (Fig. 4). The sequence alignment analysis revealed a deletion of size 51 bp between positions 371 to 423 in the exonic regions of ORFs for MN549527 to MN549530. While normal sequence without deletion was observed for MN549531.
Cytoplasmic genetic variations and floral malformations
We sought to investigate whether cytoplasmic genetic diversity associated with SNPs or InDels at loci BnTR4 and orf125(Fig. 3; Fig .4) has specifically impacted floral phenotypes of CMS lines. The polymorphism at these loci categorized the cytolines into two distinct groups (Fig. 2). It was evident that the cytolines exhibiting SNPs and deletions of 31 and 51 nucleotidesin the ORFs exonic region was associated with varying degree of floral abnormalities (Fig. 5; Table 4). The CMS lines with the absence of these InDels and SNPs identified by polymorphic markers had normal flower structure with no deformities which indicated the role ofcytoplasmic genetic variations in determining floral phenotypes.The major deformities recorded in the CMS lines were (i) adherence of functional stamens with style (ii) homeotic-like floral transformationpetaloidy condition of stamens (iii) partial petaloidy of functional stamens (iv) splitted style along with adherence of stamens (v) stigma hidden inside the petals (vi) splitted style along with exposed ovules (vii) unopened flower (viii) stamens adherence with style and crooked stigma (ix) partially opened flowers (x) absence of non-functional stamens (xi) fused flower (xii) curved functional stamens with crooked stigma and (xiii) absence of nectaries. Most of the CMS lines with none of the above floral deformities had normal female fertility and seed set(Fig. 5; Table 4).
Analysis of ORFs and similarity of genes, phylogenetic relationships
The open reading frame (ORF) analysis revealed nucleotide deletion in the orf125 coding region ofthe male sterile cytoplasm. The protein sequence analysis of polymorphic amplicons and reference mitotypes showed deleted nucleotide encoded 17 amino acids (Fig. 6).Therefore, polymorphism of P15 (BnTR4) and P16 (orf125) could be ascribed to SNPs or fragment insertion/deletion near the mt-SSR loci. The amino acid sequence analysis in the region of ORF depicted the polymorphic region amplifiedby P16 mt-SSR is located in the coding region of orf125 protein of B. oleracea (wild cabbage), B. juncea, B. rapassp. oleifera, Erucavesicariasubsp. sativa, B. oleracea var. capitataand B. oleracea var. botrytis (Additional file 1: Table S2)besides its location in B. carinataexonic region of orf108c. The protein sequence analysis revealed deletion of amino acids related to array of floral deformities withhigh degree of similarity with orf108c in B. carinata(Additional file 1: Table S2).The protein sequences devoid of deletion exhibited high similarity with orf125 in B. oleracea.
Polymorphism exhibited with with P15mt-SSR and mt-DNA sequences of other 9 Brassicaceae mitotypes were broadly grouped into two major groups (Fig.7a). The phylogenetic analysis revealed close affinity of cauliflower CMS lines,Ogu16A with Ogu12A and Ogu307-33A with Ogu33-1A. These cytoplasmic sources were clustered in one major group with one diploid accession of B. oleracea. This clustering in one group concurs withrapa/oleracea lineage propounded by Warwick and Black [51] based on chloroplast genome. Another group represented diploid B. oleracea var. botrytis, B. oleracea var. capitata, B. oleracea var. italica, Raphanus sativus var. kosenaand B. nigra. This grouping also corresponded to rapa/oleracea lineage as proposed by Warwick and Black [51]except one species of nigralineage.Another group also confined to rapa/oleracea lineage with the exception of one species of nigralineage [51]. Similarly, the polymorphic amplicons based on P16 mitochondrial marker and corresponding mt-DNA sequences of ten Brassicaceae mitotypes were clustered two major groups (Fig.7b). The selected CMS sources, Ogu50A and Ogu17A were distantly placed from other CMS sources. OguHL-3A, Ogu12A and Ogu121-2A exhibited close affinity with each other and B. oleracea var. capitatabased on phylogenetic analysis. The CMS line Ogu50A depicted high affinity with KJ820683.1 (B.oleraceavar. botrytisL.)and Ogu17A showed high affinity with mitogenome of B. nigraand R. sativusvar. Kosena,and clusteredin one major group along with Raphanus sativus cv. Ms-gensuke, Ogu50A and B. oleracea var. botrytis (KJ820683.1). The black radish distantly placed itself from other members of its group.
Effects of mito-nuclear genomic interactions on floral-nectar phenotypes in different nuclear backgrounds
Effect of cytonuclear interactions on floral qualitative traits
In the present investigation we sought to analyze whether cytonuclear interactions specifically affected floral qualitative traits in the cytolines of Indian cauliflowers. Cauliflower cytolines along with their respective maintainers were characterized for five traitsnamelystyle shape, petal color, presence of floral nectaries, presence of viable pollen and type of ovary (Additional file 1: Table S3,Additional file 2: Fig.S1-S2).All the CMS lines had normal ovary and varied nectaries. None of the CMS line showed the presence of viable pollen grains and few lines were devoid of anther and pollen grains. The CMS accessions had straight to slightly curved or fully curved stigma. Yellow colored petals were predominant however white petals were recorded in Ogu33A, Ogu134-8A, Ogu13A, Ogu118-6A, OguKt-2-6A and Ogu118-2A.
Cytonuclear interactions influencing floral reproductive whorls
Comparative analysis for floral reproductive whorls and floral phenotypes of cauliflower cytolines and their respective male fertile counterparts was conducted to determine the effects of mito-nuclear co-adaptaion and disruption. The per se performance of several cytolines lines and their respective maintainers recorded significant variation for different floral phenotype (Additional file 1: Table S4, Additional file 2: Fig. S2).Petal length varied from 12.91 mm (Ogu76-4A) to 17.92 mm (Ogu2-6A). The longest petal was recorded in the CMS line Ogu2-6A followed by Ogu126-1A and Ogu310-8A. The petal width ranged from 4.18 mm (Ogu13-85-2A) to 7.83 mm (Ogu2-6A). The CMS line, Ogu2-6A followed by Ogu178-8A and Ogu33A-1301 had the widest petals. The ratio of petal length to petal width varied from 1.88 (Ogu33A-1301) to 3.82 (Ogu13-85-2A) and ratio was>2 except Ogu-13-01-5A, Ogu307-1A and Ogu-13-01-33A.The considerable differences were also observed for sepal size,and the sepal length and width varied from 6.23 mm (Ogu15A) to 10.28 mm (Ogu2-6A) and 2.21 mm (Ogu13-85-2A) to 3.53 mm (Ogu115-8A), respectively. The ratio of sepal length to sepal width ranged from 2.19 (Ogu15A) to 3.88 (OguKt-2-1A). Like petal size, sepal size too was significantly reduced in the male sterile lines. The length of short (non-functional) and long (functional) filament ranged from 2.30 mm (Ogu16A) to 6.38 mm (Ogu310-8A) and 3.40 mm (33A-1301) to 7.52 mm (Ogu310-8A), respectively. The introgression of sterile cytoplasm resulted in elimination of non-functional filaments in 9CMS lines. Marked differences were also observed for stamen length, as the short stamen (non-functional) length varied from 3.41 mm (Ogu16A) to 8.13 mm (Ogu310-8A) and long stamen (functional) length from 4.79 mm (33A-1301) to 9.48 mm (Ogu2-6A). The ratio of functional (long stamen) to non-functional anthers (short stamen) was determined to detect any changes in CMS types and it varied from 1.03 (OguKt-2-1A) to 2.07 (Ogu77-4A) and it varied from 1.03 to 1.26 in maintainer lines. The 9 CMS lines viz. Ogu121-1A, Ogu122-1A, Ogu126-1A, OguKt-9-2A, OguKt-8-2A, Ogu118-3A, Ogu2A, Ogu3A, and Ogu118-2A were completely devoid of nonfunctional stamens. The significant differences were also observed in all the CMS lines for style length and it varied from 5.32 mm (Ogu14A) to 9.44 mm (Ogu16A). However, in the male fertile counterparts the style length varied from 6.33 mm to 10.86 mm. These results clearly indicated the effect of cytonuclear genomic incompatibilities on reproductive whorls. Reduction in style length, stamen length and even absence of short stamens was also recorded in the cytolinesin different nuclear backgrounds. Generally, the position of style was higher than the stamens in majority of the cytolines except Ogu-Kt-2-1A, Ogu122-5A, Ogu118-4A, Ogu33-1A, Ogu33A, Ogu77-4A, OguKt-9-2A, Ogu3A and Ogu14A (Additional file 1: Table S4). Introgression of ogura cytoplasm invariably reduced the stamen length in call Indian cauliflower lines. The relative position of stamens and stigma was determined by estimating the ratio between functional stamens and style length. The ratio varied from 0.62 (33A-1301) to 1.30 (Ogu14A) in CMS lines and 0.98 to 1.46 in male fertile counterparts.
Impact of cytonuclear interactions on nectar production
Comparative nectar quantity in the cytolines and their respective male fertile maintainers was conducted to determine nuclear-cytoplasmic interaction for this trait. Significant variation was observed in the cytolines lines for nectar quantity (Table 5). The nectar quantity varied from 0.28 µl (Ogu15A) to 5.69 µl (Ogu1-8A) in the CMS lines, while in the male fertile counterpart the nectar quantity varied from 0.60 µl to 13.94 µl.The highest quantity of nectar was found in the CMS line, Ogu1-8A followed by Ogu308-6A and Ogu2-6A. The results (Table 5) revealed significant reduction in the nectar quantity ofcytolinesas compared to their respective male fertile counterparts.
Cluster analysis
The CMS lines were grouped into different clusters based on floral reproductive whorls and phenotypic traits and 2 major clusters (CI and CII) formed with two sub-clusters in cluster II (Fig.8). The sub-clusters (C-IIA and C-IIB) in the cluster-II were further grouped into sub-clusters C-IIA-1, C-IIA-2 and C-IIB-1, C-IIB-2. Majority of CMS lines were grouped into cluster II and cluster I had only 9 CMS lines. All the CMS accessions with the absence of non functional stamens remained in cluster-I. The 6 CMS lines with the functional stamens of length 8.5 mm to 9.48 mm remained in the sub-cluster C-IIA-1. The nine CMS accessions grouped into sub-cluster C-IIA-2 with nonfunctional filament length 3.1-3.8 mm and majority of the CMS lines in this cluster had sepal width < 2.45 mm. The sub-cluster C-IIB-1a contained five CMS lines with petal length < 14.5 mm, style length 6.28-6.6 mm and functional stamen length of 6.49-6.61 mm .The sub-cluster C-IIB-1b had ten CMS lines and majority of them (60%) have petal length 15-17 mm and sepal width 2.9-3.4 mm. The sub-cluster C-IIB-2a comprised of eight CMS lines and majority of them had sepal length > 8.7 mm (i.e. 8.7-9.8 mm). The remaining thirteen CMS lines were clustered into sub-cluster C-IIB-2b with petal length of >13 mm to < 15 mm in majority of the lines.