Phenotypic diversity
The variation in the phenotypes is a key factor for crop improvement. The phenotypic traits were divided into two groups’ i.e., germination parameters and growth parameters.
Germination parameters
Wide range of diversity was observed in germination parameters. The trait wise phenotypic variation was depicted by histogram (Figure 1). FGP spread from 10 to 100%, and peaked in frequency between 90 and 100%. The distribution was poorly fit with clear left skew (Figure 1a). MGT spread from 1 to 8 days, which peak frequency for MGT between 3 and 4 days. The distribution was slightly left skewed (Figure 1b). Germination Index (GI) GI spread from 0.1 to 1, with peak frequency recorded between 0.9 and 1. The distribution was left skewed (Figure 1c). Germination Energy (GE) spread from 0 to 100%, with frequency between 90 and 100%. The central distribution was not normal and left skewed (Figure 1d). Peak Value (PV) spread from 0 to 3.5, with marginal mode frequency between 2 and 2.5. The distribution were normally distributed (Figure 1e). Germination Speed (GS) spread from 0 to 7 seed /day, with frequency between 2 and 3 seed /day. The distribution was slight right skewed (Figure 1f). GR spread from 0 to 7 % /day, with peak in frequency for GR between 6 and 7% /day. The distribution was right skewed (Figure 1g). Germination Capacity (GC) spread from 10 to 100%, with peak in frequency between 90 and 100%. The distribution was left skewed (Figure 1h). All the data was validated by the normality test (p-value < AD-value) (Table 1). Based on these results, the null hypothesis is rejected which assumed the data follows a normal distribution. All these parameters was tested statistically and result showed that germination parameters (FGP, MGT, GI, GE, GS, GR and GC) were found to be statistically significant at α = 0.1% (Table 1). Outliers for FGP, GI, GS, GR and GC were identified and illustrated by boxplot (Supplementary Fig S1). Based on above results, it is concluded that BAS2000 and Hashemi Tarom were identified as expressing weak tolerant phenotypes consistently (high MGT, low GI, GS, and FGP, nil GE).
Growth parameters
Wide range of diversity was observed in growth parameters. The trait wise phenotypic variation was depicted by histogram (Figure 2). Total Length (TL) spread from 1 to 10 cm, and peaked in frequency between 5 and 6 cm. The TL were normally distributed (Figure 2a). Shoot Length (SL) spread from 1 to 6.5 cm, with peaked frequency between 4.5 and 5 cm. The distribution was slightly left skewed (Figure 2b). Root Length (RL) spread from 0 to 3 cm, with peaked frequency between 0.5 and 1 cm. The distribution were poorly fit with clear right skew (Figure 2c). Vigor Index (VI) spread from 0 to 1,000, with peaked frequency between 500 and 600. The VI were normally distributed (Figure 2d). Shoot Fresh Weight (SFW) spread from 0 to 35 mg, with peaked in frequency between 15 and 20 mg (Figure 2e). Root Fresh Weight (RFW) spread from 0 to 25 mg, with peaked frequency between 0 and 5 mg. The distribution was poorly fit with clear right skew (Figure 2f). Shoot Dry Weight (SDW) spread from 0.5 to 6 mg, with peaked frequency for SDW between 4 and 5 mg. The distribution was poorly fit with clear left skew (Figure 2g). Root Dry Weight (RDW) spread from 0 to 2.5 mg, with peaked frequency between 1 and 1.5 mg. The distribution were poorly fit with clear right skew (Figure 2h). Normality test indicated that the null hypothesis was rejected (p-value < AD-value). (Table 1). Outliers for FGP, GI, GS, GR and GC were identified and illustrated by boxplot. Boxplot suggested an outlier which complemented the AD test (Supplementary Fig. S2). All growth parameters (TL, SL, RL, VI, SFW, RFW, SDW, and RDW) varied statistically significant at α = 0.1%. Based on the results, it was concluded that BAS2000 variety was severely affected by salt stress. MCHKAB variety also indicated minimal growth in comparison to other varieties investigated. Interestingly, MR varieties including MR219, MR263 and MR211 ranked higher in terms of traits that suggest salt tolerant attributes.
Clustering analysis based on normalised phenotypic traits suggested significant variation between three subgroups. However, in this clustering, the weak and highly tolerant varieties grouping were significantly smaller (Figure 3). Shiroodi, Hashemi Tarom and BAS2000 were clustered together, presented as weak tolerant varieties. MR211 and MR219 are two Malaysian varieties which were clustered as highly tolerant traits. 87.5% of the remaining varieties clustered into moderately tolerant varieties (Table 2).
Genotyping Analysis
These genotyping markers collectively find 287 alleles. The number of alleles ranged from 2- 9 among which RM403, RM559, RM343, RM596 and RM479 each had the lowest (2) number of alleles, while RM481 and RM21 both shared the highest (9) number of alleles. The number of alleles averaged at 4.86 per locus (Table 3). Fifty markers were assigned as highly polymorphic (PIC > 0.50), seven as moderately polymorphic (0.50 < PIC < 0.25) and two markers were not categorized as no PIC value was calculated for them. RM21 had the highest PIC value i.e., 0.88, while RM462 had the lowest PIC value i.e., 4.0 (Table 3). The average PIC value is 0.69.
Population structure analysis
The model based analysis using STRUCTURE HARVESTER successfully interrogated the dataset using the evano approach, since there were 10 replicate sets (≥ 3), which allowed K to be analysed (Earl 2012). The result showed the number of K groups which best fit the dataset was three (where maximum Δk = 33.64). The K verses Δk graph indicated the presence of three groups in the rice germplasm (Figure 4a). The presence of the three groups was also illustrated by triangle cluster diagram (Figure 4b). Moreover, the presence of these groups was confirmed by QQ plot and results provide the clear indication of three groups shown by three different colours (Figure 4c). The groups represented moderately tolerant (Group A), highly tolerant (Group B) and weakly tolerant (Group C) subpopulations.
Comparatively, the dendrogram generated by DARwin using the distance-based approach establish a similar conclusion on the population structure among the varieties (Figure 5). There were clear sub clades present with increasing dissimilarity between them. Therefore, similar to the findings by Nachimuthu et al. (2015) suggested that both population structure approaches demonstrated similarity between the genotypes clustering. The dendrograms were labelled as Group I, Group II and Group III clusters, which correspond to weakly tolerant, highly tolerant and moderately tolerant clustering, respectively.
Genetic diversity
AMOVA test suggested that 95% genetic diversity was within the population, which implies significant genetic variation between each grouped subpopulation with salinity tolerant characteristics (Figure 6). The variation between all forty varieties were significantly smaller (5 %) given a relatively high PIC average and wide geographical origin of the cultivars tested, but are accounted by the large variation between those exhibiting similar salinity tolerance.