Genetic diversity of Zanthoxylum
Genetic diversity serves as the foundation for the long-term survival and evolutionary advancement of species. The extent of genetic diversity within a species determines its evolutionary potential and ability to withstand adverse environmental factors [29]. In the case of plants, research on genetic diversity is crucial for comprehending the level of genetic variation and genetic structure within species. This serves as a significant indicator for evaluating the genetic potential of germplasm resources. Additionally, these findings could lead to resource utilization, germplasm innovation, and varietal improvement while also providing recommendations for resource conservation and management [30, 31].
Molecular markers represent an effective method for studying species genetic diversity. There are various types of molecular markers with different characteristics. By combining different molecular markers, researchers can examine different segments of the genome, thereby enhancing the coverage and uniformity of polymorphic loci. This approach compensates for any limitations and drawbacks associated with using a single type of molecular marker, enabling researchers to gain a comprehensive understanding of the species' genetic information and enhancing the credibility of their findings [32].
The aim of this study was to assess the genetic diversity and relatedness among 80 Zanthoxylum accessions using SSR and iPBS molecular markers. SSR molecular markers are known for their superior variability and broad distribution within the genome. They are widely utilized across numerous genetic-related fields due to their codominance, high polymorphism, reproducibility, and consistent results [7]. In this study, we identified a total of 206 allelic variations among the 80 Zanthoxylum accessions using 32 selected SSR markers. Each marker displayed an average of 6.438 alleles (Na), an effective number of alleles (Ne) of 3.254, a Shannon's information index (I) of 1.336, and PIC values ranging from 0.400 to 0.827, with an average of 0.710. Notably, 30 markers exhibited high polymorphism levels (PIC > 0.5). Among the genetic diversity indices, Na and the PIC are particularly important for assessing molecular marker polymorphisms [33]. In this study, the values for these two indices were greater than those reported by Li et al. [9] (Na = 3.5; PIC = 0.48) and Feng et al. [13] (Na = 4.636) in Zanthoxylum. Taken together, these findings indicate that the SSR markers employed in this study exhibited overall high polymorphism, revealing the genetic diversity of the tested Zanthoxylum accessions.
Compared to SSR molecular labeling technology, iPBS molecular labeling technology offers a simpler, faster, and more cost-effective approach. Throughout this study, 10 iPBS primers were employed to amplify a total of 127 bands across the 80 Zanthoxylum accessions. The average polymorphism rate of the primers was 93.1%. The PIC values ranged from 0.201 to 0.324, with an average of 0.281, indicating a moderate level of polymorphism, consistent with research findings in Phoenix dactylifera [34] (PIC = 0.287) and Psidium guaJava [35] (PIC = 0.287). By combining the results of both sets of molecular markers, it was observed that the genetic diversity index obtained through iPBS markers was significantly lower than that obtained through SSR markers. This finding suggested that SSR markers possess greater polymorphism and are more suitable for analyzing the genetic diversity of Zanthoxylum germplasm resources. Such disparity is likely influenced by the number of markers used in this study; utilizing 32 SSR markers increases the likelihood of detecting greater genetic variation than does the use of only 10 iPBS markers. Moreover, SSR markers are codominant markers that distinguish between pure and heterozygous genotypes, thus conferring a greater advantage in revealing species genetic diversity than dominant markers. In summary, the utilization of both molecular markers revealed a considerable level of genetic diversity within the 80 Zanthoxylum accessions.
Genetic relationship of Zanthoxylum
The genetic similarity coefficient is a useful tool for evaluating genetic similarity. A higher genetic similarity coefficient indicates a closer genetic relationship and greater similarity between two individuals or groups, while a lower coefficient suggests greater genetic differentiation and greater genetic diversity [36]. Among the 80 Zanthoxylum accessions, the ranges of GS values obtained through the SSR, iPBS, and SSR + iPBS methods were 0.0947 ~ 0.9868, 0.2206 ~ 1.0000, and 0.1747 ~ 0.9921, respectively, with statistically significant differences. The average GS values were 0.3864, 0.5215, and 0.4422, respectively, indicating relatively rich genetic diversity and a high level of genetic variation among the tested Zanthoxylum accessions. SSR markers exhibited a wider range of GS variation and smaller average GS values than did the other markers, suggesting that SSR markers are more effective at detecting genetic variation. The genetic relationships revealed by the two marker types were consistent. For instance, in the iPBS results, GS values of 1 were obtained between 'Fengxiandahongpao' and 'Guojiadahongpao', 'Baishajiao' and 'Linzhouhonghuajiao', and 'Meishanqinghuajiao' and 'Wucitengjiao'. These same groups also had relatively large GS values (0.9744, 0.9868, and 0.9730) according to the SSR results, indicating very close genetic relationships. This may be attributed to inconsistent naming of the same variety in different regions, also known as the "same substance but different name" phenomenon. In summary, both SSR and iPBS markers can be employed to assess the phylogenetic relationships of the Zanthoxylum species. However, SSR markers showed greater diversity and a more comprehensive reflection of the phylogenetic relationships, suggesting it has greater polymorphism. Additionally, SSR + iPBS markers compensated for the limitations of iPBS markers and provided a more accurate representation of the genetic relationships among the tested Zanthoxylum accessions. The cluster analysis findings also supported these conclusions. Based on the SSR, iPBS, and SSR + iPBS markers, the 80 Zanthoxylum accessions were divided into three categories (Z. bungeanum, Z. armatum, and Z. piperitum), and closely related Zanthoxylum species were grouped together. However, when iPBS markers were used, 'Mianyangwuciqinghuajiao', which belongs to Z. armatum, was clustered with Z. bungeanum cultivars, indicating that SSR markers provided more accurate results. Furthermore, it is possible that the unique characteristics of 'Mianyangwuciqinghuajiao' contributed to this clustering result, as evidenced by the presence of multiple unique loci or band patterns. The calculated mean GS value of 'Mianyangwuciqinghuajiao' compared to those of the other 16 accessions of Z. armatum was only 0.391 (based on SSR + iPBS markers), indicating a distant relationship. These findings highlight the unique genetic variation of 'Mianyangwuciqinghuajiao', which may prove valuable in future efforts related to germplasm innovation and the development of new varieties. Additionally, on the clustering tree diagrams of both markers, it was observed that some Zanthoxylum accessions from the same region were not clustered together (Fig. 8). These findings suggest that long-term cultivation, domestication of Zanthoxylum species, and trading and introduction between different regions may have contributed to this phenomenon. Notably, the single Zanthoxylum accession from Germany was not grouped separately but instead clustered together with Chinese Zanthoxylum, indicating a shared origin, consistent with previous research conducted by Feng [37].
Genetic differentiation and genetic structure of Zanthoxylum
Gene differentiation (Fst) and gene flow (Nm) are crucial parameters for assessing genetic variation among populations, and they exhibit an inverse correlation wherein higher differentiation coefficients indicate lower levels of gene flow [38]. For Fst, the following categories are generally utilized: Fst ranges between 0 and 0.05, which suggests negligible genetic differentiation between populations; 0.05 and 0.15, which signifies a moderate degree of genetic differentiation; 0.15 and 0.25, which indicates a substantial degree of genetic differentiation; and Fst > 0.25, which signifies a high degree of genetic differentiation [39]. For Nm, it is generally accepted that Nm > 1 indicates that there is frequent gene exchange between populations, which prevents genetic differentiation of populations due to genetic drift and contributes to the maintenance of genetic stability of populations, while Nm < 1 indicates that gene flow is not sufficient to counteract the effects of genetic drift, thus contributing to the increase of genetic differentiation between populations [40]. In this study, we used SSR markers to analyze the genetic differentiation characteristics of three Zanthoxylum populations (Pop1, Pop2, and Pop3). The Fst values were 0.242, 0.335, and 0.429 between Pop1 and Pop2, Pop1 and Pop3, and Pop2 and Pop3, respectively, suggesting a high level of genetic differentiation among the three populations. Moreover, the mean Nm was 0.629 (< 1), indicating limited gene exchange among the populations. This can be attributed to the fusionless reproductive characteristics of Zanthoxylum species and the high levels of genetic differentiation among populations, which hinder gene flow [37]. Additionally, the AMOVA results indicated a high level of genetic differentiation among the tested Zanthoxylum accessions, with genetic variation predominantly arising within individuals (65%), while 35% of the genetic variation originated from between populations. Both cluster analysis and PCoA accurately categorized the 80 Zanthoxylum accessions into three groups corresponding to the three different Zanthoxylum species populations (Pop1, Pop2, and Pop3). The genetic analysis revealed substantial genetic distance (0.972) and low genetic concordance (0.383) among these three populations, further highlighting their high level of genetic differentiation. Geographical isolation is an important factor leading to population differentiation, due to environmental heterogeneity, genetic variation, and limited gene flow, resulting in the independent evolution of populations in different geographical regions [13, 41]. The distinct growth environments of these three groups contributed significantly to their differentiation, with Z. armatum found in frost-free regions of southwestern China characterized by warm and humid climates; Z. bungeanum exhibiting resilience and adaptability to wide areas with harsh climates (subtropical and temperate zones); mainly distributed in northern regions of the Qinling Mountains-Huaihe River in China [19]; and Z. piperitum concentrated in certain parts of Japan. Over an extended period, the combination of natural and artificial selection has limited genetic exchange between these Zanthoxylum populations, leading to significant differentiation. Generally, higher genetic diversity indicates greater complexity of plant diversity and greater potential for environmental adaptation [42]. Among the three populations, the Z. bungeanum population (Pop1) exhibited the highest genetic diversity, while the Z. piperitum population (Pop3) displayed the lowest. This discrepancy may be attributed to the number of samples and actual cultivars, as well as the stronger environmental adaptability and wider geographic distribution of Z. bungeanum. Consequently, Z. bungeanum germplasm resources can serve as crucial genetic breeding material for future cultivar selection and breeding endeavors.
Unlike the results of UPGMA cluster analysis and PCoA, Bayesian model-based population structure analysis classified the 80 Zanthoxylum accessions into two subgroups (Fig. 4), of which six Z. piperitum materials were not classified into a separate category, which may be related to the small number of materials of this species. Most of the 80 Zanthoxylum accessions (86%) had a single genetic component (Q-value ≥ 0.8), and only a few materials (14%) showed a mixture of both gene pools (Q-value < 0.8), suggesting a lack of genetic exchange between Zanthoxylum subgroups, which is consistent with the results of the analysis of population genetic differentiation.
Construction of DNA fingerprint map and fingerprinting power
DNA fingerprinting is a molecular-level method used to identify different biological individuals by utilizing molecular markers. It is not influenced by environmental factors or by the developmental stage of organisms. In the case of plants, DNA fingerprinting is valuable for accurately and rapidly identifying varieties, offering convenience for germplasm resource management, evaluation, protection of varietal rights, and crop breeding [44]. Among several molecular markers, SSR markers are widely regarded as the preferred method for constructing plant DNA fingerprints. They have been recognized as one of the most powerful marker systems for identifying plant variety and have been successfully applied across multiple species [8, 44]. For instance, He et al. [45] established the genetic fingerprints of 33 standard flue-cured tobacco varieties using 48 SSR markers and developed identification technology for new tobacco varieties based on SSR markers. Chen et al. [44] created a DNA fingerprinting database of 128 excellent oil camellia varieties using highly variable SSR markers.
PI and PIsibs are widely used as indicators of the fingerprinting power of molecular markers in studies of fingerprinting construction [28, 46]. In this study, the combined PI value of 32 SSR markers was 4.265 × 10− 27, and the low PI value showed high fingerprinting power. However, Waits et al. [28] argued that the assumption of independent segregation among sites does not hold because the substructure of plant populations is shaped by environmental and anthropogenic selection, leading to a possible overestimation of the theoretical PI, and thus PIsibs are usually used as a conservative upper limit for the PI; specifically, PI values of 1 × 10− 4 ~ 1 × 10− 2 are considered sufficient for application to the identification of individuals in natural populations. The PI and PIsibs values in this study were much lower than the putative values, indicating that the 32 SSR markers have a very high potential for fingerprinting. Therefore, we combined eight pairs of primers to construct DNA fingerprints for 80 Zanthoxylum cultivars, each of which was assigned a unique numerical code. However, it should be noted that the number of Zanthoxylum cultivars that can be identified by this fingerprint method is limited. As the number of Zanthoxylum accessions used for identification increases and new varieties are introduced and promoted, the number of new variant sites will increase as well. In such cases, timely and periodic updates to the fingerprint will be required to ensure its ongoing role in future research and application.
In comparison to SSR markers, iPBS markers have been less frequently employed to construct DNA fingerprints. Zeng et al. [47] successfully constructed fingerprints of 85 Cymbidium goeringii germplasm resources using two iPBS primers. Demirel et al. [48] used 17 iPBS markers to fingerprint and genetically analyze 151 potato genotypes. These studies demonstrated the feasibility of constructing plant fingerprints using iPBS markers. For our study, we selected 10 iPBS primers with high polymorphism and clear amplification bands from a pool of 83 primers. However, we found that these 10 iPBS markers were not sufficient to completely differentiate the 80 Zanthoxylum cultivars.
Notably, specific bands were observed in the amplification results for two markers, indicating that allelic loci, such as 'Hanyuanwuci ♀', 'Mianyangwuciqinghuajiao', 'Xizanghuajiao', 'Laiwuxiaohongpao', and 'Yaojiao', can serve as important molecular traits for cultivar identification. Considering factors such as the ease of banding, number of available markers, polymorphic information content of the primers, and amplification stability, we believe that SSR markers are more suitable for constructing DNA fingerprints of Zanthoxylum species. However, it is important to acknowledge that iPBS markers have valuable potential when genomic information is lacking for a species. Moreover, for materials that are difficult to identify using a single molecular marker, a combination of multiple markers can improve identification efficiency.
Currently, with the decreasing cost of high-throughput sequencing technology, the construction of DNA fingerprints using SSR and/or SNP markers has become the most popular choice [43]. Future research can focus on the development of these two marker types, as well as the collection of more comprehensive Zanthoxylum germplasm resources, to construct a more perfect fingerprint map. This endeavor holds significant importance for the conservation and development of Zanthoxylum germplasm resources.