Retrotransposon insertional polymorphism in sweet wormwood genome and IRAP analysis
The study made use of 32 IRAP primers tested on 118 sweet wormwood accessions (Table 1) for initial screening. Among these primers, 12 single primers (Ty1-Copia, LTR6149, LTR6150, LTR1, Nikita, Sukkula, Tnt1.OL16, ToRTL1, 3ʹLTR, Tnt-1, 5ʹLTR1, 5ʹLTR2) and 20 primer combinations (3ʹLTR/LTR1, 3ʹLTR/Tnt-1, Bare1/LTR2, Nikita/LTR1, Nikita/Sukkula, Nikita/Ty1-Copia, Tnt-1/Sukkula, Tnt-1/5ʹLTR2, Tnt1.OL16/Ty1-Copia, Tnt1.OL16/Nikita, Tnt1.OL16/LTR6150, Tnt1.OL16/5ʹLTR1, Tnt1.OL16/5ʹLTR2, Tnt1.OL16/Sukkula, ToRTL1/Sukkula, ToRTL1/5ʹLTR2, LTR6150/5ʹLTR1, LTR6150/5ʹLTR2, 5ʹLTR1/3ʹLTR, 5ʹLTR1/5ʹLTR2) produced 849 distinguished and scored loci, out of 819 loci (95.80%) that were polymorphic and that 14 primers, LTR6149, LTR1, Sukkula, Tnt1.OL16, 3ʹLTR, Tnt-1, 3ʹLTR/LTR1, 3ʹLTR/Tnt-1, Bare1/LTR2, Nikita/LTR1, Nikita/Sukkula, Tnt-1/Sukkula, Tnt1.OL16/Ty1-Copia and ToRTL1/Sukkula, represented 100% polymorphism. The length of the amplified fragments differed in range of 100–3,000 bp, most of the band sizes had range of 500bp -2000bp. Single Primer Tnt1.OL16 generated the maximum amplified and polymorphic loci and showed the highest Ne, He and I (Table 1, Supplementary Fig. 2). The lowest values of Ne, He and I parameters were achieved for the primer combination ToRTL1/Sukkula. The number of scored loci for these primers ranged from 60 to 12 with a mean of 26.53 loci for each primer. Further detail of the primers is shown in Table 1. Supplementary Fig. 2 shows the banding pattern of Tnt1.OL16 and LTR1 primer in some sweet wormwood accessions. RTN activity and comparison of these families are shown in Table 2.
Table 1
Characteristics of the IRAP primers used in this study
Primer | TNB | NPB | PPB | Ne | I | He | Annealing temp. (C°) | Band size (bp) |
Ty1-Copia | 42 | 41 | 97.6 | 1.50 | 0.49 | 0.32 | 58 | 100–3000 |
LTR6149 | 18 | 18 | 100 | 1.56 | 0.48 | 0.32 | 65 | 150–2000 |
LTR6150 | 14 | 13 | 92.8 | 1.45 | 0.43 | 0.29 | 59 | 250–2000 |
LTR1 | 35 | 35 | 100 | 1.50 | 0.47 | 0.31 | 64 | 150–3000 |
Nikita | 20 | 19 | 95 | 1.44 | 0.43 | 0.27 | 65 | 200–2000 |
Sukkula | 23 | 23 | 100 | 1.52 | 0.48 | 0.31 | 64 | 100–3000 |
Tnt1.OL16 | 60 | 60 | 100 | 1.68 | 0.58 | 0.39 | 61 | 100–3000 |
ToRTL1 | 43 | 41 | 95.3 | 1.45 | 0.44 | 0.29 | 61 | 100–3000 |
3ʹLTR | 30 | 30 | 100 | 1.64 | 0.54 | 0.37 | 64 | 200–3000 |
Tnt-1 | 12 | 12 | 100 | 1.36 | 0.39 | 0.24 | 65 | 200–1500 |
5ʹLTR1 | 15 | 13 | 86.7 | 1.41 | 0.40 | 0.26 | 61 | 200–2000 |
5ʹLTR2 | 17 | 14 | 82.3 | 1.51 | 0.48 | 0.31 | 62 | 150–2000 |
3ʹLTR/LTR1 | 24 | 24 | 100 | 1.55 | 0.49 | 0.32 | 64 | 200–3000 |
3ʹLTR/Tnt-1 | 33 | 33 | 100 | 1.51 | 0.48 | 0.31 | 64 | 100–2500 |
Bare 1/LTR2 | 28 | 28 | 100 | 1.64 | 0.55 | 0.37 | 61 | 150–2500 |
Nikita/LTR1 | 21 | 21 | 100 | 1.55 | 0.49 | 0.32 | 64 | 150–2000 |
Nikita/Sukkula | 33 | 33 | 100 | 1.42 | 0.41 | 0.26 | 58 | 100–3000 |
Nikita/Ty1-Copia | 21 | 20 | 95.2 | 1.50 | 0.48 | 0.31 | 59 | 200–2500 |
Tnt-1/Sukkula | 25 | 25 | 100 | 1.39 | 0.40 | 0.25 | 65 | 150–2000 |
Tnt-1/5ʹLTR2 | 23 | 20 | 87 | 1.50 | 0.47 | 0.30 | 62 | 200–2000 |
Tnt1.OL16/Ty1-Copia | 42 | 42 | 100 | 1.31 | 0.36 | 0.21 | 58 | 100–2500 |
Tnt1.OL16/Nikita | 24 | 22 | 91.7 | 1.46 | 0.41 | 0.27 | 65 | 100–3000 |
Tnt1.OL16/LTR6150 | 15 | 14 | 93.3 | 1.43 | 0.41 | 0.29 | 65 | 100–1500 |
Tnt1.OL16/5ʹLTR1 | 21 | 20 | 95.2 | 1.49 | 0.46 | 0.30 | 60 | 100–2000 |
Tnt1.OL16/5ʹLTR2 | 26 | 24 | 92.3 | 1.51 | 0.48 | 0.31 | 59 | 100–2500 |
Tnt1.OL16/Sukkula | 17 | 16 | 94.1 | 1.41 | 0.44 | 0.31 | 60 | 200–2500 |
ToRTL1/Sukkula | 25 | 25 | 100 | 1.23 | 0.30 | 0.17 | 62 | 150–2500 |
ToRTL1/5ʹLTR2 | 31 | 30 | 96.8 | 1.55 | 0.49 | 0.31 | 63 | 100–2500 |
LTR6150/5ʹLTR1 | 34 | 32 | 94.1 | 1.56 | 0.49 | 0.31 | 60 | 100–3000 |
LTR6150/5ʹLTR2 | 25 | 23 | 92 | 1.48 | 0.49 | 0.30 | 64 | 100–2500 |
5ʹLTR1/3ʹLTR | 30 | 28 | 93.3 | 1.49 | 0.49 | 0.31 | 61 | 100–3000 |
5ʹLTR1/5ʹLTR2 | 22 | 20 | 90.9 | 1.48 | 0.47 | 0.30 | 61 | 200–2000 |
Total | 849 | 819 | - | - | - | - | - | - |
Average | 26.53 | 25.59 | 95.80 | 1.48 | 0.46 | 0.30 | - | - |
TNB total number bands, NPB number of polymorphic bands, PPB percentage of polymorphic bands, Ne effective number of alleles, I Shannon’s information index, He expected heterozygosity |
Table 2
Comparison of RTN family in studied sweet wormwood (A. annua) accessions from Iran
RTN family | TL | PL | PPL | He | Ne | I |
Ty1-Copia | 105 | 103 | 98 | 0.28 | 1.44 | 0.44 |
LTR6149 | 18 | 18 | 100 | 0.32 | 1.56 | 0.48 |
LTR6150 | 88 | 82 | 93 | 0.30 | 1.48 | 0.45 |
LTR1 | 80 | 80 | 100 | 0.32 | 1.53 | 0.48 |
Nikita | 119 | 115 | 97 | 0.28 | 1.47 | 0.44 |
Sukkula | 123 | 122 | 99 | 0.26 | 1.39 | 0.41 |
Tnt1.OL16 | 205 | 198 | 97 | 0.30 | 1.47 | 0.45 |
ToRTL1 | 99 | 96 | 97 | 0.26 | 1.41 | 0.41 |
3ʹLTR | 117 | 115 | 98 | 0.33 | 1.55 | 0.50 |
Tnt-1 | 93 | 90 | 97 | 0.27 | 1.44 | 0.43 |
5ʹLTR1 | 122 | 113 | 93 | 0.30 | 1.49 | 0.46 |
5ʹLTR2 | 144 | 131 | 91 | 0.30 | 1.50 | 0.48 |
LTR2 | 28 | 28 | 100 | 0.37 | 1.64 | 0.55 |
Bare 1 | 28 | 28 | 100 | 0.37 | 1.64 | 0.55 |
TL Total loci, PL Polymorphic loci, PPL Percentage of polymorphic loci, He Mean of expected heterozygosity, Ne Number of effective alleles, I Shannon’s information index |
Genetic diversity and population structure of Iranian sweet wormwood accessions
The properties of amplified IRAP loci utilizing 32 IRAP primers are shown in the Table 3. The means (%) of IRAP polymorphic loci in the population ranged 26.02 (East Azerbaijan) to 95.35 (Mazandaran) averaging 76.25. Number of private marker varied from zero in population East Azerbaijan to 6 in population Mazandaran with an average of 3.5. Population Golestan had the maximum genetic diversity (He = 0.26, I = 0.40), followed by Gilan (He = 0.24, I = 0.38), East Azerbaijan that noted the minimum genetic diversity indices (He = 0.11, I = 0.16). To partition and evaluate total genetic variation within and among the populations, AMOVA was carried out that was based on four populations using 849 IRAP loci. Significant variations were detected within populations (PhiPT = 0.05, P ≤ 0.010). The level of genetic variation was higher within each population (94%) in comparison to variations across populations (6%) (Data not shown). Gene flow, as estimated by PhiPT [Nm = 0.25 (1/PhiPT)-1], was 3.987, that indicated that genetic variation does not exist among the natural A. annua populations.
Table 3
Characteristics of amplified IRAP loci in the studied sweet wormwood (A. annua) populations
Parameters | Azerbaijan | Gilan | Golestan | Mazandaran |
Number of loci | 566 | 828 | 815 | 830 |
Percentage of polymorphic loci | 26.02 | 93.31 | 90.33 | 95.35 |
No of loci with frequency ≥ 5% | 566 | 794 | 779 | 775 |
Number of private loci | 0 | 3 | 5 | 6 |
No of less common loci (≤ 25%) | 0 | 0 | 0 | 0 |
No of less common loci (≤ 50%) | 0 | 58 | 51 | 57 |
Mean of heterozygosity | 0.11 | 0.24 | 0.26 | 0.22 |
Shannon’s information index (I) | 0.16 | 0.38 | 0.40 | 0.36 |
Number of effective alleles (Ne) | 1.18 | 1.42 | 1.38 | 1.35 |
To further elucidate and explain concepts the population structure, distance- and model-based cluster analyses we performed. Neighbor-Joining (NJ) clustering to check pattern of differences among 118 accessions. The un-weighted Neighbor-joining algorithm and Number of differences evolutionary distance coefficient sourced IRAP data that grouped 118 accessions of the four populations into three major clusters (Fig. 1). In the resulting tree, the accessions were mainly grouped based on their geographical origins with minor mixture. Cluster I was dominated by the accessions originated from Mazandaran population and consisted of the maximum number of accessions. In addition, three accessions from Gilan population namely Gilan-Raz Kan, Gilan-Rahim Abad and Gilan-Rahim Abad-Polam were classified together in cluster I. Cluster II comprised the accessions originated from Gilan population along with the two accessions from East Azerbaijan. In addition, five accessions from Mazandaran population namely Mazandaran-Mzarostaq, Mazandaran-Shirag, Mazandaran-Ncka-Chaleh Pola, Mazandaran-Ramsar-Ezaki, Mazandaran-Ramsar-Garesmasar and two accessions from Golestan population namely Golestan-Tangrah2, Golestan-Jokleh Bandan were included in this cluster. The third cluster included most of the accessions from Golestan population along with the three accessions from Mazandaran population and three accessions from Gilan population.
To settle or find a solution to the pattern of variation, principal coordinate analysis (PCoA) was used to further elucidate relationships among the selected accessions and assess the population subdivisions. The PCoA bi-plot showed no distinguished cluster pattern for 118 sweet wormwood accessions studied (Fig. 2). The first three axes accounted for 27.00, 19.13 and 15.65% genetic variation in same order, explaining altogether 61.78% of the total variation. These results showed that Mazandaran population was distanced from Golestan population on axis 1. The accessions of Gilan population were scattered over a large area. The second coordinate could not well separate the accessions of Mazandaran, Gilan, Golestan and East Azerbaijan. Coordinates 1 and 2 un-abled a discrimination of all three clusters in the phylogenic dendrogram in combination. Color-codes of all accessions in 2-dimensional PCoA plot was in accordance with these population groups noted from the “structure” analyses (Fig. 3). Because only 61.78% of the differences in this study was added in the 1st three coordinate components, the sweet wormwood germplasm was also analyzed utilizing the model-based method implemented in the software “structure”. The accessions within a cluster were represented by unique colour while accessions with two different colours indicated admixed forms. The results detected the maximal ΔK at K = 2 followed by K = 7. The ΔK value reduced with increased K, without showing any peak of ΔK at K > 7 (Fig. 3A). The guessed delta K value was 222.23 for 118 sweet wormwood accessions, which represented two subpopulations. More” subgroups were noted at the value of K = 7 (Fig. 3C), that showed that genotypes admixture confirmed the PCA. Most of sweet wormwood accessions expressed populations admixture of four populations as shown in the structure analysis at K = 7 (Fig. 3C).