Genetic Diversity and population structure of Selected tef Core Germplasm Lines Based on Microsatellite Markers

Abstract

Tef is an indigenous and important food, feed and cash crop to the small-holder Ethiopian farmers. Knowledge of the natural genetic composition of the crop provides the option to further exploit its genetic potential through breeding. However, there are insufficient reports on the genetic variability of Ethiopian tef using a medium-throughput marker system. Hence, the current study was designed to evaluate the genetic variability of released and core germplasm which were collected earlier. Eighty-one tef genotypes collected from eight Ethiopian ecological zones and release variety were targeted using 14 SSR markers. The study ensued a total of 122.43 alleles through the entire loci and populations. All the used microsatellite loci were highly polymorphic with PIC ranging from 0.96 to 0.98 and mean of 0.976. The analysis showed the presence of high allelic diversity ranging from 0.94 to 0.97 with overall mean of 0.96. The molecular variance analysis indicated the existence of large genetic differentiation (FIS and FIT=0.87) with 86% and 13% of the total variation accounted by among genotypes within population and across from all population respectively. Whereas low genetic differentiation among population (FST=0.014. which accounts 1 %.) was observed. Multivariate analysis like clustering, and PCoA, analysis did not cluster genotypes into distinct groups according to their geographical areas of population. This is due to presumably gene flow among populations. The information of this study may be used as input for breeders in selecting breeding materials.

Introduction

Tef is well known as an indigenous and important food, feed and cash crop to the small-holder Ethiopian farmers. Among the cereals, tef covers 2.93 million hectares of land and this is equivalent to 27.8% of the total area allocated to cereal crop cultivation in Ethiopia (CSA, 2021). The same source indicates that the annual grain production of 5.51 million tons makes tef third following maize and wheat with 10.56 and 5.78 million tons, respectively.

In Ethiopia, tef grain is mainly used for making ‘injera’ and sometimes for making porridge, unraised bread (‘kita’), and local alcoholic drinks, called ‘tela’ and ‘katikala’. In general, tef is considered to be a highly nutritious grain having protein content in the range of 8 to 11 percent, similar to other more common cereals such as wheat (Baye et al., 2014), balanced amino acids with high lysine content (Jansen et al., 1962), and high iron content (Mengesh ,1966; Costanza et al., 1979). It has also got significantly higher calcium content than most major grains (Vohwinkel et al., 2002). Besides, tef straw is the most appreciated feed for cattle, while it also serves as a binding material to reinforce mud used for plastering the walls of huts and local grain storage facilities called gotera, and also as bedding material, mulch and fuel source. Moreover, tef grains are free of gluten which causes celiac disease. It is also a good alternative diet for diabetic people due to its low glycemic index as compared to most other cereals (Saturni et al., 2010).

Eragrostis tef (Zucc.) Trotter belongs to the class: Liliopsida, order: Poale, family: Poaceae, subfamily: Chloridodeae, genus: Eragrostis (Smale et al., 1996). It is allotetraploid (2n =4x= 40) originating from the hybridization of two distinct species followed by diploidization Cannarozzi et al. (2014).

Every individual is genetically unique by nature. Hence conservation of this unique ness is to maintain genetic diversity at many levels and to provide tools for population monitoring and assessment that can be used for conservation planning. Genetic diversity is generally defined as the extent of genotypic variability existing in a group of individuals. It provides opportunity for plant breeders to develop new and improved cultivars with desirable characteristics, which include both farmer-preferred traits (yield potential and large seed, etc.) and breeders preferred traits (pest and disease resistance and photosensitivity, etc.). It gives species that the ability to adapt to changing environments, including new pests and diseases and new climatic conditions, such as global warming. Hence assessment of genetic diversity in germplasm collections and genetic relationships among breeding materials is therefore essential in crop improvement strategies.

The indigenous tef genetic resources have been and will continue to be the major source of variability for its genetic improvement. In the early 1990s, a core tef germplasm set of 320 selected representative lines was established based on phenotypic evaluation and characterization of over 2255 tef germplasm collections (Ketema et al., 1993). Some studies detected tef genetic diversity at the molecular level using various types of DNA markers such as AFLP (Mulu Ayele et al., 1999; Bai et al., 1999), RAPD (Bai et al., 2000), ISSR (Assefa et al., 2003a), SSR (Zeid et al., 2012; Tesfaye et al., 2018; Fikre et al., 2018 and Jifar et al., 2020) and SNP (Girma et al., 2014). Among these, the studies so far indicate SSR markers are polymorphic in tef genotypes comparably (Girma et al., 2014)

A number of methods have been developed over the past decades for the assessment of genetic diversity in tef germplasm accessions, breeding lines and populations. This implies considerable progress in improving tef and developed and released more than 51 improved tef varieties (Chanyalew et al., 2021). However, the productivity of tef with its national average yield of 1.88 t/ha is still extremely low compared to other cereals (CSA, 2021). Among the major issues which cause low productivity are lodging, abiotic stresses such as drought and acidity, and biotic stresses (diseases and insect pests) (Assefa et al., 2015).

Therefore, germplasm screening and varietal development with the molecular markers in tef have been described as still insufficient (Misgana et al., 2018). Besides, conventional cereal breeding is time-consuming and very dependent on environmental conditions. Hence breeding a new variety takes between eight and twelve years and even then, the release of improved variety cannot be guaranteed. Hence, breeders are extremely interested in molecular markers.

Characterization and evaluation of core germplasm collections using SSR markers would be vital for enhancing genetic gain from tef breeding. The productivity of tef should be enhanced to compete with other more productive cereals. Consequently, this study is initiated with the aim of generating useful information by molecular diversity analysis for designing effective and efficient tef breeding strategies.

Materials And Methods

Plant Materials

The experimental plant materials for both the field and laboratory experiments comprised a total of 81 genotypes including 74 core germplasm lines and seven released varieties obtained from the working collections at the National Tef Research Program of DZARC. The germplasm materials originated from previous collections made from different areas of Ethiopia (Table 1)

Table 1. List of experimental tef genotypes

No.	Genotypes	Origin	No.	Genotypes		Origin
1	DZ-01-91	Jimma	42	DZ-01-247		E. Shoa
2	DZ-01-101	Jimma	43	DZ-01-1055		E. Shoa
3	DZ-01-86	Jimma	44	DZ-01-1057		E. Shoa
4	DZ-01-102	Jimma	45	DZ-01-663		E. Shoa
5	DZ-01-117	Jimma	46	DZ-01-691		E. Shoa
6	DZ-01-99	Jimma	47	DZ-01-695		E. Shoa
7	DZ-01-1193	Tigray	48	DZ-01-647		E. Shoa
8	DZ-01-1217	Tigray	49	DZ-01-1288		W. Shoa
9	DZ-01-1225	Tigray	50	DZ-01-1292		W. Shoa
10	DZ-01-1167	Tigray	51	DZ-01-1293		W. Shoa
11	DZ-01-1034	Gojam	52	DZ-01-1277		W. Shoa
12	DZ-01-1100	Gojam	53	DZ-01-979		W. Shoa
13	DZ-01-1102	Gojam	54	DZ-01-728		WShoa
14	DZ-01-1108	Gojam	55	DZ-01-725		W. Shoa
15	DZ-01-483	Wello	56	DZ-01-751		W. Shoa
16	DZ-01-527	Wello	57	DZ-01-759		W. Shoa
17	DZ-01-478	Wello	58	DZ-01-778		W. Shoa
18	DZ-01-512	Wello	59	DZ-01-787		W. Shoa
19	DZ-01-530	Wello	60	DZ-01-821		W. Shoa
20	DZ-01-504	Wello	61	DZ-01-866		W. Shoa
21	DZ-01-517	Wello	62	DZ-01-951		W. Shoa
22	DZ-01-647	Wello	63	DZ-01-974		W. Shoa
23	DZ-01-1130	Wello	64	DZ-01-801		W. Shoa
24	DZ-01-1233	Wello	65	DZ-01-829		W. Shoa
25	DZ-01-481	Wello	66	DZ-01-1382		W. Shoa
26	DZ-01-553	Wellega	67	DZ-01-886		W. Shoa
27	DZ-01-558	Wellega	68	DZ-01-1014		W. Shoa
28	DZ-01-569	Wellega	69	DZ-01-1015		W. Shoa
29	DZ-01-581	Wellega	70	DZ-01-1062		Arsi
30	DZ-01-1020	E. Shoa	71	DZ-01-1268		Arsi
31	DZ-01-11	E. Shoa	72	DZ-01-1310		Arsi
32	DZ-01-30	E. Shoa	73	DZ-01-10		Arsi
33	DZ-01-12	E. Shoa	74	DZ-01-9		Arsi
34	DZ-01-14	E. Shoa	75		Nigus		Released variety
35	DZ-01-16	E. Shoa	76		Bishoftu		Released variety
36	DZ-01-36	E. Shoa	77		Tesfa		Released variety
37	DZ-01-37	E. Shoa	78		Dagim		Released variety
38	DZ-01-58	E. Shoa	79		Flagot		Released variety
39	DZ-01-71B	E. Shoa	80		Eba		Released variety
40	DZ-01-18	E. Shoa	81		Bora		Released variety
41	DZ-01-46	E. Shoa

DNA Isolation and Purification

About 20-30 seeds of each of the 81 tef genotypes were planted in plastic pots comprising the combination of sand, forest soil, and compost. The plants were grown in the greenhouse at National Agricultural Biotechnology Research Center (NABRC), Holetta, Ethiopia.

Leaf samples were collected from a single plant of four-week-old seedlings. Hundred milligrams of fresh leaves were placed in 2 ml autoclaved and labeled Eppendorf tubes and freeze-dried for 24 hours at −40 °C, and then lyophilized. The samples were then ground using Geno Grinder (MM-200, Retsch) for 3 min. The genomic DNA was extracted from the ground fresh leaf samples using plant DNA extraction protocol based on the method of Diversity Array Technology (DArT) with some minor modification. Then, the DNA pellets were air dried and dissolved in 100 μl of TE buffer.

The quality and concentration of genomic DNA was checked with nano drop spectrophotometer (ND-8000, Thermos scientific). The level of DNA purity was determined by the 260/280 absorbance ratio. Further, the quality of DNA was tested using 1% (w/v) agarose gel and electrophoresis was carried out on horizontal electrophoresis set up (JYSPBT) using a standard DNA ladder. Hyper Ladder TM IV with known reference band to quantify the DNA amount and visualized using gel documentation, 3UV- Tran’s illuminator (Bio-Doc).

Those high band intensity, lesser smear, and high concentration were selected for PCR analysis and were stored in the refrigerator (4 ºC) till the PCR.

Microsatellite Primer Selection and Optimization

Twenty-five microsatellite primer-pairs previously developed by Zeid et al. (2011) were used for initial screening for amplification, polymorphism, and specificity to target loci. Fourteen markers that showed clear band and polymorphism were selected (Table 2). Concurring to the manufacturer's instruction, they were dissolved utilizing nuclease-free,water to a final concentration of 100mol/µl.

Table 2. Description of screened microsatellite makers that were used in the diversity analysis of 81 tef genotypes

No.	Marker name	Forward primer (5´ to 3´)	Reverse primer (3´ to 5´)	motif	Size (bp)	AT	Linked tef Chromosome No.	Source
1	CNLTS5	CCCAAAGTGATGCAAAAACA	GATAGATAGAGACACAGACACACAACA		250-310	55.4		Tadesse et al. (2021)
2	CNLTS6	AATTCGCAGCTGATCTACGC	CTCGTCGATATACGTGCAAAA		90-260	52.7		Tadesse et al. (2021)
3	CNLTs 12	TCATATGCAGCCCAACTCAA	ACGACGGGCTACTGGTATCT		180-280	60.6	12	Zied et al (2011)
4	CNLTS19	AAGACAATGCTACCCCAAGC	CCTCGGATTGTTGCTCTCTC	(AG)14	180-300	58.5		Tadesse et al. (2021)
5	CNLTs 20	GGGAATGAGCATGACCTGAT	TCTGTGGATGCTCAGCTACG		157-300	59.5	1
6	CNLTS22	CATTTCTTGCTGCTGGATCA	AGTATGGTGGCCTTGGTGAG		100-260	55.4
7	CNLTS25	GAGAGCGGTTTTGTCCTACG	GGAATAGGGAGGCGAGGTAG		150-320	51.3
8	CNLTS27	TTGGAATGAGATGGCATTTG	GAAGCGGGGTAAGATTTGAA		200-220	53.2	1
9	CNLTS28	GCTCAAGGGCAAGGACATC	CCTGGAATGTAATCGGAAAAA		100-200	59.3
10	CNLTS458	AGAGCCTTCGGCCTTGTACT	GTGAAGCGCGCAAATCTC		130-250	50.1	4
11	CNLTS463	TGCTAGGATGGTCCTGTTGAG	AGCACCAAATCCCTATGCAC		190-260	57.4
12	CNLTS 465	CAGCTTAGACGGGCAGAGAA	AGGAAAACAGAGAGAGGGAGAG		150-300	57.4
13	CNLTs 466	CAT CAT CAA TAA AAG ATC CGA GA	CAT CAT GAT ACT CGC CGT TG	(GA)16	246	54.1
14	CNLTs 484	GAGATCCTACCACGGCGATA	CGCTTTCCCCTCCTTTTGTA	(GA)18	157	59.5	1	Jifar et al. (2020)

PCR and Gel Electrophoresis

DNA amplification was performed in 12.5 μl reaction volume containing 6.25 μl, GoTaq®Red Master Mix (supplemented with all PCR reaction components, MgCl₂, PCR buffer, dNTPs, and Taq DNA polymerase), 0.5 μl forward primer and reverse primer, 3.25 μl nuclease-free water and 2 μl genomic DNA. The PCR was carried out using thermal cycler Gene Amp® PCR system 9700 (Applied Biosystems, USA) programmed at an initial denaturation at 95°C for 2 minutes followed by denaturation at 95°C for 30 seconds, annealing at 51.3°C up to 61.3°C depending on the primer used for 1 minute, extension at 72°C for 30 seconds and final elongation at 72°C for 5 minutes for 37 cycles. Before fragment analysis, amplification of the PCR products was checked on a 2% (w/v) agarose gel and visualized using gel documentation 3UV-transilluminator (Bio-Doc).

Data Analyses

Fragment scoring and data analyses were based on allelic forms. Clearly resolved and unambiguous fragments were scored for each locus using Gel analyzer 19.1 software. Then the allelic forms data of all entries combined were subjected for statistical analysis. Genetic diversity parameters of the 81 tef genotypes were analyzed using GenAlex version 6.5 (Peakall and Smouse, 2015) software package. Genetic diversity parameters considered in this study were number of effective alleles per locus (Ne), Shannon information index (I), fixation index (F) (Nei, 1978), gene flow (Nm), and percent polymorphism (% P). The number of alleles per locus (Na), polymorphic information content (PIC), observed (Ho), and expected (He) heterozygosity were analyzed using Power Marker v3, 25 (Liu and Muse, 2005) software package.

Furthermore, AMOVA was used to estimate F-statistics such as genetic differentiation (F_ST), fixation index or inbreeding coefficient (F_IS), and overall fixation index (F_IT) to compare the genetic structure among and within populations. The AMOVA procedures were done using Gene Alex version 6.5 (Peakall and Smouse, 2015) software package. To examine the degree of pair-wise population of F_ST and Nm values among the study materials were computed according to Nei’s (1978) using Gen Alex version 6.5 (Peakall and Smouse, 2015) software package.

Cluster analysis was carried out by using the UPGMA algorithm in DARwin version 6 software (Perrier and Jacquemoud Collet, 2010). A dendrogram for 81 tef test genotypes was then generated based on the dissimilarity matrix to visualize the pattern of a cluster within and among the selected tef germplasm lines and varieties. In addition, the population structure and admixture patterns of the 81 germplasm and release varieties were determined by the Bayesian model-based clustering method (Pritchard et al., 2000) using the Structure ver. 2.3.1 software.

Principal Coordinate Analysis (PCoA) was done using the Jaccard’s index to further validate the complementarity of clustering pattern revealed by the dendrogram using GenAlex version 6.5 (Peakall and Smouse, 2015) software package. The first three axes were used to plot the three-dimensional PCoA with STATISTICA version 6.0 software (Hammer et al., 2001)

Results

Molecular Genetic Diversity

In this study, the overall of 122.43 alleles were detected across the genome of 81 genotypes as revealed by 14 microsatellite loci. The number of alleles noticed ranged from 6.11 (CNLTS5) to 12.4 (CNLTS463) with a mean number of 8.75 alleles per locus through the genome of the entire populations. On the other hand, from all loci there was high inbreeding coefficient (f>0.5) except loci CNLTS 463 (0.3021) (Fig 1). In addition, the heterozygosity of the studied materials ranged from 0.to 0.69 with overall mean of 0.1457.

Relatively the markers CNLTS463, CNLTS22, CNLTS27, and CNLTS20 have high heterozygosity and Shannon´s index. The germplasm showed the highest genetic differentiation (F_st = 0.27) and the lowest value of gene flow (Nm =0.69) from CNLTS458 (Fig 1).

Four of the microsatellite loci (CNLTS22, CNLTS25, CNLTS27 and CNLTS463) did not significantly deviated from the HW-equilibrium over the entire populations.And the overall polymorphic information, from all the microsatellite loci utilized was found profoundly instructive (0.96-0.98) (Fig.1)

Analysis of Molecular Variance

The analysis of molecular variance (AMOVA) showed that 1% of the total genetic variation and low genetic differentiation (F_ST = 0.014) for the variation among populations, 86% among genotypes within populations and 13% variation among genotypes across all populations (Table 4). This study confirmed genetic differentiation (F_IS =0.870) of genotypes within population and (F_IT=0.872) among genotypes across all population (From 81 genotypes=one large grouped) with a small value of overall genetic differentiation coefficient (p = 0.001) (Table 4).

On the other side, released varieties and core germplasm collection of overall genetic differentiation coefficient among the population (p = 0.832), Fst=-0.004 and gene flow (Nm: 0) were observed (Table 4). A total genetic variation of 87% and 13% were explained by within populations and among genotypes across all populations respectively (Table 4).

Table 4: AMOVA results of 81 tef genotypes evaluated using 14 SSRs

Source	df	SS		MS	Est. Var.	%		Fstat		P-value
Among Pops	9		114.3	14.29	0.199		1%		Fst=0.014	0.002
Among Indiv	71		909.07	12.63	5.885		86%		Fis=0.87	0.001
Within Indiv	81		71.00	0.85	0.88		13%		Fit=0.87	0.001
Total	161		1094.37		6.85		100%
Gene flow among population (Nm= 17.02)
Among Pops	1		12.01	12.01	0.00		0%		Fst=0.004	0.832
Among Indiv	74		944.61	12.775	5.92		87%		Fis=0.865	0.001
Within Indiv	76		70.00	0.92	0.92		13%		Fit=0.865	0.001
Total	151		1026.62		6.84		100%
Gene flow among core germplasm and released varieties (Nm=0)

Where, df = degree of freedom, SS=sum of squares, MS=mean squares and Nm=gene flow, FST=inbreeding coefficient among population, FIS=inbreeding coefficient with population, FIT=inbreeding coefficient of within groups (across all population=one)

Genetic Distance and Gene Flow Among Populations

The pairwise population genetic distance and gene flow for both core germplasm and released varieties of tef genotypes are presented in (Table 5). The pairwise Fst standard genetic differentiation among populations varied from (0.000-0.036) which means 0,000 in between Wollo and Wellega, Gojam and Wollo ,Gojam and Wellega, Wollo and released varieties, Wellega and released varieties and 0.036 shows in between Arsi and Wellega.

Table 5: Fst values (below diagonal) and gene flow (above the diagonal) among tef genotypes originating from nine sources

Origins	JIM	TG	GOJ	WO	WEL	E. S	W. S	AR	R. Var
JIM	0.000	7.435	29.00	63.830	20.547	32.239	17.402	8.045	99.402
TG	0.027	0.000	24.846	12.423	7.022	10.686	8.448	7.042	13.039
GOJ	0.009	0.010	0.000	0.000	0.000	31.000	8.674	8.161	100.00
WO	0.005	0.020	0.000	0.000	0.000	20,000	11.691	7.393	0.000
WEL	0.013	0.034	0.000	0.000	0.000	29.000	12.631	6.605	0.000
ES	0.008	0.027	0.008	0.012	0.010	0.000	21.196	10.788	73.356
WS	0.014	0.020	0.026	0.019	0.020	0.020	0.000	79.772	20.52
AR	0.029	0.031	0.030	0.035	0.036	0.030	0.003	0.000	13.073
RV	0.003	0.020	0.002	0.000	0.000	0.003	0.012	0.022	0.000

JIM=Jimma, TG =Tigray, GOJ= Gojam, WO= Wollo, WEL=Wellega, ES= East Shoa, WS= West Shoa, AR=Arsi and R. Var=Released Variety

Allelic Diversity Among Populations

The number of observed alleles (Na) was higher for accessions collected from East Shoa, West Shoa and Wollo with values of 18.857, 12.57 and 10.714, respectively (Fig 2). Similarly, the germplasms from East Shoa, West Shoa and Wollo in that order have the highest numbers of effective alleles with 16.756, 11.592 and 9.732 and Shannon’s information indices with 2.855, 2.436 and2.306, and numbers of private alleles with 13.071, 8.500 and6.357. (Fig 2)

Small number of observed heterozygosity were observed for all of the populations. However, relatively high number of heterozygosity were noted for the lines from Jimma (0.193), Released varieties (0.19) Wellega (0.16) Tigray (1.5) and E.shoa (1.5) and the (Fig 2). And all core germplasm tef collection of populations used in this study showed similar value in percentage of polymorphism (100), except the materials originated from Arsi. (Fig 2)

Population Structure Analysis

Cluster Analysis

Cluster analysis based on UPGMA method clustered the 81 tef genotypes into three major groups consisting of 52%, 44% and 4% of the genotypes in clusters 1, 2 and 3, respectively (Figure.1A). It was observed that the genotypes in cluster 1 comprised in total of genotypes per population:16/25 from East Shoa, 4/7 from released varieties, 6/11 from Wollo, 3/4 from Wellega, 3/4 from Gojam, 1/5 from Arsi, 1/16 from Jimma, 3/4 from Tigray and 5/15 from West Shoa.

The genotypes in second cluster included 10 from West Shoa, 4 from Arsi, 7 from East Shoa, 2 from Wollo, 3 from the 7 released varieties 1 of the 4 from Tigray, and 4 from Jimma. On the other hand, only 3 of the germplasm lines from Wollo are grouped in cluster 3

Principal Coordinate Analysis

PCoA showed that the first three coordinates count in anti clockwise direction explained 93.34% of the gross genetic variation. The gross variation explained was 65.45%, 22.93% and 4.96% by the first, second, and third principal coordinates, respectively (Fig. 3).

Population Structure

The most function of structure in population hereditary qualities is the recognizable proof of hereditarily homogeneous bunches of individuals based on modular value of ΔK. These are determined by Bayesian algorithm implemented in the software STRUCTURE based on true number of cluster (K) suggested by Evanno et al. (2005). In this study the algorithm allows estimating the true number of clusters at (K=4) (Figure 3a) in the sample of 81 individuals included in the study. Which means that these samples (81 individuals) come from four ancestors. But the bar plot appeared that there is no clear geographic origin-based organizing of population this is what indicates there is admixture of tef genes.

Discussion

Allelic Variance from Loci in Populations

The allele number that found in this study relatively closer to the earlier reports with ranges of 8 to 23 (Zeid et al., 2012). However, these were different when compared to the reports of total number of alleles (164) using 10 SSR (Tesfaye et al., 2016), 168 allels using 10 SSR (Jifar et al., 2020) and, 52 using 9 SSR markers (Fikre et al., 2018). In the other case, the genetic distance in the present study is relatively high but with a narrower range than those reported by Zeid et al. (2012) 0.75-0.94 and Tesfay et al. (2016) 0.65-0.94. This shows the marker used from this study have high polymorphism and results to detect the differentiation of genotypes appropriately.

Regarding to inbreeding coefficient all loci show f>0.5 except loci CNLTS 463(0.3021). And as compared to the expected heterozygosity, the observed heterozygosity is less. This is dissimilar from the report of higher heterozygosity (Fikre et al., 2018). The cause is the differences in test genotypes used in the studies and due to the fact that tef is autogamous and improvement of tef using intra-specific crossing has not been yet exploited from most of the germplasm collections that included from this study. Here in even though the observed heterozygosis and genetic differentiation is not in line with the high range heterozygosity of in previous study of Fikre et al. (2018), the loci CNLTS463, CNLTS22, CNLTS27 and CNLTS20 have relatively high genetic variation in all the test genotypes. This gives chanced for breeders to alleles that are linked for desirable traits and then helps to establish new breeding strategy.

In the other case, four of the microsatellite loci (CNLTS22, CNLTS25, CNLTS27 and CNLTS463) did not significantly deviate from the HW-equilibrium over the entire test genotypes. This may be due to gene flow or inbreeding as it implies genetic drift within populations. This in turn reduces or increases the amount of genetic diversity within the gene pool (Kleinhans et al., 2019). In terms of the general polymorphic data, all the microsatellite loci utilized were found profoundly instructive (0.96-0.98) and all loci have greater than 0.5. The range is different from the previous reports of 0.75 to 0.91 (Zeid et al., 2012); 0.67 to 0.94 (Tesfaye et al., 2016), and 0.36 to 0.93 (Jifar et al., 2020).

Besides the markers used in the study are highly informative, and have high allelic variation in loci that are distributed within tef core germplasms and released varieties. Consequently, these markers would be useful for further genetic analysis diversity and interrelationships as well as linkage map and QTL mapping. And the study indicates private alleles are parallel with the number of genotypes within a population. This line with the study of Jifar et al. (2020), and the observed high number of private alleles will be important for further improvement of the tef crop.

Compared to the expected heterozygosity, low observed heterozygosity values were observed from all the test genotypes. However, relatively high heterozygosity occurred for germplasm lines from Jimma and Tigray, and also released varieties. Hence a good differentiation of genes may be from those populations. This will be source of good traits which help to design breeding program.

Genetic Differentiation of the Population

low genetic differentiation of among population from this study is due high variation of the genotypes within population. This has little bet discrepancy with the study of Jifar et al. (2020) and Tadesse et al. (2021) which is 3% and 5% respectively. But majority of the variation, is described among genotypes within one population and secondly explained by the distinction of among genotypes across all populations (within 81 genotypes =within a large group). These results show little bet high variation within population as compared to in earlier tef studies depicted Jifar et al. (2020), Tadesse et al. (2021) respectively. While it shows low variation of genotypes across all population comparing to the study of Jifar et al., (2020); Tadesse et al. (2021).

The Fst value ranged from 0 to 0.05 is considered as low, 0.05 to 0.15 moderate, and 0.15 to 0.25 large and greater than 0.25 very large genetic differentiations (Wright, 1951). Comparing to this, the current results showed low F_ST, (0.000-0.036)) genetic differentiation in pair of population which is low. While high genetic differentiation (within population and among individuals of genotypes across all populations have been observed in this study. The low genetic differentiation among populations is may be due to presence of high gene flow that led to high genetic differentiation within population. Those which have high gene differentiation of genotypes within population and among genotypes across all populations provides ample opportunities to select lines having good potentials to adapt different environments (Bekana et al., 2021).

On the other hand, in this study, the overall genetic differentiation of released variety versus core germplasm collections is very low among the population and the gene flow is (Nm: 0). Here in absence of gene flow is because of new varieties were released recently whereas, the core germplasm was collected earlier around 1994 (Berhe, 1995).

Clustering and Population Structure Analysis

Cluster analysis is used to establish the relationships among the genotypes based on their distance matrix. Based on this, the UPGMA method, the study confirms the reports of Fikre et al. (2018) and Tadesse et al. (2021), who obtained three clusters of the tef test genotypes whereas Jifar et al. (2020) obtained 5 clusters for 144 genotypes. Here indicates the number of genotypes and its diversity that used from study have influenced the number of clusters. And the patterns of grouping of the genotypes through all clusters obviously showed the existence of moderate to significant gene flow along geographical borders. This is due to contiguity and interchange of germplasm resources. Besides, for small seeded crops like tef, it is very hard to maintain truly pure seed stocks because admixtures can occur during any step of the seed production and handling process unless extreme care is practiced (Zeid et al., 2012).

On the other case the cluster of collected core germplasm versus release variety, the released varieties are distributed from different clusters and sub clusters rather than forms one cluster except Tesfa and Eba. This indicates, the variety may be developed for different parents which found in different agro ecologies whereas the variety Tesfa and Eba may have similar parents.

The most function of structure in population hereditary qualities is the recognizable proof of hereditarily homogeneous bunches of individuals. In PCoA of in this study showed that individuals make three groups with three coordinates explained 93.34% of the genetic variation. The gross variation explained in this study is unlike from the previous study of Fikre et al. (2016), 29% of gross variation 12%, 10%,7% Tadesse et al. (2021); 30.72% explained 13.13 %, 9.20%, 8.39% and Teshome (2019) 14.85% of the total variation explained 5.57%, 4.95% and 4.33% .

In the other case, based on Bayesian algorithm ,the structure of the individuals clustered into four since the algorithm allows estimating the true number of clusters at (K=4) (Figure 3a) in the sample of 81 individuals based on suggested by Evanno et al. (2005).Which means that these samples come from four ancestors. But the bar plot appeared in the admixture indicates the used sample of individuals are admixture and requires revision for its collection.

Conclusion And Recommendation

In conclusion, sufficient genetic variation offers alternatives from which selections are made for improvement of crop. A success in crop improvement depends on the quantity of genetic variation that exist in the genetic materials and selection for genetically superior genotypes.

In this study higly polymorphic and informative markers were used .Hence larger number of allels are generated. Among these, there may have private allels which might be linked to important agronomic traits. And it indicates there is relatively high heterozygosity occurred for germplasm lines which is found in Jimma ,Tigray and released varieties .Those may have allels which linked to good traits which help to design breeding program. While we observed lower heterozygosity than expected heterozygosity. To increase hetezygosity, artificial intracross and a broad variablity through artificial intercrossing should be supported by in vitro embryo rescue. And induced mutation will aslo help to increase the variablity of tef. As the population structure indicates from this study the genotypes found in all population are admixture which come from four ancestors. Hence re-collect the genotypes based on its origin may be good to conserve

The study did not cover all admistrative zones due to lack of the passport data.To facillitate the breeding for diffrenet environment, core germplasm collection should be revised and collected based on agroecologies

Declarations

Acknowledgements

The principal author acknowledges the assistance and support of the Ethiopian Institute of Agricultural Research (EIAR), Debre Zeit Agricultural Research Center and its staff, Holetta National Agricultural Biotechnology Research Center and its staff, Mr. Obssi Desalegn, Dr.Solomon Chanyalew, Prof.Gabriel Schaaf and his PhD student Yeshambel Emwedih,at Rhenish Friedrich Wilhelm University of Bonn, and Wagaw Sendeku (PhD) Student at Addis Ababa University.

Author contributions

Kebebew A. and Derejaw T. conceived the study; Derejaw.T did the whole lab work, data analysis and drafted the manuscript; Tileye F. and Kebebew A. give suggestion follow up the work and revised the manuscript; all authors reviewed the manuscript and gave final approval for publication

Funding

funding provided for working was by Ethiopian institution agricultural research and Addis Ababa university.

Conflict of Interest

No conflict of interest.

Data Availability Statement

The data that support the findings of this study are openly available in the Science Data Bank at https://www.doi.org/ [DOI 10.57760/science dB. j00143.00070] and Addis Ababa university library

References

Assefa K,Merker A, Tefera H (2003a). Inter simple sequence repeat (ISSR) analysis of genetic diversity in tef (Eragrostis tef (Zucc.) Trotter). Hereditas139:174–183.
Assefa K, Cannarozzi G, Girma D, Kamies R, Chanyalew S, Plaza-Wüthrich S, Regula Blösch R, Rindisbacher A, Rafudeen S, and Tadele Z (2015). Genetic diversity in tef [Eragrostis tef (Zucc.) Trotter]. Front. Plant Sci .6:177
Assefa K, Ketema S, Jifar H, Tesfaye K, Assefa K, Chanyalew S, Tadele Z (2017) SemiDwarf Tef (Eragrostis tef) Lines for High Seed Yield and Lodging Tolerance in Central Ethiopia. Afric. Crop Sci. J., 25 (4): 419 - 439.
Ayele M, Tefera H. Assefa K, Nguyen H (1999). Genetic characterization of two Eragrostis species using AFLP and morphological traits. Hereditas130: 33-40
Bai G, Ayele M, Tefera H, Nguyen H.T(1999). Amplified fragment length polymorphism analysis of tef [Eragrostis tef (Zucc.) Trotter]. Crop Sci. 39: 819-824.
Bai G, Ayele M, Tefera H, Nguyen H (2000). Genetic diversity in tef [Eragrostis tef (Zucc) Trotter] and its relatives as revealed by random amplified polymorphic DNAs. Euphytica112: 15–22.
Baye K (2014). Synopsis: Tef: nutrient composition and health benefits. Ethio. Stratag Support Program, 2013–2014.
Bekana T (2021). Role and economic importance of crop genetic diversity in food security. Int.l J. of Agril. Sci. and Food Techno. 7: 164–169.
Cannarozzi G, Girma D (2014). Genome and transcriptome sequencing identifies breeding targets in the orphan crop tef ( Eragrostis tef ), (August). concentration of six cultivars of tef [Eragrostis tef (Zucc.) Trotter]. Ethiopian J. Nat. Resour. 4, 21–36
Chanyalew S, Kebede W, Fikre T, Genet Y, Jifar H, Demissie M, Tolossa K,Tadesse M, Tadele Z, Assefa K (2021). Tef Breeding Manual. Ethiopian Institute of Agricultural Research, Debre Zeit Agricultural Research Center, National Tef Research Program, Debre Zeit, Ethiopia
Costanza S, Dewet J, Harlan J (1979). Literature‐review and numerical taxonomy of Eragrostis tef (t’ef). Economic Botany33: 413–424
CSA (2021). The Federal Democratic Republic of Ethiopia, Agricultural Sample Survey 2020/21 (2013 E.C.), Volume I, Report on Area and Production of Major Crops (Private Peasant Holdings, Meher Season), Statistical Bulletin 586, April 2021, Addis Ababa, Ethiopia
D'Andrea, A (2008). Tef (Eragrostis tef) in ancient agricultural systems of highlands of Ethiopia. Economic Botany62: 547-566
Fikre T, Tesfaye K, Assefa K (2018). Genetic Diversity of Ethiopian Tef [( Eragrostis tef ( Zucc .) Trotter ] Released and Selected Farmers ’ Varieties along with Two Wild Relatives as Revealed by Microsatellite Markers, J. of Crop Sci. and Biot.21(4):367–374.
Girma D, Cannarozzi G (2014.). Genotyping by sequencing reasserts the close relationship between tef and its putative wild Eragrostis progenitors. Diversity 10: 17
Girma D , Assefa K , Chanyalew S, Cannarozzi G, and Kuhlemeier C (2014). The origins and progress of genomics research on tef (Eragrostis tef), Plant Biot. J. 12: 534–540
Govindaraj M,Vetriventhan, Srinivasan M (2015). Importance of genetic diversity assessment in crop plants and its recent advances: an overview of its analytical perspectives. Genetics Res. Inter: Vol .2015
Harrower M, Mccorriston J, Andrea A (2010). General/specific, local, global /comparing of the begnning of agriculture in the horn of Africa (Ethiopia, Eriteria) and south west Arabiya (Yemen) . American Antiquity5(3): 452–472.
Jansen G, DiMaio L, Hause N (1962). Amino acid composition and lysine supplementation of teff. J. Agri. Food Chem. 10: 62-64.
Jansen G, DiMaio L, Hause L (1962). Amino acid composition and lysine supplementation of teff. J. Agri. Food Chem. 10: 62-64.
Jifar H, Tesfaye K, Assefa K (2020). Advances in Crop Science and Technology Genetic Diversity and Population Structure of Tef [ Eragrostis tef ( Zucc .) Trotter ] as Revealed by SSR Markers. Advan. in Crop Sci. and Tech. 8(1)
Ketema S (1993). Tef (Eragrostis tef): Breeding, Genetic Resources, Agronomy, Utilization and Role in Ethiopian Agriculture. Institute of Agricultural Research, Addis Ababa, Ethiopia.
Kleinhans C, Willows-Munro S (2019). Low genetic diversity and shallow population structure in the endangered vulture, Gyps coprotheres. Scientific Reports9(1): 1–11.
Mengesha H (1966). Chemical composition of tef (Eragrostis tef) compared with that of wheat, barley and grain sorghum. Econ. Bot. 20:268-273.
Merga M (2018). Achievements and Challenges of Tef Breeding in Ethiopia. J Agri Sci Food Res. 9:1
Nei M (1972). Genetic distance between populations. American Naturalist106: 283-292.
Nei M (1978). Estimation of average heterozygosity and genetic distance from small number of individuals. Genet.89: 583-590. .
Peakall R, Smouse P (2015). GENALEX6: Genetic Analysis in Excel. Population genetic software for teaching and research. Mol.Ecol. 6: 288-295.
Perrier X, Jacquemoud-Collet J (2010). DARwin software http://darwin.cirad.fr/.
Ponti J (1978). The systematics of Eragrostis tef (Graminae) and related species. PhD Thesis, University of London, London, UK
Pritchard JK, Stephens M, Donnelly P (2000). Inference of population structure using multilocus genotype data. Genetics155: 945–959.
Romero M, Mujica A, Pineda E, Ccamapaza Y, Nacional U, de Biología E P (2019). Genetic identity based on simple sequence repeat (SSR) markers for quinoa (Chenopodium quinoa Willd.). Cien. Inv. Agr. 46(2):166-178..
Saturni L, Ferretti G, Bacchetti T (2010). The gluten-free diet: safety and nutritional quality. Nutri.2: 16-34.
Smale M, Aquino P, Crossa J, del Toro E, Dubin J, Fischer T, Fox P, Khairallah M, Mujeeb-Kazi A, Nightingale K, OrtizMonasterio I, Rajaram S, Singh R, Skovmand B, van Ginkel M,Varughese G , Ward R (1996). Understanding global trends in the use of wheat diversity and international flows of wheat genetic resources. Economics Working Paper 96-02. Mexico, D.F.: CIMMYT. 61pp.
Tadesse M, Kebede M, Girma D (2021). Genetic diversity of tef [Eragrostis tef (Zucc.) Trotter ] as revealed by microsatellite markers, 2021. Int. J. of Genomics, Vol. 2021:
Tesfay M, Shimellis H, Nark D, Assefa A (2016). Assessment of the genetic relationship of tef (Eragrostis tef) genotypes using SSR markers. S Afr J Bot.Sci. 105: 106-110
Teshome A. (2019). Morphological and molecular characterization selected of (Eragrostis tef (Zucc.) Trotter) Accession from the gene bank of Ethiopia. MSc. Thesis, Addis Ababa University, Addis Ababa, Ethiopia
Vohwinkel F, Piepho HP, Heiligtag B, Richter C (2002).Comparative yield and nutrientmconcentration of six genotypes of tef (Eragrostis Tef (Zucc.)Trotter).Ethiop. J. Nat.Resour. 4, 21–36
Zeid M, Assefa K, Haddis A, Chanyalew S, Sorrells M (2012). Field Crops Research Genetic diversity in tef ( Eragrostis tef ) germplasm using SSR markers. Field Crops Res. 127, 64–70
Zeid, M., Belay, G., Mulkey, S., Poland, J., and Sorrells, M. (2011). QTL mapping for yield and lodging resistance in an enhanced SSR-based map for tef. Theor Appl Genet122:77–93.