Genetic Diversity and population structure of Selected tef Core Germplasm Lines Based on Microsatellite Markers

DOI: https://doi.org/10.21203/rs.3.rs-2770211/v1

Abstract

Tef is an indigenous and important food, feed and cash crop to the small-holder Ethiopian farmers. Knowledge of the natural genetic composition of the crop provides the option to further exploit its genetic potential through breeding. However, there are insufficient reports on the genetic variability of Ethiopian tef using a medium-throughput marker system. Hence, the current study was designed to evaluate the genetic variability of released and core germplasm which were collected earlier. Eighty-one tef genotypes collected from eight Ethiopian ecological zones and release variety were targeted using 14 SSR markers. The study ensued a total of 122.43 alleles through the entire loci and populations. All the used microsatellite loci were highly polymorphic with PIC ranging from 0.96 to 0.98 and mean of 0.976. The analysis showed the presence of high allelic diversity ranging from 0.94 to 0.97 with overall mean of 0.96. The molecular variance analysis indicated the existence of large genetic differentiation (FIS and FIT=0.87) with 86% and 13% of the total variation accounted by among genotypes within population and across from all population respectively. Whereas low genetic differentiation among population (FST=0.014. which accounts 1 %.) was observed. Multivariate analysis like clustering, and PCoA, analysis did not cluster genotypes into distinct groups according to their geographical areas of population. This is due to presumably gene flow among populations. The information of this study may be used as input for breeders in selecting breeding materials.

Introduction

Tef is well known as an indigenous and important food, feed and cash crop to the small-holder Ethiopian farmers.  Among the cereals, tef covers 2.93 million hectares of land and this is equivalent to 27.8% of the total area allocated to cereal crop cultivation in Ethiopia (CSA, 2021). The same source indicates that the annual grain production of 5.51 million tons makes tef third following maize and wheat with 10.56 and 5.78 million tons, respectively.

 In Ethiopia, tef grain is mainly used for making ‘injera’ and sometimes for making porridge, unraised bread (‘kita’), and local alcoholic drinks, called ‘tela’ and ‘katikala’. In general, tef is considered to be a highly nutritious grain having protein content in the range of 8 to 11 percent, similar to other more common cereals such as wheat (Baye et al., 2014), balanced amino acids with high lysine content (Jansen et al., 1962), and high iron content (Mengesh ,1966; Costanza et al., 1979). It has also got significantly higher calcium content than most major grains (Vohwinkel et al., 2002). Besides, tef straw is the most appreciated feed for cattle, while it also serves as a binding material to reinforce mud used for plastering the walls of huts and local grain storage facilities called gotera, and also as bedding material, mulch and fuel source. Moreover, tef grains are free of gluten which causes celiac disease. It is also a good alternative diet for diabetic people due to its low glycemic index as compared to most other cereals (Saturni et al., 2010).

Eragrostis tef (Zucc.) Trotter belongs to the class: Liliopsida, order: Poale, family: Poaceae, subfamily: Chloridodeae, genus: Eragrostis (Smale et al., 1996). It is allotetraploid (2n =4x= 40) originating from the hybridization of two distinct species followed by diploidization Cannarozzi et al. (2014). 

Every individual is genetically unique by nature. Hence conservation of this unique ness is to maintain genetic diversity at many levels and to provide tools for population monitoring and assessment that can be used for conservation planning. Genetic diversity is generally defined as the extent of genotypic variability existing in a group of individuals. It provides opportunity for plant breeders to develop new and improved cultivars with desirable characteristics, which include both farmer-preferred traits (yield potential and large seed, etc.) and breeders preferred traits (pest and disease resistance and photosensitivity, etc.). It gives species that the ability to adapt to changing environments, including new pests and diseases and new climatic conditions, such as global warming. Hence assessment of genetic diversity in germplasm collections and genetic relationships among breeding materials is therefore essential in crop improvement strategies.

The indigenous tef genetic resources have been and will continue to be the major source of variability for its genetic improvement. In the early 1990s, a core tef germplasm set of 320 selected representative lines was established based on phenotypic evaluation and characterization of over 2255 tef germplasm collections (Ketema et al., 1993). Some studies detected tef genetic diversity at the molecular level using various types of DNA markers such as AFLP (Mulu Ayele et al., 1999; Bai et al., 1999), RAPD (Bai et al., 2000), ISSR (Assefa et al., 2003a), SSR (Zeid et al., 2012; Tesfaye et al., 2018; Fikre et al., 2018 and Jifar et al., 2020) and SNP (Girma et al., 2014). Among these, the studies so far indicate SSR markers are polymorphic in tef genotypes comparably (Girma et al., 2014)

A number of methods have been developed over the past decades for the assessment of genetic diversity in tef germplasm accessions, breeding lines and populations. This implies considerable progress in improving tef and developed and released more than 51 improved tef varieties (Chanyalew et al., 2021). However, the productivity of tef with its national average yield of 1.88 t/ha is still extremely low compared to other cereals (CSA, 2021). Among the major issues which cause low productivity are lodging, abiotic stresses such as drought and acidity, and biotic stresses (diseases and insect pests) (Assefa et al., 2015). 

Therefore, germplasm screening and varietal development with the  molecular markers in tef have been described as still insufficient (Misgana et al., 2018). Besides, conventional cereal breeding is time-consuming and very dependent on environmental conditions. Hence breeding a new variety takes between eight and twelve years and even then, the release of improved variety cannot be guaranteed. Hence, breeders are extremely interested in molecular markers. 

 Characterization and evaluation of core germplasm collections using SSR markers would be vital for enhancing genetic gain from tef breeding. The productivity of tef should be enhanced to compete with other more productive cereals. Consequently, this study is initiated with the aim of generating useful information by molecular diversity analysis for designing effective and efficient tef breeding strategies.

Materials And Methods

Plant Materials

The experimental plant materials for both the field and laboratory experiments comprised a total of 81 genotypes including 74 core germplasm lines and seven released varieties obtained from the working collections at the National Tef Research Program of DZARC. The germplasm materials originated from previous collections made from different areas of Ethiopia (Table 1)

Table 1. List of experimental tef genotypes

No.

Genotypes

Origin

No.

Genotypes

Origin

 

1

DZ-01-91

Jimma

42

DZ-01-247

E. Shoa 

 

2

DZ-01-101

Jimma

43

DZ-01-1055

E. Shoa 

 

3

DZ-01-86

Jimma

44

DZ-01-1057

E. Shoa 

 

4

DZ-01-102

Jimma

45

DZ-01-663

E. Shoa 

 

5

DZ-01-117

Jimma

46

DZ-01-691

E. Shoa 

 

6

DZ-01-99

Jimma

47

DZ-01-695

E. Shoa 

 

7

DZ-01-1193

Tigray 

48

DZ-01-647

E. Shoa

 

8

DZ-01-1217

Tigray 

49

DZ-01-1288

W. Shoa 

 

9

DZ-01-1225

Tigray 

50

DZ-01-1292

W. Shoa 

 

10

DZ-01-1167

Tigray 

51

DZ-01-1293

W. Shoa 

 

11

DZ-01-1034

Gojam 

52

DZ-01-1277

W. Shoa 

 

12

DZ-01-1100

Gojam 

53

DZ-01-979

W. Shoa 

 

13

DZ-01-1102

Gojam

54

DZ-01-728

WShoa 

 

14

DZ-01-1108

Gojam 

55

DZ-01-725

W. Shoa 

 

15

DZ-01-483

Wello 

56

DZ-01-751

W. Shoa 

 

16

DZ-01-527

Wello 

57

DZ-01-759

W. Shoa 

 

17

DZ-01-478

Wello 

58

DZ-01-778

W. Shoa 

 

18

DZ-01-512

Wello 

59

DZ-01-787

W. Shoa 

 

19

DZ-01-530

Wello 

60

DZ-01-821

W. Shoa 

 

20

DZ-01-504

Wello 

61

DZ-01-866

W. Shoa 

 

21

DZ-01-517

Wello 

62

DZ-01-951

W. Shoa 

 

22

DZ-01-647

Wello 

63

DZ-01-974

W. Shoa 

 

23

DZ-01-1130

Wello 

64

DZ-01-801

W. Shoa 

 

24

DZ-01-1233

Wello 

65

DZ-01-829 

W. Shoa 

 

25

DZ-01-481

Wello 

66

DZ-01-1382

W. Shoa

 

26

DZ-01-553

Wellega 

67

DZ-01-886

W. Shoa 

 

27

DZ-01-558

Wellega 

68

DZ-01-1014

W. Shoa 

 

28

DZ-01-569

Wellega 

69

DZ-01-1015

W. Shoa 

 

29

DZ-01-581

Wellega 

70

DZ-01-1062

Arsi

 

30

DZ-01-1020 

E. Shoa

71

DZ-01-1268

Arsi 

 

31

DZ-01-11

E. Shoa 

72

DZ-01-1310

Arsi 

 

32

DZ-01-30

E. Shoa 

73

DZ-01-10

Arsi 

 

33

DZ-01-12

E. Shoa 

74

DZ-01-9

Arsi 

 

34

DZ-01-14

E. Shoa 

75

Nigus 

Released variety

35

DZ-01-16

E. Shoa 

76

Bishoftu

Released variety

36

DZ-01-36

E. Shoa 

77

Tesfa 

Released variety

37

DZ-01-37

E. Shoa 

78

Dagim 

Released variety

38

DZ-01-58

E. Shoa 

79

Flagot 

Released variety

39

DZ-01-71B

E. Shoa 

80

Eba 

Released variety

40

DZ-01-18

E. Shoa 

81

Bora

Released variety

41

DZ-01-46

E. Shoa 

 

 

 

 

 

 

 

DNA Isolation and Purification 

About 20-30 seeds of each of the 81 tef genotypes were planted in plastic pots comprising the combination of sand, forest soil, and compost. The plants were grown in the greenhouse at National Agricultural Biotechnology Research Center (NABRC), Holetta, Ethiopia. 

Leaf samples were collected from a single plant of four-week-old seedlings. Hundred milligrams of fresh leaves were placed in 2 ml autoclaved and labeled Eppendorf tubes and freeze-dried for 24 hours at −40 °C, and then lyophilized. The samples were then ground using Geno Grinder (MM-200, Retsch) for 3 min. The genomic DNA was extracted from the ground fresh leaf samples using plant DNA extraction protocol based on the method of Diversity Array Technology (DArT) with some minor modification. Then, the DNA pellets were air dried and dissolved in 100 μl of TE buffer.

The quality and concentration of genomic DNA was checked with nano drop spectrophotometer (ND-8000, Thermos scientific). The level of DNA purity was determined by the 260/280 absorbance ratio. Further, the quality of DNA was tested using 1% (w/v) agarose gel and electrophoresis was carried out on horizontal electrophoresis set up (JYSPBT) using a standard DNA ladder. Hyper Ladder TM IV with known reference band to quantify the DNA amount and visualized using gel documentation, 3UV- Tran’s illuminator (Bio-Doc).

Those high band intensity, lesser smear, and high concentration were selected for PCR analysis and were stored in the refrigerator (4 ºC) till the PCR. 

 Microsatellite Primer Selection and Optimization

Twenty-five microsatellite primer-pairs previously developed by Zeid et al. (2011) were used for initial screening for amplification, polymorphism, and specificity to target loci. Fourteen markers that showed clear band and polymorphism were selected (Table 2). Concurring to the manufacturer's instruction, they were dissolved utilizing nuclease-free,water to a final concentration of 100mol/µl.

Table 2. Description of screened microsatellite makers that were used in the diversity analysis of 81 tef genotypes

No. 

Marker name

Forward primer (5´ to 3´)

Reverse primer (3´ to 5´)

motif

Size (bp)

AT

 

Linked tef Chromosome No.

Source 

1

CNLTS5

CCCAAAGTGATGCAAAAACA

GATAGATAGAGACACAGACACACAACA

 

250-310

55.4

 

Tadesse et al. (2021)

2

CNLTS6

AATTCGCAGCTGATCTACGC

CTCGTCGATATACGTGCAAAA

 

90-260

52.7

 

3

CNLTs 12

TCATATGCAGCCCAACTCAA

ACGACGGGCTACTGGTATCT

 

180-280

60.6

12

Zied et al (2011)

4

CNLTS19

AAGACAATGCTACCCCAAGC

CCTCGGATTGTTGCTCTCTC

(AG)14

180-300

58.5

 

 

 

 

 

 

 

Tadesse et al. (2021)

5

CNLTs 20

GGGAATGAGCATGACCTGAT

TCTGTGGATGCTCAGCTACG

 

157-300

59.5

1

6

CNLTS22

CATTTCTTGCTGCTGGATCA

AGTATGGTGGCCTTGGTGAG

 

100-260

55.4

 

7

CNLTS25

GAGAGCGGTTTTGTCCTACG

GGAATAGGGAGGCGAGGTAG

 

150-320

51.3

 

8

CNLTS27

TTGGAATGAGATGGCATTTG

GAAGCGGGGTAAGATTTGAA

 

200-220

53.2

1

9

CNLTS28

GCTCAAGGGCAAGGACATC

CCTGGAATGTAATCGGAAAAA

 

100-200

59.3

 

10

CNLTS458

AGAGCCTTCGGCCTTGTACT

GTGAAGCGCGCAAATCTC

 

130-250

50.1

4

11

CNLTS463

TGCTAGGATGGTCCTGTTGAG

AGCACCAAATCCCTATGCAC

 

190-260

57.4

 

12

CNLTS 465

CAGCTTAGACGGGCAGAGAA

AGGAAAACAGAGAGAGGGAGAG

 

150-300

57.4

 

13

CNLTs 466 

CAT CAT CAA TAA AAG ATC CGA GA 

CAT CAT GAT ACT CGC CGT TG 

(GA)16 

246

54.1

 

 

14

CNLTs 484

GAGATCCTACCACGGCGATA

CGCTTTCCCCTCCTTTTGTA

(GA)18

157

59.5

1

Jifar et al. (2020)

PCR and Gel Electrophoresis 

DNA amplification was performed in 12.5 μl reaction volume containing 6.25 μl, GoTaq®Red Master Mix (supplemented with all PCR reaction components, MgCl2, PCR buffer, dNTPs, and Taq DNA polymerase), 0.5 μl forward primer and reverse primer, 3.25 μl nuclease-free water and 2 μl genomic DNA. The PCR was carried out using thermal cycler Gene Amp® PCR system 9700 (Applied Biosystems, USA) programmed at an initial denaturation at 95°C for 2 minutes followed by denaturation at 95°C for 30 seconds, annealing at 51.3°C up to 61.3°C depending on the primer used for 1 minute, extension at 72°C for 30 seconds and final elongation at 72°C for 5 minutes for 37 cycles. Before fragment analysis, amplification of the PCR products was checked on a 2% (w/v) agarose gel and visualized using gel documentation 3UV-transilluminator (Bio-Doc).

Data Analyses

Fragment scoring and data analyses were based on allelic forms. Clearly resolved and unambiguous fragments were scored for each locus using Gel analyzer 19.1 software. Then the allelic forms data of all entries combined were subjected for statistical analysis. Genetic diversity parameters of the 81 tef genotypes were analyzed using GenAlex version 6.5 (Peakall and Smouse, 2015) software package. Genetic diversity parameters considered in this study were number of effective alleles per locus (Ne), Shannon information index (I), fixation index (F) (Nei, 1978), gene flow (Nm), and percent polymorphism (% P). The number of alleles per locus (Na), polymorphic information content (PIC), observed (Ho), and expected (He) heterozygosity were analyzed using Power Marker v3, 25 (Liu and Muse, 2005) software package.

 Furthermore, AMOVA was used to estimate F-statistics such as genetic differentiation (FST), fixation index or inbreeding coefficient (FIS), and overall fixation index (FIT) to compare the genetic structure among and within populations. The AMOVA procedures were done using Gene Alex version 6.5 (Peakall and Smouse, 2015) software package. To examine the degree of pair-wise population of FST and Nm values among the study materials were computed according to Nei’s (1978) using Gen Alex version 6.5 (Peakall and Smouse, 2015) software package.

Cluster analysis was carried out by using the UPGMA algorithm in DARwin version 6 software (Perrier and Jacquemoud Collet, 2010). A dendrogram for 81 tef test genotypes was then generated based on the dissimilarity matrix to visualize the pattern of a cluster within and among the selected tef germplasm lines and varieties. In addition, the population structure and admixture patterns of the 81 germplasm and release varieties were determined by the Bayesian model-based clustering method (Pritchard et al., 2000) using the Structure ver. 2.3.1 software. 

Principal Coordinate Analysis (PCoA) was done using the Jaccard’s index to further validate the complementarity of clustering pattern revealed by the dendrogram using GenAlex version 6.5 (Peakall and Smouse, 2015) software package. The first three axes were used to plot the three-dimensional PCoA with STATISTICA version 6.0 software (Hammer et al., 2001)

Results

Molecular Genetic Diversity

In this study, the overall of 122.43 alleles were detected across the genome of 81 genotypes as revealed by 14 microsatellite loci. The number of alleles noticed ranged from 6.11 (CNLTS5) to 12.4 (CNLTS463) with a mean number of 8.75 alleles per locus through the genome of the entire populations. On the other hand, from all loci there was high inbreeding coefficient (f>0.5) except loci CNLTS 463 (0.3021) (Fig 1). In addition, the heterozygosity of the studied materials ranged from 0.to 0.69 with overall mean of 0.1457. 

Relatively the markers CNLTS463, CNLTS22, CNLTS27, and CNLTS20 have high heterozygosity and Shannon´s index. The germplasm showed the highest genetic differentiation (Fst = 0.27) and the lowest value of gene flow (Nm =0.69) from CNLTS458 (Fig 1). 

Four of the microsatellite loci (CNLTS22, CNLTS25, CNLTS27 and CNLTS463) did not significantly deviated from the HW-equilibrium over the entire populations.And  the overall polymorphic information, from all the microsatellite loci utilized was found profoundly instructive (0.96-0.98) (Fig.1)

Analysis of Molecular Variance 

The analysis of molecular variance (AMOVA) showed that 1% of the total genetic variation and low genetic differentiation (FST = 0.014) for the variation among populations, 86% among genotypes within populations and 13% variation among genotypes across all populations (Table 4). This study confirmed genetic differentiation (FIS =0.870) of genotypes within population and (FIT=0.872) among genotypes across all population (From 81 genotypes=one large grouped) with a small value of overall genetic differentiation coefficient (p = 0.001) (Table 4).     

On the other side, released varieties and core germplasm collection of overall genetic differentiation coefficient among the population (p = 0.832), Fst=-0.004 and gene flow (Nm: 0) were observed (Table 4). A total genetic variation of 87% and 13% were explained by within populations and among genotypes across all populations respectively (Table 4). 

Table 4: AMOVA results of 81 tef genotypes evaluated using 14 SSRs

Source

df

SS

MS

Est. Var.

%

Fstat

P-value

Among Pops

9

114.3

14.29

0.199

1%

Fst=0.014

0.002

Among Indiv

71

909.07

12.63

5.885

86%

Fis=0.87

0.001

Within Indiv

81

71.00

0.85

0.88

13%

Fit=0.87

0.001

Total

161

1094.37


6.85

100%



Gene flow   among population (Nm= 17.02)

Among Pops

1

12.01

12.01

0.00

0%

Fst=0.004

0.832

Among Indiv

74

944.61

12.775

5.92

87%

Fis=0.865

0.001

Within Indiv

76

70.00

0.92

0.92

13%

Fit=0.865

0.001

Total

151

1026.62


6.84

100%



Gene flow among core germplasm and released varieties (Nm=0)

Where, df = degree of freedom, SS=sum of squares, MS=mean squares and Nm=gene flow, FST=inbreeding coefficient among population, FIS=inbreeding coefficient with population, FIT=inbreeding coefficient of within groups (across all population=one)

 Genetic Distance and Gene Flow Among Populations

The pairwise population genetic distance and gene flow for both core germplasm and released varieties of tef genotypes are presented in (Table 5). The pairwise Fst standard genetic differentiation among populations varied from (0.000-0.036) which means 0,000 in   between Wollo and Wellega, Gojam and Wollo ,Gojam and Wellega, Wollo and released varieties, Wellega and released varieties  and  0.036  shows in between Arsi and Wellega. 

Table 5: Fst values (below diagonal) and gene flow (above the diagonal) among tef genotypes originating from nine sources

Origins

JIM

TG

GOJ

WO

WEL

E. S

W. S

AR

R. Var

JIM

0.000

7.435

29.00

63.830

20.547

32.239

17.402

8.045

99.402

TG

0.027

0.000

24.846

12.423

7.022

10.686

8.448

7.042

13.039

GOJ

0.009

0.010

0.000

0.000

0.000

31.000

8.674

8.161

100.00

WO

0.005

0.020

0.000

0.000

0.000

20,000

11.691

7.393

0.000

WEL

0.013

0.034

0.000

0.000

0.000

29.000

12.631

6.605

0.000

ES

0.008

0.027

0.008

0.012

0.010

0.000

21.196

10.788

73.356

WS

0.014

0.020

0.026

0.019

0.020

0.020

0.000

79.772

20.52

AR

0.029

0.031

0.030

0.035

0.036

0.030

0.003

0.000

13.073

RV

0.003

0.020

0.002

0.000

0.000

0.003

0.012

0.022

0.000

JIM=Jimma, TG =Tigray, GOJ= Gojam, WO= Wollo, WEL=Wellega, ES= East Shoa, WS= West Shoa, AR=Arsi and R. Var=Released Variety 

Allelic Diversity Among Populations

The number of observed alleles (Na) was higher for accessions collected from East Shoa, West Shoa and Wollo with values of 18.857, 12.57 and 10.714, respectively (Fig 2). Similarly, the germplasms from East Shoa, West Shoa and Wollo in that order have the highest numbers of effective alleles with 16.756, 11.592 and 9.732 and Shannon’s information indices with 2.855, 2.436 and2.306, and numbers of private alleles with 13.071, 8.500 and6.357. (Fig 2)

Small number of observed heterozygosity were observed for all of the populations. However, relatively high number of heterozygosity were noted for the lines from Jimma (0.193), Released varieties (0.19) Wellega (0.16) Tigray (1.5) and E.shoa (1.5) and the (Fig 2). And all core germplasm tef collection of populations used in this study showed similar value in percentage of polymorphism (100), except the materials originated from Arsi. (Fig 2)

Population Structure Analysis

 Cluster Analysis 

Cluster analysis based on UPGMA method clustered the 81 tef genotypes into three major groups consisting of 52%, 44% and 4% of the genotypes in clusters 1, 2 and 3, respectively (Figure.1A). It was observed that the genotypes in cluster 1 comprised in total of genotypes per population:16/25 from East Shoa, 4/7 from released varieties, 6/11 from Wollo, 3/4 from Wellega, 3/4 from Gojam, 1/5 from Arsi, 1/16 from Jimma, 3/4 from Tigray and 5/15 from West Shoa. 

The genotypes in second cluster included 10 from West Shoa, 4 from Arsi, 7 from East Shoa, 2 from Wollo, 3 from the 7 released varieties 1 of the 4 from Tigray, and 4 from Jimma. On the other hand, only 3 of the germplasm lines from Wollo are grouped in cluster 3

Principal Coordinate Analysis

PCoA showed that the first three coordinates count in anti clockwise direction  explained 93.34% of the gross genetic variation. The gross variation explained was 65.45%, 22.93% and 4.96% by the first, second, and third principal coordinates, respectively (Fig. 3). 

Population Structure

The most function of structure in population hereditary qualities is the recognizable proof of hereditarily homogeneous bunches of individuals based on modular value of ΔK. These are determined by Bayesian algorithm implemented in the software STRUCTURE based on true number of cluster (K) suggested by Evanno et al. (2005). In this study the algorithm allows estimating the true number of clusters at (K=4) (Figure 3a) in the sample of 81 individuals included in the study. Which means that these samples (81 individuals) come from four ancestors. But the bar plot appeared that there is no clear geographic origin-based organizing of population this is what indicates there is admixture of tef genes. 

Discussion

Allelic Variance from Loci in Populations 

The allele number that found in this study relatively closer to the earlier reports with ranges of 8 to 23 (Zeid et al., 2012). However, these were different when compared to the reports of total number of alleles (164) using 10 SSR (Tesfaye et al., 2016), 168 allels using 10 SSR (Jifar et al., 2020) and, 52 using 9 SSR markers (Fikre et al., 2018). In the other case, the genetic distance in the present study is relatively high but with a narrower range than those reported by Zeid et al. (2012) 0.75-0.94 and Tesfay et al. (2016) 0.65-0.94. This shows the marker used from this study have high polymorphism and results to detect the differentiation of genotypes appropriately.

Regarding to inbreeding coefficient all loci show f>0.5 except loci CNLTS 463(0.3021). And as compared to the expected heterozygosity, the observed heterozygosity is less. This is dissimilar from the report of higher heterozygosity (Fikre et al., 2018). The cause is the differences in test genotypes used in the studies and due to the fact that tef is autogamous and improvement of tef using intra-specific crossing has not been yet exploited from most of the germplasm collections that included from this study. Here in even though the observed heterozygosis and genetic differentiation is not in line with the high range heterozygosity of in previous study of Fikre et al. (2018), the loci CNLTS463, CNLTS22, CNLTS27 and CNLTS20 have relatively high genetic variation in all the test genotypes. This gives chanced for breeders to alleles that are linked for desirable traits and then helps to establish new breeding strategy. 

In the other case, four of the microsatellite loci (CNLTS22, CNLTS25, CNLTS27 and CNLTS463) did not significantly deviate from the HW-equilibrium over the entire test genotypes. This may be due to gene flow or inbreeding as it implies genetic drift within populations. This in turn reduces or increases the amount of genetic diversity within the gene pool (Kleinhans et al., 2019). In terms of the general polymorphic data, all the microsatellite loci utilized were found profoundly instructive (0.96-0.98) and all loci have greater than 0.5. The range is different from the previous reports of 0.75 to 0.91 (Zeid et al., 2012); 0.67 to 0.94 (Tesfaye et al., 2016), and 0.36 to 0.93 (Jifar et al., 2020). 

Besides the markers used in the study are highly informative, and have high allelic variation in loci that are distributed within tef core germplasms and released varieties. Consequently, these markers would be useful for further genetic analysis diversity and interrelationships as well as linkage map and QTL mapping. And the study indicates private alleles are parallel with the number of genotypes within a population. This line with the study of Jifar et al. (2020), and the observed high number of private alleles will be important for further improvement of the tef crop. 

 Compared to the expected heterozygosity, low observed heterozygosity values were observed from all the test genotypes. However, relatively high heterozygosity occurred for germplasm lines from Jimma and Tigray, and also released varieties. Hence a good differentiation of genes may be from those populations. This will be source of good traits which help to design breeding program.

 Genetic Differentiation of the Population 

low genetic differentiation of among population from this study is due high variation of the genotypes within population. This has little bet discrepancy with the study of Jifar et al. (2020) and Tadesse et al. (2021) which is 3% and 5% respectively. But majority of the variation, is described among genotypes within one population and secondly explained by the distinction of among genotypes across all populations (within 81 genotypes =within a large group). These results show little bet high variation within population as compared to in earlier tef studies depicted Jifar et al. (2020), Tadesse et al. (2021) respectively. While it shows low variation of genotypes across all population comparing to the study of Jifar et al., (2020); Tadesse et al. (2021).

The Fst value ranged from 0 to 0.05 is considered as low, 0.05 to 0.15 moderate, and 0.15 to 0.25 large and greater than 0.25 very large genetic differentiations (Wright, 1951). Comparing to this, the current results showed low FST, (0.000-0.036)) genetic differentiation in pair of population which is low. While high genetic differentiation (within population and among individuals of genotypes across all populations have been observed in this study. The low genetic differentiation among populations is may be due to   presence of high gene flow that led to high genetic differentiation within population. Those which have high gene differentiation of genotypes within population and among genotypes across all populations provides ample opportunities to select lines having good potentials to adapt different environments (Bekana et al., 2021).  

On the other hand, in this study, the overall genetic differentiation of released variety versus core germplasm collections is very low among the population and the gene flow is (Nm: 0). Here in absence of gene flow is because of new varieties were released recently whereas, the core germplasm was collected earlier around 1994 (Berhe, 1995). 

Clustering and Population Structure Analysis 

Cluster analysis is used to establish the relationships among the genotypes based on their distance matrix. Based on this, the UPGMA method, the study confirms the reports of Fikre et al. (2018) and Tadesse et al. (2021), who obtained three clusters of the tef test genotypes whereas Jifar et al. (2020) obtained 5 clusters for 144 genotypes. Here indicates the number of genotypes and its diversity that used from study have influenced the number of clusters. And the patterns of grouping of the genotypes through all clusters obviously showed the existence of moderate to significant gene flow along geographical borders. This is due to contiguity and interchange of germplasm resources. Besides, for small seeded crops like tef, it is very hard to maintain truly pure seed stocks because admixtures can occur during any step of the seed production and handling process unless extreme care is practiced (Zeid et al., 2012). 

On the other case the cluster of collected core germplasm versus release variety, the released varieties are distributed from different clusters and sub clusters rather than forms one cluster except Tesfa and Eba. This indicates, the variety may be developed for different parents which found in different agro ecologies whereas the variety Tesfa and Eba may have similar parents. 

The most function of structure in population hereditary qualities is the recognizable proof of hereditarily homogeneous bunches of individuals. In PCoA of in this study  showed that individuals make three groups with  three coordinates explained 93.34% of the genetic variation. The gross variation explained in this study is unlike from the previous study of Fikre et al. (2016), 29% of gross variation 12%, 10%,7% Tadesse et al. (2021); 30.72% explained 13.13 %, 9.20%, 8.39% and Teshome (2019) 14.85% of the total variation explained  5.57%, 4.95% and 4.33%  . 

In the other case, based on Bayesian algorithm ,the structure of the individuals clustered into four since the algorithm allows estimating the true number of clusters at (K=4) (Figure 3a) in the sample of 81 individuals based on suggested by Evanno et al. (2005).Which means that these samples come from four ancestors. But the bar plot appeared in the admixture indicates the used sample of individuals are admixture and requires revision for its collection.  

Conclusion And Recommendation

In conclusion, sufficient genetic variation offers alternatives from which selections are made for improvement of crop. A success in crop improvement depends  on the quantity of genetic variation that exist in the genetic materials and selection for genetically superior genotypes.

In this study higly polymorphic and informative markers were used .Hence larger number of allels are generated.  Among these, there may have  private  allels which might be linked to important agronomic traits. And it  indicates there is relatively high heterozygosity occurred for germplasm lines which is found in  Jimma ,Tigray and released varieties .Those may have allels which linked to good traits which help to design breeding program. While  we observed lower heterozygosity than  expected heterozygosity. To increase hetezygosity,  artificial intracross and a broad variablity through artificial intercrossing should be supported by in vitro embryo rescue. And induced mutation will aslo help to increase the variablity of tef. As the population structure indicates from this study the genotypes found in all population are admixture which come from four ancestors. Hence re-collect the genotypes based on its origin may be good to conserve

The study did not cover all admistrative zones due to lack of the passport data.To facillitate the breeding for diffrenet environment, core germplasm collection should be revised and collected based on agroecologies

Declarations

Acknowledgements

The principal author acknowledges the assistance and support of the Ethiopian Institute of Agricultural Research (EIAR), Debre Zeit Agricultural Research Center and its staff, Holetta National Agricultural Biotechnology Research Center and its staff, Mr. Obssi Desalegn, Dr.Solomon Chanyalew, Prof.Gabriel Schaaf and his PhD student Yeshambel Emwedih,at Rhenish Friedrich Wilhelm University of Bonn, and Wagaw Sendeku (PhD) Student at Addis Ababa University.

Author contributions

Kebebew A. and Derejaw T. conceived the study; Derejaw.T did the whole lab work, data analysis and drafted the manuscript; Tileye F. and Kebebew A. give suggestion follow up the work and revised the manuscript; all authors reviewed the manuscript and gave final approval for publication

Funding 

funding provided for working was by Ethiopian institution agricultural research and Addis Ababa university.

Conflict of Interest          

No conflict of interest.

Data Availability Statement

The data that support the findings of this study are openly available in the Science Data Bank at https://www.doi.org/ [DOI 10.57760/science dB. j00143.00070] and Addis Ababa university library 

References

  1.  Assefa K,Merker A, Tefera H (2003a). Inter simple sequence repeat (ISSR) analysis of genetic diversity in tef (Eragrostis tef (Zucc.) Trotter). Hereditas139:174–183. 
  2.  Assefa K, Cannarozzi G,  Girma D,   Kamies R,  Chanyalew S, Plaza-Wüthrich S, Regula Blösch R,   Rindisbacher A, Rafudeen S, and Tadele Z (2015). Genetic diversity in tef [Eragrostis tef (Zucc.) Trotter]. Front. Plant Sci .6:177
  3. Assefa K, Ketema S, Jifar H,  Tesfaye K, Assefa K, Chanyalew S, Tadele Z (2017) SemiDwarf Tef (Eragrostis tef) Lines for High Seed Yield and Lodging Tolerance in Central Ethiopia. Afric. Crop Sci. J., 25 (4): 419 - 439. 
  4.  Ayele M, Tefera H. Assefa K, Nguyen H (1999). Genetic characterization of two Eragrostis species using AFLP and morphological traits. Hereditas130: 33-40
  5.  Bai G, Ayele M, Tefera H, Nguyen H.T(1999). Amplified fragment length polymorphism analysis of tef [Eragrostis tef (Zucc.) Trotter]. Crop Sci. 39: 819-824.
  6.  Bai G, Ayele M, Tefera H, Nguyen H (2000). Genetic diversity in tef [Eragrostis tef (Zucc) Trotter] and its relatives as revealed by random amplified polymorphic DNAs. Euphytica112: 15–22.
  7. Baye K (2014). Synopsis: Tef: nutrient composition and health benefits. Ethio. Stratag Support Program, 2013–2014.
  8. Bekana T (2021). Role and economic importance of crop genetic diversity in food security. Int.l J. of Agril. Sci. and Food Techno.  7: 164–169. 
  9. Cannarozzi G,   Girma D (2014). Genome and transcriptome sequencing identifies breeding targets in the orphan crop tef ( Eragrostis tef ), (August).   concentration of six cultivars of tef [Eragrostis tef (Zucc.) Trotter]. Ethiopian J. Nat. Resour. 4, 21–36
  10. Chanyalew S, Kebede W, Fikre T, Genet Y, Jifar H, Demissie M, Tolossa K,Tadesse M, Tadele Z, Assefa K (2021). Tef Breeding Manual. Ethiopian Institute of Agricultural Research, Debre Zeit Agricultural Research Center, National Tef Research Program, Debre Zeit, Ethiopia
  11. Costanza  S, Dewet J, Harlan J (1979). Literature‐review and numerical taxonomy of Eragrostis tef (t’ef). Economic Botany33: 413–424
  12. CSA (2021). The Federal Democratic Republic of Ethiopia, Agricultural Sample Survey 2020/21 (2013 E.C.), Volume I, Report on Area and Production of Major Crops (Private Peasant Holdings, Meher Season), Statistical Bulletin 586, April 2021, Addis Ababa, Ethiopia
  13. D'Andrea, A (2008). Tef (Eragrostis tef) in ancient agricultural systems of highlands of Ethiopia. Economic Botany62: 547-566 
  14.  Fikre T,   Tesfaye  K, Assefa K   (2018). Genetic Diversity of Ethiopian Tef [( Eragrostis tef ( Zucc .) Trotter ] Released and Selected Farmers ’ Varieties along with Two Wild Relatives as Revealed by Microsatellite Markers, J. of Crop Sci. and Biot.21(4):367–374.
  15.    Girma D, Cannarozzi G (2014.). Genotyping by sequencing reasserts the close relationship between tef and its putative wild Eragrostis progenitors. Diversity 10: 17 
  16.  Girma D , Assefa K , Chanyalew S, Cannarozzi  G, and Kuhlemeier C  (2014). The origins and progress of genomics research on tef (Eragrostis tef), Plant Biot. J. 12: 534–540 
  17. Govindaraj M,Vetriventhan, Srinivasan M (2015). Importance of genetic diversity assessment in crop plants and its recent advances: an overview of its analytical perspectives. Genetics Res. Inter: Vol .2015 
  18. Harrower M, Mccorriston  J,   Andrea A (2010). General/specific, local, global /comparing of the begnning of agriculture in the horn of Africa (Ethiopia, Eriteria) and south west Arabiya (Yemen) . American Antiquity5(3): 452–472.
  19. Jansen  G,   DiMaio L, Hause N (1962). Amino acid composition and lysine supplementation of teff. J. Agri. Food Chem. 10: 62-64.
  20. Jansen G, DiMaio L, Hause L (1962). Amino acid composition and lysine supplementation of teff. J. Agri. Food Chem. 10: 62-64.
  21.  Jifar H, Tesfaye K, Assefa K  (2020). Advances in Crop Science and Technology Genetic Diversity and Population Structure of Tef [ Eragrostis tef ( Zucc .) Trotter ] as Revealed by SSR Markers. Advan.   in Crop Sci. and Tech. 8(1)
  22. Ketema S (1993). Tef (Eragrostis tef): Breeding, Genetic Resources, Agronomy, Utilization and Role in Ethiopian Agriculture. Institute of Agricultural Research, Addis Ababa, Ethiopia.
  23. Kleinhans  C,   Willows-Munro S (2019). Low genetic diversity and shallow population structure in the endangered vulture, Gyps coprotheres. Scientific Reports9(1): 1–11. 
  24. Mengesha H  (1966). Chemical composition of tef (Eragrostis tef) compared with that of wheat, barley and grain sorghum. Econ. Bot. 20:268-273.
  25. Merga M (2018). Achievements and Challenges of Tef Breeding in Ethiopia. J Agri Sci Food Res. 9:1 
  26. Nei  M  (1972). Genetic distance between populations. American Naturalist106: 283-292.
  27. Nei  M  (1978). Estimation of average heterozygosity and genetic distance from small number of individuals. Genet.89: 583-590. .
  28. Peakall  R, Smouse P (2015). GENALEX6: Genetic Analysis in Excel. Population genetic software for teaching and research. Mol.Ecol. 6: 288-295.
  29. Perrier X, Jacquemoud-Collet J  (2010). DARwin software http://darwin.cirad.fr/.
  30. Ponti J (1978). The systematics of Eragrostis tef (Graminae) and related species. PhD Thesis, University of London, London, UK 
  31.  Pritchard JK, Stephens M, Donnelly P (2000). Inference of population structure using multilocus genotype data. Genetics155: 945–959.
  32. Romero  M, Mujica A, Pineda  E, Ccamapaza  Y, Nacional U,   de Biología E P  (2019). Genetic identity based on simple sequence repeat (SSR) markers for quinoa (Chenopodium quinoa Willd.). Cien. Inv. Agr. 46(2):166-178..
  33. Saturni L, Ferretti G,  Bacchetti T (2010). The gluten-free diet: safety and nutritional quality. Nutri.2: 16-34. 
  34. Smale M, Aquino P, Crossa J, del Toro  E, Dubin  J, Fischer T, Fox P, Khairallah M, Mujeeb-Kazi A, Nightingale K, OrtizMonasterio I, Rajaram S, Singh R, Skovmand B, van Ginkel M,Varughese G , Ward R (1996). Understanding global trends in the use of wheat diversity and international flows of wheat genetic resources. Economics Working Paper 96-02. Mexico, D.F.: CIMMYT. 61pp.
  35. Tadesse  M, Kebede M,   Girma D (2021). Genetic diversity of tef [Eragrostis tef (Zucc.) Trotter ] as revealed by microsatellite markers, 2021. Int. J. of Genomics, Vol. 2021: 
  36. Tesfay M, Shimellis H, Nark D, Assefa A (2016). Assessment of the genetic relationship of tef (Eragrostis tef) genotypes using SSR markers. S Afr J Bot.Sci. 105: 106-110
  37.  Teshome A. (2019). Morphological and molecular characterization selected of (Eragrostis tef (Zucc.) Trotter) Accession from the gene bank of Ethiopia. MSc. Thesis, Addis Ababa University, Addis Ababa, Ethiopia
  38. Vohwinkel F, Piepho HP, Heiligtag B, Richter C (2002).Comparative yield and nutrientmconcentration of six genotypes of tef (Eragrostis Tef (Zucc.)Trotter).Ethiop. J. Nat.Resour. 4, 21–36
  39. Zeid M, Assefa K, Haddis  A, Chanyalew S,  Sorrells M (2012). Field Crops Research Genetic diversity in tef ( Eragrostis tef ) germplasm using SSR markers. Field Crops Res. 127, 64–70
  40. Zeid, M., Belay, G., Mulkey, S., Poland, J., and Sorrells, M. (2011). QTL mapping for yield and lodging resistance in an enhanced SSR-based map for tef. Theor Appl Genet122:77–93.