Morphological and microsatellite DNA diversity of Djallonké sheep in Guinea-Bissau

Background The present study aimed at characterizing the Djallonké Sheep (DS), the only local sheep breed raised in Guinea-Bissau. A total of 200 animals were sampled from four regions (Bafatá, Gabú, Oio and Cacheu) and described using 7 visual criteria and 8 measurements. These parameters have been studied by principal components analysis. The genetic diversity and population structure of 92 unrelated animals were studied using 12 microsatellite markers. Results The values of quantitative characters in the Bafatá region were significantly higher than those obtained in the other three regions. A phenotypic diversity of the DS population was observed and three genetic types distinguished: animals with “large traits” in the region of Bafatá, animals with “intermediate traits” in the regions of Gabú and Oio and animals with “small traits” in the Cacheu region. The hair coat colors are dominated by the white color, the shape of the facial head profile is mainly convex and the ears “erected horizontally”. Most of the morphobiometric characteristics were significantly influenced by the “region” and “sex of animals”. The average Polymorphism Information Content (PIC) of 0.65 ± 0.11 supports the use of markers in genetic characterization. Gabú subpopulation had the highest genetic diversity measures (He = 0.716 ± 0.089) while Cacheu DS subpopulation presented the smallest (He = 0.651 ± 0.157). Only Gabú and Bafatá subpopulations presented significant heterozygote deficiency across all loci indicating possible significant inbreeding. Mean values for FIT, FST, FIS and GST statistics across all loci were 0.09, 0.029, 0.063 and 0.043 respectively. The overall genetic differentiation observed between the four DS subpopulations studied was low. Bafatá and Gabú are the most closely related subpopulations (DS = 0.04, genetic identity = 0.96) while Bafatá and Cacheu were the most genetically distant subpopulations (DS = 0.14, genetic identity = 0.87). Using Bayesian approach, the number of K groups that best fit the data is detected between 2 and 3, which is consistent with the morphological analysis and the factorial analysis of correspondence. Conclusions The molecular results on DS population of Guinea-Bissau confirmed the ones obtained with morphological analysis. The three genetic types observed phenotypically might be due to a combination of the agro-ecological differences and the management of breeding rather than genetic factors. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-021-01009-7.


Conclusions:
The molecular results on DS population of Guinea-Bissau confirmed the ones obtained with morphological analysis. The three genetic types observed phenotypically might be due to a combination of the agro-ecological differences and the management of breeding rather than genetic factors.

Background
Livestock is an important source of income, livelihoods, nutrition and food security, as well as resilience in sub-Saharan Africa [1]. In the Republic of Guinea-Bissau, like other West African countries, the economy is dominated by the primary sector (agricultural production) with a contribution of approximately 62% to the Gross Domestic Product (GDP) and agriculture contributes to creating around 95% of jobs [2]. Livestock sector represents the second economic activity after agricultural crops and contributes to 17% of national GDP and 32% of agricultural GDP [3]. In its various forms, livestock occupies 72% of the rural population through multiple functions (economic, social, reserve and savings capital, labor power and improving soil fertility) [4].
The livestock population in Guinea-Bissau is relatively large, very diverse and includes cattle, goats, sheep, pigs, poultry and other animal species [5]. The farming system practiced is of extensive agro-pastoral type with certain specificities depending on the region.
Despite the socio-economic importance of livestock sector in Guinea-Bissau, the animal genetic resources are under-exploited and less valued. In recent years, the contribution of the livestock sub-sector to GDP decreased to 3.5% of national GDP and 7.8% of agricultural GDP [4]. The authors explain this decline by an absence of effective and sustainable strategies for the management of animal genetic resources despite the great potential and assets available to the country. The development of an efficient management strategy of domestic animal genetic resources in Guinea-Bissau requires the characterization and inventory of these genetic resources in order to guide decision-making [6,7].
In Guinea-Bissau, small ruminants are important in animal husbandry and play a social and nutritional role. Indeed, they are commonly used as a source of protein during social and religious ceremonies (birthday celebrations, baptisms, funerals, weddings) and constitute a savings strategy [8]. They are among the most dominant domestic animal species in the east and north of the country. Djallonké sheep (DS) represents the main local sheep breed of Guinea-Bissau. Despite their appreciation (hardiness, resistance, trypanotolerance, prolificacy and sexual precocity), information on the phenotypic characteristics is very little documented while the molecular characterization has never been done. The goal of the present study was to improve the knowledge on the local sheep genetic resources of Guinea-Bissau in order to develop sustainable strategies for their development. The specific objectives of this study were to determine the morphobiometric characteristics and to evaluate the genetic diversity of the local DS population in four regions in Guinea-Bissau.

Morphological characterization Quantitative characters
Basic statistics of quantitative traits in DS subpopulations in the four regions are presented in Table 1.
The values of the Chest Girth (CG), Chest Depth (CD), Height at withers (HW), Ear Length (EL) and Tail Length (TL) in Bafatá subpopulation were significantly higher (KW test, P < 0.001) than those of Cacheu, Gabú and Oio regions. In addition, the animals from the Bafatá region had significantly higher Body Length (BL) (ANOVA, P < 0.001) than those from other regions. The "region" or "location" had a significant effect on the most of the quantitative body characters of the DS in Guinea-Bissau as presented in Table 1, excepted the following traits: "Horn Length" and " Interval Length between the roots of the two horns". Three genetic types of DS were distinguished in the four regions: the type with "large traits" for animals in the Bafatá region, the type with "small traits" for animals in Cacheu region and the type with "intermediate traits" for animals in the Gabú and Oio regions. The three genetic types were revealed by the Principal Components Analysis (PCA). The Fig. 1 shows the individuals of Bafatá (black), the individuals of Cacheu (red) and a more heterogeneous population in Gabú (green) and Oio (blue).
In the studied population, 81.5% of animals sampled were females against 18.5% of males and all were 2 to 4 years. A sexual dimorphism was observed for some body parameters. Female animals had higher BL, CG and CD than their male counterparts (Table 1). Contrariwise, male animals had higher Horn Length and Interval Length between the roots of the two horns than the females.

Qualitative characters by region
Values of the qualitative characters of the DS by region are presented in Table 2. In the Gabú, Cacheu and Oio regions, the uniform white body coat color was predominant with 81.67, 76.00 and 50.00% respectively. In Bafatá, the eumelanin-black color with tan belly (49.33%) and the uniform white (37.33%) and then the pheomelanin-brown and tan belly (13.33%) were mainly found. The uniform red/fawn was not observed in this study. The type of melanin observed had a significant link with the region (Chi 2 -test, P < 0.001). For the coat color patterns, the uniform white pattern characterized the DS in Gabú, Cacheu and Oio regions, while in Bafatá region the patchy (white-black or white-red/fawn) and the spotted (white color with some black or red/fawn spots without regular distribution) patterns were mostly observed in the proportions of 37.33 and 33.33%, respectively. The patchy pattern with badger face, plain black/brown, black/brown and tan white belly patterns were observed in the Bafatá and Oio regions. Figure 2 illustrates the coat color patterns of black/ brown and tan, spotted pattern, patchy (white-black/ white-fawn) and uniform white color.
In Cacheu, Gabú and Oio regions, all the animals carried horizontally erected ears, while 2.67% of the animals in Bafatá region had semi-pendulous ears. The facial (chamfer) profile of animals was predominantly convex.
The different horn shapes and orientations observed in the DS are presented in Table 3. No significant difference was observed between the regions (P = 0.056).
The sexual dimorphism was observed for the horn presence and the chamfer profile (Table 3). Indeed, all males were horned against only 6.75% of horned females among which 45.45% were in the form of stumps.

Molecular genetic diversity
The number of alleles (Na), the allelic richness (AR), the expected (He) and observed (Ho) heterozygosities per locus and per DS subpopulation (region) are presented in Table 4. The 12 microsatellite loci used were polymorphic and a total of 89 alleles were detected. The allelic diversity was characterized by the number of alleles ranging from 3 (MAF214) to 10 (MAF10), with an average of 7.42 ± 2.19. The allelic richness estimated using rarefaction method ranged from 2.57 (SRCRSP1) to 4.49 (ILSTS5), with an average of 3.59 ± 0.67. Subpopulations from Bafatá and Gabú had higher genetic diversity with He values of 0.716 ± 0.089 and 0.697 ± 0.094, respectively compared to those from Oio (0.655 ± 0.143) and Cacheu  Table 4 Number of alleles (Na), allelic richness (AR), expected (He) and observed (Ho) heterozygosities per loci in the four subpopulations of Djallonké sheep.
The effective Ae, the Polymorphic Information Content (PIC) and the F-Statistics (F IT , F ST , F IS ) according to Weir and Cockerham (1984) for all the microsatellite markers analyzed over the four DS subpopulations are presented in Table 5. The effective Ae varied from 2 (SRCRSP1) to 5.24 (ILSTS5) with an average of 3.52 ± 1.04. SRCRSP1 locus was the lowest informative with a PIC of 0.45 while ILST S5 locus presented the highest value of PIC (0.78) and the average value was 0.65 ± 0.11.
The mean values of F IT , F ST , F IS were 0.09, 0.029 and 0.063, respectively. Values of G ST ranged from 0.015 for MAF65 to 0.152 for OarJMP58, with a mean of 0.043 showing that the gene variation among subpopulations is still low. The F ST value (0.029) showed that most of the total genetic variation corresponds to differences among individuals within subpopulation (97.10%) and only 2.90% result from differences among subpopulations.
The overall estimate of F IS was 0.063 ± 0.029. The subpopulation-wise F IS estimates were significantly (P < 0.01) greater than zero in Bafatá and Gabú subpopulations, suggesting a deviation from HWE ( Table 6). The exact tests also showed a significant deviation from HWE for some markers in the different subpopulations.
The overall differentiation level of the subpopulations was very low (F ST = 0.029 ± 0.016). Among the four subpopulations, the lowest genetic distance was observed between Bafatá and Gabú subpopulations (0.0406) and the highest between Bafatá and Cacheu subpopulations (0.1412). The genetic distances and the genetic identity according to Nei (1978) are summarized in Table 7.
From the unrooted neighbor-joining tree constructed using the genetic distances (Fig. 3), the subpopulation from Cacheu region relatively differed from the three other subpopulations.

Genetic structure of subpopulations by factorial correspondence analysis
The factorial correspondence analysis (Fig. 4) clustered the studied population in three groups: group 1 with Using Bayesian approach implemented in Structure Software and Evanno method [9], the number of K groups that best fit the data is detected between 2 and 3 ( Fig. 5).

Morphological diversity
Quantitative characters DS in Guinea-Bissau can be classified into three "genetic types" associated to three the "large animals" in the Bafatá region, "intermediate traits" for sheep in the Gabù and Oio regions and "small animals" in the Cacheu region. Indeed, the average values of the quantitative characters (CG, CD, HW, BL, EL and TL) of the Bafatá DS subpopulation were significantly higher than those obtained in the Gabú, Oio and Cacheu regions. This gradient in the size of the morphological traits could be explained by the differences in the agro-ecological conditions, the farming practices and genetic background. In fact, the agro-ecological area of the North-East, which includes the Bafatá, Gabú and Oio regions, is characterized by savannah trees and clear forests, which offer rich natural pastures to pastoralists who are Fulani and Mandingos. Moreover, the livestock is dominated by ruminant species. Contrariwise, in the North-West agro-ecological zone including the Cacheu region, ruminant species (sheep, goat and cattle) are mainly raised for ritual ceremonies by breeders who are rather animistic [10]. In addition, this zone is covered with wooded savannahs and dense forests hardly accessible by animals, hence the predominance of the sedentary system in the Cacheu region. At the cultural level, Bafatá region is mainly populated by Fulaspeaking people, practicing the Muslim religion and traditionally attached to animal husbandry compared to the other regions (Cacheu and Oio) where the populations are strongly Christianized and more attached to pig farming. The Bafatá region is also a large area of ruminant species concentration during the transhumance period and hosts the most important livestock market in the country. This region generally receives animals from Gabú and both Gabú and Bafatá regions have more than 70% of the country's ruminant livestock [4]. During the dry season (November to May), ruminants from the Gabú region migrate to the Bafatá and Oio regions [11]. Sheep from the Cacheu region had the smallest size in the study area. In fact, Cacheu is one of the regions of the North-West agro-ecological zone with high humidity   favorable to parasitism and vectors of pathogens such as tsetse flies which transmit the trypanosomes causing African animal trypanosomosis. DS subpopulations of the Gabú and Oio regions were highly heterogeneous with an "intermediate genetic type", probably due to the introduction of improving rams in these regions in the past [12]. This heterogeneity is observed not only between regions but also within region (Fig. 1). The effect of the agro-ecological zone on the morphological types of ruminants, especially sheep, has been previously reported in Côte d'Ivoire in DS [13], in Senegal with Peul-peul (Fulani) sheep [14] and in Togo in Vogan Sheep and DS [15]. A recent morphobiometric characterization of DS in the sudano-guinean zone of Cameroon revealed three genetic types [16] as observed in the present study in Guinea-Bissau. In Burkina Faso, Traoré et al. [17] described a sheep population named "Mossi sheep" which is a savannah DS found in an agro-ecological zone between the sudano-sahelian zone and the sudano-guinean zone with an "intermediate type" between DS and sahelian sheep.
The average values of HW obtained (55.67 ± 4.16 cm for the Bafatá region, 54.67 ± 3.44 cm for the Oio region, 53.44 ± 2.85 cm for the Cacheu region and 53.23 ± 3.77 cm for Gabú region) are closed to those reported by Dayo et al. [15] in DS in Togo (HW = 54.63 ± 8.23 cm; BL = 58.47 ± 6.30 cm and CG = 74.72 ± 8.28 cm) and Sangaré [18] in DS in West Africa and Gueye [19] in Senegal. Similar results have also been reported in other populations of DS in Ghana (HW = 57.06 ± 0.28 cm; BL = 54.87 ± 0.35 cm and CG = 69.19 ± 0.41 cm) by Birteeb et al. [20] and Asamoah-Boaheng and Sam [21] and in Côte d'Ivoire (HW = 59.60 ± 5.40 cm; BL = 57.80 ± 5.40 cm and CG = 70.80 ± 6.50 cm) by N'Goran et al.   [13]. However, the values of the present study were higher than those previously reported by Hadzi [22] in DS in Togo and in Guinea-Bissau [8]. These results could be explained by the differences of climatic conditions of the agro-ecological zones in which these studied populations are bred, the study periods of the year (season effect), the farming systems or the genetic variability that could be observed between DS populations across the countries. It has been reported the existence of two sub-categories of DS [23,24] and DS of savannah are larger than those of forest zones [25], demonstrating once more the effect of the agro-ecological zone on the morphological type of this sheep breed. The tail of the DS is thin and relatively long. The average TL (28.06 ± 4.36 cm) is similar to those reported by N'Goran et al. [13] in DS in Côte d'Ivoire (24.70 ± 3.40 cm) and in Togo (27.47 ± 8.05 cm) [15]. This TL is longer than those reported in the DS (West African Dwarf) by Gbangboche et al. [25] in Benin (17 cm), in Nigeria (19.42 ± 0.63 cm) [26] but shorter than those of the Sahelian sheep (48.20 ± 5.37 cm) and Vogan sheep from Togo (45.24 ± 6.23 cm) [15].
Concerning the ear length, the value obtained (9.85 ± 1.12 cm) is similar to value reported by Gbangboche et al. [25] in West-Africa, who found that DS has small ears, about 10 cm. However, the value in the present study is lower than those reported in DS in West Africa: 13.03 ± 0.39 cm in Nigeria [26], 11.61 ± 2.61 cm in Togo [15]) and in the Peul-peul (Fulani) sheep (13.30 ± 1.20 cm) in Senegal [14]; and significantly shorter than those recorded in Vogan sheep (18.45 ± 2.08 cm) and Sahelian sheep (21.63 ± 2.48 cm) [15]. No sexual dimorphism was observed for this trait contrary to Gueye [19] who showed that male sheep and goats had slightly longer ears than females in Senegal.

Qualitative characters
The coat color pattern in DS in Guinea-Bissau is dominated by the uniform white pattern and the spotted white and brown / fawn pattern in all regions. In the Bafatá region, the frequency of the spotted pattern is higher than in the other three regions. Indeed, for the Muslim populations in Bafatá and Gabú regions, the rams are preferentially slaughtered while the uniform white or spotted ewes are kept for the reproduction in order to have the offspring with white coat color. This explained the presence of only few rams in most of herds. The higher proportion of animals with uniform white color pattern could also be due to a strong selection of animals expressing the white coat color to meet the livestock market demands (higher price than other coat colors) and the cultural preference in the country (religious sacrifices or gifts during baptism celebrations and the "Eid El-Kebir" (Tabaski) celebration or for the dowry). The preferences for the coat color of animals differ from one society to another. For example, in southern Ethiopia, red coat color for ewes is the most suitable for market demands [27]. In Côte d'Ivoire, the DS had at 55.00% patchy white-black coat color compared to 24.00% uniform white coat [13], and only 5.88% of the DS were white in southern Togo [15]. This diversity for coat color in DS in West Africa is linked to the choices made by the societies in which these animals are raised. In Ferlo zone in Senegal, the dominant coat color of the Peul-peul sheep has evolved from patchy (whiteblack or white-red/fawn) [19] to spotted of white and black / red/fawn [28].
The ears of DS in Guinea-Bissau are mostly erected horizontally and only 2.67% of animals in the Bafatá region had slightly drooping ears. These results agree with those of Dayo et al. [15] in DS from Togo (86.27%) and N'Goran et al. [13] in Côte d'Ivoire (87.00%). Drooping ears in DS are considered to be the result of Sahelian sheep genes introgression [13,15]. Thus, the presence of animals with slightly drooping ears in the Bafatá region (2.67%) could be explained by crosses occurred with Sahelian sheep from neighboring countries, especially from Senegal.
Sexual dimorphism has been observed for the presence of horns with only 6.75% females horned in our study.  This proportion is higher than the 2.30% often reported for ewes wearing horns (most are stumps); but lower than the 14.60% of Mossi ewes carrying horns in Burkina Faso [17]. The horns are developed for rams and absent or in stumps in ewes. In the current study, the most of horned ewes were from the regions of Bafatá and Oio where small ruminants and cattle move during the transhumance in the dry season [11]. Horned ewes are thought to have come from crossing with transhumant animals. It is important to highlight that in half of these ewes, the horns are in stumps. The horn shapes were significantly different according to the zone: horns laterally straight were the most observed in Bafatá, Gabú and Oio regions while spiral horns facing forward predominated in the forest and humid Cacheu region similarly to the one reported by Dayo et al. [15] in the south of Togo.

Molecular genetic diversity
The current study provides the first information on molecular genetic characterization of DS in Guinea-Bissau and is complementary to the morphological characterization of this breed. This study presents a comprehensive genetic analysis of DS, the assumed only sheep breed of Guinea-Bissau, from four administrative regions covering two agro-ecological zones. The genetic diversity of subpopulations was influenced by the sociocultural practices and agro-ecological zones. Similar observations were reported by prior studies in West African DS [29]. Indeed, these authors had reported that Malian, Gambian and eastern Guinean DS populations had higher genetic diversity than those from Senegal and southern and western Guinean using expected heterozygosity (He) and the mean number of alleles (Na). Based on the He, Cacheu and Oio DS subpopulations would be closer to Senegalese, Gambian southern and western Guinean populations while Bafatá and Gabú DS presented similar expected heterozygosities to Malian and eastern Guinean DS. The Na in the current study (7.42 ± 2.19) was similar to those obtained by Wafula et al. [29] in Guinean and Malian DS and Agaviezor et al. [26] in West African Dwarf sheep in Nigeria. However, the allelic richness (adjusted mean number of alleles) values were lower than those reported by Wafula et al. [29] and Agaviezor et al. [26] and probably due to the small sample size used for genotyping in our study.
Genetic structure of the population Using different population differentiation parameters (F ST , G ST , genetic distance, genetic identity) and representation (NJ Tree and FCA), our results showed that the population differentiation over the 4 subpopulations is very low since the multi-locus F ST and G ST values indicated that only 2.9 and 4.3% respectively of the total genetic variation were due to the subpopulation differences. The remaining 97.1 for F ST and 95.7 for G ST corresponded to differences between individuals within the subpopulations. These values were lower than those (8.8% for F ST and 12% for G ST ) reported by Agaviezor et al. [26] in four sheep populations in Nigeria (Udah, Balami, Yankasa and West African Dwarf sheep also known as DS). Even though the genetic differentiation observed between the four DS subpopulations in Guinea-Bissau was low, the current study pointed that the subpopulation from Cacheu region slightly differs from those in Gabú, Bafatá et Oio regions. Indeed, these three subpopulations are genetically close even though they come from geographically different locations. This similarity is shown by: i) the high genetic identity (from 0.9603 to 0.9017) of the three subpopulations while this value decreased to 0.8683 between Bafatá and Cacheu subpopulations, ii) the low genetic distances between the three subpopulations. The closest Nei's [30] unbiased measures between Bafatá and Gabú, and the farthest between Bafatá and Cacheu may be due not only to their geographical locations but also to the breeding systems, the presence of the livestock market in Bafatá and the cultural behavior of the breeders in the different regions. Ira et al. [5] reported that Bafatá, Gabú and Oio regions had 95.88% of the sheep population of Guinea-Bissau and breeders practice transhumance breeding system, mixing cattle and sheep while in Cacheu region the breeding system is rather sedentary in association with agriculture (production of mangrove rice, sorghum, millet, beans, peanuts and cashew). The Bayesian approach implemented in STRUCTURE program detected the number of K that best fit the data between 2 and 3, suggesting an introgression of the Djallonké sheep of Guinea Bissau by an exotic sheep or the existence of "ecotypes". The two subpopulations from the eastern regions (Gabù and Bafatà) were separated from the western subpopulations (Cacheu and Oio) at K = 2. The heterogeneity of the DS in Guinea-Bissau is shown with K = 3. The molecular study on DS population of Guinea-Bissau confirmed the results obtained from phenotypic study.
Further investigations extended to other regions of Guinea-Bissau and other sheep breeds are required to determine the origin of the admixture and the existence of ecotypes of Djallonké sheep in this country.

Conclusions
In this primary phenotypic characterization of the DS in Guinea-Bissau, three genetic types of animals were distinguished, namely the largest animals in Bafatá, the smallest animals in Cacheu and the type with intermediate traits of animals in Gabú and Ohio. The values of the quantitative characters of the sheep of the region of Bafatá were significantly higher than those of the other regions. The molecular study confirmed the existence of three genetic groups in the DS population in Guinea-Bissau that could be related more to breeding system than a genetic differentiation which was very low. The current study provides sufficient data that could be used to develop strategies for the sustainable and efficient management of animal genetic resources in general and specifically of sheep genetic resources in Guinea-Bissau. To complete the morphological and molecular characterization reported in the current study, it would be necessary to collect and analyze the demographic parameters and the zootechnical data of the DS population in Guinea-Bissau.

Study area and population
The study was conducted in four administrative regions which are the largest agro-pastoral areas in Guinea-Bissau: Bafatá, Gabú, Cacheu and Oio. These four regions cover two agro-ecological zones [31]: -the North-East area comprising the regions of Gabú, Bafatá and Oio: characterized by a Sudanese climate with two distinct seasons: a dry season between November and May, and a rainy season from June to October. The annual rainfall ranges from 1200 to 1500 mm over an average of 107 days. The rate of evapotranspiration is 2507 mm and the annual average temperature is 27.4°C. Most of the soil is tropical iron and iron. However, hydromorphic soils derived from marine alluvium are found in the shallows, basins of rivers and rivers. The vegetation consists of wooded savannahs and clear forests dotted with grasses that provide excellent natural grazing for animals. Livestock is dominated by ruminants and associated with the cultivation of maize, plains rice, sorghum, millet, cotton, groundnuts and cashew nuts. DS is the only sheep breed used in these regions. Animal breeding is practiced by ethnic Peulh populations and Mandingoes with Muslim religious dominance (5); -the North-West area comprising the regions of Cacheu, Bissau and Biombo: moderately wet and warm Guinean maritime climate with 1500 1877 mm of average rainfall over 112 days. The average annual temperature is 26.6°C and the evapotranspiration is 137 mm [31]. This area offers good opportunities for diversified agricultural production. The soils are sandy-clay and hydromorphic. The vegetation is made up of wooded savannahs and dense forests. Livestock is dominated by pigs and poultry. The reduced size herds of ruminants are also met. Ruminants and poultry are much more used for traditional rituals than for sale at the market. This system is practiced by animist populations such as Pepels, Balantes, Manjaques, Diolas, Mancanhes and Bijagós. The husbandry is associated with the cultivation of low-lying rice, sorghum, millet, groundnuts, sweet potatoes, cassava and cashew nuts.
Animals belonging to Djallonké Sheep breed, both adult males and females were included in the study. Data collection was carried out between April and October 2017.
Morphobiometric data (qualitative and quantitative traits) were collected through single visits (primary characterization) in the different herds. A total of 200 animals were chosen in the four administrative regions: 75 animals in the Bafatá region, 25 in the Cacheu region, 60 in the Gabú region and 40 in the Oio region. The herds were chosen after sensitization of the breeders and their agreement. In each herd, the least related adult animals were chosen. Locations of the animal sampling have been included in Supplementary Fig. S1.

Description of animal morphological characters and body measurements
Body measurements (quantitative variables) concerned: (i) the Height at the Withers (HW), the Chest Depth (CD) and the Body Length (BL) using a sliding ruler; (ii) the Chest Girth (CG), Ear Length (EL), the Horn Length (HL), the Interval Length between the roots of the two Horns in males (ILH) measured between the roots of the two horns and the Tail Length (TL) were determined using a measurement tape. Body parameters measurements were taken early in the morning to avoid changing the animal's conformation after consuming water and food.
The morphological characteristics (qualitative variables) related to the sex of the animal (male / female), the type of melanin (eumelanin, phaeomelanin, absence of pigment), the coat color pattern, the coat color, the ear orientation, the facial (chamfer) profile, the presence or absence of horns and the shape of the horns were described using visual criteria by simple observation of the interviewers following the elements of the guidelines developed for the study using the guidelines of the Food and Agriculture Organization for the phenotypic characterization of Animal Genetic Resources [7].

DNA extraction, polymerase chain reaction and fragment analysis
Blood samples were collected on 92 unrelated animals: 26 animals in the Bafatá region, 15 in the Cacheu region, 27 in the Gabú region and 24 in the Oio region. Farmers were interviewed in detail to ensure unrelatedness among the sampled individuals. About 5 ml of whole blood samples were collected after jugular venipuncture in EDTA coated vacutainer tubes. Genomic DNA was extracted using Commercial PROMEGA Wizard purification kit. A total of 12 microsatellite markers chosen among those recommended by the FAO-ISAG consortium [32] were used to genotype all the individuals ( Table 8). The forward primer for each locus was labelled with one of the four fluorescent dyes FAM, VIC, NED and PET (Applied Biosystems, USA). Multiplexed polymerase chain reaction was performed with a total reaction volume of 12 μl containing 5 μl of mix primers of multiplex, 5 μl of mix of other reagents (Buffer, MgCl 2 , Taq polymerase) and 2 μl of DNA. The following thermal conditions, 94°C for 15 min, followed by 40 cycles of 94°C for 30 s, specific annealing temperature (58°C and 60°C according to the multiplex) for 1 min 45 s and 72°C for 1 min 30 s and a final extension at 72°C for 15 min was used for sample amplification by PCR. The amplified PCR products containing different dyes were then electrophoresed in four multiplexes (Table 8) in an automated DNA sequencer along with LIZ600 (Applied Biosystems, USA) as an internal lane control. The allele size data for each sample was generated using GENEMAPPER software version 5.

Data analysis Morphological data analysis
The statistical analysis of the qualitative and quantitative data was done using R 3.5.1 software [33].
For qualitative data, frequencies and proportions were analyzed by region and sex using the Chi-square test.
Means, standard deviations and extreme values (minimum, maximum) were computed for all studied traits.