The few analyses of genetic variation carried out so far in Creole pigs were very limited in scope as they dealt only with a small set of neutral markers (Vargas et al., 2016). On the contrary, whole-genome sequencing provides a more comprehensive resolution of the genetic variation within and across populations across all genomic regions (Ros-Freixedes et al., 2022). Here, we focused on a set of 34 candidate genes with reported effects on relevant adaptive and economical traits (Tables 1 and 2). Six variants in five of these genes were fixed in CR (CTSK_1, HSP70_3, MC1R_2, MC1R_3, RYR1_1 and TAS2R39_1), while seven variants in other six genes were fixed in at least two CR (CYB5A_1, HSP70_4, KIT_1, MUC4_1, NR6A1_1, PHKG1_1 and PPARD_1). Of these, variants in the MC1R, NR6A1, PPARD and TAS2R39 genes are missense mutations that might have been selected for environmental adaptation.
The MC1R gene has a great impact in the determination of coat color due to its key role regulating the synthesis of eumelanin (black/brown) and phaeomelanin (yellow/red) in the melanocytes (Barsh, 1996; Fang et al., 2009). At least six haplotypes, tagged by 5 SNPs (MC1R_1 to MC1R_5, Table 1) and one deletion (g.182126CC>*), have been described in this gene (Muñoz et al., 2018). In the CR breeds, we only found three of these six haplotypes (Supporting Table S2), the so-called MC1R*2 (GCGCA**), MC1R*3 (GCATG**) and MC1R*6 (GCATGCC), which are all associated with black coat or spotting. The predominance of MC1R*3 in SP (frequency of 85.7%) and in ZU (frequency of 71.4%) is compatible with the IB (likely, Lampiño) origin of these breeds (Ocampo-Gallego and Abuabara-Pérez, 2021), since this haplotype is fixed in old black-coated and hairless IB strains as Lampiño (Alves et al., 2007; Fernández et al., 2004). However, MC1R*3 was residual in CM, where MC1R*6 was the predominant haplotype (frequency of 85.7%), as happens in current IB commercial strains (Muñoz et al., 2018). The presence of the MC1R*2 in SP (frequency of 14.3%) and ZU (frequency 21.4%), which has been previously detected in Large Black, provides evidence of introgression of black alleles from Asian origin into these two CR breeds. Likewise, CM does not seem to be completely free of introgression from commercial breeds as indicates the presence of the T allele in the KIT_1 variant (belted phenotype), which is not found in IT breeds (Muñoz et al., 2018). The absence of the MC1R*4 (ATGTG**) haplotype in CR pigs suggests that they have not been crossbred with Duroc.
The TAS2R39 gene is a member of the bitter-taste receptor family that has been related to fatness (Ribani et al., 2017). In agreement with findings in European local breeds (Muñoz et al., 2018), the G allele at TAS2R39_1 is fixed in the three CR breeds, likely indicating a selective pressure towards defensive bitter taste. The A allele at NR6A1_1 was fixed in SP and ZU but not in CM. This allele increases the vertebrae number in pigs, resulting in longer carcasses (Mikawa et al., 2007). This suggests that CM pigs could have been less intensively selected for size than SP and ZU. On the other hand, PPARD_1, as well as CBY5A_1 and HSP70_4, only segregate in ZU and at intermediate frequencies. Since the ZU pigs are found in the Atlantic coastal area, where the weather is especially hot, it is worth exploring whether these three variants might be related to heat resistance, as it has been described before. For instance, the missense mutation PPARD_1 (A allele; Table 1) increases ear size in pigs (Ren et al., 2011), with implications on skin homeostasis and fat deposition. The A allele is found in Asian but not in European breeds. The fact that the A allele segregated in ZU at a frequency of 50% adds evidence of Asian introgression into this breed, which, on the other hand, is characterized by having large and droppy ears (FAO, 1992). The T allele at CYB5A_1, which is only present in ZU, was associated with low fat androstenone levels (Peacock et al., 2008; Lin et al., 2005). While this may be desirable to reduce the risk of boar taint in carcasses from entire ZU males, it may jeopardize reproduction success. The HSP70 variants have also been related to male reproduction. In particular, the T allele at HSP70_4, which is the one not fixed in ZU, has been associated to larger ejaculates and semen quality. In a previous research we showed that CR boars produced less normal and motile sperm per ejaculate than CP boars (Suárez-Mesa et al., 2021). There is no pattern in the allele distribution of the four HSP70 variants across breeds that provides further evidence for an association of these markers with male fertility.
The rest of fixed variants in a single CR breed were mostly found in SP. These variants were either fixed or at very high frequency in IT (AHR_1 to AHR_3, LEPR_1, MC4R_1, PCK1_1 and SCD_1). The fatty nature of these breeds is consistent with the presence of the A allele at LEPR_1 (Table 2), which has been documented to increase feed intake and fatness and to impair reproductive and maternal abilities (Ros-Freixedes et al., 2016; Solé et al., 2021). This allele co-segregated with the T allele at AHR_1, which has a negative impact on prolificacy (Bosse et al., 2014). The joint presence of these two fixed alleles in SP can compromise the reproductive outcome of this breed. Given the sample size per breed, no clear-cut pattern across breeds can be inferred from allele frequency distribution of MC4R_1, PCK1_1 and SCD_1, except that, of CR, SP was the closest to IT, while CM was the most differentiated, with CM showing higher frequencies of the alleles associated with increased fatness (A allele at MC4R_1; Kim et al., 2004)d allele at PCK1_1; Latorre et al., 2016) and saturated fatty acids (C allele at SCD_1; Estany et al., 2014).
A total of 27 of the studied genes are annotated in the Sscrofa11.1 assembly of the pig genome. Therefore, in a second step, we went further to search for new variants into the coding region of these genes using whole-genome sequence data. In CR, we found 18 variants of high impact on mRNA sequence and protein translation. Only one of them was specific to CR. This is a splice-donor variant in LEPR that consists of a 7-bp deletion extending upstream on intron between exons 15 and 16 (LEPR_2: SSC6:146,829,573 − 146,829,580 bp) that affects the three transcripts of the gene. This deletion was only observed in CM and ZU (frequency of 7.1% and 28.6%, respectively) and was fully linked to the G (non-fatty) allele in LEPR_1, but not vice versa. Since these two variants are separated by only 9 bp, we can hypothesize that the 7-bp deletion appeared later from a haplotype with the G allele at LEPR_1. No individuals that were homozygous for the deletion allele were found, despite the probability of sampling at least one in ZU was around 45%. Apart from LEPR_1, other seven variants of moderate impact were detected in the LEPR gene of CR (Table 5; Supporting Table S3). In line with results in LEPR_2, the alternative allele in these variants did not segregate in SP (LEPR_3: SSC6:146,831,558 bp, frequency of 57.1%, only in CM; LEPR_4: SSC6:146,838,276 bp, frequency of 7.1%, in CM, and 50.0%, in ZU; LEPR_5: SSC6:146,838,380 bp, frequency of 21.4.0%, only in ZU; LEPR_6:SSC6:146,847,237 bp, frequency of 21.4%, only in CM; LEPR_7:SSC6:146,861,093 bp, frequency of 42.9%, only in CM; LEPR_8:SSC6:146,861,094 bp, frequency of 42.9%, only in CM; and LEPR_9: SSC6:146,861,105 bp, frequency of 64,3%, in CM, and 14.3%, in ZU). Interestingly, LEPR_1 to LEPR _9 had the same fixed allele in SP and IB. Considering all LEPR variants in Table 5, we can infer that, in SP, all of these variants reside in a single haplotype of 35,019 bp (from SSC6:146,826,086 bp to SSC6:146,861,105 bp), while, in IB they are inherited in two haplotypes due to a breed-specific in-between missense variant at SSC6:146,830,356 bp (frequency of 54.2%). More detailed studies are needed to decipher the connection between these LEPR variants and their effects on phenotypes. Nevertheless, findings so far provide further evidence on the IB origin of SP and hint at the role that LEPR, as a key element of the endocrine control of energy balance (Friedman, 2019), may have played in the adaptation of CR breeds to different geographical locations and dietary regimes.
Besides those in LEPR, we identified 12 more missense mutations. Three of them were in more than one CR breed and affected genes involved in coat color (KIT, in the three CR breeds, and MC1R, in SP and ZU). The remaining 9 were observed either in CM and ZU (FASN) or only in CM (ADIPOQ, CAST, and PPARGC1A) or ZU (FADS2, FUT1, ACTN1, and PRKAG3). No breed-specific missense mutations were found in SP, a result that would confirm that molecular variability is lower in this breed as compared to CM and ZU. As a whole, the findings presented here support that genome-wide characterization is a useful tool to identify patterns of genetic variation between and within CR breeds.