Phylogeny of Northeast Asian wild boars
We sequenced the hypervariable region (161 bp-564 bp, Fig. 2a) within the D-loop in 23 wild boars from Northeast China (primarily Heilongjiang Province, Fig. 1b), and sequence of the same region were collected from the NCBI PopSet database (Heilongjiang Province, Inner Mongolia, Jilin, Liaoning and Korea) (Fig. 2b). A phylogenetic tree of the Northeast Asian wild boar D-loop was constructed based on these data. From the tree, we can infer that there are eight different haplotypes distributed in the Northeast Asian wild boar population, which can be classified into three main clades: Clade 1 contains a large portion of the wild boars from Heilongjiang and Inner Mongolia (indicated with a blue line in Fig. 2b); Clade 2 is composed of the wild boars from Jilin, Liaoning Province (indicated with a red line); and Clade 3 consist of the wild boars from Liaoning Province and Korea and the remainder of those from Heilongjiang and Inner Mongolia (indicated with a yellow line). In the tree, we can observe that the genotype from Heilongjiang Province is the main haplotype throughout Northeast Asia, suggesting that Heilongjiang may be the centre from which the diffusion migration of Northeast Asian wild boars began. The results indicate the possibility that Heilongjiang may be the main region of immigration or site of origin for the local wild boars in Northeast Asia.
Haplotype of the mtDNA D-loop and nucleotide variation in wild boars
For the purpose of exploring the origin of Northeast Asian wild boars, we further investigated the data for wild boars from Europe, West Asia, Southeast Asia and areas of China other than the northeast. A Eurasian wild boar dataset consisting of 480 samples was established. Finally, 21 haplotypes distributed across 27 SNP sites were obtained, including 8 singleton variable sites and 19 parsimony-informative sites. The SNP sites in the sequence are presented in Figure 3, and the base substitutions are listed in Table 1.
Table 1 SNP information for the 21 haplotypes screened from 480 wild boar samples
SNP Pos
|
Ref
|
Count
|
A Cnt
|
C Cnt
|
G Cnt
|
T Cnt
|
181
|
c
|
21
|
0
|
0
|
0
|
8
|
182
|
t
|
21
|
0
|
2
|
0
|
0
|
214
|
t
|
21
|
0
|
5
|
0
|
0
|
241
|
t
|
21
|
0
|
4
|
0
|
0
|
260
|
c
|
21
|
0
|
0
|
0
|
1
|
279
|
c
|
21
|
0
|
0
|
0
|
8
|
288
|
a
|
21
|
0
|
0
|
1
|
0
|
294
|
g
|
21
|
4
|
0
|
0
|
0
|
300
|
c
|
21
|
0
|
0
|
0
|
1
|
301
|
c
|
21
|
0
|
0
|
·XD0
|
1
|
302
|
a
|
21
|
0
|
0
|
1
|
0
|
306
|
t
|
21
|
0
|
7
|
0
|
0
|
322
|
a
|
21
|
0
|
0
|
2
|
0
|
323
|
t
|
21
|
0
|
2
|
0
|
0
|
324
|
t
|
21
|
0
|
5
|
0
|
0
|
376
|
t
|
21
|
0
|
1
|
0
|
0
|
387
|
a
|
21
|
0
|
0
|
7
|
0
|
388
|
a
|
21
|
0
|
0
|
5
|
0
|
390
|
t
|
21
|
0
|
2
|
0
|
0
|
391
|
c
|
21
|
0
|
0
|
0
|
5
|
405
|
t
|
21
|
0
|
1
|
0
|
0
|
406
|
t
|
21
|
0
|
4
|
0
|
0
|
443
|
a
|
21
|
0
|
0
|
6
|
0
|
452
|
c
|
21
|
0
|
0
|
0
|
9
|
474
|
c
|
21
|
0
|
0
|
0
|
1
|
501
|
a
|
21
|
0
|
0
|
5
|
0
|
560
|
t
|
21
|
0
|
6
|
0
|
0
|
Phylogeny and geographic distribution of the haplotypes
To clearly describe the origin and differentiation of wild boars in Northeast Asia, we first built a median-joining network (Fig. 4a) to obtain the ratios of different regions among 21 wild boar haplotypes and two Suina outgroup members, peccary (Tayassuidae) and warthog (Suidae), by using Network 5.0 and integrated the pie chart information for each haplotype into the neighbour-joining tree constructed with MEGA X. Additionally, we calculated the ratios of the seven geographic areas in which every haplotype was distributed (Fig. 4c). According to the phylogenetic relationships, we found that all the wild boar haplotypes could be classified into group A and group B (Fig. 4b). Hap1, 2, 4, 8, 11, 13 and 14 were in group B, and the remaining haplotypes were in group A. Group A included the unique 21T and 147C alleles. Hap2, 11, 13, and 14 were dispersed mainly in Southeast China, and they shared 120T, which was in accord with the outgroups. Hap1 and Hap4 were distributed in West Asia and Europe, and they shared 120C with group B. Hap3, 5, 6, 7, 15, 16, 17, 19, 20 and 21 formed small subgroup in group B in which 293T was unique; these haplotypes came from Southeast Asia, West Asia and Northeast Asia.
We further analysed the haplotype ratios of the wild boars of every group from every area of the Old World (Fig. 4c). We observed that Hap4 occurred only once in the European group, while Hap4 was one of six haplotypes that occurred simultaneously in the West Asian group. Hap4 and Hap15 accounted for largest proportions of the haplotypes in West Asia. The East Asia group exhibited the maximum number of haplotype varieties. The primary haplotype among the six haplotypes in the Northeast Asia group was Hap5, and the two secondary haplotypes were Hap3 and Hap10. However, consistent with the above results, the haplotype heatmap showed that more haplotypes were present in East Asia than in other areas, and the wild boar population in Northeast Asia showed a relatively similar haplotype composition to that of the Southeast Asia group, which differs from the haplotype clusters in Southern Asia, West Asia and Europe (Fig. 4d).