Selection of isolates for re-sequencing
Cluster analysis was conducted using phenotypic data from bioassays on a set of differential hosts (Table S1) inoculated individually with 1590 L. maculans isolates from western Canada. Six clusters were identified, containing 125, 325, 179, 143, 195 and 623 isolates, respectively (S Fig. 1). A total of 162 isolates (Table S2), with 27, 31, 19, 23, 14 and 48 isolates from each of the six cluster (S Fig. 1) were selected for DNA sequencing. Temporally 98 isolates were from years 2007-2008 and 54 from 2012-2014, or spatially 31 isolates from four locations in Alberta (AB), 63 from 10 locations in Manitoba (MB), and 68 from nine locations in Saskatchewan (SK).
The NGS short reads were assembled on the reference genome of L. maculans 'brassicae' isolate v23.1.3. Breadth and depth of coverage were examined to evaluate the quality of sequencing. Both coverages varied with isolates and supercontigs (SCs); four isolates with low depth and breadth of coverage were removed from the analysis. The remaining 158 isolates showed breadth of coverage of >80% (S Fig. 2 A), and five SCs (SC 27, 28, 33, 34, 36) had lower breadth of coverage ranging from 49.6 to 73.8% (S Fig. 2 B). For depth of coverage, 88% of the isolates displayed values >20. The coverage for SCs ranged from 12 to 26 except for SC30, which was as high as 621. In summary, the average breadth of coverage was 91% and the average depth of coverage was 36.
Variant discovery
Compared with the reference genome of L. maculans, about 920,000 DNA variants (SNPs and InDels) were identified; among them were 104,744 polymorphic biallelic variants with MAF ≥1 and miss call rates≤50%. Considering cluster analysis and PCA were to be performed in this study, miss call rates >5% would be considered statistically consequential [25], therefore, additional eight isolates were removed. After filtering against the criteria of phred quality (Q call ≥25), minimum allele frequency (MAF ≥5), and miss call rate (≤5%), 31,870 polymorphic variants were selected for subsequent analysis. Variant numbers in each isolate ranged from 8,591 to 14,997 with an average of 9,575, as compared with the reference genome.
To understand DNA variation among the isolates, the sequences of 150 retained isolates were paired to each other to find polymorphic variant numbers between isolates. The numbers varied dramatically with 11,175 isolate pairs in the range 508 to 17,708. About 4,000 isolate pairs showed 4,000~5,000 polymorphic variants, and 610 pairs displayed 15,000~18,000 polymorphic variants (S Fig. 3). This would suggest that approximately 56% of the 31,870 total polymorphic variants identified may be found in a single isolate pair.
Uneven variant distribution among genomic regions
As expected, variant numbers were highly correlated with SC size (R2=0.95, data not shown). There were 2,884 variants detected in the longest supercontig SC0, but only 14 variants in the
shortest SC40 (Table 1). The L. maculans genome has an isochore-like structure, characterized by alternating AT- and GC-blocks [20]. Variant density in the two blocks was compared along with coding and non-coding regions. Variant density in the AT blocks and the non-coding regions was 0.88 and 0.78 variants/Kb, higher than the average for the whole genome (0.71 variants/Kb) (Table 1). In contrast, GC blocks and coding regions had lower variant density (0.62 and 0.61 variants/Kb). This suggested uneven variant distribution in the contrasting regions, which becamemore evident when individual supercontigs were examined. For instance,
Table 1. Variant distribution and density in supercontigs and different genomic regions of L. maculans.
|
|
SCsa
|
Size (base)
|
Gene numbers
|
Variant numbers
|
AT-blocks coverageb (%)
|
Coding regions coveragec (%)
|
Variant densityd (variants per kb)
|
Whole SC
|
AT blocks
|
GC Blocks b
|
Non-coding regions
|
Coding regions
|
SC0
|
4258568
|
1263
|
2884
|
31.4
|
45.67
|
0.68
|
0.95
|
0.55
|
0.79
|
0.54
|
SC1
|
3378610
|
1080
|
2712
|
26.3
|
45.96
|
0.80
|
1.03
|
0.72
|
0.84
|
0.76
|
SC2
|
2939989
|
916
|
1928
|
25.0
|
46.09
|
0.66
|
1.10
|
0.51
|
0.79
|
0.50
|
SC3
|
2348246
|
565
|
2039
|
44.6
|
34.61
|
0.87
|
0.91
|
0.83
|
0.88
|
0.84
|
SC4
|
1918205
|
662
|
1171
|
28.7
|
46.87
|
0.61
|
0.97
|
0.47
|
0.77
|
0.43
|
SC5
|
1869450
|
633
|
1516
|
25.9
|
50.65
|
0.68
|
1.01
|
0.57
|
1.01
|
0.62
|
SC6
|
1888674
|
510
|
1274
|
41.9
|
33.79
|
0.80
|
0.68
|
0.89
|
0.61
|
0.81
|
SC7
|
1769547
|
567
|
1267
|
32.4
|
44.69
|
0.86
|
1.18
|
0.70
|
0.80
|
0.61
|
SC8
|
1809296
|
618
|
1019
|
25.8
|
45.84
|
0.70
|
1.04
|
0.58
|
0.67
|
0.44
|
SC9
|
1772623
|
474
|
1514
|
34.8
|
39.70
|
0.57
|
0.59
|
0.57
|
1.02
|
0.60
|
SC10
|
1758670
|
434
|
1052
|
33.9
|
35.57
|
0.60
|
0.85
|
0.47
|
0.67
|
0.47
|
SC11
|
1590160
|
285
|
954
|
47.5
|
25.52
|
0.77
|
0.67
|
0.86
|
0.72
|
0.25
|
SC12
|
1631710
|
439
|
1161
|
35.0
|
38.01
|
0.71
|
1.03
|
0.54
|
0.81
|
0.55
|
SC13
|
1634580
|
492
|
1225
|
30.4
|
42.29
|
0.58
|
1.21
|
0.31
|
0.76
|
0.73
|
SC14
|
1533332
|
512
|
1018
|
26.0
|
51.78
|
0.69
|
1.10
|
0.54
|
0.86
|
0.48
|
SC15
|
1560629
|
442
|
1053
|
36.5
|
39.64
|
0.65
|
0.76
|
0.59
|
0.72
|
0.60
|
SC16
|
1397653
|
353
|
1428
|
41.2
|
39.83
|
0.52
|
0.61
|
0.46
|
1.11
|
0.89
|
SC17
|
1445693
|
415
|
723
|
29.7
|
41.99
|
0.99
|
1.39
|
0.82
|
0.54
|
0.44
|
SC18
|
1351976
|
366
|
1148
|
37.8
|
40.36
|
0.85
|
0.85
|
0.85
|
0.88
|
0.80
|
SC19
|
1186800
|
288
|
716
|
41.2
|
36.34
|
0.60
|
0.75
|
0.50
|
0.66
|
0.51
|
SC20
|
1087932
|
292
|
1088
|
39.3
|
40.47
|
1.00
|
0.87
|
1.08
|
0.92
|
1.12
|
SC21
|
1020521
|
296
|
700
|
33.2
|
40.20
|
0.69
|
0.85
|
0.60
|
0.73
|
0.62
|
SC22
|
731443
|
35
|
309
|
84.4
|
5.53
|
0.42
|
0.40
|
0.56
|
0.44
|
0.15
|
SC23
|
521426
|
157
|
473
|
32.1
|
42.97
|
0.91
|
1.10
|
0.81
|
0.98
|
0.81
|
SC24
|
475869
|
116
|
334
|
39.5
|
40.12
|
0.70
|
0.82
|
0.63
|
0.84
|
0.50
|
SC25
|
318058
|
99
|
249
|
28.7
|
48.58
|
0.78
|
1.16
|
0.63
|
0.92
|
0.64
|
SC26
|
261540
|
74
|
172
|
34.1
|
43.28
|
0.66
|
0.69
|
0.64
|
0.71
|
0.59
|
SC27
|
250629
|
3
|
186
|
77.1
|
0.38
|
0.47
|
0.52
|
0.28
|
0.62
|
31.50
|
SC28
|
236098
|
1
|
8
|
96.1
|
0.45
|
0.45
|
0.42
|
1.08
|
0.02
|
2.82
|
SC29
|
200940
|
7
|
72
|
41.0
|
3.22
|
0.93
|
1.58
|
0.47
|
0.28
|
2.78
|
SC30
|
154863
|
35
|
117
|
63.8
|
12.49
|
0.05
|
0.00
|
0.14
|
0.86
|
0.00
|
SC31
|
143268
|
37
|
106
|
69.7
|
40.50
|
0.50
|
0.24
|
1.10
|
1.24
|
0.00
|
SC32
|
87679
|
2
|
56
|
75.7
|
3.16
|
0.64
|
0.75
|
0.28
|
0.66
|
0.00
|
SC33
|
65326
|
0
|
39
|
84.2
|
0.00
|
0.60
|
0.51
|
1.06
|
0.60
|
nae
|
SC34
|
58596
|
1
|
35
|
55.6
|
0.69
|
0.60
|
0.77
|
0.38
|
0.60
|
0.00
|
SC35
|
79158
|
0
|
24
|
83.9
|
0.00
|
0.30
|
0.35
|
0.08
|
0.30
|
na
|
SC36
|
35372
|
0
|
19
|
40.4
|
0.00
|
0.54
|
0.84
|
0.33
|
0.54
|
na
|
SC37
|
52193
|
0
|
37
|
88.7
|
0.00
|
0.71
|
0.52
|
2.20
|
0.71
|
na
|
SC38
|
23239
|
0
|
10
|
94.0
|
0.00
|
0.43
|
0.46
|
0.00
|
0.43
|
na
|
SC39
|
22454
|
0
|
19
|
82.7
|
0.00
|
0.85
|
0.75
|
1.29
|
0.85
|
na
|
SC40
|
21590
|
0
|
15
|
92.0
|
0.00
|
0.69
|
0.70
|
0.58
|
0.69
|
na
|
whole genome
|
44892605
|
12469
|
31870
|
35.2
|
40.20
|
0.71
|
0.88
|
0.62
|
0.78
|
0.61
|
a SCs, Supercontigs
|
b AT-blocks and GC-blocks denote L. maculans genome regions with GC percentage less or greater than 33.9% (Rouxel et al., 2011). AT-blocks and GC-blocks coverage were calculated as the percentage of accumulative AT- or GC-blocks size (bases) in each supercontig.
|
|
c Coding regions denotes genome regions covered by annotated gene sequences, and the rest part of genome was assigned as non-coding regions. Coding regions coverage was calculated as the percentage of accumulative coding sequences (bases) in each supercontig.
|
|
d Variant density indicated number of variants in one kilometer of sequences.
|
|
e na: not available due to illegal denominator (0) for division operation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
for SC2, SC4, SC13 and SC35, variant density in AT blocks were 1.10, 0.97, 1.21 and 0.35 variants/Kb, respectively, nearly 2-4 times of that in GC blocks; similarly, in SC11 and SC22, non-coding regions had variant intensity (0.75 and 0.44 variants/kb) almost twice as high as in coding regions (Table 1).
To understand if genes in AT-blocks and GC-blocks mutate differently, we calculated variant density in genes in the two bipartite regions. There were 12,469 genes annotated in the L. maculans genome. We sorted genes in AT-rich regions into three classes: class A genes resided entirely in an AT-rich block, class B genes were those in which only part of their sequences overlapped with an AT-block, and class C genes were located solely within an GC-equilibrated region with one of its ends falling within 2,000 bases from an AT-block (Table S3). In the end, 1,158 genes were assigned to AT-blocks, leaving 11,311 genes assigned to GC-blocks. There were 10,042 variants detected in 3,971 genes in the GC-blocks, and 757 variants in 341 genes in the AT-blocks, with an average of 2.5 variants/gene in the GC-blocks and 2.2 variants/gene in the AT-blocks. Class A had 28 genes, with 25 located in SC30. Despite the higher variant density in AT blocks, none of the DNA variants was found in genes with full sequences residing in an AT-block. Class B was comprised of 248 genes; 80 of them had 274 variants distributed in 28 SCs. Class C were made up of 881 genes, including avirulence genes previously characterized, such as AvrLm4-7[26], AvrLm3 [27], AvrLm2 [28] and AvrLm6 [29], suggesting class C might serve as a pool of effectors. Class C displayed 746 variants in 259 genes.
L. maculans genes could be classified into small, secreted protein-encoding genes (SSP-genes) and non-SSP genes. We inspected coding regions to determine if DNA mutation was biased towards SSP genes or non-SSP-genes. Rouxel et al. [20] identified 652 SSPs, from which we selected 587 annotated genes, carrying gene names beginning with egn4_Lema, for variant density comparison. In this study, 327 variants were found in 153 SSP genes, and 10,668 variants were detected in 4364 non-SSP genes. On average, SSP genes had 2.1 variants/gene and non-SSP genes had 2.4 variants/gene. Variant numbers fluctuated considerably among genes (Fig. 1). The top 2 SSP genes, egn4_Lema_T086290.1(AvrLm4-7) and egn4_Lema_T015100.1 had only 37 and 23 variants, respectively, while the top 2 non-SSP genes, egn4_Lema_T015070.1 and egn4_Lema_T015110.1, had 101 and 100 variants, respectively. Next to AvrLm4-7, AvrLm3 was detected with nine SNPs (six nonsynonymous SNPs and three synonymous), and the other avirulence genes carried three SNPs at most, indicating that some non-SSP genes might have mutated more frequently than SSP genes.
Dominance of transition variants
Variants are classified into transition, transversion and InDels (insertions and deletions). The variant composition of these four types was examined. Transition variants were dominant in the L. maculans genome, accounting for 68.0% of all variants, followed by transversion variants at 24.5%. Variant composition in different genomic regions was also investigated. When compared at the genome level, AT blocks contained more transition variants (75.0%) and fewer transversion (19.9%) and InDel variants (5.1%). In contrast, GC blocks had a reduced proportion of transition (63.2%) but higher percentage of transversion (28%) variants and Indels (8.8%) (S Fig. 4). In other words, when compared with GC blocks, AT blocks had 19% higher transition variants, but 29% fewer transversion variants and 42% fewer InDels.
Comparatively, the difference in variant composition between coding and non-coding regions, and SSP and non-SSP genes showed was less significant that between AT- and GC- blocks (S Fig. 4). Of course, the ratio varied considerably depending on the genes considered. For an example, gene egn4_Lema_T086290.1 (AvrLm4-7) had 36 transitions but only one transversion and no InDels; on the other hand, gene egn4_Lema_T007850.1 had five transversions but only one transition.
Phylogeny of L. maculans isolates
This study involved 26 isolates from AB, 59 from SK and 65 from MB. Considering the abundance of variants identified, genetic diversity among these isolates was investigated first.
The variants were discovered in different genomic regions or genes of interest, such as AT-blocks (13,851 variants), GC-blocks (18,019), coding regions (10,994), non-coding regions (20,876), SSP-genes (952), non-SSP genes (10,042), genes in AT-blocks (326), genes in GC blocks (10,668) and avirulence genes (48). Given the difference in genetic diversity as measured by variants in different regions, hierarchical cluster analysis was performed using the variants from different sources and their dendrograms were compared. Visual side-by-side comparison of overcrowded dendrograms was inconvenient due to the number of isolates involved, dendrograms were then compared to each other by their cophenetic matrices instead since a phylogenetic dendrogram is the graphical representation of a cophenetic matrix [30]; a coefficient value close to 1 would represent a significant similarity between the two dendrograms in the comparison (Fig. 2). The similarity of dendrograms between genome-wide variants and variants from different genomic regions could be classified into three tiers, the first tier included AT-blocks, GC-blocks, coding-regions, non-coding regions and non SSP genes, with correlation coefficients of >0.94. The second tier encompassed SSP genes, and genes in AT and GC blocks, from which the correlation coefficient ranged from 0.60 to 0.87. The third tier was Avr genes, with a correlation coefficient of only 0.14. It was found, however, that the similarity between any of these dendrograms with whole-genome dendrogram was only partly correlated with the number of variants involved. For instance, the genes in AT-blocks had almost 14 times as many variants as SSP genes, but its correlation coefficient (0.60) was less than the latter (0.73) when the two dendrograms were compared with that of genome-wide variants (Fig. 2).
We then chose to describe three dendrograms based on our interest in pathogen-host interaction. the first was created by 48 SNPs in avirulence genes, the second constructed by the 952 variants in SSP genes, and the third was from genome-wide variants, to illustrate their difference in measuring genetic diversity. The three dendrograms are hereafter referred to as the Avr dendrogram, the SSP dendrogram and the whole-genome dendrogram.
The Avr dendrogram consisted of two main groups (I, II), each with two subgroups (Fig. 3 A). Subgroups A and B in Group I were made up of 72 and 22 isolates, respectively. For group II, There were 18 isolates in subgroup C and 38 in subgroup D. Each subgroup contained isolates collected from the three provinces, but it was noticeable that subgroups C and D were composed of isolates only from years 2007-2008, while isolates from both 2007-2008 and 2012-2014 were found in subgroups A and B. While the Avr dendrogram showed avirulence gene diversity, it also revealed that some isolates shared an identical set of variants in multiple avirulence genes because the Euclidean distance among them was 0. For example, 33 isolates were clustered as leaf A in subgroup A (Fig. 3 A); they were literally the same in terms of avirulence gene sequences. So were the 25 isolates as leaf B in subgroup D. It was noticed, however, that isolates collected from the same location were observed in different clads, for instance, 19 isolates from Melfort, SK, were found in subgroup A, C and D, which denoted avirulence gene diversity for a specific location.
The SSP dendrogram had two main groups (I, II) (Fig.3 B). Group I contained 127 isolates divided into two subgroups (A, B), and group II had only 23 isolates further divided into two subgroups (C, D). Group II consisted of isolates from MB and AB, but not SK. Subgroup C contained 10 isolates, all collected from five locations (Holland, Cartwright, Lowe Farm, Carberry, and Carman) in MB in 2012-2014, except that one isolate CB07-100 was sampled in 2007-2008. Subgroup D had 13 isolates from five locations (Holland, Cartwright, Lowe Farm, Winker, and Plum Coulee) in MB, and two locations (Olds and Camrose) in AB. Group I could be divided into two subgroups: A and B. Subgroup A had 55 isolates collected exclusively in 2007-2008, while subgroup B had 72 isolates from 19 locations collected from both time periods. It was noticeable that isolates collected in 2007-2008 appeared in all four subgroups, but isolates from 2012-2014 were observed only in three subgroups (B, C, D), so isolates collected in 2007-2008 were more genotypically diverse in terms of SSP gene variations than those collected in 2012-2014. With respect to geographic locations, MB isolates were of the highest diversity in SSP genes as compared with SK and AB isolates, because these isolates appeared in the four subgroups (A, B, C, D); followed by AB isolates, which were found in three subgroups (A, B, D). Saskatchewan isolates from nine locations had the least variability, and were clustered in only two subgroups (A, B).
The whole-genome dendrogram formed two main groups (I and II) (Fig. 3C). Group I was comprised of 143 isolates partitioned into four subgroups: A, B, C and D. Subgroups A, B and D each contained 22 isolates, with 19 isolates originating from seven locations in MB, two isolates from AB, and one from SK. The major subgroup C was comprised of 118 isolates, accounting for 79% of the isolates from the three provinces, which represented the majority of the L. maculans population gathered in western Canada. Group II contained only seven isolates, which were all collected from MB except for CR07-97 from Camrose, AB. Group II were divided into two subgroups (E,F). The two isolates in subgroup E were collected from AB and MB, and the five isolates in subgroup F were all from MB. All SK isolates, regardless of the year of collection, were all clustered in this subgroup.
The Avr, SSP and whole-genome dendrograms resulted in different isolate groupings. This suggested that the level and complexity of genetic diversity varied with the set of variants used to examine these isolates. Regardless, the three dendrograms indicated high genetic diversity among these isolates. As shown by SSP and whole-genome dendrograms, the level of genetic diversity varied with the source of the isolates, with MB isolates being most diverse, followed by AB isolates. SK isolates appeared least diverse. Considering that use of variants in Avr or SSP genes alone would only reflect genomic diversity with respect to the L. maculans interaction with the differential hosts, and that genome-wide variants would be more informative and inclusive, we used genome-wide variants for further analysis.
PCA analysis of L. maculans isolates
To further illustrate the relationships of the isolates collected at different locations and times, PCA was performed using genome-wide variants. PC1, accounting for 17.8% of variation, and PC2 for 6.1%, were used to map the isolates (Fig. 4). Except for five outliers (CR07-48, CR07-58, CR07-97, 12CC-481 and 12CC-419) from AB, all SK and AB isolates clustered together close to the origin on the negative side along with approximately half of the MB isolates. The other isolates from MB, however, segregated widely on the right side of the PCA map (Fig. 4). This indicated that isolates from SK and AB were of similar genetic composition, while MB isolates were highly diverse, which was consistent with the previously described cluster analysis. The genetic diversity among isolates seemed to increase with years, as exemplified by SK and MB isolates. For examples, some isolates collected in 2012-2014, such as 12CC-387, 13CCSK19-07 and 12CC-329 from SK, and 12CC-481 and 12CC-419 from AB, were dispersed from the main cluster. The genetic variation also became greater with time for MB isolates, as over half of the isolates shifted away from the isolates sampled earlier in 2007-2008 (Fig. 4).
The difference in genetic diversity between years and locations may be reflected in variant changes. In order to find DNA variants associated with years and regions, we carried out a GWAS study using TASSEL 5.2.31 [31] with the variants as genotype source and collection years, locations and provinces as traits. TASSEL offered two different models, the generalized linear model (GLM) and the multiple linear model (MLM) for marker-trait association. We tested the validity of GLM (PCA) and MLM (PCA+K) by evaluating their quantile-quantile plots (QQ-plots). A QQ-plot deviating from the null hypothesis across the entire standard distribution reflects undetected population stratification or cryptic relatedness. Consequently, seemingly significant variants could be statistically unreliable, whereas truly significant variants would generate deviations only at the end of plot range [32]. While GLM (PCA) employs principal components only as covariates to control the population structure, MLM (PCA+K) uses both principle component and kinship to eliminate or reduce the confounding effect of a real association. The MLM (PCA+K) test proved to have higher correction efficiency because the base of its QQ-plot was consistent with the reference line (Fig. 5B), as compared with the GLM (PCA) approach (Fig. 5A). Therefore, Manhattan plots of MLM (PCA+K) tests were examined to find variants associated with years, locations and provinces, and the variants were mapped to SCs with significant p values for potentially associated traits. There were the variants associated with the year (Fig. 5C), province (Fig. 5D) and location of the field (Fig. 5E) of collections. To reduce false positives, the false discovery rate (FDR) approach [33] was adopted for selection of variants. As a result, none of the variants was associated with the province or field site of collection, while 143 variants in SC12 were associated with the year of collection significantly. Of the 143 variants, 20 were located in egn4_Lema_T086290 (AvrLm4-7), 29 in egn4_Lema_T086300 and six in egn4_Lema_T086310. The rest of the variants were located in non-coding regions in proximity to these three genes. The other four significant variants were found in SC20, SC8, SC10 and SC11.
Population structure of L. maculans
Initially, the popular Bayesian-based STRUCTURE software was tested for population structure analysis; it imposed tremendous computational burden due to the large number of variants involved, making implementation extremely inefficient. Landscape and ecological association (LEA), an R package [22], was used to perform STRUCTURE-like analysis and estimate admixture coefficients efficiently with improved algorithms [21, 34]. LEA was run first to establish a curve of cross-entropy versus number of ancestral populations, the optimal number K=3 indicated three ancestral populations in western Canada (Fig. 6A), which are hereafter referred to as sub-pop1, sub-pop2 and sub-pop3. The function snmf of LEA differentiated the 150 isolates by partitioning their genome into the three ancestral populations, with probability denoted by an admixture coefficient (Fig. 6B). Sub-pop1 appeared to be the dominant sub-population, consisting of 76% of the 150 isolates with a correlation coefficient of >0.8 . Four isolates, one from AB (CR07-97) and three from MB (PC07-33, PC07-29 and 13CCMB02-06), constituted subpop2 with the coefficient at also >0.8. In addition, five isolates from MB (PC07-24, PC07-30, PC07-67, PC07-97 and MT07-31) were members of sub-pop3 with a correlation coefficient of 0.9998. All the other isolates were mixes of these sub-populations to a greater or lesser extent.
For a better understanding of the temporal and spatial distribution of the three sub-populations, the means of the admixture coefficients for isolates collected from the same field locations were calculated and mapped to these locations for 2007-2008 (Fig. 6C) and 2012-2014 (Fig. 6D), separately. The L. maculans population admixture varied among provinces; all SK isolates from 10 localities, regardless of the year of collection, were mainly composed of subpop1. The AB isolates were made up of two subpopulations (subpop1 and subpop2), exemplified by isolates from Camrose in 2007-2008 and Olds in 2012-2014, with admixture proportions of subpop1/subpop2/subpop3 being 85/13/1% and 42/51/7%, respectively (Table 2). This indicates the expansion of subpop2 between 2007-2008 and 2012-2014. The MB isolates turned out to have the greatest admix at both collection times. In 2007-2008, subpop1 was comprised of about 90% of the L. maculans population in Brandon and Carberry, but the admixture proportion (subpop1/subpop2/subpop3) in Plum Coulee, collected in 2007-2008, was 23/29/48%, indicating three major subpopulations. In 2012-2014, subpop2 in four locations (Carman, Winkler, Low Farm and Cartwright) increased noticeably, ranging between 39.2 and 55.7%, similar to that observed at Olds, AB. At the same time, subpop3 accounted for 5.5-11.6% of the isolates at all seven locations in MB, but subpop1 was significantly lower as compared with Brandon and Carberry in 2007-2008.
Table 2. Admixture coefficient of isolates
|
years
|
Provinces
|
Locations
|
number of isolates
|
Admixture coefficient
|
subpop1
|
subpop2
|
subpop3
|
2007-2008
|
AB
|
Vegreville
|
4
|
99.2
|
0.3
|
0.5
|
2007-2008
|
AB
|
Vermilion
|
8
|
95.7
|
0.3
|
4.0
|
2007-2008
|
AB
|
Camrose
|
12
|
85.8
|
13.1
|
1.1
|
2007-2008
|
MB
|
Plum Coulee
|
9
|
23.3
|
28.7
|
48.0
|
2007-2008
|
MB
|
Brandon
|
10
|
89.4
|
8.7
|
1.9
|
2007-2008
|
MB
|
Carberry
|
11
|
84.1
|
11.1
|
4.8
|
2007-2008
|
SK
|
Indian head
|
8
|
96.1
|
0.8
|
3.0
|
2007-2008
|
SK
|
Scott
|
9
|
98.5
|
0.9
|
0.6
|
2007-2008
|
SK
|
Melfort
|
19
|
93.6
|
0.2
|
6.2
|
2012-2014
|
AB
|
Olds
|
2
|
42.3
|
51.0
|
6.6
|
2012-2014
|
MB
|
Carman
|
1
|
43.7
|
40.8
|
15.5
|
2012-2014
|
MB
|
Winkler
|
2
|
41.8
|
46.2
|
11.9
|
2012-2014
|
MB
|
Killarney
|
2
|
75.7
|
14.0
|
10.3
|
2012-2014
|
MB
|
Lowe Farm
|
3
|
27.7
|
55.7
|
16.6
|
2012-2014
|
MB
|
Franklin
|
3
|
76.3
|
18.1
|
5.5
|
2012-2014
|
MB
|
Holland
|
8
|
67.9
|
25.4
|
6.6
|
2012-2014
|
MB
|
Cartwright
|
10
|
49.2
|
39.2
|
11.6
|
2012-2014
|
SK
|
Scott
|
1
|
97.5
|
0.0
|
2.5
|
2012-2014
|
SK
|
Speers
|
3
|
91.2
|
8.2
|
0.7
|
2012-2014
|
SK
|
Biggar
|
3
|
91.7
|
5.4
|
2.9
|
2012-2014
|
SK
|
Watrous
|
4
|
96.8
|
0.1
|
3.1
|
2012-2014
|
SK
|
Langenburg
|
4
|
89.5
|
7.3
|
3.2
|
2012-2014
|
SK
|
Meota
|
5
|
97.0
|
0.2
|
2.8
|
2012-2014
|
SK
|
Goodeve
|
9
|
95.8
|
2.8
|
1.4
|