GBS has been used successfully in insect pests to reveal insights about gene flow and coancestry29, spatial and temporal genetic structure30,31, and incursions of invasive pests1,32. Here we associated the informative data set provided by GBS with molecular markers used for host strain identification to better explain the patterns of FAW population structure in Brazil and Argentina and to identify candidate genes under selection, either because of differentiation among populations or between host strains.
By assessing a large number of samples in Brazil, we confirmed that FAW collected in corn fields were predominantly C-strain, with less than 4% of samples featuring both the COI-RS and TpiR diagnostic markers, and the few specimens featuring discordant genotypes likely represent vestiges of interbreeding events that occurred in the past. Based solely on diagnostic polymorphisms in COI and Tpi regions, MA02 and AR03 populations showed increased levels of strain hybridization, and we were able to describe the level of gene flow of these locations by relying also on SNP data set. Furthermore, our mtCOI sequencing detected no COI-h3 haplotype, which confirms that populations in South America are not significantly interbreeding with those from Florida and Caribbean33,34.
GBS data revealed high levels of gene flow and low genetic differentiation between MA02 and RS population, which was composed by pure R-strain samples. Since these two populations were the most geographically distant locations sampled in Brazil, apart over 3200 km, we presumed that MA02 was composed by R-strain specimens with recent events of hybridization. Altogether, both COI haplotype network and pairwise FST distance analysis confirmed that the Brazilian populations are structured by host strains, rather than by geographical ecoregions.
Long distance migration enables FAW populations to travel from Southern Texas and Florida up to Canada, a distance of nearly 2500 km, in less than three months11,35. Therefore given its strong flight performance, we can hypothesize that FAW is also performing long distance migration within Brazil sufficient to keep populations homogeneous within each host strain.
As for Argentinian populations, a novel pattern of genetic structure was revealed based on GBS data, but not with COI haplotype analysis. Several studies comparing populations from Brazil and Argentina in the past showed strong similarities between the two countries7,22,36,37, which would be expected if the great majority of Argentina FAW are derived from seasonal migrations from Brazil. However, our data indicated significant genetic structure between countries as well as among provinces inside Argentina. This result is consistent with observations of mating incompatibility between populations collected in northern Argentina compared to those from the Pampas region, indicating pre-reproductive isolating barriers between geographically separated populations38. GBS data for another important Noctuidae pest in the Americas, the sugarcane borer (SCB), showed a similar pattern of genetic structure between Argentina and Brazil populations, and among populations within Argentina6,39. We hypothesize that Argentina likely contains one or more endemic FAW populations that exhibit significant geographical isolation and that additional studies are required to better investigate the fine scale genetic structure of FAW as well as identify locations capable of supporting permanent FAW populations in this country.
In order to explain how selection pressures might be affecting FAW populations in South America, we examined the putative annotations of genes containing outlier SNPs. Host strain outliers were mostly concentrated on the sex chromosome Z, suggesting that the selection pressures are acting upon specific regions of the FAW genome. This result corroborates with previous study where the preponderance of strain specific SNPs were Z-linked40, and is consistent with the proposal that strain divergence is being driven primarily by Z-chromosome functions41. We believe that by reducing complexity of the genome, the GBS method was able to capture a fairly large number of polymorphisms in the Z-chromosome, and thereby discriminate between the R- and C-strains. It is possible that previous research based on nuclear SSR markers21 that did not differentiate the host strains lacked sufficient coverage of the sex chromosome.
Other functional annotations revealed proteins that were likely involved in binding activities and that were present in or related to the cell membrane. Mutations in Cry receptor genes have been reported in numerous lepidopteran species to be the most common mechanism of resistance against Bt toxins42, and here we found outlier SNPs in genes likely coding for Cry receptors such as GPI-anchored glycoprotein, cadherin and zinc carboxypeptidase. We also found outlier SNPs in genes possibly coding many important enzymes such as cytochrome P450 CYP314A143, esterase FE4-like44, JHAMT45, and also proteins related to cuticle and chitin, which may be an indication of response to management with insecticides46. Two noteworthy outlier SNPs associated with host strains were in genes possibly encoding an odorant receptor and a UDP-glucuronosyltransferase. Odorant receptors function in insects olfaction process, which is indispensable for host selection for feeding and oviposition45. UDP-glucuronosyltransferase, in turn, appears to be associated with C-strain ability to detoxify DIMBOA47, a toxic compound produced by corn plants but not rice. In conclusion, our work strongly suggest that positive selection is affecting allele frequencies at the level of populations and host strains.
From the Insect Resistance Management (IRM) perspective, resistance evolution is one of the most challenging problems in the sustainable control of FAW48. Therefore understanding patterns of gene flow and consequent risks for spread of field-evolved resistance alleles are crucial for effective management. Our SNP data set poses a challenging scenario in Brazil, where locations presented high levels of gene flow across all ecoregions and low genetic structure within host strains. Positive selection was also related to a higher degree of FAW strains differentiation in Brazil, and this is plausible given that C-strain is less susceptible to insecticides and Bt crops49. Moreover, both pairwise FST distances and DAPC showed genetic structure between FAW populations of Brazil and Argentina, and the most important IRM implication of this result is that a plan can be anticipated and designed after a resistant population is reported in either country.
In conclusion, by combining classic molecular markers for FAW host strain identification, and genome-wide SNPs identified with GBS, we obtained more resolution of population structure than previously reported. The genetic structure and pattern of FAW in Argentina and Brazil reinforces the importance of phytosanitary barriers between countries for effective FAW management in each location. In agreement with this issue, outlier analysis suggested that positive selection is associated with field management and host strain divergences. Taking all this into consideration, current GBS data proved to be useful for population genomics research in South America and it may be applied to other geographies where the species has been recently introduced.