High genetic diversity and strong genetic structure of Strongyllodes variegatus (Coleoptera: Nitidulidae) demonstrate the population history of its distribution in oilseed rape production areas in China

doi:10.21203/rs.3.rs-19659/v2

Background: Strongyllodes variegatus (Fairmaire) is a major insect pest of oilseed rape in China. Despite its economic importance, the contribution of its population genetics in the development of suitable protection control strategy for oilseed rape crops is poorlys tudied. Using the sequences mitochondrial DNA cytochrome c oxidase subunit I (COI ) and cytochrome b (Cytb ) as genetic markers, we analyzed population genetic diversity and structure of 437 individuals collected from 15 S. variegates populations located in different oilseed rape production areas in China. In addition, we estimated the demographic history using neutrality test and mismatch distribution analysis.

Results: The high level of genetic diversity was detected among the COI and Cytb sequences of S. variegates . The population structure analysis strongly suggested three distinct genetic and geographical regions in China with limited gene flow. The Mantel test showed that the genetic distance was greatly influenced by the geographical distance. The demographic analyses showed that S. variegates experienced population fluctuation during the Pleistocene Epoch, which was likely to be related to the climatic changes.

Conclusion: Overall, these results demonstrate that the strong population genetic structure of S. variegates in China may is attributed to the isolation through the geographical distance among populations, their weak flight capacity and subsequently adaptation to the regional ecological conditions.

Evolutionary Biology

Gene flow

Genetic differentiation

Haplotype

Oilseed rape

Population genetic pattern

Strongyllodes variegates

The brown beetle, Strongyllodes variegates (Fairmaire) (Coleoptera: Nitidulidae), feed on brassicaceous plant species [1, 2], which often co-occurs with the pollen beetle, Meligethes aeneus [3].The S. variegatus adults chew up flowers, buds and leaves, forming crescent-shaped bites where the mature females lay eggs. After hatching, the larvae feed on mesophyll resulting in irregular bubble-shaped wounds before pupation in soil. The wounded leaves become necrotic and abscise prematurely [4, 5]. S. variegatus displays ecological characteristics to temperature and photoperiod of geographical regions. In the spring oilseed rape areas, this beetle species reproduces once or twice a year [2]. However, only two generations occur in the winter oilseed rape areas [6]. In Anhui, the overwintering adults begin to appear in March. When the temperature more than 30℃, the adults begin to over summer in the soil and some of them are mixed into the harvested rapeseed for over summer. The adults begin to appear on cruciferous vegetables in September and then are transferred to rape fields for damage in October. When the temperature is low in November, the adults begin to overwinter in the soil [4, 6]. In addition, it was reported that S. variegates have high reproductive ability [4] and can fly 30~40 m in 2 min [2].

Recently, the leaf damage to oilseed rape crops by this beetle became more and more serious, so that this beetle has become a major insect pest of oilseed rape. In spring 2013, S. variegatus population broke out in Hanshan, Anhui province, destructing 97% of oilseed rape leaves [6]. This pest was found for the first time on spring oilseed rape plants in Ningxia, Gansu province, China in 1993 [2]. It was detected in Hanshan, Anhui province on winter oilseed rape crops in 2008 [4]. For the past few years, we investigated oilseed rape production areas in China and found that this pest has spread to Chongqing municipality and Qinghai, Gansu, Sichuan, Shaanxi, Hubei, Anhui and Jiangsu provinces (unpublished). S. variegatus is generally distributed in the the middle and lower reaches of the Yangtze River valley. Currently, this pest is wide spread around China but has not yet been found globally and reported in the rest of the world except China. However, the phylogeography and population genetics of S. variegatesis still a blank field. Consequently, it is in urgent demands to conduct the genetics studies and understand the population genetic diversity and structure of S. variegatus for the management and control of this beetle species.

Population genetic studies on crop pests can provide information on the spatial scales at which population structure and gene flow occurs. Such information can help spatially defining relevant strategies for pest control [7]. In addition, genetic diversity contains the information on past and present demography that could be useful to characterize the demographic history of crop pests [8]. In recent years, more and more molecular markers have been used to study insect population genetics, demonstrating the importance of phylogeographical approaches [9]. The insect molecular markers mainly include the sequences of nuclear DNA and mitochondrial DNA (mtDNA), among which mtDNA genes is widely used. Mitochondrial genes have a faster evolution rate than nuclear genes and are effective genes for studying phylogenetic evolution, especially the degree of inter- and intra-specific population differentiation and the level of gene flow [9, 10]. Among insect molecular markers, the fragments of the mtDNA cytochrome c oxidase subunit I (COI) and cytochrome b (Cytb) have been widely used to study population genetic variation and differentiation, for example, Dendrolimus kikuchii, Chilo suppressalis and Agriosphodrus dohrni [11-15].The COI and Cytb were also used to track the colonization routes of Halyomorpha halys to identify the places where they have originated [16-18].

In this study, we used COI and Cytb genes to elucidate for the first time the genetic diversity and structure of 15 S. variegates populations occurring on oilseed rape production areas in China. We hypothesize that the populations would have high level of genetic diversity and clear genetic structure. At the same time, the efficient molecular data will contribute to assess if the hypothesis that historical geographic events and associated ecological adaptation have played an important part in shaping the observed genetic and geographic patterns of this beetle in China.

Genetic variation of S. variegatus populations

Seventy haplotypes of the COI gene and 67 haplotypes of the Cytb gene were identified from the 15 populations. The S. variegates COI fragment (652bp) and Cytb fragment (421bp) have 45 (6.9%) and 40 (9.5%) variable sites with 28 and 23 parsimony informative sites, respectively (Table 1). The base composition of the two genes is adenine (A) and thymine (T) (67.5% and 73.3%) biased, respectively, which is common for insect mitochondrial genes. Haplotype diversity (Hd) ranges from 0.424 to 0.913 (mean = 0.865) and nucleotide diversity (π) ranges from 0.00072 to 0.00462 (mean = 0.00427) for the COI gene (Table 1). Similarly, the Hd ranges from 0.464 to 0.833 (mean = 0.834) and π ranges from 0.00119 to 0.00539 (mean = 0.00479) for the Cytb gene (Table 1).

Haplotype analyses of the COI and Cytb genes

The distribution of the haplotypes for the two genes across the populations studied was showed in Table S1. The rarefaction analyses showed that the curves converged on an asymptote (Fig. S1). The COI haplotypes (H1-H70) include 34 (48.6%) unique haplotypes (Table S2). The four most frequent haplotypes (H1-H4) were found in 132 (30.2%), 59 (13.5%), 29 (6.6%), and 60 (13.7%) individuals (Table S2; Fig. 2a). The haplotype 1 (H1) was in almost all populations except GDQH, FJCQ and ESHB populations, whereas the haplotype 2 (H2) was only discovered in GYSC, HZSX, AKSX, FJCQ, ESHB and LCHB populations (Table S2). The Cytb haplotypes (H1-H67) have 35 (52.2%) unique haplotypes, among which 32 were observed in more than one individual (Table S2). Three most frequent haplotypes (H1-H3) were found in 158 (36.2%), 61(14.0%) and 48 (10.9%) individuals (Table S2; Fig. 2b). The haplotype 1 (H1) was found in all populations except ESHB population, whereas the haplotype 3 (H3) was only discovered in AQAH, LAAH, HFAH, CHAH, NJJS and ZJJS populations (Table S1).

The haplotype distribution and haplotype network analyses (see below) of both COI and Cytb genes revealed that S. variegates populations could be divided into three major geographical distribution regions or haplogroups: the northwestern China (NW) haplogroup (GDQH, HZGS and ZYGS populations), the central China (CC) haplogroup (GYSC, HZSX, AKSX, FJCQ, ESHB and LCHB populations) and the central and eastern China (CE) haplogroup (AQAH, LAAH, HFAH, CHAH, NJJS and ZJJS populations) (Fig. 1).

For the haplotype network of the COI gene, there was only one common haplotype (H1) in three haplogroups. The haplotype 2 (H2) was only detected and abundant in the CC haplogroup. The haplotype 3 (H3) was only discovered in the CE haplogroup. There were six common haplotypes (H4-H9) between the NW haplogroup and CC haplogroup. A total of five missing haplotypes were observed in all populations (Fig. 2a). Similarly, for the haplotype network of Cytb gene, there were two common haplotypes (H1, H4) in three haplogroups. The haplotype 2 (H2), the most abundant, was only detected in the CC haplogroup. The haplotype 3 (H3) was only discovered in CE haplogroup. The haplotypes 5-6, 7, 8-9 (H5-H6, H7, H8-H9) were common in both the NW and CC haplogroups, NW and CE haplogroup, CC and CE haplogroup, respectively. A total of four missing haplotypes were observed in the CC haplogroup (Fig. 2b).

Population genetic differentiation

To further assess whether the three inferred clusters of S. variegates populations are genetically distinct, the Bayesian clustering analysis was performed using STRUCTURE. The STRUCTURE analysis showed that the most likely value of K chosen with Evanno’s ΔK method was 3, indicating a division of genetic variation into three clusters as well. The proportions of each population that contributed to each of the three clusters are showed in Figure 3. Clusters 1 (red) and 2 (yellow) were contributed mainly from the NW and CC populations, respectively. The CE populations were mainly shared in cluster 3 (green).

A strong genetic divergence was observed across populations (F_ST = 0.425, P < 0.0001, Table 2). The F_CT value among three regions (NW, CC and CE) was highly significant (F_CT= 0.470, P< 0.0001, Table 2), further demonstrating that S. variegates populations in China is divided into three regions. A significant genetic differentiation was observed among populations within regions (F_SC = 0.072, P< 0.0001, Table 2), and within populations (F_ST = 0.508, P< 0.0001, Table 2) based on the combined date of the COI and Cytb genes. The percentages of genetic variation within populations (60.16% in the populations between NW and CC regions, and 56.00% between in the populations NW and CE regions) were significantly higher than those of the comparisons between regions (33.89% between NW and CC regions, 33.88% between the NW and CE regions) (Table 2). However, the percentage of genetic variations between the CC and CE regions (54.95%) was higher than that of 42.82% within populations (Table 2), an indicator that there is limited gene flow between the CC and CE regions.

The pairwise F_ST values based on the combined date of the COI and Cytb genes among populations ranged from -0.015 to 0.811 (Table 3). In 105 comparisons, 88 comparisons showed a significantly high genetic differentiation. The pairwise F_ST values among populations within the CC and CE regions were less than 0.159, while the pairwise F_ST values between populations from the CC and CE regions were above 0.409. In addition, the pairwise F_ST values were high and significant among regions (F_ST> 0.25, P < 0.001, Table 4), and gene flow among regions was estimated extremely low (Nm < 1, Table 4), suggesting a limited gene flow among regions. The results were greatly consistent with those obtained by the analysis of molecular variance (AMOVA) described in above sections.

The Mantel test based on the combined date of the COI and Cytb genes revealed a significant correlation between the genetic distance (F_ST/(1-F_ST)) and the geographical distances among all populations (r = 0.500, P < 0.0001, Fig. 4).

Demographic analyses

The Tajima’s D values obtained with the single and combined gene data in the NW region were negative, but not significant (P > 0.05, Table 1). The Tajima’s D and Fu’s Fs values in the CC and CE regions were negative and highly significant (P < 0.05, Table 1), whereas the CE region showed significant sum of squares deviation (SSD) values (P < 0.05, Fig. 5, S2). Thus, for the NW and CE regions, the sudden expansion hypothesis was rejected. However, the distributions of the pairwise differences obtained with the single and combined gene data in the CC region were unimodal with non-significant SSD and Harpending’s raggedness index (Rag) values (Fig. 5, S2), suggesting an expansion event in the CC region. The tau values (τ), a rough estimate of the population expansion, were approximately 3.842 (COI date), 2.016 (Cytb date), and 1.595 (COI+Cytb date) mutation units for the CC region. For the NW and CE regions, τ was 1.344 and 0.766 in the data of the COI gene, 3.693 and 0.875 in the data of the Cytb gene, and 2.628 and 1.875 in the combined data of the COI and Cytb genes (Fig. 5, S2).

Using two mitochondrial genes, we investigated the genetic diversity and structure of 437 individuals collected from 15 S. variegates populations located in different oilseed rape production areas in China. The results exhibited a high genetic diversity and clear genetic structure of S. variegates in the sampled areas.

Based on the analyses of the mtDNA sequences, haplotype distribution, haplotype networks, Bayesian clustering and AMOVA, three genetically diverse and geographically distinct regions of S. variegates distribution in China are classified, namely the NW region, CC region and CE region. A high proportion of total genetic variance was attributed to variations within populations (49.18%) and among regions (47.01%). This showed that the largest source of variation might not be due to the geographical barriers among regions but to the variations among individuals within populations. It was reported previously that the variations among individuals within populations had a significant effect on the genetic structure of Chilo suppressalis [19]. This contrasts with the studies of Myotis myotis and Plecotus austriacus [20, 21], which showed the geographical barrier was the most important effect. Other factors could also play a significant role on the genetic structure. Chen and Dorn analyzed the genetic variation of Cydia pomonella populations in Switzerland and found that host specificity, geographic isolation, intrinsic flight capacity and anthropogenic measures could shape the population structure [22].

A limited gene flow (Nm < 1) was revealed among regions by the current study. It is known that once populations have become genetically differentiated, their genetic divergence status can be maintained if they have differentially adapted to regional ecological conditions, since geographic variation in selection can act as a strong barrier to gene flow [23]. Our analysis also suggested a large gene flow among populations within the CC and CE regions. This may be due to the geographical isolation. The Mantel test results showed that the gene flow between the populations was greatly influenced by geographical distance. This strong isolation-by-distance relationship in our study may be also due to the limited flight capacity of S. variegates. It was reported that S. variegates can fly 30~40 m in 2 min [2]. However, the flight ability of S. variegates is less than tens of kilometres and would not be enough to weaken the isolation-by-distance relationships and increase the potential for allopatric or parapatric speciation [24, 25]. On the other hand,, the three regions shared common haplotypes, suggesting small amounts of gene flow among regions. This may be because some of adults are mixed into the harvested rapeseed over summer [4, 6]. Human intervention in the method of alternating seed breeding in a different location of oilseed rape crops could also paly an important role in the mixing of populations from distant geographic regions and provide the conditions for the gene flow among regions [6].

Gene flow in insects has been reported to increase with mobility, which is more pronounced on herbaceous plants, and this feature is strong especially in agricultural pests [26]. The large genetic variation within populations was also found for the pollen beetle, Meligethes aeneus, another oilseed rape pest [9, 27-29]. However, no population structure of the pollen beetle could be found in five provinces of Sweden [28]. M. aeneus is found to have high altitude flights (up to ca 200 m) at specific points during the year and low-altitude flights at multiple periods [29], which could help to disperse over large distances with the assistance of prevailing wind currents [30], resulting in the high gene flow similar to the diamondback moths, Plutella xylostella [31].

Both the neutrality test and the mismatch distribution analysis indicated a population expansion in the CC region. Furthermore, the phylogeographic patterns of the COI and Cytb haplotype networks were roughly composed of three “star-like” clusters. Based on 2.3% per site per million years [32], the expansion time of the CC region for COI and Cytb was estimated to be 104 and 128 ka years ago, respectively, within the interglacial time of the Pleistocene. Vast glaciers developed at that time in Tibetan Plateau, Qinling Mountain and even in the Yangtze River valley [33, 34], which could trigger episodes of range contractions and expansions in many plant and animal species [35-37].

In China, the management practices against S. variegates have primarily focused on using chemicals. The investigation of the genetic diversity of S. variegates populations can provide a useful guide for controlling this pest. Furthermore, localized populations with similar genetic structure should be considered as a same management unit for most effective control [38]. For isolated populations, various management methods should be used, especially, a variety of chemical pesticides with different properties. Additional research will be carried out using other molecular markers, such as nuclear genes, or even faster evolutionary markers, such as microsatellites to obtain better understanding of the population genetic structure and evolutionary history of S. variegates in China, and in the rest of the world if the pest would occur in future.

The current study provides the first population genetic analysis of S. variegates, a serious pest of oilseed rape crops. The high variability observed in the COI and Cytb molecular markers indicates that the markers are useful for measuring the genetic patterns in S. variegates populations. We confirmed the strong genetic structure of S. variegates populations in China, which could be divided into three genetic haplogroups and geographical regions with the limited gene flow among them. The distribution of this species in oilseed rape production areas in China is mainly structured by the isolation through geographical distance among populations and their weak flight capacity. We also found a population expansion signature in the CC region, which might be related to the climatic changes during the Pleistocene. These results suggest that phylogenetic information could help to guide the development of suitable protection control strategy for oilseed rape crops.

Sampling

A total of 437 S. variegates individuals were collected from 15 populations in China (Fig. 1). Sample size ranged from 24 to 37 individuals per population spot except eight individuals for the ESHB population (Table S2). All S. variegates individuals were freshly collected from the fields and immediately stored in absolute ethyl ethanol at -20℃ before molecular analysis.

DNA extraction, amplification, and sequencing

Total genomic DNA was extracted from each S. variegates specimens following the DNeasy Blood & Tissue Kit protocol (QIAGEN, Germany). The primers used were LCO-1490 (5’- GGTCAACAAATCATAAAGATATTGG - 3’) and HCO-2198 (5’- TAAACTTCAGGGTGACCAAAAAATCA - 3’), and CB1 (5’- TATGTACTACCATGAGGACAAATATC -3’) and CB2 (5’- ATTACACCTCCTAATTTATTAGGAAT - 3’) for polymerase chain reactions (PCR) amplification of the regions of COI and Cytb genes, respectively [39].

The PCR were performed using Applied Biosystems ABI 3730 (Applied Biosystem, USA) in a 25 μL reaction mixture containing 12.5 μL of 2 × Taq PCR Master Mix (BBI), 1 μL of 10 μM forward and reverse primers (respectively), 9.5 μL of ddH₂O, and l μL of template DNA. The procedure for the PCR amplification was 4 min at 94℃, 35 cycles of 30 s at 94℃, 30 s at 48℃, and 1 min at 72℃, and a final extension for 10 min at 72℃. The reaction mixture without DNA template was included as negative control for each set of PCRs.

The PCR products were subjected to electrophoresis on a 1.5 % agarose gel (UltraPure Agarose, Invitrogen) containing 10,000× stock GelRed (Biotium) diluted at 1:10,000, visualized on a BioDoc-it imaging system (UVP) and purified using ExoSAP-IT (USB, USA). The PCR products were bidirectionally sequenced (using the above primers) on an ABI 3730XL Automated Sequencer using the BigDye Terminator Cycle Sequencing 3.1 Ready Reaction Kit (Applied Biosystems, USA).

Date analysis

Forward and reverse sequences were assembled, aligned using ClustalW algorithm [40]. Obtained chromatograms were checked for the presence of ambiguous bases. The sequences were also translated to amino acids using the invertebrate mitochondrial code implemented in MEGA7 to check for the presence of stop codons and therefore pseudogenes [41]. Population genetic diversity was estimated using the program DnaSP 5.0 [42], as indexed by number of variable sites (S), parsimony informative sites, number of haplotypes (Hn), % of haplotypes unique to a given geographical area, haplotype diversity (Hd), nucleotide diversity (π), and average number of nucleotide differences (k). To estimate the haplotype completeness a Coleman rarefaction curve was calculated with haploAccum of spider package implemented inR software [43]. The Templeton, Crandall, and Sing (TCS) network of the haplotypes was performed using POPART [44, 45].

An admixture model based on the Bayesian clustering method was performed in STRUCTURE 2.3 [46] to determine whether a species was genetically distinct. A total of 6 independent simulations were run for a K-value of 1-10 with 500,000 burn-in steps followed by 500,000 steps. Population genetic structure was assessed with AMOVA in Arlequin3.5 according to the degree of differentiation between regions (F_CT), between populations within regions (F_SC), and between all populations (F_ST). Pairwise F_ST analysis among populations and regions were carried out with significance tests based on 1,000 permutations using Arlequin3.5 [47]. In order to test isolation by distance, the matrices of genetic distance F_ST/(1-F_ST) and the geographic distance (ln) between all 15 populations were compared using the Mantel test with 10,000 permutations [48]. The analysis was carried out using zt software package [49].

We examined the historical demographic expansion with Tajima's D and Fu’s Fs neutrality test and pairwise mismatch distribution [50-53], as implemented in Arlequin 3.5 [47]. Tajima's D and Fu’s Fs values are sensitive to demographic expansion, which usually leads to large negative values. Pairwise mismatch distributions were implemented to test whether a population experienced expansion events. A goodness-of-fit test was used to determine the smoothness of the observed mismatch distribution (using Harpending’s raggedness index, Rag) and the degree of fit between the observed and simulated data (using the sum of squares deviation, SSD) [54, 55]. The expansion signal for a population was indicated by a smooth and unimodal distribution pattern with non-significant p-values for the SSD. The time of expansion was evaluated with the formula τ = 2μkt [53], where τ is the crest of mismatch distribution, μ is the nucleotide substitution rate, and k is the number of nucleotides.

mtDNA: mitochondrial DNA; COI: cytochrome c oxidase subunit I; Cytb: cytochrome b; Hd: Haplotype diversity; π: nucleotide diversity; F_ST: genetic differentiation; PCR: Polymerase chain reaction; AMOVA: analysis of molecular variance; Rag: Harpending’s raggedness index; SSD: the sum of squares deviation.

Acknowledgments

We thank Ling-Ling Gao of Agriculture & Food Business Unit, the Commonwealth Scientific and Industrial Research Organization for critical suggestions and manuscript editing.

Authors’ contributions

SMH and HXZ conceived and designed the experiments. SMH, HXZ, and ZPH collected the data. HXZ, RT and LNZ analyzed the data. HXZ wrote the first draft the manuscript. JJZ made critical editing and proofreading for the manuscript. All authors contributed substantially to revisions.

Funding

This work was jointly supported by the Earmarked Fund for China Agriculture Research System (Grant No. CARS-13) and the National Key Research and Development Program of China (Grant No. 2018YFD0200905). JJZ was financially supported by the Short-term Recruitment Program of Foreign Experts of Anhui Province to SMH (Grant No.558340119600901014) by Anhui Foreign Experts Bureau. The funding body played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Availability of data and materials

All mitochondrial and sample location data are available. DNA sequences are deposited at GenBank under the accession numbers [MN935027-MN453096 for COI haplotypes; MN935097–MF935163 for Cytb haplotypes].

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Borror DJ, De Long DM, Triplehorn CA. An introduction to the study of insects (5th edition). New York: Saunders College Publishing; 1981.
He CG, Wang GL, Fan YH, Zou YX, Deng HY. Study on a new pest Strongyllodes Variegatus (Fairmaire 1891) attacking rape. Acta Agr Boreali-occidentalis Sin. 1998;7(4):18-23.
He CG. Occurrence law and pesticide control of Meligethes aeneus (Coleoptera: Nitidulidae). Plant Protect. 2001;27(1):15-17.
Hu BJ, Hou SM, Li CC, Zhou ZY, Zhang HS, Preliminary report of a new insect pest Strongyllodes variegates of oilseed rape in Anhui province. Proceedings of the 2012 Annual Conference of the Anhui Insect and Pathology Society. 2012;161-165.
He CG, Pan F, Yang ZM, Han E, Fan YH, Zou YX. Strategies and measures for integrated pest control in spring oilseed rape. Gansu Agr Sci and Tech. 1996;(1):31-32.
Hou SM, Hu BC, Hu BJ, Li QS, Fei WX, Jiang YF, Fan ZX, Rong SB. New pest Strongyllodes variegatus on winter oilseed rape in Anhui province. Chinese J Oil Crop Sci. 2013;35(6):692-696.
Zimmer CT, Maiwald F, Schorn C, Bass C, Ott MC, Nauen R. A de novo transcriptome of European pollen beetle populations and its analysis with special reference to insecticide action and resistance: next-generation sequencing of the pollen beetle. Insect Mol Biol. 2014;23(4):511-526.
Ouvrard P, Hicks DM, Mouland M, Nicholls JA, Baldock KC, Goddard MA, Kunin WE, Potts SG, Thieme T, Veromann E, Stone GN. Molecular taxonomic analysis of the plant associations of adult pollen beetles (Nitidulidae: Meligethinae) and the population structure of Brassicogethes aeneus. 2016;59(12):1101-1116.
Behura SK. Molecular marker systems in insects: current trends and future avenues. Mol Ecol.2006;15(11):3087-3113.
Hurst GD, Jiggins FM. Problems with mitochondrial DNA as a marker in population phylogeographic and phylogenetic studies: the effects of inherited symbionts. P Roy Soc Lond B Bio. 2005;272(1572):1525-1534.
Birungi J, Arctander P. Large sequence divergence of mitochondrial DNA genotypes of the control region within populations of the African antelope kob (Kobus kob). Mol Ecol.2000;9(12):1997-2008.
Zanol J, Halanych KM, Struck TH, Fauchald K. Phylogeny of the bristle worm family Eunicidae (Eunicida Annelida) and the phylogenetic utility of noncongruent 16S COI and 18S in combined analyses. Mol Phylogenet Evol.2010;55(2):660-676.
Men Q, Xue G, Mu D, Hu Q, Huang M. Mitochondrial DNA markers reveal high genetic diversity and strong genetic differentiation in populations of Dendrolimus kikuchii Matsumura (Lepidoptera: Lasiocampidae). PloS One. 2017;12(6):e0179706.
Meng XF, Shi M, Chen XX. Population genetic structure of Chilo suppressalis (Walker) (Lepidoptera: Crambidae): strong subdivision in China inferred from microsatellite markers and mtDNA gene sequences. Mol Ecol. 2008;17(12):2880-2897.
Du Z, Liu H, Li H, Ishikawa T, Su ZH, Cai WZ, Kamitani S, Tadauchi O. Invasion of the assassin bug Agriosphodrus dohrni (Hemiptera: Reduviidae) to Japan: Source estimation inferred from mitochondrial and nuclear gene sequences. Int J Biol Macromol. 2018;118(Pt B):1565-1573.
Cesari M, Maistrello L, Ganzerli F, Dioli P, Rebecchi L, Guidetti R. A pest alien invasion in progress: potential pathways of origin of the brown marmorated stink bug Halyomorpha halys populations in Italy. J Pest Sci. 2015;88(1):1-7.
Gariepy TD, Haye T, Fraser H, Zhang J. Occurrence genetic diversity and potential pathways of entry of Halyomorpha halys in newly invaded areas of Canada and Switzerland. J Pest Sci. 2014;87(1):17-28.
Valentin RE, Nielsen AL, Wiman NG, Lee DH. Fonseca DM, Global invasion network of the brown marmorated stink bug Halyomorpha halys. Sci Rep. 2017; 7(1):9866.
Meng XF, Shi M, Chen XX. Population genetic structure of Chilo suppressalis (Walker) (Lepidoptera: Crambidae): strong subdivision in China inferred from microsatellite markers and mtDNA gene sequences. Mol Ecol. 2008;17:2880-2897.
Ruedi M, Walter S, Fischer MC, Scaravelli D, Excoffier L, Heckel G. Italy as a major Ice Age refuge area for the bat Myotis myotis (Chiroptera: Vespertilionidae) in Europe. Mol Ecol. 2008;17(7):1801-1814.
Razgour O, Juste J, Ibáñez C, Kiefer A, Rebelo H, Puechmaille SJ, ArlettazR, Burke T, Dawson DA, Beaumont M, Jones The shaping of genetic variation in edge-of-range populations under past and future climate change. Ecol Lett. 2013;16(10):1258-1266.
Chen MH, Dorn S. Microsatellites reveal genetic differentiation among populations in an insect species with high genetic variability in dispersal, the codling moth, Cydia pomonella (L.) (Lepidoptera: Tortricidae). Bull Entomol Res. 2009;100:75-85
Krantz DE, Williams DF, Jones DS. Ecological and paleoenvironmental information using stable isotope profiles from living and fossil molluscs. Palaeogeogr Palae Ocl.1987;58(3-4):249-266.
Peterson MA, Denno RF. The influence of dispersal and diet breadth on patterns of genetic isolation by distance in phytophagous insects. Am Nat. 1998;152(3):428-446.
Roff DA. The evolution of flightlessness: is history important? Evol Ecol. 1994; 8(6):639-657.
Juhel AS, Barbu CM, Valantin-Morison M, Gauffre B, Leblois R, Olivares J, Franck P. Limited genetic structure and demographic expansion of the Brassicogethes aeneus populations in France and in Europe. Pest Manag Sci. 2019;75:667-675.
Kazachkova N, Meijer J, Ekbom B. Genetic diversity in pollen beetles (Meligethes aeneus) in Sweden: role of spatial temporal and insecticide resistance factors. Agr Forest Entomol. 2007;9(4):259-269.
Kazachkova N, Meijer J, Ekbom B. Genetic diversity in European pollen beetle Meligethes aeneus (Coleoptera: Nitidulidae) populations assessed using AFLP analysis. Eur J Entomol. 2008;105(5):807-814.
Mauchline AL, Cook SM, Powell W Chapman JW, Osborne JL. Migratory flight behaviour of the pollen beetle Meligethes aeneus. Pest Manag Sci.2017;73(6):1076-1082.
Williams IH. The major insect pests of oilseed rape in Europe and their management: an overview. Biocontrol-based integrated management of oilseed rape pests. Dordrecht: Springer; 2010:1-43.
Endersby NM, McKechnie SW, Ridland PM, Weeks AR. Microsatellites reveal a lack of structure in Australian populations of the diamondback moth Plutella xylostella (L.). Mol Ecol. 2006;15(1):107-118.
Brower AVZ. Rapid morphological radiation and convergence among races of the butterfly Heliconius erato inferred from patterns of mitochondrial DNA evolution. P Natl Acad Sci USA. 1994;91(14):6491-6495.
Shi YF, Cui ZJ, Su Z. The quaternary glaciations and environ-mental variations in China. Shijiazhuang: Hebei Science and Technology Press; 2006.
Deng LL. The Pleistocene climate of Changjiang valley. Journal of Guizhou Normal University (Natural Sciences). 2006;24(3):29-32.
Hewitt G. The genetic legacy of the Quaternary ice ages. Nature. 2000;405(6789):907-913.
Smith CI, Farrell BD. Range expansions in the flightless longhorn cactus beetles Moneilema gigas and Moneilema armatum in response to Pleistocene climate changes. Mol Ecol. 2005;14(4):1025-1044.
Tzedakis PC, Roucoux KH, de Abreu L, Shackleton NJ. The duration of forest stages in southern Europe and interglacial climate variability. Science. 2004;306(5705):2231-2235.
Ayres RM, Pettigrove VJ, Hoffmann AA. Low diversity and high levels of population genetic structuring in intro duced eastern mosquitofish (Gambusia holbrooki) in the greater Melbourne area, Australia. Biol Invasions. 2010;12:3727-3744.
Simon C, Frati F, Beckenbach A, Crespi B, Liu H, Flook P. Evolution weighting and phylogenetic utility of mitochondrial gene sequences and a compilation of conserved polymerase chain reaction primers. Ann entomol Soc Am. 1994;87(6):651-701.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting position-specific gap penalties and weight matrix choice. Nucleic Acids Res.1994;22(22):4673-4680.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870-1874.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009; 25(11):1451-1452.
Gotelli NJ, Colwell RK. Quantifying biodiversity: procedures and pitfalls in the measurement and comparison of species richness. Ecol Letters, 2001;4(4):379-391.
Templeton AR Crandall KA, Sing CF. A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping and DNA sequence data. III. Cladogram estimation. Genetics. 1992;132(2):619-633.
Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6(9):1110-1116.
Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2020;155:945–959.
Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10(3):564-567.
Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Res. 1967;27(2):209-220.
Bonnet E. Van de Peer Y, zt: A sofware tool for simple and partial mantel tests. Journal Stat Softw. 2002;7(10):1.
Tajima F. The effect of change in population size on DNA polymorphism. 1989;123(3):597-601.
Fu YX. Statistical tests of neutrality of mutations against population growth hitchhiking and background selection. 1997;147(2):915-925.
Rogers AR. Harpending H, Population growth makes waves in the distribution of pair-wise genetic differences. Mol Biol Evol. 1992;9(3):552-569.
Slatkin M, Hudson RR. Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics. 1991;129(2):555-562.
Harpending HC. Signature of ancient population growth in a low-resolution mitochondrial DNA mismatch distribution. Hum Biol. 1994;66(4):591-600.
Schneider S, Excoffier L. Estimation of past demographic parameters from the distribution of pairwise differences when the mutation rates vary among sites: application to human mitochondrial DNA. Genetics. 1999;152(3):1079-1089.

Table 1 Genetic diversity indices and neutrality test for mitochondrial COI and Cytb markers in all analyzed Strongyllodes variegates populations

Marker	Population code	Region^a	S	Hn	Hd	π	k	Tajima's D	P	Fu's Fs	P
COI	GDQH		4	5	0.702	0.00166	1.082	0.263	NS	-0.286	NS
	HZGS		8	9	0.702	0.00192	1.255	-1.059	NS	-3.893	**
	ZYGS		5	6	0.649	0.00121	0.790	-1.188	NS	-2.707	*
	GYSC		14	14	0.855	0.00378	2.467	-1.008	NS	-6.799	***
	HZSX		14	15	0.913	0.00462	3.018	-0.487	NS	-6.672	***
	AKSX		13	13	0.852	0.00431	2.810	-0.351	NS	-3.961	*
	FJCQ		11	14	0.857	0.00328	2.138	-0.740	NS	-7.898	***
	ESHB		6	6	0.893	0.00257	1.679	-1.280	NS	-3.114	**
	LCHB		9	11	0.764	0.00320	2.085	-0.206	NS	-3.819	*
	AQAH		13	11	0.580	0.00146	0.949	-2.201	***	-8.187	***
	LAAH		4	5	0.424	0.00072	0.467	-1.654	*	-3.127	***
	HFAH		6	6	0.574	0.00119	0.775	-1.306	NS	-2.271	NS
	CHAH		5	6	0.520	0.00092	0.597	-1.543	*	-3.524	***
	NJJS		5	4	0.458	0.00095	0.619	-1.367	NS	-0.697	NS
	ZJJS		5	6	0.628	0.00118	0.770	-1.041	NS	-2.417	*
		NW	14	15	0.713	0.00183	1.193	-1.565	*	-9.255	***
		CC	30	43	0.856	0.00397	2.587	-1.471	*	-26.732	***
		CE	20	20	0.544	0.00113	0.736	-2.132	**	-21.274	***
	Total		45	70	0.865	0.00427	2.786	-1.628	*	-25.887	***
Cytb	GDQH		6	6	0.708	0.00417	1.758	0.547	NS	0.186	NS
	HZGS		4	4	0.469	0.00285	1.198	0.558	NS	1.002	NS
	ZYGS		3	4	0.583	0.00161	0.678	-0.394	NS	-0.714	NS
	GYSC		14	14	0.833	0.00539	2.271	-1.192	NS	-7.424	***
	HZSX		11	11	0.832	0.00472	1.986	-0.916	NS	-4.300	*
	AKSX		13	13	0.810	0.00455	1.916	-1.258	NS	-6.437	**
	FJCQ		9	10	0.791	0.00300	1.262	-1.381	NS	-5.530	***
	ESHB		2	3	0.464	0.00119	0.500	-1.310	NS	-0.999	NS
	LCHB		18	15	0.752	0.00359	1.51	-2.252	***	-12.320	***
	AQAH		8	9	0.718	0.00255	1.075	-1.273	NS	-4.442	**
	LAAH		3	4	0.71	0.00215	0.905	0.223	NS	-0.187	NS
	HFAH		9	9	0.784	0.00251	1.239	-1.322	NS	-3.954	**
	CHAH		7	9	0.726	0.00272	1.145	-1.151	NS	-5.076	***
	NJJS		9	9	0.776	0.00347	1.462	-1.082	NS	-3.413	*
	ZJJS		8	9	0.697	0.00215	0.903	-1.655	*	-5.812	**
		NW	9	9	0.638	0.00354	1.492	-0.393	NS	-1.395	NS
		CC	33	43	0.826	0.00436	1.837	-1.992	**	-27.537	***
		CE	20	24	0.741	0.00276	1.162	-1.799	*	-21.480	***
	Total		40	67	0.834	0.00479	2.015	-1.819	**	-26.759	***
COI+ Cytb		NW	23	23	23.000	0.800	0.00250	-1.208	NS	-11.042	***
		CC	63	82	82.000	0.957	0.00412	-1.847	**	-25.523	***
		CE	40	48	48.000	0.881	0.00177	-2.139	**	-27.540	***

For each population, the number of variable sites (S), number of haplotypes (Hn), haplotype diversity (Hd), nucleotide diversity (π), average number of nucleotide differences (k) and Tajima's D and Fu's Fs test statistics for selective neutrality are given.

^a Regions as defined in Fig. 1.

Values are significant at * P ≤ 0.05; ** P ≤ 0.01; *** P ≤ 0.001; NS, not significant

Table 2 Hierarchical analysis of molecular variance (AMOVA) in collected Strongyllodes variegates from 15 populations

Source of variation	df	Sum of squares	% of variation	Fixation indices
all populations
Among populations	14	446.669	42.50
Within populations	422	599.926	57.50	F_ST = 0.425***
three regions
Among regions	2	391.765	47.01	F_CT = 0.470***
Among populations within regions	12	54.904	3.80	F_SC = 0.072***
Within populations	422	599.926	49.18	F_ST = 0.508***
NW vs. CC
Among regions	1	124.847	33.89	F_CT = 0.339**
Among populations within regions	7	46.483	5.95	F_SC = 0.090***
Within populations	248	438.452	60.16	F_ST = 0.398***
NW vs. CE
Among regions	1	89.300	38.88	F_CT = 0.389**
Among populations within regions	7	26.418	5.11	F_SC = 0.084***
Within populations	263	265.672	56.00	F_ST = 0.440***
CC vs. CE
Among regions	1	332.830	54.95	F_CT = 0.550***
Among populations within regions	10	36.907	2.23	F_SC = 0.050***
Within populations	333	495.727	42.82	F_ST = 0.572***

AMOVA partitioned among all populations and three regions: NW region (GDQH, HZGS, ZYGS), CC region (GYSC, HZSX, AKSX, FJCQ, ESHB, LCHB) and CE region (AQAH, LAAH, HFAH, CHAH, NJJS, ZJJS).

**P ≤ 0.001, *** P ≤ 0.0001 after 1,023 permutations

Table 3 Pairwise F_ST values among populations of Strongyllodes variegates based on the combined date of the COI and Cytb genes

GDQH

ZYGS

HZGS

GYSC

HZSX

AKSX

FJCQ

ESHB

LCHB

AQAH

LAAH

HFAH

CHAH

NJJS

ZJJS

GDQH

ZYGS

0.086

HZGS

0.330

0.124

GYSC

0.400

0.438

0.533

HZSX

0.249

0.230

0.326

0.066

AKSX

0.240

0.226

0.325

0.078

-0.015

FJCQ

0.454

0.488

0.597

0.025

0.089

0.077

ESHB

0.573

0.627

0.767

0.024

0.159

0.161

0.009

LCHB

0.443

0.489

0.598

0.008

0.103

0.096

0.006

0.004

AQAH

0.477

0.369

0.336

0.583

0.431

0.434

0.644

0.756

0.642

LAAH

0.534

0.458

0.489

0.593

0.454

0.460

0.669

0.811

0.661

0.074

HFAH

0.478

0.378

0.363

0.574

0.423

0.427

0.639

0.752

0.635

-0.002

0.071

CHAH

0.482

0.377

0.366

0.566

0.409

0.414

0.638

0.771

0.633

-0.014

0.088

0.000

NJJS

0.492

0.399

0.387

0.575

0.432

0.436

0.643

0.754

0.638

0.019

0.040

-0.001

0.027

ZJJS

0.495

0.395

0.388

0.583

0.430

0.434

0.651

0.779

0.646

0.015

0.152

-0.003

0.019

0.026

Significant F_ST values are shown in bold (P = 0.05)

Table 4 Pairwise F_ST values (below diagonal) and gene flow (above diagonal) pairwise and within geographical regions based on the combined date of the COI and Cytb genes

Regions^a	-	NW	CC	CE
-	-	1.131^b	3.917	9.009
NW	0.181		0.457	0.373
CC	0.060	0.354 ***		0.202
CE	0.027	0.401 ***	0.553 ***

^a Regions as defined in Fig. 1 and Table 2

^bGene flow (Nm) was calculated from Fst as: Nm = (1- Fst) / 4 Fst

*** P < 0.001

Additional file 1: Table S1 Geographical distribution of (A) COI and (B) Cytb haplotypes of Strongyllodes variegates (Hap.= Haplotype; N = total number)

(A) Hap.	GDQH	HZGS	ZYGS	GYSC	HZSX	AKSX	FJCQ	ESHB	LCHB	AQAH	LAAH	HFAH	CHAH	NJJS	ZJJS	N
H1		1	3	3	3	3			1	24	16	21	18	22	17	132
H2				11	8	12	10	3	15							59
H3										3		8	2	7	9	29
H4	14	18	14		3	6	2		3							60
H5	7	2							1							10
H6	1				1											2
H7		2			2	2										6
H8			1			2										3
H9			2	1		1										4
H10	11	5														16
H11	1															1
H12		2														2
H13		1														1
H14		1														1
H15		2														2
H16			2													2
H17			2													2
H18				1			1									2
H19				1	1											2
H20				1	2											3
H21				2	2		1									5
H22				1												1
H23				3	1				3							7
H24				2												2
H25				1												1
H26				1					4							5
H27				1												1
H28				1												1
H29					1	1										2
H30					2	2		1								5
H31					1		1	1	1							4
H32					1											1
H33					1											1
H34					1											1
H35						1										1
H36						1										1
H37						2	2		1							5
H38						1										1
H39						1										1
H40							1									1
H41							6									6
H42							1									1
H43							1									1
H44							1									1
H45							1									1
H46							1									1
H47							1		1							2
H48								1								1
H49								1								1
H50								1								1
H51									1							1
H52									1							1
H53										1	1		2	1	2	7
H54										1	2	1	1			5
H55										1	1					2
H56										1						1
H57										1						1
H58										1						1
H59										1						1
H60										1						1
H61										2						2
H62											1					1
H63												2				2
H64												1				1
H65												1				1
H66													1		1	2
H67													2			2
H68														1		1
H69															1	1
H70															1	1
(B) Hap.	GDQH	HZGS	ZYGS	GYSC	HZSX	AKSX	FJCQ	ESHB	LCHB	AQAH	LAAH	HFAH	CHAH	NJJS	ZJJS	N
H1	12	24	15	3	11	10	3		3	18	6	13	13	12	15	158
H2				9	5	12	13	6	16							61
H3										8	9	9	5	8	9	48
H4		1			2	2				1		3				9
H5	1									1			1			3
H6			4							1						5
H7			2	1												3
H8				1								1	1			3
H9				1	1		4					2			1	9
H10	14	7														21
H11	4	2														6
H12	2															2
H13	1		3													4
H14				1		1										2
H15				2	1		1									4
H16				5	4											9
H17				2												2
H18				1												1
H19				1												1
H20				1					1							2
H21				1												1
H22				1												1
H23					1		1									2
H24					1											1
H25					1											1
H26					2											2
H27					1											1
H28						1										1
H29						1										1
H30						1	2									3
H31						2	3									5
H32						1										1
H33						1										1
H34						1	1									2
H35						1										1
H36						1										1
H37							1	1	1							3
H38							1									1
H39								1								1
H40									1							1
H41									1							1
H42									1							1
H43									1							1
H44									1							1
H45									1							1
H46									1							1
H47									1							1
H48									1							1
H49									1							1
H50									1							1
H51										2			1	1	1	5
H52										4	5	3	1			13
H53										1						1
H54										1						1
H55											1	1				2
H56												1		1		2
H57												1				1
H58													2	1		3
H59													1			1
H60													1			1
H61														5		5
H62														1	1	2
H63														1	1	2
H64														1		1
H65															1	1
H66															1	1
H67															1	1

Additional file 2: Figure S1 Individual-based rarefaction curves of haplotype diversity of S variegatus of in China.

Additional file 3: Figure S2 Pairwise mismatch distributions of (a) COI and (b) Cytb genes for three derived regions. The x coordinate represents the number of pairwise differences among sequences, and the y coordinate represents the frequencies of pairwise differences in each region. The significance values (p) of the parameters were evaluated with 1,000 simulations; P_SSD: P value for SSD (sum of squared deviations) P_R: P value for Rag (Harpending’s raggedness index); τ: the index of population expansion.

Additional file 4: Table S2 Sample information of Strongyllodes variegatus (Fairmaire) specimens collected for the present study

Province	Location	Abbreviation	Longitude	Latitude	Years	Sample size
Qinghai	Guide	GDQH	101.43	36.05	2012	34
Gansu	Hezheng	HZGS	103.35	35.43	2013, 2015	34
	Zhenyuan	ZYGS	107.32	35.53	2019	24
Sichuang	Guangyuan	GYSC	105.79	32.59	2015	30
Shaanxi	Hanzhong	HZSX	106.67	33.16	2015	30
	Ankang	AKSX	108.25	32.05	2015, 2017	35
Chongqing	Fengjie	FJCQ	109.44	31.01	2015	30
Hubei	Enshi	ESHB	109.72	30.61	2015	8
	Lichuang	LCHB	108.77	30.48	2019	32
Anhui	Anqing	AQAH	116.58	30.63	2016, 2017	37
	Liu'an	LAAH	116.70	31.79	2016, 2017	21
	Hefei	HFAH	117.23	31.88	2015, 2016, 2017	34
	Caohu	CHAH	117.89	31.62	2012, 2015	26
Jiangsu	Nanjing	NJJS	118.47	32.05	2019	31
	Zhenjiang	ZJJS	119.18	31.94	2015	31

High genetic diversity and strong genetic structure of Strongyllodes variegatus (Coleoptera: Nitidulidae) demonstrate the population history of its distribution in oilseed rape production areas in China

Status:

Journal Publication

Version 2

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

Abbreviations

Declarations

References

Tables

Additional Files

Supplementary Files

Status:

Journal Publication

Version 2