Genetic variability among P. gossypiella populations using mitochondrial Cytochrome C Oxidase gene (COI)
Fifty one pink bollworm individuals from 16 populations were sequenced for partial mitochondrial COI region to workout the population genetic structure, genetic diversity
and distribution in three cotton-growing zones of India. The nucleotide sequences of 748 bp were aligned using CLUSTEW and edited using MEGA7 to ensure no missing data for all samples. The insertions or deletions were checked and corrected in all the sequences. Then, the sequences were BLAST analyzed in NCBI and found consensus with the PBW sequences from various parts of the world.
Phylogenetic tree was constructed with sixteen populations of P. gossypiella applying Maximum likelihood algorithm to JTT matrix-based model using MEGAX. Cluster diagram grouped all 16 populations in to three major clades (Fig. 1). Clade I includes total of 10 populations, of which 9 are from south India (Rangareedy, Karimnagar, Warangal, Nalgonda, Guntur, Kadapa, Kurnool, Chennai and Bangalore) and one from north India (Faridkot). Clade II includes 5 populations of which three from central India (Buldhana, Akola, Junaghar) and two from south India (Raichur and Adilabad) whereas clade III includes only one population from Mahabubnagar.
Pair wise distances
Genetic distance among the populations of P. gossypiella was calculated using pair-wise distances analyzed by poisson correction model and gave values ranging from 0.000 to 0.01066 (Table 2). This suggested that there is no much variation among the population of P. gossypiella in India. The highest genetic distance (0.01066) was observed between populations of Mahabubnagar with Raichur. The genetic distance was nil among the populations viz., Bangalore, kadapa, Karimnagar, Guntur, punjab, Ranga Rededy, Warangal, Kurnool and Tamil Nadu of clade I and Buldhana, Adilabad, Akola and Gujarat of clade II. The present study revealed very low level of genetic variation among P. gossypiella populations as the genetic distance varied from 0.000 to 0.01066 across populations. On a larger scale, low genetic differences among populations appeared to result from low dispersal rates between populations.
Table 1
Details of Pectinophora gossypiella sampled for genetic variability studies during 2019–2020.
S.No | Zone | Location | Date of Sampling | Geographical Coordinates | Population Code |
| | State | Place | | | |
1. | South India | Telangana | Adilabad | 24- Nov-2019 | 19.6808 N 78.5359 E | ADB |
2. | | | Warangal | 30-Nov-2019 | 17.7919 N 79.7974 E | WGL |
3. | | | Rangareddy | 2-Dec-2019 | 17.2326 N 77.5048 E | RNG |
4. | | | Mahaboobnagar | 21-Nov-2019 | 16.4547 N 78.8365 E | MHB |
5. | | | Karimnagar | 30-Nov-2019 | 18.2618N 79.7434 E | KRM |
6. | | | Nalgonda | 29-Dec- 2019 | 17.323 N 79.1656 E | NAL |
7. | | Andhra Pradesh | Kurnool | 28- Nov-2019 | 15.7313 N 77.4356 E | KUN |
8. | | | Kadapa | 22- Nov-2019 | 14.416 N 78.23333 E | KAD |
9. | | | Guntur | 20-Jan-2020 | 16.3067 N 80.4365 E | GNT |
10. | | Karnataka | Raichur | 29-Nov-2019 | 16.2120 N 77.3439 E | RAI |
11. | | | Bangalore | 7-Jan-2020 | 13.0270N 77.5843E | BAN |
12. | | Tamil Nadu | Chennai | 10-Jan-2020 | 11.0152N 76.9326E | CHE |
13. | Central India | Maharashtra | Akola | 26- Nov-2019 | 20.6962 N 77.0589 E | AKL |
14. | | | Buldhana | 7-Jan-2020 | 19.2644 N 76.6413 E | PRB |
15. | | Gujarat | Anand | 22-Nov − 2019 | 22.5608 N 72.9547 E | GJT |
16. | North India | Punjab | Faridkot | 30-Nov-2019 | 29.9038 N 73.8771 E | PUN |
Table 2
Pair wise distance of P. gossypiella populations.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
RAI | 0.00000 | | | | | | | | | | | | | | | |
NAL | 0.00370 | 0.00000 | | | | | | | | | | | | | | |
KRL | 0.00492 | 0.00205 | 0.00000 | | | | | | | | | | | | | |
BUL | 0.00164 | 0.00204 | 0.00328 | 0.00000 | | | | | | | | | | | | |
BNG | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | | | | | | | | | | | |
KDP | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | | | | | | | | | | |
MBNR | 0.01066 | 0.00780 | 0.00900 | 0.00898 | 0.00737 | 0.00737 | 0.00000 | | | | | | | | | |
KRM | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | 0.00737 | 0.00000 | | | | | | | | |
CHE | 0.00492 | 0.00205 | 0.00000 | 0.00328 | 0.00164 | 0.00164 | 0.00900 | 0.00164 | 0.00000 | | | | | | | |
ADB | 0.00164 | 0.00204 | 0.00328 | 0.00000 | 0.00164 | 0.00164 | 0.00898 | 0.00164 | 0.00328 | 0.00000 | | | | | | |
AKL | 0.00164 | 0.00204 | 0.00328 | 0.00000 | 0.00164 | 0.00164 | 0.00898 | 0.00164 | 0.00328 | 0.00000 | 0.00000 | | | | | |
GNL | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | 0.00737 | 0.00000 | 0.00164 | 0.00164 | 0.00164 | 0.00000 | | | | |
JUN | 0.00164 | 0.00204 | 0.00328 | 0.00000 | 0.00164 | 0.00164 | 0.00898 | 0.00164 | 0.00328 | 0.00000 | 0.00000 | 0.00164 | 0.00000 | | | |
FAR | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | 0.00737 | 0.00000 | 0.00164 | 0.00164 | 0.00164 | 0.00000 | 0.00164 | 0 | | |
R.R | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | 0.00737 | 0.00000 | 0.00164 | 0.00164 | 0.00164 | 0.00000 | 0.00164 | 0 | 0 | |
WGL | 0.00328 | 0.00041 | 0.00164 | 0.00164 | 0.00000 | 0.00000 | 0.00737 | 0.00000 | 0.00164 | 0.00164 | 0.00164 | 0.00000 | 0.00164 | 0 | 0 | 0 |
Table 3. Descriptive analysis for polymorphisms based on COI gene in Pectinophora gossypiella populations of India
Population
|
No .of Sequences
|
H
|
Haplotype diversity(Hd)
|
Nucleotide diversity(pi)
|
K
|
S
|
Telangana
|
18
|
6
|
0.725
|
0.002
|
1.510
|
5
|
Andhra Pradesh
|
9
|
2
|
0.500
|
0.0006
|
0.500
|
1
|
Karnataka
|
6
|
4
|
0.800
|
0.002
|
1.667
|
3
|
Maharashtra
|
6
|
1
|
0.000
|
0.000
|
0.000
|
0
|
Gujarat
|
4
|
1
|
0.000
|
0.000
|
0.000
|
0
|
Punjab
|
4
|
1
|
0.000
|
0.000
|
0.000
|
0
|
Tamil Nadu
|
4
|
1
|
0.000
|
0.000
|
0.000
|
0
|
South India
|
37
|
9
|
0.764
|
0.0019
|
1.477
|
8
|
Central India
|
10
|
2
|
0.181
|
0.0002
|
0.181
|
1
|
North India
|
4
|
1
|
0.000
|
0.000
|
0.000
|
0
|
Total
|
51
|
9
|
0.763
|
0.0017
|
1.340
|
8
|
H= Number of haplotypes, K= Average number of nucleotide differences,
S= Polymorphic sites
Nucleotide polymorphism
The nucleotide composition analysis of 51 sequences showed nucleotide frequencies as A = 31.56%, T = 41.16%, C = 14.42%, G = 12.83% and was found to be A + T rich (72.72%). A + T rich nature of nucleotide sequences of COI in P. gossypiella was similar to many reported species. The sequence region showed high AT rich nucleotide composition, which is typical characteristic of mitochondrial genes. The estimated transition to transversion ratio bias (R) was 1.50. Monomorphic sites (Invariable) were 740 and total of 8 polymorphic sites were observed, with number of mutations 8 which including parsimony informative sites (two variants): 8(site positions: 125, 261, 500, 659, 673, 678, 696, 726).
Diversity indices
Eight polymorphic sites were found among 51 individuals of P. gossypiella from 16 populations. Demographic statistics like number of haplotypes (n), haplotype diversity (Hd) and nucleotide diversity (n) are given in (Table 3). haplotype diversity and nucleotide diversity among the populations ranged from 0.000 to 0.800 and 0.000 to 0.002, respectively. The average haplotype diversity (Hd) and nucleotide diversity (n) of COI gene were 0.763 and 0.0017 respectively. Average number of nucleotide differences (k) found to be 1.340. Haplotype diversity and nucleotide diversity indices were high in Karnataka (0.800 ± 0.029; 0.002) followed by Telangana (0.725 ± 0.27; 0.0088) and lowest (0.000 ± 0.00; 0.000) in Maharashtra, Gujarat, Punjab and Tamil Nadu populations. When these populations were pooled zone wise, South Indian population showed high haplotype diversity (0.764) and nucleotide diversity (0.0019) followed by central India (0.181; 0.0002).
Haplotype, Network and Demographic analysis
Sequence analysis of 51 individuals of P.gossypiella from 16 locations yielded 9 haplotypes (Table 4). Among all haplotypes, Hap5 was observed to be dominant one shared by 20 sequences represented by samples from 7 locations of three southern states of India and one northern state. Hap4 sharing 13 sequences represented from 4 locations (3 from central India and 1 of south India). Whereas Hap3 is shared by 8 locations however, Hap1 and Hap2 shared by 3 locations each and Hap6 to Hap9 shared by one location each. Out of 9 haplotypes, six were found in Telangana region, two in Andhra Pradesh, Karnataka and one in each of other states.
Table 4
Mitochondrial haplotype distribution of Pectinopora gossypiella in different populations.
| Telangana | Andhra Pradesh | Tamil Nadu | Karnataka | Punjab | Gujarat | Maharashtra |
Haplotypes | ADB | NAL | MBNR | RR | WGL | KRM | KNL | KDP | GNL | CHE | BNG | RAI | FAR | JUN | AKL | BUL |
Hap1 (3) | | | | | | | | | | | | 3 | | | | |
Hap2 (3) | | 3 | | | | | | | | | | | | | | |
Hap3 (8) | | | | | | | 3 | | | 4 | 1 | | | | | |
Hap4 (13) | 3 | | | | | | | | | | | | | 4 | 3 | 3 |
Hap5 (20) | | | | 3 | 3 | 3 | | 3 | 3 | | 1 | | 4 | | | |
Hap6 (1) | | | 1 | | | | | | | | | | | | | |
Hap7 (1) | | | | | | | | | | | 1 | | | | | |
Hap8 (1) | | | 1 | | | | | | | | | | | | | |
Hap9 (1) | | | 1 | | | | | | | | | | | | | |
Overall | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 3 | 4 | 3 | 3 | 4 | 4 | 3 | 3 |
Median joining network among haplotype.
Median joining haplotype Network constructed based on the 51 sequences of P. gossypiella populations of India identified 9 haplotypes (Fig. 2). Network analysis indicated that Hap5 (39.21%) is present predominantly in southern states (Telangana, Andhra Pradesh, Karnataka) where 50% of analyzed populations of Telangana comes under Hap5. Hap4 (25.49%) present in 2 states is shared by 3 populations of central India i.e., Maharashtra (2) and Gujarat (1) and one population of Telangana (Adilabad) as it is adjoining district to Maharashtra state. MJ network suggests that most common, Hap5 could be proposed as ancestral haplotype for the analyzed populations which occupied the central region of haplotype network connected with all other 8 haplotypes.
Tajima’s D, test and Fu’s Fs test
Neutrality indices like Tajima’s D test and Fu’s Fs test were calculated and has shown non significant negative value for overall average of all populations (Table 5). A negative Tajima’s D signifies an excess of low frequency polymorphisms in the population consistent with the recent population expansion after bottleneck or selection sweep. A positive Tajima’s D signifies low levels of both low and high frequency polymorphisms. A negative value of Fu’s Fs is evidence for an excess number of alleles, as would be expected from a recent population expansion or from genetic hitchhiking. A positive value of Fu’s Fs is evidence for a deficiency of alleles, as would be expected from a recent population bottleneck.. This shows that despite the distance between localities sampled there is no genetic structure among the P. gossypiella populations in India. Present results supports hypothesis of severe bottlenecks or founder effects in some of the taxa leading to low mitochondrial variations.
Table 5
Tajima’s D and Fu’s F values of COI gene of Pectinophora gossypiella
Population | Tajimas’D | Significance | Fu’s Fs |
South India | -0.663 | NS, P > 0.10 | -2.958 |
Central India | -1.128 | NS, P > 0.10 | -0.410 |
North India | - | - | - |
TOTAL | -0.670 | NS, P > 0.10 | -2.748 |
Gene flow (Nm) between different populations of Pectinophora gossypiella
Nm values were calculated for 16 populations based on COI gene sequences. The results revealed that there is high, intermediate and low gene flow among the populations. High gene flow was recorded among populations of South India with North and Central India whereas low gene flow was recorded between North and Central India (Table 6). High rate of gene flow reduce genetic differentiation between the two groups with increasing homogeneity. This indicates that low genetic differentiation among the populations of P.gossypiella as depicted in the dendrogram.
Table 6
Gene flow (Nm) between different populations of Pectinophora gossypiella based on COI region.
Population 1 | Population2 | Nm* | Gene flow |
KN | TS | 0.981 | Intermediate |
KN | AP | 3.455 | High |
KN | MH | 0.667 | Intermediate |
KN | GJ | 0.828 | Intermediate |
KN | PUN | 1.143 | High |
KN | TN | 1.143 | High |
TS | AP | 12.053 | High |
TS | MH | 1.478 | High |
TS | GJ | 1.874 | High |
TS | PUN | 5.013 | High |
TS | TN | 1.427 | High |
AP | MH | 0.416 | Intermediate |
AP | GJ | 0.534 | Intermediate |
AP | PUN | 4.313 | High |
AP | TN | 0.950 | Intermediate |
MH | GJ | 0.000 | Low |
MH | PUN | 0.000 | Low |
MH | TN | 0.000 | Low |
GJ | PUN | 0.000 | Low |
GJ | TN | 0.000 | Low |
PUN | TN | 0.000 | Low |
South India | North India | 5.423 | High |
South India | Central India | 1.616 | High |
North India | Central India | 0.000 | Low |
*Nm > 1 (High gene flow); Nm = 0.25–0.99 (Intermediate gene flow); Nm = < 0.25 (Low gene flow). |