Molecular identification of vivax malaria relapse cases in Yunnan Province based on the homologyical analysis of Plasmodium vivax circumsporozoite protein gene

DOI: https://doi.org/10.21203/rs.3.rs-1861513/v1

Abstract

Background

More than 85% of the malaria burden is caused by imported vivax malaria in Yunnan Province and Yunnan is also where the majority of vivax malaria cases are diagnosed across China. Timely removal of the source of Plasmodium vivax and its breeding environment remains the key to eliminating the secondary transmission of imported malaria. To compensate for the uncertainty of epidemiological surveys in tracing vivax malaria recurrence, this study attempted to use molecular markers for identification.

Materials and methods

To do so, blood samples were collected from cases diagnosed and revalidated as single infections of Plasmodium vivax in Yunnan Province from 2013 to 2020. Specifically, samples from suspected relapses with recurrent episodes were subjected to PCR amplification, product sequencing, and analysis of the Plasmodium vivax circumsporozoite protein (pvcsp) gene.

Results

Seventy-eight suspected recurrent cases were retrieved from 2484 vivax malaria cases, with a total of 81 recurrent episodes. A total of 159 blood samples from primary infection Plasmodium vivax and recurrences were subjected to PCR amplification and sequencing to obtain 156 CDS sequences of pvcsp gene, 121 of which can be matched into the paired sequences of 59 cases. There were 475 polymorphic loci and 84 haplotypes (H01-H84) in the 121 sequences. Also, there were 79 and 5 haplotypes with CRR repeat units (PRM) of VK210 and VK247 structure, respectively. Of the 59 pairs of pvcsp gene sequences, every one of 31 pairs showed only one haplotype and no variant sites, meaning the every paired sequences were completely homologous and the paired Plasmodium vivax strains were homologous single clone. Every one of the remaining 28 paired sequences had two haplotypes but no length polymorphism, and except for 2 polymorphic loci (39 and 1027), all single nucleotide polymorphisms were double-equivalent bases differentially transferred between paired sequences, indicating that the paired sequences are "weakly heterologous" with no fragment insertions (or deletions) and only individual site polymorphisms. All 59 vivax malaria recurrences were respectively caused by the activation of Plasmodium vivax hypnozoites from the same population as the primary infection.

Conclusions

The paired analysis of the similarity of Plasmodium high variant genes allowed the identification of recurrent episodes caused by Plasmodium vivax homologous hypnozoites, and also demonstrated pvcsp gene as a good molecular marker. Moreover, the study showed most of the hypnozoites causing vivax malaria recurrence in Yunnan Province belonged to homologous single clone or sibling strains comparison with the original infection strains.

Background

In recent years, as efforts to control malaria have increased, the proportion of Plasmodium vivax infections in many traditionally highly endemic areas, such as Sri Lanka, Thailand and Brazil, has shown a counter-intuitive trend. The cause may be related to the fact that previous prevention and control measures ignored the characteristics of Plasmodium vivax and indirectly contributed to the accumulation of infection sources of Plasmodium vivax [12]. It has been observed that mature gametophytes (stage V gametophytes) of Plasmodium vivax appear almost simultaneously with its asexual bodies in the early stages and continue to produce gametophytes throughout the infection period [35]. The high efficiency of transmission therefore also means that the most effective measures to remove the infection sources of Plasmodium falciparum, such as early diagnosis and timely treatment, may not be able to contain the danger of an already infectious early episode of Plasmodium vivax. In addition, Plasmodium vivax develops more rapidly inside the Anopheles mosquitoes and can easily circumvent interventions such as insecticide netting and indoor residual spraying. Thus, the maintenance of Plasmodium vivax populations is easier than those applicable to Plasmodium falciparum under the same vector control pressure [6]. Moreover, due to the existence of a parasitic mechanism of hypnozoites, Plasmodium vivax can break through the local and seasonal limits of mosquito vector transmission [78]. The covert accumulation of infection sources plays a negative role in increasing the complexity associated with Plasmodium vivxa transmission.

Indigenous cases of malarial infection in China ceased in 2016. However, Yunnan remains the province with the largest number of imported vivax malaria cases, with instances primarily originating in Southeast Asian countries. To wit, in 2018, such cases accounted for 80.8% (172/213) malaria cases caused by Plasmodium vivax infection throughout the province [9], of which there was no shortage of groups with multiple episodes and suspected recurrences of Plasmodium vivax infection [10]. During the period spanning 2013-19, suspected recurrent events constituted approximately 3.5% (83 in total) of the 2364 vivax malaria cases diagnosed in Yunnan Province. This was based on a minimum interval of 45 days between the first recurrence and the original episode [11] (this interval has an average duration of 41 days across those Southeast Asian countries where Plasmodium vivax is widely prevalent [7]), and 7% of vivax malaria recurrence within 6 months in Thailand [11], suggesting the burden of vivax malaria recurrence in Yunnan Province is no less than that of neighboring countries. Of course, other regions of the world are also facing the challenge posed by the Plasmodium vivax recurrence, with the recurrence rate in Papua New Guinea being as high as 80% [12]. 23% of pregnant women in Brazil had a recurrence of due to failure to receive primaquine eradication [13], making its management unavoidable source of difficulty along the process towards the global elimination of malaria [1415]. One of the three major technical bottlenecks in the control of Plasmodium vivax is the accurate identification of the resurgent infection caused by the activation of Plasmodium vivax hypnozoites [14].

The WHO recommends analyzing the homology of single-copy antigenic genes of Plasmodium vivax as a method for the molecular identification of 'new infection' and 'recrudescence' events [16]. Lin et al. [17] identified 37.9% (11/29) of re-emergence vivax malaria episodes caused by the activation of Plasmodium vivax homologous hypnozoites on the basis of paired analysis of genetic similarity between initial infection and the re-emergent strains. Craig et al [18], Imwong et al [19] and Chen et al [20] also assessed the differentiation characteristics of pvama1, pvcsp, pvmsp1, and other genes which revealed the first vivax malaria recurrence was mostly caused by the activation of Plasmodium vivax homologous hypnozoites. The feasibility of molecular identification of recurrent events in Plasmodium vivax has been demonstrated from different perspectives. To establish a practical and reasonable method for evaluating the recurrence resulted from Plasmodium vivax, this study used selective whole genome amplification (SWGA) [21] and deep sequencing of hypervariable genes [15] to investigate the vivax malaria recurrence events in Yunnan Province. The results of the molecular characterization of the pvcsp gene and its marker role in identification of the recurrent strains of Plasmodium vivax are reported below.

Materials And Methods

Study subjects and blood samples

The study was conducted on cases drawn from throughout Yunnan Province from January 2013 to December 2020 and confirmed at the Yunnan Province Malaria Diagnosis Referent Laboratory (YPMDRL) using both examination by light microscopy of blood smears and genetic testing (Additional file 1). All cases were officially registered and counted in the "China Disease Surveillance Information Reporting System", from which suspected recurrent instances of Plasmodium vivax with a history of re-emergence were further screened. The following inclusion criteria were applied for the vivax malaria relapse: (1) patients who were clinically cured after an eight-day course with chloroquine/primaquine, had a second episode of appearing Plasmodium vivax in his peripheral blood 28 days later; (2) no further history of exposure to malaria endemic areas was reported after the original episode (Additional file 2). The determining of infection source of Plasmodium vivax is confirmed by epidemiological surveys. The criteria is as follows: those with no history of movement outside Yunnan Province within 30 d prior to the onset of Plasmodium vivax were classified as indigenous Yunnan cases, while those with a history of exposure to Plasmodium vivax in endemic areas such as Myanmar and Africa were classified as imported cases of Myanmar and African infections (Additional file 2).

Venous blood was collected from all vivax malaria cases during the primoinfection and before treatment for recurrent episodes (0 d). The samples were prepared as filter paper dry blood drops for Plasmodium genetic analysis.

Extraction of Plasmodium genomic DNA and PCR amplification of pvcsp gene

Three filter paper dry blood drops with a diameter of 5 mm were used to extract Plasmodium genomic DNA. This was carried out in accordance with the instructions of the QIAgen Mini Kit (QIAamp, Germany), and subsequently stored at -20°C.

The reference sequence (ID: NC_009913.1) was used as the template to design the primers and reaction conditions for the nested PCR amplification of the pvcsp gene. The first round of PCR amplified the region of 1537513–1539033 containing a total of approximately 1521 base pairs (bp), with the upstream and downstream primers as 5'-CCGTTCGAACAAGTTCTGTTC-3' and 5'-GCGCATAATGTGTAAGAGGTGT-

3', respectively. The second round amplified a region of 1341 or so bp from 1537625–1538965, for which the upstream and downstream primers were 5'-GCTTAAG TTAAGCAAGCAAAACAGC-3' and 5'-GCAGGGAACATTCATAAG

AAAGGG-3', respectively. The system for both PCR reactions was the following: 2.6 µl of DNA template; 14.0 µl of PCR mixed with 2× Taq; and 0.7 µl of each of the upstream and downstream primers (10umol/L), then to make up the volume to 25.0 µl with ddH2O. It was conducted under the following sets of PCR reaction conditions: 94°C for 3 min; 94°C for 30 sec, 49°C for 90 sec, 72°C for 2 min, 35 cycles; or 72°C for 7 min. The second round of amplification was carried out using 1.5% agarose gel electrophoresis, and the amplified products were sent to Shanghai Meiji Biomedical Technology Co. Ltd. for Sanger sequencing.

Gene polymorphism and homology analysis of pvcsp genes in paired blood samples

The sequencing results were collated in DNAStar 11.0 and BioEdit 7.2.5. The obtained DNA sequences were compared with the reference sequences of the pvcsp gene. When the ‘query cover’ and ‘identifies’ were greater than 98%, it indicated that the sequences collated were the intended targets. DnaSP 6.11.01 software was used to confirm the locus mutations and haplotypes of the pvcsp gene, from which the expected heterozygosity (He) and nucleic acid diversity index (π) of the DNA sequences were calculated. The pvcsp gene sequences of paired primary and relapse isolates from the same individual patients with several episodes of Plasmodium vivax were identified [18, 20] separately in MEGA 5.04 software for similarity matching and central repetitive region (CRR) [2325]. And these regions including 5 'end, coding sequence of KLKQP five amino acids, 3’ end of pvcsp gene were named asα-N、Region I(RI)and Thrombospondin repeat༈TSR༉, respectively [24];When the main peptide repeat motifs (PRMs) of the CRR were "GDRA[D/A]GQPA" and "ANGAGNQPG", they were identified as VK210 and VK247 type sequences, respectively [26, 23, 25], while "APGANQ[E/G]GAA" was confirmed as the PRM of the P. simiovale type sequences [27].

Confirmation of the paired Plasmodium vivax strains and the nature of recurrent episode

When the DNA sequence of the pvcsp gene and the CRR repeat motifs of each group paired samples showed the following characteristics, the Plasmodium vivax strains of both original infection and the recurrent episodes were determined to be from the same time mosquito biting inoculation. Subsequent vivax malaria episodes of the same case were caused by the activation of Plasmodium vivax hypnozoites in this population and were termed as relapse episodes [28]: (1) only one haplotype was shown, and both the expected heterozygosity (He) and quantity of variable sites (V) were equal to 0. This indicated the Plasmodium vivax strains from a paired samples were completely homologous single-clone strains [16, 1920, 22, 29]; (2) the number of haplotypes was greater than one, and both He > 0 and V > 0. V was confirmed by checking peak plots (Additional file 3), from which it was seen the paired DNA sequences were of the same length and with no fragment deletions (or insertions), suggesting the paired Plasmodium vivax strains were sibling strains which pvcsp gene were weakly heterologous but did not undergo intra-helical recombination events [30, 25].

Results

Sample information and PCR amplification product sequencing

Seventy-eight reports (3.1%) with recurrent episodes were retrieved from 2484 vivax malaria cases, drawn from the period 2013-19 for patients, all of whom were living in Yunnan Province (97°31′ E to 106°11′ E; 21°8′ N to 29°15′ N). The majority of cases could be traced to origins in Myanmar (98.7%, 77/78), and the male-to-female ratio was 1:5 for all within study’s sample. The majority of cases had one relapse (97.4%, 76/78), while one case had two episodes, and one case had three episodes. General demographic information and original place of infection for the 78 cases are shown in Table 1 (Additional file 2).

Table 1

Demographic and clinical characteristics of the study cohort

Variable

Total

2013

2014

2015

2016

2017

2018

2019

2020

Total

78

0

14

20

17

9

9

7

2

1. Gender

                 

Male

65

0

12

17

13

7

9

5

2

Female

13

0

2

3

4

2

0

2

0

2. Age (in years)

               

0–20

8

0

1

0

4

3

0

0

0

21–60

67

0

12

20

13

5

9

6

2

above 60

3

0

1

0

0

1

0

1

0

3. Malaria recurrence

               

1 episode

76

0

13

20

17

9

8

7

2

2 episodes

1

0

0

0

0

0

1

0

0

3 episodes

1

0

1

0

0

0

0

0

0

4. Infection source

               

Myanmar

77

0

13

20

17

9

9

7

2

Africa

1

0

1

0

0

0

0

0

0

Yunnan indigenous

0

0

0

0

0

0

0

0

0

5. Interval time of recurrence(days)

           

Longest

1561

0

939

882

1561

426

319

367

268

Shortest

28

0

54

46

62

80

43

28

264

Average

299

0

271

291

292

277

275

285

264

NOTE. Data are no. (%) of paired samples, unless otherwise indicated.

A total of 81 relapses occurred across these 78 cases infected with Plasmodium vivax, allowing a total of 159 blood samples which were obtained from all reported original infection and recurrent episodes. From these, 156 PCR amplification products (> 1,000 bp in length) of the pvcsp gene were successfully obtained, with a product acquisition rate of 98.1% (156/159). Of them, paired CDS full strands (807–1179 bp in length) of the pvcsp gene were obtained from blood samples in 75 cases (96.2%, 75/78), but only those from 59 cases could be used for homological analysis of the gene sequences (see Fig. 1).

The structural regions of the amino acid chains derived from the 78 CDS strand conversions were completed, including the conserved region near the 5'-end (1st ~ 90th aa) (α-N region), the R I (KLKQP) region, the CRR (96th ~ 275th aa), and the conserved region near the 3'-end (276th ~ 393th aa) (TSR), etc. (see Fig. 2)

Diversity of pvcsp gene and CRR array

The 121 CDS strands of pvcsp gene obtained from paired blood samples of 59 cases showed 475 variable (polymorphic) loci, comprised of 20 singleton variable sites and 455 parsimony informative sites (Additional file 3), with a nucleotide diversity index (π) equal to 0.1384 (± 0.0056). Among them, 32 alleles were double alleles at positions 112, 113, 233, 234, 240, 261, 264, 270, 274, 282, 295, 309, 327, 347, 354, 426, 491, 507, 511, 534, 545, 552, 572, 579, 615, 684, 742, 761, 769, 805, 892, and 999 (bimodal chart). The sequences from the original infection and relapse strains both call only one of the biallelic bases respectively, usually the type with a strong sequencing signal (Additional file 3). The 32 double alleles were distributed in all seats of the pvcsp gene, but were predominantly concentrated in the CRR region (62.5%, 20/32) (Table 2). Furthermore, 56.3% (18/32) of the polymorphic sites

Table 2

The polymorphism of single nucleotide loci in pvcsp gene CDS chains

Regions

Loci

Alleles of call

(Major/Minora)

Codings

Amino acid variation

No. of CDS

(n = 127)

Frequency

α-N b

c.112

A/G

AAC/GGC

N38G

2

0.0157

c.113

A/G

2

0.0157

c.233

A/G

GAG/GGG

E78G

2

0.0157

c.234

G/T

GAG/GAT

E78D

20

0.1575

c.240

A/G

AAA/AAG

K80K

18

0.1417

c.261

A/C

CCA/CCC

P87P

14

0.1102

c.264

T/G

CGT/CGG

R88R

4

0.0315

c.270

A/T

AAA/AAT

K90N

6

0.0472

RIc (91th -95th aa)

c.274

T/C

TTG/CTG

L92L

16

0.1260

c.282

A/G

CAA/CAG

Q94Q

2

0.0157

 

c.295

C/A

CGA/AGA

R99R

2

0.0157

 

c.309

G/A

CAG/CAA

Q103Q

2

0.0157

CRR (96th -290th aa)

c.327

A/C

GGA/GGC

G109G

2

0.0157

c.354

A/C

GGA/GGC

G118G

4

0.0315

c.426

C/A

GGC/GGA

G142G

2

0.0157

c.491

G/C

GGT/GCT

G164A

2

0.0157

c.507

A/C

GGA/GGC

G169G

2

0.0157

c.511

G/A

GGA/AGA

G171R

2

0.0157

c.518

C/A

GCT/GAT

A173D

2

0.0157

c.534

C/A

GGC/GGA

G178G

2

0.0157

c.545

A/C

GAT/GCT

D182A

2

0.0157

c.552

T/A

CAT/CAA

Q184R

2

0.0157

c.572

G/C

AGG/AGC

R286S

2

0.0157

c.579

G/A

CAG/CAA

Q193Q

2

0.0157

c.615

A/C

GGA/GGC

G205G

2

0.0157

c.684

A/C

GGA/GGC

G228G

6

0.0472

c.742

C/G

CCA/GCA

P248A

2

0.0157

c.761

C/G

AGC/AGG

G254R

2

0.0157

c.769

C/G

CCA/GCA

P257A

2

0.0157

c.805

A/C

ACC/CCC

P269T

2

0.0157

TSRd (291th -393th aa)

c.892

C/T

CTT/TTT

L298F

2

0.0157

c.999

A/G

AAA/AAG

K333K

2

0.0157

a: At the double allelic base in the DNA sequencing peak map, the base with higher wave peak is Major allele and the another base with lower wave peak is Minor allele; b: Named the near N-terminal of pvcsp amino acid chain; c: The coding region of KLKQP five amino acids; d: The C-terminal of pvcsp amino acid chain.

belonged to the third base of the amino acid codon, and only 27.8% (5/18) of these resulted in amino acid variants. The percentages of the second base and first base were 17.6% (6/34) and 26.5% (9/34) (Table 2), while the highest frequency of the double allele was 0.1575 for 234, the minor allele frequency (MAF) was 0.1417 for 240, and 75.0% (24/32) of the double alleles were present in only one set of paired sequences (Table 2).

Further to the aforementioned, 121 CDS strands were defined as 84 haplotypes (H01 to H84) with a He of 0.9940 (± 0.0040). Of these, only haplotypes H08 and H13 had similar other paired sequences. Haplotypes H05, H50, H51, H63 and H64 had CRR repeat units (PRMs) of the VK247 genotype (Fig. 3B), while the remaining 79 had PRMs characteristic of the VK210 genotype (Fig. 3A).

Among the haplotypes of the VK210 type, there were 39 CRR forms consisting of peptide repeat motifs (PRMs) (Fig. 3A). Of these, there were 15 PRM unit types, with GDRAAGQPA and GDRADGQPA occurring most often, with frequencies of 0.470 (987/2100) and 0.3833 (805/2100), respectively. The remaining 13 PRMs, included the five newly detected PRMs GNRANGQPA (0.0033, 7/2100), GNRANGQAA (0.0001, 1/2100), GDRADGQTA (0.0001, 1/2100), GDRADGHPA (0.0001, 1/2100), and GNGAAGQPA (0.0001, 1/2100) (Fig. 3A). Generally, the CRRs of VK210 type consisted of 14–20 PRMs with 18 being the most common, and 96.8% (38/39) ended with GNGAGGQAA units (Fig. 3A).

Of the five haplotypes of type VK247, three types of CRR consisted of 17–21 PRMs (Fig. 3B) in which there were eight unit types of PRMs. Those with the highest frequency of occurrence were ANGAGNQPG (0.7414, 86/116), ANGAGGQAA (0.0517, 6/116), and ANGDDQPGA (0.0172, 2/116) while the remaining two were newly detected PRMs (Fig. 3B).

Comparison of paired blood samples of the pvcsp gene and confirmation of relapse episode

Results from the comparison of the pvcsp gene CDS chains of the 59 paired blood samples showed the paired CDS chains of 31 groups (52.5%, 31/59) had only one haplotype and no variant sites, and the He and V values were both 0. This indicated each of the 31 pairs was homologous and the source of the paired Plasmodium vivax was a single clone with complete genetic homology, belonging to the same mosquito-bite inoculated population (Table 3). Subsequent episodes of Plasmodium vivax were caused by the activation of hypnozoites from the same population as primary infection strains. The paired blood samples of CDS chains of the other 28 groups (47.5%) had varying numbers of polymorphic sites (1⁓6 loci) between the paired sequences. However, there were two exceptions, at 39 (0.0082, 1/121) and 1027 (0.0082, 1/121), which were true base substitutions (Table 3, Additional file 3), while the remaining sites were all double allelic bases (Table 2). These 28 sequences showed no evidence of DNA fragment insertion (or deletion), suggesting they were weakly heterologous for each other, but did not experience intra-helical recombination events. Also, their source pairs were "weakly heterologous" parallel sibling strains

Table 3

Identification of the homologous between Plasmodium vivax strains from every group paired samples based on alignment the pvcsp gene sequence

No. of paired samples

Length of aa chains

Nucleotide diversity (Pi)

Ins or Del in CRR

(bp)

Combined variable (polymorphic) sites

Paired strains

Single infection

No.

α-N

(1th -90th aa)

RI

(91th -95th aa)

CRR

(96th -290th aa)

TSP

(291th -393th aa)

Ⅰ. Paired (groups = 59)

31

364–393

0

0

0

0

0

0

0

com-Homo

Yes

8

340–376

0.0009

0

1a

234, 240, 261, 264

274

295

0

non-re-Sibl

Yes

9

340–376

0.0018

0

2b

234, 240, 261, 270

274

327, 354, 507, 552, 572, 615, 742

999, 1027*

non-re-Sibl

Yes

4

333–386

0.0030

0

3c

234, 240, 261, 264

274, 282

491, 511, 545

0

non-re-Sibl

Yes

3

372–376

0.0036

0

4d

234, 240, 261

274

426, 518, 534, 742, 761, 769

0

non-re-Sibl

Yes

3

336–375

0.0049

0

5e

112, 113, 233, 234, 240, 261, 270

274

309, 579, 684, 805

892

non-re-Sibl

Yes

1

358–368

0.0054

0

6f

39*, 234, 240, 261, 270

274

0

0

non-re-Sibl

Yes

Ⅱ. non-Paired

 

69–109

372 and 368

0.0263

+ 12g

29f

240, 247*, 259*, 264

0

291*,294*,318*,321*,345*,

348*,363*,444*,453*,552, 561*,572,579,588*,599*, 606*,615,642*,783*,798*, 800*,803*,805

987*, 1015*

dif-Popu

No

156 − 110

393 and 368

0.0245

+ 54g

27f

261

0

291**,294*,318*,321*,345, 348*,363*,364*,372*,373*, 376*,390*,444*,453*,552, 561*,572,579,588*,599*,669*,687*,696*,707*,714*,796*

0

dif-Popu

No

49–125

275 and 249

0.0204

+(51 + 27)h

22f

240, 261, 264

274

404*,435*,567*,570*,591*, 594*,597*,598*,621*,645*, 648*,650*,651*,652*,702*, 705*,729*,780*

0

dif-Popu

No

a-e: Existing one locus polymorphic and any two, three, four, five loci polymorphic; f: Existing all loci polymorphic at the same time; g: “+” Including one inserted peptide composed of many amino acids; h: Including two inserted peptides composed of many amino acids; *༚No showing the double peaks at the base site in sequencing peak map.

which were not genetically homologous and belonged to the latter generations bred from the common ancestor inoculated by a time mosquito-biting population (Table 3). Subsequent episodes of Plasmodium vivax are still caused by the activation of hypnozoites in the same population as primary infection strains.

However, similarity matching of two randomly selected sequences from the 121 CDS strands of the unpaired samples showed there were significantly more base substitutions between the two sequences. Furthermore, the whole strand showed length polymorphism due to the presence of oligonucleotide strand insertions (or deletions) of different lengths (Table 3). This indicated the two sequences of the unpaired samples were more heterologous and their corresponding genomic donors, Plasmodium vivax, were more likely to belong to different populations (Table 3).

Discussion

The single-copy pvcsp gene commences at the tip of Plasmodium vivax chromosome 8 [30, 25, 31], extends for 807–1179 bp, and does not possess introns, but has a coding DNA sequence (CDS) which encodes polypeptides between 269 and 393 aa long. The structural diversity of the pvcsp gene is concentrated in the mid-segment CRR repeat region encoding the 90th to 275th aa and its flanks (Fig. 1) [2325, 32]. It is generally characterized by insertions and deletions of repetitive units, mostly due to sexual recombination during meiosis or intra-helical strand-slippage events during mitotic DNA replication [3335], as well as frequent base substitutions within the locus. Using the repeat units of the CRR as markers, Plasmodium vivax can be defined as three genotypes: VK210; VK247; and the P. semiovale which is Plasmodium. vivax-like [27, 3637]. Therefore, pvcsp gene is commonly used as a molecular marker for the evolutionary description of populations of Plasmodium vivax [30, 3840].

This study provides preliminary evidence on the pathogenic nature of recurrent episodes of Plasmodium vivax which occurred in Yunnan Province based on the analysis of pvcsp gene differentiation in the sample group. Although the Plasmodium vivax population included in this study was not truly a natural population, 97.4% of the strains in the sample were nonetheless harvested from cases infected in Myanmar. Consequently, the pvcsp genes of the entire sample group showed similar findings to those reported by Thanapongpichat et al. [41] and Võ et al. [32], which related to the Myanmarese population. These included, the CRR region, the same significant polymorphism in length and composition of PRMs, the same predominance of VK210 types (95.0%, 115/121), and the absence of Plasmodium vivax-like variants. Furthermore, there was the same extremely low DNA sequence homology, with up to 84 haplotypes in 121 pvcsp gene sequences, though the expected heterozygosity (He, 0.9940 ± 0.0040) was greater than that observed by Võ et al. (He, 0.096 ± 0.034) [32]. However, at the same time, the RI of the pvcsp gene in this set of samples had only one arrangement of KLKOP, far fewer than the seven described by Võ et al. [32]. In addition to this, there was a reduction in variety of PRMs units constituting the CRR region (VK210: 15, VK247: 8), and less complex arrangements and length polymorphisms than those reported in the aforementioned research. These may be justified by this study having involved a relatively homogeneous population of primarily Myanmarese strains of Plasmodium vivax, unlike the wider range and a more complex population composition of samples taken by Võ et al. However, the displayed high variability of the pvcsp gene was largely consistent with the results obtained in current study. Furthermore, it is agreed the CRR is suitable as a molecular marker for separating genetic differences in Plasmodium vivax populations due to its poor conservation, whereas the α-N and TSR regions on both flanks are suitable as candidate antigen genes for polyclonal antibody serum preparation because of their highly conserved sequences [2425].

In addition to confirming the high degree of polymorphism in the pvcsp gene structure at the population level, the similarity comparison performed with paired blood samples (n = 59) from the vivax malaria relapse cases showed it was not an accident that there were the Plasmodium vivax strains with highly convergent gene sequences. In 52.5% (31/59) of the cases, the paired pvcsp gene sequences of Plasmodium vivax strains from the original infection and relapse were not only highly consistent in length but also completely similar in sequence structure, with both He and Variable sites equal to 0, indicating complete homology of the paired sequences. Even among the 28 sets of paired sequences with double allelic expression (Table 2, Table 3), the pvcsp gene sequences were not only highly concordant in length. There were notable oligonucleotide fragment insertions (or deletions) in any two of the pvcsp gene sequences from non-paired blood samples (Table 3). There was also a significant increase in the number of base substitutions, suggesting length polymorphism in the pvcsp gene may be an important point of differentiation between populations of Plasmodium vivax, before the degree of single nucleotide polymorphism,

Besides the possible causes of reinfection and relapse [4243], validation of the activation of the intrahepatic Plasmodium vivax hypnozoites is imperative due to the frequent recurrent episodes after treatment for vivax malaria [28, 4445]. In the 78 vivax malaria cases with recurrent episodes included in this study, history of exposure in malaria endemic areas outside the country was before the original infection episode only. Therefore, the possibility of recurrent caused by reinfection is basically excluded. The molecular identification of relapse in only 59 (75.6%, 59/78) cases were conducted by alignment of paired pvcsp gene sequences. In 31 of these cases, including those with two or three times relapses, the paired pvcsp gene sequences of Plasmodium vivax strains from the original infection and the relapse episode were identical as the same length and had no variant sites within the sequences. This observation suggested the single clone of a genetically identical strain existed in every paired blood samples, and the vivax malaria relapse could be attributed to the activation of a intrahepatic hypnozoites which gene structure was identical to the original infection strain [1718, 20, 28, 46]. In the other 28 cases, there was still a large degree of reproducibility in gene structure between the paired pvcsp gene sequences, which were identical in length and the single nucleotide polymorphic sites mostly were also double alleles (Tables 2 and 3). Such an appeared any locus polymorphism may have been due to differential calling of double allelic bases in paired sequences by generation sequencing [45], suggesting the sibling strains [25, 30, 47] with identical genetic structure existed in every group of paired blood samples, and the two batches of Plasmodium vivax populations from primary infection and relapse actually reflects parallel homologs of polyclonal strains. Therefore, the authors suggested that the relapse of these 28 cases with Plasmodium vivax still belonged to the activation event of intrahepatic hypnozoites coming from common ancestor strains inoculated by the same time mosquito-biting [17, 4749]. It is worth examining whether the polymorphism form resulting from the differential mobilization of the double alleles is similar to the previously considered "weakly heterologous" [5053].

In summary, this paper analyze the possibility of relapse episodes caused after one time Plasmodium vivax infection using the principle of identifying the strong similarity of highly variable genes in closely related Plasmodium vivax. Thus, methodological implications are provided for the identification of relapse episodes in malaria-free transmission areas, although there are shortcomings. First, the study may still be incomplete in excluding recrudescence events caused by chloroquine-resistant strains or continued proliferation of myeloplasmosis [5455] due to the lack of feasible experimental verification means, although the chloroquine resistance associated molecular marker have been detected in some Plasmodium vivax strains from individual paired samples (Additional 4). Secondly, the relapse event was confirmed by the similarity of Plasmodium vivax genes between the paired samples from the original infection and the reactivation. It included two results of 100% homology and "weakly heterologous" with only a few base substitutions between paired gene sequences and it remains to be confirmed whether or not the latter was part of the activation of heterologous hypnozoites populations that scholars previously studied. Thirdly, no definitive confirmation could be provided concerning the nature of the paired Plasmodium vivax strains in 24.4% (19/78). Although the authors are continuing to mitigate the limitations of single genetic markers to identify homologous strains, as well as for other aetiological investigations of these relapse episodes, the length of this article does not allow for further elaboration, which may somewhat reduce its apparent completeness of this paper.

Conclusion

In conclusion, this study identified the vivax malaria relapse episodes caused by homologous hypnozoites by deep sequencing and analyzing the similarities between pvcsp genes of paired Plasmodium vivax strains from primary infection and relapse. The pvcsp gene was found to be simple in structure, while the traces of recombined gene are easy to observe, and it is suitable as a molecular marker to identify the Plasmodium vivax homologous strains. This study was also demonstrated that, because most of the blood-stage Plasmodium vivax resulting in relapse among the cases reported in Yunnan Province were single clone or sibling strains with homologous to the original infection, the using gene similarity rules is suitable for identifying homologous hypnozoites activation.

Abbreviations

pvcsp

Plasmodium vivax circumsporozoite protein

SWGA

Selective whole genome amplification

YPMDRL

Yunnan Province Malaria Diagnosis Referent Laboratory

CRR

Central repetitive region

PRMs

Peptide repeat motifs

RI

Region I

TSR

Thrombospondin repeat

MAF

Minor allele frequency

CDS

Coding DNA sequence

Com-Homo

Completely Homologous

non-re-Sibl

non-recombinated sibling

dif-Popu

Different population

pvcrt-o

Plasmodium vivax chloroquine resistance proteinortholog

Declarations

Acknowledgements

Our sincere gratitude got to the provided initial diagnosis information on malaria cases and their blood samples for the Centers for Disease Control and Prevention in prefectures and counties: Dehong, Baoshan, Kunming, Pu’er, Lincang, Dali, Nujiang, Lijiang, Xishuangbanna, Yuxi, Chuxiong, Honghe, Zhaotong, Diqing, Qujing, and Whenshan.

Authors’ contributions

YX was responsible for the study design, carried out genetic testing and wrote the contents of microscopy examination in the manuscript; YD wrote the manuscript, and was responsible for the study design, coordinated the project, and statistical analysis; Yan Deng, HH, MC, YL, JW, CZ and ZW performed the collection of blood samples and microscopy examination. All authors reviewed the manuscript.

Funding

The current study was funded by National Science Foundation, China (Nos. 81660559, 82160637).

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Ethics approval and consent to participate

The design and protocol of the study was approved by the Yunnan Institute of Parasitic Diseases and Ethical Committee (Ethical Approval No.: 2019 Yunnan Ethics Auditing No. 5). The purpose of the study was explained to the subjects or their relatives, genetic testing was performed on stored blood samples obtained as part of the routine diagnostic work-up of patients with fever who were suspected to have malaria. The patients were fully informed on the aims of the study and signed an informed consent agreement after confirmation Plasmodium infection.

Consent for publication

All authors provided their consent for the publication of this report.

Competing interests

The authors declare no financial and/or non-financial conflicted interest regarding the publication of the present manuscript.

Author details

1 Yunnan Institute of Parasitic Diseases Control, Yunnan Provincial Key Laboratory of Vector-Borne Diseases Control and Research, Yunnan Centre of Malaria Research, Pu’er 665000, China. 2 Department of Basic Medical Sciences, Clinical College of Anhui Medical University, Hefei, 230031, China. 3 Center for Disease Control and Prevention, Baoshan, 678000, China.

References

  1. Baird JK. Eliminating malaria--all of them. Lancet. 2010; 376:1883-1885
  2. Battle KE, Guerra CA, Golding N, Duda KA, Cameron E, Howes RE, et al. Global database of matched Plasmodium falciparum and P. vivax incidence and prevalence records from 1985 to 2013. Sci Data. 2015;2:150012.
  3. Bockarie MJ, Dagoro H. Are insecticide-treated bednets more protective against Plasmodium falciparum than Plasmodium vivax-infected mosquitoes? Malar J. 2006;5:15.
  4. Dong Y, Deng Y, Xu YC, Lui Y, Chen MN, Wu J, et al. [Homology analysis of Plasmodium vivax strains isolated from the sporadic vivax malaria cases found near the Sino-Burmese international border during the short term in Yunnan Province, China]. Chin J Parasitol Parasit Dis. 2021(publishing) (in Chinese).
  5. Ghanchi NK, Khan MH, Arain MA, Zubairi MBA, Raheem A, Khan MA, et al. Hematological profile and gametocyte carriage in malaria patients from Southern Pakistan. Cureus. 2019;11: e4256.
  6. Golding N, Wilson AL, Moyes CL, Cano J, Pigott DM, Velayudhan R, et al. Integrating vector control across diseases. BMC Med. 2015;13:249.
  7. Battle KE, Karhunen MS, Bhatt S, Gething PW, Howes RE, Golding N, et al, Geographical variation in Plasmodium vivax relapse. Malar J. 2014;13:144.
  8. White NJ, Imwong M, Relapse. Adv Parasitol, 2012;80:113-150.
  9. Li Zhang, Jun Feng, Shaosen Zhang, Zhigui Xia, Shuisen Zhou. Epidemiological characteristics of malaria and the progress towards its elimination in China in 2018. Chin J Parasitol Parasit Dis. 2019;37:241-247 (in Chinese).
  10. Huang H, Dong Y, Deng Y, Xu Y, Deng Y, Zhang C, et al. The association of CYP2D6 gene polymorphisms in the full-length coding region with higher recurrence rate of vivax malaria in Yunnan Province, China. Malar J. 2021;20:154.
  11. Llanos-Cuentas A, Lacerda MV, Rueangweerayut R, Krudsood S, Gupta SK, Kochar SK, et al. Tafenoquine plus chloroquine for the treatment and relapse prevention of Plasmodium vivax malaria (DETECTIVE): a multicenter, double-blind, randomised, phase 2b dose-selection study. Lancet. 2014;383:1049-1058.
  12. Betuela I, Rosanas-Urgell A, Kiniboro B, Stanisic DI, Samol L, de Lazzari E, et al. Relapses contribute significantly to the risk of Plasmodium vivax infection and disease in Papua New Guinean children 1-5 years of age. J Infect Dis. 2012;206: 1771-1780.
  13. Corder RM, de Lima ACP, Khoury DS, Docken SS, Davenport MP, Ferreira MU. Quantifying and preventing Plasmodium vivax recurrences in primaquine-untreated pregnant women: An observational and modeling study in Brazil. PLoS Negl Trop Dis. 2020;14: e0008526.
  14. WHO. Confronting Plasmodium vivax malaria. World Health Organization, Geneva, Switzerland. 2015.
  15. White MT, Shirreff G, Karl S, Ghani AC, Mueller I. Variation in relapse frequency and the transmission potential of Plasmodium vivax malaria. Proc Biol Sci. 2016;283:20160048.
  16. WHO. Methods and Techniques for Clinical Trials on Antimalarial Drug Efficacy: Genotyping to Identify Parasite Populations; World Health Organization: Geneva, Switzerland, 2008,13-16.
  17. Lin JT, Hathaway NJ, Saunders DL, Lon C, Balasubramanian S, Kharabora O, et al. Using amplicon deep sequencing to detect genetic signatures of Plasmodium vivax relapse. J Infect Dis. 2015;212: 999-1008.
  18. Craig AA, Kain KC. Molecular analysis of strains of Plasmodium vivax from paired primary and relapse infections. J Infect Dis. 1996;174:373-379.
  19. Imwong M, Boel ME, Pagornrat W, Pimanpanarak M, McGready R, et al. The first Plasmodium vivax relapses of life are usually genetically homologous. J Infect Dis. 2012;205:680-683.
  20. Chen N, Auliff A, Rieckmann K, Gatton M, Cheng Q. Relapses of Pladmodium vivax infection result from clonal hypnozoites activated at predetermined intervals. J Infect Dis. 2007;195: 934-941.
  21. Cowell AN, Valdivia HO, Bishop DK, Winzeler EA. Exploration of Plasmodium vivax transmission dynamics and recurrent infections in the Peruvian Amazon using whole genome sequencing. Genome Med. 2018;10:52.
  22. Dong Y, Deng Y, Xu Y, Chen M, Wei C, Zhang C, et al. Analysis of initial laboratory diagnosis of malaria and its accuracy compared with re-testing from 2013 to 2018 in Yunnan Province, China. Malar J. 2020;19(1):409.
  23. Imwong M, Pukrittayakamee S, Grüner AC, Rénia L, Letourneur F, Looareesuwan S, et al. Practical PCR genotyping protocols for Plasmodium vivax using pvcs and pvmsp1. Malar J. 2005;4:20.
  24. Coppi A, Natarajan R, Pradel G, Bennett BL, James ER, Roggero MA, et al. The malaria circumsporozoite protein has two functional domains, each with distinct roles as sporozoites journey from mosquito to mammalian host. J Exp Med. 2011;208:341-356.
  25. Dias S, Wickramarachchi T, Sahabandu I, Escalante AA, Udagama PV. Population genetic structure of the Plasmodium vivax circumsporozoite protein (Pvcsp) in Sri Lanka. Gene. 2013;518:381-387.
  26. Rosenberg R, Wirtz RA, Lanar DE, Sattabongkot J, Hall T, Waters AP, et al. Circumsporozoite protein heterogeneity in the human malaria parasite Plasmodium vivax. Science. 1989;245:973-976.
  27. Qari SH, Shi YP, Goldman IF, Udhayakumar V, Alpers MP, Collins WE, et al. Identification of Plasmodium vivax-like human malaria parasite. Lancet. 1993;341:780-783.
  28. Imwong M, Snounou G, Pukrittayakamee S, Tanomsing N, Kim JR, Nandy A, et al. Relapses of Plasmodium vivax infection usually result from activation of heterologous hypnozoites. J Infect Dis. 2007;195:927-933.
  29. Khusmith S, Tharavanij S, Bunnag D. Antigenic disparity of Plasmodium vivax causing initial symptoms and causing relapse. Southeast Asian J Trop Med Public Health. 1998;29:519-524.
  30. Brito CF, Ferreira MU. Molecular markers and genetic diversity of Plasmodium vivax. Mem Inst Oswaldo Cruz. 2011;106:12-26.
  31. https://www.ncbi.nlm.nih.gov/genome/gdv/browser/gene/?id=5472322
  32. Võ TC, Lê HG, Kang JM, Moe M, Naw H, Myint MK, et al. Genetic polymorphism and natural selection of circumsporozoite protein in Myanmar Plasmodium vivax. Malar J. 2020;19:303.
  33. McConkey GA, Waters AP, McCutchan TF. The generation of genetic diversity in malaria parasites. Annu Rev Microbiol. 1990;44:479-498.
  34. Kim JR, Imwong M, Nandy A, Chotivanich K, Nontprasert A, Tonomsing N, et al. Genetic diversity of Plasmodium vivax in Kolkata, India. Malar J. 2006;5:71.
  35. Bousema T, Kreuels B, Gosling R. Adjusting for heterogeneity of malaria transmission in longitudinal studies. J Infect Dis. 2011;204:1-3.
  36. Tripathi V, Gupta D. Evolutionary analysis of circumsporozoite surface protein and merozoite surface protein-1 (CSP and MSP-1) sequences of malaria parasites. Bioinformation. 2011;6:320-323.
  37. Xu YC, Dong Y, Deng Y, Mao XH, Chen MN, Zhang CL, et al. Polymorphism of circumsporozoite protein gene and population structure analysis of Plasmodium vivax with different infection sources in Yunnan Province. Chin J Parasitol Parasit Dis. 2020;38:67-73(in Chinese).
  38. di Giovanni L, Cochrane AH, Enea V. On the evolutionary history of the circumsporozoite protein in plasmodia. Exp Parasitol. 1990;70:373-381.
  39. Kosaisavee V, Hastings I, Craig A, Lek-Uthai U. The genetic polymorphism of Plasmodium vivax genes in endemic regions of Thailand. Asian Pac J Trop Med. 2011;4:931-936.
  40. Etemadi S, Nateghpour M, Motevalli Haghi A, Eslami H, Mohebali M, Setayesh N, et al. Genotyping and phylogenetic analysis of Plasmodium vivax circumsporozoite protein (PvCSP) gene of clinical isolates in South-Eastern Iran. Iran J Public Health. 2020;49:981-988.
  41. Thanapongpichat S, McGready R, Luxemburger C, Day NP, White NJ, Nosten F, et al. Microsatellite genotyping of Plasmodium vivax infections and their relapses in pregnant and non-pregnant patients on the Thai-Myanmar border. Malar J. 2013;12:275.
  42. De Niz M, Meibalan E, Mejia P, Ma S, Brancucci NMB, Agop-Nersesian C, et al. Plasmodium gametocytes display homing and vascular transmigration in the host bone marrow. Sci Adv. 2018;4:eaat3775.
  43. Brito MAM, Baro B, Raiol TC, Ayllon-Hermida A, Safe IP, Deroost K, et al. Morphological and transcriptional changes in human bone marrow during natural Plasmodium vivax malaria infections. J Infect Dis. 2020:jiaa177.
  44. Looareesuwan S, White NJ, Chittamas S, Bunnag D, Harinasuta T. High rate of Plasmodium vivax relapse following treatment of falciparum malaria in Thailand. Lancet. 1987;2:1052-5.
  45. Orjuela-Sánchez P, da Silva NS, da Silva-Nunes M, Ferreira MU. Recurrent parasitemias and population dynamics of Plasmodium vivax polymorphisms in rural Amazonia. Am J Trop Med Hyg. 2009;81:961-968.
  46. Kirchgatter K, del Portillo HA. Molecular analysis of Plasmodium vivax relapses using the MSP1 molecule as a genetic marker. J Infect Dis. 1998;177:511-515.
  47. Bright AT, Manary MJ, Tewhey R, Arango EM, Wang T, Schork NJ, et al. A high resolution case study of a patient with recurrent Plasmodium vivax infections shows that relapses were caused by meiotic siblings. PLoS Negl Trop Dis. 2014;8:e2882.
  48. Conway DJ, Greenwood BM, McBride JS. The epidemiology of multiple-clone Plasmodium falciparum infections in Gambian patients. Parasitology. 1991;103:1-6.
  49. Sutton PL, Neyra V, Hernandez JN, Branch OH. Plasmodium falciparum and Plasmodium vivax infections in the Peruvian Amazon: propagation of complex, multiple allele-type infections without super-infection. Am J Trop Med Hyg. 2009;81:950-960.
  50. Chan ER, Menard D, David PH, Ratsimbasoa A, Kim S, Chim P, et al. Whole genome sequencing of field isolates provides robust characterization of genetic diversity in Plasmodium vivax. PLoS Negl Trop Dis. 2012;6:e1811.
  51. Nair S, Nkhoma SC, Serre D, Zimmerman PA, Gorena K, Daniel BJ, et al. Single-cell genomics for dissection of complex malaria infections. Genome Res. 2014;24:1028-1038.
  52. Friedrich LR, Popovici J, Kim S, Dysoley L, Zimmerman PA, Menard D, et al. Complexity of infection and genetic diversity in Cambodian Plasmodium vivax. PLoS Negl Trop Dis. 2016;10:e0004526.
  53. Popovici J, Friedrich LR, Kim S, Bin S, Run V, Lek D, et al. Genomic analyses reveal the common occurrence and complexity of Plasmodium vivax relapses in Cambodia. mBio. 2018;9:e01888-17.
  54. Obaldia N, Meibalan E, Sa JM, Ma S, Clark MA, Mejia P, et al. Bone marrow is a major parasite reservoir in Plasmodium vivax infection. mBio. 2018;9(3):e00625-18.
  55. Silva-Filho JL, Lacerda MVG, Recker M, Wassmer SC, Marti M, Costa FTM. Plasmodium vivax in hematopoietic niches: Hidden and Dangerous. Trends Parasitol. 2020;36(5):447-458.