Overview of demographic data
Between 2004 and 2020, 40.5% (n = 6,864/16,952) of Shigella spp. diagnoses among adults processed by the Gastrointestinal Bacteria Reference Unit (GBRU) at the UKHSA were identified as S. flexneri. Following an increase in the proportion of S. flexneri cases that were serotype 3a between 2004 and 2013, and a decline between 2013 and 2018, the proportion of 3a more than tripled between 2018 and 2020 (from 7.8% [n = 7/90] to 30.0% [n = 85/287], Fig. 1a). Epidemiological comparisons of S. flexneri 3a cases among presumptive MSM in the emergence phase (2012–2013) and re-emergence phase (2019–2020) (Table 1), showed that the age and geographical distribution remained largely unchanged, with the greatest proportion of cases reported from London and the Southeast of England (Table 1). Hospitalisation data was not available for the cases identified during the emergence phase; however, enhanced surveillance questionnaires from the re-emergence phase indicate that 35% (n = 18/51) of presumptive MSM were hospitalised. WGS has revealed that the re-emergence of S. flexneri 3a has been driven mainly by a single 10 SNP cluster t10.1189, (n = 27/118, 22.8%) (Table 1).
Table 1
Summary of characteristics of the recent re-emergence of S. flexneri 3a cases among presumptive MSM and comparison to original emergence, England.
Characteristic | 2012–2013 n (%) | 2019–2020 n (%) |
Age | | |
16–34 | 95 (39.7) | 54 (42.2) |
35–64 | 136 (56.9) | 67 (52.3) |
65+ | 5 (2.1) | 7 (5.5) |
Unknown | 3 (1.3) | 0 (0.0) |
Median age [IQR]a | 38.5 [30-46.5] | 36 [29-45.5] |
Region | | |
London | 140 (58.6) | 91 (71.1) |
South East | 36 (15.1) | 20 (15.6) |
North West | 28 (11.7) | 2 (1.6) |
West Midlands | 8 (3.4) | 1 (0.8) |
East of England | 13 (5.4) | 8 (6.3) |
South West | 6 (2.5) | 4 (3.1) |
Yorkshire & Humber | 2 (0.8) | 1 (0.8) |
North East | 4 (1.7) | 1 (0.8) |
East Midlands | 2 (0.8) | 0 (0.0) |
Unknown | 0 (0.0) | 0 (0.0) |
Hospitalised | | |
Yes | ND | 18 (35.3) |
No | ND | 33 (64.7) |
Median nights in hospital [IQR]a | | 2 [2–4] |
10 SNPb cluster | | |
t10.1177 | 27 (11.3) | 0 (0.0) |
t10.43 | 18 (7.5) | 0 (0.0) |
t10.138 | 2 (0.8) | 0 (0.0) |
t10.116 | 1 (0.4) | 0 (0.0) |
t10.1182 | 1 (0.4) | 0 (0.0) |
t10.1197 | 1 (0.4) | 0 (0.0) |
t10.1211 | 1 (0.4) | 0 (0.0) |
t10.1227 | 1 (0.4) | 0 (0.0) |
t10.1356 | 1 (0.4) | 0 (0.0) |
t10.1189 | 0 (0.0) | 108 (84.4) |
t10.1305 | 0 (0.0) | 2 (1.6) |
t10.1627 | 0 (0.0) | 1 (0.8) |
t10.1648 | 0 (0.0) | 1 (0.8) |
t10.1684 | 0 (0.0) | 13 (10.2) |
t10.1852 | 0 (0.0) | 1 (0.8) |
t10.1969 | 0 (0.0) | 1 (0.8) |
Unknown | 186 (77.8) | 1 (0.8) |
a Interquartile range (IQR) | | |
b Single nucleotide polymorphism (SNP) | | |
S. flexneri 3a re-emergence featured clonal replacement
To determine the genomic epidemiology among 502 isolates of S. flexneri 3a from the emergent and re-emergent periods (2008–2013 n = 205; 2016–2020 n = 298), a phylogenetic tree was constructed and matched to epidemiological metadata (Fig. 2). This was complemented by Bayesian Analysis of Population Structure (BAPS) groups being computed to assign genetically similar clusters, which we called Global-BAPS clusters. This revealed that isolates clustered by patient travel destination, with isolates from cases reporting recent travel to Asia mostly (71%, n = 25/35) belonging to the Global-BAPS-3 cluster, while African-travel associated isolates typically (74%, n = 26/35) belonged to Global-BAPS-2 (Table 2, Fig. 2). Notably, more than half (52%, n = 257/502) of the isolates have no associated travel data. Global-BAPS-1 cluster was mostly (84%, n = 286/339) composed of males, aged 16–60, who were without a known history of international travel and was thus defined as MSM-associated. The majority (78%, n = 159/205) of the 2008–2014 S. flexneri 3a outbreak belonged to the Global-BAPS-1 cluster, as did the majority (58%, n = 180/312) of S. flexneri 3a isolated between 2016 and 2020.
Table 2
Temporal and demographic features of isolates in this study by Global-BAPS groups.
| | | | Year (%) | | Sex (%) | | Age Group in Years (%) | | Travel (%) |
Global-BAPS Group | | Total (n) | | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2016 | 2017 | 2018 | 2019 | 2020 | | Male | Female | | < 16 | 16–60 | > 60 | | Africa | Asia | Europe | N-America | Oceania | S-America |
1 | | 339 | | 4 | 4 | 8 | 6 | 5 | 19 | 10 | 5 | 2 | 14 | 22 | | 96 | 4 | | 1 | 94 | 5 | | 0 | 1 | 4 | 0 | 0 | 1 |
2 | | 45 | | 2 | 16 | 0 | 2 | 2 | 11 | 9 | 16 | 7 | 22 | 13 | | 76 | 24 | | 11 | 69 | 20 | | 58 | 2 | 0 | 0 | 0 | 0 |
3 | | 67 | | 3 | 7 | 6 | 9 | 4 | 4 | 16 | 15 | 16 | 15 | 3 | | 64 | 36 | | 24 | 63 | 13 | | 0 | 37 | 6 | 0 | 0 | 0 |
4 | | 35 | | 3 | 3 | 3 | 6 | 3 | 3 | 9 | 11 | 14 | 23 | 23 | | 71 | 29 | | 9 | 80 | 11 | | 23 | 3 | 0 | 0 | 0 | 11 |
5 | | 6 | | 0 | 0 | 17 | 0 | 0 | 0 | 0 | 0 | 17 | 50 | 17 | | 33 | 67 | | 17 | 67 | 17 | | 0 | 67 | 0 | 0 | 0 | 0 |
6 | | 3 | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 67 | 0 | 33 | | 100 | 0 | | 0 | 100 | 0 | | 33 | 0 | 0 | 0 | 0 | 0 |
7 | | 7 | | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 43 | 14 | 29 | 0 | | 71 | 29 | | 57 | 29 | 14 | | 0 | 0 | 0 | 14 | 0 | 0 |
As we were focused on pathogen changes between the emergence and re-emergence of MSM-associated Global-BAPs-1 isolates, these isolates (n = 339) underwent a further round of BAPS grouping to facilitate study at a finer resolution. These more highly resolved groups are referred to as MSMA-BAPS groups (Table 3, Fig. 3). Correlation with the nomenclature from the previous study of the emergent event revealed that lineages, previously designated as sublineages B and C,30 were associated with MSMA-BAPS-1, and sublineage A with MSMA-BAPS-3, while cases from 2019 onwards were now MSMA-BAPS-5. This revealed that the first outbreak was majorly associated with MSMA-BAPS-1 (23%) and BAPS-3 (39%), and the second outbreak was associated with MSM-BAPS-5 (42%), (Table 3). Thus, this subsequent analysis revealed further distinction between the two time periods, where the recent re-emergence (2016–2020) was associated with a single clonal expansion that replaced the earlier outbreak (Fig. 3). Thus, the clonal replacement for the re-emergence of S. flexneri 3a in England has evolved from MSMA-BAPS-1 to MSMA-BAPS 5.30
Table 3
Temporal features of the Global-BAPS-1 cluster broken down into MSMA-BAPS groups.
| | | | Year (%) | |
MSMA- BAPS Group | | Total (n) | | 2008 | 2009 | 2010 | 2011 | 2012 | 2013 | 2016 | 2017 | 2018 | 2019 | 2020 | |
1 | | 79 | | 3 | 1 | 3 | 4 | 8 | 42 | 27 | 8 | 4 | 1 | 1 | |
2 | | 21 | | 24 | 33 | 38 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | |
3 | | 90 | | 2 | 4 | 17 | 18 | 12 | 36 | 8 | 2 | 0 | 1 | 0 | |
4 | | 17 | | 29 | 12 | 12 | 12 | 6 | 0 | 18 | 6 | 0 | 0 | 6 | |
5 | | 132 | | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 | 3 | 35 | 56 | |
Re-emergent S. flexneri 3a is less antimicrobial resistant
Owing to the importance of shifting resistance profiles in previous MSM-associated shigellosis outbreaks, we explored whether there were differences in the emergent and re-emergent subtypes, with respect to AMR. We identified the antimicrobial resistance genes (ARGs) across the isolates and correlated this with genomic groupings (Fig. 3, Table 4). This revealed that emergent MSMA-BAPS-1 isolates had highly conserved erm(B) (73%, n = 58/79), mph(A) (77%, n = 61/79) and blaTEM−1 (80% n = 63/79) genes, whereas a loss of, first, blaTEM−1 and then erm(B) ARGs in the re-emergent MSMA-BAPS-5 isolates was observed. Although blaTEM−1 and then erm(B) genes were lost, mph(A) has been maintained in MSMA-BAPS-5 isolates (92%, n = 121/132). Previous work on this lineage revealed that most ARGs were carried on four main mobile genetic elements (MGEs); the Shigella resistance locus (blaOXA−1, catA1, aadA1, and tet(B), pKSR100 (blaTEM−1, erm(B), and mph(A)), pKSR100 integron (dfrA17, sul1 and aadA5) or pCERC1 (dfrA14 and sul2).30 Grouping ARGs by these MGEs suggested that, in contrast to the emergent event associated with MSMA-BAPS-4 where 24% (n = 4/17) of isolates contain dfrA14 and sul2, indicative of pCERC1 presence, only a single MSMA-BAPS-5 isolate potentially contains the pKSR100 integron (indicated by presence of dfrA17, though absence of sul1 and aadA5, 0.75%, n = 1/133), and none contain genes associated with pCERC1 (indicated by genes dfrA14 and sul2, 0%, n = 0/133).
Table 4
Proportion of S. flexneri 3a MSMA-BAPS isolates containing various ARGs which demonstrated change between clusters.
| Proportion of MSMA-BAPS cluster containing gene (%) |
Antimicrobial class | Gene | 1 (n = 79) | 2 (n = 21) | 3 (n = 90) | 4 (n = 17) | 5 (n = 132) |
Aminoglycosides | aph(3")-lb | 4 | 0 | 37 | 24 | 0 |
aadA5 | 3 | 0 | 17 | 0 | 0 |
aph(6)-ld | 4 | 0 | 39 | 24 | 0 |
B-lactams | blaTEM−1 | 80 | 0 | 57 | 0 | 0 |
Trimethoprim | dfrA14 | 3 | 0 | 38 | 24 | 0 |
dfrA17 | 3 | 0 | 17 | 0 | 1 |
Macrolides | erm(B) | 73 | 0 | 53 | 0 | 26 |
mph(A) | 77 | 0 | 57 | 18 | 92 |
Sulphonamides | sul1 | 6 | 0 | 20 | 0 | 0 |
sul2 | 8 | 0 | 40 | 24 | 0 |
Antiseptics | qacE | 1 | 0 | 17 | 0 | 0 |
Phenotypic testing (minimum inhibitory concentrations [MICs]) of the antimicrobials corresponding to the genotypic resistance profiles was performed to investigate whether changes in the genotype of the bacteria influenced phenotypic antimicrobial resistance. The findings largely corresponded (90%, n = 18/20 concordance statistic) with genotypic resistance (Supplementary Table 1). Presence of at least one of erm(B) or mph(A) and the isolate being resistant to azithromycin had a 94% (n = 16/17) concordance statistic. Presence of at least one of sul1, sul2, dfrA14, or dfrA1, and the isolate being resistant to trimethoprim-sulfamethoxazole had a 67% (n = 4/6) concordance statistic. This revealed that Global-BAPS-1 S. flexneri 3a in the UK has acquired resistance to azithromycin and trimethoprim-sulfamethoxazole, primarily through acquisition of mph(A), and sul1 and sul2. None of the isolates were resistant to ciprofloxacin, consistent with there being no relevant quinolone resistance determining region (QRDR) mutations (i.e. ParC S80I, nor GyrA S83L or D87G) present. Also, none of the isolates were resistant to ceftriaxone. While resistance to azithromycin correlated with the presence of either erm(B) or mph(A), higher levels of resistance (> 256mg/L) were typically associated with the presence of both genes (Supplementary Table 1), where the presence of both mph(A) and erm(B) genes and the isolate having a > 256mg/L resistance to azithromycin had a concordance statistic of 82% (n = 9/11). All isolates were sensitive to fosfomycin or mecillinam.
AMR gene loss on pKSR100 is facilitated by transposases
To investigate the structural basis for the changes in the ARG content of S. flexneri 3a over time, we performed long-read (Nanopore) sequencing on three representative isolates based on their AMR gene profiles, year of isolation, and BAPS-group assignment. We then investigated similarity among contiguous sequences containing mph(A), which revealed that the mph(A) genes were located on plasmids. We found that the pKSR100-like plasmids from: the pKSR100 reference plasmid from S. flexneri 3a, and 2016 and 2020 UKHSA isolates of S. flexneri 3a were overall genetically similar (Fig. 4a). However, the 2016 and 2020 pKSR100-like plasmids did have some notable differences, particularly around the region carrying AMR genes (Fig. 4b). The pKSR100-like plasmid from the 2020 isolate was shorter (69,055 bp) than the 2016 version (73,462 bp) and had lost the erm(B) and blaTEM−1 genes, which were originally flanked by transposase genes. A qacE Δ-1-containing plasmid from a Global-BAPS-4 S. flexneri 3a isolate from 2020 was genetically distinct from the pKSR100 plasmids (Fig. 2), but contained a similar mph(A)-containing region also flanked by transposase genes. To explore the resolution more fully across the isolate collection (i.e. also in those isolates that were not long-read sequenced), we mapped isolates' sequencing reads against pKSR100 which showed that the more recent isolates mapped against a lower proportion of pKSR100, likely reflecting the loss of erm(B) and blaTEM−1 captured in the individual isolate long-read comparisons (Supplementary Fig. 1), the decision to display percentage mapping of ≥ 60 was informed by class interval frequencies (Supplementary Fig. 1b) which allowed for a distinguishable colour gradient. This revealed that a newer, shorter plasmid with fewer AMR genes is now circulating in this MSM-associated S. flexneri 3a lineage.