Drain biofilm and P-trap water were sampled contemporaneously across six sinks in four ICU rooms at least three times annually within the five-year study period (286 sampling instances) with a KPCO-positivity rate of 20–40% for all rooms across all samplings. The longitudinal sampling resulted in 87 KPCO isolates across six sinks (52 drain/35 P-trap) from four rooms (See Supplementary Table 1). Eighty-two of the KPCO isolates (94%) were successfully assembled and included in the analysis (Supplementary Fig. 2).
Distribution of species and strains
Ten unique blaKPC-positive species were identified in the drains and P-traps. Serratia marcescens was the most frequently isolated species, accounting for 24/82 sequenced isolates (29%), and Citrobacter freundii and Raoultella ornithinolytica the next most common, with 19 isolates each (23%) (Fig. 2, Supplementary Figure). Across all species, we identified 14 distinct clades (based on a threshold to define a clade of > 400 SNVs) including four C. freundii clades and two S. marcescens clades (Fig. 2). Given how genetically distinct clades were from each other, blaKPC presence across species/clades was most consistent with multiple horizontal transfer events15,16.
We observed distinct species/clade compositions in the unique sinks across different units. Four of the ten species were exclusively confined to one of the sinks, namely Enterobacter hormaechei in Sink-1PS, Aeromonas hydrophila in Sink-1BS, and Enterobacter asburiae and K. pneumoniae in Sink-2BS (Fig. 2).
The remaining six species were shared across multiple locations, but certain clades were specific to particular sinks (e.g. S. marcescens clade 2 in Sink-4PS, C. freundii clade 4 in Sink-1PS, and C. freundii clade 3 in Sink-2BS). Additionally, species/clade composition within some sinks either changed over time as well as observing several clades persisting for the five-year duration in a single sink (Supplementary Fig. 2). These findings suggest that KPCO dynamics within sinks vary substantially from sink-to-sink with each sink fixture undergoing distinct evolutionary processes with some degree of independence.
Complex, nested variation in the genetic context of blaKPC
To assess the dynamics of horizontal transfer of blaKPC within environmental isolates, we examined variation in blaKPC and its genetic environment. blaKPC is typically carried in a highly conserved replicative 10 kb transposon, Tn440117, which can also be mobilized as part of a complex nested, Russian doll-like system by plasmids and other mobile genetic elements such as insertion sequences and other transposons3. Within Tn4401, ISKpn7 is typically located upstream of blaKPC with the same orientation, while ISKpn6 is positioned downstream of blaKPC in the opposite orientation (Fig. 3). Both ISKpn6 and ISKpn7 can function as mobile genetic elements19,22.
In this study, two blaKPC alleles were identified in three known Tn4401 variants, with additional variation observed in the insertion sequence, downstream of blaKPC (i.e. ISKpn6). These composite transposons were seen across 113 different plasmids, some of which were shared across species/clades (Fig. 3). Overall, with the many varied genetic contexts of blaKPC within this data set demonstrates a complex history of gene mobilization across species and plasmids at several trackable genetic levels; allele, Tn4401 structural variant with and without Tn5403 insertion, replicon typed plasmid and species (Fig. 4).
The distribution of the two blaKPC alleles identified (blaKPC−2, blaKPC−3; differ by one SNV) could not clearly be attributed to any single factor such as unit/sink location, species/clade, or mutation given the different wider genetic contexts (Supplementary Table 2). None of the 82 isolates carried both blaKPC−2 and blaKPC−3 simultaneously, as observed previously18. Different isolates of the same clade did however harbour different blaKPC alleles. For example, one clade of K. michiganensis had blaKPC−2 on the pUVA01 and IncFII(pBK30683) plasmids in one isolate sampled from Sink-3PS, and blaKPC−3 on the pUVA01 and IncN plasmids in two other isolates from the same sink.
The robust association between the blaKPC gene and ISKpn6/ISKpn7 within Tn4401 is well-documented in the literature19–21. Among the 82 isolates, we identified two structural isoforms of ISKpn6: ISKpn6a and ISKpn6b. The genomic characterization of these two isoforms revealed more intricate rearrangement structures within our collection ISKpn6a closely aligns with the wild-type ISKpn6 sequence in the ISFinder database. In contrast, ISKpn6b exhibits a novel insertion of Tn5403 at nucleotide position 406 of ISKpn6 (Fig. 3). ISKpn6a was the most prevalent isoform in environmental samples, found in 62 isolates across all 10 bacterial species, 14 clades, and 12 of the plasmid Inc types. It was observed in all 6 units Dec 2013-Sep 2014. In contrast, ISKpn6b was confined mainly to specific species (R. ornithinolytica, K. oxytoca, K. michiganensis), and plasmid Inc types (IncFIA(HI1)__IncFII(K), IncN, IncM1, Col440II), predominantly observed in units Sink-4PS and Sink-3PS, and from Mar 2014 to Sep 2018. The average copy number of blaKPC when associated with ISKpn6b was noted to be significantly higher than when associated with ISKpn6a (1.39, stdev [0.57] versus 1.82 [stdev: 0.59]; p = 0.003, Wilcoxon rank sum test). This observation is consistent with the hypothesis that Tn5403 enhances the transposition capability of blaKPC, as has been demonstrated in vitro23.
As blaKPC is always contained within Tn4401 in our dataset, we identified structural variation, single-nucleotide variation (SNV), and flanking sequences of Tn4401 across the environmental samples to track and validate the mobilization events of blaKPC. Among 82 isolates, there were three structural variants of Tn4401, namely, Tn4401b, Tn4401a, and Tn4401h in 79 (96.3 %), 1 (1.2%), 2 (2.4%) isolates, respectively. All structural variants carried blaKPC, but they exhibited different deletions in the promoter region upstream of blaKPC as previously described24. Tn4401b was the most prevalent variant, being found in 8/10 species, widely spreading across all sampled sinks.
Using replicon typing as a crude metric of plasmid diversity, there were 113 blaKPC-carrying plasmids across 15 distinct known replicon types [as well as the non-typeable but characterized plasmid pKPC_UVA01 (repA_CP013325)]16,25. The top 3 Inc types associated with blaKPC carriage were pKPC_UVA01 (in 33 samples), IncN (in 21 samples), and hybrid IncU__IncX5 (in 18 samples). These major KPC-carrying Inc types were shared across different units supporting the concept that blaKPC was acquired by the plasmid outside of a specific study sink, with pKPC_UVA01, IncN, and IncU__IncX5 observed in 5, 4, and 4 sinks, respectively. The IncU/IncX KPC hybrid plasmid was only identified in one clade of S. marcescens (across multiple sinks), suggesting vertical transmission and persistence of a blaKPC plasmid within this S. marcescens clade over time. In contrast, pKPC_UVA01 was shared across 5 species, and in 25/33 of isolates blaKPC had more than one genetic context consistent with blaKPC dissemination via horizontal gene transfer through transposition and plasmid transfer in this genetic context.
Given the diversity of genetic contexts for blaKPC that were observed, we applied our framework to characterize the putative mobilization events that contributed to this complex diversity.
Characterizing blaKPC transfers by Building a Composite-Sample Complex Model
To capture the complex dynamics of blaKPC movement and evolution within a relatively stable biofilm we took a novel approach. We treated each isolate as a collection of co-occurring circularized genomic elements (either chromosomes or replicon type labeled plasmids), and as an evolutionary snapshot where the genomic element could be a source or a target of blaKPC. We then developed a mathematical framework of plasmid chromosome co-occurrence called the "Composite-Sample Complex" (CSC) (See Method Section and Supplementary Fig. 4). This framework enables us to deduce gene mobility and directionality patterns from a collection of isolates. The primary principles are that only one plasmid replicon type or chromosome can exist in any instance and, blaKPC can be integrated into or from any genomic context. Each CSC constitutes the union of all isolates in the collection which is being investigated (e.g. within a single clade, or a single sink), the vertices are the individually labeled plasmids/chromosome, vertex weights are determined by the number of isolates that contain that specific element in the collection, and the weight of the number of vertices with blaKPC indicates origin versus source. By integrating replicon typing information and treating all closed genomic elements within a bacterial cell as a potential target for resistance integration, the Composite-Sample Complex ensures reliable identification of these mobility patterns. Moreover, this framework assists in accurately pinpointing the source and target of gene mobility.
Transposition: Plasmid to chromosome
We observed the integration of blaKPC into the chromosome in Citrobacter freundii (in 4 samples), Raoultella ornithinolytica (in 9 samples), and Enterobacter asburiae (in 1 sample). Interestingly, while Citrobacter freundii and Raoultella ornithinolytica were shared across different units, the genomic KPC-carrying integration site in the chromosomes was specific to each sink, with Citrobacter freundii found in Sink-1PS and Raoultella ornithinolytica in Sink-4PS, suggesting that these integrations into chromosome took place in sink specific biofilm environment. This is an important biological validation as chromosomes, by definition, will not mobilize into another species or strain like a mobile plasmid. The CSC is then further supported in the finding that the genomic context of chromosomal integration is unique to the specific drain biofilm.
We can successfully hypothesize the source of blaKPC transposition for each of these integrations, as depicted in Fig. 5. To elucidate our approach, we use the KPC integration into the R. ornithinolytica chromosome in unit Sink-4PS as an illustrative example of the model. Within unit Sink-4PS, 9 out of 12 R. ornithinolytica isolates harbor KPC in their chromosomes, all of which are associated with KPC-carrying IncN plasmids (Fig. 5a, Fig. 5b). Even among the remaining R. ornithinolytica isolates lacking KPC in their chromosomes, they still possess KPC-carrying IncN plasmids. Furthermore, no other KPC-carrying plasmid displays such a high degree of co-occurrence with KPC-carrying R. ornithinolytica chromosomes. Hence, we posit that the IncN type is the origin of KPC transposition into R. ornithinolytica chromosomes (Fig. 5c).
To validate the hypothesized transposition pattern, we examined the variations of blaKPC and its surrounding genetic elements in both the source and target. We then checked for consistency between them (Table 2). Specifically, the co-occurring IncN plasmids and R. ornithinolytica chromosomes both possess blaKPC−3, which is flanked by ISKpn6b and further surrounded by Tn4401b. This arrangement is in agreement with the notion that the IncN plasmid type serves as the origin of KPC transposition onto the chromosomes of R. ornithinolytica. Additionally, we examined the 5bp flanking sequence of Tn4401. The variations in the flanking sequence of Tn4401 indicate that the mobilization of the KPC occurred through the transposition of Tn4401, see Table 2 line 1.
Using a methodology analogous to our earlier approach, we analyzed the CSCs and KPC-induced CSCs constructed from isolates of the same bacterial clade from a particular sink across all possible strain-sink combinations. We successfully identified two other KPC plasmid-chromosome transposition pathways: from pKPC_UVA01 to C. freundii in unit Sink-1PS (Fig. 5d), and from pKPC_UVA01 to E. asburiae in unit Sink-2BS (Fig. 5e). Again, these transposition pathways are supported by subsequent validation (Table 2) from the genomic context. Notably, all identified KPC plasmid to chromosome transposition patterns are sink-specific, consistent with transpositions within the specific sink drain/ptrap biofilm, highlighting that this mobilization is potentially occurring while in the environmental biofilm.