Direct co-extraction improves total DNA and protein yields, but not prokaryotic protein identification rates
To compare the effect of indirect (n = 3) and direct (n = 3) plastisphere co-extraction, DNA and protein yields were realised from the optimal biofilm detachment and cell lysis protocol. Although the results were variable, direct extraction resulted in mean total DNA concentrations ~ four-times greater than indirect extraction, at 6.2 ± 5.1 µg and 1.7 ± 1.1 µg, respectively (Table 1). Consistent with the DNA isolation results, direct extraction resulted in mean total protein concentrations that were ~ five times greater than indirect extraction, at 3350 ± 1991 µg and 674 ± 228 µg respectively (Table 1). However, the opposite trend was observed in the protein identification results, whereby the numbers of proteins and distinct peptides identified using gel-free proteomics were generally higher in the indirect extraction samples, but these differences were not statistically significant (T-test, P > 0.05; Table 1; Fig. S1). The mean coverages of identified peptide spectra, although relatively low, were significantly higher for the indirect extraction samples for three of the protein search databases (16S-TaxDB, 16S-TaxDB-2nd, MG-DB; T-test, P < 0.05; Fig. S1).
Table 1
Yields from indirect and direct plastisphere co-extraction, including mean total DNA and protein (± standard deviation; S.D.), and mean protein and peptide identification rates from individual gel-free metaproteomes (n = 3 per treatment) across four search databases (16S-TaxDB, 18S-TaxDB, MG-DB, and 16S-TaxDB-2nd).
Variable (Mean ± S.D.) | Indirect | Direct |
Total DNA (µg) | 1.7 ± 1.1 | 6.2 ± 5.1 |
Total Protein (µg) | 674 ± 228 | 3350 ± 1991 |
No. Proteins (all DBs) | 308 ± 88 | 175 ± 100 |
No. Peptides (all DBs) | 685 ± 327 | 356 ± 313 |
Consistent taxonomy and function between indirect and direct extraction
Despite the observed differences in DNA concentrations, sequencing of the 16S and 18S rRNA genes revealed no significant differences in the alpha diversity of the prokaryotic and eukaryotic plastisphere between indirect and direct extraction. This was evident for Faith’s Phylogenetic Diversity, Shannon’s diversity, and Pielou’s community evenness of both the 16S and 18S ASVs (Kruskal Wallis, P > 0.05; n = 3 per treatment, per gene). Moreover, no significant differences were observed in the composition of 16S and 18S rRNA ASVs between the indirect and direct extraction samples (Fig. 2), for both compositional (ANCOM, P > 0.05; n = 3 per treatment, per gene) and relative abundances (PERMANOVA, P > 0.05; n = 3 per treatment, per gene). For both extraction types, the prokaryotic plastisphere was dominated by the phyla Proteobacteria and Bacteroidota, with ASVs taxonomically identified as Pseudoalteromonas sp., Psychrobacter sp., Gillisia sp., and Dokdonia sp., respectively, displaying prevalence across all samples (Fig. 2A). Although not statistically significant, eukaryotic ASVs displayed variability in their prevalence and relative abundances across replicates (Fig. 2B). Nevertheless, the dominant eukaryotic lineages belonged to the phyla Basidiomycota, Ciliophora, and Nematozoa, with ASVs identified as Tremellomycetes sp., unclassified Oligohymenophorea, and Rhabditida Pellioditis marina more consistent across samples (Fig. 2B).
To determine the impact of the extraction type on plastisphere function, the up- and down-regulation of proteins was explored using fold change and revealed few significant differences (Fig. S2). Specifically, only three proteins were significantly upregulated in the indirect extraction samples (identified using 16S-TaxDB). These included an unannotated protein, a Pseudoalteromonas sp. 50S ribosomal protein, and a Flavobacterium sp. BFFFF2 Bacterial Ig-like domain (Big_12 domain-containing protein). Conversely, only one protein annotated as Psychrobacter sp. Elongation factor G was significantly upregulated in direct extraction. No statistically significant differences in the up- or down-regulation of proteins identified as Eukaryotic (identified using 18S-TaxDB) were observed between the extraction types.
Mechanical detachment and cell lysis approaches impact plastisphere recovery and protein yields
To standardise biofilm detachment across different plastic properties and polymer compositions, four mechanical approaches were tested on 2 g mixed plastic debris. All approaches resulted in significant detachment relative to the Control, with the highest optical densities of recovered cells observed in the Vortex, Bead-beating, and Combination treatments (Fig. 3A). Within these, Bead-beating was the most reproducible and displayed the greatest difference relative to the control (T-test, P = 0.0003; Fig. 3A), and the greatest decrease in plastic-bound biofilm (reduction in the range 0.163–0.171 OD595nm; Fig. 3B). Three rounds of Bead-beating further improved cell recovery relative to the Control (data not shown). For all methods, plastisphere cells detached in ASW remained viable, demonstrating growth after 24 hr in nutrient-rich media (Fig. S3A).
Two common mechanical cell lysis approaches, Bead-beating and Sonication were tested on cells detached from the plastisphere. Significantly greater mean total protein yields were observed following Sonication compared to Bead-beating, at 378 µg and 93 µg, respectively (Fig. 3C; T-test, P < 0.05). Subsequently, four chemical cell lysis buffers were tested in combination with Sonication revealing differences in cell lysis efficiency and protein recovery, such that TE < Urea-Thiourea < Guanidine HCl < SDS (Fig. 3D). The TE-based buffer was the least efficient, with normalised median yields of 55 µg g− 1 plastic debris, compared to median yields of 192 µg g− 1 plastic using the most efficient SDS-based lysis buffer (Fig. 3D). Cells lysed by Sonication in TE buffer remained viable after 24 hr growth in nutrient rich media, while those lysed in the SDS-, Urea-Thiourea-, and Guanidine HCl-based buffers were no longer viable (Fig. S3B).
Complementary gel-free and gel-based metaproteomics
To evaluate gel-free and gel-based protein fractionation, protein identification rates and their taxonomic and functional annotations were qualitatively compared between six combined gel-free and 20 combined gel-based metaproteomes. Overall, protein identification rates were higher in the combined gel-free metaproteome, with 967 distinct proteins and 1506 distinct peptides, compared to 382 proteins and 391 distinct peptides in the gel-based approach (16S-TaxDB; including peptides ≥ 1). Following taxonomic annotation of the identified proteins, 10 unique genera were found to be shared between the gel-free and gel-based metaproteomes, including those attributed to the dominant lineages, Pseudoalteromonas, Pseudomonas, and Psychrobacter, and the less prevalent, Marinobacter, Sphingomonas, Streptomyces, and Vibrio. While 9 genera were specific to the gel-free, only 4 were unique to the gel-based approach (Fig. S4), namely actinobacterial Kitasatospora and Nocardioides, and gammaproteobacterial Lysobacter and Shewanella. Interestingly, the functional annotations reflected divergence between the gel-free and gel-based approaches (Fig. 4A-B), with only 4 shared unique protein functions between them (Fig. S4). The shared proteins were associated with bacterial translation, ribosomal structure and biogenesis (3 unique functions) and a periplasmic substrate-binding protein involved in inorganic ion transport and metabolism. A total of 78 unique protein functions were identified specifically using the gel-free approach, with the majority of these spanning the functional categories of translation, ribosomal structure and biogenesis (29 unique functions), energy production and conversion (15 unique functions), and, posttranslational modification, protein turnover, and chaperones (7 unique functions) (Fig. 4A). Conversely, 21 unique protein functions were specific to the gel-based proteomes, with the most common functional categories encompassing amino acid transport and metabolism (3 unique functions), signal transduction mechanisms (3 unique functions), carbohydrate transport and metabolism (2 unique functions), cell wall biogenesis (2 unique functions), and transcription (2 unique functions) (Fig. 4B).
The role of protein search database in determining plastisphere structure and function
Comparison of the high confidence protein annotations revealed similarities and differences in the diversity of the active taxa identified using the 16S-TaxDB, MG-DB, and the second search database, 16S-TaxDB-2nd (Fig. 5A). MG-DB identified the greatest diversity of organisms spanning 7 phyla (3 Bacteria, 3 Eukaryota, and 1 viral) and 19 genera. 16S-TaxDB recovered proteins from 5 phyla (4 Bacteria and 1 Eukaryota) and 18 genera, while 16S-TaxDB-2nd detected 6 phyla (4 Bacteria and 2 Eukaryota) across 19 genera. In total, 11 genera were shared between 16S-TaxDB, MG-DB, and 16S-TaxDB-2nd (Fig. 5A), while 8 genera were unique to MG-DB, including Bradyrhizobium, Cobetia, Gillisia, Granulosicoccus, Polaribacter, Phyllobacterium, Planococcus, and the virus, Prymnesiovirus. In contrast, only 2 and 3 genera were specific to 16S-TaxDB and 16S-TaxDB-2nd, respectively (Fig. 5A). These included, Devosia and Marinobacter for 16S-TaxDB and, Cylindrotheca, Halomonas, and Moraxella for 16S-TaxDB-2nd. Additional active eukaryotic organisms were highlighted through protein identification using 18S-TaxDB, however the taxonomic identities were largely unresolved by LCA, with only a few proteins of the phyla Bacillariophyta and Oomycota annotated. Taxonomic annotations from the DIAMOND BLAST results revealed 5 additional phyla spanning the Arthropoda, Ascomycota, Chordata, Nematoda, and Streptophyta. Although these classifications should be interpreted with caution due to the limitations of short peptide sequences, these contained 16 genera (Fig. 5A). Within the functional annotations, the numbers of total functional categories were greatest for proteins identified using the 16S-TaxDB-2nd, and MG-DB, followed by 16S-TaxDB, and 18S-TaxDB, at 18, 18, 17, and 6 unique categories, respectively. At the protein level, eukaryotic functional annotations were limited, but 7 unique protein functions were shared between all four search strategies, including ATP synthase subunits, 50S ribosomal proteins, elongation factors, and chaperonin proteins. 16S-TaxDB, MG-DB, and 16S-TaxDB-2nd shared 57 of the annotated unique protein functions. Across these individual search databases, 5, 12, and 15, unique protein functions were found to be specific to 16S-TaxDB, MG-DB, and 16S-TaxDB-2nd, respectively (Fig. 5B).
Multi-omic insights into a heterotrophic plastisphere
Annotation of the metabolic potential within the representative metagenome (n = 1), combined with the expressed proteins identified using the most comprehensive gel-free metaproteome annotations (n = 6; 16S-TaxDB, MG-DB and 16S-TaxDB− 2nd; Fig. 5), provided insights into the functioning of the heterotrophic plastisphere. The predicted metabolism, guided by the metagenome, revealed genes encoding energy acquisition, carbon metabolism, and amino acid metabolism, which were complemented by a range of membrane transporters (Fig. 6A-D). The metaproteomes revealed the specific expression of proteins involved in oxidative phosphorylation, the citrate cycle, glutamine and proline biosynthesis, and carbohydrate metabolism, providing confirmation of the importance of these pathways (Fig. 6E; Table S1). Interestingly, proteins associated with stress responses were a prominent feature of the metaproteomes, including the specific expression of TerD (Psychrobacter), superoxide dismutase (Psychrobacter, Planococcus), alkyl hydroperoxide reductase (Pseudoalteromonas), ferritin (Psychrobacter), and cold-shock and heat-shock proteins in several lineages (Fig. 6E). Both the metagenome and metaproteomes included mechanisms associated with biofilm formation, such as motility, chemotaxis, and adhesion, with several expressed proteins also associated with virulence factors, such as Pseudoalteromonas lipase, Lysobacter mycobactin siderophore (Phenyloxazoline synthase MbtB), elongation factors, Pseudomonas Type II secretion, and Streptomyces toxins (papain fold toxin domain, Ntox27 domain-containing protein; Fig. 6E; Table S1). Furthermore, proteins indicative of inter-community interactions were also expressed, including those mediating protein-protein interactions in Flavobacterium (Big_12 domain), quorum sensing in Streptomyces (acyl-homoserine lactone synthase), and others suggestive of viral infection in Arthrobacter, Vibrio, and Pseudoalteromonas, such as a phage-related minor tail protein, an integration host factor, and HflK (Fig. 6E; Table S1). Notably, evidence of genes and expressed proteins from pathways relevant to marine pollutants, such as aromatic compound degradation in Psychrobacter (e.g., homoprotocatechuate and chlorocatechol degradation) and fatty acid beta oxidation in Psychrobacter and Pseudoalteromonas (alcohol dehydrogenase, 3-ketoacyl-CoA thiolase, acyl-coenzyme A dehydrogenase), were provided by both approaches (Fig. 6; Table S1). Unsurprisingly, genes and proteins associated with information systems were also highly prevalent, such as those associated with ribosomes, chaperonins, transcription, and translation (Table S1).
The taxonomic annotations of the multi-omic datasets revealed the dominance of Proteobacteria, in particular, Pseudoalteromonas (Fig. 2A; Fig. S4; Table S2; Fig. 6E). Metagenomic binning resulted in a partial MAG identified as Pseudoalteromonas sp. (77% completeness, < 10% contamination, including 13 total rRNAs and 97 tRNAs) and the predicted proteome was used as a reference database to further resolve this microorganism’s activity. This resulted in the identification of 98 expressed proteins and 431 high confidence peptides for relative quantification (Table S3). The most abundant proteins expressed by Pseudoalteromonas sp. were involved in general metabolism and energy acquisition, including a peptidylprolyl isomerase (FkpA) involved in protein folding, large and small subunit ribosomal proteins, translation factors, RNA polymerases, and energy acquisition via oxidative phosphorylation, which were present at between 1-2.5% mean relative abundances. Several proteins involved in carbohydrate metabolism were also expressed, with proteins of the citrate cycle representing 4.5% of total mean relative abundance and two proteins involved in glycine metabolism also present. Besides these metabolic processes, proteins involved in membrane transport, such as the OmpF porin, TonB dependent receptor and the ExbB biopolymer transport protein, a carboxypeptidase regulatory-like domain, and a cation/acetate symporter, collectively represented a total of 3.6% mean relative abundance. Elongation factor Tu, which is integral to prokaryotic translation, and plays potential roles in cell-surface adhesion and virulence [51], and chemotaxis proteins (cheW, and, methyl-accepting chemotaxis protein) were also detected at mean relative abundances of 2.3 and 0.3%, respectively (Table S3). Interestingly, proteins associated with DNA damage/repair, including an excision nuclease, and oxidative stress response, such as thioredoxin and cold-shock protein, were also expressed (Table S1; Table S3; Fig. 6E).
A targeted approach to detect polymer degradation and pathogenicity
Targeted protein searches with public databases confirmed the presence and expression of two key plastisphere phenotypes indicated within the multi-omic data: polymer degradation and pathogenicity (virulence and antimicrobial resistance). Alignment of the metagenomic reads to PlasticDB revealed the presence of 5 polymer degradation genes, which included a laccase, depolymerases, an esterase, and dehydrogenase from a range of taxa present within the plastisphere (Table 2). Of these, 36% were annotated as the cold-adapted Psychrobacter sp. laccase, which mediates polyethylene degradation [52]. Annotation and relative quantification of the metaproteomes using PlasticDB identified 3 expressed proteins associated with polymer degradation from the Actinomycetia and Betaproteobacteria (Table 2). These included nylon-degrading polyamidase (Nocardia sp.) and hydrolase (Paenarthrobacter sp.), which represented ~ 60% of the PlasticDB quantified proteins, and a poly lactic acid depolymerase (Paucimonas lemoignei) (Table 2).
Table 2
Multi-omic investigation of polymer degradation within the plastisphere. Annotations and relative abundances are based on the number of aligned metagenomic reads, or relative quantification of proteins, as a proportion of the total identified using PlasticDB as a reference. PLA = poly lactic acid, PE = polyethylene, PHA = polyhydroxyalkanoate, PHB = polyhydroxybutyrate, P3HV/PHBV = poly(3-hydroxybutyrate-co-3-hydroxyvalerate).
Metagenome |
PlasticDB Enzyme | Substrate | PlasticDB Species | PlasticDB Phyla | Average ID (%) | Rel. ab. (%) |
Laccase | PE | Psychrobacter sp. | Proteobacteria | 80.3 | 36.4 |
Depolymerase | PLA | Marinobacter sp. | Proteobacteria | 79.8 | 18.2 |
Dehydrogenase | PHA, P3HV/PHBV | Paracoccus denitrificans | Proteobacteria | 80.8 | 18.2 |
Esterase | PLA | Uncultured bacterium | Uncultured bacterium | 83.3 | 18.2 |
Depolymerase | PLA | Pseudomonas putida | Proteobacteria | 78.7 | 9.1 |
Metaproteome |
PlasticDB Enzyme | Substrate | PlasticDB Taxonomy | Protein | Consensus Taxonomy | Rel. ab. (%) |
Depolymerase | PLA | Paucimonas lemoignei | Poly(3-hydroxyalkanoate) depolymerase C | Paucimonas lemoignei | 12.4 |
Hydrolase | Nylon | Paenarthrobacter ureafaciens | 6-aminohexanoate-dimer hydrolase | Paenarthrobacter sp. | 28.3 |
Polyamidase | Nylon | Nocardia farcinica | 6-aminohexanoate-cyclic-dimer hydrolase | Nocardia sp. | 30.9 |
Pathogenicity was explored through the targeted databases, VFDB and CARD. Alignment of metagenomic reads to VFDB resulted in 388.0 Kbp of sequence across 649 alignments. The most abundant virulence factors within the metagenome were associated with cell adherence and motility mechanisms, in Francisella tularensis, Pseudomonas aeruginosa, and Mycoplasma hyopneumoniae (Fig. S5). Besides the virulence factors identified using the MG-DB and 16S-TaxDB, protein identification using VFDB confirmed the expression of a Francisella sp. TufA protein and a B-type flagellin of Pseudomonas aeruginosa. Furthermore, antimicrobial resistance genes were identified through alignment of the metagenomic reads to CARD, resulting in the identification of 123 genes across 7,723 alignments. Taxonomically, these genes were associated with an array of pathogenic organisms present within the metagenome (Table S2), including Escherichia coli, Salmonella enterica serovar Typhimurium, and Mycobacterium sp. (Fig. 7A). Of the 123 CARD genes identified, 6 were designated as clinically relevant, including identification of Mycobacterium smegmatis tetracycline resistance (tetV), E. coli fluoroquinolone resistance (oqxB), and Enterobacter aerogenes erythromycin resistance (qepA). Use of the CARD protein variant model as a protein search database, identified 11 expressed proteins for relative quantification, including those conferring resistance to rifampicin, beta-lactam, and fluoroquinolones in the bacterial genera Escherichia, Helicobacter, Neisseria, and Mycoplasmoides (Fig. 7B). In addition to resistance to kirromycin, pulvomycin, and enacyloxin IIa, within the Enterobacteriaceae (Fig. 7B).