Novel hydrocarbon-degradation pathways in uncultured bacteria in industrial-impacted ocean waters

doi:10.21203/rs.3.rs-2060586/v1

Download PDF

Research Article

Novel hydrocarbon-degradation pathways in uncultured bacteria in industrial-impacted ocean waters

https://doi.org/10.21203/rs.3.rs-2060586/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Microbes play an active role in oil spill remediation, but little is known about the baseline hydrocarbon-degrading communities that exist before a spill occurs, or the diversity of metabolic mechanisms responsible for degradation. The Faroe Shetland Channel (FSC) is a region of the North Atlantic Ocean with prominent oil production and a diverse microbial community associated with the degradation of petroleum compounds. We characterized the baseline hydrocarbon-degrading communities of the FSC and identified potential novel molecular mechanisms for petrochemical degradation.

Results

We obtained 42 metagenome assembled genomes (MAGs) from bacteria actively utilizing a major compound in oil, n-hexadecane, via stable isotope probing (SIP) from the FSC. Phylogenomics revealed that they belong to 19 genera, including two not previously shown to degrade hydrocarbons: Lentibacter(Alphaproteobacteria) and Dokdonia(Bacteroidetes). Diversity surveys indicated Lentibacter were dominant members of the FSC, constituting up to 17% of these communities. 42% of the SIP-enriched MAGs encoded a complete alkane oxidation pathway containing alkane monooxygenase (AlkB), rubredoxin reductase (AlkT), and rubredoxin-2 (AlkG). However, 40% of the Alphaproteobacteria lacked AlkG for electron transfer in alkane hydroxylation. Instead, they encoded novel disulfide isomerases with iron-binding cysteine motifs conserved across rubredoxins. Dokdonialacked AlkT and AlkG, however, their central alkane-degradation catabolic pathways were complete.

Conclusion

This study describes new bacteria capable of hydrocarbon degradation including the dominant genera Lentibacter, along with novel putative hydrocarbon degradation enzymes. These bacteria may be continuously purging hydrocarbons released from industrial activities in the FSC. This study advances our understanding of the diversity and physiologies of alkane degradation in the North Atlantic and provides evidence of new mechanisms used to metabolize alkanes.

oil spills

stable isotope probing (SIP)

metagenomics

assembled genomes

hydrocarbon degradation

Oil spills at sea are one of the most harmful anthropogenic pollution events. Oil can spread for many miles in sea water and its impact on marine ecosystems far exceeds spills on land. For example, the Deepwater Horizon oil spill (April 20, 2010), resulted in the release of ca. 3.19 million barrels of crude oil into the Gulf of Mexico, which contaminated an area of 62,159 km² [1]. While oil-spill response measures may help to recover and clean up some spilled oil, the ultimate protagonists who contribute to this process are hydrocarbon-degrading bacteria. These microorganisms are ubiquitous in the world’s oceans and are enriched following an oil spill [2, 3]. They underpin bioremediation and are critical to restoring oil-impacted ecosystems to their natural state.

Extensive research has focused on aerobic hydrocarbon degradation pathways in cultured bacteria. In the canonical alkane degradation pathway (alkBGTHJK), octane is first oxidized to alcohol by a complex of three enzymes: alkane monooxygenase (AlkB), rubredoxin reductase (AlkT), and rubredoxin-2 (AlkG). Then these products are converted to fatty acids and channeled into beta-oxidation by alcohol dehydrogenase (AlkJ), aldehyde dehydrogenase (AlkH), and medium-chain-fatty-acid-CoA (AlkK) [4–11]. The key enzyme AlkB is a non-heme diiron integral membrane protein and is a marker for bacteria capable of growing on alkanes as their sole source of carbon and energy [12]. In Pseudomonas putida GPo1, the alkBGTHJK genes are found on the octane (OCT) plasmid [14, 15]. In other hydrocarbon-degrading organisms these genes were identified within the genome as opposed to on a plasmid, and with varying organizations [13]. Multiple copies of alkB have also been observed within a genome, as well as alkB fused with rubredoxin domains [14]. This suggests that there are alternate genetic mechanisms for hydrocarbon degradation yet to be described.

In natural environments hydrocarbon-degrading organisms exist as part of a complex microbial community. Most studies of oil-degrading organisms are undertaken after major spill events and employ single-gene diversity surveys [15, 16]. These studies lack genome-level characterization of bacteria actively utilizing oil compounds like n-hexadecane, a major component of crude oil and many of its refined petrochemical liquid fuels. Stable isotope probing (SIP) is an approach used to obtain DNA of microbial populations that uptake substrates of interest [17]. For example, isotopically-labeled compounds, like ¹³C-n-hexadecane can be used to enrich hydrocarbon-degraders from natural communities [18]. The ¹³C-enriched DNA can then be used to reconstruct genomes using metagenomics (SIP-metagenomics). This approach has been used to characterize the physiologies of bacteria in the Deepwater Horizon spill [19]. However, pre-spill baseline communities have not been studied in this manner.

Here, we investigate the active hydrocarbon-degrading bacteria from the Faroe-Shetland Channel (FSC), an area of the northeast Atlantic Ocean that has over a 20-year history of oil exploration and production. Recently, new potential hydrocarbon-degrading bacteria were described in these waters [20]. To understand the metabolic mechanisms of baseline FSC communities, we used SIP-metagenomics. From these enrichments, we obtained 42 metagenome-assembled genomes (MAGs), including two bacterial lineages for which the hydrocarbon-degradation pathway was not previously described.

Field sampling

During a research cruise on the MRV Scotia in the spring of April 24 to May 9 of 2014, water samples were collected from location FIM6a (60° 38’N, 4° 54’W) at depths 5 m and 700 m. This sampling site lies on the Fair Isle-Munken line [21] near the Foinaven oil field development area, approximately 3 and 9.3 nautical miles from the Petrojarl Foinaven and Glen Lion production facilities, respectively. Collection of seawater samples (3 L volumes) was performed using 10 L Niskin water bottles mounted on a CTD (conductivity, temperature, depth) carousel, based on the sampling procedures of MRV Scotia [21]. CTD casts confirmed these samples at these water depths were from two specific water masses respectively, the Modified North Atlantic Water (MNAW) and the Norwegian Sea Arctic Intermediate Water (NSAIW) masses [22]. Immediately after recovery, some of the collected seawater was used to rinse, at least three times, two Nalgene carboys (10L each; acid-washed, acetone-rinsed, and dried) prior to filling, and immediately stored at 10°C onboard the vessel until return to the laboratory at Heriot-Watt University for immediate use within SIP experiments.

SIP incubations

Prior to the preparation of the SIP incubations, each of the two seawater samples (collected at 5 m and 700 m depths) were processed to remove, as far as possible, dissolved organic carbon/matter that could potentially act as an alternative carbon source and potentially redirect microbial activity away from the labeled substrates during SIP. For this, 800 ml of each sample was filtered through 0.22 μm MCE filters (47 mm diameter; Millipore Sigma). The bacterial biomass collected on the filters was washed with a few milliliters of sterile synthetic seawater medium ONR7a [23], and then the biomass was re-suspended in sterile 40 ml of the ONR7a to act as the inoculum for SIP.

SIP incubations were performed using 125-ml sterilized glass screw-top Erlenmeyer flasks with caps that were lined with aluminum foil to prevent sorption of hydrocarbons. ONR7a medium was used in these incubations because, as explained above, we wanted to prevent the introduction of exogenous and potentially bioavailable sources of carbon. For incubations utilizing either the surface 5m depth water inoculum or the deep water (700 m depth) inoculum, each flask contained 16 ml of ONR7a medium, 1 mg of labeled (¹³C) and/or unlabeled n-hexadecane and 4 ml of inoculum. Uniformly [U-¹³C] and unlabeled (¹²C) n-hexadecane of >99% purity was obtained from Sigma-Aldrich. Each of the two inocula, from the 5 m and 700 m depth, were used to set up SIP experiments with both hydrocarbon substrates. For SIP, duplicate flasks were prepared with 1 mg of U-¹³C-labeled n-hexadecane, and a second set of duplicates was prepared with 1 mg of the respective unlabeled hydrocarbon. An additional set of triplicate flasks was prepared containing unlabeled n-hexadecane to monitor its disappearance by gas chromatography–mass spectrometry (GC–MS). Samples were periodically taken from these flasks for DNA extraction and subsequent measurement by quantitative PCR (described below) to determine the abundance of target organisms identified through SIP. For each SIP experiment, an additional set of triplicate flasks was prepared to act as acid-killed controls (pH<1) containing unlabeled hydrocarbons and amended with 750 μl of 85% phosphoric acid. All flasks were incubated on an orbital shaker (150 rpm) in the dark at 21°C. The endpoint when each SIP experiment was terminated was determined by tracking the disappearance of the hydrocarbon in the triplicate unlabeled flasks by GC-MS. At this point, whole DNA from the total volume in the paired flasks amended with the [U-¹³C] hydrocarbon and the corresponding paired set with unlabeled hydrocarbons was extracted using a standard protocol [24].

DNA gradient ultracentrifugation and identification of labeled 16S rRNA genes

Isopycnic ultracentrifugation of DNA from each of the SIP experiments utilizing the 5m or 700m seawater samples amended with ¹³C-labeled n-hexadecane resulted in the visual separation of two bands (~1 cm apart from each other) that were in the lower half of the polyallomer tubes and which was consistent with the expected location of the ‘heavy’ and ‘light’ DNA bands. With the respective acid-killed controls, it later became apparent that instead of concentrated acid, a very dilute concentration had been added to these control incubations and which was apparent from the degradation data (Fig. 1). This was insufficient to completely suppress microbial activity which explains why some degradation of the hydrocarbon was observed in these controls. Nonetheless, this did not affect our assessment in monitoring degradation of the n-hexadecane in the “live” incubations and did not preclude our selection of the endpoint for when to terminate these experiments for extraction of DNA from the ¹³C incubations. We selected an endpoint of 5 days, at which point DNA extractions were performed on each of the duplicate ¹³C incubations for subsequent isopycnic ultracentrifugation to isolate the ¹³C-enriched ‘heavy’ DNA for analysis. Subsequent denaturing gradient gel electrophoresis (DGGE) analysis of the fractions derived from the labeled incubations performed for each of the two water samples showed clear evidence of isotopic enrichment of DNA. This is evident from the observed clear separation of, and different banding patterns between, the ¹³C-enriched and unenriched DNA fractions by DGGE (Suppl. Fig. S1, Additional File 1) and by the distribution of qPCR-quantified 16S rRNA gene sequences (Suppl. Fig. S2, Additional File 2). Combined fractions containing ¹³C-enriched DNA from each of these ¹³C incubations were used to construct 16S rRNA gene clone libraries. Fractions from the duplicate ultracentrifuge tube from each of the ¹³C incubations were similarly analyzed to confirm our results (data not shown).

Caesium chloride (CsCl) gradient ultracentrifugation and identification of ¹³C-enriched DNA

Total extracted DNA from each of the duplicate unlabeled and ¹³C-labeled incubations was added to caesium chloride (CsCl) solutions (1.68 g/ml) and the ¹³C-enriched and unenriched DNA separated by isopycnic ultracentrifugation and gradient fractionation, following the method of [25] with the following modifications. About 3 ml of mineral oil was added to the top of each CsCl solution until the polyallomer tubes were almost full prior to then heat-sealing them. The tubes were then ultra-centrifuged for 40 hours using a fixed-angle rotor 70.1Ti (Beckman Coulter) at 187,000 ×g at 20°C with a BC Optima L-100 XP ultracentrifuge (Beckman Coulter).

Following isopycnic ultracentrifugation, denaturing gradient gel electrophoresis (DGGE) was performed on each fraction from the SIP tubes to visualize and confirm the separation of DNA. For this, amplification of each fraction was carried out using PCR as described by [26] with bacterial primers 341f (5’-CCTACGGGAGGCAGCAG-3’) and 534r (5’-ATTACCGCGGCTGCTGG-3’), the forward of which contained a 40 nucleotide GC clamp (5’-CGCCCGCCGCGCGCGGCGGGCGGGGCGGGGGCACGGGGGG-3’) [27]. PCR products were confirmed on a 1.5% (w/v) agarose gel alongside a HindIII DNA ladder (Invitrogen, Carlsbad, CA, USA). DGGE was performed using 6.5% acrylamide gels containing a denaturant range of 30–70% (100% denaturant contains 7.0 M urea and 40% molecular-grade formamide). After electrophoresis for 16 h at 60ºC and 60 V, gels were stained with ethidium bromide (1:25,000 dilution) for 15 min, and then imaged with a InGenius3 gel imaging system (Syngene) and accompanying software to crop the gel images to only the regions displaying bands. The ¹³C-enriched heavy DNA fractions were selected based on the DGGE evidence, which is discussed below.

16S rRNA gene libraries of ¹³C-enriched DNA

16S rRNA clone libraries, each comprising 48 clones, were prepared from combined fractions containing the ¹³C-enriched DNA from each of the two SIP experiments using general bacterial primers 27f and 1492r [27]. Cloning of PCR products was performed using the TOPO-TA cloning kit for sequencing (ThermoFisher Scientific). Clones were partially sequenced by GeneWiz (UK) using primer 27f. After excluding vector sequences, poor-quality reads and chimeras, the clone sequences were grouped into operational taxonomic units (OTUs) based on applying a 97% sequence identity cut-off. A representative clone sequence was selected from each dominant OTU identified in each of the libraries and used to obtain a near-complete 16S rRNA gene sequence. Sequences were edited and assembled using Consed/Phred/Phrap [28]. BLASTn searches and RDP-II were used to check for close relatives and phylogenetic affiliation. The near-complete 16S rRNA gene sequence (>1400 bp) representing each of the major SIP-identified OTUs were compared with related GenBank sequences (Suppl. Fig. S3, Additional File 3).

Primers for qPCR targeting the 16S rRNA genes of these major SIP-identified OTUs were developed (Table 1) and used to determine the abundance of each group over time during incubation with the n-hexadecane. During incubations of the 5m water sample with unlabeled n-hexadecane, the 16S rRNA gene copy number for the Thalassolitus OTU-2.6, Lentibacter OTU-1.1, Oleibacter OTU-2.4 and Alcanivorax OTU-2.14 increased, respectively, by approximately 9, 16, 17 and 18 orders of magnitude after 5 days (Fig. 2A). Similarly, during incubations of the 700m water sample with unlabeled n-hexadecane, the 16S rRNA gene copy number for the Dokdonia OTU-4.12 (6 orders), Glaciecola OTU-3.32 (10 orders), Phaeobacter OTU-4.3 (12 orders), Marinobacter OTU-3.15 (13 orders), Alcanivorax OTU-4.22 (18 orders) and Oleibacter OTU-3.27 (20 orders) increased, respectively, by approximately 6, 10, 12, 13, 18 and 20 orders of magnitude after 5 days (Fig. 2B). An increase in the 16S rRNA gene copy numbers of these OTUs in these two SIP experiments provides further confirmation of their enrichment on the n-hexadecane as growth substrate. These increases also coincided with an increase in the total concentration of DNA (Fig. 2) as a proxy for cell growth. This coupled with the disappearance (biodegradation) of the hydrocarbon and the appearance of the 16S rRNA genes of these organisms in the most heavily ¹³C-enriched DNA fractions suggests that these organisms performed a primary role in the degradation of the n-hexadecane.

Real-time quantitative PCR

To quantify sequences in the dominant OTUs, primers for real-time quantitative PCR (qPCR) were developed using AliView [29] and the NCBI Primer Blast online tool [30]. Primer specificity was confirmed with the NCBI Primer BLAST tool. The optimal annealing temperature of each primer pair was determined using an Applied Biosystems (Foster City, CA. USA) Mastercycler gradient thermal cycler. The template for these reactions, and for the construction of respective standard curves for quantitative PCR, was a plasmid containing a representative sequence that had been linearized using an appropriate restriction endonuclease and purified using the QIAquick nucleotide removal kit (Qiagen, Valencia, CA, USA).

Purified DNA from time-series incubations with unlabeled hydrocarbon was quantified using a NanoDrop ND-3300 fluorospectrometer (Thermo Scientific) and the Quant-iT Picogreen double-stranded DNA (dsDNA) kit (Invitrogen). From each of the two SIP experiments, just one replicate of the duplicate ¹²C- and ¹³C-labeled incubations was selected for further analysis based on fractions from it containing the highest total amount of DNA. SIP-identified sequences were quantified in each separated SIP fraction by qPCR. Single reactions were performed on each triplicate DNA extraction (from triplicate samples) from the time series containing unlabelled hydrocarbons.

Phylogenetic tree of the ¹³C-enriched community

The 16S rRNA genes of the SIP-identified sequences were assembled by using the program Sequencher 5.3 (GeneCodes Corp., Ann Arbor, MI). The consensus sequences were submitted to GenBank and checked for close relatives and phylogenetic affiliation using BLASTn. Search results were used as a guide for tree construction, and additional related 16S rRNA sequences identified from the BLASTn search were retrieved from GenBank. The software package MEGAX (version 10.2.4) was used to align the sequences using MUSCLE and to construct a neighbor-joining tree with Jukes–Cantor correction. The tree was bootstrapped 1000 times and gaps in the alignment were ignored. Roseibacillus ishigakijimensis strain MN1-741 (NR041621), Verrucomicrobium spinosum strain DSM 4136 (NR026266) and Haloferula chungangensis strain CAU 1074 (NR109435) were used as an outgroup.

Metagenomic sequencing and assembly of ¹³C-enriched DNA from SIP

Illumina library preparation, sequencing and assembly of four samples were completed by the Joint Genome Institute (JGI [31, 32]; data are available under project IDs 3300039448, 3300039456, 3300039449, and 3300040958). The four samples represent the heavy (¹³C-enriched) DNA from each of the duplicate SIP incubations using the seawater inoculum collected at 5 m and 700 m depths. Paired-end sequencing was performed on an Illumina NovaSeq 6000 platform with an average insert size of 241 and fragments of 300 bps. Raw reads were quality filtered following BBtools pipeline [33] and assembled with metaSPAdes version 3.14.1 [34]; Suppl. Table 1,2, Additional File 9,10). Coverage information was obtained by mapping all high-quality reads of each sample against the assembly using the BWA-MEM algorithm in paired-end mode (bwa-0.7.12-r1034[35]).

Genome binning

Assembled metagenomic data (contigs >2000 bp) was binned using MetaBAT v2.12.1 [36] and CONCOCT v1.1.0 [37], and resulting MAGs were combined using DAS Tool v1.1.2 [38]. First, each of the mapping files were summarized using jgi_summarize_bam_contig_depths and then MetaBAT was run using the following settings: --minCVSum 0 --saveCls -d -v --minCV 0.1 -m 2000 and CONCOCT as follows: --clusters 400 --kmer_length 4 --length_threshold 3000 --seed 4 --iterations 500. For each of the two binning tools, a scaffold-to-bin list was prepared, and the DAS Tool run on each of the eight scaffold files as follows: DAS_Tool -i Concoct.scaffolds.tsv, Metabat.scaffolds.tsv -l concoct,metabat -c assembly.contigs.fasta –debug -t –write_bins 1 –search_engine blast. The accuracy of all the MAGs was evaluated by calculating the percentage of completeness and gene duplication using CheckM lineage_wf v1.0.5[39] (Suppl. Table 3, Additional File 11). All MAGs were greater than or equal to 50% of completeness and <10% gene duplications (according to checkM). MAGs can be found at NCBI within Bioproject PRJNA816150. MAG relative abundance was calculated as previously described in De Anda et al., 2021 using the bin_abundance.py script from MetaGaia (https://github.com/valdeanda/MetaGaia) (Suppl. Table 4, Additional File 12).

Phylogenetic reconstruction and taxonomy

GTDB-Tk classify_wf [40] was used for preliminary taxonomic identification of the 42 individual genomes (Suppl. Table 5, Additional File 13), and then 37 conserved marker proteins were extracted using phylosift [41]. We used the 30S ribosomal protein S2 protein of the 37 marker proteins identified to perform a BLASTp search against the RefSeq database, to obtain the closest publicly available genomes. Based on GTDB-tk v1.5.0 results of predicted taxonomy we also downloaded from RefSeq 30 genomes from Actinobacteria used as an outgroup. For these reference genomes, we also extracted the 37 marker genes using phylosift and added them for analyses. An alignment of the extracted assembled MAGs and reference genomes were generated using MAFFT [42] as follows: –globalpair –maxiterate 16 –reorder. The alignment was trimmed using trimAL [43] -automated1. The phylogeny was constructed with RAxML[44] as follows: raxmlHPC-PTHREADS-AVX -f a -m PROTGAMMAAUTO -N autoMRE -p 12345 -x 12345.

Metabolism reconstruction

Gene prediction for individual MAGs was performed using Prodigal v2.6.3 [45]. Predicted genes of individual MAGs were further characterized using several databases: KofamScan [46], and InterProScan v5.31-70.0 [47]. Searches over these databases were performed using default parameters. For KofamKOALA, only hits above the predefined threshold for individual KOs were selected. We then implemented, rbims https://github.com/mirnavazquez/RbiMs, an open software that reads, evaluates and visualizes the annotation profile output derived from KofamScan. We used the function “read_ko” to calculate the abundance of each KO within each MAG and the function “mapping_ko” to link each KO to other KEGG database features, and we also linked each KO to the rbims database, which includes a manually curated definition of the aerobic hexadecane degradation pathway [19](Suppl. Table 6, Additional File 14).

Phylogenetic reconstruction of Alk proteins

The metagenome entropy-based score MEBS [48] was used to search the Pfams associated to each of the alk genes in the assembled MAGs; alkB (PF00487), alkT (PF07992, PF18113), alkG (PF00301), alkJ (PF00732, PF05199), and alkH (PF00171). We performed a BLASTp search against the non-redundant database from NCBI and queried all the proteins previously identified for each gene and used the first hits as references for phylogenetic reconstructions. We also downloaded from the UniProt database (The UniProt Consortium) the sequences from well-characterized AlkB proteins: Q0VTH3, Q0VKZ3, O31250 and P12691; AlkG: Q9HTK7, Q9HTK8, P00272, Q9WWW4 and Q0VKZ2; AlkT: P17052, P42454, Q0VTB0, Q9HTK9, and Q9L4M8; AlkJ: Q00593, and Q9WWW2; AlkH: P12693. The coding sequences that contained the Pfam and the references were concatenated and aligned using MAFFT [42] as follows: –globalpair –maxiterate 16 –reorder, and phylogeny was generated using iqtree [49] with the following parameters: iqtree -alrt 1000 -bb 1000 -bnni (Suppl. Fig. 4-7, Additional File 4-7)).

The genomic neighborhood rubredoxin and alkane monooxygenase system and alkG-like alignment

We used the operon mapper web server [50] to identify the operons in the assembled MAGs. We focused on the operons where AlkB was present. We extracted the sequences that belonged to the COG1194 found next to the AlkB based on the operon mapper analysis. We also downloaded from UniProt (The UniProt Consortium) and NCBI well-characterized AlkG proteins: P00271, Q9WWW4, Q0VKZ2, WP_138436252.1, WP_161463810.1, WP_089423380.1, WP_084394766.1, WP_015486580.1, Q9HTK8, and Q9HTK7. The reference sequences and the COG1194 sequences were aligned using MAFFT [42] (–globalpair –maxiterate 16 –reorder).

SIP enrichment experiments

To characterize bacteria actively utilizing hydrocarbons in the FSC, we performed SIP enrichments using ¹³C n-hexadecane on water collected at the surface (5m) and subsurface (700m) (Fig. 1). These two depths differ in water sources. At 5m, the warmer currents from the North Atlantic Water (NAW) and the Modified North Atlantic Water (MNAW) are prominent. At 700m, the most prominent current comes from the cold Norwegian Sea Arctic Intermediate Water (NSAW) [14]. Over 90% of the ¹³C n-hexadecane was degraded in the SIP enrichments at both depths by day 3, with complete degradation occurring after that, which coincided with an observed increase in the turbidity of the cultures. We extracted DNA on day 5 of the SIP enrichment experiment (Fig. 1), as experience with this and other hydrocarbon compounds has informed us that 1-2 days after complete degradation is adequate for sufficient incorporation of the ¹³C-label for SIP.

Microbial diversity and abundance of clone libraries

To determine the taxonomy of the members of the SIP-enriched community we sequenced clone libraries to obtain full length 16S rRNA gene sequences (>1400 bp; Table 1). We found that a small number of OTUs comprised most of the SIP-enriched communities: 4 OTUs comprised 77.6% of the 5 m community and 6 OTUs comprised 61.7 % of the 700 m community (Table 1). All other OTUs at each depth represented < 5% of total sequences and were not further analyzed. The abundant OTUs were distributed among three phyla: Gammaproteobacteria, Alphaproteobacteria, and Bacteriodetes. Gammaproteobacteria was comprised of 5 genera: Alcalinivorax, Thalassolituus, Oleibacter, Marinobacter and Glaciecola; Alphaproteobacteria was comprised of 2 genera: Lentibacter and Phaeobacter; and Bacteriodetes was comprised of genus Dokdonia (Table 1).

To determine the abundance of the members of the enriched community, we performed qPCR experiments (Fig. 2). qPCR showed that at 5 m depth, Alcanivorax OTU-2.14 was the most abundant genus during all three days of the experiment. At 700 m depth, Dokdonia OTU-4.12 was the most abundant during day one, and Alcanivorax OTU-4.22 was the most abundant for the remaining two days (Fig 2). This suggests that Alcanivorax is key to the alkane degradation process, but that distinct Alcanivorax OTUs carry out alkane degradation at different depths.

Microbial diversity and abundance of MAGs

To explore the genomic diversity and metabolic pathways of the hydrocarbon-degrading bacteria from the FSC, we de novo assembled four metagenomes from the 5 m and 700 m SIP enrichments (two from each depth totaling ~400 Gb) and reconstructed 42 metagenome-assembled genomes (MAGs) (Fig. 3). We obtained 24 MAGs from 5 m and 18 MAGs from 700 m with a size range from 1.76 to 4.88 Mb, average completeness 94%, lowest completeness 51%, and a maximum gene redundancy of 10%. Phylogenomic analysis of 37 concatenated marker proteins (Fig. 3) revealed that these genomes belong to Bacteroidetes, Alphaproteobacteria, and Gammaproteobacteria, which is consistent with our 16S rRNA gene phylogenies of the SIP enrichments (Suppl. Fig. 1). However, 21 of our reconstructed MAGs were distributed in 11 genera not recovered by the 16S rRNA gene clone libraries (stars in Fig. 3). This suggests that approaches reliant on the 16S rRNA gene surveys may overlook important hydrocarbon-degrading bacteria.

Of the recovered MAGs, representatives of Alcanivorax, Glaciecola, Marinobacter, Oleibacter, and Pseudophaeobacter were obtained from both depths (blue dots in Fig. 3). At 5 m we additionally recovered MAGs belonging to Flavobacterium, Henricella, Hyphomonas, Celeribacter, Planktomarina, Lentibacter, Teteyamaria, and Pseudomonadales, while at 700 m we additionally recovered MAGs belonging to Alteromonas, Olleya, Dokdonia, Paracoccus, and Sulfitobacter. Of note, these are the first assembled genomes of Dokdonia and Lentibacter obtained from hydrocarbon SIP enrichment experiments.

Abundance estimations based on genomic coverage of the MAGs indicated that the most abundant genera at 5 m were Alcanivorax, Thalassolitus, Oleibacter, and Lentibacter (bar plots in Fig. 3). This is consistent with the qPCR 16S rRNA analysis performed in parallel (Fig. 2). Interestingly, the most dominant MAG in the SIP enrichment from 700 m -Oleibacter- has not been previously reported in FSC waters [20], suggesting that is likely present at low abundance in the baseline communities. Lentibacter is a predominant genus in the baseline FSC water column [20, 51–53] and becomes enriched in the presence of crude oil [54, 55], however, alkane degradation pathways have not been previously confirmed. To confirm the alkane degradation capabilities of these bacteria we looked for genes that compose the canonical alkane degradation pathway.

Distribution of Alk enzymes in FSC bacteria

To understand the metabolic pathways involved in hydrocarbon utilization, we searched for genes predicted to encode proteins involved in aerobic alkane utilization. We searched our SIP enrichment MAGs for homologs of AlkB and performed a phylogenetic reconstruction of these proteins (Fig. 4). This revealed six distinct phylogenetic clusters that we named clades I-VI. All Gammaproteobacteria and Alphaproteobacteria had multiple copies of the AlkB. The Gammaproteobacteria Alcanivorax had two copies (Clade IV and VI), the Alphaproteobacteria Lentibacter, Teteyamaria, and Celeribacter had two copies (Clade I and III), and the Gammaproteobacteria Thalassolituus and Oleibacter had three copies (Clade II, IV, and VI). The presence of multiple copies of AlkB suggests high metabolic potential for alkane degradation.

In order to incorporate the alkane molecule, AlkB requires rubredoxin (AlkG) and rubredoxin-reductase (AlkT) [56], therefore, we also searched the MAGs for genes encoding these proteins (outer blue rings in Fig. 4). Sixteen Gammaproteobacteria MAGs had genes predicted to encode AlkGT (Clade II, IV, and VI), suggesting that they perform alkane degradation using the canonical pathway. Seven MAGs belonging to the Gammaproteobacteria Alteromonas, the Bacteriodetes Flavobacterium and Olleya, and the Alphaproteobacteria Sulfitobacter all lacked the AlkBGT, in addition to also lacking AlkJK. Two Alphaproteobacteria Henriciella MAGs also lacked AlkBGT. These bacteria were some of the lowest in abundance in the SIP enrichments (Fig. 2), suggesting they lack the potential for hydrocarbon degradation.

However, some MAGs with AlkB were abundant in the SIP enrichment but lacked either AlkG or AlkT, suggesting that they may degrade alkanes with a novel pathway. Out of the Gammaproteobacteria MAGs, three Glaciecola MAGs in clade II lacked AlkT. The Alphaproteobacteria MAGs lacked AlkG except for Celeribacter MAGs (Clades I and III), and the Bacteroidetes Dokdonia MAGs lacked both AlkG and AlkT (Clade V).

Finally, the aerobic alkane degradation pathway is comprised of multiple enzymes, AlkBGTHJK. The key enzyme AlkB was present in 35 of the 42 MAGs. Within these 35 MAGs, we identified the enzymes AlkJHK in most of the MAGs (Fig.5). Dokdonia has all the enzymes except for AlkGT. However, the high abundance of this bacteria in the qPCR analysis on day one supports that it is an active degrader (Fig. 2). The absence of certain genes in these bacteria could be due to incompleteness of these MAGs, or the degradation of alkane in these communities may be a cooperative process, as previously proposed [19].

Fusion domains and putative AlkG in Alphaproteobacteria.

We identified two AlkB from Alcanivorax in clade VI with a fusion rubredoxin domain (stars in Fig. 4). This suggests that these genes encode a protein with a transmembrane domain that in its cytoplasmic site already contains the AlkG rubredoxin domain (PF00301). Similarly, we found that Celeribacter AlkG sequences (SIP_5_Bin0_scaffold_9_c1_22 and SIP2_5_Bin10_scaffold_4_c1_43) have the AlkG and AlkT domains fused (PF00301 and PF07992), which suggests that they are new proteins that may be able to interact with AlkB without AlkT (Suppl. Fig. S5, Additional File 5).

The FSC Lentibacter MAGs also had two copies of AlkB and lacked AlkG, but they did not have any evidence of fused domains like the Alcanivorax and Celeribacter. Therefore, we searched for other potential electron carriers in these genomes. We examined the genes surrounding alkB in all the clade III Alphaproteobacteria, which includes Lentibacter. We found that there are some genes with shared functional annotations in that genomic region (Fig. 6). We searched the proteins encoded by these genes for the conserved AlkG motif (Cys-X-X-Cys-Gly), which is the motif that interacts with AlkB [57]. We identified this motif in a protein present in all 15 MAGs in Clade III annotated as a disulfide isomerase (COG1651; Suppl. Fig. S8, Additional File 8). This suggests that it could be involved in electron transfer in place of rubredoxin reductase in these bacteria.

Alternative energy metabolisms and adaptations to hydrocarbon degradation

Since the genomes of these bacteria were obtained from waters not associated with a documented occurrence of an oil spill, it is likely that they are versatile and able to do other processes to obtain energy in the absence of alkanes. Also, it is possible that the degradation of hydrocarbons may be enhanced by other cellular processes. Therefore, we searched for other energy generating processes in the MAGs. We identified denitrification genes (napAB; nitrate reductase, norBC; nitric oxide reductase, nosZ; nitrous-oxide reductase) in the Dokdonia genomes. The Alphaproteobacteria genomes contained genes that are predicted to encode the Sox enzyme complex involved in sulfur oxidation [58]. The presence of key sulfur and nitrogen genes suggests that these bacteria are metabolically versatile and have other mechanisms for energy generation in the absence of alkanes. Furthermore, we identified genes related to flagella biosynthesis, secretion systems, and chemotaxis, primarily present in Gammaproteobacteria, indicating a potential community response towards n-hexadecane as previously observed for Marinobacter [59] (Fig. 7).

Here we examined the microbial diversity and genetic mechanisms of hydrocarbon-degrading bacteria from baseline water column microbial communities in a region with an active oil industry presence, but no obvious oil spillage. The alkane-degrading microbial communities from our enrichments varied by depth. Based on 16S rRNA data and MAG abundances, Alcanivorax dominated the shallow waters, while Oleibacter was the most abundant at 700 m. Similarly, a recent study analyzing the microbial communities in the FSC over a period of two years showed Alcanivorax were significantly increased in abundance in the upper water column (< 175 m), and this was more prominent during the spring compared to the fall (from undetectable to 0.1% relative abundance) [20]. The microbial community reconstructed from 5 m is more diverse than the deeper community. However, at 700 m we recovered two Dokdonia MAGs, which were dominant at the beginning of the SIP enrichment. Dokdonia species have been previously described in hydrocarbon-contaminated areas [60], but this is the first time members of this genus capable of degrading hydrocarbons have been identified and had their genetic oil-degradation mechanisms described. Likewise, Lentibacter is a predominant lineage in the FSC water column [20, 51–53] and has been shown to become enriched in the presence of crude oil [54, 55]. Here we found they were predominant at the 5 m SIP enrichments, comprising 8% of the metagenome, and we found evidence of their direct involvement in oil degradation. In other diversity surveys of pyrene-enrichments of sediment samples collected at 500 m and 1000 m depth from the FSC, representatives of Glaciecola were identified, but they were not shown capable of hydrocarbon degradation [61]. We showed that Glaciecola has all the enzymes needed for alkane degradation except for AlkT. Moreover, they represented 5% of the metagenome and were found in the SIP enrichments at 5 m and 700 m, suggesting that they are capable of hydrocarbon degradation. Other genera like Alteromonas and Sulfitobacter were also previously found in the FSC during the spring [20], however, we could not reconstruct the complete alkane degradation pathway.

Previous hexadecane SIP enrichment experiments from the Deepwater Horizon Spill recovered MAGs affiliated to Marinobacter, a generalist hydrocarbon-degrader [19]. Marinobacter has been found throughout the water column of the FSC, with up to 0.2% relative abundance [20, 62]. We recovered 4 MAGs from these genera evenly distributed among depths but more abundant at 700 m. However, in this study, Alcanivorax and Oleibacter were the most abundant lineages, suggesting that the distribution of the hydrocarbon-degrading bacteria is not homogeneous in the oceans. The MAGs belonging to Flavobacterium, Henriciella, Alteromonas, Olleya, and Sulfitobacter were the least abundant and the ones where we did not recover any AlkB enzyme. However, these lineages have been found in oil-impacted ecosystems or implicated in oil degradation [63–66]. Flavobacterium, among other methylotrophic organisms, is dominant in the oil-impacted ecosystems after other bacteria like Oceanospirillales perform the first degradation process [16]. Furthermore, Sulfitobacter, Alteromonas, and some Flavobacterium have been implicated in aromatic hydrocarbon degradation [16, 67, 68], suggesting that these organisms may be involved in hydrocarbon degradation, either indirectly or in concert with other degraders, hence supporting the idea of a community reliant response [19].

The alkane degradation pathway includes the AlkBGTHJK enzymes, and AlkB is often used as a biomarker to explore the diversity of bacteria capable of degrading hydrocarbons using this metabolism. We found six clades of AlkB distributed among Gammaproteobacteria, Alphaproteobacteria, and Bacteroidetes. Except for Dokdonia, most MAGs showed more than one version of AlkB. The origin of these multiple copies could be gene duplication or acquisition via HGT [69, 70]. The AlkB has a history of HGT, for instance the OCT plasmid of P. putida was obtained from Alcanivorax and has been found distributed in different lineages [14, 71]. We believe that the most likely explanation for the multiple copies of the AlkB in our MAGs is HGT because the clades are composed of genomes from multiple genera. Gene duplication has been described before for AlkB, mainly associated with multidomain versions and the capacity to use large alkanes of chain-length from C14 to C30 [14][72][73]. In our experiment, we recovered a fusion version of AlkB with rubredoxins in two Alcanivorax MAGs that cluster among other Alcanivorax jadensis. The diversity within AlkB suggests that there must be other electron donors than AlkGT. We identified a new protein that could interact with AlkB in Alphaproteobacteria; this protein shares the motif of AlkG that interacts with AlkB. In Celeribacter MAGs, we also found a new protein with the fusion domains of AlkGT, suggesting a new possibility of interaction with AlkB in Alphaproteobacteria.

We also found other genes related to the adaptation of these bacteria towards hydrocarbon degradation. For example, in all Gammaproteobacteria MAGs, we found genes related to motility, biofilm formation and chemotaxis. Hydrocarbons are weakly soluble in water and, consequently, poorly available for assimilation by heterotrophic bacteria, therefore, biofilms have been proposed as a strategy that favors the access to hydrocarbon droplets [59, 74, 75]. Chemotaxis might also result in an advantage as it can promote movement towards the hydrocarbon molecules when the concentration of the chemical is low, or even scape when toxic hydrocarbon molecules are around [76]. For instance, the type VI secretion system and genes related to chemotaxis have been observed to be produced by Marinobacter when exposed to hexadecane [77].

Our results show that hydrocarbon-degrading bacteria might be involved in the nitrogen and sulfur cycle in the FSC. In Dokdonia MAGs, we found genes associated with denitrification. Denitrification is associated with oxygen deficient zones (ODZ), where some organisms can use alternative respiratory pathways when O2 is depleted [78]. Dokdonia was only identified at 700 m, and one possibility is that the presence of these genes indicates that there are probably ODZs in the FSC.

Here we examined the diversity and metabolic potential of the Arctic and Subarctic communities of the FSC. Our SIP enrichments were dominated by bacteria commonly associated with oil degradation as well as novel genotypes with distinct alkane degradation strategies. These include several uncultured Alphaproteobacteria lineages related to Hyphomonas, Celeribacter, Lentibacter, Teteyamarina, and Pseudophaeobacter, which are capable of alkane degradation in the absence of rubredoxins, essential components for electron transfer. Many of these bacteria code for an uncharacterized protein containing a conserved site with confirmed AlkG rubredoxins, suggesting a previously undescribed mechanisms for alkane hydroxylation in oil-degrading bacteria. The predominance of Lentibacter in the water column and our enrichments suggests they play an active and potentially continuous role in purging the FSC waters of hydrocarbons. We also demonstrated that Dokdonia assimilate and have pathways for hexadecane utilization in the absence of rubredoxins. This study highlights that the oceans' metabolic capability and diversity of hydrocarbon cycling is encoded by a network of genes and incomplete pathways spread across a complex bacterial community that, via their collective whole, would be capable of coordinating the complete biodegradation of aliphatic and aromatic hydrocarbons in the event of an oil spill.

FSC - Faroe Shetland Channel

MAGs - Metagenome assembled genomes

SIP - Stable isotope probing

AlkB - Alkane monooxygenase

AlkT - Rubredoxin reductase

AlkG - Rubredoxin-2

AlkJ - Alcohol dehydrogenase (AlkJ)

AlkH - Aldehyde dehydrogenase (AlkH)

AlkK - Medium-chain-fatty-acid-CoA

OCT - The octane plasmid

NAW - North Atlantic Water

MNAW - Modified North Atlantic Water

NSAIW - Norwegian Sea Arctic Intermediate Water

GC–MS - Gas chromatography–mass spectrometry

DGGE - Denaturing gradient gel electrophoresis

CsCl - Caesium chloride

HGT - Horizontal gene transfer

AAP - Aerobic anoxygenic photosynthesis

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and material

The following accession numbers were submitted to GenBank for ¹³C-enriched DNA in SIP experiments with n-hexadecane (KY515280, KY515282, KY515284, KY515286–KY515288, KY515291–KY515294). The MAGs are submitted under the project ID PRJNA816150.

Competing interests

The authors declare no competing financial interests.

Funding

This work was funded by the Simons Foundation (Award number 687165) awarded to BJB. This work was also funded under the NERC Scottish Universities Partnership for Environmental Research (SUPER) Doctoral Training Partnership (DTP) (Grant reference number NE/S007342/1 and website https://superdtp.st-andrews.ac.uk/), a DOE-JGI grant (Project CSP 503341) to TG, with support also from Heriot-Watt University via their James-Watt Scholarship Scheme to AA. Partial support was also provided through a Royal Society Research Grant (project NEAMO), a Society for Applied Microbiology (SfAM) grant and a MASTS PECRE grant (project NEADMICRO) to TG.

Authors contributions

T.G, V.D.A., and B.J.B. conceived this study. T.G., V.D.A. and B.J.B. advised experiments and analyses. M.V.R.L, V.D.A., G.W., A.A., T.G. and B.J.B performed experiments and/or analyses. M.V.R.L, V.D.A., G.W., R.R.R., T.G. and B.J.B wrote the paper with contributions from all authors.

Acknowledgements

We thank Alejandro Gallego and the captain and crew of MRV Scotia for their support on the research cruises to the FSC and for accommodating all our research needs.

Sammarco PW, Kolian SR, Warby RAF, Bouldin JL, Subra WA, Porter SA. Distribution and concentrations of petroleum hydrocarbons associated with the BP/Deepwater Horizon Oil Spill, Gulf of Mexico. Mar Pollut Bull. 2013;73:129–43.
Head IM, Jones DM, Röling WFM. Marine microorganisms make a meal of oil. Nat Rev Microbiol. 2006;4:173–82.
Yakimov MM, Timmis KN, Golyshin PN. Obligate oil-degrading marine bacteria. Curr Opin Biotechnol. 2007;18:257–66.
Chakrabarty AM, Chou G, Gunsalus IC. Genetic regulation of octane dissimilation plasmid in Pseudomonas. Proc Natl Acad Sci U S A. 1973;70:1137–40.
Owen DJ, Eggink G, Hauer B, Kok M, McBeth DL, Yang YL, et al. Physical structure, genetic content and expression of the alkBAC operon. Mol Gen Genet. 1984;197:373–83.
Kok M, Oldenhuis R, van der Linden MP, Raatjes P, Kingma J, van Lelyveld PH, et al. The Pseudomonas oleovorans alkane hydroxylase gene. Sequence and expression. J Biol Chem. 1989;264:5435–41.
van Beilen JB, Penninga D, Witholt B. Topology of the membrane-bound alkane hydroxylase of Pseudomonas oleovorans. J Biol Chem. 1992;267:9194–201.
Rojo F. Degradation of alkanes by bacteria. Environ Microbiol. 2009;11:2477–90.
Geissdörfer W, Kok RG, Ratajczak A, Hellingwerf KJ, Hillen W. The genes rubA and rubB for alkane degradation in Acinetobacter sp. strain ADP1 are in an operon with estB, encoding an esterase, and oxyR. J Bacteriol. 1999;181:4292–8.
Panke S, de Lorenzo V, Kaiser A, Witholt B, Wubbolts MG. Engineering of a stable whole-cell biocatalyst capable of (S)-styrene oxide formation for continuous two-liquid-phase applications. Appl Environ Microbiol. 1999;65:5619–23.
van Beilen JB, Smits THM, Whyte LG, Schorcht S, Röthlisberger M, Plaggemeier T, et al. Alkane hydroxylase homologues in Gram-positive strains. Environ Microbiol. 2002;4:676–82.
Williams SC, Austin RN. An Overview of the Electron-Transfer Proteins That Activate Alkane Monooxygenase (AlkB). Front Microbiol. 2022;13:845551.
Smits THM, Balada SB, Witholt B, van Beilen JB. Functional analysis of alkane hydroxylases from gram-negative and gram-positive bacteria. J Bacteriol. 2002;184:1733–42.
Nie Y, Chi C-Q, Fang H, Liang J-L, Lu S-L, Lai G-L, et al. Diverse alkane hydroxylase genes in microorganisms and environments. Sci Rep. 2014;4:4968.
Hazen TC, Dubinsky EA, DeSantis TZ, Andersen GL, Piceno YM, Singh N, et al. Deep-sea oil plume enriches indigenous oil-degrading bacteria. Science. 2010;330:204–8.
Redmond MC, Valentine DL. Natural gas and temperature structured a microbial community response to the Deepwater Horizon oil spill. Proceedings of the National Academy of Sciences. 2012;109:20292–7.
Dumont MG, Colin Murrell J. Stable isotope probing — linking microbial identity to function. Nature Reviews Microbiology. 2005;3:499–504.
Gutierrez T, Singleton DR, Berry D, Yang T, Aitken MD, Teske A. Hydrocarbon-degrading bacteria enriched by the Deepwater Horizon oil spill identified by cultivation and DNA-SIP. ISME J. 2013;7:2091–104.
Dombrowski N, Donaho JA, Gutierrez T, Seitz KW, Teske AP, Baker BJ. Reconstructing metabolic pathways of hydrocarbon-degrading bacteria from the Deepwater Horizon oil spill. Nature Microbiology. 2016;1 May:1–8.
Angelova AG, Berx B, Bresnan E, Joye SB, Free A, Gutierrez T. Inter- and Intra-Annual Bacterioplankton Community Patterns in a Deepwater Sub-Arctic Region: Persistent High Background Abundance of Putative Oil Degraders. MBio. 2021;12.
Turrell WR, Slesser G, Adams RD, Payne R. Decadal variability in the composition of Faroe Shetland Channel bottom water. Deep Sea Res Part I. 1999.
Berx B. The hydrography and circulation of the Faroe-Shetland Channel. Ocean Challenge. 2012;19:15–29.
Dyksterhouse SE, Gray JP, Herwig RP, Lara JC, Staley JT. Cycloclasticus pugetii gen. nov., sp. nov., an aromatic hydrocarbon-degrading bacterium from marine sediments. Int J Syst Bacteriol. 1995;45:116–23.
Tillett D, Neilan BA. Xanthogenate nucleic acid isolation from cultured and environmental cyanobacteria. Journal of Phycology. 2000;36:251–8.
Jones MD, Singleton DR, Sun W, Aitken MD. Multiple DNA extractions coupled with stable-isotope probing of anthracene-degrading bacteria in contaminated soil. Appl Environ Microbiol. 2011;77:2984–91.
Yu Z, Morrison M. Improved extraction of PCR-quality community DNA from digesta and fecal samples. Biotechniques. 2004;36:808–12.
Muyzer G, de Waal EC, Uitterlinden AG. Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16S rRNA. Appl Environ Microbiol. 1993;59:695–700.
Gordon D. Viewing and editing assembled sequences using Consed. Curr Protoc Bioinformatics. 2003;Chap. 11:Unit11.2.
Larsson A. AliView: a fast and lightweight alignment viewer and editor for large datasets. Bioinformatics. 2014;30:3276–8.
Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, Madden TL. Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics. 2012;13:134.
Grigoriev IV, Nordberg H, Shabalov I, Aerts A, Cantor M, Goodstein D, et al. The genome portal of the Department of Energy Joint Genome Institute. Nucleic Acids Res. 2012;40 Database issue:D26–32.
The Genome Portal of the Department of Energy Joint Genome Institute: 2014 Updates. United States. Department of Energy. Office of Science; 2013.
Bushnell B, Rood J, Singer E. BBMerge - Accurate paired shotgun read merging via overlap. PLoS One. 2017;12:e0185056.
Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 2017;27:824–34.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165.
Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, et al. Binning metagenomic contigs by coverage and composition. Nat Methods. 2014;11:1144–6.
Sieber CMK, Probst AJ, Sharrar A, Thomas BC, Hess M, Tringe SG, et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat Microbiol. 2018;3:836–43.
Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55.
Chaumeil P-A, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2019. https://doi.org/10.1093/bioinformatics/btz848.
Darling AE, Jospin G, Lowe E, Matsen FA 4th, Bik HM, Eisen JA. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ. 2014;2:e243.
Katoh K, Misawa K, Kuma K-I, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002;30:3059–66.
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90.
Hyatt D, Chen G-L, Locascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11:119.
Aramaki T, Blanc-Mathieu R, Endo H, Ohkubo K, Kanehisa M, Goto S, et al. KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold. Bioinformatics. 2020;36:2251–2.
Blum M, Chang H-Y, Chuguransky S, Grego T, Kandasaamy S, Mitchell A, et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 2021;49:D344–54.
De Anda V, Zapata-Peñasco I, Poot-Hernandez AC, Eguiarte LE, Contreras-Moreira B, Souza V. MEBS, a software platform to evaluate large (meta)genomic collections according to their metabolic machinery: unraveling the sulfur cycle. Gigascience. 2017;6:1–17.
Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32:268–74.
Taboada B, Estrada K, Ciria R, Merino E. Operon-mapper: a web server for precise operon identification in bacterial and archaeal genomes. Bioinformatics. 2018;34:4118–20.
Sunagawa S, Coelho LP, Chaffron S, Kultima JR, Labadie K, Salazar G, et al. Ocean plankton. Structure and function of the global ocean microbiome. Science. 2015;348:1261359.
Agogué H, Lamy D, Neal PR, Sogin ML, Herndl GJ. Water mass-specificity of bacterial communities in the North Atlantic revealed by massively parallel sequencing. Mol Ecol. 2011;20:258–74.
Zhang Y, Yao P, Sun C, Li S, Shi X, Zhang X-H, et al. Vertical diversity and association pattern of total, abundant and rare microbial communities in deep-sea sediments. Mol Ecol. 2021;30:2800–16.
Bacosa HP, Erdner DL, Rosenheim BE, Shetty P, Seitz KW, Baker BJ, et al. Hydrocarbon degradation and response of seafloor sediment bacterial community in the northern Gulf of Mexico to light Louisiana sweet crude oil. ISME J. 2018;12:2532–43.
Liu J, Bacosa HP, Liu Z. Potential Environmental Factors Affecting Oil-Degrading Bacterial Populations in Deep and Surface Waters of the Northern Gulf of Mexico. Front Microbiol. 2016;7:2131.
van Beilen JB, Neuenschwander M, Smits THM, Roth C, Balada SB, Witholt B. Rubredoxins involved in alkane oxidation. J Bacteriol. 2002;184:1722–32.
van Beilen JB, Wubbolts MG, Witholt B. Genetics of alkane oxidation by Pseudomonas oleovorans. Biodegradation. 1994;5:161–74.
Friedrich CG, Rother D, Bardischewsky F, Quentmeier A, Fischer J. Oxidation of reduced inorganic sulfur compounds by bacteria: emergence of a common mechanism? Appl Environ Microbiol. 2001;67:2873–82.
Vaysse P-J, Sivadon P, Goulas P, Grimaud R. Cells dispersed from Marinobacter hydrocarbonoclasticus SP17 biofilm exhibit a specific protein profile associated with a higher ability to reinitiate biofilm development at the hexadecane-water interface. Environ Microbiol. 2011;13:737–46.
Alonso-Gutiérrez J, Costa MM, Figueras A, Albaigés J, Viñas M, Solanas AM, et al. Alcanivorax strain detected among the cultured bacterial community from sediments affected by the “Prestige” oil spill. Mar Ecol Prog Ser. 2008;362:25–36.
Gontikaki E, Potts LD, Anderson JA, Witte U. Hydrocarbon-degrading bacteria in deep-water subarctic sediments (Faroe-Shetland Channel). J Appl Microbiol. 2018;125:1040–53.
Suja LD, Summers S, Gutierrez T. Role of EPS, Dispersant and Nutrients on the Microbial Response and MOS Formation in the Subarctic Northeast Atlantic. Front Microbiol. 2017;8:676.
Kostka JE, Prakash O, Overholt WA, Green SJ, Freyer G, Canion A, et al. Hydrocarbon-degrading bacteria and the bacterial community response in gulf of Mexico beach sands impacted by the deepwater horizon oil spill. Appl Environ Microbiol. 2011;77:7962–74.
Thompson H, Angelova A, Bowler B, Jones M, Gutierrez T. Enhanced crude oil biodegradative potential of natural phytoplankton-associated hydrocarbonoclastic bacteria. Environ Microbiol. 2017;19:2843–61.
Ma M, Gao W, Li Q, Han B, Zhu A, Yang H, et al. Biodiversity and oil degradation capacity of oil-degrading bacteria isolated from deep-sea hydrothermal sediments of the South Mid-Atlantic Ridge. Marine Pollution Bulletin. 2021;171:112770.
Chaudhary DK, Kim D-U, Kim D, Kim J. Flavobacterium petrolei sp. nov., a novel psychrophilic, diesel-degrading bacterium isolated from oil-contaminated Arctic soil. Scientific Reports. 2019;9.
Mas-Lladó M, Piña-Villalonga JM, Brunet-Galmés I, Nogales B, Bosch R. Draft Genome Sequences of Two Isolates of the Roseobacter Group, Sulfitobacter sp. Strains 3SOLIMAR09 and 1FIGIMAR09, from Harbors of Mallorca Island (Mediterranean Sea). Genome Announc. 2014;2.
Jin HM, Kim JM, Lee HJ, Madsen EL, Jeon CO. Alteromonas as a key agent of polycyclic aromatic hydrocarbon biodegradation in crude oil-contaminated coastal sediment. Environ Sci Technol. 2012;46:7731–40.
Gogarten JP, Peter Gogarten J, Townsend JP. Horizontal gene transfer, genome innovation and evolution. Nature Reviews Microbiology. 2005;3:679–87.
Hooper SD, Berg OG. On the nature of gene innovation: duplication patterns in microbial genomes. Mol Biol Evol. 2003;20:945–54.
Giebler J, Wick LY, Schloter M, Harms H, Chatzinotas A. Evaluating the assignment of alkB terminal restriction fragments and sequence types to distinct bacterial taxa. Appl Environ Microbiol. 2013;79:3129–32.
Nie Y, Liang J, Fang H, Tang Y-Q, Wu X-L. Two novel alkane hydroxylase-rubredoxin fusion genes isolated from a Dietzia bacterium and the functions of fused rubredoxin domains in long-chain n-alkane degradation. Appl Environ Microbiol. 2011;77:7279–88.
Williams SC, Forsberg AP, Lee J, Vizcarra CL, Lopatkin AJ, Austin RN. Investigation of the prevalence and catalytic activity of rubredoxin-fused alkane monooxygenases (AlkBs). J Inorg Biochem. 2021;219:111409.
Ennouri H, d’Abzac P, Hakil F, Branchu P, Naïtali M, Lomenech A-M, et al. The extracellular matrix of the oleolytic biofilms of Marinobacter hydrocarbonoclasticus comprises cytoplasmic proteins and T2SS effectors that promote growth on hydrocarbons and lipids. Environ Microbiol. 2017;19:159–73.
Grimaud R, Ghiglione J-F, Cagnon C, Lauga B, Vaysse P-J, Rodriguez-Blanco A, et al. Genome sequence of the marine bacterium Marinobacter hydrocarbonoclasticus SP17, which forms biofilms on hydrophobic organic compounds. J Bacteriol. 2012;194:3539–40.
Parales RE, Ditty JL. Chemotaxis to Hydrocarbons. In: Krell T, editor. Cellular Ecophysiology of Microbe: Hydrocarbon and Lipid Interactions. Cham: Springer International Publishing; 2018. p. 221–39.
Vaysse P-J, Prat L, Mangenot S, Cruveiller S, Goulas P, Grimaud R. Proteomic analysis of Marinobacter hydrocarbonoclasticus SP17 biofilm formation at the alkane-water interface reveals novel proteins and cellular processes involved in hexadecane assimilation. Res Microbiol. 2009;160:829–37.
Hutchins DA, Capone DG. The marine nitrogen cycle: new developments and global change. Nature Reviews Microbiology. 2022;20:401–14.

Table 1 is available in the Supplemental Files section.

No competing interests reported.

Table1.docx
AdditionalFile1.tiff
● Additional File 1: Supplementary Figure S1. Distribution of the ‘heavy’ and ‘light’ DNA in separated SIP fractions from the incubations using surface (5 m depth; top panel) and subsurface (700 m depth, bottom panel) water, as analyzed by DGGE of bacterial PCR products with decreasing densities from left to right.
AdditionalFile2.tif
● Additional File 2: Supplementary Figure S2. DNA concentration quantified in fractions from labeled [13C]-hexadecane and unlabelled [12C]-hexadecane incubations using the sea surface (5 m depth) (A) and subsurface (700 m depth) (B) water. The duplicate 13C SIP incubations are represented by the curves with solid symbols, whereas the respective unlabelled incubations are represented by the curves with open symbols. The density of each fraction (represented with cross symbols) is also shown to demonstrate the successful formation of a density gradient during isopycnic ultracentrifugation. For the 13C-incubation using the surface water, fractions 7-10 were combined (see also Suppl. Fig. S1A), and for the 13C-incubation with the subsurface water, fractions 7-9 were combined (see also Suppl. Fig. S1B).
AdditionalFile3.png
● Additional File 3: Supplementary Figure S3. 16S rRNA phylogenetic tree of the 13C-enriched community.
AdditionalFile4.pdf
● Additional File 4: Supplementary Figure S4. Maximum-likelihood-based phylogenetic tree of AlkT. Black circles within the branches indicate bootstrap values >80%, where the smallest circle equals 80% and the biggest 100%. Clades are color-coded according to the taxonomic group to which each sequence belongs: yellow, Gammaproteobacteria; pink, Alphaproteobacteria; purple, Bacteroidetes. Label colors represent the genera to which each sequence belongs.
AdditionalFile5.png
● Additional File 5: Supplementary Figure S5. Maximum-likelihood-based phylogenetic tree of AlkG. Black circles within the branches indicate bootstrap values >80%, where the smallest circle equals 80% and the biggest 100%. Clades are color-coded according to the taxonomic group to which each sequence belongs: yellow, Gammaproteobacteria; pink, Alphaproteobacteria; purple, Bacteroidetes. Label colors represent the genera to which each sequence belongs. The stars show the sequences with fusion domains.
AdditionalFile6.pdf
● Additional File 6: Supplementary Figure S6. Maximum-likelihood-based phylogenetic tree of AlkJ. Grey circles indicate bootstrap values >80%, where the smallest circle equals 80% and the biggest 100%. Label colors represent genera.
AdditionalFile7.pdf
● Additional File 7: Supplementary Figure S7. Maximum-likelihood-based phylogenetic tree of AlkH. Grey circles within the branches indicate bootstrap values >80%, where the smallest circle equals 80% and the biggest 100%. Label colors represent the genera to which each sequence belongs.
AdditionalFile8.pdf
● Additional File 8: Supplementary Figure S8. The AlkG-like and AlkG protein alignment, letters highlighted in green show the shared motif.
AdditionalFile9.xlsx
● Additional File 9: Supplementary Table 1. Overview of the metagenome assembly. The first column indicates sample IDs, followed by the program name used for the assembly. The following columns show characteristics such as N50 and minimal contig length.
AdditionalFile10.xlsx
● Additional File 10: Supplementary Table 2. Sequencing data and reports from JGI. The first column indicates the site where the samples were taken, followed by the SIP experiment characteristics and each metagenome's sample name. The following columns indicate the type of file and its size. For some types of files, the number of sequences is also shown.
AdditionalFile11.xlsx
● Additional File 11: Supplementary Table 3. MAGs statistics. The first panel shows genome characteristics, such as genome size, GC content, the number of coding sequences predicted, and the depth from where the genome was obtained. In the second panel, we showed the CheckM results for each MAG. The first column indicates MAG ids; the rest represent the number of marker genes identified and the number of copies of these marker genes. Finally, the last columns represent each MAG's completeness, contamination, and heterogeny values.
AdditionalFile12.xlsx
● Additional File 12: Supplementary Table 4. Relative abundance of MAGs in the metagenomes. Fist columns indicate the data used to calculate the abundance; the genome size, contig length, sequencing depth, number of reads, and coverage. The final columns show the relative abundance values.
AdditionalFile13.xlsx
● Additional File 13: Supplementary Table 5. GTDB-tk taxonomy. The first column indicates the taxonomy inferred by the program. The following columns show each bin relationship with the closest reference genomes, including the classification method.
AdditionalFile14.xlsx
● Additional File 14: Supplementary Table 6. KEGG-based functional annotation of the forty-two MAGs from the n-hexadecane SIP enrichments. The first four columns represent the KEGG modules and pathways. The following three represent the main biogeochemical cycle categories described in the DiTings pipeline (https://github.com/xuechunxu/DiTing/blob/master/table/KO_affilated_to_biogeochemical_cycle.tab). The following four columns represent gene name, description, enzyme number (EC), and KO. The rbims database is in the following two columns. The last columns represent the abundance of each KO in the bin.

Download PDF

Version 1

posted

You are reading this latest preprint version

Novel hydrocarbon-degradation pathways in uncultured bacteria in industrial-impacted ocean waters

Status:

Version 1

Abstract

Figures

Background

Methodology

DNA gradient ultracentrifugation and identification of labeled 16S rRNA genes

Results

Discussion

Conclusions

Abbreviations

Declarations

Authors contributions

References

Tables

Additional Declarations

Supplementary Files

Status:

Version 1