Biogenic hydrocarbon producing genes.
Lake A metagenomes from eight depths (2 to 65 m) were sequenced and then analyzed for genes coding for fatty aldehyde decarbonylase (FAD), aldehyde deformylating oxygenase (ADO) and olefin beta-lactone synthetase (OleC), which confer the capability to produce long-chain alkanes and olefinic hydrocarbons (alkenes) respectively 16. Overall, up to 3200 genes of hydrocarbon-producing proteins were identified in the dataset. FAD genes, phylogenetically related to Flavobacteriaceae were identified at 2m (Fig. 2). Numerous ADO genes, affiliated to the phylum Cyanobacteria were detected in the freshwater aerobic layer and the chemocline, with up to 1463 genes at 14m depth, corresponding to the bottom of the euphotic zone (Fig. 2). By contrast, OleC genes were more abundant in the saline anoxic (< 1 µM oxygen) water with a maximum of 263 genes at 40m. Phylogenetic analysis of OleC genes indicated that most of the detected sequences were related to Deltaproteobacteria (Desulfobacteraceae, Desulfuromonadales and Myxoccocales) and the PVC superphylum (Planctomycetes, Verrucomicrobia, Lentispheraceae and Gemmataceae). Below 40m depth, the number of hydrocarbon-producing genes declined with depth, down to 23 ADO and FAD genes and 81 OleC genes at 65m (Fig. 2).
Hydrocarbon Degrading Genes
Overall, around 4670 genes coding for proteins involved in hydrocarbon degradation pathways were detected (Fig. 3). The total number of known genes for hydrocarbon degrading proteins identified in the metagenomes exceeded those of hydrocarbon production at each analysed depth, except at 14m where the number of genes of both degrading and producing pathways were maximal and similar (1595 vs 1592). Consistent with the oxygen profile of the water column, aerobic alkane degradation genes (alkB, CYP153, ladA, prmA) were abundant from the surface to 22m, with a maximum of 1451 genes at 14m (Fig. 3). By contrast, anaerobic alkane degradation genes (assA, bssA) predominated in the anoxic saline waters with up to 380 genes at 34m before slowly declining with depth, supporting the absence of deep natural seepages at the bottom of Lake A. The number of aromatic hydrocarbon degradation genes increased with depth until 14m for aerobic pathways (tmoABE, cymA, and MAHαβ), and throughout the water column for anaerobic pathways (ebdA, nmsA), reaching 109 genes at 65m (Fig. 3).
Hydrocarbon Cycling Microbial Populations
A total of 250 MAGs with > 50% completeness and < 5% contamination levels were recovered from the combined metagenomic dataset. Among them, 89 MAGs (35.6%) harbored genes for hydrocarbon cycling (Fig. 4 and Supplementary Fig. 1). ADO genes were identified in only one MAG affiliated with marine Cyanobacteria recovered from the lower chemocline zone. Nonetheless, ADO genes with high coverage were also identified at 6 and 14m samples in unbinned contigs taxonomically affiliated to the cyanobacterial taxon Synechoccocus. In addition, one contig from the 2m metagenomic dataset and related to Flavobacteraceae (Bacteroidia) included a long-chain fatty aldehyde decarbonylase gene (Fig. 4). Olefinic hydrocarbon production genes were identified in 26 MAGs, of which 14 were assigned to PVC lineages (Planctomycetota, Pirellula, Lentisphaerae, Gemmataceae and Opitutaceae) and 8 were related to Deltaproteobacteria (Fig. 4 and Supplementary Fig. 1). Olefinic hydrocarbon producers in the oxic freshwater were related to PVC and Nitrospinaceae, whereas highly abundant Deltaproteobacteria MAGs were predominant in anoxic saline waters. Six low abundance PVC MAGs with OleC were also detected at the deepest depth (Supplementary Fig. 1).
Hydrocarbon degradation pathways were identified in 63 taxonomically diverse MAGs (Fig. 4 and Supplementary Fig. 1). Aerobic alkane degradation genes (alkB, CYP153) were detected in Actinobacteria (Nanopelagicales), Bacteroidetes (Sediminibacterium, Schleiferiaceae) and Alpha-(Rhodobacteraceae), Beta (Polaromonas)- and Gamma-proteobacteria (SAR86, Porticoccaceae, Woeseiaceae, Pseudohongiellaceae). Long chain alkane monooxygenase genes (ladA) were also identified in Actinobacteria (Microbacteriaceae, Alpinimonas) recovered at 2m. Aerobic aromatic hydrocarbon degradation pathways were detected in Alphaproteobacteria (Rhodospirillales) and in Rhodothermales MAGs. Based on the average coverage of the MAGs, Nanopelagicales and Schleiferiaceae were the most abundant aerobic hydrocarbon degrading lineages of the system (Fig. 4 and Supplementary Fig. 1). In addition, anaerobic alkane degradation pathways (assA and bssA) genes were identified in Marinimicrobia, Deltaproteobacteria (Desulfobacteraceae, Desulfatiglans, Syntrophales), Bacteroidetes, Chloroflexi as well as in poorly characterized lineages (candidate divisionKSB1, Aminicenantes) and Abyssubacteria that also possessed aromatic hydrocarbon degradation genes. Coverage of the MAGs indicated that populations of Marinimicrobia were the most abundant anaerobic hydrocarbon degraders in low oxygen waters (Fig. 4 and Supplementary Fig. 1).
Sulfur And Nitrogen Cycle Genes In Hydrocarbon Short Cycle Populations
Genome analysis of the hydrocarbon short cycle populations indicated that 86.5% of the MAGs harbored nitrogen cycling (70% of the MAGs) or sulfur cycling (71% of the MAGs) genes, and 54% included genes from both cycles (Supplementary Fig. 1). Nitrite and nitrate reduction pathways were the most represented pathways in hydrocarbon producers, notably in Planctomycetes, Nitrospina and Deltaproteobacteria (Myxococcota, Desulfatibia and Desulfobia). The Cyanobacteria MAG also included the potential for nitrate and urea assimilation. By contrast, genes coding for enzymes involved in the oxidation of sulfide (SQR) and thiosulfate (doxD, tsdA) and sulfate reduction (aprAB, dsrAB) were the most detected in hydrocarbon degraders. Potential sulfide oxidizers included members of the Marinimicrobia, Bacteroidales and Alphaproteobacteria lineages, whereas thiosulfate oxidizers were related to other Bacteroidetes lineages (Flavobacteriales including Scheiferiaceae, Cytophagales and Cryomorphaceae). Sulfate reduction pathway genes were identified in Desulfatiglandales, Syntrophales, Desulfobacteraceae Chloroflexi and Abyssubacteria. In addition, dissimilatory nitrate reductase and sulfur oxidation genes were also identified in the highly dominant Nanopelagicales MAG with hydrocarbon degradation genes, whereas in the highly dominant Marinimicrobia MAGs sulfur oxidation genes were identified along nitrous-oxide reductase genes (Supplementary Fig. 1).