A gricultural Exudate- R esponsive Metagenomic (ARM) Database Expands Microbiome Knowledge in Colorado Soils
To understand how microbial genomic content influences soil health and the promotion of plant growth in response to exudate stimulation, we established the Agricultural exudate-Responsive Metagenomic (ARM) microbial genomic catalog. In total, we collected 41 metagenomes from two exudate stimulation soil microcosm experiments (Fig. 1C), resulting in 621 Gbp of total metagenomic sequencing. The prior proof of concept study evaluated sorghum genotype exudate profiles using microcosms operated similarly except exudate profiles guided the formulation of synthetic root exudates [28], while here we directly added the cover crop produced exudates to the soil microcosms. From both experiments we reconstructed 441 medium- and high-quality metagenome-assembled genomes (MAGs) that were dereplicated at 99% identity into 326 MAG clusters, representative of distinct microorganisms (Fig. 2A). ARM contains MAGs assigned to 21 bacterial phyla and 1 archaeal phylum, with the database composed mainly of members of Pseudomonadota (29.1%), Bacteroidota (14.1%), Actinomycetota (13.1%), and Bacillota_A (12.2%). The MAGs of ARM span 43 different genera (Additional File 9, Fig. 1).
Highlighting the genomic novelty of these soils, the ARM database contains MAGs that represent previously unidentified (i.e., lacking a taxonomic assignment) family (n = 10 MAGs, 3%) and genera (n = 39 MAGs, 11.9%) in the Genome Taxonomy Database (GTDB) (Fig. 2B, Additional File 3). Additionally, a large proportion of our MAG database belonged to lineages only recognized by alphanumeric identifiers (e.g., Draft Genome Sequence) in GTDB at the class (n = 7), order (n = 46), and family (n = 104) levels (Fig. 2B). Together these findings emphasize the phylogenetic novelty yet to be genomically captured from agricultural soils. Ultimately, accessible ARM resources provide new genomic information for ecologically relevant taxa, with the goal towards enabling taxonomic analyses and metabolic reconstruction of microorganisms from agricultural soils.
Cover crop exudates results in distinct microbial chemical landscapes
We first examined how soil microbial metabolite pools changed in response to the addition of cover crop exudates over time. Combining non-targeted and targeted LC-MS/MS data resulted in the detection of 641 molecular features to resolve community metabolic changes across the enrichment timeseries in each treatment (Additional File 2; Additional File 9, Fig. 2). The metabolite data showed restructuring by cover crop exudate treatment and time (Fig. 3 & Additional File 9, Fig. 3 & Table 1). Additionally, the cereal rye and hairy vetch treatments exhibited significantly different metabolomes from the control at each matched timepoint following the first day of exudate addition, indicating adding exudates not only rapidly influenced microbial outputs during the time they were added, but had an impact on restructuring of metabolic outputs after the amendment period. In the sorghum and rapeseed exudate treatment, the exudate amendments only differed from control on days 3 and 5, respectively, of the amendment period, yet despite these slight chemical changes early on, they did result in a temporally significant trajectory different from the control later in the timeseries (Additional File 2). These results suggest that that cereal rye and hairy vetch root exudates may have more distinct chemical influence over the soil microbiome than sorghum and rapeseed treatments.
The 641 detected metabolites were then classified into 14 superclasses across all samples: lipids and lipid-like molecules (n = 179), organoheterocyclic compounds (n = 106), organic acids and derivatives (n = 97), phenylpropanoids and polyketides (n = 64), organic oxygen compounds (n = 52), benzenoids (n = 38), nucleosides, nucleotides, and analogues (n = 20), alkaloids and derivatives (n = 18), organic nitrogen compounds (n = 15), lignans, neolignans and related compounds (n = 7), organosulfur compounds (n = 3), organic 1,3-dipolar compounds (n = 1), organic polymers (n = 1) (Additional File 9, Fig. 4; Additional File 2). We found a significant difference between treatment and control metabolomes at this metabolite classification level (Additional File 9, Fig. 5 & Table 2). Specifically, all exudate-treated microcosms were significantly enriched for lipids, benzenoids, alkaloids and derivatives, and organoheterocyclic compounds. Water amended soil controls were enriched for organic acids and derivatives, nucleosides, nucleotides, and analogues, and organic polymers (Additional File 9, Table 3). Interestingly, cereal rye was significantly different from the control and other exudate treatments being enriched for metabolites from the phenylpropanoids and polyketides superclass (Additional File 9, Table 3). Taken together our broad metabolite profiling indicates that cover crop exudates can result in distinct chemical trajectories, possibly through altered microbial metabolism.
Microbial Community Function is Altered by Exudate Addition
Next, we wanted to examine the microbial community response to exudates at the genome membership and gene expression levels. We used the ARM database to resolve metatranscriptome expression profiles across the enrichment timeseries (Additional File 7). Unlike the metabolomes that had a strong individual cover crop effect, the overall composition of metatranscriptionally active MAGs did not change by cover crop treatment, but rather time (Additional File 9, Fig. 6 & Tables 4–5). Yet we did observe that the active genera changed over the course of the experiment. For example, genera in the family Nitrososphaeraceae (phylum Thermoproteota, previously Thaumarchaeota) contributed roughly 50% of exudate-addition phase metatranscriptome expression (Fig. 4A, green shading). But by day 21 of enrichment, control and cover crop microcosm metatranscriptomes were dominated by MAGs from the Bacteroidota, including unclassified genera in the Crocinitomicaceae (genus M2408) and JADKCL01 (genus JADKCL01) families (Fig. 4A). Comparing these membership and metabolite findings, the chemical data showed much stronger restructuring by treatment and time than the genus level active members.
To better identify specific organismal responses to individual cover crop treatments we next focused on genome (not genus) resolved responses within the microbial community. During exudate amendment, only a single MAG from an undescribed species in the Actinomycetota increased in activity in response to hairy vetch amendment (JAJTCL01 sp021323255), while a single MAG from a novel genus in the Patescibacteria (family UBA1547) was enriched in response to sorghum exudates (Additional File 8). Next, we identified MAGs that broadly responded to the cover crop exudates compared to the control. We found 23 MAGs were enriched in the control metatranscriptomes and 9 MAGs were enriched across the cover crop exudate treatment metatranscriptomes (Additional File 9, Fig. 7). The latter group included three MAGs in the Nitrospiraceae (including two in the Nitrospira_C), two MAGs in the Gammaproteobacterial genus Hydrogenophaga, one MAG classified as Nitrobacter vulgaris, and MAGs representing novel species of Arthrobacter, Lacipirellula, and Paucimonas. Collectively, this suggests cover crops produced root exudate compounds that stimulated specific microorganisms in the broader community.
Next, the impact of exudate addition on gene expression was evaluated by aggregating genes at the functional level (by annotation protein type). We identified 114 microbial functions that were enriched in the control metatranscriptome, and 145 functions that were enriched in the cover crop exudate amended microcosms, at day 5 (Additional File 9, Fig. 8). We observed distinct nitrogen cycling gene expression between control and exudate metatranscriptomes at day 5 (Fig. 4B; Additional File 9, Figs. 8 & 9). Functions associated with nitrogen transport, mineralization and assimilation were discriminant to the control microcosm metatranscriptomes. In support of the microorganisms in the control microcosms being nitrogen limited, we confirmed the exudate metabolites contained organic nitrogen in the form of urea and several amino and organic acids while the controls received only water amendments (Additional File 2). Additionally, ammonia oxidation gene expression (amoB) was more enriched in the control, with contributions from four genera of Thaumarchaeota (Additional File 8). Nitrite reduction (nirK) was also enriched in the control, with an average of 97% control gene expression from the four Thaumarchaeota genera. The increased expression of amoB further supports the idea that the control soil microcosms are nitrogen limited, as genera from archaeal ammonia oxidizers were detected and active in all cover crop and control treatments (Fig. 4A), but this increased gene expression in controls may be due to the low ammonia environment [56].
The only nitrogen function enriched in the exudate microcosms was nitrite oxidation (nxrB), or the second step of nitrification resulting in nitrite being oxidized to nitrate. Gene expression was derived from Nitrospira_C MAGs and a MAG from an undescribed genus WHTF01 in the Binatia (Additional File 8). Concomitant with this gene result, as mentioned above, a third of the discriminant cover crop transcriptionally active MAGs were bacterial nitrite oxidizers (Nitrospira_C, Nitrobacter). The enrichment of bacterial nitrite oxidizers was unique to the cover crop treatments, suggesting these nitrifiers selectively responded to exudate components. Despite similar redox conditions between exudate and control reactors, in the control soils gene expression data suggest that nitrite is not oxidized but reduced in a closed loop (Fig. 4B). Instead in the controls, nitrite reduction was mediated by Thaumarchaeota (nirK) possibly to detoxify ammonification produced nitrite due to the absence of active nitrite oxidizers or was assimilated to ammonia (nirBD). Finally, 8 genera contributed to nirD expression in the control, but 83% of the expression comes from 2 MAGs from the Cellvibrio and Pseudomonas_E genera. An additional 6 genera contribute to nirB expression in the control, with 45% of the expression from the same Pseudomonas_E MAG. These findings reflect an increase in control microorganisms to utilize specific nitrogen cycling functions due to the lack of available organic nitrogen (Fig. 4B). While we acknowledge limitations in translating these laboratory findings to the field scale, our multi-omics results do highlight how small-scale nutrient landscape changes caused by exudation could alter microbially expressed metabolic regimes (e.g., nitrification versus nitrite reduction) to result in disparate ecosystem manifested outcomes.
Metabolite Evidence for Cover Crop and Microbially Produced Phytohormones
Given the recognized importance of microbial phytohormones in plant growth promotion, we next considered whether cover crop exudation could stimulate soil microbial phytohormone metabolism. We performed targeted metabolomics to quantify phytohormones in the microcosms, identifying 8 phytohormones that fluctuated over time: indole-3-acetic acid (IAA), indole-3-butyric acid (IBA), gibberellic acid 4 (GA4), 1-aminocyclopropane carboxylic acid (ACC), salicylic acid (SA), methyl salicylate (mSA), benzoic acid (BA), jasmonic acid (JA) (Fig. 5). Of these phytohormones, IAA, IBA, BA, GA4, and ACC had the strongest effect across the treatments (Fig. 5A; Additional File 9, Tables 6–9). Notably, only IAA and ACC were detected in cover crop exudate additions and were thus not detected in the water controls (Figs. 5B-C). Of the other phytohormones, IBA and GA4 were stimulated to be produced by the microbial community in response to added root exudates (Figs. 5D-E), while BA and mSA production was unique to the non-exudate stimulated controls (Additional File 9, Fig. 10). Our findings show that cover crop roots exude unique phytohormone profiles, but also that the soil microbiome likely encodes diversity of phytohormone producing and consuming metabolisms.
Of the different phytohormones we were particularly interested in ACC, IAA, IBA, and GA4 dynamics because they had clear cover crop signal. Our exudate metabolomics data confirmed ACC was detected as a component of the endogenous root exudate across all cover crops. As a result, ACC increased during the exudate addition phase (up to day 5). After day 7, ACC abundance stabilized (e.g., hairy vetch) or decreased (e.g., rapeseed, sorghum, cereal rye), the later indicating a possible microbial consumption (Fig. 5C). Like ACC, IAA was a product of cover crop direct addition in the hairy vetch but was not a component of other exudate treatments. However, the IAA accumulation signal was not as clear as ACC during the exudate addition phase (up to day 5) (Fig. 5B). After exudate stimulation stopped, IAA generally decreased in 3 cover crop treatments (hairy vetch, cereal rye, rapeseed), except for sorghum where IAA increased overtime (Fig. 5B; Additional File 9, Table 10). Similarly, IBA, an indole related to IAA, accumulated significantly only in the cereal rye at the post exudation phase (Fig. 5E; Additional File 9, Table 6). These latter two findings suggest the exudate chemistry stimulates members of the microbiome for indole production.
Based on these interesting indole cover crop responses, we mined our untargeted LC-MS data for additional indoles, identifying 2 unknown compounds that shared structural similarity to IAA (see Methods). Unknown indoles were from the chemical subclasses (i) indoles and (ii) indolyl carboxylic acids and derivatives and could be further categorized as a 3-alkylindole compound and an IAA derivative, respectively (Additional File 9, Table 11). The unknown IAA derivative was significantly more abundant in the rapeseed metabolomes compared to the control at each timepoint following day 1 (Additional File 9, Fig. 11A & Table 12). Exhibiting a broader response, the unknown 3-alkylindole was significantly enriched at each timepoint following the first day of exudate addition in rapeseed, sorghum, and hairy vetch compared to the control (Additional File 9, Fig. 11B & Table 13).
While there are more than 100 gibberellic acids (GA) known, only four (GA1, GA3, GA4, and GA7) have characterized biological activity [57]. Of these, GA4, a C20-gibberellin, was detected exclusively, and increasingly over time, in the cereal rye treated microcosms (Fig. 5D; Additional File 9, Table 7). Further supporting broader microbial GA production, we found two GA features that shared structural similarity with GA4 that were further characterized as type C20- and C19-GA (Additional File 9, Table 11). Given that GA compounds were not detected in the exudates, and they collectively increased after exudate stimulation in the cereal rye (Additional File 9, Fig. 12A-B & Tables 14–15), it is likely that cereal rye exudates contained compounds that stimulated microbial production of these GA. These findings show the power of combining untargeted metabolomics with targeted phytohormone quantification to enable tractability of microbial phytohormone biosynthesis. Our metabolite analyses highlight how much yet to be discovered biochemistry is likely residing in agricultural soils.
Metatranscriptome Evidence for a Microbial Role in Phytohormone Transformations
Based on the possible ACC microbial consumption observed in our metabolite dynamics (Fig. 5C) and the critical role this compound plays as a recruitment cue for plant growth promoting bacteria [58], we mined ARM MAGs for the ACC deaminase (acdS) gene. Many bacteria use acdS to metabolize ACC via deamination, curbing the abiotic stress induced ethylene production and its adverse effect on plants [58]. In total, 66 genes from 63 MAGs were annotated as acdS (Additional File 3). As ACC deaminase genes are commonly misannotated as D-cysteine sulfhydrase [45, 46], we curated these putative hits for key active site residues to identify 13 high confidence acdS genes from distinct MAGs in the Actinomycetota (n = 9) and the Pseudomonadota (formerly Proteobacteriota, n = 4) (Fig. 6A).
Two of these Actinomycetota MAGs (Pseudonocardia, Ornithinibacter) expressed ACC deaminase across treatments and timepoints. These strains represent the most likely biological culprits contributing to ACC removal in the treatment microcosms. Analysis of the broader gene expression of these two MAGs suggests the α-ketobutyric acid generated from ACC deaminase activity could be used to support isoleucine synthesis for these microorganisms (Fig. 6B; Additional File 3). Additionally, the Ornithinibacter MAG expressed genes for participation in arabinan utilization, a plant polysaccharide found in high concentrations in roots [59], as well as the genes for nitrate reduction (Nap). These paired metabolite and genome-resolved expression data uncover how cover crop ACC exudation can enrich for microorganisms with the capacity to modulate soil carbon and nitrogen cycles that benefit plant health.
Microbial biosynthesis of indole derivatives, like auxins, can impact plant health directing proper plant growth and development [60, 61]. Given that IAA was only a component of the hairy vetch exudate pool, the IAA detected in sorghum, cereal rye, and rapeseed was likely microbially generated. This compound and other closely related indole intermediates can be synthesized via five microbial pathways derived from tryptophan (Trp), all converging on the final oxidation step converting indole acetaldehyde (IAAld) to IAA by aldehyde dehydrogenase (ALDH) [61, 62]. We profiled genes for IAA pathways in the ARM database and found 4 pathways were represented in our MAGs (Fig. 7A, Additional File 3).
Of the 4 pathways, the indole-3-pyruvate (IPA) pathway was likely not contributing to the metabolite signal. We found 10 MAGs carrying IPA decarboxylase, yet we found no expression of either genes in the pathway (Fig. 6A). For the second pathway, the indole-3-acetamide (IAM) pathway, we failed to identify any MAGs that encoded both genes in this pathway. However, it has been suggested that this could be community metabolism [63], thus we still examined the gene expression patterns in our data. One MAG from the unnamed genera belonging to the Geminicoccaceae family expressed iaaM and a different MAG from the Burkholderiales order expressed iaaH. Expression of these two genes was highest in the rapeseed, hairy vetch, and cereal rye exudate treatments at day 5 (Fig. 7B). Our combined metabolite and metatranscriptome data suggest that this pathway was likely contributed to the IAA-like metabolite signal in the cereal rye and rapeseed.
The third pathway, the tryptamine (TAM) pathway, best explains the sorghum metabolite IAA production in later time points (Fig. 7B). This pathway is composed of two genes that were encoded across 116 MAGs, but 1 MAG from the Woeseiaceae family in the Pseudomonadota expressed both genes in the sorghum microcosms making this organism the most likely source for the increased IAA detected in sorghum metabolites at later time points. The fourth pathway, the indole-3-acetonitrile (IAN), is identified by the nitrilase gene. This gene was encoded by 55 MAGs, with expression detected in 5 MAGs from three genera in Actinomycetota, one in Pseudomonadota, and one in Chloroflexota. Expression was greatest in rapeseed at timepoints 5 and 21 (Fig. 7B). Holistically, our multi-omics indicates higher functional diversity expressed in the rapeseed treatment, where IAA synthesis genes were expressed by diverse organisms using two different pathways (IAN, IAM). Lastly, 66% of the MAGs in ARM had the potential for catalyzing the final step in IAA synthesis, 15 of which expressed this gene only in exudate microcosms during (day 5) or after exudate stimulation (day 21; Fig. 7B). Our findings clearly validate a role for cover crops in producing a chemical environment that promotes microbial IAA production.
Gibberellic acids (GA) are important phytohormones that promote plant root and stem elongation [64]. Because the cereal rye metabolites showed a clear microbial production signal, we next wanted to identify microbial taxa with this capacity. We found 176 MAGs that encoded at least one gene in the GA operon (Additional File 9, Fig. 12C), with 63 encoding CYP115, the gene responsible for converting GA9 into bioactive GA4 (Fig. 7C-D; Additional File 3). Supporting our GA4 metabolite data, there were 5 MAGs that expressed CYP115 only in cereal rye metatranscriptomes at days 5 and 21 (Fig. 6A and E). A novel MAG in the Thermomicrobiales order (Chloroflexota) encoded 6 of 9 GA operon genes and actively expressed 5 (CYP112, CYP114, CYP115, GGPS, SDR) of the transcripts at day 21 in the cereal rye treatment and not in the control making this MAG the most likely culprit for GA4 production (Fig. 7D). CYP115 expression by this MAG increased over time showing a significant positive correlation with GA4 metabolite production (Fig. 7F; Additional File 9, Table 16). Our findings showing Thermomicrobiales order may be important in the production of GA4, demonstrated the power of these laboratory experiments for uncovering new microbial physiologies with relevance to plant growth promotion.