Statistics of pathogenic Azolla plants
Based on the observations, it is evident that the Azolla plants cultivated for two weeks have grown rapidly and fully covered the entire pond. The majority of the plants in both the Api and Aim groups appeared healthy. However, there seems to be an issue with Azolla plants infected with an unknown pathogenic fungus in both groups, which were cultivated for one month. These infected plants showed symptoms of disease, such as yellowing of the leaves, partial browning, or complete withering of certain plant parts, and loss of structural integrity ( as shown in Fig. 1A, B). To further investigate, we conducted a count of pathogenic plants and the findings indicated that there were significantly more pathogenic plants in the Aim group compared to the Api group ( as shown in Fig. 1C). Additionally, the disease incidence in the Aim group was significantly higher than in the Api group ( as shown in Fig. 1D).
Sequence data summary
High-quality sequencing data were obtained from all six samples after confirming a clean data rate of > 97% through rigorous quality control analysis. The average Q20 and Q30 values were 97.8% and 93.59%, respectively, indicating excellent data quality. The average GC content was 52.46%, which felled within the expected range for the samples ( as shown in Supplementary table 1). To assess the consistency between biological replicates, correlation coefficient analysis was performed, and the results revealed that all samples were grouped individually within their respective replicates, indicating a high level of correlation (Fig. 2A). This was further supported by the principal component analysis (PCA) results (Fig. 2B). These findings indicate that the quality of the sequencing data met the necessary requirements for subsequent analysis, ensuring reliable results.
Taxonomic composition of the microbial communities
The taxonomic composition of the two groups was determined by blasting against the NCBI-NT database. In the annotation results, 195 OTUs were identified as archaea, 10857 as bacteria, 986 as eukaryotes, and 147 as viruses. Bacteria accounted for the highest proportion, followed by eukaryotes. A total of 142 bacterial species were found to be present in both Azolla samples when analyzed at the phylum level. Cyanobacteria and Proteobacteria were the most abundant phyla in both Api and Aim, together comprising more than 70% of the overall population ( as shown in Fig. 3A). Proteobacteria was the most abundant phylum in both samples, comprising 42.39% of all phyla in Api and 51.35% in Aim. At the order level, the most abundant orders within Proteobacteria were Burkholderiales (21.54% and 27.98% in Api and Aim, respectively), Hyphomicrobiales (2.72% and 5.01% in Api and Aim, respectively), and Cellvibrionales (0.62% and 1.28% in Api and Aim, respectively) ( as shown in Fig. 3B). Cyanobacteria was the second most abundant phylum in both samples, comprising 42.19% of all phyla in Api and 22.40% in Aim. The most abundant orders within Cyanobacteria were Nostocales (43.92% and 22.73% in Api and Aim, respectively) and Pseudanabaenales (0.75% and 0.86% in Api and Aim, respectively) ( as shown in Fig. 3B). At the genus level, a total of 2,271 genera were identified from all samples. Trichormus, Calothrix, Methyloversatilis, and Haliscomenobacter were the dominant bacteria. The relative abundance of Trichormus was as high as 27.49% in Api and 12.06% in Aim. The relative abundance of other genera is less than 10% ( as shown in Fig. 3C). The relative abundance of Haliscomenobacter showed the largest difference between Api (1.25%) and Aim (4.06%). LEfSe analysis was performed based on the bacterial data at the phylum level. The results showed different biomarkers for the two groups: the Proteobacteria phylum, Burkholderiales order, Betaproteobacteria class, and others had a higher proportion in the Aim group ( as shown in as shown in Fig. 3D and E), while Cyanobacteria phylum, Nostocales order, Nostocaceae family, Trichormus genus, and others had a higher proportion in the Api group ( as shown in Fig. 3D and E).
We have also identified numerous OTUs annotated as eukaryotes. The Streptophyta phylum accounted for over 90% of the relative abundance of all the identified fungi (as shown in Fig. 4A). Within the Streptophyta phylum, we observed the presence of Vitis, Glycine, Selaginella, Cajanus, Marchantia, and several another fungi with low abundance ( as shown in Fig. 4B). Further analysis using LEfSe revealed that Magnoliopsida class, Bacillariophyta phylum, Bacillariophyceae class, and others had a higher proportion in the Aim group ( as shown in Fig. 4C and D), while the Streptophyta phylum, M. polymorphic species, Marchantia genus, and others had a higher proportion in the Api group ( as shown in Fig. 4C and D).
Pathogenic microorganism
Based on the information from the Pathogenic Bacteria Database (https://globalrph.com/bacteria/), we have identified 70 pathogenic bacteria in our results (Fig. 5A). Most of these pathogenic bacteria showed a higher relative abundance in the Aim group. Furthermore, we have performed LEfSe analysis, which revealed that 13 bacteria were identified as significantly different between the two groups (Fig. 5B). Among these, Actinomyces, Porphyromonas, Salmonella, Achromobacter, Stenotrophomonas maltophilia, Clostridium, Nocardia, and Enterobacter cloacae complex were found to have higher abundance in the Aim group compared to the Api group. On the other hand, Enterobacter cloacae, Enterobacter, Pseudomonas aeruginosa, and Nocardia showed higher abundance in the Api group compared to the Aim group.
KEGG functional analysis
Functional capabilities of the microbial community were assessed through a metagenomic study for KEGG functional analysis. Base on our criteria, pathways accounting for more than 1% of all paths were considered dominant. Subsequently, 20 predominant pathways were identified in the two groups (Fig. 6A). Among them, ribosome, oxidative phosphorylation, photosynthesis, quorum sensing, and two-component system were the top five enriched KEGG pathways. Further analysis was conducted to evaluate KEGG pathways that showed significant differences between the two groups. The results revealed that pathways such as Staphylococcus aureus infection, melanogenesis, chloroalkane and chloroalkene degradation, sphingolipid metabolism, biosynthesis of various secondary metabolites, Vibrio cholera infection, apoptosis, RAS signaling pathway, stilbenoid, diarylheptanoid and gingerol biosynthesis, and butanoate metabolism were among the top 10 significantly enriched KEGG pathways (Fig. 6B).
Resistance gene analysis
122 contigs were identified, out of which 19 were found to be associated with Antibiotic Resistance Ontology (ARO). The analysis of resistance gene abundance revealed that the most prevalent resistance gene was adeF ( as shown in Fig. 7A). Furthermore, when comparing the different resistance genes between Api and Aim, we conducted Metastats analysis using the CARD database. The results revealed the identification of five distinct genes: AAC6-IIa, cmlA9, qacEdelta1, tetD, and ANT3-Ii-AAC6-IId_fusion_protein ( as shown in Fig. 7B). Among these genes, AAC6-IIa and qacEdelta1 were both found to be present in the Proteobacteria phylum.