Phylogenetic analysis based on 16S rRNA gene sequence
The nearly complete 16S rRNA gene sequence of strain YIM B05605T contains 1545 nucleotides. Strain YIM B05605T was closely related to C. candidum 18JY8-7 T with 96.61 % 16S rRNA gene sequence similarity. The neighbor-joining phylogenetic tree showed that strain YIM B05605T formed a distinct and stable clade within the genus Cohnella with C. fontinalis YT-1101T as the closest relative, and supported by 100 % bootstrap value (Fig. 1). The similar topological results were also obtained from the maximum-likelihood and maximum-parsimony phylogenetic trees (Figs. S1 and S2). It was obvious that strain YIM B05605T were phylogenetically affiliated to the genus Cohnella.
Genomic analysis and annotation of gene functions
The dDDH, ANI values between YIM B05605T and C. fontinalis YT-1101T were 15.7 %, 73.42 %, respectively. These values were significantly lower than the currently established boundaries for genomic species definition (70 % for dDDH, 95-96 % for ANI) (Wayne 1988; Richter and Rosselló-Móra 2009), which demonstrated that strains YIM B05605T and C. fontinalis YT-1101T were not belong to the same species. According to the genome-based phylogenetic tree (Fig. 3), strain YIM B05605T formed a distinct phylogenetic cluster within the genus Cohnella with a high bootstrap value, confirming the topology determined by 16S rRNA gene sequences. These results proved that strain YIM B05605T should be assigned to the genus Cohnella and represented novel species.
The genome of strain YIM B05605T was assembled into 55 contigs with a total length of 5,510,416 bp. The G + C content was 58.76 mol% with N50 value of 243,336 bp. Strain YIM B05605T was predicted to harbor 4,957 protein-coding genes (CDS), The numbers of tRNA and rRNA were 45 and 4, respectively. The genome of strain C. fontinalis YT-1101T was assembled into 161 contigs with a total length of 6,658,403 bp. The G + C content was 57.78 mol% with N50 value of 161,101 bp. Strain C. fontinalis YT-1101T was predicted to harbor 5,690 protein-coding genes (CDS), The numbers of tRNA and rRNA were 56 and 7, respectively. Sequences matching antibiotic resistance genes and virulence factors were retrieved from the genomes of strains YIM B05605T and C. fontinalis YT-1101T in accordance with the analysis carried out using CARD, PHI and VFDB database. 314 and 350 antibiotic resistance genes (annotated in CARD database) were respectively predicted in genomes of strains YIM B05605T and C. fontinalis YT-1101T. More detailed general features of the genome of strains YIM B05605T and C. fontinalis YT-1101T were summarized in Table 1 and the distribution of specific genomic regions was displayed in Fig. 2.
Functional analysis by COG database revealed that strains YIM B05605T and C. fontinalis YT-1101T had 4,131 and 5,019 genes, respectively. Both assigned to 21 and 23 categories, respectively. The most abundant category of strain YIM B05605T was carbohydrate transport and metabolism (G, 16.2 %). In this, gene 4225 encodes a chitinase (table S1). Followed by transcription (K, 12.5 %), signal transduction mechanisms (T, 10.7 %), general function prediction only (R, 9.4 %), amino acid transport and metabolism (E, 7.4 %), cell wall/membrane/envelope biogenesis (M, 6.2 %). The most abundant category of strain C. fontinalis YT-1101T was function unknown (S, 30.5 %), carbohydrate transport and metabolism (G, 14.3 %). In this, gene 0645, gene 0970 and gene 1012 encode three chitinases (table S2). signal transduction mechanisms (T, 8.7 %), transcription (K, 8.4 %), inorganic ion transport and metabolism (P, 6.4 %), amino acid transport and metabolism (E, 4.9 %). The detailed distributions of genes associated with general COG functional categories in the genome of strains YIM B05605T and C. fontinalis YT-1101T were presented in Fig. S3. Functional analysis by CAZy database found that 215 genes in genome of strain YIM B05605T and 285 genes in strain C. fontinalis YT-1101T were assigned to CAZy families. The strain YIM B05605T have 5 functional CAZy classes, the largest class was glycoside hydrolases (GHs, 122 domain), followed by carbohydrate esterases (CEs, 46 domain), glycosyl transferases (GTs, 39 domain), auxiliary activities (AAs, 17 domain) and carbohydrate‐binding modules (CBMs, 1 domain). In addition to the above 5 functional CAZy classes, there is polysaccharide lyases (PLs, 8 domain) in strain C. fontinalis YT-1101T. The detailed distribution of genes associated with general CAZy functional categories in the genome of strains YIM B05605T and C. fontinalis YT-1101T were presented in Fig. S4. Functional analysis by KEGG database revealed that strains YIM B05605T and C. fontinalis YT-1101T had 2,606 and 2,763 genes, respectively. In strain YIM B05605T of carbohydrate metabolism, gene 4010 encodes a chitinase, both gene 3531 and gene 4979 encode starch synthase (table S3). In strain C. fontinalis YT-1101T of carbohydrate metabolism, gene 0604 encodes a chitinase (table S4).The detailed distribution of genes associated with general KEGG functional categories in the genome of strains YIM B05605T and C. fontinalis YT-1101T were presented in Fig. S5.
The prediction of antiSMASH revealed that strain YIM B05605T contained six biosynthetic gene clusters encoded for secondary metabolites, including redox-cofactor, LAP, terpene, T3PKS, cyclic-lactone-autoinducer and RiPP-like. Among them, lankacidin C , as a microtubule stabilizer by binding to the paclitaxel binding site, by 17-membered polyketide, exhibits considerable antitumor activity (Ayoub et al. 2022). Lankacidin C and lankamycin, a 14-membered macrolide produced in the same strain, can synergistically inhibit protein synthesis by binding to the neighboring sites in the bacterial large ribosomal subunit (Auerbach et al. 2010; Belousoff et al. 2011). Lankacidins exert considerable antitumor activity against various cell lines including L1210 leukemia, B16 melanoma, HeLa, and the breast cancer T47D cells (Ootsu and Matsumoto 1973; Oostu et al. 1975; Ayoub et al. 2016).
Morphology, physiological and biochemical characteristics
Strain YIM B05605T was Gram-stain positive, strictly aerobic and endospore-forming. Under transmission electron micrographs, the shape of the cells was rod-shaped (approximately 0.37-0.93 × 1.97-3.21 µm) with flagella and the endospores (Fig. 4). Colonies of strain YIM B05605T was punctiform, circular and white-cream on TSA when incubated at 45 °C for 5 days. Strain YIM B05605T grew at pH ranging from pH 6.0 to 8.0 with optimum growth at pH 7.0which was slightly different from its experimental control (C. fontinalis YT-1101T; Table 2). Strain YIM B05605T grew at temperatures ranging from 37 to 50 °C with optimum growth 45 °C, NaCl tolerance ranging from 0 to 3 % with optimum growth 0.5 %. Like strain C. fontinalis YT-1101T, strain YIM B05605T was positive for oxidase and catalase. Negative for nitrate reduction , while C. fontinalis YT-1101T was positive. Unlike C. fontinalis YT-1101T, strain YIM B05605T was positive for hydrolysis of starch and gelatin. Strains YIM B05605T and C. fontinalis YT-1101T were negative for hydrolyzed of tweens 20, 40, 60 and 80, H2S production, cellulose, milk peptonization and coagulatio. In API 20NE and API ZYM strips, strain YIM B05605T was negative for tryptophan, glucose, D-mannitol, gluconate, esterase lipase C8, leucine arylamidase and α-fucosidase, while C. fontinalis YT-1101T was positive. Strain YIM B05605T was positive for acid phosphatase, naphthol-AS-BI-phosphohydrolase and β-glucosidase, while C. fontinalis YT-1101T was negative. The data of API 50CH, Biolog GEN III MicroPlate tests and antibiotic susceptibility for strains YIM B05605T and C. fontinalis YT-1101T were shown in Table S5, Table S6 and Table S7, respectively.
Chemotaxonomic characterization
The polar lipid of strain YIM B05605T was comprised of diphosphatidylglycerol (DPG), phosphatidylglycerol (PG), phosphatidylethanolamine (PE), phosphatidylmonomethylethanolamine (PME), unidentified glycolipid (GL), three unknown aminophospholipids (APLs) and unidentified polarlipid (PL) (Fig. S6). MK-7 were the isoprenoid quinone present in strain YIM B05605T, which was consistent with members of the genus Cohnella. The cell wall peptidoglycan contained glutamic acid, serine, threonine and alanine, the diagnostic diamino acid was meso-diaminopimelic acid. The predominant fatty acid (>10 %) of strain YIM B05605T were identified as anteiso-C15:0 (41.34 %) and iso-C16:0 (27.60 %) and that of C. fontinalis YT-1101T were anteiso-C15:0 (44.81 %) and iso-C16:0 (28.37 %). The fatty acid profiles of strains YIM B05605T and C. fontinalis YT-1101T showed disparities from the reference strains were presented in Supplementary Table S8.