The phylogenetic analysis and the in silico DNA-DNA hybridization correlated the ANT_B131 to Streptomyces albidoflavus.
The Streptomyces albidoflavus ANT_B131 was isolated from Salp sp. collected in King George Island, Antarctica, during the summer of 2010. The phylogenetic identification using the sequencing of the 16S RNAr gene indicated that ANT_B131 belongs to the Streptomyces genera (See Supplementary Fig. 1), and was closest related to a clade with Streptomyces albidoflavus DSM 40455, Streptomyces hydrogenans NBRC 13475, Streptomyces daghestanicus NRRL B-5418, and Streptomyces violascens ISP 5183. According to the List of Prokaryotic names with Standing in Nomenclature (LPSN)40 there are 660 species of Streptomyces with a validly published name. The 16S rRNA gene sequencing has an important role in the taxonomy of Streptomyces, however, it does not always have the power to delimit species41. To improve the taxonomic analysis, we choose to perform the Multilocus Sequence Analysis (MLSA) of house-keeping genes and the in silico DNA-DNA hybridization (isDDH). MLSA provide the improvement of taxonomic resolution among species within the family Streptomycetaceae 42 and DDH is the gold standard for species delineation34.
The studied Streptomyces strain had its whole genome sequenced (accession number JADILI000000000) and, based on the genome annotation, five house-keeping genes were selected (see Supplementary Table 1) to the MLSA. The whole genome sequencing enables the comparation of the total genome of Streptomyces ANT_B131 with correlated strains, performing the isDDH. The DDH is a laborious technique, with which G + C content and melting profiles have been estimated, and could be done by HPLC and real-time PCR. After the advantage of genome sequence techniques, the G + C content analysis and the DDH between two bacterial genomes could be done in silico34.
The whole genome was sequenced with Illumina and assembled with SeqMan NGen version 14.0 software, resulting in a total genome length of 6,968,465 bp in 49 contigs, with an N50 of 236,213 bp and a GC content of 73.2%. The genome was annotated using Prokka that identified 6,110 coding sequences (CDSs), PGAP identified 6,055 protein-coding genes, Patric identified 6,459 CDS and RAST identified 6,550 CDSs and 413 subsystems (Table 1). We provide a high-quality draft genome with 97.2% completeness, 0.6% duplicated and 0.3% fragmented.
Table 1
Genome annotation features of Streptomyces albidoflavus ANT_B131
Feature
|
Prokka
|
PGAP
|
Patric
|
RAST
|
CDS
|
6,110
|
6,055
|
6,459
|
6,550
|
rRNA
|
7
|
16
|
65
|
16
|
tRNA
|
75
|
66
|
11
|
65
|
tmRNA
|
1
|
-
|
-
|
-
|
misc RNA
|
50
|
-
|
-
|
-
|
Subsystems
|
-
|
-
|
-
|
413
|
The MLSA included closest strains related to ANT_B131 and indicated that the Antarctic strain was closest related to Streptomyces albidoflavus NRRL B-1271 (Fig. 1). The average nucleotide identity based on BLAST + (ANIb) and in silico DNA–DNA hybridization (isDDH) values were calculated between ANT_B131 and its phylogenetically closest species S. albidoflavus J1074 (ANIb 97.02%; isDDH 79.60%; Difference in % G + C: 0.15) and S. albidoflavus SM254 (ANIb 97.41; isDDH 79.50%; Difference in % G + C: 0.17) (see Supplementary Table 2). The ANI and isDDH were above the cutoff of > 95% and > = 70%, respectively, and the difference in %G + C was < 1.0, these results indicates that ANT_B131 belongs to the same species of the tested strains34,36. The DDH values (%) are currently used as the gold standard for species delineation in bacteriology. The evaluation of the dDDH (digital DNA-DNA hybridization) enabled the delineation of the novel species Micromonospora zhangzhouensis, the productor of a diterpenoid with cytotoxic activity43. In this study, attending to taxonomic criteria applied to the genome sequence, it was possible to correlate the bacterium ANT_B131 to the species Streptomyces albidoflavus.
In general, the genomic results are in agreement with the characteristics of Streptomyces albidoflavus genomes previously reported on NCBI. There are 46 genomes related to Streptomyces albidoflavus deposited in GenBank (https://www.ncbi.nlm.nih.gov/genome/genomes/32384). The GC content ranging from 72.30 to 73.60% and the genome size ranging from 6.52 Mb to 7.45 Mb. The median total length of the genomes is 7,053,130 bp, the median protein count is 5,964 and the median GC% is 73.4.
Streptomyces albidoflavus ANT_B131 produces potential antimicrobials and antiproliferative.
Organic crude extracts obtained by ethyl acetate solvent extraction showed antimicrobial and antiproliferative activities. The antimicrobial activity was observed against Gram-positive and Gram-negative pathogenic bacteria, and against the fungus Candida albicans (Table 2). The MIC ranged from 0.25 to 2.0 mg.mL− 1. Antimicrobial activity related to crude extracts of Streptomyces isolated from Antarctica is documented against Gram-positive and Gram-negative bacteria, but little information is available about the antifungal activity of those crude extracts2,3.
The cytostatic antiproliferative activity of crude extracts were observed against cell lines: glioblastoma U251 (GI50 17.2 µL.ml− 1), human breast adenocarcinoma MCF-7 (GI50 195.3 µL.ml− 1), large cell carcinoma of lung NCI-H460 (GI50 0.35 µL.ml− 1), adenocarcinoma of kidney 786-0 (GI50 27.7 µL.ml− 1), and prostate PC-3 (GI50 24.8 µL.ml− 1) (Fig. 2).
Table 2
Minimum inhibitory concentration (MIC) and Mininimum Bactericidal Concentration (MBC) of crude extracts of Streptomyces sp. ANTB_131.
Strain
|
Origin
|
MIC (mg/mL) / MBC (mg/mL)
|
Gram-positive strains
|
|
|
Bacillus subtilis ATCC 6051
|
DM/UNICAMP
|
0,5 / 1,5
|
Micrococcus luteus ATCC 4698
|
DM/UNICAMP
|
0,25 / -
|
Staphylococcus aureus ATCC 6538
|
DM/UNICAMP
|
2,0 / -
|
Gram-negative strains
|
|
|
Neisseria meningitidis Y USA (Clinical isolation)
|
INCQS/FIOCRUZ
|
2,0 / -
|
Neisseria meningitidis C 2135
|
INCQS/FIOCRUZ
|
2,0 / -
|
Neisseria meningitidis B4 (Clinical isolation)
|
INCQS/FIOCRUZ
|
2,0 / -
|
Neisseria meningitidis W135 ATCC35559
|
INCQS/FIOCRUZ
|
2,0 / -
|
Fungi strain
|
|
|
Candida albicans ATCC10231
|
DM/UNICAMP
|
1,5 / 1,5
|
INCQS/FIOCRUZ: Special Laboratory of Bacteriology and Molecular Epidemiology, Department of Clinical, Toxicological and Bromatological Analysis, São Paulo State University (USP), Ribeirao Preto, SP, Brazil. DM/UNICAMP: Division of Microbiology, University of Campinas (UNICAMP), Paulínia, SP, Brazil. LABIOTEC: Biotechnology Laboratory, Faculty of Pharmaceutical Sciences, Campinas State University (UNICAMP), Campinas, SP, Brazil. ATCC: American Type Culture Collection.
|
Genome mining uncovers the genomic potential for Streptomyces albidoflavus ANT_B131 to produce natural products.
The genome mining study was performed using antiSMASH database and the manual curation revealed 24 putative Biosynthetic Gene Clusters (BGCs). The BGC encoding polyketides type I (5 of 24) and type III (1 of 24), non-ribosomal peptides (NRPS, 8 of 24), bacteriocins (2 of 24), terpenes (4 of 24), siderophores (2 of 24), hybrid compounds (4 of 24), ectoine, thiopeptide, lanthipeptide, and lassopeptide (Fig. 3, Table 3).
Despite the antiSMASH analysis, the manual curation validated the identification of three fragments of genes clusters related to surugamide A/D (cluster 13) and three related to candicidin (cluster 19). The surugamide A/D BGC had 82,441 bp and encodes a non-ribosomal peptide described as a antibacterial and antifungal compound44. The candicidin BGC had 138,203 bp and the correspondent metabolite showed antimicrobial activity against Candida albicans45.
The bioinformatic analysis by antiSMASH predicted terpenes with high BGC similarity: hopene (cluster 4, 100% of similarity), the volatile compound geosmin (cluster 8, 100% of similarity), the sesquiterpene antibiotic albaflavenone (cluster 9, 100% of similarity), the carotenoid Isorenieratene (cluster 17, 75% of similarity), naringenin (cluster 18, 100% of similarity). Albaflavenone and geosmin are responsible for the strong odor in streptomycete and are related to antimicrobial activity46. Interestingly, antiSMASH identified the gene cluster of SGR PTMs (cluster 3, 100% of similarity) related to a polycyclic tetramate macrolactams first identified in Streptomyces griseus. The Desferrioxamine B/E (cluster 15) and Ectoine (cluster 16) BGCs showed 100% of gene similarity. The Desferrioxamine B is a Fe(III) chelator, and is associated to iron deliver to microbes47. Ectoine provides protection against osmotic stress48. The antimycin BGC (cluster 23, 80% of similarity) is a secondary metabolite of Streptomyces sp. with cytotoxic, antifungal, antibiotic, anticancer activities49. The BGCs relate to kanamycin (cluster 8), thiopeptides fluostatins M-Q (cluster 10) and type I PKS Pacificanone A (cluster 20) showed less than 20% of similarity.
Table 3
Biosynthetic gene clusters identified in the genome of Streptomyces albidoflavus ANT_B131.
Number
|
Cluster location
|
Cluster size (kb)
|
Type
|
Predicted product
|
1
|
Contig 79 (164714 to 210406)
|
45693
|
NRPS
|
Diisonitrile antibiotic SF2768
|
2
|
Contig 79 (229633 to 304621)
|
74989
|
NRPS / Terpene
|
Orphan
|
3
|
Contig 79 (423354 to 472764)
|
49411
|
NRPS / T1PKS
|
SGR PTMs
|
4
|
Contig 79 (503123 to 529681)
|
26559
|
Terpene
|
Hopene
|
5
|
Contig 79 (608808 to 619023)
|
10216
|
Bacteriocin
|
Orphan
|
6
|
Contig 42 (6276 to 17604)
|
11329
|
Bacteriocin
|
Orphan
|
7
|
Contig 66 (20876 to 35889)
|
15013
|
Siderophore
|
Orphan
|
8
|
Contig 104 (6303 to 28603)
|
22301
|
Terpene / Other
|
Geosmin/Kanamycin
|
9
|
Contig 52 (386568 to 407620)
|
21053
|
Terpene
|
Albaflavenone
|
10
|
Contig 2 (123277 to 155794)
|
32473
|
Thiopeptide
|
Fluostatins M-Q
|
11
|
Contig 10 (163019 to 224651)
|
48914
|
NRPS
|
Orphan
|
12
|
Contig 10 (263300 to 313589)
|
62236
|
NRPS
|
Orphan
|
13
|
Contig 80 (1 to 29890)
|
29890
|
NRPS
|
Surugamide A / Surugamide D
|
|
Contig 33 (1 to 41377)
|
41377
|
NRPS
|
Surugamide A / Surugamide D
|
|
Contig 102 (46186 to 77679)
|
30893
|
NRPS
|
Surugamide A / Surugamide D
|
14
|
Contig 71 (189212 to 233561)
|
44395
|
NRPS
|
Orphan
|
15
|
Contig 83 (59655 to 71475)
|
11820
|
Siderophore
|
Desferrioxamine B / Desferrioxamine E
|
16
|
Contig 81 (106501 to 116899)
|
10399
|
Ectoine
|
Ectoine
|
17
|
Contig 55 (39767 to 65616)
|
25849
|
Terpene
|
Isorenieratene
|
18
|
Contig 85 (1860 to 42957)
|
41097
|
T3PKS
|
Naringenin
|
19
|
Contig 85 (47893 to 101940)
|
54047
|
T1PKS
|
Candicidin
|
|
Contig 3 (1 to 22674)
|
22674
|
T1PKS
|
Candicidin
|
|
Contig 25 (11507 to 70522)
|
59015
|
T1PKS
|
Candicidin
|
20
|
Contig 21 (1 to 31408)
|
31408
|
T1PKS
|
Pacificanone A
|
21
|
Contig 27 (1 to 5181)
|
5181
|
T1PKS
|
Orphan
|
22
|
Contig 40 (1 to 4749)
|
4749
|
T1PKS
|
Orphan
|
23
|
Contig 47 (1 to 74671)
|
74671
|
NRPS-T1PKS
|
Antimycin
|
24
|
Contig 22 (74379 to 96762)
|
22384
|
Lassopeptide
|
Orphan
|
In the antiSMASH analysis of the genomes of the two strains closest related to the Streptomyces albidoflavus ANT_B131, we found similarities in the BGCs number and type. The BGCs identification (Fig. 4) indicated that are common types of core biosynthetic genes for secondary metabolites production between the three strains, such as, the NRPS, PKS, Hybrid PKS-NRPS, Terpene and Siderophore. The metabolites hopene, geosmin, surugamide, desferrioxamine B, candicidin, ectoine, isorenieratene and SGR-PTM were identified in all the genomes with similarity above 75%. Biosynthetic clusters related to RiPPs, such as lanthipeptides, thiopeptides and bacteriocins were also found. However, a blast analysis did not show similarity between the RiPPs BGCs of Streptomyces albidoflavus or even other bacteria. This result indicated that these orphan clusters (clusters 6, 10, 28, and 30) have a potential to be new molecules with unknow biological activities.
The three strains showed closest percentage of the genome attributed to biosynthetic genes involved in secondary metabolite production. Streptomyces albidoflavus ANT_B131 had 13.3% (925,376 bp) of the genome related to the secondary metabolism, S. albidoflavus SM254, genome size 7,170,505 bp, attributed 13.58% and S. albidoflavus J1074, genome size 6,841,649 bp, attributed 13.0% of the genome.
Five BGCs were expressed
After the genome sequencing and the BGC identification by antiSMASH, we performed an experiment to improve the expression of the biosynthetic gene clusters related to the secondary metabolites production. The Classical Molecular Networking analysis by GNPS enables the identification of compounds related to BGCs. GNPS provides the processes, analysis or identification of tandem mass (MS/MS) spectrometry data of a data set and compare the data to all publicly available data39. To generate networks, Molecular Networking utilizes the MS/MS data. The nodes, in the network, represent molecules and edges represent spectral similarity between pairs of molecules12.
MS/MS spectra of S. albidoflavus ANT_B131 extracts were identified as a hit in GNPS database (see Supplementary Fig. 2–5) and four metabolites were putatively identified: surugamide A (1), surugamide D (2), desferrioxamine B + Al (3) and desferrioxamine E (4). Also, based on the MS/MS spectra and comparison with other databases (METLIN and MassBank of North America), the metabolite ectoine (5) was manually identified in S. albidoflavus extracts (see Supplementary Fig. 6). The identified compounds were detected only in the bacterial extracts and absent in the controls (culture media extracts) (see Supplementary Fig. 6–10), and all their structures are represented in Fig. 5. The MS data of the detected metabolites have mass errors below 5 ppm and are shown in Table 4.
Table 4
Genome annotation features of Streptomyces albidoflavus ANT_B131
Compound
|
Ion
|
Ion Formula
|
Calculated m/z
|
Experimental m/z
|
Error (ppm)
|
Surugamide A
|
[M + H]+
|
C48H82N9O8
|
912.6286
|
912.6294
|
0.9
|
Surugamide D
|
[M + H]+
|
C47H80N9O8
|
898.6129
|
898.6116
|
-1.4
|
Desferrioxamine B + Al
|
[M-2H + Al]+
|
C25H46N6O8Al
|
585.3192
|
585.3202
|
1.7
|
Desferrioxamine E
|
[M + Na]+
|
C27H48N6O9Na
|
623.3380
|
623.3404
|
3.8
|
Ectoine
|
[M + H]+
|
C6H11N2O2
|
143.0820
|
143.0818
|
-1.4
|
Ion [M + H]+ at m/z 912.6294 was identified as compound 1. MS/MS spectrum of this ion yielded to fragments at m/z 261.1602, 298.2084, 374.2498, 539.3926, 652.4693 and 884.6316, typical fragments for the structure of 1 (see Supplementary Fig. 7E) and already reported in the literature50. Also, surugamide 2 was identified as ion [M + H]+ at m/z 898.6116. Fragmentation of 2 was close to fragmentation of 1 due structure similarity (see Supplementary Fig. 8E) and yielded to fragments 261.1600, 374.2493, 638.4719 and 785.5196. Surugamide 1 was identified as intracellular and extracellular metabolite. Compound 1 was found in the mycelium and the media of two media, NB-ASW and GYE-ASW, when applied methanol as extract solvent (Fig. 6). However, compound 2 was identified only as intracellular metabolite of all culture media evaluated, also recovered by methanol extraction. Surugamide 1 was reported with antifungal activity against Saccharomyces cerevisae and was expressed in Streptomyces albidoflavus J107444, antimicrobial activity against Staphylococcus aureus (MIC of 10 µM)51, and inhibit bovine cathepsin B (IC50 of 21 µM)52, a cysteine protease that implicated in invasion of metastatic tumor cells. Surugamide 2 showed the inhibitory activity against bovine cathepsin B (IC50 of 18 µM)52.
GNPS database identified ions [M-2H + Al]+ at m/z 585.3202 and [M + Na]+ at m/z 623.3404 as desferrioxamines B and E, respectively. MS/MS spectrum of 3 yielded to fragments at m/z 385.2015, 425.1973, 467.2067 and 568.2982, typical fragmentation pattern for the structure of desferrioxamine B complexed with Al(III) (see Supplementary Fig. 9E). Close fragmentation pattern was reported for desferrioxamine B complexed with Fe(III) (difference of 29 Da in the mass fragments due the difference of mass between Al and Fe)53. For compound 4, mass fragmentation yielded to fragments at m/z 423.2225 and 223.1101, this fragmentation pattern corresponds to the typical neutral losses of molecules of succinylcadaverine (200.1160 Da)54. Compounds 3 and 4 were recovery using all types of extraction methods when GYE-ASW medium was used to bacterial growth. The compounds were also identified as intra and extracellular metabolite of ISP2-ASW medium, recovered by methanol extraction. Compound 3 was identified as in NB-ASW medium using methanol as extraction solvent (see Supplementary Fig. 7). Desferrioxamine B is an iron-chelating agent used clinically to treat pathological iron deposition in human and iron poisoning, as Al(III)55. The antimicrobial activity of desferrioxamine B against E. coli is related to inhibition of bacterial motility56. Desferrioxamine E or nocardamine belongs to a group of microbial siderophores and has weak antimicrobial activity against E. faecium and B. subtilis and showed inhibition of colony formation of melanoma cancer cells (IC50 from 12 to 18 µM)57.
Ion [M + H]+ at m/z 143.0818 were identified as compound 5, and yielded to fragments at m/z 101.0706 and 113.5968. MS/MS spectra of ectoine at METLIN and MassBank of North America databases showed close fragmentation pattern, with fragment at m/z 101.0709 as one of the main fragments for compound 5 (see Supplementary Fig. 6E). Compound 5 was identified as intracellular metabolite in methanolic extract of all culture media. Also, the compound could be identified in the NB-ASW culture medium as an extracellular metabolite recovered by methanol extraction (Fig. 6). Ectoine has an important role as osmoprotectant, helps to prevent whole cell damage and loss of viability58. The crude extract of NB-ASW medium extracted using ethyl acetate was the extract used for the biological assays, antiproliferative and antimicrobial. However, the GNPS database did not correlate any secondary metabolite to this crude extract.
These results revealed that the use of different culture media enabled the condition to Streptomyces albidoflavus ANT_B131 express the BGCs related to the secondary metabolites. Also, the different extraction methods and solvents enable the recovery of these compounds. In this study, we could express 20% of the BGCs identified by antiSMASH (5 of 25). Studies conducted on Streptomyces fildesensis S013.3 and Pseudoalteromonas luteoviolacea showed the expression of zero and 10% of the BGCs identified, respectively2,59. We showed that the diversity of culture media and extraction methods could enhance the BGCs expression and recovery of natural products. This approach could be a new strategy to intensify the BGC expression of natural products.
Genome mining made possible the identification of BGCs related to RiPPs, such as bacteriocins, thiopeptide and lassopeptide. These clusters did not presented similarity with none BGC described in literature and Molecular Networking analysis did not show results for any class of RiPPs, but further work is required to detect their production in these organisms. The heterologous expression is a potential tool to discover these natural products.
Besides the efforts to get the compounds that showed similarities with biosynthetic gene clusters, we could not correlate any of the compounds identified in this study to the crude extract that showed antimicrobial and antiproliferative activity, the growth in NB-ASW media and extracted by ethyl acetate. Additional genome and biological assays should be done to confirm the presence and novelty of secondary metabolites for this condition and activity. Also, heterologous expression will be applicable, in future study, to get the orphan BGCs undefined associated to S. albidoflavus ANT_B131 genome.