Identification of Angucycline Biosynthetic Gene Cluster spi1 in HDN155000.
Genome mining on the marine derived actinomyces strain Streptomyces sp. HDN155000 assisted by online tools of antiSMASH17 and 2ndfind identified a type II BGC named spi1 (GenBank accession no. OP009365) which contains a unique set of FMO genes (spiH1, spiH2 and spiH3) with spiH1 showing only 46% identity to its nearest homologue of BexI, a proposed monooxygenase in the anthrone biosynthetic pathway (Fig. 1a, Supplementary Table 5).18–19 Cluster alignment revealed the spi1 BGC spans on a 29 kb of continuous DNA region and contains 26 open reading frames (ORFs, Fig. 1a, Supplementary Table 5) with putative genes that encode seven minimal type II polyketide synthase enzymes (spiABCDEFG), ten sugar biosynthesis related enzymes (spiLNPQRSTUVW), one drug resistance transporter (spiO), and five tailoring enzymes including three monooxygenases (spiH1, spiH2 and spiH3) plus two putative methyltransferases (spiM and spiI). Apart from the above designated genes, two putative primary metabolic-related genes (spiJ and spiK) and one gene with unpredictable function (orf1) were also presented in spi1.20 Detailed protein phylogenetic analysis showed SpiH1 is evolutionarily close to ABM superfamily FMOs but clearly locates in a clade separated from AlpJ/JadG/GiloII/ChaZ/GrhO5, the post-PKS redox enzymes responsible for the biosynthesis of kinamycins (alp)11, jadomycin (jad)14–15, gilvocarcin (Gil)13, chartreusin (Cha)6, and griseorhodin (grh)21 (Fig. 1b and Supplementary Fig. 8), which indicated the metabolic capacity of spiH1 in diversifying angucycline structures. The spi1 BGC was therefore selected for direct cloning and heterologous expression in Streptomyces hosts.
Direct Cloning and Heterologous Expression of spi1 BGC
The spi1 BGC was cloned into the p15A vector in Escherichia coli using Red/ET recombineering technology via the linear plus linear homologous recombination strategy (LLHR)22–23 to generate the recombinant plasmid p15A1 (Fig. 2a and Supplementary Fig. 9). To facilitate site-specific integration and heterologous expression, p15A1 was modified by inserting the oriT-attP-phiC31 cassette PCR amplified from pR6K-oriT-phiC3122–23 using the Red/ET linear plus circular homologous recombination method (LCHR), thus yielding the plasmid p15A2 (Fig. 2a, Supplementary Table 6 and Fig. 9). Plasmid p15A2 was then introduced into S. coelicolor A3(2) to generate strain S. coelicolor A3(2)/p15A2. The strain harboring the whole spi1 BGC was cultured in M1 medium using a shake flask under 30 ℃, 200 rpm. However, compared with the background of wild type S. coelicolor A3(2), no new peak was detected from the crude extract of S. coelicolor A3(2)/p15A2 as analyzed by HPLC (Fig. 2b, trace ii). To activate the expression of the spi1 BGC, one strong promoter, kasOp*24, was then inserted downstream to spiH3 to form p15A3 (Supplementary Fig. 10). The plasmid p15A3 was retransformed into S. coelicolor A3(2) and the resulting strain S. coelicolor A3(2)/p15A3 (Supplementary Fig. 10c) successfully produced a series of new peaks detected by HPLC at a UV absorption of 210 nm (Fig. 2b, trace iii).
Scaled culture (60 L) of S. coelicolor A3(2)/p15A3 followed by purification with column chromatography of silica gel, ODS, sephadex LH-20, and semi-HPLC led to the identification of four new angucycline structures named spirocycliones A-B (1–2) and angumycinones E-F (3–4) along with two known compounds rabelomycin (5)25 and DHR (6)25 from the crude extract (Figs. 2b-c). The planar structures of compounds 1–4 were elucidated based on 1D and 2D NMR data (Fig. 2c and Supplementary Tables 2–3). The relative configurations were assigned based on NOESY signals and the calculated iJ/dJ-DP4 probability. By applying the ECD calculation method using the time dependent density functional theory (TDDFT) at the B3LYP/6–31 + G(d) level, the absolute configurations of 1–2 were defined as (8aS, 13R)-1 and 8aS-2, respectively (Supplementary Table 1 and Figs. 1–3). Single-crystal X-ray diffraction analysis by Cu Kα radiation confirmed the absolute configuration of 3 (6aS, 7S, 8aR, 12aR, 12bR) (Fig. 4, CCDC 2181459). Considering the same biosynthetic origin, the absolute configuration of 4 was presumed to be consistent with 3. Hence, the absolute configuration of 4 was determined as 6aS, 8aR, 12aR, 12bR.
Among those structures, compound 1 represents the first example of angucycline derivatives with an oxaspiro[benzochromene-2,1'-cyclohexane] architecture, and compound 2 possesses an intriguing benzochromene scaffold appended with two carboxylic acid terminals which probably derived from the A ring cleavage of compound 1. These obtained structures exhibited cytotoxicity against a panel of cancer cell lines with compound 4 showing promissing activity to the multidrug-resistant human small-cell lung carcinoma cell line of H69AR with an IC50 value of 0.97 µM, which was comparable to the positive control of adriamycin (ADM, Table 1).
Table 1
In Vitro Antitumor Activities of the spirocycliones (IC50, µM).
Compd | MDA-MB-231 | K562 | ASPC-1 | H69AR | H69 |
1 | 12.65 | > 50 | > 50 | > 50 | > 50 |
2 | > 50 | > 50 | > 50 | 23.39 | > 50 |
3 | 25 | 4.38 | > 50 | 25 | > 25 |
4 | 2.16 | 2.05 | 8.23 | 0.97 | 10.48 |
ADM | 0.31 | 0.24 | 0.22 | 0.52 | 0.36 |
FMOs of SpiH1, SpiH2 and SpiH3 Involved in the Biosynthesis of Atypical Angucyclines
The structural novelty and bioactivity presented by these angucyclines increased our interest in studying their biosynthetic routes, especially the formation of the oxaspiro structure in rings A and B of 1 and the cleavage of ring A in 2. Initially, to reconfirm the correlation of the spi1-cluster with the production of these angucyclines, spiA, the key gene which encodes the type II polyketide synthase was inactivated from the spi1 BGC using LCHR in E. coli GB08-red and the mutant plasmid was re-transformed into S. coelicolor A3(2); as a result, the production of new peaks were abolished (Supplementary Fig. 11a). To rule out the heterologous host redox modification effects on the formation of novel structures 1–2, the p15A3 plasmid was transformed into different host strains of S. venezuelae (ISP 5230) and S. lividans K4-114, and the productions of 1 and 2 were detected from both cultures (Supplementary Fig. 11b), reconfirming the involvement of spi1 BGC in the biosynthesis of the above atypical angucycline structures.
It has been reported that the biosynthesis of angucycline structures share an initial biosynthetic pathway, which includes the formation of the polyketide backbone and early processing steps on it to provide the common intermediates such as UWM61. The highly active intermediate UWM6 could undergo complex oxidative modifications and carbon skeleton rearrangements to give diversified tetracyclic or tricyclic frameworks. During this process, redox tailoring enzymes, especially the FMOs, are revealed to play the pivotal roles in modifying the early angular intermediates and generating structural complexity and diversity (Fig. 3).
To gain a better understanding of the interesting rearrangement cascade from the early tetracyclic intermediate into to the final products including 1–2, we focused on the three FMOs of SpiH1, SpiH2 and SpiH3, which were thought to be responsible for the post-PKS modification of early angular skeletons. At first, we inactivated spiH1, spiH2, and spiH3 individually from p15A3 using the Red/ET LCHR method to give p15A3KOH1, p15A3KOH2 and p15A3KOH3, which were then transformed and expressed in S. coelicolor A3(2) (Supplementary Fig. 12).
As a result, the mutant plasmid p15A3KOH1 failed to produce compound 2 while maintaining the ability to produce compound 1, however, the production of 1–2 were both abolished within the p15A3KOH2 and p15A3KOH3 mutants (Fig. 2a, trace v-vii). Chemical analysis of the S. coelicolor A3(2)/p15A3KOH1 strain led us to isolate a new tetracyclic angular structure gephyromycin E (8) together with a known compound gephyromycin (7), which have been reported as products from early oxidative modification steps26 catalyzed by SpiH2 and SpiH3 homologues (Fig. 2a, trace v and Supplementary Figs. 5–6). And two extra known compounds rabelomycin (5) together with RM20C (9), whose skeleton was reported to be generated from the spontaneous cyclization of the linear decaketide skeleton27 were accumulated and obtained from the strains of S. coelicolor A3(2)/p15A3KOH2 and S. coelicolor A3(2)/p15A3KOH3, respectively (Fig. 2a, trace vi and trace vii). Those results indicated that SpiH2 and SpiH3 probably worked upstream in the biosynthetic process relative to SpiH1 and converted early angular intermediates such as UWM6 into oxidized structures, including the oxaspiro product 1 and other oxidative modified structures such as 3–8, while SpiH1 worked downstream to convert 1 into the ring-opened structure of 2 (Fig. 3).
SpiH2 and SpiH3 Are Responsible for the Formation of Spirocyclione A (1)
Phylogenetic analysis showed SpiH2 clustered with PgaM28, BexM19 and UrdM9, which are two-domain flavoproteins that contain an N-terminal domain homologous to FAD-dependent monooxygenases fused to a C-terminal domain homologous to SDRs via a short linker region (Supplementary Figs. 13–14). To test the in vitro function of SpiH2 in the formation of compound 1, we tried to purify SpiH2 expressed in E. coli BL21(DE3), however, as in similar cases reported for its homologues, the protein was mostly produced as inclusion bodies29, which prevented the in vitro attempts. We then tested the in vivo activity of SpiH2 within an alternative host. Plasmid p15A5 was constructed by inserting spiH2 into the p15A backbone (Supplementary Fig. 15b) and transformed into S. coelicolor A3(2). The resulting strain S. coelicolor A3(2)/p15A5 was fed with extracts from S. coelicolor A3(2)/p15A3KOH2 (extract-1) and BAP1/pGro7/pXY-1/pXY-3/pXY-6 (extract-2)30, both of which have the capability to supply early angular intermediates such as UWM6 and SF2315A31, which are presumed precursors of 1.
The cultures were then subjected to LC-MS detection, but compound 1 was not detected from the fermentation products (Fig. 4a), which revealed SpiH2 was not capable of converting early angular intermediates to compound 1. However, interestingly one new peak, characterized as 7-hydroxyfridamycin E (10)32 was identified in the mixture (Fig. 4A). To further test the activity of SpiH2 in correlation with the production of 10, we fed rabelomycin (5), a proposed precursor of 10 to S. coelicolor A3(2)/p15A5. The resulting HPLC profile indicated one extra peak, which was further identified as urdamycin L (12), the proposed BV oxidation product of 10 catalyzed by the SpiH2 homologue UrdM9–10, which experimentally confirmed the BV oxidative C-C bond cleavage ability of SpiH2 on an agucycline skeleton.
To continue interrogating the biosynthesis of 1, we focused on SpiH3, which was also predicted to be an NADPH-dependent flavoprotein hydroxylase based on phylogenetic analysis and clustered with PgaE28, BexE19, and UrdE9 (Supplementary Fig. 13). The gene spiH3 was inserted into the pColdI backbone to construct plasmid pColdIH3. An induced culture of BL21(DE3) containing pColdIH3 fed with extract 2 successfully produced 1 (Fig. 4b), which preliminarily confirmed SpiH3 was able to convert early angular intermediates to the oxaspiro[5.5]undecane featured compound 1. We then planned to test the function of SpiH3 in vitro using UWM6 and SF2315A as substrates, the most possible precursors leading to 1. However, efforts were obstructed by failing in preparation of both UWM6 and SF2315A due to their highly active and unstable properties. Instead, an analogue of SF2315A, characterized as a new compound named kanglemycin D (11) was obtained from the S. coelicolor A3(2)/p15A3KOH1 culture. We then purified N-His6-tagged SpiH3 from E. coli BL21(DE3)/pColdIH3 (Supplementary Fig. 18a) and tested its enzymatic activity with compound 11 as a substrate instead of UWM6 and SF2315A. As a result, a new peak was detected from the reaction mixture. Due to the low catalytic efficacy and resulting trace amount of output, we failed to purify it. However, the MS spectrum showed an ion peak at m/z 356.89 [M + H]+, which suggested SpiH3 was able to introduce an oxygen atom into the skeleton of 11 (Supplementary Fig. 18b), supporting the hypothesis that SpiH3 could catalyze oxidation of early angular intermediates to give compound 1.
Based on the above results, we illustrated the conversion steps from early angular intermediates to the novel oxaspiro architecture of 1, which involved SpiH2 and SpiH3 working cooperatively for sequential oxidations. UWM6, the early common intermediate, was transformed into SF2315A probably catalyzed by SpiH2 and SpiH3 through successive redox reactions. The Δ12b double bond of SF2315A was then opened by SpiH3 through another flavin-dependent oxidation step to give the proposed intermediate 1′, which further underwent C12a-C12b carbon bond cleavage and ring rearrangement to generate the final product of 1 (Fig. 4c). SpiH3 homologues have been well investigated for their capabilities of hydroxylating early angular intermediates at the C1 position9,25, while the production of compound 1 represents the first example of a flavin-dependent C12a-C12b carbon bond cleavage and hydroxylation at the C12a position, thus resulting in the formation of the oxaspiro[5.5]undecane structure in rings A and B.
To consolidate the proposed pathway from SF2315A to 1 and better understand the flavin-dependent SpiH3 catalyzed ring rearrangement process, we conducted density functional theory (DFT) calculations on the reaction between SF2315A and the simplified coenzyme factor Fl-OO−. As shown in Fig. 5, the processes started with the 1,4-michael addition of Fl-OO− to the naphthoquinone part of SF 2315A to easily form the conjugate 1′ via transition state TS1 (21.0 kcal mol-1), then tautomerized into enol form 1′a. The TS2 is calculated as high as 31.3 kcal mol− 1, indicating that the epoxidation of the enol double bond cannot smoothly occur at room temperature without the catalysis of SpiH3. Once the departure of Fl-O−, the process is hugely exothermic and to obtain the epoxy compound 1′b, which could hardly go through the bond cleavage reaction with the energy barrier TS3′ is 64.1 kcal mol− 1 (Supplementary Fig. 16). However, with the aid of imidazole and its conjugated acid, which built up proton shuttles, the latter processes could undergo rapidly at room temperature. This provided the other evidence that the enzyme SpiH3 participates in the whole processes of rearrangement. The epoxy ring expanded into the seven-membered compound 1′′ through a bond cleavage reaction, then the intramolecular attack of the hydroxyl group to give the bridged-ring 1′′′. Finally, through sequential reactions including the ether bond break and the capture of proton to afford the spiral compound 1.
SpiH1 Is the Key Enzyme for the Biosynthesis of Spirocyclione B (2)
Phylogenetic analysis identified SpiH1 as an BVMO within the ABM superfamily, but clustered in a different clade from other characterized BVMOs in type II PKS biosynthesis like, for example, JadG and AlpJ, the key enzymes responsible for ring B cleavage of benz[a]anthracene backbones (Fig. 1b). To reconfirm SpiH1 was directly related to the formation of 2, in trans complementation of the SpiH1 deficiency was achieved by transforming S. coelicolor A3(2) with p15A6, a plasmid with insertion of spiH1 into p15A3KOH1 backbone (Fig. 2b, trace viii, Supplementary Fig. 17). Results showed the production of 2 was successfully recovered in the S. coelicolor A3(2)/p15A6 strain, verifying the involvement of SpiH1 in 2 formation. Next, compound 1, the postulated intermediate leading to 2 was added to the culture of the spiH1 over-expressed strain, which was achieved by transforming S. coelicolor A3(2) with p15A4, a plasmid carrying only spiH1within the p15A backbone (Supplementary Fig. 15a). As shown in Fig. 6a, LC-MS analysis revealed 2 was successfully produced in the culture, which further confirmed the function of SpiH1 in converting 1 to 2. To further investigate the BV oxidation activity of SpiH1, N-His6-tagged SpiH1 was purified from E. coli BL21(DE3)/pET28a-H1 to near homogeneity (Supplementary Fig. 20a). Experiments were processed in vitro by incubating 1 with SpiH1, FAD, NADPH and E. coli flavin reductase (Fre), which is known to regenerate FADH2 from FAD using NADPH11. As shown in Fig. 6b, the production of 2 in the reaction mixture was detected by HPLC trace. To further test the substrate promiscuity of SpiH1, compounds 3–5, whose structures containing C12 carbonyl groups were incubated in vitro with SpiH1, but no obvious changes for all the tested compounds were observed (data not shown), showing the relatively low promiscuity for SpiH1.
Based on above results, the conversion steps from the oxaspiro structure of 1 to the ring A-opened product of 2 was proposed: BV reaction on 1 catalyzed by SpiH1 introduced an oxygen atom adjacent to the active 12-carbonyl group, thus generating the ester intermediate 2′, which subsequently underwent a spontaneous rearrangement reaction and oxidation steps to give the final ring A opened product of 2 (Fig. 6a)
Docking and Mutagenesis Studies Revealing Key Residues for the Function of SpiH1
Sequence alignment of SpiH1 with AlpJ and other BVMOs in the ABM superfamily revealed the conserved residues in N-terminal (GxSxxxG) and C-terminal (YxQW), which was consistent with the BV oxidation activity of SpiH1 (Supplementary Fig. 19). To further understand the role of SpiH1 during the intriguing transformation from 1 to 2, we then performed AlphaFold2 assisted protein analysis on SpiH133. The overall structure of SpiH1 consists of six α helices and nine antiparallel β sheets, which make two hydrophobic pockets within the C-terminal pocket in the “open” conformation accessible to substrate while the N-terminal is buried and not easy for reactants to enter (Supplementary Figs. 20b-c). In addition, both pockets showed no FAD-like cofactors binding sites, which is the typical characteristic for FMOs in the ABM superfamily.1 Above features highly resembled to the three-dimensional conformation of the well characterized AlpJ. Detailed analysis of the sequence also provided the conserved residues of His54 and Trp68 in the N-terminal, and Tyr170, Gln172 and Trp173 in the C-terminal. SpiH1 and compound 1 were then subjected to docking analysis using AutoDock Vina34. The best binding conformations revealed most conserved residues are positioned toward the catalytic cavity, except for Tyr170, which was reported to be the key residue for stabilization of domain interface, being stretched outside. Importantly, the size of the SpiH1 active site was predicted to be much bigger than that of any previously reported AlpJ family of oxygenases with five proposed catalytic residues of Arg125, Asn169, Tyr179, Tyr182 and Asn188 folded into the cavity, which suggested a different mechanism of substrate binding or catalysis for SpiH1 (Figs. 6d-e).
On the basis of the above information, the conserved sites including residues at the intramolecular domain interface (His54 and Tyr170) and residues reported responsible for substrate binding (Val64, Trp68, Trp173), together with residues specifically presented in the SpiH1 catalytic pocket (Arg125, Asn169, Tyr179, Tyr182 and Asn188) and a conservative site of Gln172 in the C-terminal were mutated to alanine. The activities of SpiH1 and its mutants were evaluated by monitoring the consumption of the substrate 1, and the reaction mixture with boiled SpiH1 was used as a negative control.
As shown in Fig. 6c, the activities of H54A, W68A, R125A, Y170A, W173A, Y182A and N188A were drastically decreased by more than 70% compared with wild-type SpiH1. For other mutants like N169A, Q172A, and Y179A, activity was only slightly affected. The above results support both the N- and C-terminal putative active sites as being vital for the activity of SpiH1, with the C-terminal “open” to the reactants and the N-terminal helping to stabilize the active conformation, where Arg125 and Asn188 together with Tyr182 form strong hydrogen bonds with the C1 carbonyl and C13 hydroxyl groups of 1, thus exposing the C12 carbonyl to the cofactor of peroxyflavin and facilitating the nucleophilic attack initiated BV oxidation. Compared with AlpJ, the bigger active sites of SpiH1 may allow the accommodation of substrate 1, which needs more space than DHR, the nearly planar structured substrate of AlpJ.
In conclusion, heterologous expression and mutational biosynthetic studies of a type II PKS BGC led us to characterize six new angucycline variant structures (1–4, 8, 11). Interestingly, compound 1 represents the first example of angucycline derivatives with an oxaspiro[5.5]undecane ring system and compound 2 possesses a benzochromene scaffold bearing two carboxylic acid substitutions derived from the oxidative cleavage of compound 1 in ring-A. Meanwhile, compound 1 exhibited promising cytotoxicity against triple-negative breast cancer (TNBC) cell line of MDA-MB-231, and compound 2 showed cytotoxicity to multidrug-resistant human small-cell lung carcinoma cell line of H69AR. Biosynthesis studies based on in vivo biotransformation experiments revealed the formation of 1 resulted from sequential flavin dependent oxidative reactions on the structure of the early tetracyclic angular intermediates such as UWM6 catalyzed by two FMOs of SpiH2 and SpiH3, while SpiH1, identified as a new member of ABM super family FMOs, was unambiguously confirmed to catalyze the transformation from 1 to 2 via BV oxidation adjacent to the C12 carbonyl followed by hydrolysis steps, which represented the first example of BV oxidative cleavage of ring A on oxaspiro angucycline derivatives. AlphaFold assisted analysis and site-directed mutagenesis experiments revealed the structure of SpiH1 was conformationally similar to the AlpJ family of oxygenases but possessed a more relaxed space in the catalytic cavity with a triad of Arg125, Tyr182 and Asn188 presented as active sites, which may facilitate the loading of the more stereoscopic stretched substrate of 1 and the peroxyflavin dependent nucleophilic attack at the of C12 carbonyl position. The elucidation of new compounds in this study enriches the structural diversity of angucycline-like chemical entities and the discovery of the new function of the above enzymes expands the catalytic repertoire of naturally occurring flavoproteins in aromatic polyketide biosynthesis, and enlightens ongoing chemical engineering and molecular design work based on FMOs catalyzed oxidative reaction cascades.