Construction of genome-reduced B. amyloliquefaciens strains
To adapt to the adverse environmental conditions, there is a common mechanism horizontal gene transfer (HGT) among microorganisms, enabling host bacteria to acquire larger DNA segments, i.e., GIs, the G+C contents of which are significantly different from that of the core genome [26]. GIs usually carry some functional genes related to pathogenicity and antibiotic resistance, leading to the emergence of multiple resistant bacteria by HGT [27]. In addition, there are latent secondary metabolic biosynthesis gene clusters scattered across the LL3 genome, which may increase the metabolic burden on cells and the purification cost of target products [28]. Consequently, to streamline the genome of LL3, the GIs containing putative protein genes, antibiotic biosynthesis genes and prophage protein genes, where the G+C contents deviate significantly from 45.7%, were selected as the knockout targets. Besides, the gene clusters eps, bae and pgsBCA responsible for the biosynthesis of extracellular polysaccharides, bacillaene and γ-PGA, respectively, which consume more energy and substrates, were also deleted from the LL3 genome. The detailed information on the deleted regions is summarized in Tables S1 and S2. The schematic diagram for deletion of large genomic segments in LL3 is presented in Figure S1. Overall, a genome-reduced strain GR167 lacking ~4.18% of the LL3 genome was generated from NK-1 via a markerless deletion method [24]. The exact coordinates (G1 to G6) of the deleted regions on chromosome and the physical map of the endogenous plasmid pMC1 are shown in Figs. 2a and b, respectively.
Deleting redundant genes from a bacterial genome is expected to create superior chassis cells for the industrial production of valuable bio-based chemicals. Due to the existence of unannotated genes in the LL3 genome and lack of insight into the interactions among known genes, several industrially-relevant physiological traits were evaluated in GR167 to determine whether a chassis cell with excellent characteristics can be produced by genome reduction.
Genome reduction can improve the growth rate of LL3
To evaluate the effect of non-essential genomic sequences on cell growth, the growth profiles of GR167 and the parental NK-1 strain were detected by following the optical density (OD600) of cells cultured in both poor (M9 medium) and rich (LB medium) conditions. As shown in Fig. 3a, obviously, whether incubated in LB or M9 medium, GR167 grew faster and yielded higher biomass with approximately 1.5 and 1.2-fold higher at the plateau phase than that of NK-1, respectively. The maximum specific growth rate (μmax) of the strains was further determined during exponential growth. When compared with NK-1, the GR167 showed a 23.7% and 67% increase in μmax when cultured in LB and M9 medium, respectively (Fig. 3b).
During the evolution of the bacteria, various behaviors (e.g., horizontal gene transfer, HGT) enlarge the genome capacity, which may be unfavorable to cell growth because of the extra consumption for the expression of redundant metabolic pathways [29]. In current study, there was a positive correlation between cell growth and cumulative deletions, and deleting ~4.18% of the LL3 genome did not affect cellular viability of GR167. Moreover, the growth rate of GR167 outcompeted the parental strain under the tested culture conditions, making it a candidate chassis cell for further genetic engineering.
Genome reduction can enhance transformation efficiency
An ideal chassis cell is expected to possess the excellent capacity to take up exogenous plasmids. To eliminate the growth-rate bias of different strains, transformation efficiency was calculated by normalizing the colony number of bacteria transformed with plasmid pHT01 against the colony number of bacteria transformed without plasmid DNA. As shown in Fig. 3c, GR167 surpassed the transformation efficiency of the parental strain NK-1 by about 133%, indicating that the GR167 presented a better DNA uptake state during electroporation. For the transformation efficiency was a synergistic effect caused by many factors [30], it is difficult enough to pinpoint individual removed genes that are exactly affect the observed results.
Genome reduction can increase intracellular reducing power and the expression capacity of heterologous protein
The intracellular reducing power (NADPH/NADP+), which is indispensable for basic anabolic processes [31], was measured in this study. The intracellular NADPH/NADP+ ratio of GR167 increased by 21.4% compared to the parental strain, (Fig. 3d), which may be attributed to the deletion of some NADPH-consuming biosynthesis pathways such as γ-PGA biosynthesis [32]. The improvement of intracellular reducing power level may be beneficial for GR167 to enhance the production of secondary metabolites.
Besides, an ideal chassis is expected to possess high heterologous protein expression capacity. In previous studies, green fluorescent protein (GFP) was selected as a model heterologous protein in genome-reduced P. putida KT2440 mutants, and the expression capacity of heterologous protein was characterized by the GFP fluorescence per biomass unit [9, 33]. In this study, the production capability of GFP was evaluated in GR167 by transcriptional level and the fluorescence intensity. As shown in Fig. 3e, when transformed with plasmid pHT-P43-gfp, the relative fluorescence intensity of GR167 was about 50.4% higher than that of NK-1 and the increase in the transcriptional level of gfp was also observed in GR167. Overall, the reduced genome in GR167 has a positive effect on the expression capacity of heterologous protein.
Genome reduction can improve the metabolic phenotype
To further evaluate the difference in the metabolic potential of B. amyloliquefaciens, the GEN III MicroPlateTM was used to test and analyze the overall utilization of the substrates by both strains NK-1 and GR167. There are 71 carbon sources in the MicroPlates, 23 of which could be better utilized by both analyzed strains, especially L-Aspartic Acid, Citric Acid, L-Malic Acid, Glycerol, L-Glutamic Acid, and L-Lactic Acid, and among which, GR167 showed a better metabolic capacity than NK-1 except Glycerol (Table 1). These results indicated that these substrates were the preferred carbon sources for B. amyloliquefaciens LL3 and its derivatives, and that genome reduction could improve the capacity of LL3 to utilize certain substrates.
Use of genome-reduced strain GR167 as a starting strain for surfactin production
Surfactin is synthesized by the non-ribosomal peptide synthetase (NRPS) encoded by srfA operon (srfAA, srfAB, srfAC, srfAD) in microbes, which uses four amino acids (L-glutamate, L-leucine, L-valine, and L-aspartate) and fatty acids as precursors to form cyclic lipopeptide surfactin via a complex mechanism [34] (Figure S4). For surfactin, it can hardly achieve a significant breakthrough in production only through traditional fermentation optimization because of its low yield in wild strains [16, 35]. Engineering and modifying microbial chassis could maximize its practical application ranges and obtain maximum theoretical yields of bio-based products of interests. Such as B. subtilis BSK814, a genome-reduced strain, was endowed with the ability to hyperproduce guanosine as well as acetoin by modifying different metabolic pathways [4, 19].
In this study, the chassis GR167 with the intact srfA operon and superior physiological traits was used as a starting strain for surfactin production. Because the fermentation broth of the NK-1 strain was too viscous to obtain a relatively purer surfactin product, the quantification of surfactin produced by NK-1 was very difficult. The γ-PGA production leads to the high viscosity and the limitation of dissolved oxygen of the culture broth [36] and competes with surfactin production for the common substrate glutamate (Glu) (Figure S4). Moreover, both γ-PGA and surfactin are extracellular secretion products. Therefore, an extremely low yield of surfactin was detected with NK-1. Consequently, the mutant strain NK-ΔLP (NK-1 derivative, ΔpgsBCA) [37] without γ-PGA production was used as a control in the case of surfactin production. Surfactin produced by GR167 and NK-ΔLP was demonstrated by high-performance liquid chromatography-mass spectrometry (HPLC-MS). A slight increase (approximately 9.7%) in the surfactin titer was observed with GR167 (Figure S2). Genome reduction seems to have little positive effects on the surfactin yield, however, the chassis GR167 constructed in this study shows superior genetic operability, e.g., higher transformation efficiency. In addition, higher growth rate of GR167 is also a critical factor for ensuring that further genetic modifications are successfully performed. Therefore, it is interesting and necessary to explore whether microbial cell factories with high surfactin production capabilities can be constructed by further modification of GR167.
Characterization of surfactin produced by GR167
It was reported that surfactin produced by microorganisms is a mixture of several surfactin homologs [38]. In current study, by comparing the HPLC spectrogram of the produced surfactin by GR167 with that of the surfactin standard, there were four peaks to be detected within a retention time range of 6.4 to 7.2 min (Fig. 4a). To determine precisely the surfactin purity produced by GR167, each peak product of GR167 was purified from the culture supernatant and subjected to mass spectra (MS) analysis. The mass spectra of the product peaks 1, 2, 3, and 4 had the molecular ion peaks at m/z 995, 1009, 1023, and 1037, which were attributed to [C13 + 2H]2+, [C14 + 2H]2+, [C15 + 2H]2+, and [C16 + 2H]2+, respectively (Figs. 4b and c). These compounds are four homologs of surfactin, which differ in their β-hydroxy fatty acid chain length by a CH2 group of 14 Da.
Enhancing surfactin production by blocking the potential competitive pathways
A transcriptional comparison between B. amyloliquefaciens LL3 and NK-ΔLP using RNA-seq revealed that the transcriptional levels of the gene clusters srfA, itu and fen, responsible for surfactin, iturin A and fengycin biosynthesis were all up-regulated when pgsBCA was removed (Figure S3). Iturin A and fengycin belonging to CLP antibiotics are structural analogues of surfactin [39], possibly leading to the reduction of the purity of the extracted surfactin from the culture supernatant. Iturin A and fengycin are synthesized by NRPSs like surfactin [13]; thus, they may share similar biosynthesis mechanisms with surfactin and their biosynthesis may compete for NADPH, energy and direct precursors with surfactin biosynthesis. In this study, the gene clusters itu (37.2 kb) and fen (11.5 kb) were deleted to enhance surfactin production. The resulting mutants were designated as GR167I (Δitu), GR167D (Δfen) and GR167ID (Δitu, Δfen). The titer of surfactin was increased to 32.88 mg/L in GR167ID, with a 10% and 56% improvement in the titer and specific productivity of surfactin compared to GR167, respectively (Figure S2). The synthesis pathway of surfactin indicated that blocking the potential competitive pathways can eliminate the competition for the same amino acid precursors, allowing for the redistribution of substrates and precursors towards surfactin biosynthesis (Figure S4).
Construction of endogenous promoter library of B. amyloliquefaciens LL3
Promoter engineering is considered as a promising approach for enhanced production of bacterial secondary metabolites [21, 22]. FPKM (fragments per kilobase million) value is positively correlated with the transcriptional activity of a gene [40], which therefore can be regarded as an indicator for initial screening of promoters. Through RNA-seq analysis of LL3, all genes were ranked and classified into three groups based on their FPKM values, i.e., lower than 1,250, 1,250-4,000 and higher than 4,000. Then, the first six genes with higher FPKM values in each group were selected, and their upstream regions were predicted and cloned as described in Methods, named PRx [x: the name of various related genes; PR: the sequences of predicted promoters with their ribosomal binding sites (RBSs)] and represented weak, moderate and strong promoters, respectively (Table S3). Subsequently, various reporter gene vectors derived from pHT01 containing fused fragments of the predicted promoters and gfp gene were used to assess the strengths of the tested promoters in LL3 (Figure S5).
Characterization of the selected promoters via RT-qPCR and GFP fluorescence measurement
As shown in Fig. 5a, the relative transcriptional levels of the candidate promoters measured with reporter gene vectors were PRldh, PRahp, PRhem, PRtpxi, PRclp, PRsuc, PRaccD, PRgltA, PRrpsu, PRnfrA, PRgltX, PRydh, PRugt, PRarg, PRnad, PRlac, PRalsD, PRhom and PRpgmi in a descending order, which were inconsistent with the strengths of the promoters shown by the FPKM values (Table S3), with similar results reported in a previous study [23]. We speculate that the transcription of a gene on chromosome may be affected and regulated by flanking genes and regulatory sequences. However, this interference could be eliminated if a promoter is inserted into a plasmid.
To better evaluate these endogenous promoters, the relative fluorescence intensities of GFP was measured. Among the 18 endogenous promoters, PRahp, PRsuc and PRtpxi showed superior production capacity of GFP, followed by PRrpsU, PRhem and PRydh (Fig. 5b). However, the first six promoters were PRldh, PRahp, PRhem, PRtpxi, PRclp and PRsuc from high to low at the transcriptional levels (Fig. 5a). The different RBSs located upstream of the promoters may affect the translational initiation efficiencies of mRNA corresponding to GFP, leading to the different trends between the transcriptional level and production capacity of GFP.
Substitution of the native srfA promoter further enhanced surfactin production
Considering the heterologous expression of srfA is challenging for which large genetic sequence (over 25 kb), substitution of the native srfA promoter by strong promoters is considered more beneficial for enhanced transcription of srfA operon [15, 16, 20]. In this study, two promoters PRsuc and PRtpxi with better transcription and expression levels were integrated into upstream of the srfA operon in GR167ID, generating mutant strains GR167IDS and GR167IDT, respectively. The nucleotide sequences of the two selected promoters are shown in supplementary material. As expected, both the surfactin production and specific productivity exhibited a significant elevation (Figs. 6a and b). In particular, the PRsuc promoter-substituted strain GR167IDS produced 311.35 mg/L surfactin, which was about 9.5-fold higher than that of GR167ID (Fig. 6a). Meanwhile, the transcriptional level of srfA in GR167IDS was 678-fold higher than that in GR167ID (Fig. 6c). The melting curves of srfA and its internal standard gene rpsU indicated the absence of non-specific products (Figure S6).
The chassis strain GR167 does not differ in the surfactin yield compared to NK-ΔLP, however, the replacement of srfA promoter with PRsuc promoter in GR167ID significantly increased the surfactin yield of the promoter-substituted strain GR167IDS compared with other modifications (genome reduction and blocking of competitive pathways). Interestingly, PRsuc was substituted for srfA promoter in NK-ΔLP to generate the mutant strain NK-ΔLPS, with a surfactin yield of 180.88 ± 2.87 mg/L (approximately 1.7-fold lower than that of GR167IDS). Therefore, genome reduction may contribute to the improvement of overall cellular metabolic activity, resulting in high-performance production of surfactin in GR167IDS.