DHCR7 is rate-limiting step for complete biosynthesis of Cho in S. cerevisiae
First, we constructed the biosynthetic pathway to produce 7-dehydrocholesterol (Dhc), the key precursor of Cho. Considering the endogenous biosynthetic pathway for ergosterol in S. cerevisiae competes for carbon flux with the pathway for Cho de novo synthesis (Fig. 1A)15, DHCR24 from Gallus with the promoter PTEF was inserted into ERG5 and ERG6 sites to achieve two copies, thereby facilitating the accumulation of Dhc and reducing carbon flux toward ergosterol (Fig. 1A). The resulting strain D7-1 produced 203 mg/L of Dhc (Fig. 1B). Subsequently, to convert Dhc into Cho, we screened DHCR7 from four species (Bos taurus, Pangasianodon hypophthalmus, Sus scrofa, and Danio rerio). The resulting strain D7-1 with pY26-BtDhcr7 could produce a maximum Cho production of 31 mg/L (Fig. 1B) but 146 mg/L Dhc was still accumulated (Fig. 1C). Obviously, DHCR7 was the rate-limiting heterologous enzyme and needed for engineering.
ETE of BtDHCR7 for efficient complete biosynthesis of Cho
Elucidating the catalytic mechanism of DHCR7 and its electron transfer mechanism was crucial for the ETE process. First, BtDHCR7 was modeled by AlphaFold2 and docked with both Dhc and NADPH to determine the substrate-binding domain and NADPH-binding domain. (Fig. 2A). Dhc could both bind to the substrate-binding domain with Y317 and the surface at the entrance of the catalytic pocket with F430 and F434 (Fig. 2A). The nicotinamide group of NADPH could bind to Y54, which was formed the NADPH-binding domain of DHCR7. However, how electrons were transferred from the NADPH-binding domain to the active center still remained unknown. Aromatic residues, such as phenylalanine and tyrosine, are excellent carriers of electron transfer and could facilitate electron transfer through the resonance stabilization of radical intermediates6, 16. We happened to discover that there is a series of aromatic residues (Y54, F56, F430, F434, and Y317) between the NADPH binding domain and the catalytic center, which might form the electron transfer chain of DHCR7 (Fig. 2A). To verify our above hypothesis, Y54, F56, F430 and F434 were mutated to alanine while Y317 was mutated into phenylalanine for disturbing electron transfer (Fig. 2B). It showed that a sharp decline in Cho production was observed when one of residues involved in electron transfer was disrupted, and DHCR7 even completely lost its activity with no Cho production when two residues involved in electron transfer were disrupted (Fig. 2B). Moreover, Y317F-DHCR7 mutation completely lost its activity (Fig. 2A and 2B), suggesting its essential role in finally transferring the electron to the substrate. In conclusion, free Dhc in the cytoplasm was captured by DHCR7 through π-π stacking interactions with F434 and F430, subsequently entering the active center composed of Y317. Simultaneously, electrons from NADPH are captured by Y55 and transferred through the electron transfer chain (measured as 29.4 Å) with F56, F430, and F434 to the active center Y317, facilitating the electrophilic addition at the C7 = C8 of Dhc to synthesize Cho (Fig. 2A).
Based on the catalytic and electron transfer mechanism, substrate-recognizing domain (SRD), substrate-binding domain (SBD), and electron transfer chain (ETC) of BtDHCR7 were gradually engineered. First, for engineering SRD, we noticed that compared to the conformation of Dhc in the catalytic pocket, the side chain of Dhc in the substrate recognition domain was in a twisted state (Fig. 2C). This resulted in higher binding energy (-4.17 kcal/mol) than that of catalytic pocket (-9.01 kcal/mol), which prevented DHCR7 effectively capture the substrate. Thus, L67 and L426 of BtDHCR7 closed to the side chain of Dhc (Fig. 2C) were mutated into hydrophobic residue of small conformations (G/A/V/I). It showed that the BtDHCR7 -L67V-L436A (M1) variant increased the production of Cho to 76.6 mg/L (Fig. 2D). Subsequently, based on M1, SBD was further engineered. Considering Dhc only formed a π-π stacking interaction with Y317 in SBD of M1, and its 2C-OH did not form stronger hydrogen bonds with other residues. Thus, W280 and Y290, located within 4 Å of the 2C-OH, were subjected to saturation mutagenesis. It showed that M1- W281Y-Y291W (M2) variant increased the production of Cho to 92.3 mg/L (Fig. 2D). Finally, after iteratively engineering the DHCR7's SRD and SBD, resulting in M2, its electron transfer chain needed to be engineered. The ETE of BtDHCR7 involved two steps: shortening the distance for NADPH electron capture and engineering the electron transfer residues. Starting from M2, A49 and I53 of BtDHCR7 were gradually engineered into polar residues for accelerating obtaining the electrons of NADPH. It showed that only M2- I53T (M3) variant increased the production of Cho to 124.3 mg/L (Fig. 2D). The mechanistic analysis of BtDHCR7 has confirmed that F430 and F434 in the electron transfer chain not only transferred electrons but also captured the Dhc from the medium. Additionally, we also found that the binding position of F430 with Dhc was similar to that of Y317 with Dhc in the SBD. However, F430 can only facilitate electron transfer and not act as an electron donor, so it was further mutated to tyrosine. It showed that only M3- F430Y variant increased the production of Cho to 179.0 mg/L (Fig. 2D). Tyrosine could also increase the π-electron cloud density of the conjugated system, thereby further enhancing electron transfer efficiency6. Thus, F434 of electron transfer chain was further mutated into tyrosine, which resulted in M3-F430Y-F434Y (M4, MuDHCR7) variant could increase the production of Cho to 191.7 mg/L, representing a 5.93-fold improvement over wild-type BtDHCR7.
Computational simulation analysis for ETE of BtDHCR7 process
ETE of BtDHCR7 has achieved remarkable results and still needed to employ computational simulations to deeply analyze the specific electron transfer mechanisms. The electron transfer chains of wild-type BtDHCR7 and MuDHCR7 are composed of Y55-F56-F434-F430-Y317 and T53-Y430-Y434-Y317, respectively. Firstly, we used 100-ns molecular dynamics simulations to analyze the dynamic changes of distances between the electron transfer residues in BtDHCR7 and M4 (Fig. 3A), respectively. Starting from the capture of electrons from NADPH by the electron transfer chain, the distances from the nicotinamide group of NADPH to T53 (wild-type BtDHCR7) and Y55 (MuBtDHCR7) were 5.8 ± 0.3 Å and 3.9 ± 0.4 Å, respectively (Fig. 3B). Subsequently, the distances between residues in the electron transfer chain of wild-type BtDHCR7 remain within 5–10 Å, whereas that of M4 can be stably maintained within 2–5 Å (Fig. 3A). As a result, the electron transfer chain length of WT-DHCR7 and MuDHCR7 was 29.4 and 21.3 Å, respectively (Fig. 3A).
Enhancing the efficiency of electron transfer from the terminal residues of the electron transfer chain to the substrate was also the crucial step in ETE process. We have confirmed that Y317 was the key electron-donating residue in the active pocket. MD simulation results showed that the distance between Y317 of MuBtDHCR7 and the C7 = C8 double bond of the substrate is 3.9 ± 0.14 Å, with the length and fluctuation range of this distance being lower than that of wild-type BtDHCR7 (4.1 ± 0.52 Å) (Fig. 3B). This indirectly indicated that the Y317 of M4 could provide electrons to the substrate more effectively. Notably, the ETE of BtDHCR7 also involved engineering the SRD. F430 of wild type BtDHCR7 SRD only facilitated electron transfer without providing electrons to the substrate and was further mutated into tyrosine for possessing both electron transfer and electron donation capabilities to the substrate, which significantly enhanced the activity of DHCR7. To verify our conclusion as Y430 was the novel catalytic center, Y317 and Y430 were further mutated to alanine, respectively (Fig. S20). Consistent with our conclusion, only M4- Y317A-Y430A variant lost activity. We noticed that the two active pockets respectively formed by Y317 and Y430 both have the similar binding conformation with Dhc. Thus, we further extracted the catalytic pockets involving Y317-Y280 and Y430-Y434, along with the substrate, to create a cluster model for QM calculations (Fig. 3C). In step 1, the C8-Dhc toke a proton from Y317 or Y430 to generate intermediate (Int1) via the transition state (TS1), which requires an activation-free energy of 21.2 and 26.5 kcal/mol (Fig. 3C). In step 2, Y317 or Y430 respectively attracted protons from Y280 or Y434 and transfer them to C7-Dhc to generate intermediate (Int2) via the transition state (TS2), which requires an activation-free energy of 20.5 and 23.0 kcal/mol (Fig. 3C). QM calculations confirmed that the catalytic center composed of dual tyrosine residues effectively facilitates the double bond reduction process. Due to the introduction of Y430 to form a new active center, the new electron transfer chain was further shortened from 21.3 Å to 9.5 Å. As a result, ETE process of BtDHCR7 brought novel catalytic site, which further shortened the electron transfer chain by 68%, which significantly increased the activity.
Optimizing the elements of P450 scc systems for complete biosynthesis of Prn.
Constructing an efficient de novo biosynthesis pathway for Prn requires obtaining effective P450scc catalytic components (Fig. 4A)17. Firstly, we screened CYP11A1 within the P450scc system. CYP11A1 sequences from 28 different species were retrieved from the National Center for Biotechnology Information (NCBI) database and used to conduct a phylogenetic tree analysis (Fig. S15). However, analyzing 28 CYP11A1 enzymes represented a substantial workload. Thus, the deep learning technique18 was employed to calculate their turnover numbers (kcat values). Based on their DLkcat values, the top ten CYP11A1 enzymes were selected to introduce into the D7-1 with pY26-MuDHCR7 strain for further catalytic validation (Table S4). Moreover, ADR and ADX from H. sapiens and ATR from Arabidopsis thaliana were selected to serve as redox partners. It showed that the actual catalytic abilities of different forms of CYP11A1 closely matched with the calculated kcat values derived from DLkcat (Table S4), and CYP11A1 from R. glutinosa (RgCYP11A1) give the best Prn production, reaching 93.5 mg/L. (Fig. 4B). Thus, RgCYP11A1 was selected for further engineered.
ETE of RgCYP11A1 for efficient complete biosynthesis of Prn and mechanism analysis
CYP11A1 can utilize its heme center to hydroxylate substrates and in combination with residues near the heme achieved side chain cleavage14. However, the side chain cleavage residues of RgCYP11A1 remain unknown. First, residues of RgCYP11A1 within 4 Å region of Cho were selected for alanine scanning to determining the side chain cleavage residues (Fig. 5A). RgCYP11A1- Y105A variant completely lost the ability of side chain cleavage and was identified as a side chain cleavage residue. Moreover, RgCYP11A1-V118A variant could increase the Prn production from 161 to 197.0 mg/L by adding 300 mg/L Cho as the substrate (Fig. 5B) and was selected for next generation of engineering process. Based on the docking results, we found that Y105 is close to the C22-OH group, allowing it to abstract the proton and facilitate the subsequent side chain cleavage. Subsequently, QM calculations determined that the C22-OH toke a proton from Y105 to generate intermediate (Int1) via the transition state (TS1), which requires an activation-free energy of 28.2 kcal/mol (Fig. 3C). The high energy barrier limits the efficiency of side chain cleavage by CYP11A1. We noticed that the naturally alkaline microenvironment in R. glutinosa benefited CYP11A1 for facilitating the final step of Prn synthesis by capturing the proton from the hydroxyl group of 20α,22(R)-dihydroxycholesterol to form an ionic bond19, thereby promoting electron transfer and enabling side-chain cleavage. Thus, starting from RgCYP11A1-V118A variant, A281 and T293 within 4 Å of C22-OH-Cho were further engineered into acidic residues for accelerating PECT process, which was the second-generation modification of CYP11A1 and belong to the ETE process. To this end, we found that the A281D-T293E variant of V118A-RgCYP11A1 (MuRgCYP11A1) could produce 243 mg/L of Prn (Fig. 5C).
The biosynthesis of Prn by cutting the side chain of Cho with P450scc was a complex process involving two hydroxylation as the first step, followed by the loss of two protons from the two hydroxyl groups and ultimately the formation of a ketone group (Fig. 5D) as the second step14. For the first step, ensuring that the two carbon positions requiring hydroxylation were stably positioned above the heme center and in similar distances were the key to efficient dihydroxylation20. 100-ns MD simulation of RgCYP11A1 and Cho complexes indicated that C-20 and C22 were more stably exposed above the center of the heme (Fig. 5E and 5F), which further accelerated dihydroxylation process. Additionally, free energy calculations revealed that the D281 and E285 of MuRgCYP11A1 could contribute intermolecular forces of -48 kJ/mol and − 51 kJ/mol, respectively (Fig. S21). The stronger binding forces allow Cho to bind more stably in the substrate pocket of CYP11A1 for higher catalytic efficiency. For the second step, QM calculation of wild type RgCYP11A1 suggests that the lack of acidic residues within 4-Å distance around 20α,22(R)-dihydroxycholesterol to capture its proton limited PCET process. Thus, we further carried out QM calculation for examining the first step of proton abstraction from the 22-hydroxyl group in MuRgCYP11A1, which was engineered by ETE (Fig. 5E). Introducing T293E into the catalytic pocket of CYP11A1, compared to the original Y105, allows for closer proximity to the 22-OH proton (reduced from 2.0 Å to 1.6 Å), which further lowered the reaction energy barrier (from 28.2 kcal/mol to 14.6 kcal/mol) (Fig. 5G). In fact, ETE process effectively promoted the PCET process and accelerating electron transfer efficiency, which significantly improved the catalytic efficiency of CYP11A1.
ETE of S. cerevisiae for efficient complete biosynthesis of Cho and Prn
Last, the above obtained BtDHCR7 and RgCYP11A1 variants with the highest catalytic efficiency (MuDHCR7 and MuRgCYP11A1) were used to construct de novo biosynthesis pathway of Cho and Prn in S. cerevisiae. Before this, the precursor metabolic pathway of Cho synthesis was strengthened (Fig. 6A). First, the mevalonate (MVA) pathway was strengthened by integrating tHMG1 and ID11 into the TY1 multiple copy sites of D7-1, resulting in D7-2. D7-2 with pY26-MuDHCR7 produced 228 mg/L Cho (Fig. 6B). ERG2 and ERG3 were further integrated into TY3 multiple copy sites of D7-2 for increasing the conversion rate of squalene to Cho, resulting in D7-3. D7-3 with pY26-MuDHCR7 produced 267 mg/L Cho (Fig. 6B). Subsequently, MuDHCR7 was integrated into the D7-3 genome in single, double, and multiple copy numbers, respectively (Fig. S15). As a result, D7-3 with two copies of MuDHCR7 (D7-4) could produce 303 mg/L Cho (Fig. 6B).
After optimizing the carbon flux through metabolic engineering, D7-4 was continuously engineered by ETE strategy (Fig. 6B). Given that the animal-derived electron transfer system supplied electrons to the endogenous ERG11 and ERG25 in S. cerevisiae21–24, thereby directing more carbon flux toward the subcellular structures involved in animal sterol synthesis, two electron transfer systems (adrenodoxin reductase (ADR)-adrenodoxin (ADX) from H. sapiens and human cytochrome b5, respectively) were integrated into D7-4, resulting in D7-5 and D7-6, respectively. D7-5 with b5 could produce 282 mg/L Cho, while D7-6 with both ADR and ADX produced 382 mg/L Cho (Fig. 6B). Then, a glucose 6-phosphate dehydrogenase (GDH) from Bacillus Subtilis was integrated into D7-6 for recycling NADPH and enhancing the electron transfer efficiency, resulting in D7-7. This strain could produce 425 mg/L Cho (Fig. 6B). Besides enhancing the cofactor regeneration efficiency, FPK and PTA were introduced into the PPP for NADPH regeneration (D7-8). Consequently, D7-8 could synthesize 460 mg/L of Cho (Fig. 6B). In order to test whether the ETE process improves the efficiency of yeast NADPH regeneration, the ratio of NADPH/NADP+ of D7-6, D7-7, and D7-8 were measured, respectively. It showed that the ratio of NADPH/NADP+ from D7-6 to D7-8 increased from 0.3 to 0.8 using the electron transfer strategy (Fig. 6C). The strains D7-8 were used to produce Cho through fermentation in a 5-L bioreactor (Fig. 6D). After optimization, the titers of Cho reached 1.78 g/L (Fig. 6E).
For Prn production, pY26-MuRgCYP11A1-ATR were introduced into D7-8, resulting in D7-9 could produce 161.84 mg/L Prn. Subsequently, wild type RgCYP11A1-ATR and MuRgCYP11A1-ATR were integrated into the genome of strain D7-8 to achieve a more stable fermentation synthesis, resulting in D7-10-1 and D7-10-2. D7-10-2 with MuRgCYP11A1 could produce 193.77 mg/L of Prn, an increase of 138% compared to the wild type. Reducing the physical distance between redox partners and CYP450 enzymes is a strategy to enhance the electron transfer efficiency (ETE) of P450 enzymes1. Thus, to enhance electron transfer efficiency, MuRgCYP11A1 and ATR were linked using the flexible peptide GGGGS (Fig. 6E), resulting in D7-10-3. As a result, D7-10-3 could produce 261 mg/L Prn (Fig. 6E), an increase of 37% compared to the D7-10-2. Finally, the strains D7-10-3 were used to produce Prn through fermentation in a 5-L bioreactor (Fig. 6F). After optimization, the titers of Prn reached 0.83 g/L (Fig. 6F).