Synthesis and characterization of the lignin models
In recent work, we synthesized the unlabeled β-O-4 lignin oligomer model compound 4 (Fig. 2) and fully characterized it by performing 1D 1H NMR, 1D 13C NMR, 2D 1H-13C HSQC, 2D 1H-13C HMBC, 2D 1H-13C long-range heteronuclear single quantum multiple bond correlation (LR-HSQMBC), 2D 1H-1H TOCSY, 2D 1H-1H EXSY and 2D 1H-1H ROESY [18]. We applied the protocol to synthesize three 13C-labeled β-O-4 lignin oligomer model compounds. 13C-labels were placed in the aromatic rings and β positions of compound 4(Arβ), the α positions of compound 4(α), and methoxy groups in compound 4(m) using 13C-labeled vanillins and t-butyl-2-bromoacetate. The 13C-vanillins 1(Ar), 1(α), and 1(m) were individually refluxed in acetone with unlabeled or 13C-labeled t-butyl-2-bromoacetate to afford 13C-t-butoxycarbonylmethyl vanillins 2(Arβ), 2(α), and 2(m), respectively. The 13C-t-butoxycarbonylmethyl vanillins were dissolved in anhydrous tetrahydrofuran (THF) and polymerized using lithium diisopropylamide (LDA). The ester groups in polymerized 13C-oligomers 3(Arβ), 3(α), and 3(m) were reduced with NaBH4 to obtain 13C-labeled β-O-4 lignin oligomer model compounds 4(Arβ), 4(α), and 4(m), respectively.
The 2D 1H-13C HSQC spectra of the 13C-labeled lignin models contained signals that corresponded to the 1H-13C correlations (Fig. 3), which was evidence that 13C was incorporated at the designated positions. No HSQC signals attributable to side products were observed in the NMR spectra, which indicated that high-purity lignin models were prepared. HSQC signals at δC/δH 85.9/4.52–4.64, 72.6/4.06–4.08, and 113.2–123.6/6.81–7.06 ppm in the HSQC spectra of lignin model 4(Arβ) with 13C-labeling in the β positions and aromatic rings (Fig. 3a) were assigned to the β positions, Terminal C1, and aromatic regions, respectively. The HSQC spectra of lignin model 4(α) (Fig. 3b) contained three signals at δC/δH 74.7–75.0/4.92–4.98, 74.8–75.0/4.72–4.76, and 66.2/4.43–4.49 ppm, which were assigned to α positions in the A, B, and C rings, respectively. A single signal was observed at 58.1–58.3/3.57–3.85 ppm in the HSQC spectra of lignin model 4(m) with 13C-labeled methoxy groups (Fig. 3c).
The long- and short-chain lignin models were separated via silica gel chromatography and subjected to NMR and binding analysis. Size exclusion chromatography (SEC) revealed that the 13C-labeled lignin models had narrow molecular weight distributions (Figure S2, Additional file). The degree of polymerization (DP) was calculated for each model compound from its weight-average molecular weight (Mw). The DPs of the long-chain lignin models ranged from 4.16 to 4.70, whereas those of the short-chain models ranged from 2.64 to 3.12 (Table 1). Although differences between the Mws of the long- and short-chain models were small, their NMR spectra (Fig. 3d) and SEC profiles (Figure S2, Additional file) were clearly distinct. Therefore, the long- and short-chain lignin models could be used to evaluate the effects of Mw on molecular interactions with TrCBM1.
Table 1
Molecular weight parameters of 13C-labeled and unlabeled β-O-4 lignin oligomer model compounds.
Lignin models | Chain length | Mn | Mw | Mw/Mn | DPa |
Compound 4(Arβ) (13C-labeled aromatic rings and β positions) | Long | 843 | 923 | 1.095 | 4.70 |
Short | 480 | 520 | 1.083 | 2.64 |
Compound 4(α) (13C-labeled α positions) | Long | 750 | 918 | 1.224 | 4.67 |
Short | 531 | 614 | 1.156 | 3.12 |
Compound 4(m) (13C-labeled methoxy groups) | Long | 629 | 818 | 1.301 | 4.16 |
Short | 482 | 587 | 1.218 | 2.98 |
Compound 4 (unlabeled) | Long | 882 | 964 | 1.092 | 4.91 |
Short | 552 | 560 | 1.014 | 2.85 |
a DP: Degree of polymerization calculated from the Mw and theoretical molecular mass of the lignin model. |
Analysis of interactions between lignin models and TrCBM1
Interactions between the lignin models and TrCBM1 were evaluated via CSP analysis using 2D 1H-13C HSQC NMR. To analyze the binding positions in lignin model compounds 4(Arβ), 4(α), and 4(m) with a high degree of sensitivity, the compounds were separated into high and low molecular mass fractions. Assignment of the 13C-lignin model HSQC signals was accomplished for all of the carbon atoms and non-exchangeable protons. Almost all of the signals from the A, B, and C rings were assigned separately, because the tops of the peaks were distinguishable [18]. We previously found that the lignin models have two diastereomers. Although these A and B rings have similar alignments, the C ring are differently aligned and designated as C(I) and C(II) [18]. Full assignment of the peaks in the lignin model spectra enabled us to analyze the interactions of carbon atoms and protons at the atomic level.
The 2D 1H-13C HSQC spectra of the lignin models (50 µM) in the presence and absence of TrCBM1 were superimposed (Fig. 4). Several HSQC signals acquired in the presence of 200 µM TrCBM1 showed obvious perturbation, particularly those of the aromatic regions in the long-chain lignin spectra. Most of these signals exhibited larger perturbations when the concentration of TrCBM1 was increased to 350 µM (Fig. 4b). In contrast, there was no significant perturbation of NMR signals from the aliphatic regions and methoxy groups in the presence of TrCBM1 (Fig. 4c). The results clearly showed that the aromatic regions in the long-chain lignin models were the primary sites of interaction with TrCBM1. However, no distinct perturbations were observed in signals from the methoxy groups or any of the aromatic and aliphatic regions when the short-chain lignin models were used for CSP analysis (Figures S3, Additional file). This indicated that the length of the lignin chains was an important factor in TrCBM1 binding.
The change in the chemical shift (Δδ) of each signal was calculated by subtracting the chemical shift value in the spectrum of the lignin model from that of the model recorded in the presence of TrCBM1. The Δδ values obtained in the 1H and 13C-axis are plotted in Fig. 5. Δδ values were large only for the signals from the aromatic regions in the long-chain lignin models, whereas the Δδ values of signals from the aliphatic regions and methoxy groups in both the long- and short-chain lignin models were small. The Δδ values of the A 2, C 2(I), B 5, C 5(I), C 5(II), A 6, C 6(I), and C 6(II) signals from the aromatic regions of the long-chain lignin models were all larger than 0.006 ppm on the 1H-axis, whereas the Δδ values of the A 2 and C 5(I) signals on the 13C-axis were greater than 0.05 ppm (Fig. 5a). For the short-chain lignin models, only the Δδ values of the B 5 signals on the 1H-axis exceeded 0.006 ppm (Fig. 5b). Line broadening was observed in the B 5 signals of the long-chain lignin models in the presence of TrCBM1 (350 µM). The observed line broadening could be attributed to both the on and off rates of complex formation and the multiple binding states of lignin due to non-specific binding [17].
Adsorption experiments with the lignin models
The unlabeled long- and short-chain lignin models of compound 4 were used to evaluate TrCBM1 binding affinity according to the Langmuir adsorption model. The Mws and molecular weight distributions of the unlabeled lignin models (Table 1) were nearly identical to these of the 13C-labeled lignin models (Figure S2, Additional file). The TrCBM1–His tag fusion protein was used instead of TrCBM1 without a His tag to conduct the adsorption experiments, because the soluble lignin models could not be separated from the TrCBM1 protein via centrifugation. After incubating the sample solutions containing the lignin models and TrCBM1–His tag at 50 °C for 1 h, cOmplete His Tag Purification Resin (Roche, Basel, Switzerland) was added to the solutions. The unbound lignin models in the supernatant could then be separated from the bound lignin models, which were adsorbed to the precipitated TrCBM1–His tag bound to the His tag resin. We confirmed that all of the TrCBM1–His tag would adsorb to the His tag resin by performing control experiments. We did not observe unwanted binding between the His tag resin and lignin models (Figures S6 and S7, Additional file). The concentrations of the unbound lignin models in the supernatants were determined to calculate the adsorption parameters summarized in Table 2. Although the long- and short-chain lignin models had similar Γmax values, the KL of the long-chain lignin model was eight times higher than the KL of the short-chain lignin model. This result was consistent with the results of the NMR interaction analysis. The percentages of the lignin models that bound to TrCBM1 were calculated using the KL values. In the CSP analysis, 84.6% and 91.4% of the long-chain lignin models were bound to TrCBM1 at TrCBM1 concentrations of 200 and 350 µM, respectively. At TrCBM1 concentrations of 200 and 350 µM, 33.0% and 46.6% of the short-chain lignin models, respectively, were bound to TrCBM1. The chain length was thus an essential factor in binding between TrCBM1 and the lignin chains, which contained β-O-4 linkages exclusively. We also found that strong adsorption required a DP above 4.
Table 2
TrCBM1–His tag adsorption parameters of the unlabeled lignin models according to Langmuir adsorption model.
Lignin model | Langmuir affinity constant, KL (mL/mg) | Amount of adsorption at saturation, Γmax (µg/mg) |
Long-chain | 36.26 | 8.77 |
Short-chain | 4.47 | 9.15 |