Strain construction
We performed adaptation experiments varying several conditions, including genotype, growth medium, and adaptation strategy (Table 1). To understand the effect of genotype on ethanol tolerance, we started with a strain engineered to produce ethanol at high titer by introduction of a heterologous ethanol production pathway (LL1570 (Hon et al. 2018)). Previously, we have observed that strains engineered for increased ethanol tolerance (Biswas et al. 2014) or decreased acetate production accumulate lactate (Argyros et al. 2011). We deleted the ldh gene to prevent lactate accumulation (Lo et al. 2015). In some strains, we also expressed genes for alternative glycolytic enzymes (gapDH or gapN). We have previously shown that expression of the gapDH gene from T. saccharolyticum can reduce ethanol inhibition in C. thermocellum (Tian et al. 2017). We have performed thermodynamic analysis that suggested that the non-phosphorylating GapN enzyme might allow increased ethanol titer (Dash et al. 2019).
To understand the effect of growth medium on ethanol tolerance, we performed experiments in either chemically defined (MTC-5 (Cui et al. 2020)) or rich (CTFUD (Olson and Lynd 2012)) medium. For the strains expressing alternative glycolytic enzymes, we performed adaptation in a rich medium to give the cells more metabolic flexibility to use the heterologous genes. For the other strains, we performed adaptation in a defined medium to restrict metabolic flexibility and prevent the accumulation of auxotrophic mutations (Fig. 1).
To understand the effect of ethanol adaptation strategy on ethanol tolerance, we performed adaptation under two different strategies: continuous increase or alternating high and low concentrations (Table 1). In the continuously increasing approach, the concentration of added ethanol was increased once cells were growing well. As an alternative strategy, we grew cells on alternating high and low concentrations of ethanol. This strategy has been proposed as a way to select for constitutive expression of a desired trait, and has been used successfully in prior C. thermocellum ethanol adaptation experiments (Shao et al. 2011).
Table 1
Lineage group | Genotype | Medium | Ethanol addition strategy |
A | T. sacch ethanol production pathway, and altered glycolytic genes | Rich | Alternating high and low |
B | T. sacch ethanol production pathway | Defined | Alternating high and low |
C | Continuously increasing |
Adaptation
Un-adapted strains were able to grow in the presence of 20 g/L ethanol in all conditions, and we therefore used this as a starting ethanol concentration for almost all of our adaptation work (the LL1790, LL1791, and LL1792 lineages were started at lower ethanol concentrations, but the ethanol concentration was increased to 20 g/L after just a few transfers) (Fig. 2). Strains grown in rich medium were able to grow in the presence of 60 g/L ethanol after a relatively brief period (40 days) of adaptation. In defined medium, the alternating strategy for ethanol adaptation caused very long lag phases and was abandoned. Using the continuously-increasing strategy for ethanol adaptation in defined medium, the maximum ethanol tolerance achieved was 40 g/L. The experiment was stopped due to logistical limitations of the COVID 19 pandemic, so it is not known whether additional transfers would have further increased ethanol tolerance.
Genome resequencing
To identify genetic modifications that had occurred during adaptation, we performed whole-genome sequencing on the adapted strains. A complete table of mutations is included in the supplement (Supplementary Table S1). To identify signatures of convergent evolution, we looked for genes that had accumulated mutations across several different strains (Fig. 3).
AdhE (Clo1313_1798). The most frequently mutated gene was adhE. All 11 of the lineages sequenced had mutations at this locus. Most of the mutations in the adhE gene were frameshift (fs) or premature stop codons (stop) that would be expected to eliminate activity. One mutation, D844Y, appeared in several parallel lineages from the same adaptation strategy (strategy C).
The multiple occurrences of the D844Y mutation could be explained either by its presence in the parent strain (LL1592) or by convergent evolution. The D844Y mutation is caused by a C→A mutation at position 2,096,168 of the genome. In the parent strain, 0 of 76 reads have an A nucleotide, suggesting that if this mutation was present in the parent strain culture, it was present at a frequency of < 1%. Furthermore, we can see that in the LL1792 and LL1790 lineages, the D844Y mutation was not present in either population after 9 transfers (Fig. 4), providing additional evidence that this mutation appeared independently in each of the lineages.
It is also noteworthy that native adhE was the only ADH gene that was targeted for mutation. Strain LL1592 lineage carries two ADH genes from T. saccharolyticum (adhEG544D and adhA), mutations were not observed in these genes in any of the adapted strains. C. thermocellum has five other genes annotated as alcohol dehydrogenases (Clo1313_0076, Clo1313_0166, Clo1313_1827, Clo1313_1833, and Clo1313_2130). In this set of genes, there was only a single mutation, A151V in Clo1313_1827. This mutation only appeared in a single lineage (LL1732 population and LL1806 isolate).
Phospholipase D (also known as cardiolipin synthase, Clo1313_0853) The Clo1313_0853 gene appears to be a target of convergent evolution for strains of C. thermocellum adapted to grow in the presence of added ethanol, although the signature is weaker than for adhE. In this gene, five different alleles were identified across seven lineages (out of 11 total lineages studied), including at least one from each of the lineage groups A, B, and C (Fig. 1). In all five cases, the mutation was a loss-of-function mutation (frameshift or stop codon). Mutations in this gene have also been found in other strains of C. thermocellum adapted to ethanol (Shao et al. 2011) and n-butanol (Tian et al. 2019). Despite several attempts, we were unable to create a targeted disruption of the Clo1313_0853 locus in the LL1592 parent strain.
Other genes. Signatures for convergent evolution were also found in RNA polymerase rpoC (Clo1313_0314), histone-family DNA binding protein (Clo1313_0638), a GntR transcriptional regulator protein (Clo1313_0710), a VTC domain protein (Clo1313_1989), and an ABC-transporter related protein (Clo1313_2323), however these signals are generally weaker (fewer strains with the mutation, fewer lineage groups with the mutation, no examples in other C. thermocellum ethanol adaptation literature) than what was observed for adhE or Clo1313_0853.
Effects on fermentation products
To understand the effect of adaptation for increased ethanol tolerance on ethanol production, we performed batch fermentations in the presence and absence of 10 g/L ethanol. We only performed fermentation experiments on strains from lineage groups A and C. We use high concentrations of substrate (50 g/L (146 mM) cellobiose), to maintain consistency with our adaptation conditions, and to allow observation of ethanol production in the presence of added ethanol. A complete table of fermentation data is presented in the supplement (Supplementary Table S2).
The primary fermentation products were glucose, ethanol, acetate, and pyruvate (Fig. 5). In most cases, the majority of cellobiose (50–80%) was converted to glucose. This is commonly observed in C. thermocellum batch fermentations with high concentrations of substrate. It is not known whether this conversion takes place intracellularly or extracellularly. Carbon recovery was 90–98% on defined medium (MTC-5), and 78–94% on rich medium (CTFUD).
We observed three general trends with respect to ethanol production. (1) Increased ethanol tolerance did not result in increased ethanol production, and in some cases, even decreased production. (2) Addition of ethanol reduced ethanol production. (3) Strains grown in rich medium (CTFUD) produced less ethanol compared to strains grown in defined medium (MTC-5).
Effect of adaptation on enzyme activity
To study the effect of adaptation on enzyme activity, we focused on the strains from lineage group C (LL1790, LL1791, LL1792, and LL1805), all of which had the D844Y mutation. Since AdhE was the most common target of mutations, we measured ALDH and ADH activity. Since we have previously observed mutations in AdhE that affect its cofactor specificity (Zheng et al. 2015), we measured both activities with both NADH and NADPH cofactors. In WT C. thermocellum, ADH activity is > 99% NADH-linked. In the LL1592 parent strain, expression of the T. saccharolyticum adhA gene results in NADPH-linked ADH activity (although levels are relatively low). The primary effect of adaptation appears to be a loss of NADH-linked ADH activity. NADH-linked ALDH activity also decreased (Fig. 6).
Characterizing the D844Y mutation
To understand the effect of the D844Y mutation, the adhE gene from C. thermocellum carrying the D844Y mutation was cloned and expressed in E. coli. Activity was measured for both the ALDH and ADH reactions with both NADH and NADPH cofactors. No activity was detected with the NADPH cofactor for either the ALDH or ADH reaction. The mutation significantly reduced ADH activity, and slightly reduced ALDH activity (Fig. 7). Since AdhE is a bifunctional enzyme, the apparent decrease in ALDH activity may actually represent a measurement artifact. For the WT enzyme, each molecule of acetyl-CoA that is consumed can result in the consumption of either one or two molecules of NADH, depending on whether or not the acetaldehyde is further converted to ethanol, and the exact number is not known. If the ADH reaction is blocked, however, the reaction stoichiometry is fixed at one NADH per acetyl-CoA.
Effects of adaptation on ethanol tolerance
To confirm that our adaptation increased ethanol tolerance, we measured the growth of strains in the presence of different concentrations of added ethanol (Fig. 8). Adaptation increased both the growth rate for a given ethanol concentration, and the maximum ethanol concentration at which growth could be initiated. All of the adapted strains showed an increase in ethanol tolerance from the 20 g/L of the parent strain (LL1592) to 35–40 g/L for the adapted strains. This closely matches the ethanol tolerance observed during the adaptation work (Fig. 2).
To confirm the genetic basis for this increased ethanol tolerance we focused on understanding the effect of adhE mutations. Initially, we tried to reintroduce the D844Y mutation using recently-developed CRISPR-based tools (Walker et al. 2020). Despite several attempts, we did not succeed. Since many of the adhE mutations were expected to completely inactivate the enzyme, we instead performed a targeted deletion of the C. thermocellum adhE in the parent strain (LL1592). Deletion of adhE could explain about half of the observed ethanol adaptation phenotype.
Effects of rich medium on ethanol tolerance
In our initial adaptation experiments, we observed that strains grown in rich medium exhibited increased ethanol tolerance. Our genome resequencing work suggested that inactivation of adhE also increased ethanol tolerance. To study the interaction between the two effects, we measured ethanol tolerance of both the WT and adhE deletion strains in both rich and defined medium. Ethanol tolerance of the WT strain was substantially improved by growth in rich medium. By contrast, in the adhE deletion strain, rich medium had very little effect on ethanol tolerance (Fig. 9).