Strains and culture media
Kluyveromyces marxiuanus strain DU3 (KmDU3) was isolated from mezcal fermentation of Oaxaca state, Mexico. This strain belongs to the strain collection of Research Center and Assistant in Technology and Design of Jalisco State (CIATEJ). KmDU3 was growth in liquid and solid YPD media (10 g L-1 yeast extract, 20 g L-1 peptone, and 20 g L-1 glucose). YPD medium supplemented with 300 µg mL-1 G418 (Sigma-Aldrich) was used for KmDU3 transformation selection. Defined mineral medium reported for Flores-Cosio et al. 2018 was used for the cultivation of the CRISPRi strains with 50 g L-1 glucose and 300 µg m L-1 of geneticin (G418, Sigma Aldrich, A1720). The medium was adjusted to pH 4.5.
Escherichia coli strain DH10β was used as a host for cloning and plasmid propagation. E. coli was grown at 37 °C in Luria Bertani (LB) medium (10 g L-1 tryptone, 5 g L-1 yeast extract, and 5 g L-1 sodium chloride) containing 100 mg L-1 ampicillin.
Genome sequencing, assembly and annotation
Genomic DNA was extracted from an overnight culture of KmDU3 using the Wizard Genomic DNA Purification Kit (Promega, USA). The genome of KmDU3 was sequenced by illumina and Oxford Nanopore sequencing technologies. For Illumina, a library of 100-pb paired-end reads was generated and sequenced on an HiSeq 2000 platform (Macrogen, Inc.). Data integrity and read quality distribution was evaluated with FastQC v0.11.9 software (Andrews et al. 2010). Then, low-quality bases and regions were removed by using Trimmomatic v0.38 (Bolger et al. 2014). For Oxford Nanopore Technologies (Oxford, UK), the library was constructed using the Rapid Barcoding Sequencing kit (SQK-RBK004) and using a FLOMIN 106 flow cell. The raw FAST5 data were basecalled using guppy basecalling program v6.3.8+d9e0f64, GPU-enabled (Oxford Nanopore Technologies) and the adaptor sequences were removed by guppy barcoding script. Low quality reads and contaminant reads were filtered out by Nanofilt v2.8.0 (De Coster et al. 2018) and Filtlong v0.2.1 (Wick, 2017) softwares.
Long clean reads generated by the nanopore sequencing platform were assembled by Flye v2.9.1 software (Kolmogorov et al. 2019) using default parameters. The resulting assembly was polished by mapping the clean reads back against the Flye draft assembly using minimap2 v2.24 (Li, 2018) to self-correct, and using the mapping files as input to Racon v1.5.0 (4X corrections, Vaser et al. 2017) and Medaka v1.6.0 (https://github.com/nanoporetech/medaka). We performed a second sequence correction of the assembly with the Illumina data using Pilon v1.23 software (Walker et al. 2014), resulting in 359x of coverage evidence. The sequence alignment between genome assembly and Illumina reads was performed with Bowtie 2 (Langmead & Salzberg, 2012). The final assembly was used to construct scaffolds with RagTag (Alonge et al. 2022) using K. marxianus DMKU3-1042 genome as a reference.
The assembled genome was annotated with Funannotate v.1.8.14 (Palmer & Stajich, 2022), using the RNAseq data as evidence of transcription, and adding functional information with InterproScan5 v.5.22.61.0 (Jones et al. 2014), and EggNog (Huerta-Cepas et al. 2014).
Plasmid construction
The CRISPRi vector and the primers for the subcloning processes were designed with the NEBuilder Assembly Tool v1.12.19 from New England BioLabs (NEB). Cloning process was performed using restriction enzymes (NEB, Sigma, abm) and Gibson-type assembly (Gibson et al. 2009) with Gibson HiFi DNA Assembly Master Mix (NEB). The assembled plasmids were transformed into E. coli DH10β by electroporation using the MicroPulser kit (Bio-Rad) 0.2 cm cells. Plasmid purification from E. coli was performed using the PureYieldTM Plasmid Miniprep System kit (Promega). All plasmids and primers used are listed in Supplementary Table 1 and 2. The oligonucleotides were synthesized by Integrated DNA Technologies, Inc. (IDT; www.idtdna.com) and ADN ARTIFICIAL S. de R.L. de C.V. (https://t4oligo.com). All the amplifications of regions of interest were performed by endpoint PCR with the SimpliAmpTM thermal cycler (Applied Biosystems), using the Q5 High fidelity DNA polymerase (NEB).
The CRISPR-dCas9 plasmid was constructed using pRS410 (Addgene #11258) containing the KanMX selection marker used for cloning selection in prototrophic yeast (Vickers et al. 2013; Walker et al. 2003). This plasmid was digested with the restriction enzymes Psp5II and PfoI. The CEN6/ARS1 cassette was amplified from pIW601-KmCRISPR (Addgene, #98907) using primers ARS1_F and ARS1_R, generating the plasmid named pMZ18-ARS. To express the catalytically inactive dCas9 fused with the repressor factor domain Mxi1, the Tef1p-dCas9-Mxi1-Cyct cassette was amplified from pRS416-dCas9-Mxi1 + TetR + pRPR1(TetO)-NotI-gRNA plasmid (Addgene #73796) using primers FW_4 and RV_4. The structural guide RNA containing a KmRPR1-tRNAGly promoter, structural guide RNA and Supt was amplified from pIW601-KmCRISPR using primer FW_1-3, FW_1-4, FW_2-3, FW_2-4 and RV_2, which contain the two different IAH1 target sequences (Supplementary Fig. 1). The reverse primer of the promoter fragment and the forward primer of the sgRNA fragment were exchanged with the two different IAH1 target sequences (Supplementary Fig. 2). The pMZ18-ARS was cut with XhoI and XbaI, and the dCas9-Mxi1 and sgRNA fragments were inserted by Gibson assembly, generating two plasmids with different sgRNAs (pMZ18-KmdCas9-g1 y pMZ18-KmdCas9-g2). Verification of plasmid construction was performed by PCR using the primers listed in Supplementary Table 2. A detailed plasmid map is presented in Fig. 1a.
Fig. 1
Design of sgRNAs of the IAH1 target gene
The nucleotide sequence of IAH1 from the reference strain K. marxianus DMKU3-1042 was retrieved from NCBI ''Nucleotide'' database, to search for its orthologs into the assembled KmDU3 genome by local alignment. Once located, sequences of ~300 nucleotides upstream of the open reading frame (ORF) of the gene were recovered to analyze the promoter sequences using YeastTSS program (McMillan et al. 2019) and using Kluyveromyces lactis as a reference. The Clustal Omega tool (Sievers and Higgins, 2014) was used to detect regulatory elements in the IAH1 promoter of KmDU3.
The specific 20-bp targeting sequences with 3’-adjacent PAM sequences were selected from the K. marxianus genome using CRISPRdirect software (Naito et al. 2015). The two sgRNA were designed near the start codon of IAH1 gene to achieve maximum silencing efficiency, following Smith et al. 2016 recommendations. The target sequences were checked for uniqueness within the KmDU3 genome by local blast. The secondary structure of the sgRNAs were analyzed with RNAfold WebServer (Lorenz et al. 2011).
Transformation of K. marxianus
Plasmid transformations were performed using a previously reported protocol (Löbs et al. 2017) with the following modifications (McMillan, 2019). KmDU3 was grown in YPD broth overnight. 1 mL KmDU3 cell was harvested by centrifugation at 5000 rpm for 1 min. After washing twice with 1 mL of Tris-EDTA 1X and 0.1 M of lithium acetate (TE/LiAc), cells were resuspended in 100 µg carrier DNA (salmon sperm DNA) and 500 ng of plasmid. 500 µL of transformation mix (40 % polyethylene glycol 3350, 0.1 M lithium acetate, 10 mM Tris-HCl (pH 7.5), 1 mM EDTA, and 70 mM DTT) was added, and the solution was incubated at room temperature for 15 min. Subsequently, the transformation mix was heating shock at 47 °C for 15 min. The mix was centrifuged at 5000 rpm for 1 min, and the cells were resuspended in 1 mL of YPD broth and incubated for 2 h. Then, the 1 mL of resuspended cells were diluted with 9 mL of YPD medium (1:10 dilution) and incubated at 30 °C for 16 h with shaking. Finally, the cells were then centrifuged and resuspended in 1 mL of YPD broth. 100 µL of this resuspended cell was inoculated in solid YPD medium supplemented with 300 μg mL-1 geneticin G418.
Fermentation system and volatile compounds quantification
Both yeast cells, each transformed with a different plasmid, were inoculated into defined mineral (DM) medium supplemented with 300 µg mL-1 G418. Wild-type yeast was inoculated in the same medium without G418. These cultures were carried out at 30 °C for 24h in constant stirring (200 rpm). Samples of these cultures were taken during fermentation at 6 and 24 h in triplicate to measure cell growth, sugar consumption, volatile compounds and RNA extraction.
The cell density was analyzed by dry weight (g). Sugar consumption was determined by DNS (dinitrosalicylic acid) method (Miller, 1959). Gas chromatography was used for volatile compounds quantification using a Head-Space Sampler (HSS Model 7694 E, Hewlett Packard, Agilent Technologies, Palo Alto, CA, USA) coupled to a gas chromatograph (GC Hewlett Packard 6890, Agilent Technologies, Palo Alto, CA, USA), using a flame ionization detector (FID) and HP-INNOWAX column (60m x 0.32mm x 0.25 mm). Program conditions were used as reported by Méndez-Zamora 2020.
RNA extractions and cDNA synthesis
RNA extraction for both the control and the two transformed cultures was performed with TRIzol reagent (Invitrogen, cat. 15596026 and 15596018), according to the protocol described by Chomczynski, 1993. The concentration and purity of the RNA was confirmed by spectrophotometry in a NanoDrop One (Thermo Scientific). The integrity and quality of the total RNA was verified by electrophoresis on 1 % agarose gel stained with Viva SybrGreen Nucleic Acid Stain (Vivantis). Genomic DNA was digested with DNase I Amplification Grade (Sigma-Aldrich, Cat. AMPD1) using 1000 ng of RNA from each sample. Subsequently, it was retrotranscribed to cDNA with the GoScript™ Reverse Transcriptase (Promega, Cat. A5000) using 250 ng of RNA from each sample. Upon completion of retrotranscription, all samples were quantified and diluted to the same concentration.
RT-qPCR and expression level quantification
Gene regulation of the IAH1 gene was evaluated by Real-time quantitative PCR (RT-qPCR), as well as the expression of other genes related to ester production (ATF1, EAT1) and targeting of metabolic pathways (ZWF1 and ADH1). These genes were searched for homology in our annotated genome.
RT-qPCR analysis was conducted with the Applied Biosystems Step One Real-Time PCR System using Maxima SYBR Green/ ROX qPCR Master Mix (Thermo Scientific, K0221) according to the manufacturer’s instructions. Each sample was analyzed twice, and a non-template control for each primer set was included. For the qPCR reaction, all genes were amplified in 25 µL reactions that contained 6.5 µL Maxima SYBR Green/ ROX qPCR Master Mix, 1 µL of template (1:4 diluted cDNA), 0.25 µL of each primer (10 µM) and 4.5 µL of sterile distilled water. The thermal cycling conditions were as follows: cDNA was heated to 95 °C for 10 min for enzyme activation followed by 40 cycles of 30 s at 95 °C, and 60 s at 60 °C. The actine 1 (ACT1) gene was used as the reference gene. The relative expression levels of each gene were calculated using the 2− ΔΔCt method (Livak & Schmittgen 2001). All primers used for RT-qPCR are listed in Supplementary Table 2.
Statistical analysis
A two-tailed unpaired T-test was performed for comparison between two samples, with a difference at P < 0.05 considered significant. One-way ANOVA with a Tukey's post hoc test was performed for analysis of groups of samples and was considered significant at P < 0.05. Samples of the wild type and the two mutant strains of KmDU3 were further analyzed using Principal Component Analysis (PCA) using the statistically significant data at P < 0.05. Statistical analysis of the data was performed with GraphPad Prism software. Graphical representations were depicted in R v3.4.4 (R Core Team 2017) using the ggplot2 package (Wickham, 2016), and edited in Inkscape v0.91 software (www.inskcape.org).