Strains used in this study
Unless otherwise stated, the yeast Saccharomyces cerevisiae CEN.PK 113-7D (MATα, MAL2-8c, SUC2) was used. For quantitative proteome analysis, S. cerevisiae CEN.PK113-7D lys1::kanMX strain was used.
Media and culturing methods
Minimal mineral medium was used, which contained 10 g of glucose, 5 g of (NH4)2SO4, 3 g of KH2PO4 and 0.5 g of MgSO4 per liter, with 1 mL of trace metal solution and 1 mL of vitamin solution. The trace metal solution contained per liter: FeSO4•7H2O, 3 g; ZnSO4•7H2O, 4.5 g; CaCl2•2H2O, 4.5 g; MnCl2•4H2O, 1 g; CoCl2•6H2O, 300 mg; CuSO4•5H2O, 300 mg; Na2MoO4•2H2O, 400 mg; H3BO3, 1 g; KI, 100 mg; Na2EDTA•2H20, 19 g (pH=4). The vitamin solution contained per liter: d-biotin, 50 mg; 4-aminobenzoic acid, 0.2 g; Ca pantothenate, 1 g; pyridoxine-HCl, 1 g; thiamine-HCl, 1 g; nicotinic acid, 1 g; and myoinositol, 25 g (pH=6.5). The chemostat feeding medium was the same as the minimal mineral medium except for glucose, wherein 7.5 g/L was used instead of 10 g/L.
To generate the internal standard for quantitative proteomics, the lysine auxotrophic strain (CEN.PK113-7D lys1::kanMX) was cultured with heavy labeled 15N, 13C-lysine (Cambridge Isotope Laboratories). Fully labeled biomass (> 95% incorporation) was produced and harvested under four different stages of the culture process: Fed-batch cultures of the auxotrophic strain were carried out in three 1L bioreactors with three exponential feeding rates : 0.1 h-1, 0.2 h-1, 0.35 h-1, and the feeding continued for at least one dilution volume before cells harvest. The harvest biomass samples were mixed together thus comprised of cells in the batch phase (Sample S1) and in each of the three exponential feeding phases (Sample S2 stands for sample of 0.1 h-1, Sample S3 for 0.2 h-1 and Sample S4 for 0.35 h-1). This was performed in order to collect biomass with varying proteome compositions, which would enable a broad spectrum of heavy labeled proteins so that we can got as more quantifiable proteins as possible.
All remaining cultures were carried out under glucose limited chemostat conditions with nine different dilution rates, ranging from 0.025 h-1 to ~0.4 h-1, covering both respiratory (<0.28 h-1) and respiro-fermentative metabolism (>0.28 h-1). Experiments were carried out in 1 L bioreactors (Dasgip, Julich, Germany) equipped with an on-line off-gas analysis system alongside pH, temperature and dissolved oxygen sensors. An initial batch culture was carried out with inoculation of 10 % seed cultures. The chemostat cultures were performed in 1 L bioreactors with a working volume of 0.5 L under aerobic conditions (DO>40%) at 30 ℃ and pH 4.5. The continuous operation of chemostat cultures were performed at the end of the batch culture. To ensure cells were growing at a steady-state, chemostat cultures were run for at least five residence times before sampling.
Sampling
For extracellular metabolome measurements, broth was sampled and filtered immediately into 1.5 mL Eppendorf tubes and stored at -20 ℃ until high-performance liquid chromatography (HPLC) analysis was performed.
For the transcriptome sample collection, 10 mL broth was sampled and injected into a 50 mL falcon tube filled with ~3/4 ice. Cells were pelleted by centrifugation (4000 rpm, 5 min, Centrifuge 5702R, Eppendorf, Germany), and biomass pellets were snap frozen in liquid nitrogen (N2), and then transferred to a 1.5 mL Eppendorf tube and stored at -80 ℃ until further analysis.
For proteome sample collection, ~5 mL culture broth was injected into a 50 mL pre-weighed falcon tube (prechilled on ice), and the tube was reweighed after sampling to determine exact amount of broth collected. Samples were then pelleted by centrifugation (14000 rpm, 20 seconds, 4 ℃, Refrigerated Centrifuge 4K15, Sigma, Germany), and biomass pellet was washed in 1 mL chilled PBS followed by recentrifugation. Pellets were snap frozen in liquid N2, transferred to a 1.5 mL Eppendorf tube then stored at -80 ℃ until further analysis.
For the intracellular metabolome sample collection, ~7 mL culture broth was injected into a pre-weighed 50 mL falcon tube containing 35 mL -40 ℃ 100% methanol. Tube was then reweighed after sampling to determine exact amount of broth collected. Cells were pelleted in a precooled centrifuge (4000 g, 3 min, -20°C, Refrigerated Centrifuge 4K15, Sigma, Germany), supernatant was discarded and samples stored immediately at -80 ℃ until further analysis.
mRNA sequencing
Total RNA was extracted and purified using a Qiagen RNeasy Mini Kit, according to the user manual, with DNase step included (Qiagen, Hilden, Germany). RNA integrity was verified using a 2100 Bioanalyzer (Agilent Technologies, Santa Clara, USA) and its concentration determined using a NanoDrop 2000 (Thermo Scientific, Wilmington, USA).
To prepare RNA for sequencing, the Illumina TruSeq sample preparation kit v2 was used with poly-A selection. cDNA libraries were then loaded onto a high-output flow cell and sequenced on a NextSeq 500 platform (Illumina Inc., San Diego) with paired-end 2 × 75 nt length reads .
Raw data of reads generated by NextSeq 500 was processed using TopHat version 2.1.1 40 to map paired end reads to the CEN.PK 113-7D reference genome ( http://cenpk.tudelft.nl/cgi-bin/gbrowse/cenpk/ ). 8-15 million reads were mapped to the reference genome with an average map rate of 95%. Cufflinks version 2.1.1 41 was then used to calculate the FPKM values for each sample. Mapped read counts were generated from SAM files using bedtools version 2.26.0 42. Differential expression analysis was performed with the Bioconductor R package DESeq2 43.
Absolute mRNA quantification
To quantify RNA Seq read counts, 18 mRNAs were selected with FPKM values ranging 3.4 × 101 – 1.4 × 104, covering 80 % of the dynamic range of mRNA expression at reference conditions (D= 0.1 h-1). Absolute concentrations for these 18 mRNAs were then measured using QuantiGene assay (Affymetrix, Santa Clara, CA, United States). Further details of this measurement can be found in our previous publication 24. A positive linear correlation with a Pearson R2 value of 0.8 was achieved among these 18 selected mRNA absolute concentrations and their corresponding FPKM values (Table S2). The same correlation was then applied to all remaining mRNAs generated from RNA Sequencing, to quantify their respective absolute mRNA levels. Absolute values of mRNA under other dilution rates were then calculated based on the fold change obtained from differential expression analysis relative to the reference chemostat (D= 0.1 h-1 ). Assume the weight of yeast cells does not change under different specific growth rates, so a cell weight of 13 pg measured under reference condition (D= 0.1 h-1) was applied to all other chemostat conditions. The same assumption with a constant cell weight was applied for proteome absolute quantification. The calculated absolute mRNA concentrations were subsequently presented in the unit of [molecules/cell].
Total and phospho-proteome sample preparation
Cell pellets were resuspended in 10 volumes (relative to cell pellet) of 6 M guanidine HCl, 100 mM Tris-HCl pH 8.0, 20 mM DTT, heated at 95℃ for 10 min and sonicated with Bioruptor (Diagenonde, Denville, NJ, United States) sonication (15 min, “High” setting). Samples were further processed with FastPrep24 (MP Biomedicals, Santa Ana, CA, United States) twice at 4 m/s for 30 s with cooling between cycles. After removal of beads, the samples were precleared with centrifugation at 17,000 g for 10 min at 4℃. After protein concentration measurement with Micro-BCA assay (Thermo Fisher Scientific, Wilmington, USA) samples were spiked at 1:1 ratio with the heavy lysine labelled standard. For absolute quantification, 6 µg of heavy standard was spiked separately with 1.1 µg of UPS2 protein mix (Sigma Aldrich). Overall, 50 µg of protein was precipitated with a 2:1:3 (v/v/v) methanol:chloroform:water extraction. The precipitates were suspended in 7:2 M urea:thiourea and 100 mM ammonium bicarbonate. After disulfide reduction with 2.5 mM DTT and alkylation with 5 mM iodoacetamide, proteins were digested with 1:50 (enzyme to protein) Lys-C (Wako Pure Chemical Industries, Osaka, Japan) overnight at room temperature. The peptides were desalted using C18 material (3M Empore) tips and reconstituted in 0.5% trifluoroacetic acid (TFA).
For the phosphoproteome analysis cells were lysed as described above, except samples were not mixed with the heavy standard and proteins were digested with dimethylated porcine trypsin (Sigma Aldrich, St. Louis, MO, United States) instead of Lys-C. Sample preparation was carried out as described by the EasyPhos protocol 44. 500 µg of cellular protein was used as input for the phosphopeptide enrichment. Final samples were reconstituted in 0.5% TFA.
Nano-LC/MS/MS analysis for protein quantification
2 µg of peptides (for phosphoenriched samples, the entire sample) were injected to an Ultimate 3000 RSLC nano system (Dionex, Sunnyvale, CA, United States) using a C18 cartridge trap-column in a backflush configuration and an in-house packed (3 µm C18 particles, Dr Maisch, Ammerbuch, Germany) analytical 50 cm × 75 µm emitter-column (New Objective, Woburn, MA, United States). Peptides were separated at 200 nL/min (for phosphopeptides: 250 nL/min) with a 5-40% B 240 and 480 min gradient for spiked and heavy standard samples, respectively. For phosphopeptides a 90 min two-step separating gradient was used, consisting of 5-15% B 60 min and 15-30% B 30 min. Buffer B was 80% acetonitrile + 0.1% formic acid and buffer A was 0.1% formic acid in water. Eluted peptides were sprayed to a quadrupole-orbitrap Q Exactive Plus (Thermo Fisher Scientific, Waltham, MA, United States) tandem mass spectrometer (MS) using a nano-electrospray ionization source and a spray voltage of 2.5 kV (liquid junction connection). The MS instrument was operated with a top-10 data dependent MS/MS acquisition strategy. One 350-1400 m/z MS scan (at a resolution setting of 70,000 at 200 m/z) was followed by MS/MS (R=17,500 at 200 m/z) of the 10 most-intense ions using higher-energy collisional dissociation fragmentation (normalized collision energies of 26 and 27 for regular and phosphopeptides, respectively). For total proteome analysis, the MS and MS/MS ion target and injection time values were 3 × 106 (50 ms) and 5 × 104 (50 ms), respectively. For phosphopeptides, the MS and MS/MS ion target and injection time values were 1 × 106 (60 ms) and 2 × 104 (60 ms), respectively. Dynamic exclusion time was limited to 45 s, 70 s and 110 s for phosphopeptide, spiked samples and heavy standard runs, respectively. Only charge states +2 to +6 were subjected to MS/MS, and, for phosphopeptides fixed first mass was set to 95 m/z. The heavy standard was analyzed with three technical replicates, all other samples with a single technical replicate.
Mass spectrometric raw data analysis and proteome quantification
Raw data were identified and quantified with the MaxQuant 1.4.0.8 software package 45. For heavy-spiked samples the labelling state (multiplicity) was set to 2, and Lys8 was defined as the heavy label. Methionine oxidation, asparagine/glutamine deamidation and protein N-terminal acetylation were set as variable modifications, cysteine carbamidomethylation was defined as a fixed modification. For phospho-analysis serine/threonine phosphorylation was used as an additional variable modification. Search was performed against the UniProt (www.uniprot.org) S. cerevisiae S288C reference proteome database (version from July 2016) using the LysC/P (trypsin/P for phosphoproteomics) digestion rule. Only protein identifications with a minimum of 1 peptide with 7 amino acids long were accepted, and transfer of peptide identifications between runs was enabled. Peptide-spectrum match and protein false discovery rate (FDR) was kept below 1% using a target-decoy approach with reversed sequences as decoys.
In heavy-spiked samples, to account for any H/L 1:1 mixing deviations normalized H/L ratios (by shifting median peptide log H/L ratio to zero) were used in all down-stream quantitative analyses. Protein H/L values themselves were derived by using the median of a protein’s peptide H/L ratios, and requiring at least one peptide ratio measurement for reporting quantitative values. Signal integration of missing label channels was enabled. For enriched phosphoproteome samples, an in-house written R script based on median phosphopeptide intensity was used to normalize the phosphopeptide intensities.
The heavy spike-in standard used for deriving the copy numbers was quantified using the iBAQ method as described by Scwanhausser et al 46. Essentially, UPS2 protein intensities were divided by the number of theoretically observable peptides, log-transformed and plotted against log-transformed known protein amounts of the UPS2 proteins. This regression was then applied to derive all other protein absolute quantities using each protein’s iBAQ intensity. The relative ratios of individual proteins to total protein were then converted to protein concentration in cell by multiplying the total protein content in cell for each condition. The total protein content in cell under each condition was measured using the modified Lowery method.
Phosphorylation regulation analysis
A recently developed method47 for functional phosphorylation event identification was used for phosphorylation regulation analysis in this work. For details of the model and method the readers are recommended to the original publication. Briefly, it has been shown that correlation between changes in fluxes and phosphorylation levels suggests the contribution of the phosphorylation events to the fluxes. A phosphorylation event is inferred to activate enzyme activity if the correlation is positive while to inhibit enzyme activity if negative. Therefore, correlation analysis was performed in this study for the fold-change values of fluxes and phosphopeptide intensities by comparing with a reference dilution rate.
Relative metabolome quantification48
For intracellular metabolomic analysis, frozen biomass pellets were delivered to Metabolon, Inc. (Durham, NC, USA) wherein non-targeted MS was performed. Briefly, metabolites were identified by matching their ion chromatographic retention index and mass spectral fragmentation signatures to the Metabolon reference library of chemical standards. Relative quantification of metabolite concentrations was then performed via peak area integration.
Metabolite quantification and data normalization. Peaks were quantified using area-under-the-curve. For studies spanning multiple days, a data normalization step was performed to correct variation resulting from instrument inter-day tuning differences. Essentially, each compound was corrected in run-day blocks by registering the medians to equal one (1.00) and normalizing each data point proportionately (termed the “block correction”). For studies that did not require more than one day of analysis, no normalization is necessary, other than for purposes of data visualization.
GECKO modelling details
The enzyme-constrained model ecYeast7, version 1.4, was used from release 1.1.1 of GECKO: https://github.com/SysBioChalmers/GECKO/releases/tag/v1.1.1. A fraction of metabolic enzymes of f = 0.4461 g/g was assumed based on Pax-DB data, and an average saturation of σ = 0.49 was assumed for any non-measured enzyme. Measured protein content was rescaled to be proportional to previous measurements of 0.46 g/gDW at 0.1h-1. Manual curation was performed to some kcat values, exchange fluxes of pyruvate, acetaldehyde and (R,R)-2,3-butanediol were blocked, and a non-growth associated maintenance (NGAM) of 0.7 was assumed for all growth conditions.
For each condition, the corresponding rescaled proteomic data was overlaid as constraints to any protein that had a match, taking out zero values. An additional standard deviation was added for each protein to avoid over-constrained models, and all 4 complexes from Ox.Pho. were scaled to be proportional to the average measured subunit. For all undetected enzymes, an overall “pool” constraint was used, equal to the difference between the protein content and the sum of all measured proteins, multiplied by the previously mentioned f and σ. Additional details plus the full implementation of this process is available in the script limitModel.m.
For each condition, the previously resulting model was used to fit the chemostat data by modifying the growth-associated maintenance and optimizing for biomass growth. The implementation of this is available in dilutionStudy.m. Finally, as several proteins showed to be a big limitation due to an extremely low detected measurement, the mentioned models were flexibilized so they could at least grow at the desired biomass growth rate with the available glucose. This implementation is available in flexibilizeModels.m.
Total Protein Content Measurement
Total protein contents of each condition were measured using the modified Lowry method 49. The total protein content measured at all dilution rates (n=3, ± standard deviation) are as follows:
Specific growth rate
(h-1, AVG±SD)
|
0.027
±0.002
|
0.044
±0.001
|
0.102
±0.001
|
0.152
±0.003
|
0.214
±0.011
|
0.254
±0.003
|
0.284
±0.013
|
0.334
±0.009
|
0.379
±0.011
|
Total protein content
(%, AVG±SD)
|
29.9
±0.3
|
31.2
±0.2
|
37.1
±0.7
|
41.9
±2.4
|
32.0
±4.8
|
37.8
±0.8
|
39.1
±0.9
|
41.9
±0.7
|
39.1
±6.5
|