Identifying secreted biomarkers of dopaminergic ventral midbrain progenitor cells

doi:10.21203/rs.3.rs-2588191/v1

Download PDF

Research Article

Identifying secreted biomarkers of dopaminergic ventral midbrain progenitor cells

https://doi.org/10.21203/rs.3.rs-2588191/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Dec, 2023

Read the published version in Stem Cell Research & Therapy →

You are reading this latest preprint version

Background: Ventral midbrain (VM) dopaminergic progenitor cells derived from human pluripotent stem cells have the potential to replace endogenously lost dopamine neurons and are currently in preclinical and clinical development for treatment of Parkinson’s Disease (PD). However, one main challenge in the quality control of the cells is that rostral and caudal VM progenitors are extremely similar transcriptionally though only the caudal VM cells give rise to dopaminergic (DA) neurons with functionality relevant for cell replacement in PD. Therefore, it is critical to develop assays which can rapidly and reliably discriminate rostral from caudal VM cells during clinical manufacturing.

Methods: We performed shotgun proteomics on cell culture supernatants from rostral and caudal VM progenitor cells to search for novel secreted biomarkers specific to DA progenitors from the caudal VM. Key hits were validated by qRT-PCR and ELISA.

Results: We identified and validated several novel secreted markers significantly enriched in caudal VM progenitor cultures (CPE, LGI1 and PDGFC), and found that these markers correlated strongly with the intracellular expression of EN1, which is a predictive marker for successful graft outcome in DA cell transplantation products. Other markers (CNTN2 and CORIN) were found to be significantly enriched in the non-dopaminergic rostral VM cultures. Key novel ELISA markers were further validated on supernatant samples from GMP-manufactured caudal VM batches.

Conclusion: We propose a panel of coupled ELISA assays that can be applied as non-invasive quality control tests for validating correct patterning of caudal VM DA cells during clinical manufacturing.

Mass spectrometry

Dopaminergic progenitors

ELISA

Quality Control

Biomarkers

cell replacement therapy

Parkinson’s Disease

Parkinson’s disease (PD) is a common neurodegenerative movement disorder with a prevalence of 1% in the population above 60 years. Dopamine-modifying medications such as Levodopa are the most common treatment strategy for ameliorating the motor symptoms of the disease, but these symptomatic treatments are associated with serious side effects and a gradual loss of efficacy as the disease progresses [1]. PD involves the relatively selective loss of dopaminergic (DA) neurons within the substantia nigra, and it is the loss of this particular neuronal subtype which is the underlying cause of the main motoric symptoms in PD patients. Based on this, DA cell replacement is a promising treatment strategy with the prospect of long-term symptomatic amelioration mediated by physiological DA release from transplanted DA neurons in the striatum. The feasibility and clinical efficacy of this approach has been demonstrated in studies using transplantation of fetal ventral midbrain (VM) tissue to the brains of PD patients [2–7]. However, due to ethical and logistical hurdles of using fetal tissue, a new generation of cellular therapies derived through directed differentiation of human pluripotent stem cells (hPSCs) has emerged [8]. In this case, hPSCs are differentiated in vitro specifically towards VM fates and then transplanted to the brain while still at the neural progenitor stage. The transplanted cells subsequently mature in the host brain to form functional DA neurons which can integrate and secrete dopamine to the surrounding host parenchyma.

To ensure safe, efficacious, and reproducible outcomes of stem cell-derived DA products, reliable and predictive quality control (QC) assays for correct DA progenitor fate must be applied. Intracellular proteins known to be expressed in VM DA progenitors, such as the transcription factors LMX1A, FOXA2 and OTX2 [9], are commonly used as surrogate markers for assessing the presence of DA precursor cells in transplanted cell populations. However, assessing intracellular marker expression by staining or RNA expression is invasive as it requires cellular fixation or lysis, and it is associated with significant sample processing time. Developing rapid and non-invasive QC measures which can identify correctly patterned VM DA fate from neural progenitor cells of non-DA fate is therefore of high value for producing cells for clinical use under Good Manufacturing Practice (GMP). Currently used QC assays further present the challenge that only caudal VM (cVM)-derived LMX1A/FOXA2/OTX2 triple-positive progenitors give rise to VM DA neurons whereas triple-positive cells of the rostral VM (rVM) produces other types of neurons, including glutamatergic neurons of the subthalamic nucleus [10, 11]. Hence, QC assays using LMX1A/FOXA2/OTX2 are unable to distinguish between cVM and rVM fates, and these markers alone – although necessary – are not sufficient to predict successful graft outcome upon transplantation [11].

In this study, we searched for novel biomarkers secreted by hPSC-derived VM progenitor cells, with the specific quality of being able to distinguish correctly patterned cVM DA progenitors from the closely related non-DA rVM progenitors, as well as from neural progenitors of other brain regions. To identify secreted markers, we applied shotgun mass spectrometry-based (MS) proteomics on harvested medium from rVM and cVM cultures around day 16 of differentiation, which is the day at which progenitor cells are harvested for the purpose of clinical transplantation [12]. Top candidates from the MS analysis were validated by qRT-PCR and ELISA assay, and from this we identified several secreted markers which were present at significantly different levels in medium from rVM and cVM cultures. We further combined two of these ELISA markers to generate a dual ELISA panel which could robustly discriminate correctly patterned cVM cells for use in clinical transplantation therapy.

Regionalized neural differentiation of hESCs

RC17 hESCs from Roslin Cells (Edinburgh, UK), normally karyotyped and mycoplasma-free, were maintained on Laminin 521 (Biolamina) coated culture dishes (Sarstedt) in StemMACS iPS Brew XF medium (Miltenyi Biotec) and passaged with EDTA (0.5 mM) once weekly. The RC17 cell line used for this work is deposited in the UK Stem Cell Bank (https://nibsc.org/ukstemcellbank), and is registered in the online registry for human pluripotent stem cells hPSCreg (https://hpscreg.eu/, number RCe021-A).

The cells were differentiated towards progenitors of dorsal forebrain (dFB), ventral forebrain (vFB), dorsal midbrain (dMB), rostral ventral midbrain (rVM), caudal ventral midbrain (cVM), dorsal hindbrain (dHB), and ventral hindbrain (vHB) fates. For all conditions, media composition, coating, seeding densities and replating steps were followed until day 16 as previously described [11, 12]. All conditions received dual SMAD inhibition (SB431542 10 µM and Noggin 100 ng/ml) from day 0–9 of differentiation. Patterning into each of the different regions was obtained by differential addition of patterning factors CHIR99021 (referred to as CHIR), SHH-C24II (referred to as SHH) and FGF8b, all from Miltenyi Biotec, as follows: dFB (no additional factors added), vFB (SHH 300 ng/ml day 0–9), dMB (CHIR 0.7 µM day 0–9 + FGF8b 100 ng/ml day 4–16), rVM (CHIR 0.7 µM day 0–9 + SHH 300 ng/ml day 0–9), cVM (CHIR 0.7 µM day 0–9 + SHH 300 ng/ml day 0–9 + FGF8b 100 ng/ml day 9–16), dHB (CHIR 2 µM day 0–9) and vHB (CHIR 2 µM day 0–9 + SHH 300 ng/ml day 0–9). The basal medium used during differentiation of all regional fates consisted of DMEM/F12 (Invitrogen) mixed 1:1 with NeuroMedium, (Miltenyi), supplemented with 1% N2 supplement from day 0–11. From day 11–16, cells were kept in NeuroMedium (Miltenyi) supplemented with 2% NeuroBrew-21 (Miltenyi) as well as BDNF (20 ng/ml) and Ascorbic acid (0.2 mM). The cell culture medium was harvested from the cells on day 11, 14 and 16, and medium from all these three timepoints was pooled for vesicle preparation by centrifugation (Experiment 3b). For global secretome analysis (Experiments 1, 2 and 3a), bovine serum albumin originating from the B27 medium was first removed from the cultures by washing the cells three times in PBS on day 16. Subsequently, the cells were cultured in NeuroMedium with 0.2% N2 supplement for 24 hours until medium harvest for MS analysis on day 17. This procedure allowed to remove BSA from the input medium for MS, thereby significantly lowering the background signal on the global secretome MS analysis.

mRNA extraction and qRT-PCR

Samples were homogenized using a QiaShredder column and RNA was isolated using RNeasy Micro kit (both from Qiagen), running on a QiaCube instrument, according to the manufacturer’s procedures. Reverse transcription was performed with random hexamer primers and Maxima First Strand cDNA Synthesis Kit (Thermo Scientific) using up to 1 µg of RNA from each sample. The complementary DNA was pipetted onto a 384-well plate, together with SYBR green Mastermix (Roche Life Sciences) and primers using an automated liquid handler (I.DOT One, Dispendix). Samples were analyzed by real-time quantitative PCR on a LightCycler 480 instrument (Roche Life Sciences) using a two-step protocol with a 60°C annealing/elongation step, for 40 cycles (Ct calculations capped at 35). All qRT-PCR samples were run in technical duplicates, and the averaged Ct values were used for calculations. Data are represented using the ΔΔCt method. For each gene and samples, the fold change was calculated as the average fold change relative to undifferentiated hESCs, based on two different housekeeping genes (ACTB and GAPDH). List of primers used, and respective sequence is provided in Table 1.

Sample preparation for whole supernatant (Global Secretome) for MS

Media samples from VM cultures harvested at day 17 (Fig. 1a, Experiment 1, n = 3 biological replicates, Experiment 2, n = 5 biological replicates, Experiment 3a, n = 6 biological replicates) were prepared for mass spectrometry using in-solution digestion. Proteins were denatured with 8M Urea (50mM Ambic) and reduced with 10 mM (50mM AmBic) Dithiothreitol (DTT) at 56^oC for 1h with 900 rpm shaking. Subsequently, samples were alkylated with 20mM (50mM AmBic) Iodoacetamide (IAA) in darkness for 30 min at room temperature. Ethanol was added to all samples with a ratio 1:9 (v/v, sample:ethanol) for protein precipitation and incubated over night at -20^oC. After precipitation, samples were centrifuged at 12000 rpm x 15 minutes at 4^oC and ethanol was removed with a pipette. Protein pellets were dried in a concentrator to remove any remaining trace of ethanol, followed by pellet dissolution in 100 µl 50 mM AmBic. For protein digestion, 2 µg Trypsin with a ratio 1:50 (w/w, Trypsin:sample) was added to each sample followed by incubation at 37^oC for 17h with shaking (350 rpm). Protein digestion was stopped by reducing pH to 4 with Formic acid (v/v 10% in AmBic). iRT peptides (Biognosys AG) were added to each sample in a ratio 1:10 (v/v iRT:sample). Samples were then dried in a concentrator and stored at -80^oC.

Table 1

List of human primers, by gene, full name, and forward and reverse sequence
Primer	Gene (full name)	Forward Primer	Reverse Primer
ACTB	actin beta	CCTTGCACATGCCGGAG	GCACAGAGCCTCGCCTT
CNTN2	contactin 2	GTCACGGGAGTACCAGAACG	TGTAGACAAAGTACTGGGCATCG
CORIN	corin, serine peptidase	CATATCTCCATCGCCTCAGTTG	GGCAGGAGTCCATGACTGT
CPE	carboxypeptidase E	CTCTGAAGACCTACTGGGAGGA	GCATTCGCAATTGGGTTACCTT
EN1	engrailed homeobox 1	CGTGGCTTACTCCCCATTTA	TCTCGCTGTCTCTCCCTCTC
EN2	engrailed homeobox 2	CCTCCTGCTCCTCCTTTCTT	GACGCAGACGATGTATGCAC
FEZF1	FEZ family zinc finger 1	GGTACATTCCACATTCGTGAGC	TCACGTGCAATAATCAAAACCA
FGF8	fibroblast growth factor 8	ACAGCGCTGCAGAATGCCAAGT	GAAGTGGACCTCACGCTGGTGC
FOXA1	forkhead box A1	GGGCAGGGTGGCTCCAGGAT	TGCTGACCGGGACGGAGGAG
FOXA2	forkhead box A2	CCGTTCTCCATCAACAACCT	GGGGTAGTGCATCACCTGTT
FOXG1	forkhead box G1	TGGCCCATGTCGCCCTTCCT	GCCGACGTGGTGCCGTTGTA
FST	follistatin	GATGGGAAAACCTACCGCAATG	CATCTGCCTTGGTACTGGACTT
GAPDH	glyceraldehyde-3-phosphate dehydrogenase	TTGAGGTCAATGAAGGGGTC	GAAGGTGAAGGTCGGAGTCA
GBX2	gastrulation brain homeobox 2	GTTCCCGCCGTCGCTGATGAT	GCCGGTGTAGACGAAATGGCCG
GDF7	growth differentiation factor 7	GACGCTGCTCAACTCCATGGCA	TTGGCGGCGTCGATGTAGAGGA
HOXA1	homeobox A1	GTACGGCTACCTGGGTCAAC	ACTTGGGTCTCGTTGAGCTG
HOXA2	homeobox A2	CGTCGCTCGCTGAGTGCCTG	TGTCGAGTGTGAAAGCGTCGAGG
HOXA3	homeobox A3	GGCCAATCTGCTGAACCTCA	GAGTTCAGATAGCCACCGGC
HOXB1	homeobox B1	GGCCTTCTCAGTACTACCCTCT	CCGTAGCTCGAGGGATGAAAAT
IRX3	iroquois homeobox 3	GGCTTGCGCCCCGTAGAAATGT	AGGAGCCAGGTCAGGTCCGAAC
LGI1	leucine rich glioma inactivated 1	CAACAATCTCCAGACACTCCCA	CCCCTCAGGTCCACATTTGTTA
LHX2	LIM homeobox 2	GGGCGACCACTTCGGCATGAA	CGTCGGCATGGTTGAAGTGTGC
LMX1A	LIM homeobox transcription factor1a	CGCATCGTTTCTTCTCCTCT	CAGACAGACTTGGGGCTCAC
NKX2-1	NK2 homeobox 1	AGGGCGGGGCACAGATTGGA	GCTGGCAGAGTGTGCCCAGA
NKX6-1	NK6 homeobox 1	GGATCCCAACTCGGACGACGAGA	AGGATGAGCTCTCCGGCTCGG
OTX2	orthodenticle homeobox 2	ACAAGTGGCCAATTCACTCC	GAGGTGGACAAGGGATCTGA
PAX5	paired box 5	CCCCATTGTGACAGGCCGTGAC	TCAGCGTCGGTGCTGAGTAGCT
PAX6	paired box 6	TGGTATTCTCTCCCCCTCCT	TAAGGATGTTGAACGGGCAG
PAX7	paired box7	CTTCAGTGGGAGGTCAGGTT	CAAACACAGCATCGACGG
PAX8	paired box 8	ATAGCTGCCGACTAAGCATTGA	ATCCGTGCGAAGGTGCTTT
PDGFC	platelet derived growth factor C	ACAAGGAACAGAACGGAGTACA	GTATGAGGAAACCTTGGGCTGT
SERPINF1	serpin family F member 1	TCGGACCCTAAGGCTGTTTTAC	CTTTCAGGGGCAGGAAGAAGAT
SHH	sonic hedgehog	CCAATTACAACCCCGACATC	AGTTTCACTCCTGGCCACTG
SIX3	SIX homeobox 3	ACCGGCCTCACTCCCACACA	CGCTCGGTCCAATGGCCTGG
SIX6	SIX homeobox 6	CTCAACAAGAATGAGTCGGTGC	ACTCCTTGGTGAACTTGTGGTT
SOX10	SRY-box 10	CTTTCTTGTGCTGCATACGG	AGCTCAGCAAGACGCTGG
TBR1	T-box, brain 1	TCGTCCCCGCTCAAGAGCGA	CCTTGGCGCAGTTCTTCTCGCA
TFF3	trefoil factor 3	TCTGGAGCCTGATGTCTTAACG	GACGCAGCAGAAATAAAGCACA
WNT1	Wnt family member 1	GAGCCACGAGTTTGGATGTT	TGCAGGGAGAAAGGAGAGAA
WNT3A	Wnt family member 3A	GCGATGGCCCCACTCGGATACT	TAGCTGCCCAGAGCCTGCTTCA

Preparation of vesicle-enriched samples for MS

To enrich for secreted vesicles, media samples harvested at day 11, day 14 and day 16 (see Fig. 1a, Experiment 3b, n = 6 biological replicates) were run in a differential centrifugation protocol in the following order: 300 g x 10 min at 4^oC, 2000 g x 10 min at 4^oC and 10 000 g x 30 min at 4^oC. In between each centrifugation step, the supernatant was transferred to new tubes. Media samples from the same cultures were pooled and transferred to ultracentrifugation tubes. Samples were ultra-centrifuged at 100 000 g x 70 minutes at 4^oC. The supernatant was discarded and 12 ml 50 mM AmBic was added to the top of each tube to wash the pellet, followed by another ultra-centrifugation step at 100 000 g x 70 minutes at 4^oC. After centrifugation, the top 11 ml of media was discarded while the remaining 1 ml volume was mixed with a pipette to dissolve the vesicle pellet. The 1 ml sample was then transferred to new tubes for MS sample preparation. Sample volumes were reduced to 100 µl using a concentrator, followed by the addition of 50 µl RIPA buffer for vesicle lysis and protein denaturation. To further improve lysis, samples were placed in a Bioruptor 300 sonication system (Diagenode) and run for 50 cycles (High Power 15s/ OFF 15 s) at 4^oC. After lysis, proteins in the samples were reduced, alkylated and precipitated according to the method for the whole supernatant samples as described above. After precipitation, samples were centrifuged at 14 000 rpm x 15 min at 4^oC and the supernatant was discarded. Samples where further dried in a concentrator to remove any trace of ethanol. To dissolve the pellet, 50 µl AmBic (100 mM) was added to each sample. In order to remove glycosylations on Asparagine residues, 1.5 µl PNGase F (Promega) was added to each sample and incubated for 18h with little shaking. For protein digestion 1.4 µg Trypsin was added to each sample with a ratio 1:50 (w/w, Trypsin:sample) and incubated at 37^oC for 22h with shaking (350 rpm). Protein digestion was stopped with 10 µl Formic acid (v/v 10% in AmBic). Samples were dried in a concentrator and stored in -80^oC.

Data-dependent acquisition MS runs (DDA)

Supernatant samples from cVM and rVM (Experiment 1) were run in DDA mode on a Q Exactive Plus (Thermo Fisher Scientific) to be used for subsequent global DDA analysis. An EASY-nLC 1000 ultra-high-performance liquid chromatography system (Thermo Fisher Scientific) was connected to the MS instrument. Peptide separation was performed on an EASY-Spray column (ES802, Thermo Fisher Scientific) by running a linear acetonitrile gradient going from 5–30% solvent B (0.1% formic acid in acetonitrile) for 90 minutes. As solvent A, 0.1% formic acid was used. MS1 spectra were acquired in profile mode with a resolution of 70 000. In each cycle, the top 15 most intense precursor were selected in MS1 for fragmentation, but with a dynamic exclusion time of 20 s. Acquired MS2 spectra were centroided, with a resolution of 17 500. Normalized collision energy for fragmentation (NCE) was set to 30. Scan range in MS1 and MS2 was set to 400–1600 m/z and 200–2000 m/z respectively. Automatic gain control (AGC) target was set to 1e6 in both MS1 and MS2. Maximum ion injection time (IT) was set to 100 ms in MS1, and 60 ms in MS2.

In order to build sample-specific spectral libraries for later DIA analyses (Experiment 3), supernatant samples from cVM and rVM (global DIA and vesicles DIA dataset), were run on a Q Exactive HF-X (Thermo Fisher Scientific) in DDA mode. Connected to the MS instrument was an EASY-nLC 1200 ultrahigh-performance liquid chromatography system (Thermo Fisher Scientific). An EASY-Spray column (ES803, Thermo Fisher Scientific) separated peptides in a non-linear acetonitrile gradient for 2h (solvent B | 1–7%:8 min, 7–12%:15 min, 12–27%:65 minutes, 27–32%:15 min, 32–37%:9 min, 37–52%:8 min, 52–90%: 2 min). MS1 spectra recorded in profile mode had a resolution of 120 000. The top 20 most abundant precursors were chosen for fragmentation in each cycle, and the dynamic exclusion time was set to 15 s. Centroided MS2 spectra were acquired at a resolution of 15 000, with NCE = 27. Scan ranges were set to 350–1650 m/z in MS1, and 200–2000 m/z in MS2 respectively. The AGC target was set to 3e6 in MS1, and 1e5 in MS2. The maximum IT was set to 20 ms in MS1, while it was set to 20 ms in MS2.

Data-independent MS acquisition (DIA)

Samples for all DIA analyses were acquired on a Q Exactive HF-X mass spectrometer (Thermo Fisher Scientific), using the same liquid chromatography (LC) system and gradient settings as for the global DDA runs to build spectral libraries. For data-independent acquisition (DIA), the instrument method was set to acquire a full MS1 scan (resolution 120 000, scan range: 350–1650 m/z) in profile mode, followed by 44 variable MS2 windows (resolution 30 000) with the following ranges: 350–371, 370–387, 386–403, 402–416, 415–427, 426–439, 438–451, 450–462, 461–472, 471–483, 482–494, 493–505, 504–515, 514–525, 524–537, 536–548, 547–557, 556–568, 567–580, 579–591, 590–603, 602–614, 613–626, 625–638, 637–651, 650–664, 663–677, 676–690, 689–704, 703–719, 718–735, 734–753, 752–771, 770–790, 789–811, 810–832, 831–857, 856–884, 883–916, 915–955, 954–997, 996–1057, 1056–1135 and 1134–1650 m/z. A stepped NCE was used for fragmentation (NCE = 25.5, 27, 30). AGC targets were set to 3e6 in both MS1 and MS2. Maximum IT was set to 60 ms in MS1 and ‘auto’ in MS2.

For later spectral library building, pooled supernatant samples (Global) and vesicle samples respectively, were run with gas-phase fractionated (GPF) DIA methods. For the pooled supernatant samples, there were 6 methods with DIA windows covering different MS1 ranges (400–500 m/z, 500–600 m/z, 600–700 m/z, 700–800 m/z, 800–900 m/z, 900–1000 m/z). Centroided MS1 and MS2 spectra were recorded with a resolution of 30 000. For the pooled vesicles samples, there were 10 GPF-DIA methods with DIA windows covering 10 different MS1 ranges respectively (300–400 m/z, 400–500 m/z, 500–600 m/z, 600–700 m/z, 700–800 m/z, 800–900 m/z, 900–1000 m/z, 1000–1100 m/z, 1100–1200 m/z, 1200–1650 m/z).

For each GPF-DIA method, a set of 51 overlapping DIA windows with a fixed window size of 4 m/z were acquired to cover the full MS1 ranges. The only exception was the GPF-DIA method for the 1200–1650 m/z range, having a fixed window size of 18 m/z. The AGC target was set to 3e6 in MS1, and 1e6 in MS2.

DDA-based spectral library generation

DDA MS raw files belonging to Experiment 3 (Global and Vesicles) were imported into Fragpipe v.16.1-build5 (https://github.com/Nesvilab/FragPipe). As database, the human proteome FASTA file was used (UP000005640, Uniprot/Swissprot release 21_03) with decoys appended (reversed target sequences). To build the spectral library, the default ‘SpecLib’ workflow was loaded and the default settings for all tools were used. In this workflow, the database search engine MSFragger v3.3 [13] was employed to identify MS/MS spectra, followed by Percolator [14] for confidence estimation. Protein grouping and post processing was performed using ProteinProphet [15] and Philosopher [16] followed by spectral library building with EasyPQP (https://github.com/grosenberger/easypqp).

DIA-based spectral library generation

DIA raw files were loaded into DIA-NN v.1.8 [17] to build a wide-window DIA spectral library for the global dataset and the vesicle dataset respectively. Confidently identified spectra (q-value < = 0.01) were extracted from each DIA file to be included in the final library. Narrow-window libraries were also built in DIA-NN for both datasets, using acquired GPF-DIA runs. Similarly, wide-window DIA spectral libraries were built for both datasets in Fragpipe v.16-build5 using the existing workflow ‘MSFragger-DIA-wide-window-SpecLib’. Also, narrow-window spectral libraries were built with the workflow option: ‘MSFragger-DIA-narrow-window-SpecLib’ using default settings. For all spectral libraries the canonical human proteome FASTA database was used (UP000005640, Uniprot/Swissprot release 21_03).

Super spectral library generation

In total, ten different spectral libraries were built for Experiment 3, five for each of the analyses, Global and Vesicles. As different library building strategies resulted in slightly different targets, the libraries were imported into R (v.4.2.1) and combined into non-redundant super spectral libraries, one for each dataset, using a custom R script.

Data analysis of global DDA runs

Raw DDA files acquired by DDA on the Q Exactive Plus were loaded into MaxQuant v.1.6.1.0 [18–20] for label-free quantification of proteins. DDA MS files were put in different parameter groups based on their Experiment (1 or 2) to ensure batch-specific normalization and quantification with the MaxLFQ algorithm [21]. Identification settings used the default false-discovery rate of 1% on protein, peptide and peptide-spectral-match level. As FASTA database, the human canonical proteome was used (UP000005640, Uniprot/Swissprot release 21_03). Match-between-runs to transfer identifications between runs was enabled. Carbamidomethylation on Cystein (UniMod:4) was set as fixed modification and variable modifications were oxidation on Methionine (Unimod:35) and acetylation on protein N-terminal (UniMod:1). For label-free quantification, it was required that at least one peptide was identified from MS/MS for pairwise comparisons. The minimum LFQ peptide ratio was set to 1, in order to allow more low-abundant proteins to be quantified.

Data analysis of DIA runs

Acquired DIA raw files acquired on the Q Exactive HF-X were searched against their respective super spectral library in DIA-NN v.1.8 [17]. The quantification strategy was set to ‘Robust LC (high accuracy)’ while cross run normalization was set to RT-dependent (default). Based on the median recommended MS1 accuracies reported by DIA-NN for each run, the MS1 accuracy was set to 7.96 ppm for the Global DIA dataset (Experiment 3a) while being set to 8.48 ppm for the Vesicles dataset (Experiment 3b). MS2 accuracies were automatically set by DIA-NN to 20 ppm for both analyses. Relaxed protein inference was enabled in DIA-NN to avoid the assignment of the same protein to more than one group during protein inference. The human proteome FASTA file (UP000005640, Uniprot/Swissprot release 21_03) was used for annotations in DIA-NN.

ELISA

Supernatant samples were collected from the differentiating cells at the day 11 and 16 and immediately frozen. ELISA kits for the targets proteins were used in according to the manufacturer’s instructions: CNTN2, CORIN, FST, PDGFC, SERPINF1, TFF3 (all from R&D Systems), CPE (Nordic Biosite), LGI1 (Cusabio) (see Table 2). Before analysis, each supernatants sample was centrifuged at > 10.000 rpm for 10 minutes to remove cell debris. Initial tests were performed to ascertain dilution factors for the various proteins and samples, although some measurements were above or below the detection limit. Sample measurements above detection limit were excluded. Samples assayed at 1:1 dilution and with measurements below the detection limit were attributed the Minimum Detectable Dose according to the manufacturer’s information, or, in the absence, the minimum calculatable value using the respective dilution curve and 4-PL curve fit. The measured protein concentration values were then normalized to the cell count in the respective well, yielding pg.ml^− 1.10^–6 cells.

Table 2

List of ELISA kits
Protein	Kit Manufacturer	Kit name	Kit target	Catalog #
CNTN2	R & D Systems	DuoSet	human Contactin-2/TAG1	DY1714-05
CORIN	R & D Systems	Quantikine	human Corin	DCRN00
CPE	Nordic Biosite	n.a.	human CPE	KBB-B314QW-96
FST	R & D Systems	Quantikine	human Follistatin	DFN00
LGI1	Cusabio	n.a.	human LGI1	CSB-EL012898HU
PDGFC	R & D Systems	Quantikine	human PDGF-CC	DCC00
PDGFC	R & D Systems	DuoSet	human PDGF-CC	DY1687-05
SERPINF1	R & D Systems	DuoSet	human SerpinF1/PEDF	DY1177-05
TFF3	R & D Systems	Quantikine	human TFF3	DTFF30

Table 2: List of ELISA kits, by target protein, manufacturer and kits respective name, described target and catalog number.

Statistical Analysis of ELISA and qRT-PCR data

All ELISA and qRT-PCR data was managed in Excel and statistically analyzed using GraphPad Prism 9 software, P < 0.05 was considered significant. For multi-regional comparisons, one-way analysis of variance (ANOVA) was performed followed by a Sidak multiple comparison test between the rVM and cVM and remaining regions. All datasets were tested for their normal and Log-Normal distribution (Shapiro– Wilk and Kolmogorov-Smirnov) and homoscedasticity (Brown–Forsythe) before ANOVA. Alternatively, a non-parametric Kruskal–Wallis analysis was conducted instead, followed by a Dunn’s multiple comparison test. All multiple comparison tests were corrected using statistical hypothesis testing.

For pairwise comparison between rVM and cVM, a two-tailed unpaired t-Test was performed, or in case the datasets and the Log-transformed datasets lacked a Gaussian distribution or showed significantly different variances, a Mann-Whitney test was performed instead.

For calculating the correlation between the EN1 mRNA expression and the ELISA-assayed Protein levels, a two-tailed Spearman correlation was performed on the Log-Log data. A straight, non-linear, least squares regression was fitted to the Log-Log data, computing the 95% confidence interval.

Statistical Analysis of DDA and DIA analyses

Result files from the Global analysis and the Vesicles analysis were imported into R for processing and differential expression analysis. The protein groups table (proteinGroups.txt) from the MaxQuant search was filtered to not contain decoys nor entries only identified by site. A quantitative matrix was extracted by selecting the ‘LFQ intensity’-columns from the table, and the quantitative values were subsequently log2-transformed. Imputation was applied to the matrices using the R package imputeLCMD [22] v.2.0, where the K-nearest neighbors algorithm impute values missing at random, while the ‘MinProb’-algorithm was used to impute values missing not at random. Differential expression analysis was performed by running a moderated t-test using the R package DEqMS [23] v.1.8.0 to compare samples belonging to cVM with those in rVM.

Output reports from DIA-NN, for the Global analysis and v´Vesicles analysis, were imported into R for downstream processing. Reports were filtered to only contain confidently identified entries (Global precursor q-value < = 0.01, Global protein group q-value < = 0.01). Quantitative protein groups matrices were computed with the MaxLFQ [21] algorithm, implemented in the R package ‘diann’ v.1.0.1 (https://github.com/vdemichev/diann-rpackage). Following log2-transformation, the matrices were filtered to only contain protein groups having at least 60% quantitative values evenly distributed among samples in both conditions (cVM or rVM), or at least 50% quantitative values given that all were present in one group only. Retained protein groups were then imputed using the ‘MinProb’ algorithm described above (see global DDA analysis). Similarly to the global DDA analysis, DEqMS [23] v.1.8.0 was used to perform differential expression analysis between samples in the cVM condition and the rVM condition.

GO-term enrichment analysis

A GO-term enrichment analysis for cellular components between the Global DIA dataset and the Vesicles DIA dataset was performed in R (v.4.2.1) with the package Clusterprofiler [24] v.3.18.1. To find enriched GO-terms for cellular components in the Global DIA dataset, the enrichGo function was used to query gene names for identified proteins in the Global DIA dataset against all identified gene names (Global DIA + Vesicles DIA). Inversely, all gene names in the Vesicles DIA dataset were queried against all identified gene names to find enriched cellular component GO-terms in the Vesicles dataset. Only significant results were considered (q-value < = 0.05, Benjamini-Hochberg FDR estimation [25]).

Identifying secreted biomarkers from ventral midbrain progenitor cells through shotgun proteomics

In order to identify relevant secreted protein candidates from cVM DA progenitor cultures, shotgun proteomics was used to analyze whole supernatant collected from cultures of hESC-derived VM cells. For this purpose, we applied a clinical grade cell line (RC17) and a differentiation protocol adapted to Good Manufacturing Practice (GMP) for producing rostral and caudal VM progenitor cells (rVM and cVM, respectively) [12], thus performing the analysis on clinically relevant cell populations. Thereby, the differentiated cVM cell populations used in this study are equivalent to cells in the STEM-PD product which has been approved for clinical trial in Parkinson’s Disease patients [8, 26]. The global secretome was analyzed in medium which was harvested from the cells around the time of transplantation (i.e., collected from day 16 to day 17 of differentiation). However, to reduce background signals of Albumin, Serotransferrin and Insulin from the basic B27-supplement-containing cell medium, cell cultures were washed three times in PBS on day 16 of differentiation, and medium was changed to a low-protein content media with 0.2% N2 supplement, which was harvested 24 hours later (day 17, see Materials and Methods; see Fig. 1a). In two initial experiments (Experiment 1, n=3 biological replicates and Experiment 2, n=5 biological replicates), Data-dependent acquisition (DDA) with label-free quantification (LFQ) was used to measure the relative protein abundances between rVM and cVM culture supernatants (Fig. 1b, 1c). To allow for a deeper protein quantification, screening less abundant targets, a third experiment (Experiment 3a, n=6 biological replicates) was carried out where quantification was obtained through Data-independent acquisition (DIA) followed by LFQ (Fig. 1d). Differential expression analysis showed several upregulated proteins in the cVM supernatant that were shared in at least 2 out of 3 experiments, such as LGI1 (Leucine-rich glioma-inactivated protein 1), FREM1 (FRAS1-related extracellular matrix protein 1), CPE (Carboxypeptidase E) and SERPINF1 (Serpin family F member 1). Likewise, several protein candidates were found to be enriched in the rVM condition, such as CNTN2 (Contactin-2), PCSK1N (Proprotein convertase subtilisin/kexin type 1 inhibitor) and NCAN (Neurocan) (Fig. 1e).

Comparing the global secretome with the proteome of vesicles

In the latest years there has been a rise in awareness to the role of extracellular vesicles in intercellular communication [27] as well as their potential as an accessible biological source to identify biomarkers by proteomic analysis [28, 29]. Therefore, to ensure the detection of differentially expressed vesicle-associated proteins, supernatant samples collected from rVM and cVM cultures between day 11 to day 16 were enriched for their vesicle content by ultra-centrifugation and analyzed using DIA (Experiment 3b, hereafter termed “Vesicles”, Fig. 1a). Similarly, LFQ followed by differential expression analysis between the rVM and the cVM samples was performed (Fig. 1f), adding a dataset of 74 differentially enriched protein targets, including STC1 (Stanniocalcin 1), OLFML3 (Olfactomedin-like protein 3) and PDGFC (Platelet-derived growth factor C), which were found to be upregulated in cVM vesicle samples. While the majority of the protein targets were unique to our Vesicle enriched samples, i.e. not found in any of the whole supernatant datasets, 4 cVM enriched targets proteins were shared with at least 2 other datasets: CPE, FST (Follistatin), LGI1 and PDGFC (Fig. 1g).

Gene ontology analysis confirmed the differential origin of the analyzed samples, with the global samples showing an enrichment for proteins of extracellular matrix as well as proteins of the secretory lumen of the endoplasmic reticulum, while the vesicle samples were enriched for membrane and ribosomal proteins (Fig. S1a). Furthermore, several protein markers characteristic of extracellular vesicles, such as ALIX (PDCD612P), TSG101, CD63, CD81, CD47 and VPS4B [30, 31] were almost exclusively detected in our vesicle samples (Fig. S1b), in accordance with the guideline for minimal information for studies in extracellular vesicles [32]. None of these baseline proteins were differentially enriched in either rostral or caudal VM samples (Table 3).

Table 3. Differential detection (cVM vs rVM) of extracellular vesicle markers in Experiment 3

	Experiment 3a: Global DIA			Experiment 3b: Vesicles DIA
EV markers	fold change	p-value	q-value	fold change	p-value	q-value
PDCD6IP (ALIX)	0,10	0,837	0,887	-1,28	1,27E-06	2,46E-05
TSG101	ND	ND	ND	-1,52	2,40E-06	3,69E-05
CD63	ND	ND	ND	-0,59	0,013	0,033
CD81	ND	ND	ND	-0,62	0,015	0,038
CD47	ND	ND	ND	-1,52	2,04E-09	4,80E-07
VPS4B	0,40	0,235	0,325	-1,00	9,39E-05	6,05E-04

Table 3: List of non-tissue specific extracellular vesicle associated proteins identified by MS-DIA in Experiment 3, in the Global Secretome samples and ultracentrifuged Vesicle-enriched samples, comparing cVM and rVM cultures. Red values indicate fold change <2 and significance >0.05. ND: not detected

From the resulting MS datasets, we next selected a list of potential candidate proteins for validation by qRT-PCR and ELISA, choosing the candidates from the following characteristics: a) confirmed identification as a differentially expressed target in two or more datasets, b) high fold-change difference between the two VM regions and c) the availability of a reliable commercial source of ELISA assays for detection of the proteins. Based on these parameters, we selected 6 differentially expressed secreted protein candidates for ELISA validation: FST, LGI1 (present in all 4 datasets), CPE, PDGFC (present in 3 datasets) and SERPINF1 (present in 2 datasets) as candidates enriched in cVM samples, and CNTN2, the most enriched rVM marker present in more than one dataset. CORIN was also added for validation not only because it was found to be enriched in the DIA global secretome analysis (Table 2), but also because this protein was previously found to be enriched on the cell surface of rVM progenitor cells compared to cVM progenitors [11]. All 7 candidates showed robust peptide detection as assessed by profile plots (Fig S1c, d). Furthermore, we included TFF3 (Trefoil factor 3) on the validation list, as this factor was previously identified as an enriched marker in DA VM progenitor cells by another group [33], though it was only barely identified in our first DDA analysis (see Table 4).

Table 4. Differential detection of candidate protein markers in all datasets

Experiment 1, Global DDA

Experiment 2, Global DDA

Experiment 3a: Global DIA

Experiment 3b: Vesicles DIA

Protein

fold change

p-value

q-value

fold change

p-value

q-value

fold change

p-value

q-value

fold change

p-value

q-value

CNTN2

-4,25

2,08E-03

0,025

-1,29

1,66E-06

6,33E-05

-3,25

2,88E-11

6,19E-09

-0,98

2,30E-04

1,23E-03

CPE

3,05

6,00E-06

4,85E-04

0,69

3,21E-03

0,019

3,80

1,65E-09

8,62E-08

3,74

7,66E-06

8,96E-05

FST

2,10

2,86E-03

0,033

2,39

1,45E-07

1,81E-05

2,93

4,06E-10

2,91E-08

2,44

4,17E-05

3,15E-04

LGI1

7,91

2,99E-08

2,41E-05

1,69

1,26E-03

9,90E-03

5,43

1,87E-12

1,27E-09

5,10

9,51E-13

7,84E-10

PDGFC

2,48

7,39E-05

3,14E-03

0,48

0,051

0,141

2,02

2,09E-04

8,53E-04

2,73

1,15E-07

5,90E-06

SERPINF1

3,12

2,07E-05

1,19E-03

2,21

6,27E-08

1,81E-05

0,40

0,049

0,088

1,49

0,062

0,121

CORIN

-2,71

4,91E-07

5,76E-06

-1,65

3,10E-03

0,011

TFF3

2,07

6,52E-04

0,011

Table 4: Differential detection of the supernatant target proteins selected for further validation, from all MS analyses and experiments. Red values indicate fold change <2 (<1.5 for experiment 2 – Global DDA) and significance >0.05. ND: not detected

Validating expression profiles of secreted cVM candidates in a new set of samples

To further assess the discriminative potential of the selected candidate markers in a new set of differentiated neural progenitor samples, we performed quantitative reverse-transcription PCR (qRT-PCR) analysis for the expression of the respective genes in rVM and cVM cultures on day 16 of differentiation. To this aim, we created a new set of samples obtained from hESC differentiated towards rVM and cVM as well as towards other neural tube regions for comparison (see Fig. 2a). In line with the MS data, we observed that transcription of CORIN was significantly upregulated in the rVM samples, while CPE, LGI1 and PDGFC expression was increased in the cVM samples (Fig. 2b). Though not significantly, CNTN2 and SERPINF1 appeared to be increased in rVM and FST elevated in cVM. On the other hand, TFF3 expression was indistinguishable between the two regional VM samples. We then performed ELISA on the supernatant of the rVM and cVM samples, confirming that CNTN2 and CORIN were elevated in the supernatant from the rVM cultures, while CPE, LGI1 and PDGFC were enriched in the cVM cultures (Fig. 2c). In line with the transcriptional data, FST tended to an increase in cVM samples and TFF3 depicted no difference between the VM regions. SERPINF1 ELISA analysis was fraught by a high spread in protein concentration, despite preliminary dilution testing, resulting frequently (over 30%) in values above the detection limit.

We next proceeded to assess the specificity of these markers in VM cultures compared to neural progenitors of other regional fates. To this aim, hESCs were differentiated to other neural tube regions (dorsal Forebrain, dFB; ventral Forebrain, vFB, dorsal Midbrain, dMB, dorsal Hindbrain, dHB; and ventral Hindbrain, vHB), and the cultures were verified for correct regional fates by qRT-PCR using a panel of regional neural tube markers [34] (Fig. S2a). By performing ELISA on the supernatant samples, we could generally observe elevated protein levels of the selected markers in the VM samples in comparison with the other neural regions. In particular, CORIN showed a clear specificity to the rVM whereas PDGFC was specific to the cVM, compared to all other neural regions tested. TFF3 depicted a strong enrichment for both VM regions in comparison to all other neural regions (Fig. 2d). Our data however showed that although TFF3 was a highly specific secreted marker of the VM, it could not discriminate between rostral and caudal VM samples.

Designing a dual ELISA panel for discriminating rVM and cVM samples

Given the observations above, we next asked if we could apply some of the identified markers as a potential non-invasive, quality control method to distinguish a successful hESC differentiation towards bona fide cVM DA-progenitors from an unsuccessful differentiation towards the non-dopaminergic rVM. We first sought to investigate whether supernatant harvested at an earlier time point could predict the outcome of the VM cell fates on day 16. However, the ELISA analysis on day 11 supernatants showed lower or equally low levels for all selected proteins, and consequently showed no significant difference between rVM and cVM samples (Fig. S2b). We therefore focused on developing a QC assay for assessment of the cultures at day 16 and hypothesized that combining the measurements of two secreted markers could provide an optimized non-invasive QC assay with higher reliability and without the need of a normalizing to cell count. Based on our day 16 results, we calculated the ratio between our positive markers for cVM (CPE, FST, LGI1, PDGFC) and either a VM specific marker (TFF3) or a marker enriched in rVM samples (CNTN2 or CORIN). We observed that TFF3 worked poorly as a counterbalance marker due to its variable results, including analyses over the detection limit, despite previous dilution testing (Fig. S3a). On the other hand, the ratios with both positive markers of rVM, CNTN2 and CORIN, evidenced a significant discrimination between rVM and cVM (Fig. S3b, c). In particular, ratios of the positive cVM markers FST, LGI1 and PDGFC levels over CORIN or CNTN2 values showed the clearest contrast between the two VM samples (Fig. S3b, 3c). To substantiate our findings in a clinically relevant context, we subjected 4 supernatant samples from clinical batches of day 16 cVM-DA progenitor cells (STEM-PD product, manufactured under GMP conditions) to the same ELISA panel. We confirmed that the protein ratios of these GMP-produced samples fell in line with the other correctly specified research-grade cVM samples (GMP samples marked in red in Fig. 3a).

We further investigated the relationship between the ELISA assayed proteins and the transcriptional expression of EN1, which is a highly relevant progenitor cell marker predictive of successful graft outcome with bona fide midbrain DA neurons required for PD cell therapy [11, 12, 35]. By performing a Spearman correlation analysis between the EN1 mRNA expression levels in VM cells at day 16 and the secreted QC candidate proteins in the supernatant, we found that CPE, FST, LGI1 and PDGFC correlated positively with EN1 expression levels, whereas CNTN2 and CORIN correlated negatively, as expected from the rVM versus cVM enrichment profile for these markers, respectively (Fig. 3b). In contrast, SERPINF1 and TFF3 levels showed no correlation to EN1 expression on day 16 (data not shown). Importantly, the combined protein ratios were also positively correlated with high EN1 expression (Fig. 3c), thereby further emphasizing the predictive value of this proposed dual ELISA QC assay for GMP manufacturing. Specifically, the LGI1 ratios presented the most stringent and strongest positively correlation with EN1 expression. Altogether, our results show that majority of the proteins selected from the initial unbiased MS-based discovery study could be used to predict the outcome of ongoing cell differentiations. In particular, the ratios between positive and negative markers of cVM showed high specificity towards correctly patterned cVM cultures with high EN1 expression.

As several stem cell-derived products for cell replacement therapies are approaching clinical trials, there is an increasing need to implement GMP compatible assays to provide quality control screening of the in vitro differentiated products during manufacturing. Implementation of current GMP compatible procedures to the developing protocols allows not only for a more expedited regulatory surveillance of batch manufacturing, but also provides a bridge between the conventional established assays and newer quality-control procedures more adequate to the assessment of a live cell product during GMP manufacturing. For decades, shotgun proteomics has allowed for unbiased identification and quantification of thousands of proteins, aiding in the elucidation of their biological roles or in the discovery biological markers for a variety of purposes [36–38]. We show here that by applying this approach on the supernatant of cell preparations with clinical relevance to the treatment of Parkinson’s Disease, we could identify novel QC markers which could readily be measured and validated by ELISA.

The implementation of both DDA and DIA methods, as well as enrichment of extracellular vesicles in the comparative analysis between rVM and cVM cultures allowed the unbiased identification of secreted target proteins consistently contrasting between these two neighboring regions. This is an important feature of the identified markers, as the rVM and cVM are normally difficult to discriminate due to their extreme similarity in gene and protein expression pattern [10, 11]. These markers could also distinguish at a transcriptional level the two VM regions, as they showed a statistically significant correlation between their mRNA and respective secreted protein levels (data not shown). Our vesicle-enrichment procedure yielded samples abundant with vesicle markers, and it confirmed several key candidates from the global secretome data, which we pursued as ELISA QC markers (LGI1, FST, CPE and PDGFC). In addition, the vesicle MS detected differential expression of more lowly abundant growth factors involved in developmental patterning (i.e. GDF11 and FGFBP3), which could be relevant to pursue as potential patterning factors in the cVM differentiation protocol. Interestingly, the WNT protein WNT5A, which has previously been proposed as a midbrain patterning factor in vivo in the mouse and in vitro in human stem cells [39] was enriched in the rVM non-dopaminergic progenitor cultures, and not in the cVM cells.

TFF3, which was previously identified as a VM-specific marker through an unbiased transcriptomic comparison to dorsal forebrain (dFB)-patterned ESCs [33], also in our study showed significantly increased protein and mRNA levels in VM cultures in comparison to dFB cultures in our study. However, TFF3 might not be useful for monitoring DA progenitor patterning, as its expression was indistinguishable between rVM and cVM cultures. Our data also showed that the floor plate maker CORIN [40] was markedly elevated in rVM supernatant in comparison to the cVM, and in inverse correlation with the expression levels of EN1, a bona fide indicator of authentic DA VM progenitors for PD therapy [11, 12, 35]. This is of interest given that an ongoing clinical trial in Japan applies flow cytometric purification of CORIN-positive progenitor cells for transplantation to the brains of PD patients [41, 42]. Similarly, CNTN2, has also previously been associated to LMX1A-GFP-sorted VM ESC-derived cells [43], but we show here that this marker is mainly enriched in the non-DA rVM progenitors. Thus, our current data suggest that other markers might be superior for enriching for the bona fide cVM DA progenitor populations.

For rapid QC of manufactured cVM DA progenitors, we propose to survey a conjugation of two (or more) reliable protein markers in the supernatant. Our data shows that the ratio between two secreted proteins, one rVM marker and one cVM maker, can readily discriminate rVM from cVM cultures and could be applied to predict the quality of clinical-grade quality-controlled batches of DA cVM progenitors for the STEM-PD trial [44]. Both positive marker of rVM cultures, CNTN2 and CORIN, could be used to clearly discriminate the two culture supernatants, and were strongly negatively correlated with EN1 expression. CNTN2, however, unlike CORIN, did not so discriminative at a transcriptional level, required sample dilution testing, an extra procedure prior to the QC assay. The novel cVM marker FST, positively correlated with EN1 expression, and could also discriminate rVM from cVM cultures as an FST/CORIN or FST/CNTN2 ratio, even though FST on its own was unable to distinguish the two VM cultures. The surprising disagreement between the FST hits on all MS analyses and the poor discriminative power of FST ELISA results serves as a stark reminder of the need for thorough marker validation. The additional novel positive markers for cVM, LGI1 and PDGFC, were found to be of highly predictive of cVM cultures, elevated both transcriptionally and in the respective supernatants, and with high correlation to EN1 expression. The use of these markers in a ratio configuration with either CORIN or CNTN2 yielded the most stringent distinction between the two cell cultures. Altogether, our data points to LGI1/CORIN ratio as the most promising QC assay, as both proteins are particularly elevated in their respective VM region, while being very lowly present or even absent in other non-VM neural progenitor populations and in undifferentiated hESCs.

Overall, our results showed that the identification of novel cell therapy QC markers through proteomic exploration can aid in the establishment of GMP compliant assays critical for the regulatory assessment of these cell products on their way towards the clinic.

As hPSC-based cell replacement therapies for PD reach a clinical setting, it is essential to establish stringent multi-factorial QC parameters for the clinically relevant cell products, capable of provide contrast against undesired outcomes during manufacturing. Here, we presented a non-invasive, coupled ELISA assay, capable of qualifying GMP-grade DA VM progenitors during differentiation.

AGC: Automatic Gain Control; BSA: Bovine Serum Albumin; cVM: caudal Ventral Midbrain; DA: Dopaminergic; DDA: Data-Dependent Acquisition; dFB: dorsal Forebrain; DIA: Data-Independent Acquisition; dHB: dorsal Hindbrain; dMB: dorsal Midbrain; DTT: Dithiothreitol; ELISA: Enzyme-Linked Immunosorbent Assay; GPF: Gas-Phase Fractionation; GMP: Good Manufacturing Practices; hPSC: human Pluripotent Stem Cell; IAA: Iodoacetamide; IT: Ion Injection Time; LC: Liquid Chromatography; LFQ: Label-Free Quantification; MS: Mass spectrometry; MS1: Precursor Ion Spectra; MS2: Fragment Ion Spectra; NCE: Normalized Collision Energy; PD: Parkinson’s Disease; QC: Quality Control; qRT-PCR: quantitative Reverse Transcription Polymerase Chain Reaction; rVM: rostral Ventral Midbrain; SHH: Sonic Hedgehog; vFB: ventral Forebrain; vHB: ventral Hindbrain; VM: Ventral Midbrain

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and materials

The mass spectrometry proteomics data has been deposited to the ProteomeXchange Consortium via the PRIDE partner repository [45], with the dataset identifier PXD039510 (Access for reviewers before public availability: Username: [email protected], Password: Go0EI0FM)

Competing interests

The authors declare that they have no competing interests.

Funding

This study has been supported by funding from the Novo Nordisk Foundation (NNF18OC0030286), Innovation Fund Denmark (BrainStem: 4108-00008A), EU H2020 (grant no, 874758), the Knut and Alice Wallenberg Foundation, the Strong Research Environment at Lund University (Multipark), the Swedish Research Council (70862601/Bagadilico), The Crafoord Foundation, The Segerfalk Foundation, The Tore Nilsson Foundation, The Sven-Olof Janson Foundation and the Swedish Fund for Research Without Animal Experiments. The Novo Nordisk Foundation Center for Stem Cell Medicine is supported by a Novo Nordisk Foundation grant number NNF21CC0073729. The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript

Authors’ Contributions

PR, MI, TL and AK designed the research. PR, MI, CR, AS, JNW, AK carried out the laboratory experiments. PR, CR, AS, JNW, AK performed in vitro differentiations and sample collection. MI performed supernatant processing for mass spectrometry and downstream experimental processes. MI performed mass spectrometry analysis with guidance form TL and AK. PR, CR and AS performed ELISA assays. PR performed qRT-PCR, and downstream statistical analysis in combination with ELISA data. PR, MI, and AK contributed to the writing. PR and AK prepared the manuscript. All authors have read, provided feedback, and approved the final manuscript.

Acknowledgements

We thank Amalie Holm and Alison Salvador for providing supernatant samples, and Alrik Schörling for the help with the RNA samples. We also thank the Royal Free Hospital (UK) and Novo Nordisk A/S for providing supernatant samples from clinical grade STEM-PD batches.

Author details

¹ Novo Nordisk Foundation Center for Stem Cell Medicine – reNEW, University of Copenhagen, Belgdamsvej 3B, 2200 CopenhagenN , Denmark, ² Department of Neuroscience, University of Copenhagen, Belgdamsvej 3B, 2200 CopenhagenN , Denmark,³ Department of Biomedical Engineering, Ole Römer väg 1, SE-223 63, Lund Lund University, Lund, Sweden, ⁴ Department of Experimental Medical Science, Lund University, Sölvegatan 17, BMC-B11, S-221 84, Lund, Sweden, ⁵ Wallenberg Center for Molecular Medicine, Lund University, Sölvegatan 17, BMC-B11, S-221 84, Lund, Sweden

Connolly BS, Lang AE: Pharmacological treatment of Parkinson disease: a review. JAMA 2014, 311(16):1670-1683.
Barker RA, consortium T: Designing stem-cell-based dopamine cell replacement trials for Parkinson's disease. Nat Med 2019, 25(7):1045-1053.
Bjorklund A, Dunnett SB, Stenevi U, Lewis ME, Iversen SD: Reinnervation of the denervated striatum by substantia nigra transplants: functional consequences as revealed by pharmacological and sensorimotor testing. Brain Res 1980, 199(2):307-333.
Bolam JP, Freund TF, Bjorklund A, Dunnett SB, Smith AD: Synaptic input and local output of dopaminergic neurons in grafts that functionally reinnervate the host neostriatum. Exp Brain Res 1987, 68(1):131-146.
Dunnett SB, Bjorklund A, Schmidt RH, Stenevi U, Iversen SD: Intracerebral grafting of neuronal cell suspensions. V. Behavioural recovery in rats with bilateral 6-OHDA lesions following implantation of nigral cell suspensions. Acta Physiol Scand Suppl 1983, 522:39-47.
Freund TF, Bolam JP, Bjorklund A, Stenevi U, Dunnett SB, Powell JF, Smith AD: Efferent synaptic connections of grafted dopaminergic neurons reinnervating the host neostriatum: a tyrosine hydroxylase immunocytochemical study. J Neurosci 1985, 5(3):603-616.
Strecker RE, Sharp T, Brundin P, Zetterstrom T, Ungerstedt U, Bjorklund A: Autoregulation of dopamine release and metabolism by intrastriatal nigral grafts as revealed by intracerebral dialysis. Neuroscience 1987, 22(1):169-178.
Tomishima M, Kirkeby A: Bringing Advanced Therapies for Parkinson's Disease to the Clinic: The Scientist's Perspective. J Parkinsons Dis 2021, 11(s2):S135-S140.
Arenas E, Denham M, Villaescusa JC: How to make a midbrain dopaminergic neuron. Development 2015, 142(11):1918-1936.
Kee N, Volakakis N, Kirkeby A, Dahl L, Storvall H, Nolbrant S, Lahti L, Bjorklund AK, Gillberg L, Joodmardi E et al: Single-Cell Analysis Reveals a Close Relationship between Differentiating Dopamine and Subthalamic Nucleus Neuronal Lineages. Cell Stem Cell 2017, 20(1):29-40.
Kirkeby A, Nolbrant S, Tiklova K, Heuer A, Kee N, Cardoso T, Ottosson DR, Lelos MJ, Rifes P, Dunnett SB et al: Predictive Markers Guide Differentiation to Improve Graft Outcome in Clinical Translation of hESC-Based Therapy for Parkinson's Disease. Cell Stem Cell 2017, 20(1):135-148.
Nolbrant S, Heuer A, Parmar M, Kirkeby A: Generation of high-purity human ventral midbrain dopaminergic progenitors for in vitro maturation and intracerebral transplantation. Nat Protoc 2017, 12(9):1962-1979.
Kong AT, Leprevost FV, Avtonomov DM, Mellacheruvu D, Nesvizhskii AI: MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods 2017, 14(5):513-520.
Kall L, Canterbury JD, Weston J, Noble WS, MacCoss MJ: Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat Methods 2007, 4(11):923-925.
Nesvizhskii AI, Keller A, Kolker E, Aebersold R: A statistical model for identifying proteins by tandem mass spectrometry. Anal Chem 2003, 75(17):4646-4658.
da Veiga Leprevost F, Haynes SE, Avtonomov DM, Chang HY, Shanmugam AK, Mellacheruvu D, Kong AT, Nesvizhskii AI: Philosopher: a versatile toolkit for shotgun proteomics data analysis. Nat Methods 2020, 17(9):869-870.
Demichev V, Messner CB, Vernardis SI, Lilley KS, Ralser M: DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat Methods 2020, 17(1):41-44.
Cox J, Mann M: MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol 2008, 26(12):1367-1372.
Cox J, Neuhauser N, Michalski A, Scheltema RA, Olsen JV, Mann M: Andromeda: a peptide search engine integrated into the MaxQuant environment. J Proteome Res 2011, 10(4):1794-1805.
Tyanova S, Temu T, Cox J: The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat Protoc 2016, 11(12):2301-2319.
Cox J, Hein MY, Luber CA, Paron I, Nagaraj N, Mann M: Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol Cell Proteomics 2014, 13(9):2513-2526.
Lazar C, Gatto L, Ferro M, Bruley C, Burger T: Accounting for the Multiple Natures of Missing Values in Label-Free Quantitative Proteomics Data Sets to Compare Imputation Strategies. J Proteome Res 2016, 15(4):1116-1125.
Zhu Y, Orre LM, Zhou Tran Y, Mermelekas G, Johansson HJ, Malyutina A, Anders S, Lehtio J: DEqMS: A Method for Accurate Variance Estimation in Differential Protein Expression Analysis. Mol Cell Proteomics 2020, 19(6):1047-1057.
Yu G, Wang LG, Han Y, He QY: clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 2012, 16(5):284-287.
Benjamini Y, Hochberg Y: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological) 1995, 57(1):289-300.
Swedish Medical Products Agency grants approval for clinical study of new stem cell based Parkinson’s Disease treatment [https://www.lunduniversity.lu.se/article/swedish-medical-products-agency-grants-approval-clinical-study-new-stem-cell-based-parkinsons]
Cruz L, Romero JAA, Iglesia RP, Lopes MH: Extracellular Vesicles: Decoding a New Language for Cellular Communication in Early Embryonic Development. Front Cell Dev Biol 2018, 6:94.
Rosa-Fernandes L, Rocha VB, Carregari VC, Urbani A, Palmisano G: A Perspective on Extracellular Vesicles Proteomics. Front Chem 2017, 5:102.
Pocsfalvi G, Stanly C, Vilasi A, Fiume I, Capasso G, Turiak L, Buzas EI, Vekey K: Mass spectrometry of extracellular vesicles. Mass Spectrom Rev 2016, 35(1):3-21.
Burton JB, Carruthers NJ, Stemmer PM: Enriching extracellular vesicles for mass spectrometry. Mass Spectrom Rev 2021.
Jalaludin I, Lubman DM, Kim J: A guide to mass spectrometric analysis of extracellular vesicle proteins for biomarker discovery. Mass Spectrom Rev 2021:e21749.
Thery C, Witwer KW, Aikawa E, Alcaraz MJ, Anderson JD, Andriantsitohaina R, Antoniou A, Arab T, Archer F, Atkin-Smith GK et al: Minimal information for studies of extracellular vesicles 2018 (MISEV2018): a position statement of the International Society for Extracellular Vesicles and update of the MISEV2014 guidelines. J Extracell Vesicles 2018, 7(1):1535750.
Kriks S, Shim JW, Piao J, Ganat YM, Wakeman DR, Xie Z, Carrillo-Reid L, Auyeung G, Antonacci C, Buch A et al: Dopamine neurons derived from human ES cells efficiently engraft in animal models of Parkinson's disease. Nature 2011, 480(7378):547-551.
Rifes P, Isaksson M, Rathore GS, Aldrin-Kirk P, Moller OK, Barzaghi G, Lee J, Egerod KL, Rausch DM, Parmar M et al: Modeling neural tube development by differentiation of human embryonic stem cells in a microfluidic WNT gradient. Nat Biotechnol 2020.
Kim TW, Piao J, Koo SY, Kriks S, Chung SY, Betel D, Socci ND, Choi SJ, Zabierowski S, Dubose BN et al: Biphasic Activation of WNT Signaling Facilitates the Derivation of Midbrain Dopamine Neurons from hESCs for Translational Use. Cell Stem Cell 2021, 28(2):343-355 e345.
Aebersold R, Mann M: Mass spectrometry-based proteomics. Nature 2003, 422(6928):198-207.
Sobsey CA, Ibrahim S, Richard VR, Gaspar V, Mitsa G, Lacasse V, Zahedi RP, Batist G, Borchers CH: Targeted and Untargeted Proteomics Approaches in Biomarker Development. Proteomics 2020, 20(9):e1900029.
Yates JR, 3rd: The revolution and evolution of shotgun proteomics for large-scale proteome analysis. J Am Chem Soc 2013, 135(5):1629-1640.
Andersson ER, Salto C, Villaescusa JC, Cajanek L, Yang S, Bryjova L, Nagy, II, Vainio SJ, Ramirez C, Bryja V et al: Wnt5a cooperates with canonical Wnts to generate midbrain dopaminergic neurons in vivo and in stem cells. Proc Natl Acad Sci U S A 2013, 110(7):E602-610.
Ono Y, Nakatani T, Sakamoto Y, Mizuhara E, Minaki Y, Kumai M, Hamaguchi A, Nishimura M, Inoue Y, Hayashi H et al: Differences in neurogenic potential in floor plate cells along an anteroposterior location: midbrain dopaminergic neurons originate from mesencephalic floor plate cells. Development 2007, 134(17):3213-3225.
Doi D, Samata B, Katsukawa M, Kikuchi T, Morizane A, Ono Y, Sekiguchi K, Nakagawa M, Parmar M, Takahashi J: Isolation of human induced pluripotent stem cell-derived dopaminergic progenitors by cell sorting for successful transplantation. Stem Cell Reports 2014, 2(3):337-350.
Takahashi J: iPS cell-based therapy for Parkinson's disease: A Kyoto trial. Regen Ther 2020, 13:18-22.
Fathi A, Mirzaei M, Dolatyar B, Sharifitabar M, Bayat M, Shahbazi E, Lee J, Javan M, Zhang SC, Gupta V et al: Discovery of Novel Cell Surface Markers for Purification of Embryonic Dopamine Progenitors for Transplantation in Parkinson's Disease Animal Models. Mol Cell Proteomics 2018, 17(9):1670-1684.
Kirkeby A, Parmar M, Barker RA: Strategies for bringing stem cell-derived dopamine neurons to the clinic: A European approach (STEM-PD). Prog Brain Res 2017, 230:165-190.
Perez-Riverol Y, Bai J, Bandla C, Garcia-Seisdedos D, Hewapathirana S, Kamatchinathan S, Kundu DJ, Prakash A, Frericks-Zipper A, Eisenacher M et al: The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res 2022, 50(D1):D543-D552.

Download PDF

Journal Publication

published 10 Dec, 2023

Read the published version in Stem Cell Research & Therapy →

Reviewers agreed at journal
13 May, 2023
Reviewers invited by journal
06 May, 2023
Editor assigned by journal
24 Apr, 2023
First submitted to journal
21 Apr, 2023
Editorial decision: Major Revision
12 Apr, 2023

You are reading this latest preprint version

Identifying secreted biomarkers of dopaminergic ventral midbrain progenitor cells

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials and Methods

Regionalized neural differentiation of hESCs

mRNA extraction and qRT-PCR

Sample preparation for whole supernatant (Global Secretome) for MS

Preparation of vesicle-enriched samples for MS

Data-dependent acquisition MS runs (DDA)

Data-independent MS acquisition (DIA)

DDA-based spectral library generation

DIA-based spectral library generation

Super spectral library generation

Data analysis of global DDA runs

Data analysis of DIA runs

ELISA

Statistical Analysis of ELISA and qRT-PCR data

Statistical Analysis of DDA and DIA analyses

GO-term enrichment analysis

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1