Molecular cloning
DNA fragments encoding ASCC3HR (wt, D611A, D1453A or D611A-D1453A) or ASCC3NC were cloned into a pFL vector for expression as N-terminally His10-tagged, TEV-cleavable proteins via recombinant baculoviruses in insect cells as described previously.20 A DNA fragment encoding full-length (FL) ASC1 was PCR-amplified from a synthetic gene (IDT; Supplementary Table 3) and inserted into the pETM-11 or pIDS vectors (EMBL, Heidelberg) for expression as an N-terminally His6-tagged, TEV-cleavable protein. See Supplementary Table 4 for PCR primers used. The pIDS-asc1FL construct was Cre-recombined with pFL-ascc3HR for co-expression via a recombinant baculovirus in insect cells. DNA fragments encoding ASC11–80, ASC11–230, ASC1152–230, ASC1152–581, ASC1281–403, ASC-1281–581 or ASC-1403–581 were amplified via PCR from the pETM-11-asc1FL, and re-cloned into the pETM-11 vector. A DNA fragment encoding full-length ALKBH3 was PCR-amplified from a cDNA library of human HeLa cells and inserted into the pETM-11 vector for expression as an N-terminally His6-tagged, TEV-cleavable protein. All constructs were verified by Sanger sequencing.
For the preparation of an ASC1 sgRNA vector, we followed a previously established method40, cloning the target sequence into the pLenti-CRISPRV2 vector41. Primers used for generating the DNA fragment containing the target sequence is shown in Supplementary Table 4.
For expression of HA-tagged ASC1 variants, DNA fragments encoding wt or ΔZnF ASC1 were cloned into pENTR-3C using a synthetic gene (IDT; Supplementary Table 3). Vectors encoding HA-tagged variants ASC1L174A − L180A−I190A or ASC1C171A − C184A were created using the In-Fusion Snap Assembly mutagenesis kit (Takara Bio #683945). Each construct was then cloned into pHAGE-HA-Blast vector14 via Gateway recombination. All constructs were verified by Sanger sequencing.
Generation of cell lines
Stably transfected Flp-In™ T-REx™ 293 cell lines for the tetracycline-inducible expression of ASC1 variants with N-terminal 2xFlag-His6 or C-terminal His6-2xFlag tags were generated according to the manufacturer’s guidelines.20 Transfection of the parental cell line was done using X-tremeGENE HP DNA Transfection Reagent (Sigma Aldrich). After hygromycin-based selection of cells that had genomically integrated the expression cassette, tetracycline-induced expression of the tagged proteins was confirmed by western blotting using a monoclonal α-Flag M2 antibody (Sigma Aldrich #F3165; 1:7500). For expression of HA-tagged ASC1 variants, the pHAGE-HA-ASC1 vectors encodin HA-tagged ASC1wt, ASC1ΔZnF, ASC1L174A − L180A−I190A or ASC1C171A − C184A, were transfected into 293T cells using Transit293 transfection reagent (Mirus Bio).
CRISPR/Cas9-based genome editing
The ASC1 sgRNA expression vector was transfected into the Lenti-X 293T cell line (Takara Bio) together with psPAX2 and pCMV-VSVG (Addgene) for lentivirus production. The virus-containing culture medium was collected 72 h post-transfection. Human PC-3 cells were infected with the viral medium and individual clones were selected in 96-well plates. The single KO colonies were analyzed by western blot using an α-ASC1 antibody (sc-376916, Santa Cruz).
Recombinant protein production and purification
ASCC3HR variants and ASCC3NC were produced in High Five cells as described previously.20 Cell pellets were re-suspended in 20 mM HEPES-NaOH, pH 7.5, 500 mM NaCl, 10 mM imidazole, 1 mM DTT, 8.6% (v/v) glycerol (lysis buffer 1), supplemented with cOmplete™ protease inhibitors (Roche) and lysed by sonication using a Sonopuls Ultrasonic Homogenizer HD (Bandelin). The lysate was cleared by centrifugation and filtration. The protein of interest (POI) was captured on Ni2+-NTA resin in a gravity flow column, washed with lysis buffer 1 and eluted with lysis buffer 1 containing 400 mM imidazole. Fractions enriched for the POI were supplemented with 1/10 (w/w) TEV protease and dialyzed against 20 mM HEPES-NaOH, pH 7.5, 500 mM NaCl, 1 mM DTT, 8.6% (v/v) glycerol (dialysis buffer) overnight. The sample was then diluted to 100 mM NaCl and loaded onto a HiTrap Heparin HP column (Cytiva), pre-equilibrated with lysis buffer 1 containing 100 mM NaCl. After washing with lysis buffer 1 containing 100 mM NaCl, the POI was eluted with a linear gradient to lysis buffer 1 containing 1.5 M NaCl. The fractions containing the POI were pooled and concentrated with a centrifugal concentrator (100 kDa molecular mass cut-off). The concentrated sample was further purified by SEC on a Superdex 200 10/300 GL column (Cytiva) in 20 mM HEPES-NaOH, pH 7.5, 250 mM NaCl, 5% (v/v) glycerol, 1 mM DTT (SEC buffer). Fractions containing the POI were combined, concentrated, aliquoted, flash-frozen in liquid nitrogen and stored at -80°C.
For preparation of the ASCC3HR-ASC1FL complex, ASC1FL was co-produced with ASCC3HR in High Five cells. Cell pellets were re-suspended in lysis buffer 1 supplemented with cOmplete™ protease inhibitors, 1 mM DTT and 20 mM imidazole. The samples were lysed by sonication, then the suspension was centrifuged at 56,000 x g for 1 h, the soluble extract was further filtered through 0.8 µM pore size membrane filters (Millipore). The filtered fractions were collected and incubated with Ni2+-NTA resin pre-equilibrate with lysis buffer 1 for 2 h with gentle rotation at 4°C. POI-bound resin was loaded on a gravity flow column, washed with lysis buffer 1 and the POI was eluted with lysis buffer 1 containing 400 mM imidazole. To remove the His6/10-tags, 1/10 (w/w) of TEV protease was added and the sample and dialyzed against dialysis buffer overnight. Subsequently, the sample was diluted to 50 mM NaCl and loaded on a 5 ml StrepTrap HP column (Cytiva) pre-equilibrated with lysis buffer 1 containing 50 mM NaCl. After washing with lysis buffer 1 containing 50 mM NaCl, the POI was eluted in a linear gradient to lysis buffer 1 containing 1.5 M NaCl. Fractions containing the POI were combined, diluted to 50 mM NaCl, loaded on a 5 ml HiTrap Heparin HP column, washed and eluted in a linear gradient with lysis buffer 1 containing 1.5 M NaCl. Fractions containing the POI were pooled, concentrated and further purified by SEC on a Superdex 200 10/600 GL column (Cytiva) in 20 mM HEPES-NaOH, pH 7.5, 300 mM NaCl, 1 mM DTT. Fractions containing the POI were combined, concentrated, aliquoted, flash-frozen in liquid nitrogen and stored at -80°C.
For production of isolated ASC1 variants, the corresponding pETM-11 vectors were transformed into Escherichia coli BL21 (DE3) cells by electroporation for protein production via auto-induction at 18°C.42 Cells were harvested when cultures reached an optical density (600 nm) of 10. Cell pellets were re-suspended in lysis buffer 1 and supplemented with cOmplete™ protease inhibitors. After sonication, the lysate was cleared by centrifugation. The POI was captured on Ni2+-NTA resin in a gravity flow column, washed with lysis buffer 1 and eluted with lysis buffer 1 containing 400 mM imidazole. Fractions enriched for the POI were supplemented with 1/10 (w/w) TEV protease and dialyzed against dialysis buffer overnight. Dialyzed samples were passed through a Ni2+-NTA gravity flow column to remove the cleaved His6-tag and TEV. For ASC1FL, ASC1152–581, ASC1281–581 and ASC1403–581 fragments, the samples were diluted to 100 mM NaCl, loaded on a HiTrap Heparin HP column, washed and eluted in a linear gradient to lysis buffer 1 containing 1.5 M NaCl. Fractions containing the POI were combined, concentrated and further purified on a Superdex 200 16/600 GL column in SEC buffer.
For purification of the ASC11–80, ASC11–230, ASC1152–230 and ASC1281–403 fragments, the Heparin column step was omitted and the final gel filtration was conducted in 20 mM HEPES-NaOH, pH 7.5, 150 mM NaCl, 1 mM DTT on a HiLoad 16/60 Superdex 75 pg column (Cytiva).
For production of ALKBH3, the corresponding pETM-11 vector was transformed into E. coli C2566 cells by electroporation for protein production via IPTG induction at 37°C. Cell pellets were re-suspended in 20 mM TRIS-HCl, pH 7.5, 500 mM NaCl, 10 mM imidazole, 1 mM DTT, 0.1 mM PMSF (lysis buffer 2), and lysed by sonication. The lysate was cleared by centrifugation. The supernatant was loaded onto a Ni2+-NTA column, washed with lysis buffer 2 and the POI was eluted with a linear gradient to lysis buffer 2 containing 400 mM imidazole. Fractions enriched for the POI were combined, supplemented with 1/20 (w/w) TEV protease and dialyzed against dialysis buffer overnight. The sample was then diluted to 100 mM NaCl and loaded onto a HiTrap Heparin HP 5 ml column (Cytiva), pre-equilibrated with dialysis buffer containing 100 mM NaCl. After washing with dialysis buffer containing 100 mM NaCl, the POI was eluted with a linear gradient to dialysis buffer containing 1.5 M NaCl. The fractions containing the POI were pooled and concentrated with a centrifugal concentrator (10 kDa molecular mass cut-off). The concentrated sample was further purified by SEC on a Superdex 75 10/60 GL column (Cytiva) in 20 mM TRIS-HCl, pH 7.5, 250 mM NaCl, 1 mM DTT. Fractions containing the POI were combined, concentrated, aliquoted, flash-frozen in liquid nitrogen and stored at -80°C.
Analytical size exclusion chromatography
Analytical SEC-based interaction tests were conducted in 20 mM HEPES-NaOH, pH 7.5, 250 mM NaCl, 5% (v/v) glycerol, 1 mM DTT. 100 pmol of ASCC3HR were mixed with other proteins in a two to ten-fold molar excess in a final reaction volume of 80 µl. After incubation of the mixtures on ice for 30 min, the samples were loaded on a Superdex 200 3.2/300 analytical size exclusion column (Cytiva). 50 µl fractions were collected and subjected to SDS-PAGE analysis. Protein bands were visualized by Coomassie staining except for gels containing ASC11–80 or ASC1152–230, which were imaged by silver staining.
For testing competitive binding of ASC1 and ALKBH3 to ASCC3HR, 120 pmol of ASCC3HR (or of pre-formed ASCC3HR-ASC1 complex) were mixed with 360 pmol each of ASC1 and ALKBH3 (or of ALKBH3) in a volume of 100 µl. After 30 min of incubation on ice, the samples were loaded on a Superdex 200 3.2/300 analytical size exclusion column. 50 µl fractions were collected and subjected to SDS-PAGE analysis. The proteins were visualized by Coomassie staining.
DNA unwinding assays
DNA duplex unwinding activity was assessed in fluorescence-based stopped-flow experiments on a SX-20MV spectrometer (Applied Photophysics).26,27 The DNA substrate contained a 12-base pair duplex region and a 31-nucleotide 3’-ss overhangs, with an Alexa 488 fluorophore on the short strand and an Atto 540 Q quencher on the complementary strand ([(Atto 540 Q]5’-GGCCGCGAGCCGGAAATTTAATTATAAACCAGACCGTCTCCTC-3’; 5’-CGGCTCGCGGCC-3’[Alexa 488]; duplex region in bold). Reactions were carried out in 40 mM HEPES-NaOH, pH 7.5, 80 mM NaCl, 0.5 mM MgCl2 at 30°C. 250 nM protein or protein complex were pre-incubated with 50 nM DNA duplex for 5 min. 60 µl of the protein-DNA mixture were rapidly mixed with 60 µl of 4 mM ATP/MgCl2, and the excited Alexa 488 fluorescence signal was recorded for 20 min using a 495 nm cutoff filter (KV 495, Schott). For each experiment, at least two individual traces were averaged, baseline-corrected by the fluorescence immediately after addition of ATP and normalized to the baseline-corrected maximum fluorescence. Data for ASCC3HR,D611A-based unwinding had been reported previously20 and are reproduced here to facilitate direct comparison. Data were plotted using GraphPad Prism 6.0 and fitted to a double exponential equation (fraction unwound = Afast*(1 – exp(–kfast * t)) + Aslow * (1 – exp(–kslow * t)); A, total unwinding amplitude; k, unwinding rate constants [s− 1]; t, time [s]).25 Amplitude-weighted unwinding rate constants were calculated as kuaw = (Afast * kfast2 + Aslow * kslow2) / (Afast * kfast + Aslow * kslow).25
ATPase assays
Thin layer chromatography (TLC)-based ATPase assays were performed using [α-32P]ATP (Hartmann Analytic).26,27 To quantify DNA-stimulated ATPase activity, 0.5 µM protein or protein complex were combined with 1 mM of a 43-nt ssDNA (5’-GGCCGCGAGCCGGAAATTTAATTATAAACCAGACCGTCTCCTC-3’). 0.5 µM protein or protein complex or equivalent protein-DNA mixtures were incubated with 1 mM [α-32P]ATP in 50 mM HEPES-NaOH, pH 7.5, 80 mM NaCl, 5 mM MgCl2, 2 mM DTT at 30°C for up to 60 min. 5 µl of sample were withdrawn at selected time points and reactions were quenched with 5 µl of 100 mM EDTA. 0.8 µl of the samples were spotted on a PEI-cellulose TLC plate and chromatographed with 1 M acetic acid, 0.5 M LiCl, 20% (v/v) ethanol. The corresponding ADP and ATP spots were visualized using a Storm 860 phosphorimager (GMI, USA) and quantified using ImageQuant software (version 5.2; Cytiva). Data were plotted and analyzed using Prism software (Graphpad, version 5), the ATPase activity was calculated as the number of hydrolyzed ATP molecules per protein molecule per minute, by fitting quantified data to the equation V = (Afast * Vfast2 + Aslow * Vslow2) / (Afast * Vfast + Aslow * Vslow); Afast and Aslow, amplitudes of ATP hydrolyzed in the rapid and slow phase, respectively; Vfast and Vslow, rates of the rapid and slow hydrolysis phases [min− 1]; V, ATP hydrolyzed as a function of time [min− 1].
Fluorescence microscopy
The sub-cellular localizations of the Flag/His-tagged versions of ASC1 were determined by immuno-fluorescence.43 293 cell lines expressing Flag-tagged ASC1 variants were grown on coverslips and fixed using 4% (v/v) paraformaldehyde for 20 min before permeabilization using 0.1% (v/v) Triton-X-100 in PBS for 20 min. Cells were blocked using PBS supplemented with 10% (v/v) fetal bovine serum (FBS) and 0.1% (v/v) Triton-X-100 for 1 h, then treated for 2 h with an FITC-conjugated α-Flag M2 antibody (Sigma Aldrich F4049; 1:200) diluted in PBS containing 10% FBS and 0.1% Triton-X-100. Cells were washed, and coverslips were mounted using mounting media containing DAPI. Cells were imaged using a Nikon Ti2 2-E inverted microscope.
Immuno-precipitation and western blotting
293 cells expressing N- or C-terminally Flag/His-tagged versions of full-length or truncated ASC1 or the Flag tag were lysed by sonication in IP buffer (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, 0.5 mM EDTA, 0.1% (v/v) Triton-X-100, 10% (v/v) glycerol and cOmplete™ protease inhibitors. Lysates were cleared of debris by centrifugation at 20,000 x g for 10 min, then the cleared lysates were incubated with α-Flag M2 magnetic beads (Sigma-Aldrich #M8823) for 2 h. The matrix was washed five times with IP buffer and complexes were eluted using 3xFlag peptide (Sigma Aldrich #SAE0194). Proteins were precipitated using 20% (w/v) trichloroacetic acid (TCA) and separated by SDS-PAGE. Western blotting was performed using antibodies against the Flag tag (Sigma-Aldrich F3165; 1:7500), ASCC1 (Proteintech #12301-1-AP; 1:500), ASCC2 (Proteintech #11529-1-AP; 1:1000) and ASCC3 (Proteintech #17627-1-AP; 1:1000).
For immuno-precipitation of HA-tagged ASC1 variants (ASC1wt, ASC1ΔZnF, ASC1L174A − L180A−I190A or ASC1C171A − C184A), the transfected 293T cells were resuspended in ice cold, high salt co-IP buffer (50 mM Tris-HCl, pH 7.9, 300 mM KCl, 10% [v/v] glycerol, 1% [w/v] Triton X-100, 1 mM DTT) supplemented with protease inhibitors. The cells were then lysed by sonication and allowed to rotate at -4°C to complete lysis. Lysates were cleared by centrifugation and diluted to 150 mM KCl using co-IP buffer without KCl. Anti-HA beads (Santa Cruz Biotechnology, sc-7392 AC) were then added to the samples, and after incubation at 4°C for 3.5 h, the beads were centrifuged and washed multiple times with 150 mM KCl co-IP buffer. Bound proteins were eluted with SDS-PAGE loading buffer and boiled before analysis via SDS-PAGE/western blot using antibodies against the HA-tag (Abcam EPR22819-101, 1:4000) and ASCC3 as described previously13.
MMS sensitivity assays
The wt and ASC1 KO PC-3 cells were plated on a 96-well plate with 3,500 cells per well. Cells were exposed to media containing variable concentrations of MMS for 24 h at 37°C. Then, cells were recovered with fresh culture medium for an additional 48 h at 37°C. Cell viability was measured by using the MTS assay (Promega).
Cryogenic electron microscopy
The ASCC3HR-ASC1 complex was prepared freshly in buffer 20 mM HEPES-NaOH, pH 7.5, 300 mM NaCl, 1 mM DTT, and concentrated to 4.15 mg/ml using a 50k ultra centrifugal filter (Merck). The sample was supplemented with 0.01% (w/v) n-dodecyl β-maltoside promptly before vitrification. 3.8 µl of the sample were applied to glow-discharged holey carbon R1.2/1.3 copper grids (Quantifoil Microtools, Germany) and plunge-frozen in liquid ethane using a Vitrobot Mark IV (Thermo Fisher) equilibrated at 10°C and 100% humidity.
Data acquisition was conducted on a FEI Titan Krios G3i TEM operated at 300 kV equipped with a Falcon 3EC detector. Movies were taken for 40.57 s accumulating a total electron flux of ~ 40 el/Å2 in counting mode at a calibrated pixel size of 0.832 Å/px distributed over 33 fractions.
CryoEM data analysis
All image analysis steps were done with cryoSPARC (version 3.2.2)44. Movie alignment was done with patch motion correction generating Fourier-cropped micrographs (pixel size 1.664 Å/px), CTF estimation was conducted by Patch CTF. Class averages of manually selected particle images were used to generate an initial template for reference-based particle picking from 6,022 micrographs. 2,818,857 particle images were extracted with a box size of 160 px and Fourier-cropped to 80 px for initial analysis. Reference-free 2D classification was used to select 1,590,881 particle images for further analysis. Ab initio reconstruction using a small subset of particles was conducted to generate an initial 3D reference for consecutive iterations of 3D heterogeneous refinement. 597,971 particle images were re-extracted with a box of 160 px and subjected to non-uniform refinement followed by CTF refinement. Another heterogeneous refinement round was applied to select 473,863 particle images for re-extraction at full spatial resolution after local motion correction (box size 320 px, 0.832 Å/px). A final heterogeneous refinement run was conducted to select 244,064 particle images for non-uniform refinement and generate the final reconstruction at a global resolution of 3.4 Å, locally extending down to 2.5 Å.
Model building, refinement and analysis
AlphaFold-predicted models24 of ASCC3HR and of regions of ASC1 were manually placed in the cryoEM reconstruction and adjusted by rigid body fitting and segmental real-space refinement using Coot (version 0.8.9.1)45. The model was refined by iterative rounds of real space refinement in PHENIX (version 1.17.1)46 and manual adjustment in Coot. Manual adjustments also took advantage of locally refined, focused cryoEM reconstructions. The structural model was evaluated with Molprobity (version 4.5.1)47. Interface areas were analyzed via the PISA server (version 1.52)48. Structure figures were prepared using ChimeraX (version 1.4)49 and PyMOL (version 1.8; Schrödinger, LLC).
DNA-protein cross-linking/mass spectrometry
UV cross-linking was employed to generated zero length cross-links between protein and bound ssDNA oligos (T12, T24, T36, T48). DNA oligos were 5’-end labeled using [γ-32P]ATP and T4 polynucleotide kinase using a standard protocol. 10 µl reaction mixtures containing 100 nM (“1” in Fig. 5b) or 200 nM (“2” in Fig. 5b) protein or protein complex and 4.3 nM radio-labeled DNA probe were incubated in a 72-well microbatch plate (Greiner) in 50 mM HEPES-NaOH, pH 7.5, 80 mM NaCl, 5 mM MgCl2, 2 mM DTT on ice for 5 min, then the samples were exposed to 254 nm UV irradiation for 10 min (Ultra-violet cross-linker, Amersham Life Science). Cross-linked samples were separated by SDS-PAGE and visualized by autoradiography using a Storm 860 phosphorimager.
For identifying cross-linked peptides and residues, 6.7 nM unlabeled T48 ssDNA were cross-linked to 200 nM ASCC3HR or ASCC3HR-ASC1 in 48 x 10 µl reactions as above and ethanol precipitated. Subsequent analyses were conducted in duplicates. The pellets were dissolved in 50 µl 4 M urea and diluted to 1 M Urea with 50 mM Tris-HCl, pH 7.5. To digest the DNA, 1 µl Universal nuclease (Pierce) and 1 µl Nuclease P1 (New England Biolabs) were added to the samples, followed by incubation at 37°C for 3 h. Protein digestion was performed with 1 µg of trypsin (Promega) overnight at 37°C. The samples were acidified with formic acid (FA; final concentration 0.1% [v/v]), and acetonitrile (ACN) was added to 5% (v/v) final concentration. Non cross-linked nucleotides were depleted by C18 reversed-phase chromatography with Harvard Apparatus MicroSpin columns. Sample was eluted by stepwise application of 50% (v/v) and 80% (v/v) ACN. Cross-linked peptides were enriched over linear peptides by TiO2 self-packed tip columns with 5% (v/v) glycerol as a competitor as described previously50. The samples were dried under vacuum and resuspended in 10 to 15 µl of 2% (v/v) ACN, 0.05% (v/v) trifluoroacetic acid. 7 or 8 µl (first or second analysis) were used for LC-MS analysis.
Chromatographic separation was achieved with Dionex Ultimate 3000 UHPLC (Thermo Fischer Scientific) coupled with a C18 column packed in-house (ReproSil-Pur 120 C18-AQ, 1.9/3 µm pore size, 75 µm inner diameter, 30 cm length, Dr. Maisch GmbH). The flow rate was set to 300 nl/min, and a 44 min linear gradient was formed with mobile phase A (0.1% [v/v] FA) and B (80% [v/v] ACN, 0.08% [v/v] FA) from 8% or 10% (first or second analysis) to 45% mobile phase B. Data acquisition of eluting peptides was performed with Orbitrap Exploris 480 (Thermo Fischer Scientific). The resolution for survey scans was set to 120,000, the maximum injection time to 60 ms, the automatic gain control target to 100% or 250% (first or second analysis) and the dynamic exclusion to 9 s. Analytes selected for fragmentation were isolated with a 1.6 m/z window and fragmented with a normalized collision energy of 28. MS/MS spectra were acquired with a resolution of 30,000, a maximum injection time of 120 ms and an automatic gain control target of 100%.
Cross-link data analysis of the resulting raw files was performed with the OpenNuXL node of OpenMS (version 3.0.0)51. Default general settings were used and the preset DNA-UV Extended was selected. The sequences of the proteins in the sample were provided as a database. The maximum length of DNA adducts was set to 3 and poly-T was used as sequence. The resulting .idxml files were used for annotation, and spectra were manually validated.
Data availability
The cryoEM reconstruction of the ASCC3HR-ASC1 complex has been deposited in the Electron Microscopy Data Bank (https://www.ebi.ac.uk/pdbe/emdb) under accession code EMD-15521 (https://www.ebi.ac.uk/pdbe/entry/emdb/EMD-15521). Structure coordinates have been deposited in the RCSB Protein Data Bank (https://www.rcsb.org) with accession code 8ALZ (https://www.rcsb.org/structure/8ALZ).52 The DNA-protein CLMS data have been deposited in the ProteomeXchange Consortium (http://www.proteomexchange.org) via the PRIDE53 partner repository (https://www.ebi.ac.uk/pride/) under dataset identifier PXD036106 (https://www.ebi.ac.uk/pride/archive/projects/PXD036106). All other data are contained in the manuscript or the Supplementary Information. Source data are provided with this paper.