General Chemical synthesis. Reagents were obtained from Sigma-Aldrich or Fisher Scientific. 1H-NMR was recorded on a Bruker DPX spectrometer at 400 MHz. Chemical shifts are reported as parts per million (ppm) downfield from an internal tetramethylsilane standard or solvent references. High-resolution mass spectra were acquired on a ThermoElectron LTQ-Orbitrap Discovery high resolution mass spectrometer with a dedicated Accela HPLC system by Andrea DeBarber at the Bioanalytical MS facility, Portland State University. For air- and water-sensitive reactions, glassware was oven-dried prior to use and reactions were performed under argon. Dichloromethane, N,N-dimethylformamide, and tetrahydrofuran were dried using a solvent purification system manufactured by Glass Contour, Inc. (Laguna Beach, CA). All other solvents were of ACS chemical grade (Fisher Scientific) and used without further purification unless otherwise indicated. Analytical thin-layer chromatography was performed with silica gel 60 F254 glass plates (SiliCycle). Flash column chromatography was conducted with pre-packed normal or reversed phase columns (Biotage). High performance liquid chromatography (HPLC) was performed on an Agilent 1260 Infinity system with a flow rate of 1.0 mL/min using a Sunfire C18-A column 150 x 4.6 mm, 5 micron analytical column or a Sunfire 30 x 50 mm, 5 micron preparative column. HPLC analytical conditions: mobile phase (MP) A: 0.1% formic acid in water, B: 0.1% formic acid in acetonitrile (ACN), flow rate = 1.0 mL/min, gradient: 0% B for 2 min, 0-100% B over 13 min, 100% B for 2 min, UV-Vis detection at λ1 = 254 nm and λ2 = 220 nm. All final products were ≥95% purity as assessed by this method. Retention time (tR) and purity refer to UV detection at 220 nm. Preparative HPLC conditions: mobile phase (MP) A: 0.1% formic acid in water, B: 0.1% formic acid in acetonitrile (ACN), flow rate = 10.0 mL/min, gradient A: 0-30% B over 7 min, 30-50% B over 2 min, 100% B for 1 min; gradient B: 30% B for 6 min, 30-50% B over 8 min, 100% B for 4 min, UV-Vis detection at λ1 = 254 nm and λ2 = 220 nm.
6-(((2S, 3S, 4R)-2,3,4,5-Tetrahydroxypentyl)amino)pyrimidine-2,4-(1H,3H)-dione (2a). Commercially available ribitylamine[36] (150 mg, 1.0 mmol) and chlorouracil (42 mg, 0.28 mmol) were dissolved in 15 mL water in a 20 mL microwave vial and microwaved at 150°C for 2 h. After the reaction was complete, the desired ribitylaminouracil product (24 mg, 0.092 mmol, 33% yield) was isolated by preparative HPLC using gradient A. 1H-NMR was consistent with reference spectra [2].
6-(((3S,4R)-3,4,5-Trihydroxypentyl)amino)pyrimidine-2,4-(1H,3H)-dione (2b). Commercially available 1-amino-1,2-dideoxy D-erythro-pentitol [37] (135 mg, 1 mmol) and chlorouracil (42 mg, 0.28 mmol) were dissolved in 15 mL water in a 20 mL microwave vial and microwaved at 150°C for 2 h. After the reaction was complete, the desired aminouracil product (26 mg, 0.11 mmol, 39% yield) was isolated by preparative HPLC using gradient A and used without further purification.
6,7-Dimethyl-8-D-ribityldeazalumazine (DZ). Ribitylaminouracil 2a (26 mg, 0.10 mmol) and the sodium salt of commercially available 2-methyl butan-3-one-ol [38] (26 mg, 0.20 mmol) were refluxed in 0.5 M HCl (1.0 mL) at 100°C for 2 h. After the reaction was complete, DZ (8.0 mg, 0.025 mmol, 25 % yield) was isolated by preparative HPLC using gradient B. 1H-NMR was consistent with reference spectra [20]. 1H-NMR (400 MHz, D2O) δ 2.35 (s, 3H), 2.74 (s, 3H), 3.64 (m, 2H), 3.68 (m, 2H), 3.84 (m, 2H), 4.31 (m, 2H), 4.92 (m, 2H), 8.40(s, 1H); HRMS (ESI-TOF) Calculated for C14H20N3O6 [M +H+] 326.1352, Found 326.1347.
6,7-Dimethyl-8-D-(2’-deoxyribityl)deazalumazine (2’-deoxy-DZ). Compound 2b (26 mg, 0.10 mmol) and the commercially available sodium salt of 2-methyl butan-3-one-ol (26 mg, 0.20 mmol) were refluxed in 0.5 M HCl (1.0 mL) at 100°C for 2 h. After the reaction was complete, 2’-deoxy-DZ (8.0 mg, 0.025 mmol, 25 % yield) was isolated by preparative HPLC using gradient B. 1H-NMR (400 MHz, D2O/CD3OD) δ 1.93 (m, 2H), 2.19 (m, 2H), 2.41 (s, 3H), 2.74 (s, 3H), 3.34- 3.74 (m, 5H), 8.31(s, 1H); HRMS (ESI-TOF) Calculated for C14H20N3O5 [M +H+] 310.1397, Found 310.1347.
3-(2,4,7-Trioxo-8-((2S,3S,4R)-2,3,4,5-tetrahydroxypentyl)-1,2,3,4,7,8-hexahydropyrido[2,3-d]pyrimidin-6-yl)propanoic acid (DZPLI). Ribitylaminouracil 2a (26 mg, 0.1 mmol) and commercially available 1,5-diethyl-2-formylpentanedioate (75 mg, 0.40 mmol) were refluxed in 0.5 M HCl (1.0 mL) at 100°C for 2 h. The crude mixture was then hydrolyzed in 2 M LiOH at 40°C overnight. After the reaction was complete, DZPLI (4.2 mg, 0.011 mmol, 11% yield) was isolated by preparative HPLC using gradient B. 1H-NMR (400 MHz, CD3OD) δ 2.61 (t, 2H), 2.82 (t, 2H), 3.67 (m, 2H), 3.83 (m, 2H), 4.14 (m, 1H), 4.35 (m,1H), 4.64 (d, 1H), 7.84(s, 1H); HRMS (ESI-TOF) Calculated for C15H20O9N3 [M + H+] 386.1205, Found 386.1194.
4-Chloro-2, 6-dimethoxypyrimidine-5-carbaldehyde (4). An oven-dried flask was charged with commercially available 4-chloro-2,6-dimethoxypyrimidine (0.95 g, 5.5 mmol) and then evacuated and backfilled with argon. Anhydrous THF (5 mL) was added through a rubber septum. The mixture was cooled to -78°C and a 1.6 M solution of n-butyllithium (n-BuLi) in hexanes (3.8 mL, 6.0 mmol) was added dropwise. The mixture was stirred for an additional 0.5 h, and DMF (1 mL, 13 mmol) was added and stirring was continued for 2 h at the same temperature. The reaction was quenched by addition of aqueous HCl (1.6 M, 25 mL), and the mixture was extracted with ether (3 x 40 mL). The combined organic layers were washed with aqueous HCl (1.6 M, 25 mL) and water (40 mL), dried (Na2SO4) and evaporated to dryness. The residue was purified by column chromatography on silica gel (ethyl acetate-toluene, 1:6) to afford 4-chloro-2,6-dimethoxypyrimidine-5-carbaldehyde 4 (0.80 g, 4.0 mmol, 73%). 1H-NMR was consistent with reference spectra[39]. 1H-NMR (400 MHz, CDCl3) δ 4.11 (s, 3H), 4.15 (s, 3H), 10.34 (s, 1H).
(E)-4-(4-Chloro-2,6-dimethoxypyrimidin-5-yl)but-3-en-2-one (5). To a solution of 4 (120 mg, 0.60 mmol) in toluene (4 mL) was added commercially available 1-(triphenylphosphoranylidene)-2-propanone (189 mg, 0.60 mmol). The reaction mixture was refluxed for 6 hours. After the reaction was complete, the organic solvent was removed in vacuo. The resulting residue was purified by column chromatography on silica gel (ethyl acetate-hexane, 1:3) to afford the desired product 5. 1H-NMR (400 MHz, CDCl3) δ2.37 (s, 3H), 4.01 (s, 3H), 4.09 (s, 3H), 7.03 (d, 1H, J=18Hz), 7.67 (d, 1H, J=18Hz).
(E)-4-(2,4-Dimethoxy-6-(((2S,3S,4R)-2,3,4,5-tetrahydroxypentyl)amino)pyrimidin-5-yl)but-3-en-2-one (6). To a stirred solution of 5 (63 mg, 0.26 mmol) in DMF (4 mL) was added ribitylamine (120 mg, 0.80 mmol). The reaction mixture was refluxed for 12 h. After the reaction was complete, the organic solvent was removed in vacuo. The resulting residue was purified by preparative HPLC to deliver the desired product 6 (21 mg, 0.060 mmol, 23% over two steps). 1H-NMR (400 MHz, CD3OD) δ2.33 (s, 3H), 3.95-3.51 (m, 9H), 4.01 (s, 3H), 4.18 (s, 3H), 6.83 (d, 1H, J=16.4), 7.48 (d, 1H, J= 16.4).
7-Methyl-8-((2S,3S,4R)-2,3,4,5-tetrahydroxypentyl)pyrido[2,3-d]pyrimidine-2,4(3H,8H)-dione (monomethyl-DZ). Toa solution of 6 (20 mg, 0.050 mmol) in 6 M HCl (1 mL) was refluxed at 70°C overnight. After the reaction was complete, monomethyl-DZ (6.0 mg, 0.020 mmol, 39%) was purified by preparative HPLC. 1H-NMR was consistent with reference spectra [20]. 1H-NMR (400 MHz, D2O) δ2.78 (s, 3H), 3.41-3.88 (m, 5H), 4.29 (m, 1H), 4.44-4.65 (m, 5H) 7.18 (d, 1H, J=10.3), 7.48 (d, 1H, J= 12.2)
Human subjects. This study was conducted according to the principles expressed in the Declaration of Helsinki. Study participants, protocols and consent forms were approved by the Institutional Review Board at Oregon Health & Science University (OHSU IRB00000186). Informed consent was obtained from all human subjects included in the study.
Reagents and cells. Dendritic cells (DC) were derived from human peripheral blood monocytes as previously described [40, 41]. The bronchial epithelial cell line BEAS-2B (CRL-9609) was originally obtained from ATCC and was cultured in DMEM+10% heat inactivated fetal bovine serum (FBS). The BEAS-2B:ΔMR1 cell line was derived by CRISPR/Cas9 disruption of the MR1 gene, and MR1 expression was reconstituted in these cells [42]. Wild-type BEAS-2B cells overexpressing MR1 fused to GFP were previously described [22]. MAIT cell clones were derived, expanded, and maintained as previously described [11, 43].
ELISPOT assay. DC or BEAS-2B cells were harvested, counted and used in equivalent numbers, as indicated in the figure legends, as antigen presenting cells in an ELISPOT assay with IFN-γ production by MAIT cell clones as the readout as previously described [43]. Synthetic compounds or positive controls (M. smegmatis supernatant or PHA) were added to the cells at concentrations indicated for one hour prior to addition of the MAIT cell clones, and the ELISPOT plates were incubated for 18 hours prior to development. Phytohemagglutinin (PHA) was used at 10 mg/ml. Supernatant from M. smegmatis was prepared by collecting the <3 kDa fraction of supernatant from logarithmically-growing bacteria using a size exclusion column (Millipore). The volume of supernatant required for maximal response in the assay was determined empirically following preparation. Blocking was performed using the a-MR1 26.5 clone (Biolegend) and an IgG2a isotype control, added at 2.5 mg/ml for 1 hour prior to the addition of ligand.
Flow cytometry. BEAS-2B:MR1-GFP cells were grown in a 6-well tissue culture plate to ~70% confluency, and then incubated with synthetic compounds or vehicle at the indicated concentrations for 16 hours. Cells were harvested on ice and surface stained with the anti-MR1 26.5 antibody (1:100) conjugated to APC (Biolegend) for 40 minutes on ice in the presence of 2% human serum, 2% goat serum, and 0.5% FBS. Cells were washed and fixed, and subsequently analyzed with a BD FACS Symphony flow cytometer and FACS Diva software (BD). All analyses were performed using FlowJo software (TreeStar).
MR1 ligand docking. The crystal structures of MR1 chosen for this analysis are contained within Protein Data Bank entry 4GUP [2], the crystal structure of the heterodimer of human MR1 C262S and human β2m bound to the MR1 ligand 6-formylpterin (6-FP), and 4L4V, which contains the same protein species, except they are in complex with 6-methyl-7-hydroxyl-8-D-ribityllumazine (HMRL) and a MAIT TCR [8]. For PDB 4GUP, chains A (MR1) and B (β2m), which together compose one of the two conformations of MR1/β2m in the asymmetric unit, were selected due to this conformation’s similarity to that found in structures of MR1:ligand-TCR complexes (PDB ID: 4L4V, 4IIQ, 6PUF [5, 8, 9]). For PDB 4L4V, chains A (MR1) and B (β2m) were also chosen, though there was only minor structural heterogeneity between the two copies in the asymmetric unit. For 4LCC, a structure of chimeric human-bovine MR1 in complex with rRL-6-CH2OH and the same MAIT TCR as that of 4L4V, chain A (single chain β2m-MR1) was chosen. The PDBs were stripped of the remaining polypeptide chains, the ligands, all waters, and crystallographically-resolved ions. Separately, HMRL, DMRL, DZ, 2’-deoxy-DZ, monomethyl-DZ, rRL-6-CH2OH, PLI, and DZPLI ligands were sketched with ChemDraw 18.2 and copied to BIOVIA Discovery Studio [26] for exporting as Mol2 files with 3D information. Ligand files were subsequently converted to PDB format using Pymol [44] and visually inspected for appropriate geometry and bond angles. PDBQT files were prepared for both protein and ligand files using AutoDockTools suite [45, 46], which provides additional information regarding partial charge, atom type, and rotatable bonds. In order to restrict docking to the A’ pocket, Vina [25] was run using an x, y, z box size of 18, 20, 14 Å centered at x, y, z coordinates -6.19, -9.44, -11.53. All other Vina parameters were set to the default. Each ligand’s top binding mode was selected as the representative structure, and a single PDB was prepared for each dock in Pymol by combining MR1 and ligand output files. Alignments between the two donor structures were performed by aligning the Cα (alpha carbons) in the β sheet of the α1/α2 domains.
CHARMM input generation and molecular dynamics. The β2m/MR1:ligand complexes were solvated with the TIP3P water model using CHARMM-GUI Solution Builder [47-49], and neutralized with K+ and Cl– ions at physiological concentrations (0.15 M). To obtain missing parameters, ligands were parametrized by PDB coordinates using CGenFF [50]. Generated inputs were uploaded to the Midway compute cluster of the University of Chicago Research Computing Center to execute MD simulations. Each simulation was allowed 0.5 ns of equilibration followed by 80 ns of production, and each ligand was simulated in triplicate for a total of 15 simulations. Equilibrated systems use an NVT ensemble and production runs use an NPT ensemble, with the temperature kept constant at 300.15° K using Langevin dynamics [51]. The simulations were kept at constant pressure at one bar with the Nosé–Hoover Langevin piston by allowing the cell box size to change semi-isotropically [52]. van der Waals interactions were computed using a Lennard-Jones force-switching function over 10–12 Å while long-range electrostatics used particle mesh Ewald [53]. Production runs used a 2-fs time step and the SHAKE algorithm to constrain the bonds having hydrogen atoms [54].
TCR contact modeling. TCRs were selected with help from the TCR3D database [55], sampling from a diverse repertoire of TRBV genes, including 6-1, 6-2, 6-4, and 20-1 (PDB ID: 4L4V, 4IIQ, 4L9L, 4PJ7, 4PJ8 [5-8]). The 25th frame of the DZ and 2’-deoxy-DZ simulations were isolated and aligned to the 4GUP donor structure by the Cα (alpha carbons) in the β sheet of the α1/α2 domains of the heavy chain. Each TCR was then independently coordinated to the interface in the same fashion and CDR loops identified by IMGT V-Quest [56] were subsequently isolated for visualization.
Simulation analysis. Raw simulation data was processed using Bio3D [57], an R library with the ability to read, write and process biomolecular structure and trajectory data. Root mean square fluctuation (RMSF) was calculated using included functions to determine the conformational variance of each atom with respect to their mean position. Structural visualizations and alignments were performed using VMD [58], and renders were generated with Tachyon internal-memory processes. Ribityl time lapse was accomplished by aligning dynamics by aromatic core, and displaying the initial ring structure while selecting the ribityl pose every 8 ns starting at 3 ns (to allow for equilibration). RMSD and RMSF were plotted using the ggplot2 library in R [59].
Data analysis. Unless otherwise indicated, experimental data were plotted and analyzed for statistical significance using Prism 9 (GraphPad Software).