Previously39, we assembled a library of glycan-coated phages using strain-promoted azide–alkyne cycloaddition (SPAAC) to ligate oligosaccharides with alkyl-azido linkers to dibenzocyclooctyne (DBCO)-modified M13 phage, each containing a distinct DNA barcode. We chose to employ this approach again to prepare an N-glycan library starting using a heterogenous sialylglycopeptide (SGP, 1, Fig. 1a and Fig.S1-S7) from egg yolk44,45, a commonly employed starting material for chemoenzymatic N-glycan preparation46.
Initially, we used a published route45 to trim 1 to a homogenous N-glycan and ligate it to phage. To do this, 1 was treated with pronase to provide a mixture of asparagine (Asn)-linked biantennary oligosaccharides 2 (Fig. 1a and Fig. S2). Subsequent treatment of 2 with neuraminidase and then β-galactosidase resulted in homogeneous GlcNAc-terminating biantennary structure, 4 (Fig. S3-4), which was N-acylated on the amine of Asn with 8-azido- octanoic acid NHS-ester 555 to yield 6 (Fig. S5). Biantennary glycan 6 retained its natural
N-linkage to Asn, whereas the azido-linker allowed ligation to DBCO-modified M13 phage by SPAAC. Monitoring the SPAAC reaction by MALDI-TOF MS (Fig. 1c) showed that ligation of N-glycan 6 required longer times (24 h) compared to the 1–2 hour reaction times needed for smaller glycans39. We then used similar steps to install a heterogeneous glycosyl asparagine derivative. Thus, Asn-linked N-glycans 2 were acylated with 5 to yield a mixture of N-glycans 7, which was ligated to DBCO-modified M13 phage (Fig. 2a). MALDI-TOF MS confirmed the modification (Fig. 2c, d and see Fig. S8 for optimization of MALDI conditions): The peaks S1 and S2 correspond to natural symmetric and asymmetric biantennary structures. The peak S2’ represents the cleavage of one sialic acid from a symmetric glycan during MALDI-TOF MS detection as confirmed by enzymatic treatment described below.
Phages decorated by either homogeneous or heterogeneous glycans can be used for chemoenzymatic glycan modification. Such on-phage trimming or elongation of glycans facilitates the preparation of glycoconjugates with consistent densities across a range of structures. MALDI-TOF MS confirmed that β-galactosidase treatment quantitatively removes terminal galactose residues from glycans on phage (Extended Data Fig. 1). Similarly, neuraminidase trimming of sialic acids in phage-displayed SGP yielded N-glycans with either one or two terminal galactose residues (peaks P1 and P2, Fig. 2e). Subsequent β-galactosidase treatment (Fig. 3a) revealed progressive cleavage of both galactose residues with complete disappearance of the symmetric structure after ~ 2 hours (Fig. 3c), transient accumulation of mono-galactosylated glycans I1, and then their disappearance after ~ 4 hours (Fig. 3c). Tandem neuraminidase and β-galactosidase treatment of heterogeneous 7 on phage, thus, quantitatively gave a homogeneous glycosylated product (Fig. 3a). In contrast, direct β-galactosidase treatment of 7 on phage yielded no observable changes (Fig. S9), confirming that the glycan contains no species with terminal galactose residues and that the P2’ peak observed by MALDI-TOF MS are indeed “ghost” species generated by sialic acid cleavage during analysis (Fig. S9). Further evidence comes from model 6’SLN (a-Neu5Ac-(2→6)-LacNAc) glycans ligated to phage, which can be cleaved by β-galactosidase; such cleavage was blocked by sialylation of the galactose residues (Extended Data Fig. 2). The homogenous
biantennary glycan with terminal N-acetylglucosamine (GlcNAc) was further treated with β-Ν-acetylglucosaminidase to cleave the GlcNAc residues yielding a homogenous N-glycan structure with terminal mannose (paucimannose) (Extended Data Fig. 3). These results confirm that efficient multi-step enzymatic trimming of N-glycans on phage is possible.
We also explored on-phage N-glycan synthesis using glycosyltransferases. In model studies, phages with N-glycans terminating with GlcNAc on phage (Extended Data Fig. 4) or in solution (Fig. S7) were treated with β-(1→4)-galactosyltransferase (B4GalT1) and uridine 5’-diphosphogalactose (UDP-Gal)46 to give, after ~ 40 hours, phages with lactosamine (LacNAc)-
terminating structures (Extended Data Fig. 4b). We also applied these conditions to transfer galactose to heterogeneous asialo-SGP on phage (Extended Data Fig. 5), to provide a homogenous biantennary N-glycan with terminal galactose residues. MALDI-TOF MS confirmed a time-resolved conversion of peak P1 (asymmetric glycan) to species P2 (symmetric glycan) over the course of eight hours (Extended Data Fig. 5b). In model studies of sialylation, quantitative addition of sialic acid to LacNAc- and lactose-phages was achieved using recombinant a-(2→6)-sialyltransferase from Photobacterium damselae (Pd26ST) and cytidine-5’-monophosphate-N-acetylneuraminic acid (CMP-Neu5Ac). After nine hours, a-Neu5Ac-(2→6)-LacNAc and a-Neu5Ac-(2→6)-Lac-phages were formed quantitatively (Extended Data Fig. 2b and 6b). Pd26ST also transferred a sialic acid derivative bearing a 3-butynamide group at C-556,57 to Lac-phage (Extended Data Fig. 6c). This alkyne handle can be exploited for further chemical derivatization of phage-displayed glycans.
The loss of sialic acid during the MS analysis made it difficult to confirm reaction completion by this technique alone (Extended Data Fig. 2b and 6b). After the sialylation, the phage was therefore treated with β-galactosidase to cleave the terminal galactose in any unreacted LacNAc or Lac moieties (Extended Data Fig. 2b and 6g). Using this approach, we confirmed that sialylation proceeds to completion and that asialoglycans observed in MALDI-TOF MS are “ghost” peaks. If necessary, tandem cleavage of Gal and then GlcNAc can also distinguish partially and fully sialylated glycans (Extended Data Fig. 7). We also observed a reduction of MALDI-TOF MS signal intensity upon sialylation. To ensure that this decrease is not due to glycan degradation during the enzymatic reaction, we performed a tandem Pd26ST-catalyzed sialylation and neuraminidase de-sialylation. The intensity for the LacNAc-pVIII conjugate in the mass spectrum was similar before and after the sialylation/de-sialylation cycle (Extended Data Fig. 8) confirming that the decrease in intensity is due to decreased ionization capacity58. Using these optimized synthesis and monitoring procedures, we performed a two-step on-phage enzymatic extension using B4GalT1 and Pd26ST to yield a symmetric biantennary sialylated N-glycan (Fig. 4). Phage- bound 7 was first extended by B4GalT1 and UDP-Gal (Fig. 4b and 4d). After purification by PEG-precipitation, Pd26ST-catalyzed transfer of Neu5Ac yielded a homogeneous product P2 on phage (Fig. 4b and 4d). These results confirm that multi-step enzymatic glycan extension can be used to create diverse N-glycans directly on phage.
Developing a robust method to synthesize phage-displayed N-glycans made it possible to study the effect of N-glycan structure and density on GBP binding. To this end, we synthesized
a library of six N-glycans displayed at five different densities (50, 150, 500, 750 and 1000 glycans/phage) (Fig. S10–S15). An example of our ability to control the glycan density is shown in Extended Data Fig. 9. The density was set by installing a range of 50–1000 copies of DBCO per phage (confirmed by MALDI-TOF MS), followed by complete conjugation with 7 and, finally, quantitative chemoenzymatic conversion of phage-SGP to the desired structures (again confirmed by MS). The resulting library, dubbed “LiGA6×5”, was used to analyze binding to nine lectins: Concanavalin A (ConA), Sambucus nigra-I (SNA-I), Ricinus communis agglutinin (RCA-I), Lens culinaris hemagglutinin (LCA), Pisum sativum agglutinin (PSA), Galanthus nivalis (GNL), Erythrina cristagalli agglutinin (ECL), Wheat Germ agglutinin (WGA) and CD22 (Fig. 5 and Fig. S16–S24) as well as cells overexpressing CD22 and DC-SIGN (dentritic cell-specific intercellular adhesion molecule-3-grabbing non-integrin) (Fig. 5). In each experiment, LiGA6×5 was incubated in wells coated with each lectin before unbound phages were removed by washing. Bound particles were eluted with 1M HCl and then analyzed by next generation sequencing (NGS). As a metric, we used the fold change (FC) difference39 in copy number of each phage with respect to its copy in LiGA6×5 incubated in BSA-coated wells, which underwent analogous processing (wash, elution, NGS).
SNA-I lectin59 and CD22 (Siglec-2) both recognize a-(2 →6)-sialylated N-glycans and, as expected, we observed predominant interaction of a-(2 →6)-sialylated components of LiGA either with purified SNA-I and CD22 proteins or CD22 proteins expressed on the surface of CHO cells (Fig. 5d-f and Supplementary Fig. S16, S17). SNA-I bound strongly to a medium density of glycans (~ 150 glycans per phage), whereas phage with ≤50 or ≥500 glycans bound significantly lower. In contrast, CD22 required ≥500 glycans per phage for significant binding (Fig. 5e). The same density dependence was reproduced in a more biologically-relevant environment: CD22 expressed on the surface of cells (Fig. 5f). We are not aware of any reports describing non-overlapping density dependence of SNA-I and CD22 proteins, which may stem from differences in the accessibility of protein clusters on the plate or cell surface as well as 50–100x weaker affinity of interaction of sialylated glycans with CD22 when compared to SNA-I (75 µM and 0.77 µM respectively).60,61
An interplay of structure and density was also observed in a mannose binding lectin family (ConA, LCA, PSA, GNL and DC-SIGN). ConA recognizes a wide range of biantennary N-glycans and tolerates multiple extensions,59 and LiGA6×5 detected binding of ConA to nearly all biantennary N-glycans on phage with the exception of paucimannose with terminal GlcNAc (Fig. 5 and Supplementary Fig. S18) and lack of ConA binding to this N-glycan was in agreement with earlier observations (Supplementary Fig. S18).28 Binding occurred at medium density, 150–750, depending on the glycan, but at 1000 glycans per phage there was no detectable binding to many N-glycans. Unlike ConA, the mannose binding lectins LCA and PSA bound to the core Man3 epitope in all six N-glycans (Fig. 5h and Supplementary Fig. S19, S20). Some glycan array studies have suggested that core fucosylation is required for LCA or PSA to recognize the Man3 epitope28 (Supplementary Fig. S19d, S20d) whereas other investigations59,62 observed binding to Man3 without fucosylation (Supplementary Fig. S19e, S20e). Notably, PSA recognized a-(2 →6)-sialylated glycans at 1000 glycans/phage whereas binding to asialoglycans at 1000 glycan/phage density was significantly decreased (n = 5 independent experiments). A possible explanation for this observation is a change the conformation or accessibility of Man3 with and without the negatively charged sialic acid epitopes. Finally, the GNL lectin, which is known to recognize paucimannose, bound only to phages that display 150 and 750 copies of this structure and not to any other N-glycan at any density (Fig. 5h, Supplementary Fig. S21). The double bimodal binding profile could suggest that GNL lectin can bind paucimannose in two different ways, but such hypothesis would have to be confirmed by mode detailed investigations.
Recognition of LiGA6×5 by DC-SIGN was dramatically different from ConA, LCA, PSA or GNL. Using DC-SIGN+ rat fibroblasts, we observed that DC-SIGN does not tolerate terminal LacNAc or sialyl-LacNAc on the core Man3, regardless of glycan density (Fig. 5g and 5h). In line with our previous report,39 a narrow range of density – 500 Man3 epitopes/phage – was optimal for interaction with DC-SIGN+ cells. This preference shifted to higher density for GlcNAc-terminated Man3, corroborating an earlier study63 that showed that DC-SIGN recognition requires a high density of GlcNAc-terminated Man3 epitopes.63 All glycan binding was ablated at ≥1000 glycans/phage, presumably due to steric occlusion of tightly packed epitopes.39
In the Lac/LacNAc binding lectin family, RCA-I lectin59 bound to LiGA6×5 components decorated with galactose-terminated N-glycans and those with a-(2 →6)-linked sialic acid (Fig. 5h and Supplementary Fig. S23). LiGA measurements matched prior observations28 and uncovered a previously unknown bimodal density dependence of RCA-I binding (Supplementary Fig. S23). ECL, which is known to recognize terminal β-(1→4)-linked galactose,59 bound selectively at low concentration of lectin ([ECL] = 10 µg/mL) to phage displaying terminal galactose. At high concentration ([ECL] = 20 µg/mL), this lectin also recognized phages with a-(2 →6)-sialylated galactose (Fig. 5h and Supplementary Fig. S22). WGA, which binds to various terminal59 and internal GlcNAc64, exhibited binding to all LiGA components including Man3GlcNAc2 with internal GlcNAc (Fig. 6c, Supplementary Fig. S24). Binding to specific N-glycans was dictated by density of the glycan on phage and attenuated by the concentration of WGA used in the experiment (Supplementary Fig. S24). The recognition preferences of ECL and WGA aligned with some earlier glycan array experiments59 (Supplementary Fig. S22c, S22e, and S24c, S4e) but diverged from others (Supplementary Fig. S22d, S24d).28 64 In these latter cases, the binding of ECL or WGA to sialylated-N-glycans was not observed and we note that these lectins bound to sialylated structures in LiGA6×5 only at high density (750–1000 glycans/phage). In contrast, binding of asialoglycans to ECL and WGA was bimodal; binding was optimal at intermediate glycan densities. The complex interplay of glycan density and glycan structure might explain the inconsistent binding preferences between prior glycan array experiments. These LiGA experiments thus emphasize the importance of testing multiple glycan densities as GBP–glycan binding depends on not only on lectin concentration but also on the spatial arrangement (density) of glycans.
Previously we employed LiGA to measure interactions between glycans and receptors on the surface of B cells in live animals.39 Here, we injected LiGA into the mouse tail vein (n = 3 animals), recovered organs after 1 hour, amplified phages from the organs and plasma and employed NGS of phage populations to determine the structure and density of glycans associated with each organ (Fig. 6 and Supplementary Fig. S25–S30). We compared the Fold Change (FC) difference in copy number and its significance (False Discovery Rate, FDR < 0.05) for each glycophage in each organ with respect to the same glycophage recovered from plasma. The injected LiGA contained N-glycan glycophages (components of LiGA6x5 in Fig. 5), previously described glycophages that displayed synthetic glycans39, 10 phage clones in which the DBCO handle was capped by azidoethanol and 16 unmodified phage clones. The latter 16 + 10 “blank” clones served as important baseline of the in vivo homing experiment and exhibited only a minor fluctuation in FC across all organs (Fig. 6b). A significant (FDR < 0.05) spleen enrichment of diverse N-glycans (Fig. 6b) and synthetic glycans (Fig. S25) was anticipated because lymphocytes, macrophages, dendritic cells, and plasma cells residing in spleen express the most diverse array of cell-surface lectins. In contrast, few N-glycans enriched in kidneys, heart and lungs; high FC ratio of paucimannose-conjugated phage was detected in kidneys and heart but its significance could not be inferred; no significant enrichment of any N-glycan at any density was observed in the lungs. In liver homing, phage particles that displayed various densities of de-sialylated N-glycans with terminal galactose exhibited a significant (P = 0.02) accumulation in the liver when compared to N-glycans in which galactose was fully or partially capped by sialic acid (Fig. 6b-c).
Removal of terminal galactose to expose terminal GlcNAc abrogated liver targeting (Fig. 6b-c). This observation resembled natural clearance of desialylated red blood cells and platelets by liver. Specifically, aged, desialylated platelets, are cleared by the hepatic Ashwell Morell Receptor (AMR) complex composed of two asialoglycoprotein receptors (ASGPR) 1 and 265,66. From 12 glycans significantly enriched in liver when compared to plasma (FDR < 0.05), three synthetic glycans—P1 tetra / Gb4 (GalNAc(β1–3)Gal(α1–4)Gal(β1–4)GlcNAc(β-Sp, Globoside P (GalNAc(β1–3)Gal(α1–4)Gal(β1–4)Glc(β-Sp and GD2 (GalNAc(β1–4)[Neu5Ac(α2–8)Neu5Ac (α2–3)]Gal(β1–4)Glc(β-Sp—enriched in liver significantly more than in any other tested organs (p < 0.05). All three contained terminal beta-linked GalNAc residue. This observation mirrors well-known delivery of β-GalNAc-conjugates to ASGR receptors in liver used in FDA-approved drugs (Givlaari) and 28 other GalNAc-conjugated oligonucleotides tested in phases I-III of clinical trials.67 Biodistribution of glycophages—components of LiGA—thus mirrors a number of well-known biological mechanisms. These results highlight the possibility of using LiGA to identify both the structure and density of glycans necessary for homing of glycoconjugates to a specific organ paving the route to discovery of new strategies for delivery of therapeutics and uncovering mechanisms that govern glycan-driven biodistribution in vivo.