Analysis of Fc crystal structures for Fc engineering
The co-crystallisation of IgG Fc with enzymes is notoriously difficult, due to the inherent ability of the Fc fragment to crystallise on its own. We therefore sought to identify favourable contacts present in typical Fc crystals, in order to devise a strategy for the elimination of its selective self-crystallisation.
We have observed, from looking at structures currently present in the PDB, that IgG Fc commonly crystallises in the P212121 space group (52% of 125 structures, as of August 2022). We studied the crystal lattice contacts present in a typical, wild-type Fc structure (PDB code 3AVE), in order to identify amino acid residues which are important in this favourable packing arrangement. As calculated in PISA34, model 3AVE forms thirteen salt bridges and fifteen hydrogen bonds with neighbouring molecules within its crystal lattice (Fig. 1a). In addition, contacts are largely conserved across both Fc chains, resulting in a tight packing arrangement (Fig. 1a/b). We identified residue E382, which forms salt bridges with R255 in a neighbouring Fc molecule (and vice versa), in both Fc chains (Fig. 1b). We hypothesised that replacement of this residue would hinder the self-association of the Fc into this preferred crystal lattice, and therefore designed three IgG1 Fc variants: E382R, E382S and E382A, which we collectively term as “Fx” variants.
We obtained crystals of the Fc E382S variant, which were found to have grown in an atypical space group P3221. The structure was determined by molecular replacement using 3AVE as a search model and refined to a resolution of 3.04 Å (Supplementary Table 1). Analysis of the crystal contacts revealed that this variant makes fewer interactions with symmetry-related molecules in the crystal (four salt bridges and sixteen hydrogen bonds; Fig. 1a), which are asymmetrical across the two Fc chains, resulting in altered crystal packing (Fig. 1b). Furthermore, as calculated within the ccp4i2 interface35, the E382S variant had a higher solvent content and Matthews coefficient compared to the wild-type Fc (Fig. 1a), indicating that the molecules are less tightly packed in the variant crystal form. We conclude that these Fx variants are uniquely suited for screening attempts, as crystallisation of Fc fragments has been rendered less favourable; we subsequently used these variants for screening of enzyme-Fc complexes.
The IdeS-IgG1 Fc complex
Using our panel of IgG1 Fx mutants, IdeS from Streptococcus pyogenes (strain MGAS15252), containing a C94A mutation to abolish catalytic activity, was crystallised in complex with IgG1 Fc (E382A variant), in space group C121 (Supplementary Table 2). The structure was determined by molecular replacement with 1Y08 and 3AVE search models. Electron density resolves amino acids 43–339 in IdeS, as well as 228–445 and 229–444 for chains A and B in IgG1 Fc, respectively. We additionally observe density for seven/eight monosaccharide residues at the N-linked glycosylation site (at N297) on Fc chains A and B, comprising a fucosylated biantennary glycan with a single β-1,2-linked GlcNAc on the mannose 6-arm (chain A) and the equivalent glycan with terminal β-1,2-linked GlcNAc on both arms (chain B). The final structure was refined to 2.34 Å (Supplementary Table 2) and is depicted in Fig. 2.
Overall structure of IdeS-IgG1 Fc complex
The crystal structure shows asymmetric binding of IdeS across the Cγ2 domains of the Fc (Fig. 2a). Although the crystalline state of a molecule doesn’t necessarily represent its biologically-relevant form, the combination of evidence from crystallography (Fig. 2a), analytical size exclusion chromatography (Supplementary Fig. 1) and previous kinetic analyses36 allows us to conclude that IdeS functions predominantly in a monomeric form. The enzyme appears to clamp down over the lower hinge region of one Fc chain (Fig. 2a), creating a cavity in which the catalytic residues are brought into close proximity with the cleavage site. Consequently, the Cγ2 domain in chain A is pulled away from chain B; this is reflected in a greater root mean squared deviation between Cαs in the Cγ2 domains (1.347 Å compared to 0.675 Å in wild-type Fc 3AVE, calculated in ChimeraX37 for residues 237–341) and higher atomic B factors in this domain (Supplementary Fig. 2a).
Role of prosegment binding loop in complex formation
IdeS crystallised in complex with IgG1 Fc here is the Mac-2 variant, and thus displays sequence diversity against the three published apo structures of IdeS (all of which are the Mac-1 variant; Supplementary Fig. 3a). Despite this, a structural alignment shows very few deviations (Fig. 2b). Complexed IdeS contains ten alpha helices and twelve beta strands, as calculated by DSSP38,39 (Supplementary Fig. 2b); we note that the loop located between beta strands seven and eight is modelled in distinct conformations for each of the apo structures and is not included within 1Y0840 (Fig. 2b), signifying its inherent flexibility in the apo form. This loop is equivalent to the “prosegment binding loop” present in other papain superfamily cysteine proteases; in these enzymes, which are synthesised as inactive zymogens, this loop packs against the prosegment as a mode of inhibition41–43. In complexed IdeS, the loop curls upwards to accommodate the Fc hinge within the active site cavity (Fig. 2a).
Alanine substitution mutations within this loop were previously found to have little effect on IdeS binding to IgG, or its catalytic activity44. Our structure shows, however, that the majority of interactions present here involve the IdeS backbone, whose conformation won’t be significantly altered by alanine mutations. The inability of IdeS to cleave IgG hinge-mimicking peptides30 also indicates an occlusion of the active site in the absence of substrate, especially given the strong potential of hydrogen bonding and hydrophobic interactions observed at the Fc hinge (discussed in the following section). We therefore conclude that this loop is important for IdeS function, specifically in mediating substrate access to the active site.
Interaction of Fc hinge at active site
We observe clear density for the Fc hinge region bound within the active site cavity (Fig. 2c): the carbonyl oxygen of G236 is hydrogen bonded to the amide nitrogen of the catalytic cysteine (mutated to alanine here) and the side chain of K84, which collectively form the oxyanion hole, as predicted40,44. Binding of the hinge distorts the peptide backbone at G236 in order to promote scissile bond cleavage (Fig. 2c); this residue is thus identified in Molprobity45,46 as a Ramachandran outlier. Superposition of wild-type IdeS (PDB code 2AU1) gives an indication for placement of the catalytic cysteine side chain (Supplementary Fig. 3b): in this conformation, the cysteine sulphur is ideally poised for nucleophilic attack on the carbonyl carbon within the scissile peptide bond. A water molecule observed within the active site (Fig. 2c), held in position via hydrogen bonds to L92, G95 and V171 backbone atoms (within IdeS) and the carbonyl oxygen of L235 in the Fc hinge, is well-placed to act as a base catalyst of the emerging covalent tetrahedral intermediate.
Extended exosite binding to the Fc Cγ2 domains
It has long been suspected that IdeS must recognise its sole substrate IgG with exosite binding30,40,44. Our structure now reveals that IdeS binds across both chains of the Fc (Fig. 3a). Unsurprisingly, the most extensive interface is formed with the Fc chain being cleaved (chain A in our structure) (Fig. 3b), with an interface area of 1392 Å2 and a solvation free energy gain upon interface formation of -15.9 kcal/mol, as calculated by PISA34. The interface extends across the entire hinge region (P228-S239; Fig. 3b), with hydrogen bonds formed with A231, L234, G236 and G237 backbone atoms and the E233 side chain, and favourable hydrophobic interactions predicted here (inferred by positive solvation energies of hinge residues). Within the Fc Cγ2 domain, IdeS interacts with residues in proximity of the Fc BC loop, which aids in stabilising an “open” conformation of the prosegment binding loop (as discussed above), and additionally the FG loop (Fig. 3b).
A secondary interface is formed across chain B of the Fc (Fig. 3c), with an interface area of 802.3 Å2 and a solvation free energy gain of -7.7 kcal/mol. A smaller proportion of the Fc hinge contributes (A231-G237), but PISA predicts favourable hydrophobic interactions here, albeit not to the same extent as chain A. Subsequent recognition of this Fc chain is driven by interactions with the BC loop, and, in contrast to chain A, the C′E loop containing the N-linked glycan (Fig. 3c). PISA additionally predicts a small number of interactions between the enzyme and the Fc N-linked glycans; the lack of electron density for any monosaccharides past β-1,2-linked GlcNAc suggests that any further glycan processing doesn’t affect complex formation, and that IdeS can accommodate IgG with heterogenous glycosylation.
Although IdeS interacts with both chains in the Fc hinge simultaneously, following cleavage of the first chain, the complex would need to dissociate before the second cleavage could occur. This observation is also evidenced by detection of single-cleaved Fc in enzymatic assays and in clinical studies8,36,47,48. We suspect that the binding interface is altered for single-cleaved Fc and that this explains its slower rate of cleavage8,9,36. It is also interesting to note that, aside from the hinge region, IdeS binds Fc regions also implicit in FcγR binding, an observation also inferred by its ability to counteract Fc-mediated effector functions by competitive binding inhibition6. Moreover, we observe that IdeS residues interacting with the Fc are largely conserved across the two IdeS isoforms, and any substitutions are mostly to similar amino acids, which aids in explaining their near identical activity7.
The EndoS-IgG1 Fc complex
To date, there are several known structures of endoglycosidases in complex with their glycan substrates26,33,49−51. Here, we present the structure of truncated EndoS (residues 98–995, as described previously31) in complex with its IgG1 Fc substrate (Fc E382R mutant). A catalytically inactive version of EndoS was generated by the inclusion of D233A/E235L substitutions, as described previously33. The complex crystallised in space group P212121 and was refined to a resolution of 3.2 Å (Supplementary Table 3); the final structure is depicted in Fig. 4.
Overall structure of EndoS-IgG1 Fc complex
Our structure of EndoS shows the same “V” shape as observed in its previously solved structures31,33, comprising, from the N- to the C terminus: a proline-rich loop (residues 98–112) a glycosidase domain (residues 113–445), a leucine-rich repeat domain (residues 446–631), a hybrid Ig domain (residues 632–764), a carbohydrate-binding module (CBM; residues 765–923) and a C-terminal three-helix bundle domain (C-3HB; residues 924–995) (Fig. 4a). One Cγ2 domain in IgG1 Fc (chain A in our structure) binds across the termini of the “V”, in-between the glycosidase domain and CBM, with the rest of the antibody remaining exposed to the surrounding solvent. The N-linked glycan on this chain is “flipped-out” from its usually-observed position between the two Fc Cγ2 domains52 and is bound within the previously-identified glycosidase domain cavity33 (Fig. 4b). A structural overlay with full-length EndoS in complex with its G2 oligosaccharide substrate (PDB code 6EN3) shows that the overall morphology and domain organisation of EndoS is approximately maintained (Supplementary Fig. 4a), apart from a slight shift of the CBM and C-3HB, likely due to a pinching of the CBM around the Fc as it binds.
Role of CBM in governing specificity for IgG
Our structure of the EndoS-Fc complex reveals how one Cγ2 domain of the Fc binds across the glycosidase domain and CBM (Fig. 4). As calculated by PISA34, the interface between chain A of the Fc and EndoS comprises an area of 1356.8 Å2 and yields a solvation free energy gain of -9.5 kcal/mol. The glycosidase domain of EndoS is observed forming contacts with the glycan-containing C′E loop, while the CBM forms additional interactions at the Fc Cγ2-Cγ3 interface (Fig. 4b). We note that residue W803 within the CBM, whose substitution to an alanine has previously been shown to abolish hydrolytic activity against all human IgG subclasses31, appears to act as a hydrophobic “plug”: it binds within a cavity at the Cγ2-Cγ3 interface containing Fc residues I253, H310, L314 and H435 (Fig. 4b), and has the highest solvation energy (of 2.02 kcal/M) of all EndoS residues calculated by PISA, indicating that strong hydrophobic interactions are present here. A small number of contacts is also predicted between EndoS and the second Cγ2 domain, although these are unlikely to be necessary for complex formation, given that EndoS can cleave the Fc Cγ2 lacking the hinge region (likely monomeric)32.
The complex structure presented here corroborates previous findings that both the glycosidase domain and the CBM are important for IgG Fc binding31 and glycan hydrolysis32, and that EndoS can cleave the Cγ2 homodimer fragment of IgG Fc32. The related enzyme EndoS225 likely binds IgG in a similar manner: hydrogen-deuterium exchange mass spectrometry on this complex has similarly indicated strong binding of IgG to the glycosidase domain and the CBM26. While mutation of residues within the glycan binding site of both enzymes completely abolishes their hydrolytic activity26,33, EndoS lacking the CBM can still hydrolyse IgG, albeit at greatly reduced capacity31,32. Therefore, the CBM appears to drive additional specificity of EndoS for the Fc peptide surface.
Interestingly, although previous work has indicated that it can bind galactose (albeit with low affinity)32, the CBM doesn’t bind carbohydrate, and the N- and C-terminal 3 helix bundles, which are homologous to IgG-binding protein A from Staphylococcus aureus33,53, don’t bind protein. A structural overlay of complexed EndoS with full-length EndoS (PDB code 6EN3) indicates that the N-terminal bundle would not contact the Fc (Supplementary Fig. 4a), thus its contribution to EndoS-IgG binding and glycan hydrolysis is likely solely due to stabilisation of the glycosidase domain, as suggested previously33. Indeed, existence of the crystal structure is evidence in itself that EndoS forms a stable complex with IgG in its absence.
Stoichiometry of EndoS-Fc complex
Within the crystal, we observe a 2:1 stoichiometry of EndoS binding to IgG Fc in the complex (Fig. 5a). The N297 glycan within chain A of the Fc binds one EndoS molecule, while its counterpart in chain B, although not fully visible in the electron density, appears to be bound to a second EndoS molecule present in the asymmetric unit of the crystal (Fig. 5a). We observe clear electron density for the chain A glycan binding within the EndoS glycosidase domain cavity previously identified33, and for the C′E loop which covalently links the glycan to the Fc peptide (Fig. 5). This observation of an Fc glycan in this conformation is in strong contrast to typical crystal structures of IgG Fc, whose N-linked glycans are found between the Cγ2 domains52 (Fig. 5b).
It is fascinating to observe the glycan trapped in this “flipped-out” conformation, and this substantiates several recent studies documenting the existence of IgG Fc glycan conformational heterogeneity54–58. Superposition of this complexed IgG with a wild-type Fc (PDB code 3AVE) illustrates that movement of the glycan into this position is governed by movement of the C′E loop only (Supplementary Fig. 4b), although it is possible that the lower resolution of the data is masking small chain shifts. Moreover, it appears that the capture of Fc N-linked glycans in this state allows space for two enzymes to bind simultaneously; however, there is no evidence to suggest that this 2:1 assembly is required for activity, especially given previous work showing that EndoS is largely monomeric in solution31,33. Although EndoS crystallised here is lacking the N-terminal 3-helix bundle, a structural superposition with full-length EndoS (Supplementary Fig. 4a) suggests 2:1 binding would be able to occur in its presence.
Perspectives
The crystal structures presented here provide a structural rationale for the unique properties of these two enzymes, particularly their exquisite substrate specificity towards human IgG. Understanding the molecular basis of this activity is critical for expanding their clinical and biotechnological use. For example, the deactivation of serum IgG using both IdeS and EndoS can strengthen the potency of therapeutic antibodies21,22; this strategy could be applied to potentiate any therapeutic antibody, in theory, if the antibody were designed to be resistant to cleavage by these enzymes, a venture which can be aided greatly with structural information. This will also be invaluable in the synthesis of immunologically-distinct enzyme variants which retain identical activity, for their long-term therapeutic use. While EndoS variants have already been designed to expand the ability to engineer antibody glycosylation27–29, the structural information presented here will allow this to be extended further. To conclude, this work will assist in the continued development of IdeS and EndoS as enzymatic tools with wide clinical and biotechnological applications.