Glycosylation is a common post-translational modification (PTM) in proteins, influencing protein folding [1] and targeting (both intracellular and extracellular targeting [2]. This has a profound influence on the quality and titer of IgGs manufactured in the pharmaceutical industry. Additionally, glycosylation affects peptide-MHC binding [3], and Fc receptor interactions [4]. Therefore, the glycosylation of biopharmaceutical drugs affect its safety and efficacy – by affecting it’s in vivo half-life [5], immunogenicity [6] and effector functions [7]. A recent survey of the top 20 best-selling IgG drugs shows that more than half of them were glycoproteins (either monoclonal antibodies (mAbs) or Fc-based fusion proteins [8].
Most therapeutic antibodies belong to the IgG1 subclass. The mechanism of action often comprises antibody-dependent cell-mediated cytotoxicity (ADCC) which requires interactions mediated by the Fc region of the IgG1. While the Fab domain of different monoclonal antibodies are structurally diverse in order to recognize different antigens, the Fc region is relatively well conserved and lacks the structural diversity of the Fab domain. After the Fab domain has bound to the antigen, the Fc domain determines the function/mechanism by which the monoclonal antibody will act. For example, it may recruit molecules of innate immune system (complements) or bind to antigen presenting cells (APCs) by binding to Fcγ receptors [12]. Interestingly, the Fc region of IgG1 has two conserved N-glycosylation sites at Asn297 (one each in the two CH2 domains of each heavy chains) [12, 13]. Therefore, variability in glycan architecture affects how IgG interacts with the immune system, and therefore the downstream function/efficacy as well.
One of the major reasons for glycan variability is that in contrast to proteins, the biosynthesis of glycans is not solely derived from the plasmid template, but also depends on other factors like genetic make-up of the cells (cell-lines) in which the glycoproteins are expressed [9] including epigenetics [10] and the extracellular environment (cell culture) [11].
As a result, glyco-biologics (biopharmaceutical drugs which are glycosylated) often have a heterogeneous profile for glycans. This profile is heavily dependent on specific host cell line (HEK vs CHO) and the upstream cell culture process. Additionally, the downstream purification process (more specifically, the modes of chromatography used in the purification process) may alter the glycan profile of the final enriched, purified species.
Therefore, regulatory agencies require glycan profiles to be consistent for different batches of the drug. This is critical as glycan profiles affect the safety and efficacy of a drug. Hence, FDA has suggested a QbD (Quality by Design) approach to monitor glycosylation as far as the Critical Process Parameters (CPP) for Drug Substance Manufacture and Target Product Profile (of Drug Product) is concerned. Thus, the process design (upstream/downstream) needs to identify the CPPs for maintaining the optimal glycan profile. QbD requires an appreciation of historical data, design of experiments (DoE), process analytical technology (PAT) and concomitant risk assessments.
All IgGs including glycoproteins need to be purified; additionally, glycoproteins need the glycan architecture to be monitored through the process as that has a profound effect on potency. Generally, the proteins are captured by a Protein A step followed by other polishing steps [14]. Such additional polishing steps consists of ion-exchange and hydrophobic interaction chromatography – these modalities are used to remove process impurities like endotoxins, host cell proteins (HCP) and residual DNA [15]. This is followed by non-chromatographic steps to concentrate the IgG – such as ultrafiltration and diafiltration (UFDF). A population of glycans will behave bind to an anion-exchange column or hydrophobic column (based on charge or hydrophobicity), and the heterogeneity of the purified pool will depend on the specific elution condition. As a result, pharmaceutical manufacturers need to analyze the glycan profile throughout process development and manufacturing.
This diversity coupled with the nonlinear structures of glycans gives their physicochemical characterization an added layer of complexity. Additionally, an oligosaccharide can be “N-linked” or “O-linked” depending on whether the carbohydrate is attached through the amide group of an asparagine residue (“N-linked”) or the hydroxyl group of a serine/threonine residue in an IgG “O-linked”. Therefore, multiple analytical techniques are used to understand IgG glycosylation. Generally speaking, the analysis can be carried out at three distinct levels [16], depending on the specific question to be answered. These levels include the intact protein level, glycopeptide level, or with released glycans.
At the intact protein level, one can use chromatography, gel electrophoresis or mass-spectrometry. At an intact level, a top-down approach can be used to analyze intact protein samples with minimal sample preparation in order to obtain molecular weight and PTMs like glycosylation. It can be used to gauge variability between different batches. However, an intact top-down approach is limited in detecting the nuances of minor glycoforms. Therefore, middle-down strategies, coupled with chemical and enzymatic approaches can be used to analyze different antibody subunits. This approach using reducing agents to reduce inter-chain disulfide bonds and specific protease (like IdeS) is often utilized to detect and identify multiple glycoforms and other PTMs [17].
Additionally, glycopeptide analysis is carried out to comprehend glycosylation microheterogeneity using a bottom-up approach. Here, the therapeutic biologic is treated with proteolytic enzymes (trypsin) and then analyzed with LC/MS or CE [18]. Such methods are used for monitoring variability between different manufacturing batches or process development. Finally, the identity of the monosaccharide composition, the specific glycosidic linkage – of which is critical to safety/efficacy of the therapeutic can be characterized in-depth by HPLC/UPLC coupled with MS [16]. A major technical challenge is the lack of a chromophore in the monosaccharide augmented by the different configurations of the glycan. To circumvent this, released glycans are fluorescently labelled for optimal detection [19]. Capillary electrophoresis with laser-induced fluorescence (CE-LIF) has also been used for glycans analysis (where the released N-glycans are labelled with 8-aminopyrene-1,3,6-trisulfonate). In this case, the labelled glycans are resolved using charge/hydrodynamic-volume ratio [20]. CE-LIF can also be coupled with HPLC/UPLC in a 2D format.
The gold standard to interrogate glycosylation is MS as it has the best resolution. The most common ionization methods for analyzing glycosylation are ESI and MALDI. However, a common challenge to most methods is the complexity introduced by the enzymatic digestion of glycosylated biologics. This may lead to competitive ion suppression (analyzing glycopeptides vs peptides) during mass-spectrometric analysis. Additionally, it may not be practically feasible to engage the MS team during different phases of downstream purification. Engaging a MS team may be time consuming and often a MS team may be geographically distant from the downstream team, requiring samples to be shipped which may lead to temperature excursions, etc.
A highly desired kit in a downstream purification team’s toolbox is a seamless method to detect, monitor and quantitate glycans in real-time. This will be critical to guide both downstream purification and upstream expression studies. This is especially important as glycans are increasingly being shown to be critical for a growing number of therapeutic assets – and understanding glycan content is central to interrogating batch-to-batch variability. Additionally, a common downstream purification chromatographic technique is ion-exchange, and this is known to influence glycan content – thus, a quick tool to monitor glycan activity is helpful in a downstream team.
Currently, most downstream teams analyze glycan content following enzymatic digestion with PNGase (which removes N-linked glycosylation) – the proteins are subsequently resolved on an SDS-PAGE gel and the mobility shift (faster migrating band upon glycan truncation) confirms the presence of a glycosylated species. This is the assay used as purification guide to quickly identify glycosylated species. However, the assay is not quantitative. Additionally, this is an indirect enzymatic measurement and assumes that PNGase will be active for all immunoglobulins.
It is easy to appreciate that an ideal glycosylation-staining method should selectively stain the sugar, without staining the protein on an SDS-PAGE gel. The greatest challenge in the field has been the inability to have a dye that selectively stains sugar – a “Coomassie for carbohydrates”. The primary reason for this is that amino acids (serine/threonine/tyrosine) have primary-hydroxyl groups just like sugars; thus, selectively staining sugar moieties in their native hydroxyl form is challenging.
We have developed a method (henceforth referred to as ‘GlycoIg-stain’) where we can selectively oxidize the primary-alcohol groups in sugars to their aldehyde form using periodic acid. Our method leaves the primary alcohol in the amino acids intact. The reason behind this selectivity is that we leverage the stereochemistry of the alcohols by selectively oxidizing the cis-diol in glycoproteins to aldehyde in situ on the SDS-PAGE gel. The oxidation process we use is selective and requires two proximal hydroxyl groups to be in a ‘cis’ position. The aldehyde is then detected by Schiff’s reaction and the glycosylated IgG appears as a magenta band on the SDS-PAGE gel (graphical abstract). IgGs which are not glycosylated will not be stained by this method. The SDS-PAGE gel can then be counterstained with Coomassie (per regular protocol) to stain both glycosylated and non-glycosylated IgG.
The mechanism for the ‘GlycoIg-stain’ is shown in Fig. 1A. The carbohydrates in the CH2 domain are selectively oxidized to aldehydes in situ on an SDS-PAGE gel. These aldehydes can then be visualized using Schiff reaction. A pictorial representation of the expected magenta bands for the glycosylated IgGs is depicted in Fig. 1B (the magenta bands disappear upon treatment with PNGase (enzyme that removes N-linked glycosylation).
Our protocol is rapid and semi-quantitative. It can detect both N-linked and O-linked glycosylation. Finally, the ability to stain a single SDS-PAGE gel with the ‘GlycoIg-stain’ followed by the Coomassie (may also serve as protein loading control) enables streamlining with existing protocol to enable facile adoption in any laboratory.
The immunoglobulins, their sizes, isoelectric point, cell lines where they were expressed along with their modes of expression and glycosylation-linkage is designated in Table 1. “G-IgG” refers to “glycosylated imynoglobulin”.
Table 1
Glycoproteins used in this study
Name | Size (kDa) | pI | Cell line | Transfection type | Glycosylation |
G-IgG1 | 133.95 | 6.3 | HEK/CHO | Transient/Stable | N-linked |
G-IgG2 | 101.82 | 6.2 | CHO | Stable | N-linked |
G-IgG3 | 111.28 | 8.7 | HEK/CHO | Transient/Stable | N-linked + O-linked |
G-IgG4 | 77.63 | 6.1 | CHO | Stable | N-linked |