Identification of potential B- and T-cell epitopes in the SARS-CoV-2 S protein structural region involved in infectivity:
The first step of our approach to designing a SARS-CoV-2 precision vaccine consisted of delimiting solvent-exposed sequences in the S protein S1/S2 subunit junction that potentially contain immunogenic epitopes. We extracted the structure of the SARS-CoV-2 spike protein monomer from the structural 3D model PDB ID: 6VXX using PyMOL software and visualized it using the Bepipred.2 server. We observed that the S1/S2 junction is mostly solvent exposed (Figure 1). Moreover, the server revealed potential B-cell epitopes in this structural region (Supplementary Table 1). This analysis also showed that furin and TMPRSS2 proteolytic motif sequences lie in two distant solvent-exposed loops (Figure 1) and are part of the predicted potential epitopes, as indicated by the estimated epitope probabilities (Supplementary Table 1). These observations suggest that this important region for virus infectivity can elicit an antibody response. We also submitted the SARS-CoV-2 S protein S1/S2 junction to the improved immunoproteasome cleavage-prediction server to search for putative T-cell epitopes. We identified fourteen immunogenic epitopes that the immunoproteasome of infected cells might generate upon processing of the SARS-CoV-2 S protein (Figure 2). To check whether these predicted epitope sequences are conserved across SAR-CoV-2 variants, we aligned them using BioEdit software 31. Thus, we ran the aligned sequences on the Epitope Conservancy Analysis tool of the freely accessible Immune-Epitope Database (IEDB). The percentage of protein sequence matches showed that these peptides are conserved among the twenty-four SARS-CoV-2 variants that circulated and/or are still circulating (Supplementary Table 2). These epitopes are not conserved in SARS-CoV or MERS-CoV (Figure 2). Peptides 3 and 4 contain residues of the FCS sequence (PRRAR), peptide 11 contains the TMPRSS2 cleavage motif, and the sequence of peptide 12 overlaps with the fusion peptide (FP) sequence (Supplementary Table 2).
For confirmation of the T-cell nature of these epitopes, we used the IFNepitope prediction server 29 to determine which epitopes potentially induce interferon g (IFN-g) . The results of this analysis identified peptides 3 and 11 as IFN-g inducing epitopes (Supplementary Table 3).
To further investigate the immunogenic epitopes at SARS-CoV-2 for presentation to the immune system by MHC molecules, we used the Net MHC pan, ProPred I, NetMHC II and ProPred II servers to run algorithms that perform epitope prediction with evaluation of antigenic peptide binding to defined MHC I and II molecules. This study showed that numerous predicted peptides from the SARS-CoV-2 S protein S1/S2 junction bind MHC class 1 and class 2 molecules of known sequence (Supplementary Tables 4, 5 and 6). Several of the predicted peptides contain or overlap amino acids from the FCS or the TMPRSS2 proteolytic site or lie near these sites. Some of these peptides were predicted to have strong binding affinity for specific MHC alleles, while others were predicted to have weak binding affinity (Supplementary Tables 4, 5, and 6). This finding implies that the predicted epitopes can be presented to the immune system through numerous common MHC 1 and class 2 molecules.
Taken together, the prediction data show that the SARS-CoV-2 S protein S1/S2 junction contains epitopes that can potentially elicit both T- and B-cell immune responses, including epitopes that overlap with the furin and TMPRSS2 proteolytic sites.
Design of synthetic polypeptides and study of antibody response to the S protein sequences associated with infectivity in patients with COVID-19.
To check if the predicted epitopes overlapping with the sequence involved in SARS-CoV-2 infectivity generate an immune response in patients with COVID-19, we used two synthetic peptides, P3/FUR/x3 and P4/TMP/x3. We designed these peptides by triplicating a sixteen-residue core sequence (Q662-I679) that spans the FCS in P3/FUR/x3 and a nine-residue core sequence (P794-R802) that spans the TMPRSS2 motif in P4/TMP/x3. In each polypeptide, we introduced the five-residue-long flexible sequence GGGGS as a molecular linker between each repeated core sequence (Table 2). This linker brings stability and maintains distance between epitopes for stable and correct folding of the synthetized polypeptide 30. We used these polypeptides to investigate the specific antibody response in patients with COVID-19.
To study this antibody response, we developed an indirect enzyme-linked immunosorbent assay (ELISA) using the P3/FUR/x3 and P4/TMP/x3 polypeptides as antigens and examined the cohort of 500 patients with COVID as described in Table 1, as well as 100 negative control serum specimens collected before the pandemic. The data showed that none of the control sera reacted with P3/FUR/x3 (Figure 3-A1) and P4/TMP/x3 (Figure 3-A3), while 80% and 86% of the COVID-19 patients’ sera specifically recognized P3/FUR/x3 (Figure 3-A2) and P4/TMP/x3 (Figure 3-A4), respectively. We detected this specific antibody response in numerous patients’ sera up to 50 weeks after infection. These data show that the S protein S1/S2 junction involved in SARS-CoV-2 infectivity elicits a natural robust and persistent antibody response.
To investigate the nature of this antibody response, we analysed the ELISA data in the context of disease status. We observed a highly significant difference (p value = 0.0001) between patients’ antibody response to P3/FUR/x3 based on their disease status. All patients with the asymptomatic or mild form of the disease developed antibodies to epitopes in or around the furin cleavage site sequence (Figure 3B). Therefore, the antibody response to the S protein epitopes covering the furin proteolytic cleavage site in patients with COVID-19 significantly correlates with protective immunity to SARS-CoV-2.
Furthermore, the antibody response we observed did not cause antibody-dependent enhancement (ADE) of infection 31,32. Indeed, the patients who developed this type of immunity experienced rapid and full recovery, and no COVID-19 was recorded among them ten months after this study.
Engineering and production of recombinant multiepitope vaccinating protein subunit candidates:
Based on the data from the patient-centred study that showed strong and persistent natural immunity against the epitopes of the S protein covering the furin and TMPRSS2 cleavage sites, we developed two protein subunits, P3-L and SJ/FT, that encompassed the epitopes identified from the S protein S1/S2 junction. P3-L is 107 amino acids long, 11.1 kDa protein engineered by sequentially adding 3 core sequences of 32 amino acids that comprise the furin proteolytic site separated by a GGGGS molecular linker (Figure 4A). To design the SJ/FT protein subunit, we selected a sequence of 230 amino acids (24.8 kDa) from the S1/S2 junction that comprises both the furin and TMPRSS2 proteolytic motifs and includes the epitopes predicted earlier by computational analysis.
We selected these two subunits based on their predicted good antigenicity, nonallergenicity and stability. Indeed, we used the VaxiJen server to predict the immunogenicity of S1/S2 junction-derived antigens P3/FUR/x3, P4/TMP/x3, P3-L and SJ/FT. The predicted immunogenicity data suggest that these polypeptides and protein subunits are immunogenic (Supplementary Table 6). In addition, the prediction of allergenicity using the AlgPred and AllerTOP computational tools showed that P3/FUR/x3, P4/TMP/x3 and subunits P3-L and SJ/FT are nonallergenic (Table 3), which makes them suitable for vaccination. Furthermore, we determined the physicochemical properties of P3-L and SJ/FT using the ProtParam tool of the EXPASY database server. The relatively low predicted instability index, good solubility values, and estimated half-life (Supplementary Table 7) suggest that the physicochemical properties of the engineered antigens are suitable for production as recombinant vaccine subunits. We produced P3-L and SJ/FT as recombinant proteins in E. coli (Figure 4B) and purified them as endotoxin-free proteins for use in immunization studies.
Antibody and cellular immune responses to engineered vaccinating antigen candidates of patients with COVID-19.
To further investigate the vaccinating potential of the engineered recombinant subunits, we analysed patients with COVID-19 antibodies and T-cell responses to these antigens. ELISA experiments using P3-L and SJ/FT protein subunits as antigens and the serum of the 500 patients in the COVID-19 cohort showed specific antibody responses in 84.7% and 97%, respectively (Figure 5A). To investigate the T-cell response, we studied IFN-g production upon stimulation of the patients’ immune cells using P3-L and SJ/FT. We used PBMCs from COVID-19 patients with mild disease (MD) or severe disease (SD) and healthy controls (HCs) as defined by the WHO33 to perform ELISpot experiments. The results showed that the number of cells producing IFN-g in response to stimulation with the P3-L and SJ/FT proteins, compared to unstimulated cells (NS), was highest for patients with MD (p<0.001) (Figure 5B). These observations show that the P3-L and SJ/FT protein subunits contain epitopes that generate antibody and T-cell responses in patients with COVID-19.
Modelling of the engineered candidate vaccinating antigen structures
To further assess the vaccinating potential of the P3-L and SJ/FT protein subunits and the P3/FUR/x3 polypeptide, we carried out a computational study to check whether these antigens exhibit a conserved or mimicked native 3D structure of the SARS-CoV-2 S protein S1/S2 junction. We constructed 3D models of their structures and compared them with the native 3D structure of the S protein monomer. The predicted structural models show an alpha coil/helix fold that is consistent with the structure of immunogenic epitopes and solvent-accessible protein sequences. A repetitive 3-fold display of the alpha coil/helix fold is observed in the P3/FUR/x3 peptide and protein subunit P3-L (Figure 6A), which shows that the unit peptide sequence containing the PRRAR epitope folds independent of the GGGGS linker. Superimposition of the SJ/FT and S protein 3D models (Figure 6B) shows that the overall conformation of the sequence containing the immunogenic epitopes is very well conserved with an observed RMSD of 0.912 angstrom and 171 amino acids out of 230 superimposing perfectly on the native S protein corresponding sequence. Taken together, the data indicate that antigen-presenting cells process the SARS-CoV-2 native S protein and P3-L, SJ/FT and P3/FUR/x 3 in the same way. This observation supports their vaccinating potential.
BALB/c mouse IgG response to synthetic polypeptides P3/FUR/x3 and P4/TMP/x3 and protein subunits P3-L and SJ/FT:
To assess the vaccination potential of the engineered antigens in vivo, we immunized BALB/c mice with different concentrations of P3-L, SJ/FT, P3/FUR/x3 and P4/TMP/x3 ranging from 25 µg to 200 mg. We also used various adjuvants alone or in combination to determine the best adjuvant to use in vaccination. We immunized different groups of 5 mice each by administering 2 doses of the antigens within an interval of two weeks (Supplementary Table 8). We collected sera from week 4 to week 14 following immunization. The results of ELISA performed with the sera collected following immunization of mice with 200 µg of the polypeptide P3/FUR/x3 showed the highest antibody response in the group of mice administered the CpG TLR9 ligand adjuvant (Figure 7A). In mice immunized using the P3-L and SJ/FT protein subunits, we observed the highest IgG titres with a combination of CpG and either one of the adjuvants alhydrogel or MPLA (Figure 7B). P4/TMP/x3 did not induce a significant IgG response repeatedly.
Through the development of this new approach for designing precision viral vaccines, we used computational tools to generate a data set that identified SARS-CoV-2 immunogenic epitopes in the virus structural region associated with infectivity. We experimentally linked these epitopes to protective immunity in patients with COVID-19. Based on these findings, we engineered vaccinating subunit antigens that we thoroughly tested in vivo by immunizing mice. Through this vaccination study, we determined the best antigen concentration and adjuvants that elicited a robust antibody response.