Prediction and selection of epitopes
44 CTL, 158HTL, and 36 LBL epitopes were obtained. These epitopes were further screened through numerous immune filters. For example, the epitopes used to construct vaccines should be nonallergenic, nontoxic should have high antigenicity and high affinity with HLA alleles. To make the multiepitope vaccine have higher antigenicity, only epitopes with high antigenicity scores are used to construct the vaccine. Finally, 7 CTL, 6 HTL, and 8 LBL epitopes were obtained to construct the vaccine. Complete epitope screening data can be found in the supplemental materials (Table S1, S2 and S3). Table 1 here, Table 2 here, Table 3 here.
Table 1
Final CTL epitopes selected for vaccine construction
Protein
|
Supertype
|
Start Position
|
Epitope Sequence
|
Score
|
Immunogenicity
|
Antigenicity
|
Allergic
|
Toxicity
|
Similarity with human protein
|
OmpA
|
A1
|
269
|
HTDNTGPRK
|
1.046
|
0.09127
|
1.6471
|
-
|
-
|
0
|
B7
|
18
|
AANAGVTV
|
0.9721
|
0.13371
|
1.1461
|
-
|
-
|
Omp33-36
|
A2
|
16
|
GAHAYQFEV
|
1.0009
|
0.09453
|
1.7241
|
-
|
-
|
0
|
100
|
YVPTPYLPV
|
0.9688
|
0.00534
|
1.5876
|
-
|
-
|
0
|
140
|
AMLLPNFLM
|
0.8631
|
0.06067
|
1.7425
|
-
|
-
|
0
|
OmpW
|
A1
|
308
|
FTAGFTYDF
|
1.5173
|
0.20718
|
1.267
|
-
|
-
|
0
|
A2
|
160
|
FIGGIPPKV
|
1.1644
|
0.0289
|
1.4031
|
-
|
-
|
0
|
Table 2
Final HTL epitopes selected for vaccine construction
Protein
|
MHC Allele
|
Start Position
|
Epitope Sequence
|
Antigenicity
|
Allergic
|
Toxicity
|
IFN-γpred
|
IL-4pred
|
Similarity with human protein
|
OmpA
|
DRB1_0401
|
130
|
YKYDFDGVNRGTRGT
|
1.3463
|
-
|
-
|
+
|
+
|
0
|
DRB3_0101
|
DRB1_1501
|
66
|
GIELTPWLGFEAEYN
|
0.9127
|
-
|
-
|
+
|
+
|
0
|
Omp33-36
|
DRB3_0202
|
233
|
VGATFVGNDGEADIK
|
0.9694
|
-
|
-
|
+
|
+
|
0
|
DRB3_0202
|
235
|
ATFVGNDGEADIKGN
|
1.0696
|
-
|
-
|
+
|
+
|
0
|
OmpW
|
DRB1_0401
|
114
|
KPEVAGEATIQGLEQ
|
0.7934
|
-
|
-
|
+
|
+
|
0
|
DRB1_0404
|
DRB1_1501
|
234
|
KSGVNKFRPYLGVGL
|
0.7925
|
-
|
-
|
+
|
+
|
0
|
Table 3
Final LBL epitopes selected for vaccine construction
Protein
|
Start Position
|
Epitope Sequence
|
Score
|
Antigenicity
|
Allergic
|
Toxicity
|
Similarity with human protein
|
OmpA
|
142
|
RGTSEEGTLGNAGVGA
|
0.94
|
1.8663
|
-
|
-
|
0
|
83
|
KGDVDGASAGAEYKQK
|
0.86
|
1.9423
|
-
|
-
|
0
|
42
|
NGGKDGNLTNAPELQD
|
0.82
|
1.5997
|
-
|
-
|
0
|
Omp33-36
|
259
|
YPNATARIEGHTDNTG
|
0.8
|
1.1391
|
-
|
-
|
0
|
110
|
ASATYNHTDVDGKNNF
|
0.89
|
1.1803
|
-
|
-
|
0
|
116
|
HTDVDGKNNFSKDDNG
|
0.88
|
1.6333
|
-
|
-
|
0
|
87
|
NYHIGTYGVKGEAYVP
|
0.83
|
1.0537
|
-
|
-
|
0
|
OmpW
|
278
|
DGKAGAALDRKESSGN
|
0.87
|
1.8311
|
-
|
-
|
0
|
234
|
KSGVNKFRPYLGVGL
|
0.84
|
0.7925
|
-
|
-
|
0
|
Multiepitope vaccine construction
The final multiepitope vaccines are composed of four parts: LT-IIb, CTL epitopes, HTL epitopes, and LBL epitopes. Through the analysis of the ProtParam server, vaccine 1 consisted of 372 amino acids S, and its molecular weight was 38707.91 g/mol. the theoretical pI of vaccine I was 7.81. Vaccine 2 contained 318 amino acids, its molecular weight was 33206.34 g/mol, and the theoretical pI was 9.41. These results describe the basic properties of vaccine constructors. Vaccine 1’s instability index was 13.09, while that of vaccine 2 was 21.69, which suggests that both vaccines are stable. The aliphatic indices of vaccine 1 and vaccine 2 were 53.82 and 58.33, respectively, which revealed that both vaccines were thermostable. The GRAVY (GR and AVerage of hydropathy) index of vaccine 1 and vaccine 2 were -0.692 and -0.612, respectively, which demonstrated the vaccines’ hydrophilic nature. Both vaccines’ half-life in vitro of mammalian reticulocytes cytes is 30 hours while in vivo of yeast is more than 20 hours and 10 hours in vivo of E. coli. The antigenicity of vaccine 1 was 1.0738 and 0.946426, respectively, as predicted by Vaxijen2.0 and AITIGENpro. The antigenicity of vaccine 2 predicted by Vaxijen2.0 and ANTIGENpro was 0.9932 and 0.942819, respectively. The SQLpro predicts that the dissolution probability of vaccine 1 and vaccine 2 is 0.906464 and 0.892241, respectively, when overexpressed in E. coli. Furthermore, the AllerTOP 2.0 and Algpred servers predicted that neither vaccine was allergic.
Secondary structure prediction
The Prabi server reported secondary structure prediction, in which vaccine 1 contained 21.51% alpha-helices, 18.01% extended strands, and 60.48% random coils. For vaccine 2, the secondary structure included 21.38% alpha-helices, 19.18% extended strands, and 59.43% random coils.
Modeling and validation of vaccine
First, the RaptorX server was used to predict a primary 3D structure for the vaccine sequence. Coarse models were updated to the Galaxy server for refinement. GDT-HA, MolProbity, and RMSD plots were used to assess the models. The best models are exhibited in the figures. Vaccine 1 gets an ERRAT score of 80.000. The ERRAT score of the vaccine2 is 90.400. The Ramachandran plot analysis revealed that 83.2% of residues lied in the most favorable areas, and 14.0% of residues lied in additional permitted areas. Moreover, the Ramachandran plot of vaccine 1 showed that. For vaccine 2, 81.5% of residues were in the most favorable areas, and 16.0% of residues were in gerenally permitted areas. The Z scores of vaccines 1 and 2 were -4.93 and -4.62, respectively. Figure 2 here, Figure 3 here.
Immune Simulations
Immune stimulation was programmed using the C-IMMSIM server. The results of the host immune system's response to the designed vaccines 1 and 2 are shown in Figure 3. The primary, secondary and final immune responses have an important contribution to vaccine immunity. In particular, high antibody titers were observed, first with high titers IgG and IgM, followed by IgG1 and IgG2. The antibody titer induced by vaccine 1 was higher than that induced by vaccine 2. Furthermore, in the results of two vaccine simulations, B cells were stimulated and proliferated, which resulted in memory cell development. Meanwhile, the upregulation of the population of cytotoxic and helper T cells indicates that the immune response develops after vaccine injection. At the same time, high IFN-γ, IL-2, and IL-10 levels were induced by the designed vaccine. These results indicated that the host produced an effective immune response to the vaccine through immune stimulation. The immune cell population stimulated can be found in Figure S2. Figure 4 here.
Molecular docking
To assess the binding ability of the designed vaccine and antigenic receptors, we used HADDOCK server 2.4 to stimulate docking progress. The A chains of TLR2 and TLR4 were used for molecular docking. In the results of TLR2 and vaccine 1 docking, the best HADDOCK score of the cluster was -148.5 +/- 1.4. Similarly, the results of TLR2 and vaccine 2 docking revealed that the HADDOCK score of the cluster was -146.5 +/- 1.9. In the condition of vaccine 1 and TLR4 docking, the highest score of the structure was -222.1 +/- 3.8. For vaccine2 and TLR4, the great HADDOCK score of the cluster was -163.2 +/- 3.1.
Moreover, the α chains of MHCI and MHC II molecules were used for molecular docking with vaccine constructors. For vaccine 1 and the MHCI molecule, the best structure had a HADDOCK score of -193.9 +/- 5.3. Similarly, the vaccine2 and MHCI molecular results indicated that the best structure had a HADDOCK score of -137.4 +/- 1.6. For the vaccine1 and MHC II molecular, the best structure had a HADDOCK score of -172.2 +/- 1.5 and a while for vaccine2 and MHC II molecular, the best structure had a HADDOCK score of -184.2 +/- 1.6 and a. BCR is also tested for docking with vaccines, for vaccine 1 and BCR, the best structure had a HADDOCK score of -217.3 +/- 1.4 and a z score of -2.2. The results of vaccine 2 and BCR molecules indicated that the best structure had a HADDOCK score of -149.6 +/- 3.6.
The models with better scores were selected, and these complexes were further refined by HADDOCK. Then, the server generated 20 refined models into one cluster, which symbolized 100% water-refined models. The data of the interactions between the designed vaccines and receptors from their refined structures are shown in the figure, and complexes are shown in Table S4 and S5.
Docking analysis
Then, the vaccine-receptor complexes were submitted to PDBsum to analyze the interaction interface. In the results, 4 salt bridges and 15 hydrogen bonds were revealed between vaccine 1 and TLR2, while the same data for vaccine 2 were 4 and 15. For TLR4, vaccine 1 seemed to establish 3 salt bridges and 23 hydrogen bonds, while vaccine 2 formed 4 and 13. The number of salt bridges and hydrogen bonds built between vaccine 1 and HLAA1101 was 6 and 16 hydrogen bonds, which were 3 and 14 for vaccine 2, respectively. Two salt bridges and 11 hydrogen bonds seemed to be built between vaccine 1 and HLADRB0401, while vaccine 2 formed 4 and 10 bonds. Finally, between BCR and vaccine 1, there were 3 salt bridges and 16 hydrogen bonds, while between BCR and vaccine 2, there were 4 and 13.
We further analyzed interaction residues between vaccine 1 and TLR2, which were 33 and 30, covering 1327 Ų and 1356 Ų, respectively. Similarly, the interaction residues between vaccine 1 and TLR4 were 27 and 28, covering 1593 Ų and 1629 Ų, the interaction residues between vaccine 1 and HLA1101 were 30 and 25, covering 1656 Ų and 1579 Ų, and the interaction residues between vaccine 1 and HLADRB0401 were 28 and 26, covering 1458 Ų and 1519 Ų, respectively. The interaction residues between vaccine1 and BCR were 27 and 35, covering 1790 Ų and 1630 Ų, respectively.
In addition, the interaction residues for vaccine 2 and TLR2 were 33 and 30, covering 1327 Ų and 1356 Ų, respectively. Similarly, vaccine 2 and TLR4 each had 19 and 24 residues to interact with, covering 1106 Ų and 1006 Ų areas, respectively. Finally, the interacting residues between vaccine 2 and HLA-1101 were 16 and 21, covering 1276 Ų and 1166 Ų, and the interacting residues between vaccine 2 and MHCII were 24 and 27, covering 1419 Ų and 1372 Ų, respectively. while the interaction residues between vaccine2 and BCR were 20 and 21, covering 1226 Ų and 1162 Ų. The docked complex between vaccines and HLA-1101 and TLR2 is exhibited below in Figure 5, and the remaining receptors and vaccine complex can be found in Figure S3. Figure 5 here
Population Coverage Analysis
MHC allele distribution varies with geographic region and ethnicity. Therefore, population coverage analysis needs to be considered when an effective vaccine is designed. The screened CTL and HTL epitopes were predicted for their coverage in the global population; the global population coverage for Vaccine 1 was 84.38%, while that of Vaccine 2 was 92.28%. The results of population coverage analysis could be found in Figure S4.
Molecular Dynamic simulation
To test the stability of vaccine-receptor complexes by molecular dynamics simulation, the iMods server was chosen. This server showed the collective motion of biological macromolecules through normal mode analysis (NMA) in internal (dihedral) coordinates. The vaccine1-HLA-A11:01 complex had an elgenvalue of 2.334594e-5 (figure 6b). The deformability of the omposite was determined by the deformation of a single residue, and the result of the vaccine1-HLA-A11:01 complex is shown in Figure 6b. The value of factor B was proportional to RMS normal mode analysis (figure 6a). The covariance matrix represented the coupling between residue pairs, and different colors indicated that different residue pairs experienced correlated (red), uncorrelated (white) or anti-correlated (blue) movements (figure 6d). The elastic network model showed which atoms were connected by springs, as shown in figure 6e. Each dot represents a spring, and the stiffness of the spring is proportional to the color depth of the dot. The gray area in the figure indicates that the connection between atoms was relativelystable. Other results are shown in Supplementary materials Figure S5. Figure 6 here
In silico cloning
To fuse the vaccine with the vector and ensure maximal protein expression in E. coli (strain K12), we used the Java Codon Adaptation Tool (JCat) to optimize the concentration of vaccine protein. Codon-optimized sequence 1 was 1000 nucleotides with a CAI value of 0.990 and a GC content of 0.51. Codon-optimized sequence 2 had 1065 nucleotides, with a 0.988 CAI value and a GC content of 0.51. Two groups of data indicated that the proteins could be expressed well in E. coli. Then, the BamHI and XhoI restriction sites were added to the codon sequence, BamHI to the 5’ end and EcoRI to the 3’ end. Finally, the whole sequence inside the Pet-28a (+) plasmid was cloned to ensure the designed expression in E. coli. The nucleotide sequences of vaccines are shown in figure S6. Figure 7 here