T cell epitopes prediction
The antigenicity screening of the retrieved NiV protein sequences showed a score greater than the threshold value of 0.4 for glycoprotein G (0.5095), fusion protein F (0.5395), nucleoprotein N (0.5594), and phosphoprotein P (0.5767), suggesting probable antigenicity, while the antigenicity score of the matrix protein M (0.3969) was indicative of non-antigenic nature of the protein, therefore it was excluded from further analysis. The 4 antigenic proteins were then submitted to the NetCTL server for CTL epitope prediction, resulting in 12 potential epitopes from protein G, out of which 11 were nontoxic and showed a positive immunogenicity score, all were included in the final vaccine construct. For protein F, 21 CTL potential epitopes were predicted, of which 4 were nontoxic, of these, 2 with a positive immunogenicity score were included in the vaccine construct. Eight CTL potential epitopes were predicted from the nucleoprotein N, all were nontoxic, and the 2 included in the final vaccine construct were the only ones with a positive immunogenicity score. For phosphoprotein P, 10 CTL potential epitopes were predicted, 5 of which were nontoxic and only 3 epitopes were selected for the final vaccine construct for their positive immunogenicity score. All the ultimately selected epitopes for inclusion in the vaccine construct showed antigenicity scores greater than the threshold value (Table 1).
Table 1: Characteristics of predicted CTL epitopes
Protein
|
Peptide
|
Prediction score
|
VaxJen
Score
|
SVM score
|
Toxicity
|
Immunogenicity score
|
G
|
STDNQAVIK
|
1.4497
|
0.4021
|
-1.07
|
Non-toxin
|
0.03732
|
|
STASINENV
|
0.9814
|
0.5714
|
-1.01
|
Non-toxin
|
0.05046
|
|
AMDEGYFAY
|
3.007
|
0.6169
|
-0.83
|
Non-toxin
|
0.25913
|
|
CSAVYNNEF
|
1.8928
|
0.4829
|
-0.85
|
Non-toxin
|
0.09759
|
|
SAVYNNEFY
|
1.4353
|
0.5888
|
-1.23
|
Non-toxin
|
0.17688
|
|
AVYNNEFYY
|
0.7584
|
0.6013
|
-1.21
|
Non-toxin
|
0.17688
|
|
DSLGQPVFY
|
1.0398
|
0.621
|
-1.31
|
Non-toxin
|
0.0105
|
F
|
ECSVGILHY
|
0.9645
|
0.4284
|
-0.44
|
Non-toxin
|
0.15566
|
|
QITAGVALY
|
1.2334
|
0.4052
|
-1.22
|
Non-toxin
|
0.15037
|
N
|
QTKFAPGGY
|
1.3023
|
1.0211
|
-0.99
|
Non-toxin
|
0.12386
|
|
SSDQVAELA
|
1.2561
|
0.6207
|
-1.09
|
Non-toxin
|
0.04569
|
P
|
QLDPVVTDV
|
0.8144
|
0.6201
|
-1.24
|
Non-toxin
|
0.12082
|
|
ETDLVHLEN
|
0.9896
|
0.5078
|
-0.82
|
Non-toxin
|
0.11583
|
|
YHADHLGDY
|
0.763
|
0.4871
|
-0.61
|
Non-toxin
|
0.09764
|
Percentile rank less than 10 was used as the cut-off value for the selection of predicted HTL epitopes which resulted in 36 peptides from protein G, one of which was selected for inclusion in the final construct, despite 5 were non-allergenic, 3 were nontoxic and 2 were antigenic, but only one showed IFN- inducibility. For protein F, none of the 18 predicted HTL epitopes showed IFN- inducibility, while 2 epitopes were selected from the 20 predicted HTL epitopes from nucleoprotein N, and one epitope from protein P, that fulfilled the selection criteria (Table 2).
Table 2: Characteristics of predicted HTL epitopes
Protein
|
Peptide
|
Percentile rank
|
Toxicity
|
Antigenicity score
|
IFN-gamma
|
G
|
LGQPVFYQASFSWDT
|
9.8
|
Non-toxin
|
0.8008
|
Positive
|
N
|
GKTPFVDSRAYGLRI
|
1.4
|
Non-toxin
|
0.4445
|
Positive
|
|
QSDLNTIKSLMLLYR
|
0.64
|
Non-toxin
|
0.5368
|
Positive
|
P
|
VIKTLIRTHIKDREL
|
9.5
|
Non-toxin
|
0.6214
|
Positive
|
Population coverage
The analysis of world population coverage of the selected epitopes for MHC-I and MHC-II alleles. The coverage of HLA class I restricted epitopes was 98.59% of the world population, and the coverage of HLA class II restricted epitopes was 69.81% of the world population, while the coverage of the combined HLA class I and class II was 99.57% of the world population, this suggests a remarkably high population coverage of the given multisubunit vaccine (Fig. 1).
B cell epitope prediction
The B cell epitopes derive their importance from being key players in the adaptive humoral immune response. Upon their recognition by B lymphocytes, they trigger antibody production which is an important defense process against viral infection. Three methods were employed to ensure proper epitopes prediction, epitopes predicted by the Bepipred prediction method were assessed for surface accessibility, flexibility, and antigenicity by Emini and Kolaskar & Tongaonkar methods, respectively. Only epitopes predicted by the three methods were included in the final vaccine construct. Ultimately, 7 epitopes of variable sequence length were predicted by the three methods, one from proteins G, F, M, and P, each, and three from protein N. All predicted epitopes induce IgG antibodies.
Multi-subunit vaccine construction
The final vaccine construct comprised of 7 B cell epitopes, 14 CTL epitopes, 4 HTL epitopes, 2 linkers, EPEA tag at the C-terminal, and a cysteine residue, resulting in a sequence of 501 amino acids:
CAAYVRFENTTSDKGKIPSKVIKSYYGTMDIKKINEGLLDAAYNTYSRLEDRRVRPTSSGDLYYAAYIKSISSESMEGVSDFSPSSWEHGGYLDKVEPEIDENGAAYKTARDSSKGKTPFVDSRAYGLRITDMAAYDTAEESETRRWAKYVQQKRVNPFFALTQQWLTEAAYAAKEAASSNATDDPAISNRTQGESEKKNNQDLKPAQNDLDFVAAYGGECTGYGFTSSPERGWSDYTSGANNGAAYSTDNQAVIKAAYSTASINENVAAYAMDEGYFAYAAYCSAVYNNEFAAYSAVYNNEFYAAYAVYNNEFYYAAYDSLGQPVFYAAYECSVGILHYAAYQITAGVALYAAYQTKFAPGGYAAYSSDQVAELAAAYQLDPVVTDVAAYETDLVHLENAAYYHADHLGDYGPGPGLGQPVFYQASFSWDTGPGPGVIKTLIRTHIKDRELGPGPGGKTPFVDSRAYGLRIGPGPGQSDLNTIKSLMLLYRGPGPGEPEA
Physiochemical properties of the vaccine construct
The properties of the novel vaccine assessed by the ProtParam server showed a molecular weight of 54.6 KDa and a theoretical isoelectric point (PI) of 4.78, suggesting an acidic nature, a total of 61 negatively charged residues and 41 positively charged residues, which make it a readily soluble protein in blood pH. The estimated half-life is 1.2 hours in mammalian reticulocytes in vitro, >20 hours in yeast in vivo, and >10 hours in E. coli in vivo. The instability index was computed to be 38.30, which classifies the protein as a stable molecule. The vaccine construct also showed an aliphatic index of 62.89, indicating a thermostable protein. The grand average of hydropathicity (GRAVY) was -0.476, a negative value means a hydrophilic molecule.
Tertiary structure modeling and analysis
The results of secondary structure prediction by PSIPRED sever showed predominant helix and coil structures (Fig 2), homology modeling was then carried out using the SWISS Model tool to predict the tertiary structure of the multi-subunit construct which resulted in 29 modeling templates. The top three templates with the best sequence coverage and larger identity 62.14% each, were selected for building the 3D models of the protein. The quality of these models was then assessed and validated using Ramachandran plot, ERRAT tool to calculate the overall quality factor, VERIFY3D tool, and ProSA-web z-score based on x-ray crystallography and NMR analysis. The best-predicted model showed 93.1% of the residues fall within the favorable region in the Ramachandran plot (Fig 3), and overall quality factor of 90.426% calculated by blotting the statistics of non-bonded interactions between different atom types and the error function value, against a position of a 9-residue sliding window, compared with statistics from a highly refined structure. The value is less than the 91% threshold for an average quality (Fig 4). The model also showed a Z-score of -3.84 determined by both x-ray crystallography and NMR analysis (Fig 5)
Molecular docking and dynamics analysis
The docking of the novel vaccine construct with Toll-like receptor 3 (PDB ID: 1ziw) was performed with the PatchDock server, the generated docking solutions were sorted based on the geometric shape complementarity score, the approximate interface area of the complex, and the atomic contact energy. The top 10 protein-receptor docking solutions were submitted to the FireDock server for interaction refinement. The selected docking solution had a globale energy of -1.58 Kcal/mol, attractive van der Waals force of -9.13 Kcal/mol, repulsive van der Waals force of 1.83, Kcal/mol, and atomic contact energy of 2.98 Kcal/mol (Fig.6). The molecular dynamics simulation results showed a root mean square deviation (RMSD) of 0.65nm and the radius of gyration value with a relatively minor fluctuation throughout the 1000 ps period of simulation (Fig. 7,8).
Immune response simulation
Assessing the immune response to the designed vaccine is an important step towards ensuring vaccine efficiency. The simulation results showed increased and sustained levels of memory and active B cell population. A similar result was shown by T cell population, the memory and active T helper cells maintained a high level for the entire simulation period. The immunoglobulins level, on the other hand, peaked by the second week of simulation and then regressed to relatively acceptable levels nonetheless. The concentration of cytokines and interleukins showed for more than half the simulation period followed by a gradual decline (Fig. 9), suggesting sustained humoral and cell-mediated immune response.
In silico molecular cloning
The Java codon adaptation tool was used to optimize the novel vaccine sequence prior to the insertion of the vaccine DNA sequence into the pET28 (+) plasmid of E. coli (strain K12). The GC content of the vaccine DNA sequence was 51.8 % suggesting good expression of the vaccine in the E. coli host (Fig. 10):
TGCGCTGCTTACGTTCGTTTCGAAAACACCACCTCTGACAAAGGTAAAATCCCGTCTAAAGTTATCAAATCTTACTACGGTACCATGGACATCAAAAAAATCAACGAAGGTCTGCTGGACGCTGCTTACAACACCTACTCTCGTCTGGAAGACCGTCGTGTTCGTCCGACCTCTTCTGGTGACCTGTACTACGCTGCTTACATCAAATCTATCTCTTCTGAATCTATGGAAGGTGTTTCTGACTTCTCTCCGTCTTCTTGGGAACACGGTGGTTACCTGGACAAAGTTGAACCGGAAATCGACGAAAACGGTGCTGCTTACAAAACCGCTCGTGACTCTTCTAAAGGTAAAACCCCGTTCGTTGACTCTCGTGCTTACGGTCTGCGTATCACCGACATGGCTGCTTACGACACCGCTGAAGAATCTGAAACCCGTCGTTGGGCTAAATACGTTCAGCAGAAACGTGTTAACCCGTTCTTCGCTCTGACCCAGCAGTGGCTGACCGAAGCTGCTTACGCTGCTAAAGAAGCTGCTTCTTCTAACGCTACCGACGACCCGGCTATCTCTAACCGTACCCAGGGTGAATCTGAAAAAAAAAACAACCAGGACCTGAAACCGGCTCAGAACGACCTGGACTTCGTTGCTGCTTACGGTGGTGAATGCACCGGTTACGGTTTCACCTCTTCTCCGGAACGTGGTTGGTCTGACTACACCTCTGGTGCTAACAACGGTGCTGCTTACTCTACCGACAACCAGGCTGTTATCAAAGCTGCTTACTCTACCGCTTCTATCAACGAAAACGTTGCTGCTTACGCTATGGACGAAGGTTACTTCGCTTACGCTGCTTACTGTTCGGCTGTGTACAACAACGAATTCGCTGCTTACTCTGCTGTTTACAACAACGAATTCTACGCTGCTTACGCTGTTTACAACAACGAATTCTACTACGCTGCTTACGACTCTCTCGGTCAGCCGGTATTCTACGCGGCTTACGAATGCTCTGTTGGTATCCTGCACTACGCTGCTTACCAGATCACCGCTGGTGTTGCTCTGTACGCTGCTTACCAGACCAAATTCGCTCCGGGTGGTTACGCTGCTTACTCTTCTGACCAGGTTGCTGAACTGGCTGCTGCTTACCAGCTGGACCCGGTTGTTACCGACGTTGCTGCTTACGAAACCGACCTGGTTCACCTGGAAAACGCTGCTTACTACCACGCTGACCACCTGGGTGACTACGGTCCGGGTCCGGGTCTGGGCCAGCCGGTATTCTACCAGGCAAGCTTCTCTTGGGACACTGGTCCGGGTCCGGGTGTTATCAAAACCCTGATCCGTACCCACATCAAAGACCGTGAACTGGGTCCGGGTCCGGGTGGTAAAACCCCGTTCGTTGACTCTCGTGCTTACGGTCTGCGTATCGGTCCGGGTCCGGGTCAGTCTGACCTGAACACCATCAAATCTCTGATGCTGCTGTACCGTGGTCCGGGTCCGGGTGAACCGGAAGCT