Construction and biological characteristic of influenza B virus-based vaccine candidates with H1N1 hemagglutinin
We constructed a series of chimeric A/B HA expressing the HA ectodomain of various seasonal influenza H1N1 viruses. We then harbored the chimeric H1N1 HA molecules with the influenza B HAs’ TM and CT domains and segment-specific packaging signals presented in the 3′- and 5′-terminal regions of influenza B virus as detailed in Fig. 1a. Cloning the chimeras into the HA plasmid of B/Yamagata/16/1988 using the In-Fusion technology. Chimeric H1-HA plasmids were transfected jointly with seven other genomic segments from B/Yamagata/16/1988 (with a truncated NS1) into 293T cells.
By applying plasmid-based reverse genetics methods, we have generated the recombinant viruses with the chimeric A/Hawaii/70/2019(H1N1)-HA and the chimeric A/Victoria/2570/2019(H1N1)-HA. Notably, this chimeric vaccine candidate could grow to 103.75 TCID50/mL and 105.25EID50/mL. We also produced a chimeric A/California/07/2009 CVV and named it rA/B-California/07/2009 (R_07). Additionally, the nucleotide sequences of rescued CVVs were confirmed by sanger sequencing, and the chimeric candidates' virus particles were identified using a Transmission electron microscope (TEM).
Adaptive mutations on H1N1 influenza viruses near the receptor site may influence the growth of virus as previously described[22]. To determine the viral growth capability of the chimeric CVV, MDCK cells were infected with the virus with a multiplicities of infection(MOI)of 0.01(Fig. 1c). The growth curve indicates that the replication capability of rA/B-California/07/2009 was significantly lower than wild type A or wild type B viruses.
Alternation Of The 212 Site Between Two Chimeric Cvvs
We present the capacity of producing chimeric A/California/07/2009 CVV. The chimeric A/California/04/2009 CVV, named rA/B-California /04/2009 (R_04), could hardly be generated. We deemed two chimeras, R_07 HA protein and R_04 HA protein, similar. However, the production of two chimeric CVVs strikingly diverged. We then conducted pair-wise alignment of two chimeras to expound the divergence. By aligning the sequence of R_07 HA protein and R_04 HA protein in pairs, a A212T substitution was observed (Fig. 2).
To confirm the A212T substitution, we designed a rescue scheme for an alternation at site 212 between two chimeric CVVs to verify the significance of a generated H1N1 chimeric CVV of site 212. Mutant virus of rA/B-California/07/2009, carrying 212T, failed to generate chimeric H1N1 CVVs as before. These results indicated that the 212T substitution had a deleterious effect on the rA/B-California/04/ 2009 vaccine strain. All results are summarized in Table 1.
Table 1
Rescue scheme for an alternation at site 212
| Chimeric virus | Mutant chimeric virus with altered site 212 |
rA/B-California/07 | √ | - |
rA/B-California/04 | - | √ |
* “√” indicates the rescued virus generated successfully; |
“-” indicates the rescued virus failed to be generated.
With a revelation that the generation of chimeric A/California /07/2009 and A/California/04/2009 CVVs related to a substitution A212T, we attempted to undertake single nucleotide polymorphism analysis to calculate the scope of variation belonging to amino acid sequences of influenza H1N1 HA. Single Nucleotide Polymorphism(SNP) data for the H1N1 HA proteins of human and avian influenza viruses were selected from Influenza Research Database (IRD). In both human and avian H1N1 AIVs, alanine (A) was more prevalent than threonine (T) in residue 212. About 94% of human H1N1 IAVs had A at residue 212, followed by 5% having T. Serine (S), glycine (G), valine (V), and glutamic acid (E) at residue 212 were far lower than 1% (Fig. 3a). Furthermore, about 99% of avian H1N1 IAVs had A at residue 212, followed by T at 1% (Fig. 3b). Therefore, A212 was overwhelmingly dominant in the HA of the H1N1 subtypes.
Structural Characterization Of Chimeric Has And Site 212
To predict the structural features of CVV chimeric HA proteins and properties of substitution A212T, we submitted the sequences of R_07 and R_04 proteins to Alphafold2 (AF2) to predict the 3D structures of chimeric proteins (Fig. 4). The theoretical R_07 structure was generated based on multiple sequence alignments and available experimental protein structures [23].
As the 3D predictions were established, phylogenetic analysis describing sequence conservation grades could be conducted to demonstrate the evolutionary conservation at residues of chimeric HAs. To investigate whether residue 212 was conserved, we calculated the evolutionary degree of all sites in chimeric HA proteins using the ConSurf web server. Residue color values, a conservation score the ConSurf provided for each residue in 3D models of proteins, assigned a score of 6 to the residue 212 in the R_07 structure (Fig. 5). Although the key functional positions had a bias towards being highly conserved, residue 212 of chimeric H1N1 HAs was intermediate.
212t Substitution Induces A Reduction In The Stability Of Chimeric H1 Has
To examine how the substitution A212T affects the generation of recombinant influenza viruses, we performed a range of mutations on each amino acid in predicted chimeric HA models. FoldX, a frequently used algorithm for protein stabilization experiments, can be used as a plugin in the protein structure visualizer YASARA[24]. We used the predicted 3D structure of chimeric HA proteins to generate possible single amino acid substitutions in the R_04 sequence and calculate the energy difference between the single amino acid 212T and 212A of H1N1 chimeric HAs. The point of the predictions was to evaluate the change in thermodynamic stability regarding the R_04 protein. The corresponding free Gibbs energy change of mutating all non-A residues of R_04 protein to an A residue (ΔΔG) was obtained using FoldX. Mutations with negative ΔΔG values of AlaScan indicated that they were more stable than the wild-type R_04 protein, yet those with positive values indicated that they were less stable than the wild-type. The ΔΔG values of residue T212 were − 1.64 kcal/mol, which suggested the T212A substitution stabilized the chimeric H1N1 HA proteins. This value corroborated the residue 212T destabilizing the chimera R_04. Following the PositionScan operations of the software FoldX for mutating every amino acid in the R_07 sequence to the other 5 amino acids that might appear at this site, we excluded other variants (all ΔΔG༞0 kcal/mol, listed in Table2). Furthermore, we confirmed A was the optimum amino acid for the chimeric CVVs at position 212.
Table 2
The free Gibbs energy change (G) of mutating residue 212A of R_07 protein to the other 5 amino acids
ΔΔ |
Residue 212 | Mutants | Free Gibbs energy change (kcal/mol) |
212A | 212T | 1.83067 |
212S | 0.99211 |
212G | 1.05901 |
212V | 0.98729 |
212E | 11.1011 |
212t Substitution Induces A Decreased Sa-binding Avidity
The chimeric H1N1 HAs shared a large component of California HA ectodomains that made up 89% of the size of the H1N1 HA protein. Thus, the globular domain of the R_07 chimeric HA protein was deemed similar to that of the H1N1 HA protein. It was demonstrated previously that the residue 212 of the chimeric HA adjoined to the 190-helix, a component of the receptor binding site, in both protein primary sequence and tertiary structure, as it appeared in California HAs. Human influenza viruses were attached to α2,3-linked sialylated glycans, and avian influenza viruses had a partiality for α2,6-linked sialylated glycans[25]. To further test the contribution of the substitution A212T to the receptor-binding avidity of chimeric H1N1 HAs, we used molecular docking technology to perform the interactions between receptors–chimeric hemagglutinins and ligands–SA analogs. Sialylneolacto-N-tetraose c (LSTc, PubChem CID: 53477861) was a sialooligosaccharide, and has been suggested for a human glycan receptor analog. Additionally, sialyllacto-N-tetraose (LSTa, PubChem CID: 53477859) has been suggested as an avian glycan receptor analog [26]. AutoDock was used to dock the receptor analogs LSTc and LSTa with R_07 and R_04 HA proteins and predict the optimal conformations. Optimal conformations of ligands were selected from different poses according to the docking scores. Considering that docking in the water environment could significantly improve the estimation accuracy of binding energy[27], water molecules were added to the ligand during the docking process[28]. Among the predicted R_07 HA-LSTa (Fig. 6a) and R_04 HA-LSTa (Fig. 6b) complexes, the best poses of LSTa had SIA directed to the 130-loop. Instead, the optimal pose of the R_04 HA-LSTc (Fig. 6c) complex had a SIA toward 130-loop. In contrast, the R_07 HA-LSTc (Fig. 6d) complex had SIA turn toward 220-loop. Furthermore, the two-dimensional interaction diagrams of ligand-receptor interaction were drawn using MOE.
It has been reported that the CA04 H1 HA showed a receptor-binding property of preferential recognition to α2-6-linked glycans. As anticipated, the binding affinities of chimeric H1 HAs showed a partiality for LSTc, but the ability to bind both analogs LSTc and LSTa declined. We recorded 4 binding free energy data of each optimal structure after docking (Table 3). Detection using AutoDock algorithm about the binding energy showed whether a T212 substitution increased the binding free energy between R_04 HA protein and analogs. The R_04- analogs complexes displayed higher binding energy values than R_07 complexes, and thus T in site 212 appeared to decrease the chimeric H1 HAs’ SA-binding avidity.
Table 3
Binding free energy data (G) of receptor analogs and chimeric HAs
Δ |
Ligands | Complexes | Binding Free Energy (kJ/mol) |
LSTc | R_07 HA-LSTc | -8.9408 |
R_04 HA-LSTc | -8.6752 |
LSTa | R_07 HA-LSTa | -7.5533 |
R_04 HA-LSTa | -6.8546 |
To further analyze the receptor binding characterization of the substitution A212T, we visualized the receptor-ligand interactions in the optimal complexes with their binding pockets (Fig. 7). Root-mean-square deviation (RMSD) was calculated for measuring the conformational change of ligands and related pockets through pair-wise comparison by PyMol software. The RMSD value for R_07-LSTa-binding pockets and R_04-LSTa-binding pockets indicated the convergence of structure (RMSD value: 0.265, Fig. 7a). While comparing R_07-LSTc-binding pockets and R_04-LSTc-binding pockets, the RMSD value still showed a slight distinction (RMSD value:0.816, Fig. 7b). In comparison, the rA/B-California viruses with substitutions in 212 could affect the conformation of LSTc-chimeric H1N1 HAs.