Selection of robust Pharmacophore hypothesis
For the ligand-based pharmacophore modeling, a set of four known HPV16 E6 inhibitors were selected. The pharmacophore hypothesis was constructed by employing the PHASE module of the Schrödinger Suite. To find the common pharmacophore hypothesis, energy minimized structures of the four known HPV16 E6 inhibitors were imported into PHASE and the compounds that are considered as active pharmacophore sites were generated for all compounds.Eventually, 45 hypotheses were engendered as HPV16 E6 inhibitors and categorized according to the survival score. The most robust pharmacophore model sorted out was associated with the four-point pharmacophore hypotheses containing three hydrogen bond acceptors (A), one aromatic ring (R), and the best pharmacophore hypothesis was used as a query for database screening. The distance between the pharmacophoric and their sites were shown in figure 2.
Pharmacophore based virtual screening
Pharmacophore based virtual screening and docking-based virtual screening is one of the most reliable, rapid, low cost, and efficient approaches for the identification of effective inhibitors from huge chemical libraries. Initially, the well validated pharmacophore model (AAAR) of HPV 16 E6 inhibitors were used as a 3D structural query for identification of potential compound with desired properties from the ASINEX database with fitness value ranges >1. Initially, 2000 hits were obtained as a base on fitness score. Then, the selected hit compounds were further taken into molecular docking studies. In this investigation, we used Glide docking for the identification of the hits. Glide docking protocol comprises of three filtering process steps: high throughput virtual screening (HTVS), standard precision (SP), and extra precision (XP). Initially, a total of 2000 compounds were docked into the active site of HPV16 E6. 1000 compounds were obtained with HTVS, then these ligands were applied in the SP mode, resulting roughly 500 compounds. Finally four potential ligands with binding free energy lower than -8.547kcal/mol were obtained from the Glide XP mode. Then, the four compounds were further taken into the ADME properties evaluation. All of the four hits possessed desired predicted ADME properties. The chemical names of the four lead molecules are shown in Figure 3.
Docking study
The docking poses of the top four lead molecules in the active site of the HPV16E6 are shown in Figure 4 and the XP results for the four lead molecules which are listed in Table 1.
Docking simulation of Lead 1 into HPV16 E6
Upon binding, four hydrogen bonding interactions were observed between Lead 1 and HPV16 E6. The first is a hydrogen atom of the Lead 1 with the backbone oxygen atom of the hydrophobic residue of CYS51 with a bond distance of 2.12Å. The second is an oxygen atom of the Lead 1 with the backbone hydrogen atom of the CYS51 and has a bond length of 2.16Å. The third is an oxygen atom of Lead 1 with a side chain hydrogen atom of the GLN107 with a bond length of 1.79Å. The final is a hydrogen atom of Lead 1 with the backbone oxygen atom of the polar residue SER74 and has a bond length of 2.41Å. Furthermore, the following residues, LEU 50, ILE 73, ILE 128, CYS 66, VAL 53, PHE 45, and LEU67 were involved in hydrophobic interactions with Lead 1. The glide score and glide energy were calculated to be -8.896 kcal/mol and -57.416 kcal/mol respectively.
Docking simulation of Lead 2 into HPV16 E6
Upon binding, three hydrogen bonding interactions were observed between Lead 2 and HPV16 E6. The first is a strong interaction between a hydrogen atom of the Lead 2 with the backbone oxygen atom of the hydrophobic residue of CYS51 and a bond distance of 2.05 Å. The second is a strong interaction between a hydrogen atom of Lead 2 with the backbone oxygen atom of the polar residue of SER71 and a bond length of 1.81 Å. The third is an oxygen atom of Lead 2 with the backbone hydrogen atom of the CYS51 and a bond length of 1.98Å. The following residues, TYR 70, LEU 50, ILE 104, CYS 66, and VAL 31 are involved in hydrophobic interactions with Lead 2. The glide score and glide energy were calculated to be -8.856 kcal/mol and -57.690 kcal/mol respectively.
Docking simulation of Lead 3 into HPV16 E6
Upon binding, three hydrogen bonding interactions were observed between Lead 3 and HPV16 E6. The first is a strong interaction between a hydrogen atom of the Lead 3 with the backbone oxygen atom of the hydrophobic residue of CYS51 and a bond distance of 2.11 Å. The second is a strong interaction between a hydrogen atom of the Lead 3 with the backbone oxygen atom of the polar residue of SER71 and a bond length of 1.70 Å. The third is an oxygen atom of Lead 3 with the backbone hydrogen atom of the CYS51 and a bond length of 1.99 Å. The following residues, TYR 70, LEU 50, ILE 104, CYS 66, and VAL 31 are involved in hydrophobic interactions with Lead 3. The glide score and glide energy were calculated to be -8.496 kcal/mol and -65.257 kcal/mol respectively.
Docking simulation of Lead4 into HPV16 E6
Upon binding, three hydrogen bonding interactions were observed between Lead 4 and HPV16 E6. The first is a strong interaction between a hydrogen atom of the Lead 4 with the backbone oxygen atom of the hydrophobic residue of CYS51 and a bond distance of 1.95 Å. The second is a strong interaction between an oxygen atom of the Lead 4 with the backbone hydrogen atom of the hydrophobic residue of CYS51 and a bond length of 2.17 Å. The third is a strong interaction between a hydrogen atom of the Lead 4 with the backbone oxygen atom of the GLN 107 and a bond length of 2.46 Å. The last is an oxygen atom of Lead 4 with the backbone hydrogen atom of the polar residue of SER 74 and a bond length of 1.98 Å. One π-π interaction was found between His 78 with a phenyl ring in lead 4. The following residues, TYR 70, LEU 50, ILE 104, CYS 66, and VAL 31 are involved in hydrophobic interactions with Lead 4. The glide score and glide energy were calculated to be -8.462 kcal/mol and -50.920 kcal/mol respectively.
ADME Properties prediction
ADME describes the destiny of drug-like compounds within living organisms, especially in the human system. The ADME properties of the four newly identified lead were assessed using QikProp. The above four lead molecules were found to have desirable drug-like properties based on Lipinski’s rule of five, which suggests a potential drug candidate have a molecular weight under 500kDa, contain less than 5 H-bond donors, have less than 10 H-bond acceptors, and a predictedlogP below 5. Then, these lead compounds were further appraised for their drug-like behavior through analysis of pharmacokinetic parameters required for absorption, distribution, metabolism, excretion and toxicity (ADMET) by using QikProp. The aqueous solubility (QPlogS) for the four lead molecules, a factor that is critical for estimation of absorption and distribution of drug within the body, ranges between -5.686 to -3.891. The cell permeability (QPPcaco2), a key factor governing drug metabolism and its access to biological membranes, ranges from~ 49 to ~1043. The predicted IC50 values for the blockage of HERG K+ channels are in the acceptable range, which is below -5. The predicted Brain/Blood barriers are under the acceptable range of ~ -2.461 to -0.644. The percentage of human oral absorption range from 62 to 100. All the pharmacokinetics parameters are fit well with the acceptable range defined for use of human. The results were listed in Table 2. According to this result, we suggest that these four compounds may be apt for human use.
Molecular dynamics simulation
The dynamic behavior of the interaction between the four Lead compounds with the HPV16 E6 was investigated using the molecular dynamics simulation package, Desmond, at different timescales under a physiological state.
MD analysis of HPV16 E6 - Lead 1
Fig. 5a shows the Root mean square fluctuations (RMSF) of the protein and Lead 1. The RMSD of the protein backbone of the HPV16 E6-Lead 1 complex showed a deviation from 2.90Å to 4.29Å during 1.26 ns to 36 ns. The protein backbone was stable from 36 ns to 77.40 ns. However, a large deviation was observed from 77.40 ns to 96.90 ns. The RMSD changed from 4.31Å to 5.78Å. The RMSD dropped back to 4.47Å at end of 100 ns. Likewise, the RMSD of the ligand was stable before 77.40 ns, but deviated significantly from 2.40 Å to 5.53 Å between 77.40 to 94.10 ns. The ligand RMSD finally dropped to 4.20 Å at the end of 100 ns. The fluctuations of each protein amino acid residue over the simulation period are displayed in Fig. 5b. The greater fluctuation indicates a decrease in the stability of the protein-ligand complex. Initially, a slight fluctuation was observed in PHE 2, ASP 56, and SER 143. The RMSF values were 6.32 Å, 2.75 Å, and 6.44Å, respectively. A large fluctuation in the side chain of SER 143 was also observed and the RMSF value was 6.39Å. The fluctuation of c-alpha showed a significant fluctuation in the GLN 3 with an RMSD of 6.25Å. The bar diagram, 5c, depicts the intermolecular interactions, including the hydrogen bonding, hydrophobic interactions, and water bridge interactions of the Lead 1 with HPV16 E6. The hydrogen bonding interaction was observed with CYS 51, SER 74, GLN 107, and has a maximum occupancy of 2.002%, 74.2%, 45.4% respectively. Hydrophobic interactions and water bridges were formed between Lead 1 with HPV16E6. Figure 5d shows that hydrogen bonding plays a crucial role in the binding stability of the HPV16 E6-Lead 1 complex. Key residues SER 74, CYS 51, GLN 107 were involved in the formation of hydrogen bonds for 63%, 99%, 100% and 88% with active site regions of HPV16 E6 during the simulation period.
MD analysis of HPV16 E6 - Lead 2
Figure 6a shows the RMSF of the protein and Lead 2. The RMSD of the protein backbone of the HPV16 E6 was steady at around 3.81 Å till the end of the 36 ns. Then, a large deviation spike was observed at 36.2 ns. The protein was stable during the remaining time and reached 2.31 Å at 99.80 ns. The RMSD of the ligand, on the other hand, showed a spike at around 60 ns. The ligand RMSD finally reached 2.75 Å at the end of 100 ns. Both protein backbones and ligands were more stable after 70 ns. The most fluctuating was found in the following amino acids: ASP 56 and SER 14., RMSF values ranged from 2.26 Å to 2.44 Å. Similarly, the following residues,GLN 3, and ASP 56, in the backbone, had more fluctuations. RMSF values were observed ranges from 2.70 Å to 2.54. Beside the residues, GLN 3, and ARG 55, in the side chain higher fluctuations and RMSF values ranged from 4.13 Å to 3.87 Å. The bar diagram of Figure 6c depicts the intermolecular interactions, which includes the hydrogen bonding, hydrophobic interactions, and water bridge interactions, of the Lead 2 with HPV16 E6. Hydrogen bonds were observed on TYR 32 and CYS 51, with a maximum occupancy of 19.8% and 35.3%, respectively. Hydrophobic interactions were observed on SER 70 with a maximum occupancy of 43.8%. Other hydrophobic interactions and water bridges were also formed between Lead 2 with HPV16E6. Figure 6d shows that CYS 52 and GLN 107 were involved in the formation of hydrogen bonds for 33% and 12%, respectively, with active site regions of HPV16 E6 during the simulation period. Other residues, such as TYR 32, PHE 69, and TYR 70 were involved in hydrophobic interactions with Lead 2. They elevated the binding stability of the protein-ligand complex during the simulation time.
MD analysis of HPV16 E6 - Lead 3
Figure 7a shows the RMSF of the protein and Lead 3. The RMSD of the protein backbone of HPV16 E6 were examined, and there were no changes in the first 5.10ns he in tprotein RMSD. The RMSD value was noted 2.66Å, but the slight fluctuation was observed from 32.20ns to 98ns and the RMSD values ranged from 3.26Å to 3.72Å, then the deviation was observed in 98.90ns and the RMSD value was noted as 4.17Å. Likewise, the ligand RMSD was observed and the RMSD value were noted in the ranges of 2.27Å to 4.17Å. The higher fluctuation of the ligand RMSD was monitored at 90.90ns and the RMSD reached 4.17Å. The most fluctuating was found in the following amino acid residue: SER 143, with an RMSF value of 5.31Å. Similarly, the backbone had more fluctuation in residue PRO 5 and SER 143, with RMSF values of 3.60Å and 5.70Å respectively. Beside the side chain of the protein themost fluctuating amino acid residues were found to be GLN 3 and SER 142, with RMSF values of 5.13Å and 5.83Å, respectively. Also, thekey residues PRO 5 and SER 143 of the c-alpha of the protein had more fluctuations were observed. The RMSF of the protein-ligand complex wasdepicted in figure 7b. The greater fluctuations indicates a decrease the stability of the protein-ligand complex. The bar diagram in Figure 7c depicts the intermolecular interactions, including the hydrogen bonding, hydrophobic interactions, and water bridge interactions, of the Lead 3 with HPV16 E6. Hydrogen bonds were observed on TYR 32 and CYS 51 with a maximum occupancy of 59.5% and 23.8% respectively. Various other inter-molecular interactions, such as hydrogen bonding, π-cation, water bridges, and hydrophobic interactions were formed between lead 3 with HPV16 E6. Figure 7d shows that CYS 51 and TYR 32 were involved in the formation of hydrogen bonds for 122% and 59% respectively, with the active site region of HPV16E6 during the simulation period. Also, one π-cation and water bridge interaction were observed on HIS 78 with a maximum occupancy of 33% and 47%, respectively. LUE 67 was predominantly involved in a hydrophobic interaction with Lead 3 and elevates the binding stability of the protein-ligand complex during the simulation time.
MD analysis of HPV16 E6-Lead 4
Figure 8a shows the RMSF of the protein and Lead 4. The RMSD of protein backbone was stable after 2.80 ns and the RMSD was noted at 2.47Å. Larger fluctuations were observed between 48 ns to 78.40 ns. The protein was more stable afterward and the RMSD reached 2.02Å. Likewise, the deviation was found in ligand RMSD between in the 48ns and 78.50ns, and the RMSD were noted ranges of 2.68Å to 2.87Å. Even though the ligand was more stable end of the simulation, the RMSD reached 2.08Å. Root mean square fluctuations (RMSF) displayed the fluctuations of each protein amino acid residue over the simulation time period (Fig. 8b).The most fluctuating amino acid residue was found to be in GLN 6, with an RMSF value of 2.15Å. Higher fluctuations indicate a decrease the interaction. Similarly, the backbone had more fluctuations in residues GLN 5 and GLN 91, with RMSF values of 2.06Å and 2.15Å, respectively. Besides the side chain of protein, the most fluctuating amino acid residues was found to be ARG 141, with an RMSF value of 6.77Å. The bar diagram Figure 8c depicts the intermolecular interactions, including the hydrogen bonding, hydrophobic interactions, and water bridge interactions, of the Lead 4 with HPV16 E6. Hydrogen bonds were observed on ASP 49and CYS 51. Among them, CYS 51 displayed hydrogen bond interactions with a maximum occupancy of 73%. Additionally, various inter-molecular interactions such as hydrogen bonding, hydrophobic interaction, as well as water bridges, formed between Lead 4 with HPV16 E6. Figure 8d shows that the hydrogen bonding, hydrophobic, water bridge was significantly responsible for the boost up of the biding stability of lead 4-HPV16E6 complex. We identified the following residues: LYS 11, ARG 10, ASP 49, and CYS 51 were involved in the formation of hydrogen bonds for 14%, 22%, 42%, and 50% with active site regions of HPV16E6 during the simulation period. Moreover, residues ARG 10, TYR 32 CYS 51, CYS 11, and ARG 102 were predominantly involved in the formation of the water bridge interactions with a maximum occupancy of 15%, 19%, 26%, 19%, 11% with Lead 4, respectively. Only one reside, LEU 67, was predominantly involved in hydrophobic interactions with Lead 4 and elevated the binding stability of the protein-ligand complex during the simulation time.