Computational Methods
Database building and Library Screening
The most potent substituted styrylquinoline with the highest antitumour activity was taken from the literature [24], and by modification, at R1 position a library of around 2000 compounds was generated virtually. Based on the obtained fitness score using ChemT, the top 1000 hits were screened for further pre-ADMET analysis. All 1000 hits were screened through Lipinski’s Rule of 5 and around 100 hits were chosen for further study fitting with the criteria. In the virtual screening workflow protocol, all 100 hits were subjected to a flexible docking approach using AutoDock. A score cutoff of -9.20 kcal/mol for docking discarded lower-scoring compounds selecting 15 highest scoring compounds. Top 15 highest scoring, best pocket fitting, and possessing similar binding interactions with that of standard EAI045, novel and potent compounds were obtained through virtual screening and were selected for further studies.
Molecular Docking
Molecular docking was carried out on mutant EGFR enzyme (PDB ID: 5D41) to check the binding pattern and interactions. It was seen that all 15 molecules docked correctly into the allosteric binding pocket of the protein with common interacting amino acid residues LYS745, LEU788, ASP855 and PHE856. From Figures 1a and 1b, it was observed that all compounds fit into the hydrophobic pocket of the protein interacting with the common amino acid residues. The combined docking scores along with structures are shown in Table 1.
Figure 1b Docked images of top 15 compounds over T790M/C797S mutant EGFR (PDB: 5D41). Blue lines in the figures indicate π-π stacking interaction and pink lines indicate hydrogen bonding interaction.
ADMET studies
SwissADME website (http://www.swissadme.ch/index.php) was used to quantify the physicochemical and pharmacokinetic properties of the synthesized derivatives with respect to EAI045. Table 2 shows the analysis of the pharmacokinetic parameters needed for ADMET study of compounds KSK (1-15).
Table 2
ADMET study of top 15 compounds with EAI045.
Compound
|
#HB acceptor
|
#HB donors
|
TPSAa
|
MLOGPb
|
GI absorption
|
BBB permeant
|
Violation of Lipinski’s Rule
|
Bio -availability Score
|
Synthetic Accessibility
|
EAI045
|
5
|
2
|
110.77
|
1.93
|
High
|
No
|
0
|
0.55
|
3.28
|
KSK-1
|
2
|
2
|
70.83
|
3.65
|
High
|
No
|
0
|
0.55
|
3.37
|
KSK-2
|
5
|
2
|
88.68
|
0.81
|
High
|
No
|
0
|
0.55
|
3.99
|
KSK-3
|
1
|
1
|
42.15
|
3.75
|
High
|
No
|
0
|
0.55
|
3.33
|
KSK-4
|
2
|
1
|
56.4
|
3.15
|
High
|
No
|
0
|
0.55
|
4.16
|
KSK-5
|
2
|
1
|
36.36
|
3.85
|
High
|
Yes
|
0
|
0.55
|
3.05
|
KSK-6
|
2
|
0
|
58.5
|
3.78
|
High
|
No
|
0
|
0.55
|
3.94
|
KSK-7
|
3
|
2
|
75.27
|
3.38
|
High
|
No
|
0
|
0.55
|
3.34
|
KSK-8
|
3
|
1
|
100.88
|
3.91
|
Low
|
No
|
0
|
0.56
|
4.39
|
KSK-9
|
3
|
1
|
70.89
|
3.41
|
High
|
No
|
0
|
0.55
|
4.47
|
KSK-10
|
2
|
1
|
41.05
|
3.54
|
High
|
Yes
|
0
|
0.55
|
3.54
|
KSK-11
|
2
|
0
|
29.02
|
3.18
|
High
|
No
|
0
|
0.55
|
3.25
|
KSK-12
|
3
|
1
|
45.59
|
3.41
|
High
|
Yes
|
0
|
0.55
|
4.19
|
KSK-13
|
2
|
0
|
67.82
|
3.98
|
High
|
No
|
0
|
0.55
|
3.28
|
KSK-14
|
3
|
1
|
36.36
|
3.75
|
Low
|
No
|
0
|
0.55
|
3.24
|
KSK-15
|
2
|
1
|
55.04
|
3.33
|
High
|
Yes
|
0
|
0.55
|
3.41
|
aTPSA 40Å - 160Å |
bMLOGP must be less than 4.15 |
The drug must pass Lipinski’s rule to show good oral absorption in humans [25, 26]. All 15 compounds do not show Lipinski’s rule violation, passing the criteria. The MLOGP values were also within the acceptable range, showing the compounds' correct drug absorption and distribution within the body. Almost all the compounds show good GI absorption and some of them were blood-brain barrier permeant. Through bioavailability scores, it was indicated that a potent compound is available at the site of action. Overall, the compounds showed satisfactory pharmacokinetic parameters range defined for human use along with good docking scores [27–29].
Waterswap
When dealing with the cavitation and large-value problem of double decoupling, we used Waterswap, an absolute binding free energy method. Continuum solvent methods do not include the molecular detail of protein–water, ligand–water, and protein–water–ligand interaction interactions because they use an explicit model of water [23].
It uses a single simulation to replace the ligand-bound protein with bulk water molecules of the same shape and volume as shown in Figure 2. An increase in protein-ligand interactions occurs when the protein's resulting cavity is filled with a cluster of water molecules. With only one reaction coordinate, it's possible to get a more accurate free energy value because it's not the product of two huge numbers [30].
Moreover, the beneficial waters around the protein-ligand complex are also identified to obtain a stable pocket. These beneficial waters also become a potential bridge in creating interactions between likely residues and ligand. Both proteins favourable and ligand favourable waters are important to identify to assess the pocket interactions and their stability [31]. Two different types of colours are given for the protein depending on its preference i.e., the protein present in green colour states that it prefers ligand, and protein present in red colour states that it prefers water as indicated in Figure 3. The water box identified by the system further provides the required binding free energy for the protein and ligand [16].
To analyze Waterswap studies, binding free energy (∆G), shown in Figure 4 comes into play describing the stability of the waters around the pocket. If ∆Gprotein in the complete simulation decreases, it does not stabilize with the set number of iterations, the possibility of the existence of unfavourable water distorting the pocket intensity is high. In such cases, the ligand would exit the simulation box resulting in an inactive compound [32]. The similar identity of ∆Gwater is identified through drastic increase or instability of the waters resulting in distortions in the protein. This could also result in terminating the protein-ligand interactions.
EGFR protein becomes the perfect example of identifying and comparing the potential waters for the binding affinity and checking the potency of the complexed ligand. A mutant EGFR protein with known standard EAI045 is initially run for 1300 iterations using Waterswap to identify all the favourable waters and their binding free energy. Then the ligand is swapped to check the potential binding affinity and stability of the designed compounds concerning the identified favourable waters. If the binding free energy results obtained during the complete iterative simulations are similar or better than the results for standard drug EAI045, the compounds designed are the potential to withstand the waters and show potency.
Figure 5 describes the relative binding free energy graph during the 1300 iterations performed for EAI045 compared to the styrylquinoline compound identified from the literature. The blue line in the graph shows the average binding free energy change for the number of iterations. The thermodynamic properties compared in the Figure showed that the styrylquinoline stabilized better than the standard compound EAI045. Towards the end of the iterations, the graph showed stability after a drop in relative binding energy in styrylquinoline. Overall, all the attributes taken into account concludes the potential of styrylquinoline compound as an EGFR inhibitor.
The results show that Waterswap analysis offers a promising new path in the hunt for improved tools for analyzing and visualizing molecular driving forces in protein-ligand complex simulations.