3.1 Determination of pharmacophore model
In the common pharmacophore identification process three variant combinations were derived such as AADRR, ADRRR, and DRRRR encompassing of H-bond acceptors (A), H-bond donors (D), and aromatic rings (R). After rating these three common pharmacophores, 22 five-point pharmacophore hypotheses were generated. The survival active, inactive, vector, and volume scores were used to rank the generated hypotheses. For further investigation, the hypothesis AADRR associated with five pharmacophore site points taken into consideration and the geometry of AADRR is described in Figure 3. A1 and A2 spheres represent the H-bond acceptor feature, whereas D5 sphere represents the H-bond donor, and R12, R15 represents aromatic ring features.
3.2 Validation of 3D-QSAR model
Using atom-based analysis, the top three pharmacophore hypotheses were built as 3D-QSAR models. In the PLS regression approach, the pIC50 values used as dependent variables and the maximal PLS factors with PHASE descriptions used as independent variables. Table 5 summarises the detailed results of the 3D-QSAR models (Akt1) based on test and training set selection. Out of the top three hypothesis, AADRR gave good statistical model with high values of correlation coefficient; R2 = 0.90, low standard deviation; SD = 0.37, variance ratio; F = 137, high predictive coefficient; Q2 = 0.64, low RMSE = 0.56 and Pearson’s R value = 0.78. As a result, the 3D-QSAR model that was created had excellent statistical criteria and might be utilised for future optimization and development. Figure 4 displays the scatter plot of the experimental versus predicted pIC50 values for the training and test set ligands. The graph revealed that the predicted and experimental values had a positive association. As a result, the robustness and significance of the developed 3D-QSAR model may be validated.
3.3 3 D-QSAR visualization of best active compound
In atom-based 3D-QSAR model, steric incompatibilities with target receptors will be prioritised over pharmacophoric characteristics to predict the activity of screened compounds. Whereas the activity in pharmacophore-based 3D-QSAR model will be predicted based on the pharmacophore's sites and locations. Generally, the 3D-QSAR visualization aids in a better understanding of the structure-activity relationship (SAR) while evaluating the activity of screened hits. The 3D-QSAR visualization of the best active molecule 79 (pIC50=9.00) is shown in Figure 5. In Figure 5a, the green cubes signified the hydrogen bond donor region where as hydrogen bond acceptor region was showed by blue cubes and it shows the existence of H-bond donor characteristics at NH group of fused pyrazole ring, NH group of substituted aniline and pyrrolidine -NH. However, there is only one donor (D5) in the hypothesis. Two H-bond acceptor features were identified, which was validated with the hypothesis (A1&A2): one is on the Br attached to pyrazole and one more is on the piperazine ring. In Figure 5b, the green cubes represent the hydrophobic favoured region and the disfavoured region is indicating by orange cubes. The NH of pyrazole ring, benzene ring, NH of pyrrolidine ring, and the NH of piperazine ring are the four hydrophobic characteristics that can be seen superimposed. Surprisingly, additional H-bond donors, acceptors, and hydrophobic characteristics are particularly beneficial for the activity.
3.4 Pharmacophore-based virtual screening
In drug development, virtual screening is one of the most promising approaches in which the pharmacophore-based virtual screening of existing molecular libraries used to find a novel and potential entities for further development. 91,453 Molecules were taken from the Asinex elite synergy database and screened by applying the generated pharmacophore model (AADRR). As a result, 5960 hits were retrieved for efficient Akt1 inhibition. In the present work, the ligands were examined to match all the five features of the hypotheses. During this process, the screening hits obtained from the pharmacophore-based virtual screening are intelligible and will undoubtedly have a significant possibility to function as Akt1 inhibitors.
3.5 Molecular docking-based virtual screening
Molecular docking studies were performed against the hits from Asinex elite synergy database and the active site of Akt1 receptor by using two docking techniques: standard precision (SP) and extra precision (XP). Initially, 5960 molecules were subjected to SP docking in which 2% of hits (115 molecules) exhibited high dock score and that are computed to XP docking. Finally, the resulted 11 molecules with high dock score were directed to binding energy calculations by using Prime MM/GBSA analysis. The schematic diagram for complete virtual screening procedure was outlined in Figure 6.
3.6 Interaction studies of screened hits
LID (Ligand interaction diagram) of Schrodinger was used to examine the interactions of screened hits and the outcome is shown in Figure 7. At first, the best active molecule 79 exhibited three hydrogen bond interactions: the first between the nitrogen of pyrrolidine ring and Asn 54 residue; the second between the -NH of pyrazole ring and Gln 203 residue and the last one was between the nitrogen of fused pyrimidine ring and Ser 205 residue. In Addition, four pi-pi stacking interactions were observed between 1H-pyrazolo[3,4-d] pyrimidine ring and Trp 80 residue. The hit molecules with high binding energy (A1-A11) formed one or more hydrogen bonding interactions with amino acid residues in the active pocket of Akt1 protein (Supplementary Information figure S1). For example, molecule A5 with binding energy -93.93 kcal/mol demonstrated specific hydrogen bonding interactions, where the oxygen of 5-methylpyrazolo[1,5-a] pyrimidin-7(4H)-one ring with Asn 54, -NH and oxygen of phthalazin-1(2H)-one ring with Thr 211 and Ser 205 respectively. Also, four prominent pi-pi stacking interactions were observed with Trp 80 residue, which was the key binding interaction of the active compound.
3.7 Binding free energy calculation
The binding free energies of top 10% of the hit molecules retrieved from the database are shown in Table 6. From Table 6, it is evident that all hits possessed high binding free energy values and were compared with the best active compound 79. In particular A1, A2, A3, A4, A5, A6, A7, A8, A9, A10 and A11 with the good binding energy values of -90.63, -94.36, -86.66, -90.93, -93.20, -87.17, -86.89, -86.71, -87.37, -88.66 and -90.84 Kcal/mol respectively which are greater than the best active compound 79. As a result, these hits can undoubtedly behave as effective Akt1 inhibitors.
3.8 Designing of N-(3-(6-methoxybenzo[d]oxazol-2-yl)pyridin-2-yl)acetamide analogues (MPA’s)
The hits taken from the Asinex database exhibited high binding free energy values and strong binding interactions. Especially, molecules A2 and A5 are comparable with the best active compound 79 in terms of both interactions and free energy. Total 11 hits showed higher binding free energy values than the active compound (> -85.61 kcal/mol). However, A1-A11 hits exhibited two to three hydrogen bonding interactions that are responsible for Akt1 inhibition. The hypothesis generated from this current research resulted with five characteristics, including two acceptors group(A), one donor group(D) and two aromatic rings (AADRR). Interestingly, most of the screened compounds possessed either pyridine ring (A4, A8, A9) or fused pyridine derivative in their core structures. Therefore, we have designed a set of new pyridine derivatives: N-(3-(6-methoxybenzo[d]oxazol-2-yl)pyridin-2-yl)acetamide (MPA’s) analogues, based on the structural characteristics received from virtual screening, which could be exhibits strong interactions and better binding free energy values.
3.9 Molecular docking and Binding free energy calculations of MPA’s
The newly designed MPA’s further subjected to molecular docking against the active site of Akt1 (Figure 8) by using extra precision (XP) and the same grid was used, which was used for earlier docking of dataset molecules. The ligand interaction diagram of newly designed MPA’s is depicted in Figure 9. Most of the molecules formed the π- π stacking interaction with Tyr 80, which was the most promising interaction in both the active compound 79 and molecule A5 (moderate pIC50 with higher binding free energy) retrieved from Asinex elite synergy database. For example, molecule D4 exhibited two hydrogen bonding interactions in which the acyl oxygen was with Thr 211 and oxygen linked with aryl ring was with Tyr 326 residues. Also, D4 exhibited two π - π stacking interactions in which the pyridine ring and fused oxazole rings were formed with Trp 80 residue. Similarly, D5 molecule (highest free binding energy -105.457) showed four interactions: 1) two hydrogen bonding interactions with Thr 211 and Ser 205 residues, 2) two π - π stacking interactions with Trp 80 and Tyr 272 residues. Another molecule D10 displayed three π - π stacking interactions with Trp 80 and two hydrogen bonding interactions with Thr 211 and Tyr 326 residues. Overall, the newly designed MPA’s possessed a stronger binding affinity towards the Akt1 protein by forming multiple hydrogen bonds and π - π stacking interactions.
Further, MPA’s have been evaluated based on generated five-point hypothesis followed by prime MM/GBSA analysis. As a result, 10 hits were identified with good binding free energy values (Table 7), i.e., D1, D2, D3, D4, D5, D6, D7, D8, D9 and D10 showed binding free energy values of -100.30, -100.09, -104.25, -105.45, -104.46, -102.02, -102.85, -97.33, -98.13, and -97.85 kcal/mole respectively. Comparative study between the active compound and newly designed MPA’s indicated that all the 10 molecules exhibited significant free binding energies than the active compound 79.
3.10 MD Simulation analysis
RMSD value is an important parameter to obtain the stability of the protein - ligand complex during the simulation process [41]. In this study, RMSD was calculated for the backbone of 4ejn and its complex formed with D10 molecule (one of the best molecules with high binding energy and a greater number of interactions). As shown in Figure 10a, 4ejn and D10 complex accomplished a stable state after 2 ns and remained as such with exceptional leaps (less than 0.1 nm). In the 4ejn protein, the trajectory showed stability after 26 ns but earlier it was found highly dynamic in nature with more fluctuations. This clearly specifies the RMSD of the 4ejn-D10 complex is more stable than 4ejn only throughout the simulation period. RMSF is another essential parameter to analyse the fluctuation of residual variation [42] and it can also identify the rigid and flexible regions of a protein-ligand complex. The RMSF profile of the 4ejn-D10 complex (Figure 10b) exhibited lower fluctuations than the actual protein 4ejn. The higher RMSF values were observed in both complex and 4ejn in the f2 region, having certain residues namely Lys 112, Gln 113, Ala 114 and Ala 115. A variation in 4ejn near Arg 48 residue (f1 region) has shown high fluctuation when compare with the 4ejn-D10 complex. The RMSF revealed that the fluctuations of residues for the 4ejn-D10 complex are quite lower than their native protein.
Radius of gyration (Rg) is a helpful tool to understand the folding properties and compactness of the protein and protein-ligand complexes during the simulation [43]. The conformational changes of Rg can serve as a demonstration of the impact of a drug molecule on a protein structure. A protein molecule with a high Rg value indicates its loose packing, whereas a lower Rg value indicates tight packing. The Rg plot of the 4ejn-D10 complex displayed (Figure 11a) a constant gyration of ~2.22 nm after 3700 ps and till the end it was maintained. This indicates the 4ejn-D10 complex is highly stable in comparison to the 4ejn protein.
SASA values of the protein-ligand complex were analysed to examine the protein surface accessible to solvent molecules [44]. Greater SASA values represent an increase in surface area, whereas lower SASA values represent a reduction in protein volume. SASA plot is shown in Figure 11b and it is evident that there is no drastic change in the 4ejn-D10 complex compare to 4ejn. The average count of H-bonds formed in the 4ejn-D10 complex displayed in Figure 11c and found four in number throughout the simulation.
3.11 In-silico ADME properties
The results are presented in Tables 8 and 9. Estimation of partition coefficient of octanol-water (lipophilicity <5) is examined by QPlogPo/w. Higher values generally result in a high risk of metabolic clearance and are related to poor adsorption. From Table 9, it is evident that the MPA derivatives with good binding scores follow Lipinski’s rule of five and showed favourable pharmacokinetic profiles except molecular weights (Mwt≤500). PSA is another important property (should not be >140 Å) linked to drug bioavailability and the values for the derivatives MPA are in the range of 81.49– 139.42 Å. QPlogS aqua solubility values for the MPA derivatives are less than 0.5 predicting excellent intestinal absorption. QPlogBB measures the ability of a drug to cross the blood-brain barrier and the values are in the acceptable range (-2.10 - 0.45). Further, QPlogKhsa Human serum albumin binding coefficient and QPPcaco cell permeability are the other key factors for a drug to be a successful candidate and showed satisfactory predictions. The predicted human oral absorption values for the MPA derivatives are in the range of 81-96 %, which indicates the possibility of excellent oral bioavailability. Thus, MPA derivatives with good binding scores and good in-silico ADME properties were considered for the synthesis and in vitro studies.