2.1. Potential target protein structure for cancer MCM7 protease
Human cancer encodes a large number of protein structures. The structure we have considered for this study is protease MCM7 (PDB ID: 6XTX, resolution: 3.29 Å). MCM7 can metastasize and destroy the living tissues in the body. The malignancies in which MCM7 is involved are hepatocellular carcinoma, head, and neck, esophagus, etc. [26]. Thus, it is suitable to design a study that will identify the compound with inhibitor activity for preventing replication of DNA of cancerous cells. 3D structure of protein MCM7 is downloaded from “Protein Data Bank”. Removal of heteroatoms, water molecules, and addition of polar hydrogens is done with the help of the software “BIOVIA Discovery Studio Visualizer” (https://discover.3ds.com/discovery-studio-visualizer-download). The structure of receptor protease MCM7 with DNA backbone and basic pairs located at the center of the structure is shown in Figure 1.
2.2. Potential inhibitor: Tridax procumbens
Tridax procumbens is a phytochemical-rich plant constituting Baicalin, Tetrandrine, Luteolin, Apigenin, Stigmasterol, Catechin, Epicatechin, Quercetin, Myricetin, Gallocatechin, Sitosterol, Akuammidine, Kaempferol, and many more [27]. Among all the phytochemicals, Luteolin is a flavonoid having anti-cancerous properties [28]. Numerous works have been done so far reporting the anticancer properties of Luteolin [29]. Anticancer activity of Luteolin is investigated against gastric cancers [30], breast cancer [31], prostate cancer [32], brain tumors [33], cervical cancer [34], skin cancer [35], etc. Thus, it makes sure that our vision to use it as an anti-cancer agent will not disappoint us. Luteolin is a flavonoid belonging to the Vitamin B family. It is mainly present in many food supplements like parsley, broccoli, onion leaves, carrots, peppers, cabbages, apple skins, etc [36]. It is highly added as a food supplement due to its anti-oxidative property. Various pre-clinical reports have proven that Luteolin possesses a wide range of pharmacological activities like anti- hepatotoxic [37], hypotensive [38], anti-urolithiasis [39], haemostatic [40], antimicrobial or antibacterial activity [41], and many others. It is seen that the Luteolin has preventive activity against several parasitic agents like Leishmania Donovani [42] and plasmodium falciparum [43]. Many studies have been already done considering Luteolin as a potential inhibitor against the Mpro protease of COVID-19 [44]. The structure of Luteolin is downloaded from the online database “PubChem” (https://pubchem.ncbi.nlm.nih.gov/) ID:5280445) (Figure1).
2.3. Computational method for structural analysis
The ground state energy optimization is done with the help of the software “Gaussian 09” and method B3LYP with a standard 6-311G basis set. Optimized geometry is used to derive the Mulliken charge distribution and molecular electrostatic potential (MEP) surface map that helps in explaining the chemical stability of the molecule. The frontier molecular orbitals (FMO) (highest occupied molecular orbital and lowest unoccupied molecular orbital) energies are also derived with the help of optimized geometry. These HOMO-LUMO energies are used for computing ionization potential (IP), energy gap (ΔE), electron affinity (EA), chemical potential (CP), electronegativity (χ), softness (S), and hardness (η) [45]. These parameters are considered global reactivity parameters and help in determining the chemical reactivity of the molecule.
2.4. Drug-likeness and ADMET properties
We have done the virtual screening of drug-likeness rules and ADMET properties of Luteolin. Lipinski’s rule, MDDR-like rule, Veber’s s rule, Ghose filter, Egan rule, Muegge rule, lipophilicity, water-solubility, etc., are examples of drug-likeness rules [46]. Some of the drug-likeness rules are molecular weight<500, hydrogen bond donors < 5, hydrogen bond acceptor < 10, MLOGP (n-octanol-water partition coefficient) < 4.15, molar refractivity should be between 40 and 130, log P ranging between −0.4 to +5.6, solubility (log S) > -5.7 [47]. Additionally, Absorption, Distribution, Metabolism, Excretion, and Toxicity (ADMET) properties are also important in drug designing as they signify whether the compound undergoes proper metabolic processes in the human body or is toxic [48]. In the present work, all the drug-likeness and ADMET features are listed with the help of the online database SwissADME.
2.5. Prediction of activity spectra for substances (PASS)
Cytotoxicity prediction is done with the help of the freely accessible online platform CLC-Pred (Cell Line Cytotoxicity Predictor) [49]. CLC-Pred is a cytotoxicity predictor used in in-silico studies that can predict the cytotoxic effect of chemical compounds by the virtue of PASS technology. Near about 4000 kinds of biological activities like toxic and adverse effects, mechanisms of action, interaction with metabolic enzymes and transporters, pharmacological effects influence on gene expression, etc. can be predicted by PASS [50]. The structure of Luteolin is submitted in smiles format. Prediction of the activity spectrum of a compound is estimated in terms of probable activity (Pa) and probable inactivity (Pi). Pa and Pi values vary between 0.000 and 1.000 [51]. The activities following condition Pa > Pi are considered as possible for a particular compound. Experimental pharmacological action is considered high if the Pa > 0.7 and low if 0.5 < Pa < 0.7 [52]. Data can also be extracted in structured data file format. Analysis of data extracted from CLC-Pred is interpreted based on their IG50 (half-maximal inhibitory growth), IC50 (half maximal inhibitory concentration), and % inhibition (of activity) values [53]. Compounds having IG50 and IC50 values 10000nM and inhibition of more than 50% are considered as active [54].
2.6. Prediction of cardiac toxicity
Prediction of cardiac toxicity detects the cardio-related harms by the consumption of the drug. Pred-hERG 4.2 is a freely accessible validated web server that is used for the identification of cardiotoxic blockers of the compound [55]. The human ether-a-go-go-related gene (hERG) is a cardiac repolarizer that mainly encodes a protein and also activates the rectifier potassium channel (IKr) [56]. Heartbeat delay is the main caution of dysfunctioning of hERG which may often cause sudden death [57]. The structure of Luteolin is submitted to Pred-hERG in smileys format. Potency, confidence, applicability domain, and probability map are recorded as the results [58]. To be non-cardio toxic, the confidence value should not exceed 0.26 for any compound. [59]. The fragments representing hERG blockage are indicated in the probability map.
2.7. Molecular docking study
For molecular docking studies, Chain 4 and Chain 7 are chosen based on the AGS cocrystal found naturally in the CryoEM structure of human MCM7 structure (PDB ID: 6XTX) [60]. Missing residues in Chain 4 and 7 structures are completed using SWISS-MODEL (https://swissmodel.expasy.org/). Based on the cocrystal AGS in the 6LXT structure, the active site coordinates are determined as x:211.170, y:201.937, and z:140.544, and the grid box volume is chosen as 20*20*20 Å3. Molecular docking is performed with both Autodock Vina and Glide to validate the results [61, 62]. The optimized Luteolin structure for molecular docking studies is obtained from the DFT study. The protein and ligand input pdbqt files required for Autodock Vina docking are created with AutoDockTools-1.5.6. For Glide docking, the protein structure is prepared with the 'Protein Preparation Wizard' in Schrödinger Maestro 12.8 and the ligand structure is prepared with the 'LigPrep' module using OPLS4 force field. Protein-ligand interactions are visualized using BIOVIA Discovery Studio Visualizer v21 and UCSF Chimera v1.15 software. The docked structure by Glide of Luteolin with chain 7 and chain 4 of 6XTX is further used for performing MD simulations.
2.8 Molecular dynamics simulations
The software “Gromacs 2019.2 version” is used for performing the molecular dynamics simulation [63, 64]. The protein preparation topology is created with Gromos 43A1 force field and SCP water model. The ligand topology file is obtained from the GlycoBioChem PRODRG2 server (http://davapc1.bioch.dundee.ac.uk/cgi-bin/prodrg) [65]. The protein-ligand complex is simulated for 300 ps in canonical (amount of substance (N), pressure (P) and temperature (T) - NVT) and 300 ps isothermal-isobaric (amount of substance (N), volume (V), and equilibrium steps temperature (T) - NPT) ensembles. The molecular dynamics simulations run for 100 ns. The root mean square deviation (RMSD) and root mean square fluctuation (RMSF) is calculated to study the stability of the complex during the simulation. RMSD and RMSF help in the prediction of the atomic positions and the complex stability of the molecule undergoing simulation.