The best 3D model of CsSBAT was built from comparative homology modeling
Both N- and C-terminal regions of CsSBAT were predicted to be hypothetical as well as disordered. In particular, disordered regions can result in long simulation time and may lead to errors in the structural clustering process [57]. Therefore, these regions were excluded from building 3D models (Additional file 1: Fig. S1). Functional region (residues 185–492) of CsSBAT matched well with experimentally characterized ASBTs such as IF-NmASBT (PDB ID: 3zuy_A) [38] and OF-YfASBT (PDB ID: 4n7x_A) [37].
To obtain reliable homologous models of parasites, combined approach of 3D modeling methods and refinement were employed [17, 18, 19, 58, 59, 60]. All predicted 3D models of IF-CsSBAT were evaluated using homology modeling programs such as Swiss-Model [28], IntFOLD [29], Phyre2 [30], RaptorX [31], and HHpred [32], and threading-based modeling program such as I-TASSER [33] (Table 1). Swiss-Model, IntFOLD, and HHpred revealed values greater than 91.0% in the most favored region of Ramachandran plot [42]. Except for IntFOLD presenting erroneous ERRAT value, Swiss-Model, I-TASSER, and HHpred were evaluated further in terms of refinement because I-TASSER is efficient in building structure of unaligned regions by employing ab initio modeling [61].
Table 1
Quality verifications of initial models of IF-CsSBAT according to different programs
3D modeling program
|
Verification value/score
|
Ramachandran1
|
ERRAT
|
ProSA
|
Cov2
|
Swiss-Model
|
91.0% (0.8%)
|
46.0
|
−3.15
|
0.56
|
I-TASSER
|
75.8% (1.9%)
|
92.0
|
−1.07
|
1.00
|
IntFOLD
|
93.8% (0.2%)
|
n.a. 3
|
−1.81
|
1.00
|
Phyre2
|
73.2% (5.1%)
|
26.9
|
−3.29
|
1.00
|
RaptorX
|
89.3% (0.4%)
|
41.9
|
−2.87
|
1.00
|
HHpred
|
91.7% (0%)
|
50.2
|
−1.43
|
1.00
|
1 Ramachandran plot, % of most favored region (disallowed region) |
2 Cov: Sequence coverage, ratio of predicted region relative to whole region |
3 n.a.: not available |
Homology modeling of membrane proteins is challenging when the target protein shares low sequence identity (approximately 20%) with a template [62]. This issue was circumvented by refining the initial models indicating poor quality [17, 18]. Therefore, the initial models of IF-CsSBAT were refined with ModRefiner, FG-MD, and GalaxyRefine (Table 2). After comprehensive evaluation, we applied Swiss-Model for reliable 3D modeling, and thereafter, FG-MD and GalaxyRefine were used for effective refinement. In fact, Swiss-Model has been a powerful tool for transporter modeling [63, 64]. Refining enhanced the structural quality of the final model compared to that of the initial model, particularly in terms of ERRAT values, which were increased from 46.0% to 97.7% (Tables 1 and 2, Additional files 1 and 2: Figs. S1 and S2); however, I-TASSER and HHpred could not overcome poor values either in most favored regions of Ramachandran plot or in unacceptable ERRAT plot. Moreover, 3D models of OF-CsSBAT, OF-HsASBT and IF-HsASBT were prepared as aforementioned.
Table 2
Comparison on refined structural evaluation
3D modeling program
|
Refinement program
|
|
Verified score/value produced by
|
ModRefiner
|
FG-MD
|
GalaxyRefine
|
|
Ramachandran1
|
ERRAT
|
ProSA
|
Cov2
|
Swiss-Model
|
⚪3
|
|
⚪
|
|
93.3% (0.8%)
|
89.7
|
−3.84
|
0.6
|
|
|
⚪
|
⚪
|
|
92.5% (1.2%)
|
97.7
|
−3.73
|
0.6
|
I-TASSER
|
⚪
|
|
n.a.4
|
|
86.8% (1.9%)
|
89.0
|
−1.07
|
1.0
|
|
|
⚪
|
n.a.
|
|
74.7% (1.9%)
|
92.9
|
−1.24
|
1.0
|
HHpred
|
⚪
|
|
n.a.
|
|
94.1% (0%)
|
69.2
|
−1.89
|
1.0
|
|
|
⚪
|
n.a.
|
|
76.0% (1.3%)
|
81.5
|
−1.89
|
1.0
|
1 Ramachandran plot, % of most favored region (disallowed region) |
2 Cov: Sequence coverage, ratio of predicted region relative to whole region |
3 Symbol “⚪”: corresponding software was applied. |
4 n.a.: not available |
Bile acid-binding cavity and sodium-binding sites
For the translocation of bile acids across the cell membrane, alternative conformational changes in IF and OF conformations of secondary active transporters were proposed [37, 65]. One of the two conformational states was observed only in a particular state because it is difficult to crystallize the structure under other states. Structural information of one state was applied to predict another conformation of transporters [66]. Here, we predicted the OF-CsSBAT and IF-CsSBAT models based on OF-YfASBT [37] and IF-NmASBT [38], respectively. In CsSBAT, the bile acid-binding pocket was presumed to be formed with 9 residues in an extracellular cavity with a volume of 908 Å3 in OF-CsSBAT (Fig. 2A) and with 11 residues in an intracellular cavity with a volume of 986 Å3 in IF-CsSBAT (Fig. 2B). Among the pocket forming residues, five residues (Phe196, Phe222, Ala288, Ala291, and Met295) participated in both conformations.
ASBT is a symporter dependent on Na+ gradient that drives bile acid transport, which carries one bile acid with two Na+ ions [67]. Two Na+-binding sites of CsSBAT, Na1 and Na2, were predicted based on NmASBT (Fig. 3A). Na1 site comprised Ser289, Asn290, Ser303, Thr307, and Glu441 residues; whereas, Na2 site comprised Gln252, Glu441, Thr442, Ile444, and Gln445 residues (Fig. 3B and C). These two sites were highly conserved with ASBTs and NTCPs [37]. Furthermore, Glu441 was predicted to participate in Na+-binding of the two sites, implying that it might have a functionally significant role (Fig. 3C). In NmASBT, mutation at Glu260 (corresponding to Glu441 of CsSBAT) markedly alters the taurocholate transporting activity [38].
Recently, a putative third Na+-binding site was proposed, albeit rather speculative without experimental evidence [65]. The third Na+-binding site was reported from similar transporters such as glutamate [68] and leucine [69] transporters. In CsSBAT, residues Ile280, Gly281, Ser283, and Gln445 were predicted to act as binding sites of the third Na+ ion, which were superposed with corresponding residues on NmASBT (Fig. 3D). Residue Gln445 could be a key residue carrying Na+-ion from Na2 to Na3 site (Fig. 3). Mutation of Gln258 in YfASBT (corresponding to Gln445 in CsSBAT) was reported to significantly reduce the Na+-binding capacity of Na2 and Na3 sites [37]. Therefore, Glu441 and Gln445 in CsSBAT might act as molecular arms and transport Na+ ion from one site to the next.
For accurate molecular docking of CsSBAT against compounds in the library, grid center and size were precisely specified in the extracellular and intracellular bile acid-binding pockets of OF-CsSBAT (Fig. 2A) and IF-CsSBAT (Fig. 2B), respectively, rather than sodium-binding sites (Fig. 3B–D), as described in “Putative inhibitors screening” section of Materials and Methods.
Putative inhibitors targeting at bile acid-binding pocket
Structure-based virtual screening was used to select putative inhibitors of CsSBAT, which satisfied the following criteria (Fig. 1). i) A compound should interact more favorably with CsSBAT than with HsASBT to ensure accurate targeting. ii) OF-ASBT conformation should be considered as a target although IF-ASBT binding with taurocholate was used as a template for virtual docking [25], because the ASBTs transfer bile acid inward via conformational change from OF- to IF-conformation [70]. iii) The compound to be identified as a competitive inhibitor of taurocholate should reveal higher affinity than natural bile acids, ranging from −6.2 to −9.0 kcal/mol [25]. Theoretically, the binding energies of several bile acids against IF-CsSBAT and OF-CsSBAT conformations ranged from −6.1 to −8.7 kcal/mol (Table 3) [19]; however, those of HsASBT (CsSBAT homolog) were −9.0 kcal/mol and −9.2 kcal/mol against natural bile acids and PATD (lead compound blocking HsASBT), respectively [25]. Thus, a cut-off value was set at −9.2 kcal/mol since the present study aimed to explore the most probable inhibitor candidates binding to CsSBAT.
Table 3
Binding energy for bile acids OF-/IF-SBAT of Clonorchis sinensis and OF-/IF-ASBT of Homo sapiens using AutoDock vina
|
|
C. sinensis
(kcal/mol)
|
H. sapiens
(kcal/mol)
|
Bile acids
|
PubChem ID
|
OF
|
IF
|
OF
|
IF
|
Chenodeoxycholic acid
|
24875071
|
−6.7
|
−7.6
|
−7.7
|
−7.0
|
Taurochenodeoxycholic acid
|
312642451
|
−6.4
|
−7.7
|
−8.0
|
−8.0
|
Glycochenodeoxycholic acid
|
177011774
|
−6.4
|
−7.6
|
−7.7
|
−7.3
|
Glycodeoxycholic acid
|
57309861
|
n.a.1
|
n.a.
|
n.a.
|
n.a.
|
Deoxycholic acid
|
2225282
|
−6.1
|
−7.7
|
−7.3
|
−6.9
|
Taurocholic acid
|
828139
|
−6.6
|
−8.7
|
−7.1
|
−7.0
|
Glycocholic acid
|
177011773
|
−6.8
|
−8.5
|
−8.0
|
−6.8
|
1 n.a.: not available |
2 PubChem compound ID but other bile acids are PubChem substance ID |
Compounds satisfying Lipinski’s rule of five
Compounds following the Lipinski’s rule of five [47] were screened using MTiOpenScreen [48]. Of the top 1,000 scoring compounds under docking simulation against OF- and IF-conformations of CsSBAT and HsASBT, 19 compounds that could interact with only OF-CsSBAT or IF-CsSBAT were selected (Fig. 4 and Additional file 3: Table S1). Of these, two compounds met our strict criteria. Compound 49734421 formed a hydrogen bond with Ala291 of IF-CsSBAT and Asn446 of OF-CsSBAT. Compound 124948115 formed two hydrogen bonds with OF-CsSBAT but not with IF-CsSBAT (Table 4 and Fig. 5). Majority of the residues of these two compounds were involved in hydrophobic interaction with residues on CsSBAT, implying that these interactions might play a crucial role in compound–protein interactivity. It has been reported that aromatic moieties with high hydrophobicity can enable beneficial interactions with nonpolar residues in the binding pocket [71].
Table 4
Inhibitory compounds subjected to virtual screening using MTiOpenScreen and selected by applying Lipinski’s rule of five
PubChem
ID
|
Binding energy
(kcal/mol)
|
nRot
|
HBA
|
HBD
|
LogP
|
Mr
|
TPSA
|
Hydrogen bond
|
Toxicity
class
(LD50: mg/kg)
|
OF
|
IF
|
OF
|
IF
|
49734421
|
−10.9
|
−9.3
|
4
|
7
|
1
|
2.3
|
396.5
|
94.8
|
1
|
1
|
4
(1,077)
|
124948115
|
−10.3
|
−9.3
|
5
|
6
|
1
|
2.8
|
397.5
|
67.6
|
2
|
0
|
4
(1,500)
|
Abbreviations: nRot, number of rotatable bonds; HBA, hydrogen bond acceptors; HBD, hydrogen bond donors; LogP, lipophilicity; Mr, molecular weight; TPSA, topological polar surface area. |
Since Lipinski’s rule of five was introduced in 1997 [47], absorption or drug permeability is presumed to be more likely when there are less than 5 hydrogen bond donors, less than 10 hydrogen bond acceptors, a molecular weight with less than 500, and a calculated LogP smaller than 5. Recently, it was suggested that antiparasitic drugs should be exempted from this rule because several drug leads for infectious diseases do not follow Lipinski’s rule of five [47, 72]. Less stringent criteria may allow to identify more lead compounds for further assays. The suggestion motivated us to find out more effective strategy for antiparasitic drugs.
Large compounds satisfying high affinities
PATD was recently synthesized and evaluated as a potent inhibitor against ASBT [25]. Molecular weight of PATD molecule is larger than 500 Da because it has several polyacrylic acids and tetraDOCAs. Surprisingly, PATD is a hydrophobic substrate, which violates ideal molecular weight of the Lipinski’s rule of five. Nonetheless, it successfully inhibits ASBT by filling up the bile acid-binding cavity [47]. This finding motivated us to screen compounds with molecular weight higher than 500 Da, which are assumed to tightly dock CsSBAT.
Compounds of high molecular weight (500–1,200 Da) were retrieved from PubChem compound library [50] and screened using AutoDock Vina v1.1.2 [51]. Of the 1,255 compounds, 49 compounds satisfied the three given criteria. By strictly applying the third criterion (higher affinity than natural bile acids), we selected 25 compounds with high affinity for CsSBAT, but low affinity for HsASBT (Additional file 4: Table S2). Five compounds presented toxicity values of 4–6 with LD50 ranging 500–5,000 mg/kg (Table 5). Eventually, two compounds 45375808 (Fig. 6A and B) and 9806452 (Fig. 6C and D) were selected as possible candidate inhibitors for CsSBAT because these compounds could form more than two hydrogen bonds each with OF-CsSBAT and IF-CsSBAT.
Table 5
Inhibitory compounds with high molecular weight (Mr > 500 Da) selected using AutoDock Vina v1.1.2
PubChem
ID
|
Mr1
|
MF2
|
Binding energy (kcal/mol)
|
Hydrogen
bond
|
Toxicity
class
(LD50: mg/kg)
|
C. sinensis
|
H. sapiens
|
OF
|
IF
|
OF
|
IF
|
OF
|
IF
|
441243
|
670.8
|
C38H50N6O5
|
−12.3
|
−10.0
|
−8.9
|
−7.9
|
5
|
1
|
4
(500)
|
4701
|
508.6
|
C31H32N4O3
|
−11.2
|
−10.9
|
−8.9
|
−8.7
|
4
|
0
|
4
(800)
|
92727
|
628.8
|
C37H48N4O5
|
−10.0
|
−10.0
|
−9.0
|
−8.7
|
4
|
0
|
5
(5,000)
|
45375808
|
529.5
|
C22H29FN3O9P
|
−9.7
|
−9.7
|
−8.4
|
−8.4
|
2
|
4
|
6
(12,000)
|
9806452
|
512.7
|
C30H41FN2O4
|
−9.3
|
−9.3
|
−7.6
|
−8.3
|
4
|
2
|
5
(3,990)
|
1 Mr, molecular weight |
2 MF, molecular formula |
Notably, through docking simulation on compound–protein interactions, residues Glu229 and Gly287 participated in hydrogen bonding in taurocholate-IF-CsSBAT complex, whereas residues Gly287, Gln345, and Gln348 participated in hydrogen bonding in taurocholate-OF-CsSBAT complex (Additional file 2: Fig. S2). Gly287 was involved both in taurocholate–IF-CsSBAT and taurocholate–OF-CsSBAT complexes. In majority of the compound–OF-CsSBAT complexes, this conserved residue Gly287 was involved in hydrogen bond interaction (Fig. 6A, C, E, and G), and Gln345 was involved in compound 92727-OF-CsSBAT complex formation (Fig. 6I). Among compound-IF-CsSBAT complexes, only compound 441243 formed hydrogen bond with Glu229 (Fig. 6F). Compared to taurocholate-CsSBAT complexes, Ala288 acted as a key residue for either hydrogen bonds or hydrophobic interactions in three compound–CsSBAT complexes (Fig. 6A–F). Compounds 441243 and 3693566 presented 5 and 6 hydrogen bonds, respectively; however, these compounds were excluded owing to less hydrogen bonds in compound-IF-CsSBAT interactions, which could result in off-target binding with adverse drug reactions [73].
Compound 45375808, known as sofosbuvir, was proposed to inhibit nonstructural protein 5B polymerase in hepatitis C virus (HCV) [74]. Analogs of this compound exhibited inhibitory effect on HCV [75]. Compound 9806452 was reported as an inhibitor of matrix metalloproteinases [76] such as gelatinase A associated with tumor metastasis [77] and stromelysin-1 found in osteoarthritis [78]. Carboxyalkyl peptides containing a biphenylylethyl group inhibits adult Schistosoma mansoni [79]. Considering these reports, it is suggested that compounds 45375808 and 9806452 could be anthelminthic candidates for C. sinensis.