A list of 146 non-redundant disease-genes was identified from the DisGeNet database. The list includes 117 and 53 genes for SARS and MERS respectively with 24 genes common to both the diseases (Supplementary Table S1). A directed disease-gene-gene network having 2394 genes connected via 18810 interactions have constructed by graphite tool. These 18810 interactions of the network were classified into 12306 positive, 3055 negatives, and 3449 neutral interactions.
Further 52, 18, and 40 experimentally validated drug-gene interactions of quercetin, NAC, and 2-DG respectively were manually curated by extensive literature survey (Supplementary Table S2). Also, with each drug its interaction characteristics with the target gene are considered i.e. positive (+1) for agonist, negative (-1) for antagonist and neutral (0) for just binding interaction.
Further, these genes were mapped to the disease-gene-gene network. This way 33, 11, and 31 genes for quercetin, NAC, and 2-DG were retained in the network. Finally, three networks i.e. quercetin-gene-gene-disease directed network having 2395 nodes and 18843 interactions(Figure 2), NAC-gene-gene disease directed network having 2395 nodes and 18821 interactions(Figure 3) and 2-DG-gene-gene- disease directed network having 2395 and 18841 interactions (Figure 4) were constructed and visualized in Cytoscape.3.6.0.
Shortest Path and DV vector
To calculate the association between a drug i.e. quercetin, 2-DG, NAC, and diseases (SARS and MERS), the shortest paths between drug-genes and disease-genes were identified in all the three networks as explained in detail in the methodology section.
In quercetin-gene-gene-disease network, 528 shortest paths were identified (Figure 5). 1% of the shortest path had common genes interacting with both the drug and the disease directly. These include ESR1, EGFR, ACE, CASP3, BCL2L1 (highlighted as yellow nodes in Figure 5). 7% of the shortest path had no connecting gene between drug-gene and disease-gene pairs. Lastly, 79% of these shortest-path contain 1 connecting gene (highlighted as green nodes in Figure 5) between drug-gene and disease-gene, whereas 13% of these shortest-path contains 2 connecting genes (highlighted as pink nodes in Figure 5) between drug-gene and disease-gene (Figure 5). This shows that quercetin could be an effective drug and would be able to affect many disease-associated genes.
Likewise, NAC-gene-gene-disease directed network contains 243 shortest paths of 11 drug-genes (Figure 6). 8% of these shortest paths have no connecting genes. Whereas, 22% and 79% of the shortest path have 1 connecting gene and 2 connecting genes between drug-gene and disease-gene pair respectively.
385 shortest paths of 31 genes targets were identified in the 2-DG-gene-gene-disease directed network (Figure 6). 14% of the shortest paths in the 2-DG affected network have no connecting genes between drug-gene and disease-gene pair. While 68% and 18% of the shortest path contains either 2 connecting genes or 1 connecting gene respectively.
Thus quercetin has a more number of associations with SARS and MERS disease-genes than that of NAC or 2-DG, as shown by the shortest paths between the drug-gene to diasease-gene. In quercetin, 87% of its total shortest paths have at least 1 connecting gene. While NAC and 2-DG have 30% and 82% of their total shortest paths have at least 1 connecting gene respectively. The influence of NAC on disease-genes seem to be complicated as 70% of the shortest paths have atleast 2 connecting genes between drug-gene and disease-gene.
Further, this effect was quantified and expressed as a vector DV. DV is a vector and denoted the extent to which a disease gene was influenced by a drug. The DV values for quercetin, NAC and 2-DG are -70.19 , -39.99 and -13.71 respectively. Quercetin has a greater negative influence on SARS and MERS disease-gene with a DV value of -70.19 (Supplementary Table S3). Whereas NAC and 2-DG show lesser negative influence on SARS and MERS disease-gene target with a DV value of -39.99 (Supplementary Table S4), and -13.71 respectively (Supplementary Table S5).
Thus from the present analysis, quercetin potentially appears to be a better drug that can be repurposed against SARS and MERS than NAC and 2-DG. Thus, quercetin could also be presumably be effectively used against nSARS-CoV-2 infection because it shares 82% and 43% genome similarity with SARS-CoV and MERS-CoV betacoronaviruses.
While analysing the shortest path network of quercetin and 2-DG, some important biological interactions were discovered. Some of these interactions could have therapeutic implications e.g. in quercetin-gene-gene-disease shortest path network, ESR1, EGFR, ACE, CASP3, BCL2L1 are common direct interactors of both SARS/MERS and quercetin. ESR1 can induce classical IFN-induced antiviral genes, which inhibits the virus entry in the cell [83, 84]. Quercetin is an agonist of the ESR1 receptor that could help to boost the antiviral response against SARS-CoV or nSARS-CoV-2[85]. EGFR dysregulation protects against pulmonary fibrosis development caused by SARS-CoV and nSARS-CoV-2. Quercetin is an antagonist of EGFR and helps to reduce the risk of developing pulmonary fibrosis during SARS-CoV and nSARS-CoV-2[86]. ACE is an important enzyme activated during SARS that converts angiotensin I (AT I) to angiotensin II (AT-II), which further binds to either angiotensin II receptor 1a (AT1aR), leading to tissue damage and lung edema[87]. Quercetin is the inhibitor of the ACE that can help to reduce the tissue damage and formation of lung edema[88]. The protein encoded by ORF-6 present in both SARS-CoV and nSARS-CoV-2 can induce apoptosis through caspase-3 mediated, ER stress, and JNK-dependent pathway[89]. Quercetin inhibits the CASP3 protein and can decrease excessive apoptosis that reduces the formation of ischemic injuries in SARS-CoV patients [90]. It has found that E protein of the SARS-CoV induces the NFKB1 that triggers the expression of pro-inflammatory cytokines such as IL12RB1, IL6, TNF, IL1B, etc[91-93]. This increases the infiltration of more neutrophils in lung tissues. The clinical finding of critical nSARS-CoV-2 patients admitted to intensive care units shows consistent high-level pro-inflammatory cytokines in their plasma[93]. Quercetin has also been found as a direct inhibitor of NFKB1 in the shortest path network that further regulates expression of pro-inflammatory cytokines such as IL12RB1, IL6, TNF, IL1B. and it was observed that treatment with drugs that inhibited NF-κB activation led to a reduction in inflammation and lung pathology in SARS-CoV-infected cultured cells and also increases the mice survival rate[90]. Literature reports have shown that during both SARS-CoV and nSARS-CoV-2 infection there is an overproduction of pro-inflammatory cytokines (TNF, IL-6, and IL-1β) that results in a cytokine storm [94]. When this high cytokine concentrations get persist over time could lead to an increased risk of vascular hyperpermeability, multiorgan failure, and mortality[94]. Therefore, most of the therapeutical strategies develop until now are directed towards maintaining an adequate inflammatory response for pathogen clearance this includes the use of interleukin-1 inhibitors drug-like anakinra for immune-based therapy of nSARS-CoV-2. Both quercetin and anakinra are antagonists of the human IL-1 receptor[95]. Quercetin and its derivative quercitrin are strong antioxidant agents with anti-inflammatory properties. They are widely used for the treatment of cardiovascular disease, osteoporosis, pulmonary disease, etc[96]. Quercetin reduces the levels of TNF, IL-1β, and IL-6 helps in maintaining the oxidative and inflammatory balance of the body [96, 97]. Quercetin dysregulating the expression of cytokines pro-inflammatory cytokines like TNF, IL-1β, and IL-6 etc. can be easily visualized in the quercetin targeted shortest path network. Hence both DV vector calculation and biological implication of quercetin targeted shortest path network show quercetin can be effectively used as a therapeutic intervention against nSARS-CoV-2.
N-acetylcysteine (NAC) is a known mucolytic drug for chronic respiratory diseases and has an established safety profile (oral doses of 600mg/day or 150 mg/kg in the nebulized format in patients with acute bronchopulmonary diseases like pneumonia, bronchitis, tracheobronchitis). It makes bronchial mucous less viscous and being a cysteine derivative, helps in breaking disulfide bridges between macromolecules, which leads to a reduction in mucus viscosity[98]. NAC is also an inducer of Glutathione synthetase (GSS), which is an important enzyme in glutathione biosynthesis. Glutathione (GSH) in the body has an antioxidant effect and it reduces the formation of proinflammatory cytokines, such as IL-9 and TNF-α, and also has vasodilator properties by increasing cyclic GMP levels and by contributing to the regeneration of endothelial-derived relaxing factor. The inhibition of glutamate receptors (GRIN1, GRIN2A, GRIN2D, GRIN3A) also promotes glutathione synthesis. This interaction can be easily visualized in NAC targeted shortest path network, where NAC promotes the GSS and its downstream signaling and inhibits glutamate receptors (GRIN1, GRIN2A, GRIN2D, GRIN3A). Because of these two properties NAC is already proposed as a potential treatment, preventive, and/or adjuvant against nSARS-CoV-2[49]. This shows that NAC could be an important drug for the treatment of nSARS-CoV-2 infection but DV calculation shows it presumably would be less effective as an nSARS-CoV-2 therapeutic agent as compared to quercetin.
2-DG is a mimicking agent of glucose and is an antagonist of glucokinase/hexokinase (GCK/HCK), an enzyme that converts glucose to glucose-6 phosphate in the glycolysis cycle [99]. It is also a rate-limiting step of the oxidative metabolism. The inhibition of GCK reduces the ATP levels in the cells. Hence there would be more AMP levels than ATP levels in the cells. This leads to the activation of AMP-Activated protein kinases (PRKAG, PRKAB, PRKAA) [100]. These AMP-activated kinases inhibit Janus kinase (JAK1) and activate STAT transcription factors, especially Signal transducer and activators of transcription-3 (STAT3) [101]. STAT3 regulates cell growth, differentiation, and apoptosis[102]. During SARS infection in Vero cells, it was found that the induction of STAT3 leads to cell apoptosis, thus helping to stop viral replication [103]. This interaction can be easily visualized in 2-DG targeted shortest path network where 2-DG inhibits the GCK and its downstream signaling and leads to the activation of STAT3 through different AMP-activated kinase intermediates. Though 2-DG has a therapeutic effect against SARS, it follows a more complex route to target genes associated with SARS and MERS. Hence as hinted by DV calculation, presumably it might not be as effective as quercetin and NAC as a re-purposed drug for COVID-19.
Network properties and Biological System Features to assess safety and efficacy balance
Therapeutic Index is a measure of relative safety of a drug. It is calculated as ratio of the dose that produces toxicity to the dose needed to produce the desired therapeutic response. Drugs with TI≤3 are considered less safe and referred to as Narrow Therapeutic Index (NTI) drugs. Drugs with TI >3 are considered better than NTI on safety standards and referred to as Not -Narrow Therautic Index (NNTI) drugs. However, determination of TI is very complicated for many drugs and is also highly susceptible to the variations of drug responses.
The efficacy-safety balance of a drug may be inferred from the network properties and biological system profile of the drug-genes[104]. So, an effort was made to assess the safety efficacy balance (i.e. NTI or NNTI) of the three drugs (quercetin, NAC and 2-DG) in the drug-gene-gene-disease network. Several connectivity and adjacency-based network properties, properties based on shortest path length, have been found to be significantly different (p-value <0.05) between the targets of NTI and NNTI drugs[104] (Table 2). Network properties were grouped based on their innate mutual dependence are highlighted in similar colours.
|
Parameter
|
NNTI
|
NTI
|
Quercetin
|
NAC
|
2DG
|
1
|
Closeness Centrality (avg)
|
Smaller
|
Larger
|
0.241
|
0.264
|
0.272
|
2
|
Harmonic Closeness Centrality
|
Smaller
|
Larger
|
635.697
|
695.509
|
715.510
|
3
|
Residual Closeness Centrality
|
Smaller
|
Larger
|
180.569
|
223.146
|
237.773
|
4
|
Avg Shortest Path Length
|
Larger
|
Smaller
|
4.242
|
3.846
|
3.709
|
5
|
Deviation
|
Larger
|
Smaller
|
3493.062
|
2473.818
|
2156.581
|
6
|
Distance Deviation
|
Larger
|
Smaller
|
1247.556
|
1572.156
|
1789.97
|
7
|
Distance Sum
|
Larger
|
Smaller
|
13093.437
|
9241.818
|
8910.581
|
8
|
Betweenness Centrality
|
Smaller
|
Larger
|
0.002
|
0.0069
|
0.0065
|
9
|
CurrentFlow Betweenness
|
Smaller
|
Larger
|
0.0049
|
0.01
|
0.0099
|
10
|
Load Centrality
|
Smaller
|
Larger
|
0.0020
|
0.0070
|
0.0068
|
11
|
Normalized Betweenness
|
Smaller
|
Larger
|
0.0121
|
0.03636
|
0.03280
|
12
|
Eccentric
|
Smaller
|
Larger
|
0.574
|
0.433
|
0.593
|
13
|
Eccentricity
|
Larger
|
Smaller
|
9.0625
|
9.1818
|
8.9354
|
14
|
Eccentricity Centrality
|
Smaller
|
Larger
|
0.1111
|
0.1091
|
0.1123
|
15
|
Degree
|
Smaller
|
Larger
|
21.5312
|
32.1818
|
25.9032
|
16
|
Degree centrality
|
Smaller
|
Larger
|
0.0088
|
0.0132
|
0.0107
|
17
|
Z score
|
Smaller
|
Larger
|
0.0180
|
0.04490
|
0.0287
|
18
|
Radiality
|
Smaller
|
Larger
|
0.7505
|
0.7810
|
0.7916
|
19
|
Clustering coefficient
|
Larger
|
Smaller
|
0.2505
|
0.432
|
0.2311
|
20
|
Interconnectivity
|
Larger
|
Smaller
|
0.3883
|
0.4976
|
0.3312
|
21
|
Neighbor connectivity
|
Smaller
|
Larger
|
35.0512
|
44.2683
|
37.0124
|
22
|
Topological coefficient
|
Larger
|
Smaller
|
0.3444
|
0.2276
|
0.2091
|
Biological System Features
|
1
|
No. of pathways affiliated by the primary therapeutic target
|
Smaller
|
Larger
|
45.59
|
54
|
59.51
|
2
|
No. of similarity proteins outside the target family
|
Smaller
|
Larger
|
32.53
|
53.27
|
38.45
|
3
|
Number of director interactors which are core essential genes
|
Smaller
|
Larger
|
7.6
|
24.02
|
9.1
|
Table 2: The network properties that have been found to be significantly different (p-value<0.05) between the targets of NTI and NNTI drugs. Network properties were grouped based on their innate mutual dependence at highlighted in similar colors. The values larger/smaller in NTI/NNTI drugs is listed. The corresponding average values of the target genes of quercetin, NAC and 2-DG are also tabulated. The profiles which match the NNTI are highlighted in red color.
Some key features such as average shortest path length show increase from the targets of NTI drug to that of NNTI one[104]. Among the three, quercetin showed the highest average shortest path length, followed by NAC and 2-DG (Table 2). Also some others such as average closeness centrality demonstrated a decrease from the targets of NTI drug to that of NNTI drugs[104]. The lower value of average closeness centrality of drug-target has been shown to demonstrate a less lethality risk [105]. The interconnectivity values were lower for lethal diseases like cardiovascular and oncogenic diseases [106]. Quercetin had the smallest average closeness centrality among the three (Table 2).
Based on these parameters, Quercetin and NAC were found to be closer to the profile of NNTI drugs. Properties which determine the connectivity of the target (i.e average shortest path length, bridging-coefficient, interconnectivity) showed an increasing trend from the targets of NTI to NNTI drugs[104]. These parameters for quercetin, NAC and 2-DG are depicted in Figure 8. Average shortest path length, bridging-coefficient were highest in quercetin; whereas interconnectivity was highest in NAC.
Parameters which determine the centrality of the target in the network (i.e. average closeness centrality, degree, radiality) showed a decreasing trend from the targets of NTI to NNTI drugs[104]. These parameters for quercetin, NAC and 2-DG are depicted in Figure 8. Average closeness centrality, degree, radiality were lowest in quercetin.
Biological system properties (i.e. affiliated pathways, number of similarity proteins) also showed decreasing trend from the targets of NTI to NNTI drugs[104]. The affiliated pathways and similarity proteins were lowest in quercetin targets (Table 2).
It has been proven that the TI-related mechanism could be a result of synergistically effects among these eight features (Figure)[104]. As per these parameters, among the three, quercetin followed the trends of NNTI drug. Other studies have also shown that number of similarity proteins and affiliated pathways as a good indicator of target drugability [107, 108].
Thus the targets of NTI drugs were highly centralized and connected in network, and the numbers of similarity proteins and target-affiliated pathways were higher than those of NNTI drugs [104].
A list of 1580 core-essential-genes was obtained[109]. These genes without any context-dependence have been important for survival. For each drug-gene, its top 50 direct interactors were obtained from STRING. The percentage of these direct interactors which were part of core-essential-genes were identified. It is presumed that the drugs which target the core-essential-genes would be NTI drugs. It was found that quercetin had the minimum number of core-essential-gene interactors (Table 2).