P. aeruginosa PA01 is an omnipresent pathogenic bacterium that can cause acute and chronic infection to humans by contaminating environmental water and food, daily food spoilage, and infections. It is a rising concern for its increasing resistance against a broad range of antimicrobials. The biofilm-forming ability and evolution of antibiotic tolerance shapes pseudomonas isolate highly resistant against imipenem (95.3%), trimethoprim-sulfamethoxazole (69.8%), aztreonam (60.5%), chloramphenicol (45.3%), and meropenem (27.9%) [69]. Factors like chromosomal mutations and transferring of resistant genes through horizontal gene transfer contribute to its broad-spectrum drug resistance property [70]. Thus, it is necessary to introduce a new drug target when there will be noticed multi-drug resistance for any diseases or problem. In silico process has a great advantage for the identification of new drug targets in that situation within a very short time. Consequently, to combat the ever-increasing danger of antibiotic resistance, identifying novel drug targets is a dire necessity. A drug target should have some properties before it is considered as a new target which includes being non-homologous to human proteome, human anti-targets, human gut microbiota, having virulence capability, having druggability, and so on. For this reason, we scrutinized the properties of our targeted essential hypothetical proteins where analyzing these EHPs from multidrug resistance bacteria can lead to the identification of new potential therapeutic solutions.
We searched for essential hypothetical proteins (EHPs) among the 336 essential proteins of this bacterial strain to meet this need. Essential genes/proteins are those that are vital for a pathogen's survival and thereby analyzing their functions and metabolic pathways, crucial information that may be central to life can be retrieved. In this research, we discovered 18 EHPs for the first time that may provide valuable information about the pathogenesis, molecular mechanisms, and functions of this bacteria. Functional annotation is a prerequisite in understanding the pathogen metabolic pathways and the products that they synthesize for their survival in adverse conditions. Moreover, domain analysis, which is a basic, distinctive, and stable unit of a protein structure that is fiercely conserved during the evolutionary process, is crucial for further investigation [71]. Moreover, the function of a protein is directly or indirectly related to the subcellular localization [72]. The physicochemical properties of a protein depict a chemical assessment that shows the identity of chemical nature, physical hazards and to understand or predict molecular attributes. The combined analysis of the physicochemical properties helps to characterize the proteins annotated as hypothetical proteins from the genome of an opportunistic pathogen like P. aeruginosa PAO1(Table 3) [73].
In this study, the PPI network has provided congruent meaningful insights into the protein’s function. Here, we have looked for potential relativity to our predicted function of EHPs and their connectivity with proteins involved with functionally important activities. The protein PA2986 (NP_251676.1) related to the MacB-like periplasmic core domain, represents a connection with 45 proteins of which 8 are hypothetical proteins (HP). A notable number of proteins grouped with PA2986 are involved in protein translocation activities such as translocation protein TolQ, TolR and TolB. Tol proteins show activity in gram-negative bacteria by providing stability to the outer membrane [74]. Moreover, ABC transporter ATP-binding protein (PA0073) is related to it as it functions by utilizing TolC exit duct by shifting substrates to extracellular space from the periplasm [75]. This finding supports the idea of PA2986 being a member of the MacB-like periplasmic core domain. Another important protein for bacterial survival lysS, a lysine tRNA ligase was found to interact with PA2986 which is a mutant in some gram-negative bacteria conferring resistance against the OP0595, diazabicyclooctane b-lactamase inhibitor (an antibiotic) [76]. We have found Penicillin-Binding Protein 1 called ponA protein in this group of networks. Alteration in the ponA protein ( penicillin-binding protein 1A) has a significant role in harnessing Chromosomally mediated resistance against penicillin in N. gonorrhoeae [77]. Similarly, other considerable proteins like outer-membrane lipoprotein carrier protein lolA, transporter ExbB, penicillin-binding protein 1A (ponA) interacted with PA2986 Fig. 3.
PA0759 (NP_249450.1) has got the 2nd largest degree value having 41 nodes (Table 4) in connection of which 17 are HPs. The highest betweenness centrality value of 17881.76 determines its significance towards cell signaling pathways as in the case of directed or regulated networks, Betweenness centrality is considered to be a much robust essentiality indicator than degree value [78]. Genes in this hub include proteins having prime roles in translational regulation and cellular metabolic activities such as glycine cleavage system protein T2 [79], translation elongation factor (tsf) [80], and ribosomal large subunit pseudouridine synthase C (rluC) [81], respectively. Moreover, RecO protein in this network is a replication repairing protein from the RecF recombination repair pathway that facilitates both DNA strand annealing and DNA recombination in complex with RecA protein found in high radiation tolerant bacteria Deinococcus radiodurans [82]. This property may also contribute to better survival efficacy for P. aeruginosa PA01 in extreme conditions.
The PA4562 (NP_253252.1) protein is a probable member of the Lipid II flippase MurJ family which is used for the genesis of lipid II on both inner and outer leaflets and that ultimately produces peptidoglycan in almost every bacterial species. Peptidoglycan is the primary protective foundation for shielding against environmental hazards and is involved in cell wall organizations [83]. Proteins related with morphological importance in bacteria such as flagellar basal body rod protein ( FlgC) [84], rod shape-determining protein (rodA) [85], type 4 fimbrial biogenesis outer membrane protein (PilQ) [86] are present in a connection with the PA4562 proteins that strengthen our prediction regarding this protein function.
The proteins PA3685 (NP_252375.1) and PA3767 (NP_252456.1) are adjoined with 12 and 6 HPs respectively. Both of these proteins are largely involved with enzymes of different molecular functions-tRNA N6-adenosine threonyl carbamoyl transferase (gcp) is a universal structural modifier found at position 37 of tRNAs that provides the anticodon loop with greater binding efficiency to ribosomes invitro in E. coli [87]. The protein UDP-2,3-diacyl glucosamine hydrolase (PA1792) is hypothesized to be catalyzing lipid-A biogenesis in E. coli bacteria [88]. Lipid-A is a saccharolipid that modulates lipopolysaccharide (LPS) anchorage on the outer leaflet of the outer membrane in gram-negative bacteria which is an essential component for the bacteria shielding from antibiotics and sustain its viability [89]. Besides other proteins having enzymatic properties include thiamine monophosphate kinase (thiL), ATP-dependent DNA helicase DinG (PA1045), riboflavin-specific deaminase/reductase (ribD), amidotransferase(PA1742), acetyltransferase (PA2631) Fig. 3.
The majority of the proteins connected with PA4992 (NP_253679.1) from aldo/keto reductase family are uncharacterized proteins. The PA4167 protein has a contributing role as a source of carbon and energy for a large number of bacteria [90]. The hub proteins PA4093 (NP_252782.1) and PA4992 (NP_253679.1) are interconnected through the intermediate protein PA4098, a probable short-chain dehydrogenase enzyme [91].
The mgtE is an Mg transporter that represents a connection with the PA3481 (NP_252171.1) which can be an opportunistic inhibitor for the type III secretion system (T3SS). T3SS is a formidable toxin injected by P. aeruginosa that can ultimately cause cell death into its host. The mgtE interrupts with the T3SS transcription regulation system by provoking rsmYZ gene transcription hence inhibits T3SS protein expression [92]. Interestingly another analogous protein, DNA polymerase II (polB) is associated with this same hub protein PA3481. PolB functions as a crucial candidate for repressing the translation process of master T3SS regulator ExsA. ExsA operates a major role in maintaining the regulatory cascade of T3SS. Thus affecting ExsA expression can prohibit T3SS toxin secretion process. Furthermore, S Chakravarty et al., found that T3SS transcription is attenuated when polB is overexpressed. Therefore, polB may act as a promising target for therapeutic interventions [93]. Besides, proteins responsible for exopolysaccharide biosynthesis and biofilm formation namely pslA [94] and pslD [95] are both members of the psl operon. The presence of such virulent protein types in this protein hub suggests PA3481 as a crucial protein involved in multiple virulence pathways in P. aeruginosa. The PA4636 (NP_253326.1) protein harbors some of the virulent proteins like lptA and algQ that is required for the biogenesis of lipid bilayer in the outer membrane in P. aeruginosa [96] and facilitates in developing a chronic infection in cystic fibrosis [97].
Some notable mutual interactions also have been observed between two hub proteins like PA2986 and PA4562 where interrelated proteins include – mraY, a potential target for antibiotic development is a crucial element for the bacterial cell wall synthesis [98]; opr86, an outer membrane protein found previously in all gram-negative bacteria. Likewise suggested as a potential drug target with significant therapeutic potential against P. aeruginosa in earlier studies [99]; rpoH, a 32-kDa heat shock protein in E. coli can also take part as a complementary for sigma factor during the increasing temperature in the environment as well as while starving [100]; PA5568 possess an inner membrane translocation subunit protein YidC which facilitates proteins to be passed onto inner membranes without the help of Sec translocase complex proteins [101]; ComL is a lipoprotein that facilitates the DNA transformation process in N. gonorrhoeae [102]. Lastly, organic solvent tolerance protein OstA holds interaction simultaneously with the top 3 hub genes of maximum node connection. Concurrently, OstA is a protein of high molecular significance as it is found in almost all gram-negative bacteria and is involved in the bacterial envelope biogenesis process. A study by HC Chiu et al., found that OstA deficiency in Helicobacter pylori causes sensitivity to organic solvent, impaired membrane permeability, and vulnerability to antibiotics [103]. The function of the proteins in this network shows relational integrity with our predicted HPs. Knowing the protein’s function in a protein-protein interaction network can facilitate the process of discovering the proteins with unknown functions [104].
Herein, we analyzed these properties of our targeted essential hypothetical proteins where PBIT servers direct categorized all the non- homology features (Table 5). We have selected NP_249450.1 and NP_251676.1 respectively for being virulent determined by two of our tools with strong confidence scores. Targeting these virulent factors can limit the pathogenicity of P. aeruginosa. Even antivirulence drugs insist a pathogen towards a weaker selection for resistance in them compared to antibiotics [11]. Therefore, understanding the virulence factors and their role in pathogenesis can lead us to a new potential therapeutic solution. Besides, druggability analysis also confirmed that NP_249450.1 and NP_251676.1 can be a new and potential drug targets.
Structure prediction and quality assessment of the predicted structure are also parallelly important to evaluate the molecular and biological functions of a protein in cells for in-depth analysis and drug target identification [105]. Before structure prediction, the information of Alpha helix (Hh), Extended strand (Ee), Beta turn (Tt), or Random coil (Cc) helps to establish the secondary structure. That’s why the secondary structure was annotated to complete the structure related to all the information of our selected two proteins (Fig. 4, Fig. 5, and supplementary table 3). Thus, we further predicted the 3D structure of two EHPs (NP_249450.1 and NP_251676.1) and assessed the quality of these structures to decipher their unique conformation. The 3D structure and Ramachandran plot are depicted in Fig. 6 and Fig. 7, respectively. The predicted structure's quality assessment parameters are listed in Table 6, and the overall quality factor is shown in Supplementary Fig. 1. Our predicted structure is accurate and reliable, according to Ramachandran plot analysis, since more than 90% of residues are considered the cutoff value and our findings surpass that range by more than 94%. Therefore, this structural and functional information will open a new window for further identification of potential drug candidates that can halt the surge of this pathogenic bacterium from becoming resistant.