The USPs were first reported in bacterial systems as a 13.5 kDa cytoplasmic protein triggered by various stressors, including nutritional deprivation, toxic chemicals such as oxidants, heavy metals, acids, and antibiotics, and play an essential role in cell survival under stressful situations(Nyström and Neidhardt, 1992). Subsequently, many USPs in prokaryotes were identified and categorized (Tkaczuk et al., 2013). Proteins of the USPA superfamily play a crucial role in the body's defense mechanism against a variety of stressors. USPA is generated abundantly during growth arrest in E. coli in response to general stress conditions. Universal stress proteins have been reported to play a role in bacteria's persistence during stressful environmental conditions. Previous research has revealed that strains of E. coli(Nyström and Neidhardt, 1993, Nyström and Neidhardt, 1994, Nachin et al., 2005) and Salmonella Typhimurium (Liu et al., 2007) with a deletion of the uspA gene have a reduced ability to survive for lengthy periods of time during stasis caused by various stressors such as nutrient deprivation, temperature fluctuations, and oxidative stress. Although the biochemical mechanisms underlying the USPA proteins' ability to protect against stress are still mostly unclear. Here we demonstrated that the Salmonella Typhimurium strain possesses a functional uspA homolog and fallowing are the results of our study on USPA of Salmonella Typhimurium.
Identification of the uspA gene and sequencing
The uspA gene from the Salmonella Typhimurium strain was amplified by PCR using self-designed primers (Fig. 1). The amplicons were successfully cloned into the pET28c expression vector. The clones were then transformed into E. coli DH5α cells, and after growing on LB-agar with kanamysin (30mg/ml), the clones were screened by colony PCR, plasmid PCR, re-digestion by restriction enzymes to release insert and finally sequenced (Fig. 2). The nucleotide sequence was submitted to NCBI GenBank with the accession number OM105654.
The comparison of the sequence of uspA gene with that reported in GenBank showed that identities were 100% (Table 2). Furthermore, sequence analysis also revealed the uspA gene is 435 bp long, encodes a protein of 144 amino acids with a calculated molecular mass of 16 kDa, and belongs to the USP family. Also uspA of Salmonella Typhimurium was 100% identical to the uspA gene of Salmonella LT2 strain available in the database(Fig. 5).
Homology modeling and validation
The availability of the 3D structure of the target is most crucial to proceed for the structure-based drug design. Since the 3D structure of Universal stress protein A (USPA) of Salmonella Typhimurium, is not available in the Protein Data Bank (PDB), therefore its 3D structure was predicted by the homology modelling method using the Swiss model and HHpred server,respectively(Kumawat et al., 2020). To identify the best template(s) for the target protein, database searching was performed using the BLASTp program. A template (PDB ID: 1JMV_A) having 68.57% sequence identity was selected for predicting the 3D model of the USPA. The sequence identities and GMQE value are important parameters that are considered during the template(s) selection. The templates which are having high sequence identity (> 30%) and GMQE score closer to 1.0 are often selected for the reliable model generation of the target protein. The multiple sequence alignment was performed for the target-template sequences using the Clustal Omega program and aligned target-template sequences were submitted to the Swiss model server(Waterhouse et al., 2018)and HHpred server (Guindon et al., 2010, Gabler et al., 2020, Eswar et al., 2006) respectively. Then 3D model of the USPA was created (Fig. 3A). In order to evaluate the stereo-chemical properties as well as to find any anomaly in the predicted model, it was uploaded to the SAVES v6.0 server. The quality of models predicted by Swiss-model as well as HHpred server were compared, and it was found that better quality model was predicted by the Swiss-model server (Table 1).
Table 1
Comparison of quality of model predicted by Swiss-Model and HHpred Server
S.No.
|
Module
|
Swiss Model Server
|
HHpred Server
|
1.
|
Errat (Overall quality factor)
|
100
|
81.82
|
2.
|
Verrify 3D(% of the residues have
averaged 3D-1D score > = 0.2)
|
81.02 (Passed)
|
70.71 (Failed)
|
3.
|
Procheck (Ramachandran plot)
|
i. Core region: 92.7%
ii. Allowed region: 7.3%
iii. Gen. allowed: 00
iv. Disallowed region: 00
v. Bad contacts: 00
|
i. Core region: 93.7%
ii. Allowed region: 4.7%
iii. Gen. allowed: 00
iv. Disallowed region: 1.6
v. Bad contacts: 01
|
Table 2
Result of nucleotide blast showing comparison of uspA gene of S.enterica serovar Typhimurium strain 3232 with other Salmonella serovar
S.No.
|
Description
|
Max Score
|
Total Score
|
Ident
|
Accession
|
1
|
Salmonella sp. A39 chromosome, complete genome
|
804
|
804
|
100%
|
CP084194.1
|
2.
|
Salmonella enterica subsp. enterica serovar Newlands strain ZC-S1 3rd chromosome, complete genome
|
804
|
804
|
100%
|
CP082916.1
|
3.
|
Salmonella enterica strain SP chromosome, complete genome
|
804
|
804
|
100%
|
CP077668.1
|
4.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S2188 chromosome, complete genome
|
804
|
804
|
100%
|
CP082523.1
|
5.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S2170 chromosome, complete genome
|
804
|
804
|
100%
|
CP082526.1
|
6.
|
Salmonella enterica subsp. enterica serovar Hadar strain CVM N18S2154 chromosome, complete genome
|
804
|
804
|
100%
|
CP082531.1
|
7.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S1677 chromosome, complete genome
|
804
|
804
|
100%
|
CP082543.1
|
8.
|
Salmonella enterica subsp. enterica serovar Hadar strain CVM N18S1943 chromosome, complete genome
|
804
|
804
|
100%
|
CP082540.1
|
9.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S1634 chromosome, complete genome
|
804
|
804
|
100%
|
CP082553.1
|
10.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S1595 chromosome, complete genome
|
804
|
804
|
100%
|
CP082558.1
|
11.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S0981 chromosome, complete genome
|
804
|
804
|
100%
|
CP082577.1
|
12.
|
Salmonella enterica subsp. enterica serovar Enteritidis strain SE211 chromosome, complete genome
|
804
|
804
|
100%
|
CP084532.1
|
13.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S0666 chromosome, complete genome
|
804
|
804
|
100%
|
CP082596.1
|
14.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S0645 chromosome, complete genome
|
804
|
804
|
100%
|
CP082674.1
|
15.
|
Salmonella enterica subsp. enterica serovar Typhimurium strain CVM N18S0597 chromosome, complete genome
|
804
|
804
|
100%
|
CP082681.1
|
Procheck results of swiss-model show that on the Ramachandran plot (RC), 92.7% of amino acid residues fall in core (most favored) regions, 7.3%in allowed, and no residue falls in the disallowed region. The wrong contacts in the 3D model were also found to be zero. These evaluation parameters suggest that the Swiss-model server predicted a good quality model of USPA(Fig. 3B). Moreover, the protein model in which > 90% residues fall in core regions on RC plot can be utilized for further studies such as molecular dynamics simulation or structure-based drug designing (Yadav et al., 2013).
Phylogenetic analysis of uspA gene
All the related uspA gene sequences were aligned by the multiple sequence alignment method using the Clustal omega server. The aligned sequences were given as input files in PHYLIP file format in the PhyMLv3.0 server, keeping default parameters. A phylogenetic tree was constructed using the SMS (Smart Model Selection) algorithm(Lefort et al., 2017)incorporated in the PhyML program. For tree construction, standard bootstrap value was set to as 100, which indicates how many times out of 100 the same branch was observed when repeating the phylogenetic reconstruction on a re-sampled set of input data. The phylogenetic tree was constructed and subsequently the tree layout was visualized in linear and radial view respectively (Fig. 4A, B).
The phylogenetic tree was constructed using a 0.5–2.5% scale bar. The number on the scale in Fig. 4A suggests the percentage of genetic variation in the corresponding taxon. The evolutionary history of the uspA gene was traced back to the phylogram depicted in Fig. 4A. The tree represents that the uspA gene of Salmonella Typhimurium is closely related to the uspA gene of bacteria Shigella sonnei, E. coli K-12 and Proteus mirabilis. All these share a common lineage with Serratia marcescens. Furthermore, these genes were also sharing a significant evolutionary relationship with other bacteria such as Salmonella enterica, Pseudomonas simiae, and Bacillus subtilis subsp. subtilis 168.
Interestingly, uspA gene from another plant, such as Brassica rapa, also shows some evolutionary relationship with bacteria. It indicates that uspA genes might have been evolved from bacteria to some other plants, which are commonly expressed in stress conditions. The uspA genes from Oryza sativa and Vitis vinifera are distantly related to other organisms, and have evolved from a separate lineage.