Cloning of Sod2 from HKI 335 maize inbred line
The ZmSod2 gene was amplified from cDNA using gene specific primers and cloned in standard plasmid vector, followed by sequencing. The ZmSod2 HKI 335 sequence was deposited in the GenBank database of National Centre for Biotechnology Information (NCBI), USA with accession number ALF00121.1. The cloned sequence (151 amino acid) contained all the characteristic conserved domains of Cu-Zn Superoxide Dismutase superfamily (PLN02386 family protein) (Figure S1).
Phylogenetic Analysis
The multiple sequence alignment shows aligning of different plant species for stress tolerance (Figure 1). Orthologous sequences of SOD2 from different species were aligned and further used for phylogenetic analysis using the neighbour joining (NJ) method. The results show that SOD2 in different plant species demarcated into two prominent clusters i.e., monocots (6) and dicots (5) (Figure 2).
Secondary Structure and Comparative Modeling
Using an online bioinformatics tool known as SABLE, we predicted the secondary structure of the target protein sequence (Figure 3). The secondary structure of the protein sequence viewed by polyview-2D show the absence of alpha-helices. At the same time, there are a large number of coils and beta-sheets with fully exposed relative solvent accessibility (RSA). The result indicate that it is a soluble protein and no transmembrane domains have been found (Figure S2). Most of the beta-sheets have RSA value greater than 2, which shows that the amino acids in the protein secondary structure are completely buried. The high value of the RSA of the target protein suggests that the protein is stable for a number of biological functions within the cell. The theoretical structure of SOD2 was generated using MOD web by comparative modeling of protein structure prediction through which identical or non- identical information about the target sequences is analyzed. We developed 14 models calculated for SOD2 out of which two models were selected. The model with LONGEST_DOPE had a score of -1.834 by using Modweb which was dynamically refined and validated. Comparative modeling revealed that the cloned SOD2 exhibits 83% identity with Potentilla atrosanguinea SOD2 (PDB ID: 2Q2L_A) with an e-value of 4e-86 (Figure 4).
Potentilla atrosanguinea, commonly known as Himalayan cinquefoil or ruby cinquefoil, is a vigorous herbaceous perennial of the rose family native to mountain slopes at lower elevations in the Himalayas. Surprisingly, SOD2 from Potentilla atrosanguinea have been previously reported as a highly thermostable SOD that can tolerate autoclaving as well as sub-zero temperatures [US Patent US20070269811A1; 19]. Further, a single C95A amino acid substitution in the wild type Potentilla atrosanguinea SOD2 is known to enhance thermal properties [20]. These findings prompted us to further investigate the SOD2 from HKI 335 maize inbred.
Model Validation
With the PDBsum, it has been shown that generated model of SOD2 protein revealed (~98.0% expected):193( 84.6%) residues falling in most favoured region, (~2.0% expected) : 34 (14.9%) residues in additionally allowed region, and 0.4% residues in generously allowed region with no residues in the disallowed region of the Ramachandran plot (Figure 5). Z-score of PROSA energy indicating overall model quality was used to check 3D models of protein structures for potential errors. In these plots displaying Z-scores, value (-6.51) of the target model was determined by X ray crystallography (represented in light blue) and nuclear magnetic resonance (represented in dark blue). This value was extremely close to the value of template 2Q2L (-6.44) (Figure 6). Root Mean Squared Deviation (RMSD) value indicates the degree to which the two three dimensional structures are similar. RSMD analysis of the SOD2 model was measured from its template (2Q2l_A). The Cα RMSD and backbone RSMD deviation for both the target and the template was 0.27 Å (Figure 7).
Physiochemical Properties
The target protein consists of very high percentages of glycine (18.5%), alanine (7.9%) and valine (9.9%) in comparison to other amino acids (Table 1). The high percentage of glycine (18.5%) shows that triple helical structure of the protein is likely to be more stable. Like that of glycine residues in protein, the proline also plays a major role in stability of helix of protein’s secondary structure. Here, the residues of proline show less percentage (5.3%). On the other hand, the percentage of glycine, valine and leucine among the hydrophobic groups are 32.56% and 17.44% respectively, while serine and threonine (both hydrophilic group) are 20.51% and 28.21% respectively. The isoelectric point (pI) value was 5.43, while the instability index was 20.1 (Table 2). The aliphatic index was 80.66, which is extremely high. The value of grand average of hydropathy (GRAVY) was -0.132 that lies between -2 and +2, suggesting that the protein is hydrophobic in nature and rated positively.The atomic composition of the protein is shown in Figure 8.
Table 1
Amino acid composition of SOD2, Zea mays, HKI 335
Amino Acid (AA)
|
AA
|
AA (Number)
|
AA (%)
|
Hydrophobic Group (%)
|
Hydrophilic Group (%)
|
Ala
|
A
|
12
|
7.9
|
13.95
|
|
Arg
|
R
|
3
|
2
|
|
|
Asn
|
N
|
6
|
4
|
|
15.38
|
Asp
|
D
|
11
|
7.3
|
|
|
Cys
|
C
|
2
|
1.3
|
|
5.13
|
Gln
|
Q
|
3
|
2
|
|
7.69
|
Glu
|
E
|
6
|
4
|
|
|
Gly
|
G
|
28
|
18.5
|
32.56
|
|
His
|
H
|
9
|
6
|
|
23.08
|
Ile
|
I
|
8
|
5.3
|
9.3
|
|
Leu
|
L
|
9
|
6
|
10.47
|
|
Lys
|
K
|
6
|
4
|
|
|
Met
|
M
|
2
|
1.3
|
2.33
|
|
Phe
|
F
|
4
|
2.6
|
4.65
|
|
Pro
|
P
|
8
|
5.3
|
9.3
|
|
Ser
|
S
|
8
|
5.3
|
|
20.51
|
Thr
|
T
|
11
|
7.3
|
|
28.21
|
Trp
|
W
|
0
|
0
|
|
0
|
Tyr
|
Y
|
0
|
0
|
|
0
|
Val
|
V
|
15
|
9.9
|
17.44
|
|
Pyl
|
O
|
0
|
0
|
|
|
Sec
|
U
|
0
|
0
|
|
|
|
|
|
|
|
|
|
B
|
0
|
0
|
|
|
|
Z
|
0
|
0
|
|
|
|
X
|
0
|
0
|
|
|
Table 2
Physio-chemical properties of SOD2
Number of AA
|
Molecular weight
|
Theoretical pI
|
Formula
|
Negatively charged residues (Asp + Glu)
|
Positively charged residues (Arg + Lys)
|
Extinction coefficients
|
Instability index
|
Aliphatic index
|
GRAVY
|
151
|
15103.76
|
5.43
|
C650H1033N193O214S4
|
17
|
9
|
125
|
20.1
|
80.66
|
-0.132
|
Post Translational Modification (PTM) Site Prediction
The post translational modification (PTM) is described as amino acid modification on the basis of protein sequence and also considered as vital issues for regulating the physiological and biological functions inside the cell. We could identify only one PTM site (Phosphorylation), located at position 20 having residue S (sequence: GTDVKGTIFFSQEGDGPTTVT) with score 0.692.
Protein-Protein Interaction (PPI) Analysis
It has been observed that the interacting partner proteins with query protein are IDP712 (uncharacterized LOC100283786 in Zea mays), sod3, pco095461 (Superoxide dismutase in Zea mays), GRMZM2G139680_P01 (2-cys peroxiredoxin BAS1 in Zea mays), SODA.3, 541646, GRMZM2G157018_P01 (ATP synthase subunit d, mitochondrial in Zea mays), GRMZM2G125151_P01 (uncharacterized LOC100283203 in Zea mays) and PER1 (period circadian regulator 1 in humans) with similarity scores 0.979, 0.883, 0.852, 0.847, 0.847, 0.837, 0.837, 0.835, 0.789 and 0.761 respectively (Figure 9, Table 3). The process and function of SOD2 has also been compared with other proteins (Table 4). Other analysis such as protein interaction descriptions and other parameters are furnished in supplementary files (Table S1 and Table S2)
Table 3
Interacting proteins and their similarity scores with our query protein SOD2
Interacting Partner
|
Score
|
IDP712
|
0.979
|
sod3
|
0.883
|
pco095461
|
0.852
|
GRMZM2G139680_P01
|
0.847
|
1E+08
|
0.847
|
SODA.3
|
0.837
|
541646
|
0.837
|
GRMZM2G157018_P01
|
0.835
|
GRMZM2G125151_P01
|
0.789
|
PER1
|
0.761
|
Table 4
Process and function of SOD2 compared with other proteins from protein-protein interaction
Process
|
#term ID
|
term description
|
observed gene count
|
background gene count
|
strength
|
false discovery rate
|
GO:0098869
|
cellular oxidant detoxification
|
4
|
14
|
3.01
|
6.57E-10
|
GO:0019430
|
removal of superoxide radicals
|
3
|
5
|
3.33
|
6.60E-09
|
GO:0006950
|
response to stress
|
4
|
68
|
2.32
|
1.29E-08
|
GO:0055114
|
oxidation-reduction process
|
4
|
116
|
2.09
|
6.96E-08
|
Functions
|
GO:0016209
|
antioxidant activity
|
4
|
14
|
3.01
|
1.29E-10
|
GO:0004784
|
superoxide dismutase activity
|
3
|
5
|
3.33
|
5.85E-09
|
GO:0016491
|
oxidoreductase activity
|
4
|
107
|
2.13
|
6.25E-08
|
GO:0046872
|
metal ion binding
|
3
|
136
|
1.9
|
1.50E-05
|
Domain Composition Prediction
Domains play a significant role in the functional activities of proteins in the cell, and are either active regularly or during the process of evolution. The domain present in SOD2 protein sequence is Sod_Cu, which is metalloprotein that helps in the prevention of damage by free radicals with the catalysation of SOD into O2 and H2O2. Two outliers homologous and homologous of known structures are also found (Table S3). The fragment GFHVHALGDTT was identified as the plant Dismutase signature motif (42-52 residues). This was a consensus pattern in both target sequences of Zea mays (HKI 335) and template Potentilla atrosanguinea. GNAGGRUACGII signature motif was identified from heme ligand site which was conserved in both SOD2 of target and template sequences.