In-silico Saturation Mutagenesis of PR65. The in-silico saturation mutagenesis study of PR65 was performed using a recently introduced structure- and dynamics-based machine learning methodology, implemented in the online accessible tool Rhapsody32, 37 (Figure S1). The approach allows for assessing the impact, neutral or pathogenic, of any substitution at any residue along the protein based on sequence (conservation and co-evolution), structure (accessible surface area), and dynamics (equilibrium fluctuations, allosteric couplings, and mechanical behavior) of the protein. In parallel, we estimated the change in folding free energies associated with point mutations using ProTSPoM33 (Figure S2A). ProTSPoM uses residue physicochemical and energetic properties in the folded state, environmental compatibility, and evolutionary information to predict the change in Gibbs free energy (DDG) of folding associated with point mutations.
Figure 1C-F presents the results for the apo structure of PR65. The diagram in panel C is color-coded by average pathogenicity score for each residue I, i.e., the probability of having a deleterious/pathogenic effect upon mutating the ith residue, averaged over all 19 amino acid substitutions at that position. The effects of the individual substitutions are described by the elements of the ith column in the saturation mutagenesis heat map (see Figure S1 for a PR65 segment). The scores vary from 0 (neutral, blue) to 1 (strongly pathogenic, red), with a cutoff of 0.65 determining the decision between neutral and deleterious. Using the residue ranges defined earlier36, we evaluated the pathogenicity scores of the residues within each of the 15 HEAT repeats. The results are presented in Figure 1D, organized by repeat number (ordinate) and corresponding residue positions (abscissa). The heat map shows that the counterparts of the repeat 1 residues P11 (in helix 1), L26, R27, S30, and L34 (in helix 2) in all repeats consistently exhibit high pathogenicity probabilities. These residues are indicated by blue arrows along the abscissa. Their resistance to tolerate mutations is consistent with their high degree of sequence conservation at those positions, usually occupied by hydrophobic residues (leucine, valine, and isoleucine) or by arginine. See the counterpart of this heat map corresponding to change in free energy of folding, ΔΔG, in Figure S2B.
Closer examination of HEAT repeat structural elements (loop 0, helix 1, loop 1, helix 2 and loop 2; Figure 1E) revealed the distinctive behavior of inter- and intra-repeat elements. In Figure 1F we present the average pathogenicity for these structural elements. The corresponding residue ranges are listed in Table S136. Notably, the loops 0 and 2 linking successive repeats generally exhibit relatively high pathogenicity if mutated, suggesting the high sensitivity of PR65 if not inability to tolerate mutations at inter-repeat regions. See the peaks in Figure 1F between repeats 1-2, and those at the loop 0 or 2 between repeats 6-7, 8-9, 10-11, 11-12, and 12-13. The latter two represent kinks of single residues (G434 and G473, respectively), rather than loops, that presumably play a critical role. The counterpart of this analysis for ΔΔG is presented in Figure S3, which also draws attention to the critical role of S119 between repeats 3-4.
This analysis therefore identified the inter-repeat loop residues to generally play a critical role in ensuring the overall stability and/or functional flexibility of PR65. Closer analysis also identified specific mutations at the regulatory and catalytic subunit interfaces of PR65 that would induce the strongest destabilization and pathogenicity. Table 1 lists these mutations, Figure S4 displays their location in the PP2A structure. We note again the propensity of helix 2 residues among these critical sites.
Table 1. PR65 mutations distinguished by highly destabilizing and/or pathogenic effects
Location
|
Mutation
|
Repeat #,
secondary structure
|
ΔΔG (kcal/mol)
|
Pathogenicity score
|
Interface with the regulatory subunit
|
M179H
|
Repeat 4, helix 2
|
1.52
|
0.71
|
R182T
|
Repeat 5, helix 2
|
1.46
|
0.77
|
S255F
|
Repeat 6, loop 1
|
1.47
|
0.86
|
W256H
|
Repeat 6, helix 2
|
2.08
|
0.82
|
Interface with the catalytic subunit
|
W416F
|
Repeat 11, helix 2
|
1.23
|
0.82
|
R497T
|
Repeat 13, helix 2
|
1.34
|
0.81
|
Selection of Hinge Site Mutations. Our goal in this study was to explore the possibility of altering the conformational state and flexibility, and thereby function, of PR65 without completely destabilizing the scaffold or abolishing its function. Toward this goal, we turned our attention at residues predicted to play a key mechanical role as hinges/anchors during cooperative movements (global modes of motion) of PR65. We focused on 21 residues predicted by the Gaussian network model (GNM)38, 39 to participate in hinge regions modulating the softest six modes (Table S2) and explored how substitutions at those sites could alter the PR65 structure and dynamics. Figure 2A-D shows the shapes of GNM modes 1-4 (left) and corresponding diagrams color-coded by the direction (middle) and size (right) of residue displacements along those modes.
Based on the pathogenicity scores and ΔΔG values predicted for selected point mutations at those hinge sites (Table S2), we categorize the hinge site mutations into two broad groups, (a) pathogenic and (b) non-pathogenic depending on their pathogenicity scores. The former group also entails an increase in ΔG, indicating that these mutations would be destabilizing. The latter group, on the other hand, which is of interest as potential mutations that can alter the function while retaining the fold, is divided into three subgroups based on their DDG values: (b1) not destabilizing (negative ΔΔG), (b2) mildly destabilizing (ΔΔG < 1.25 kcal/mol), and (b3) highly destabilizing ΔΔG ≥ 1.25 kcal/mol, as mentioned in Table S2.
Experimental assessment of the thermodynamic stability of mutants. We previously used E. coli to express PR65 WT and mutants for folding studies20. To test the thermodynamic stabilities of the mutants in Table S2, we first examined the mutations in group (a), which were predicted to adversely affect the protein or function (pathogenicity score > 0.65). As seen in Table S2, the group comprises of 12 mutations. Most of them are also predicted to be highly destabilizing (DDG ≥ 1.25 kcal/mol). We performed small-scale protein expression tests in E. coli on them and the results showed that 11 of these mutants had no expression or were insoluble, indicative of instability and thus consistent with the predicted impact of these mutations from the computational analysis. On the other hand, out of the six mutations predicted to be stabilizing and non-pathogenic (subgroup b1), three - D315E, S323L, E375D - were expressed in good quantity. One mildly destabilizing and non-pathogenic mutation (subgroup b2), F502W, was also successfully expressed. Notably, two subgroup b3 mutations Y186V, and L197V (predicted to be neutral by Rhapsody but destabilizing by ProTSPoM), were also expressed in good yield.
We next performed large-scale expression of these six mutants and used thermal unfolding to qualitatively assess their thermodynamic stabilities as measured by melting temperature (the temperature at which the protein is 50% unfolded) (Table S3). All mutants had melting temperatures within 1 °C of the WT value (51.3 °C), indicating that the mutations had only very small effects on stability. Therefore, experiments were in general in accordance with predicted potential pathogenicity; and they were also consistent with the predicted mild-to-none effects of mutations on fold stability except for the b3 mutations. We therefore moved on to examine the impact of these hinge site point mutations on the structure, dynamics, and potentially function, of PR65 by molecular dynamics (MD) simulation and optical tweezer experiments.
MD simulations indicate that the mutants S323L, E375D, and F502W preferentially sample extended conformations. Simulations were initiated from the compact form of PR65, taken from the heterotrimeric PP2A3. The distributions of the end-to-end distances, defined as the distance between the Cα atoms of N29 and F577 (as in earlier work36), are presented in Figure 3A and B-G (six mutants). In each case, the average histogram deduced from triplicate runs is shown in the left panel, and the individual histograms from each of the three runs are shown in the middle. The mean values and standard deviations (SD) are written in each case, and their averages over triplicate runs are reported in Table 2 columns 3 and 4. The panels on the right of Figure 3 show the time evolution of the end-to-end distance for each run.
Table 2. Pathogenicity and conformational dynamics of the six selected mutations
Mutation
|
Results from theory and computations
|
Results from Nanoplasmonic
Optical Tweezers experiments
|
Pathogenicity score [0-1]
|
Mean
end-to-end distance (Å)
|
Standard deviation in end-to-end distance (Å)
|
Corner frequency (Hz)
|
Normalized RMSD
|
(mean ± SD)
|
S323L
|
0.24
|
70.3
|
9.4
|
122.82 ± 5.4
|
0.0018 ± 0.0009
|
F502W
|
0.36
|
70.4
|
9.9
|
45.07 ± 2.0
|
0.0019 ± 0.0007
|
WT
|
0
|
66.2
|
9.8
|
24.66 ± 2.9
|
0.0021 ± 0.0009
|
E375D
|
0.20
|
68.4
|
8.9
|
25.75 ± 5.9
|
0.0030 ± 0.0007
|
Y168V
|
0.18
|
64.6
|
11.2
|
20.97 ± 4.3
|
0.0033 ± 0.0006
|
L197V
|
0.04
|
67.2
|
9.7
|
16.78 ± 1.2
|
0.0067 ± 0.0015
|
D315E
|
0.03
|
58.2
|
15.6
|
15.99 ± 3.1
|
0.0099 ± 0.0028
|
Consistent with our prior study16, WT PR65 (Fig 3A) samples a broad range of end-to-end distances, from 36.3 Å to 95.8 Å, including those resolved for the compact (47.7 Å; see Figure 1B) and extended structures (76.3 Å), with a mean value of 66.2 Å. All three independent runs consistently exhibited similar distributions.
The triplicate runs conducted for each of the mutants S323L, E375D, and F502W also showed overlapping distributions of end-to-end distances despite small shifts (middle histograms in Figure 3E-G). However, the main difference from the WT PR65 was the shifts in the end-to-end distances towards more extended states. The corresponding mean end-to-end distances (70.3, 68.4 and 70.4 Å, respectively; cumulative histograms on the left) are larger than that of the WT. Thereby, these mutations favor more extended conformations in comparison to the WT PR65.
In contrast, the mutants Y168V and L197V (Figure 3B-D) were observed to sample end-to-end distances comparable to that of the WT PR65, if not more compact forms. D315E was able to sample much more compact conformations (as low as 17.5Å) than WT and gave the lowest mean end-to-end distances. For Y168V, L197V, and D315E, the main effect of mutations seems to compromise the ability of the structure to uniformly sample the conformational space; instead, the individual runs tend to gravitate/drift towards different forms, as evidenced by the histograms (middle diagrams) that show only a partial overlap. This effect was particularly pronounced in the mutant D315E, where two of the runs sampled rather compact forms with new peaks appearing at end-to-end distance of 41.3 and 48.0 Å -; whereas the third run sampled an extended form (mean value of 72.7 Å) with no transition to the compact form (Figure 3D).
The interface between repeats 12 and 13 significantly contributes to the opening and closing of PR65. Figure 4 displays the root-mean-square-fluctuations (RMSF) profile of residues (average size of fluctuations observed in triplicate trajectories). The regular patterns of the repeat units can be distinguished. Panel B displays the mutants colored by their RMSFs, in line with the shades in panel A. Examination of the RMSFs shows that the structure can be divided into three substructures: a middle section comprised of repeats 3-12 that shows small displacements (in blue), flanked by two segments (N-terminal repeats 1-2 and C-terminal repeats 13-15) that move significantly in space (in green).
Further examination of these individual sections shows that their spatial displacements do not necessarily reflect their conformational flexibilities. For example, even though the middle section is subject to minimal motions, it undergoes substantial internal rearrangements or deformations, as measured by the internal RMSDs (between 2.7 Å and 4.0 Å) evaluated for each mutant (by aligning with compact PR65 (PDB: 6NTS)). These rearrangements are presumably required to accommodate the local conformational fluctuations with minimal effects on the flanking regions. In contrast, the N- and C-termini that significantly move in space show much smaller internal RMSDs indicative of en-bloc movements of the repeats. In particular, the C-terminal section undergoes such rigid overall reorientation with respect to the middle section, enabled by hinge-bending at the interface between repeats 12 and 13. The internal RMSDs are confined in this case to 1.2-1.6 Å. These rigid-body movements of the C-terminal section, combined with the conformational rearrangements of the remaining structure, enable the opening/closing of PR65 that may sample compact and extended forms, as illustrated in Figure 4. Notably, F502W, located at the inner helix (helix 2) of repeat 13 near this hinge region, significantly alters PR65 equilibrium dynamics in favor of more extended conformations, underscoring the mechanical significance of this particular site.
Compared to WT PR65, F502W exhibits an increased ability to transition between open and closed forms, whereas D315E exhibits a decreased ability. To further investigate the effect of these point mutations on PR65 structural dynamics, we evaluated the correlation cosines between the global movements observed during the simulations and the deformation vector representing the experimentally observed difference between the compact and extended PR65 structures. The global movements sampled in MD simulations were characterized by principal component analysis (PCA)40. The PCA was performed using the triplicate MD trajectories (total of 1.962 µs) generated for WT PR65 and each of the six mutants. First, we aligned the conformations observed in MD with the compact PR65 structure to exclude rotational and translational motions. Thus, PCA yields 3N-6 internal motions. The first principal component (PC1) describes the most dominant mode of collective motion, which is also energetically favorable (soft mode), succeeded by PC2 and PC3.
Our goal was to assess whether the mutations impaired (or enhanced) the ability of PR65 to undergo its functionally required transitions between extended and compact forms. To this aim, we calculated the 3N-dimensional deformation vector pointing from the compact (PDB: 6NTS) to the extended (PDB: 1B3U) structures and examined if/how the PCs derived from MD simulations correlated with this structural change. As a quantitative measure of the ability of the PR65 mutants to effectuate these functional movements, we used the correlation cosine between the PCs (from simulations) and the deformation vector (from experiments)16. The heatmap in Figure 5 presents the results for all simulated systems. First, we note that the WT PR65’s PC1 yields a correlation cosine of 0.756 with the experimentally observed structural change, indicating this mode’s ability to predict 75.6% of the structural transition between the compact and extended PR65 structures. This is in accord with the previously reported intrinsic ability of the scaffold to accommodate, if not drive, these functional changes in structure. More important is to see to what extent the mutants exhibit similar abilities. In the cases of D315E and L197V mutants, PC1’s ability to describe the conformational transition between the compact and extended forms decreased to 69.55% and 73.37% respectively, indicating a small loss in the ability of the mutant to accommodate these changes in structure. In contrast, Y168V, S323L, and E375D, yielded values of 78.23%, 77.88%, and 78.37%, respectively, suggesting that the PR65 global dynamics is robust to those point mutations. Strikingly, the PC1 of F502W stood out from the others by yielding a correlation cosine of 0.8844 between PR65’s compact and extended forms, which is even higher than that of the WT. This result draws attention to the significance of F502 on PR65’s functional dynamics (and enhancing it when replaced by a tryptophan), which may have important ramifications for redesign or alteration of catalytic activity.
We further evaluated the cumulative correlation cosines with subset of 2 modes (PC1-2 in Figure 5), in accord with our previous study41. The cumulative correlation cosines for F502W PC1-2 reaches 0.895, which is higher than that of WT (0.858). D315E results in lower cumulative correlation cosines (0.739 in PC1-2) than the WT. Therefore, our findings suggest that the single point mutations F502W could impart alterations in protein dynamics that promote the ability of PR65 to undergo transitions between the compact and extended structures to accommodate trimeric assembly or diverse regulatory subunit binding. Conversely, D315E nudges the system towards a state that less favorably accommodates such transformations. These shifts could potentially impact the function of PR65.
Nanoaperture Optical Tweezer based characterization of the PR65 protein and its variants. Optical tweezers have been used widely to probe the biophysics of proteins at the single molecule level42-44. By using enhanced field confinement and sensitivity, nanoaperture-based plasmonic tweezers have been adopted by several groups to study single proteins, protein complexes and their interactions without the need for (or potential impact from) tethers or labels30, 31, 45-49. Here we trap the protein using double nanoholes50 fabricated by a random colloidal lithography technique28. A 980 nm laser with a 1.3 NA 100× objective is focused on the aperture, with 22.5 mW of power incident on the aperture in a diffraction limited spot. The transmission through the aperture is monitored on an avalanche photodiode (using a 1.3 OD filter to prevent saturation). When trapped, the PR65 protein undergoes a characteristic step as it enters the trap, with an increase in the noise amplitude, as shown in Figure 6A.
Once in the trap, the Brownian motion of the particle results in increased “noise” in the photodiode signal (transmission through the aperture, T, normalized to the pre-trap level). A histogram of this noise sampled after the trapping event is given in Figure 6B. The stiffer the optical tweezer potential, the less motion that the particle undergoes42, so the amplitude of the noise is expected to be smaller (RMSD). The stiffness is proportional to polarizability of the particle, which is higher for longer particles, so less deviation is expected if the protein is extended by a point mutation. Another quantity that can be extracted from the detected transmission is the power spectral density shown in Figure 6C. This has a corner frequency proportional to the trap stiffness divided by the hydrodynamic drag on the protein42 and so it is expected to show the opposite trend of increasing as the particle lengths (opposite to the RMSD). These two quantities for the point mutations are compared with the WT in Figure 6D-E and Table 2.
As noted above, S323L, E375D, and F502W shifted towards the extended conformation, and these showed the highest corner frequencies among all studied mutants as well as the WT PR65 (Figure 6D). They also showed the lowest RMSD (Figure 6D), however, E375D was larger than the WT. We stress that this RMSD is the result of Brownian motion of the protein in the optical potential, and therefore reflect both overall tumbling and global internal motions, different from the internal motions seen in MD simulations which occur at a much faster timescale, even though the end-to-end fluctuations observed in MD and the extracted PCs reflect relatively slow events. Both of these findings separately confirm the in-silico prediction that the protein is extended in the presence of these three mutations S323L, E375D, and F502W. The mutants L197V, Y168V and D315E, on the other hand, showed the opposite trend in the experiments, and this confirms the in-silico predictions of a more compacted form (Figure 6F). Finally, we note that the most dramatic behavior departing from the WT (and other mutants) has been observed in the mutant D315E, which is also consistent with MD results where D315E is distinguished by the impact of the point mutations on its structure (Figure 4) and functional dynamics (Figure 5).