The curated genome of the soil amoeba D. discoideum shows a single gene (DictyBase Gene ID DDB_G0293374; [7]) predicted to encode a dUTPase polypeptide containing the five hallmark motifs (M1-M5) of homotrimeric dUTPases [8], seen in the alignments of the amino acid sequences from mustard, yeast and human (Fig. 1a). While the dUTPases of Arabidopsis thaliana and D. discoideum have substantial stretches of identity (73%) within the 138-residue segment containing M1-M5 [9], their N-termini have very low sequence similarity to each other, and to the human and yeast N-termini. Notably, within the lengthy N-terminus of the D. discoideum dUTPase, atypical of most dUTPases, computational analyses predict a mitochondrial targeting sequence (MTS).
Results
A single gene codes for an active enzyme in the homotrimeric dUTPase family
To characterize the unusual D. discoideum dUTPase, we established first that the recombinant protein was an active dUTPase and determined its kinetic parameters. Also catalytically-active was a core version comprised of polypeptide subunits that retain M1-M5 but lack residues 1–37 of the N-terminus. A homotrimeric structure of the core dUTPase enzyme was confirmed by crystallographic analyses that additionally showed interactions of the truncated N-termini with the C-termini likely enhanced substrate access of the dUTP substrate to the active sites.
Recombinant His-tagged dUTPase proteins were expressed in E. coli, purified by metal-chelating chromatography and after removal of the His-tag, assayed for activity by measuring the release of protons with the pH indicator cresol red (SM Text S1). As determined by the kinetic parameters calculated from stopped-flow measurements (Fig. 1b, c and Table 1), the kcat/KM of the core protein, lacking N-terminal residues 1–37 (Fig. 1a), was 60-fold greater than that of the full-length species, indicating the core was a more efficient enzyme.
Table 1 Calculated kinetic parameters of D. discoideum dUTPases
|
KM
mM
|
Vmax
mM·s-1
|
kcat*
s-1
|
kcat/KM
µM-1s-1
|
core
|
0.5 ± 0.1
|
1.4 ± 0.06
|
9.3
|
18.6
|
full-length
|
1.0 ± 0.2
|
0.5 ± 0.03
|
3.3
|
0.3
|
*kcat= Vmax/[ET]; [ET]=0.15 mM
Activity assays using an end-point method showed the full-length and core dUTPases both were metal-dependent enzymes inhibited by EDTA. Mg2+was identified as the optimal divalent cation (SM Table S1), as seen with homotrimeric dUTPases [6]. Both proteins specifically used dUTP (≤1% hydrolysis of other dNTPs; data not shown) and exhibited optimal activities at 60˚C and pH 8 (SM Fig. S1). The catalytically efficient core dUTPase indicated that the 37 N-terminal residues were not essential for activity.
The crystal structure of the core dUTPase showed it to be a homotrimer, indicating that the absent 37 N-terminal residues were dispensable for oligomer formation (SM Fig. S2; PDB ID 5F9K). In most homotrimeric dUTPases, the C-termini, containing M5 which aids in coordinating the ligand at the active site, are flexible. In the core structure, PISA interaction analyses revealed that the shorter N-termini interacted with the C-termini of adjacent subunits, constraining their engagement with their respective active sites. The interactions are schematically shown for Chains A and C, and Chains B and A (Fig. 1d). The limited movement of the C-termini contributed to the different inhibitor coordinates in each active site of the core dUTPase (SM Fig. S2, insets). Notable is the position of Inhibitor B. Interactions (hydrogen bonds and salt bridges) by three glutamates of the C-terminus of Chain C with residues of the N-terminus of Chain A (E120::H3+F5, E123::K5 and E126::K9) produced the most displaced orientation compared to Inhibitors A and C (detailed in SM Fig. 3). The C-termini, which act normally as lids for the substrate binding pockets [10–12], instead were restricted in their movement, which may have allowed increased access of dUTP to the three active sites compared to the full-length protein with its extended N-termini. We postulate this to be why the core enzyme was more catalytically efficient in vitro compared to the full-length dUTPase.
The N-terminus of D. discoideum dUTPase contained a mitochondrial targeting sequence
The role of the N-terminus of the D. discoideum dUTPase was explored first by expressing in Ax2 cells a full-length dUTPase with GFP at the C-terminus of the polypeptide subunit (Fig. 2a). By microscopy, the fusion protein was observed to localize to Mitotracker™–labeled mitochondria (Fig. 2b), supporting the hypothesis that N-terminus contained an MTS. Contrary to expectations, nuclei did not display any GFP fluorescence (Fig. 2c). A helical wheel projection showed that N-terminal residues 1–20 of dUTPase formed an amphipathic helix [13] (Fig. 2e), with a net positive charge, a characteristic common to presequences of imported mitochondrial proteins [14]. Programs predicting targeting sequences indicated a high probability of dUTPase to be located in mitochondria, when fungi were used in the organism category, with cleavage after Leu18 upon import into the organelle (BaCelLo [15], MitoFates [16], MitoProt II [17]).
As a test for the presence of an MTS within the N-terminus of dUTPase, we added the forty N-terminal residues of dUTPase to GFP, producing N1–40-dUTPase-GFP (Fig. 2a). Like the full-length dUTPase-GFP fusion protein, the expressed N1–40-dUTPase-GFP co-localized with Mitotracker™–labeled mitochondria (Fig. 2d), reinforcing the prediction of an MTS within the N-terminus of dUTPase. The majority of the mitochondria had both GFP and Mitotracker™ signals, but not all Mitotracker™–stained mitochondria had a GFP signal. No fluorescence was observed in nuclei (data not shown).
Protein blots, of whole cell lysates and mitochondria prepared from cells expressing N1–40-dUTPase-GFP, were probed with antibodies against porin, a mitochondria-specific protein [18] or GFP (Fig. 2f). As expected, the signal for anti-porin was enhanced in the mitochondria preparation (upper panel). Observed also in mitochondria was a strong anti-GFP signal (lower panel) that corroborated the microscopy images, indicating that N1–40-dUTPase-GFP localized to mitochondria. The two anti-GFP signals observed both in the lysate and mitochondria were interpreted to be the entire N1–40-dUTPase-GFP, predicted to be 32.5 kDa and a processed version of it, migrating at 27.5 kDa. The approximate size difference of 5 kDa corresponded to the predicted 4.8 kDa mass of the forty residues of dUTPase fused to GFP, suggesting the loss of the dUTPase amino acids after import into the organelle.
To determine that cleavage of N1–40-dUTPase-GFP occurred, we obtained the N-terminal sequence of the 27.5 kDa protein immunoprecipitated from cell lysates using anti-GFP antibodies. The processed fusion protein was cleaved after Gly41, the first glycine of the (Gly-Ala)5 linker between the dUTPase and GFP sequences (Fig. 2a), confirming the loss of all forty dUTPase residues. Though within the range of presequence lengths, this was lengthier than the predicted presequence that identified cleavage after Leu18, a residue common within an R–2 motif recognized by the mitochondrial processing peptidase [16, 19] [reviewed in 14]. An intermediate was not detected by immunoblotting, but it is possible that the fusion protein was cleaved twice, once after Leu18 and then after Gly41, generating the final A(GA)4-GFP molecule inside the mitochondria.
Summary
Activity and structural analyses of the D. discoideum dUTPase showed it to be a functional enzyme with attributes typical of most eukaryotic dUTPases. The unusually long N-terminus of the polypeptide subunit was found to be dispensable for catalytic activity as well as homotrimer formation. Microscopy and biochemical evidence showed that fusion proteins, where GFP was attached to the full-length polypeptide or the forty N-terminal amino acids of dUTPase, localized to mitochondria, but not to nuclei, indicating the presence of an MTS within the N-terminus of the D. discoideum dUTPase.