Taxonomic distribution of prokaryotic L-asparaginase. Prokaryotic L-asparaginase is ecologically involved in transformation of organic nitrogen to release ammonia via the ammonification process [18]. Nitrogen rich compounds, peptides and amino acids are among available nitrogen sources in aquatic habitats where type II secretory L-asparaginase can be utilized for scavenging nitrogen by a range of prokaryotes [18, 19]. The secretory L-asparaginase act as a public good; providing surrounding cells by nitrogen source. To explore the phylogenomic distribution of L-asparaginase containing bacteria we perform an extensive screening of 27,000 publicly available bacterial genomes/metagenome assembled genomes (MAGs) via Annotree [44]. Our results show a vast distribution of L-asparaginase genes in bacteria where 54 out of 112 bacterial phyla contain the genetic potential for L-asparaginase activity. Many of these bacterial phyla have no representative in culture and consequently remain inaccessible to culture/expression-dependent screening approaches (Fig. 2).
A total of 6329 bacterial genomes containing genes annotated as TIGR00519 (L-asparaginase, type I) were recovered. These genomes are affiliated to 36 different bacterial phyla. 5294 bacterial genomes affiliated to 42 different phyla contain genes annotated as TIGR00520 (L-asparaginase, type II) (Fig. 2). Representatives of eight bacterial phyla contain both type I and type II L-asparaginase genes in their genomes (Proteobacteria 817, Bacteroidota 249, Firmicutes 175, Firmicutes_A 8, Myxococcota 5, Firmicutes_C 5, Riflebacteria 4, and Firmicutes_B 2). Phyla Proteobacteria and Firmicutes contain more representatives harboring the secretory L-asparaginase compared to the cytoplasmic one, whereas phylum Bacteroidota represent the opposite with higher frequency of representatives containing cytoplasmic L-asparaginase (Supplementary Fig. S1). These screened genomes/MAGs originate from different habitats ranging from aquatic, soil, to host associated microbes showing the vast distribution of this enzyme among bacterial representatives.
In-silico screening of L-asparaginase from the Caspian Sea metagenomes. The expansive medical use of L-asparaginase necessitate continuous screening for novel L-asparaginases of superior features in order to resolve emerging complications against the existing L-asparaginases used for therapeutic purposes. Screening novel environments by tapping into the vast metabolic reservoir of yet to be cultured prokaryotic majority can serve as a highly promising resource (Figs. 1 and 2). Current therapeutic L-asparaginase enzymes originating from E. coli and Erwinia chrysanthemi respectively lose 40 and 80 percent of their activity in blood salinity (0.9%) [15, 45]. Here we target the brackish microbiome of the Caspian Sea for in-silico metagenomics exploration of novel putative L-asparaginase. The brackish salinity of the Caspian Sea represents a highly similar salinity and main ionic concentration to the human serum (130–145 and 132.44 mM Sodium in the human serum and the Caspian Sea water respectively and 3.5–5.3 and 3.04 mM potassium in the human serum and the Caspian Sea water respectively [24, 46] (Supplementary table S1). Therefore, we hypothesize that secretory enzymes in the brackish microbiome of the Caspian See could potentially represent higher stability in physiologic conditions of the human serum thus, promising desirable therapeutic applications.
In total 703171, 1169997 and 1214607 predicted open reading frames (ORF) were screened for L-asparaginase activity respectively from 15, 40, and 150 m depth profile metagenomic datasets of the Caspian Sea. A total of 175, 296, and 284 putative L-asparaginase genes were annotated as TIGR00519 or TIGR00520 HMMs from 15, 40, and 150 m depth metagenomes respectively. The putative annotation of these genes was further evaluated by inspecting the conserved regions and protein secondary structure (as explained in the methods section). Amongst them 87 genes were verified for L-asparaginase activity after in-silico evaluations (18, 30, and 39 respectively from 15, 40, and 150 m metagenomes). Recovered L-asparaginase genes had the length in the range of 245 to 489 amino acids (median 336 amino acids) with the highest sequence identity to representatives of Bacterial taxa Verrucomicrobiota, Gemmatimonadota, Alphaproteobacteria, Acidobacteriota, Chloroflexota, Bacteroidota, Gammaproteobacteria, Patescibacteria, Actinobacteriota, Cyanobacteria, Firmicutes_C, and ‘Candidatus Rokubacteria’ (n = 84), a single gene showing 75.7% identity to an archaeal L-asparaginase and two genes with 86.2 and 87% identity to single cell eukaryotes affiliated to Bathycoccus and Micromonas respectively (Supplementary Table S2).
To assess the activity and other characteristics of recovered enzymes we have selected three candidates to synthesis the enzyme coding sequence to experimentally verify their activity, Km, survival percentage at human serum salinity and cellular analysis. Sequences of secretory enzymes (Supplementary Table S3) related to fastidious bacterial taxa were preferably selected for cloning and enzyme activity verification. Additionally, bacteria spotted in all three depths of 15, 40 and 150 meter were assumed to be more desirable as are expected to be adapted to sustain wide range of environmental conditions. Three genes were selected CAspI from 40 m metagenomics dataset and affiliated to phylum Acidobacteriota (79.9% sequence identity), CAspII also from 40 m metagenomic dataset and affiliated to phylum Gemmatimonadota (51.8% sequence identity), and CAspIII from 15 m metagenomics dataset affiliated to phylum ‘Candidatus Rokubacteria’ (50.7% sequence identity) based on their best BLAST hit against NCBI NR database (Supplementary Table S2). Phylogenetic reconstruction of L-asparaginase genes annotated from the Caspian Sea metagenomes together with the secretory L-asparaginase genes recovered from screening bacterial genomes through Annotree is shown in Fig. 3. Highlighting the enzyme phylogeny based on genome taxonomy bring forward the possibility of horizontal transfer of type II L-asparaginase gene between different taxa.
In-silico characterizations and cloning of putative L-asparaginase. To estimate the probability of possible cross-reaction between antibodies against E. coli and Erwinia chrystanthemi L-asparaginases and selected genes from the Caspian Sea metagenomes, pairwise nucleotide sequence alignment was performed. The maximum similarity was between CAspI and Erwinia (49.2% similarity, 34.3% identity) which is lower than the similarity between E. coli type II and Erwinia L-asparaginase (62.8% similarity, 42.3% identity). Erwinia L-asparaginase is used as the second line of chemotherapy in cases that show intense immune reaction against E. coli L-asparaginase. As antibodies against E. coli L-asparaginase do not cross-react with Erwinia considering their sequence similarity of 62.8%, we estimated that our genes should be safe against E. coli and Erwinia antibodies based on their sequence similarity (Table 1).
Table 1
pairwise nucleotide alignment results of the selected L-asparaginases from the enzymes from E. coli and E. chrysanthemi.
Pairwise sequence alignment (%) | E.coli type I | E. coli type II | E. chrysanthemi |
CAspI | S: 36.5 I: 21.9 | S: 47.0 I: 31.7 | S: 49.2 I: 34.3 |
CAspII | S: 32.5 I: 19.7 | S: 46.5 I: 31.1 | S: 47.4 I: 34.0 |
CAspIII | S: 35.6 I: 22.5 | S: 48.3 I: 32.6 | S: 49.9 I: 32.3 |
E. chrysanthemi | S: 35.6 I: 19.0 | S: 62.8 I: 47.3 |
E. coli type II | S: 36.6 I: 23.6 |
Three L-asparaginases screened from the Caspian Sea metagenomes named CAspI, CAspII and CAspIII (1092, 1218 and 1011 bp respectively) were codon optimized for expression in E. coli, synthesized and cloned in pET21a (+) vector in fusion with hexa-histidine tag at C-terminus (Supplementary Table S4).
Protein expression and purification. Subsequent to plasmid transformation and accuracy confirmation by re-sequencing, enzyme expression and activity was examined. Successfully all three recombinant enzymes showed L-asparaginase activity while the specific activity of the crude enzyme from untransformed E. coli was negligible; indicating that our computation metagenome-wide screening method is a promising approach for functional annotation of predicted open reading frames. After culture condition optimization, maximum specific activity of crude extract was achieved by induction of mid-log phase transformed cells for three hours at 30 °C using LB as culture medium. Optimum IPTG concentration was found 0.3 mM for CAspI and 0.2 mM for CAspII and CAspIII (Supplementary Fig. 2).
Recombinant CAspI, CAspII and CAspIII were purified to homogeneity by nickel-agarose affinity chromatography. SDS-PAGE single band matching the enzymes estimated molecular mass indicates the accuracy of purification (Supplementary Fig. 3). Molecular mass of the monomeric enzymes using their amino acid sequences were predicted to be 38.2, 43.3 and 35.6 kDa respectively which was consistent with protein migration on SDS-PAGE gel. The molecular weight of L-asparaginase protein varies according to the enzyme source i.e. for bacterial L-asparaginases, the molecular weight according to SDS-PAGE analysis is usually in the range of 35 to 40 kDa [47–49]. However, there are reported cases of very low molecular weight of 11.2 kDa for Streptobacillus sp. KK2S4 [50] and the high molecular weight (near 97.4 kDa) of purified L-asparaginase from Streptomyces tendae [51]. The specific activities of the purified enzymes were 700, 240 and 100 U/mg respectively for CAspI, CAspII and CAspIII while the specific activity of the commercial L-asparaginases from E. coli and E. chrysanthemi are between 280–400 U/mg and 650– 700 U/mg respectively [4]. Phetsri et al., reported maximum specific activity of 113 U/mg for Streptococcus thermophiles among four species of lactic acid bacteria tested [52]. Specific activities of 833 and 155 U/mg are also stated for the L-asparaginases purified from Thermococcus Kodakarensis and Acinetobacter soli, respectively [53, 54].
Kinetic parameters of recombinant L-asparaginases. Kinetic parameters of the enzymes were calculated according to the classical Michaelis–Menten equation using L-asparagine as substrate (Fig. 4 and Table 2). Km values of 10, 0.35 and 0.15 mM were achieved for CAspI, CAspII and CAspIII respectively. Enzyme affinity toward its substrate is reflected by the value of the Km; The lower the Km value, the better binding ability of the enzyme. While the Km value of all three enzymes are higher than E. coli type II and E. chrysanthemi L-asparaginases [4], our data suggests that CAspIII Km is among the lowest reported L-asparaginase Km values to date. The Km values of CAspIII is lower than the L-asparaginases produced by Streptomyces fradiae NEAE-82, Halomonas elongata and Enterobacter cloacae [15, 55, 56]. This gene was recovered from a 1161 bp long contig assembled from the 15 m depth metagenomes of the Caspian Sea showing the highest protein sequence identity (50.7%) to the reconstructed metagenome assembled genome ‘Candidatus Rokubacteria’ bacterium AR30 (The BioSample accession number SAMN08911936). This MAG was reconstructed from meadow soil samples at 30-40cm depth, Angelo Coast Range Reserve, CA, USA (The BioSample accession number SAMN08902845). Representatives of this candidate phylum so far have evaded the bound of culture and remained inaccessible to the culture/expression-screening campaigns [57–59]. Representatives of this Candidatus phylum are shown to be involved in biogeochemical cycling of elements in the soil [57–59] and harbor a vast potential for secondary metabolites biosynthesis [60]. While the CAspIII is only distantly related to the L-asparaginase gene of the ‘Candidatus Rokubacteria’ bacterium AR30, it most probably belongs to the rare and fastidious microbiome of the Caspian Sea; however, the taxonomic affiliation cannot be assured only based on the BLAST identity. Additionally, the representatives of the ‘Candidatus Rokubacteria’ are ubiquitous in a diverse range of terrestrial ecosystems and subsurface habitats with no reported marine representative [59]. CAspII and CAspIII also revealed catalytic efficiencies of approximately 10-fold higher than CAspI suggesting that they could metabolize asparagine more efficiently [61].
Table 2
Biochemical characteristics of the recombinant L-asparaginases.
L-asparaginase | kcat (s− 1) | Km (mM) | kcat/Km (mM− 1s− 1) | Vmax (µmol min− 1) |
CAspI | 446 ± 20 | 10 ± 1 | 44.6 | 0.35 ± 0.02 |
CAspII | 174 ± 8 | 0.35 ± 0.02 | 497.14 | 0.12 ± 0.01 |
CAspIII | 59.3 ± 4 | 0.15 ± 0.02 | 395.4 | 0.05 ± 0.004 |
Effect of pH and NaCl concentration on enzyme activity. The activity of the purified enzymes was studied in the pH range of 5–8 (Fig. 5a). All recombinant enzymes displayed maximal L-asparaginase activity at pH 7.5 that is favorable feature of these enzymes where maintaining optimal enzymatic activity at the physiological pH is one of the perquisites for antitumor activity [62]. A sharp decrease in activity was observed at more acidic or basic pH in the case of CAspII. CAspI activity was increased gradually up to pH 7 and retained its maximum activity up to pH 8. CAspIII showed maximum activity at pH 6.5–7.5. The amidases enzymes such as L-asparaginases are mostly active and stable at neutral and alkaline pH ranges of 5–9 [63]. L-asparaginase, purified from alkaliphilic Streptomyces fradiae NEAE-82, exhibited maximum activity at pH 8.5 [55]. The optimal L-asparaginase activity from Halomonas elongata was reported to be at pH 6–9 [15]. Maximum activity at pH 8 was obtained for purified L-asparaginase from Pyrococcus furiosus [64].
Effect of NaCl concentration on the enzyme activity is shown in Fig. 5b. As expected, it can be observed that 140 mM NaCl (equal to physiologic 0.9% saline solution) had no adverse effect on enzymatic activity of the recombinant L-asparaginases; furthermore, it is consistent with blood Na + concentration that ranges between 130–145 mM [46]. It should be noted that both commercial L-sparaginases and most reported enzymes show decreased activity when subjected to physiological salinity [45]. This reiterate the adaptive advantage of secretory enzymes recovered from the brackish waters of the Caspian Sea to retain their activity in the salinity of human serum due to similar ionic concentration. L-asparaginase isolated from the halophilic H. elongata also retained its maximum activity at physiologic salinity [15].
Anti-leukemic assessment. L-asparaginases have been isolated from various sources but all do not have cytotoxic effects on cancerous cells. The cytotoxicity of recombinant L-asparaginase was examined on human lymphoblastic leukemia cell line, Jurkat, by MTT assay (Fig. 7a). After 24 h of incubation, CAspIII and II proved to be highly effective against the leukemic cell line with IC50 of 120 and 33 nM which are equal to 0.6 and 0.06 IU/ml of the enzymes respectively. However, the commercial L-asparaginase from E. coli has IC50 of 1.0 IU/ml and that of Erwinia has been reported to have IC50 of 7.5 to 10.0 IU/ml [65]. These results clearly indicate that the purified recombinant CAspII and CAspIII can be considered as effective chemotherapeutic agents in killing human leukemic cell line, Jurkat, primarily due to depletion of the asparagine pool. Although asparagine is a nonessential amino acid, it is vital for some leukemia and cancer cells for two reasons. Firstly, asparagine is required for the synthesis of glycoproteins and other cellular proteins, and secondly, these cells have low expression level of L-asparagine synthetase for de novo synthesis of asparagine [64]. The IC50 value for CAspI was higher than 1000 nM (13 IU/mL), the highest concentration used in the dose-response curve. As seen in Fig. 6a, no tested concentration of CAspI could reduce cell viability to less than 60%, therefore the IC50 value is approximated (calculated based on the non-linear regression model).
Enzyme cytotoxicity. As a chemotherapeutic agent, asparaginase is routinely administered intravenously and thus it would come into contact with both leukemic and non-cancerous or non-blood cell types. As such, any probable anti-proliferative or cytotoxic effect of enzyme on both target and other cell types should be carefully assessed. Thus, we inspected the cytotoxicity of the purified enzymes with non-leukemic, non-myeloid cell line (HUVEC), along with the dominant cells of the blood, erythrocytes, to monitor any possible side effects of the enzymes on other cell types. No detectable adverse effect was observed for HUVEC cells (Fig. 6b and c). Additionally, no sign of erythrocyte hydrolysis was observed for any of the in vitro hemolysis test experiments conducted with the CAspI, II and III enzymes (data not shown).