In this study, we applied LC-MS-based metabolomics and bioinformatic tools (DAFDiscovery and NP Analyst) to investigate the toxicity/bioactivity of cyanobacteria crude extract from a tropical and eutrophic reservoir. We hypothesize that the combination of metabolomic methods with bioinformatic tools is efficient for evaluating the toxicity of different cyanopeptides contained in the crude cyanobacteria extract without requiring compound isolation.. Our results revealed the presence of compounds belonging to four cyanopeptides classes, microcystins, microginins, cyanopeptolins, and aeruginosins, in the tested cyanobacterial biomass. All these compound classes consist of biologically active metabolites that can influence the function of cellular enzymes, disrupt cellular signaling pathways, and induce tissue apoptosis, potentially leading to the mortality of organisms (Huang and Zimba, 2019). However, only microcystins figure in the management legislation for freshwater legislation worldwide (Chorus and Welker, 2021). Both chemometric tools used in this work efficiently graded the biological activity of the compounds found in the cyanobacterial biomass, each one with its specific characteristics of the data interpretation method.
Biomass toxicity
In aquatic systems, the cyanobacterial community produces a complex array of metabolites. All these compounds can be released mainly after the cyanobacterial cells’ lysis (Rohrlack and Hyenstrand, 2007), affecting aquatic organisms (Pearson et al., 2016; Toporowska et al., 2014; Ferrão-Filho and Kozlowsky-Suzuki, 2011; Metcalf and Codd, 2012; Osswald et al., 2007). Although cyanobacterial toxins and secondary metabolites co-occurrence are ecologically relevant (Fernandes et al., 2019), few studies explored the biological activities of crude extracts, especially in tropical systems. In this study, to assess the biomass ecotoxicity, we performed an acute toxicity test with crude extract from collected sample against A. salina. The evaluated biomass was composed predominantly of Dolichospermum spp. (~ 75%) and Microcystis spp. (~ 25%); and we have observed 100% mortality of A. salina nauplii with 500 µg.mL− 1 of crude extract after 24 hours of exposure, generating a LC50 of 278 mg.L− 1. According to Nguta et al. (2011) criteria, the crude extract used in this study was categorized as moderately toxic (LC50 = 278 µg.mL− 1). The LC50 provided by our test is higher than previously reported LC50 values for Microcystis bloom extracts collected from tropical reservoirs in Morocco (ranging from 1 to 46 mg.L− 1) that used similar evaluation methodology (Douma et al., 2017, 2010; Sabour et al., 2002). Although both Dolichospermum spp. and Microcystis spp. are known producers of microcystins (Chorus, 2001; Dreher et al., 2019), the variation in the results from acute testes can be attributed to the difference in the biomass species composition, the fluctuation of toxic clones in the samples, and the specific environmental conditions from which the biomass originated (Ibelings et al., 2021). For example, a comparison between extracts of Microcystis spp. and Planktothrix agardhii, which had similar total microcystin concentrations, revealed significant differences in the profile of microcystin types and other oligopeptides, leading to varying toxic effects in the organisms tested (Pawlik-Skowrońska et al., 2019). Furthermore, stronger toxic effects were observed by the binary mixture of anabaenopeptin-B with MC-LR at a total concentration equal to the concentration of the compounds when tested individually (Pawlik-Skowrońska & Bownik, 2022). This wide variation in results reinforces the need for toxicity studies with crude extracts from biomass environmental samples. Although there are already several studies investigating the toxicity of cyanopeptides, the biomass used generally comes from cultures of isolated strains, and toxicity tests are mainly performed with extracts from a single strain. The problem with these methodologies is that they ignore the chemo-ecological complexity of the interaction of different compounds. Also, the tested concentrations will hardly be found in natural conditions, distancing the results obtained from environmental reality.
Cyanopeptide profile and Bioactivity Correlation
The bioactivity correlation analysis in this study links different classes of cyanopeptides (MC, MG, CP, and AER) with the toxicity observed in the tests with A. salina. The cyanopeptides group with the highest diversity of congeners identified in our extract were MCs, with seven congeners. This class is the most widely described and includes diverse cyanobacterial toxins documented in the literature (Preece et al., 2017), with nearly 300 known congeners (Jones et al., 2020). All MC variants recorded in this study, except the MC-YL, already have an LC50 described in the literature, ranging from 50 (-LR) to 600 (-RR) µg.kg− 1 of body weight intraperitoneal injection in mice bioassays (Botes et al., 1985; del Campo and Ouahid, 2010; Kusumi et al., 1987; Meriluoto et al., 1989; Namikoshi et al., 1992; Stoner et al., 1989). For all MC analogs found in our extract, except the MC-RR, results from the NP Analyst platform gave the second-highest activity scores (0.35–0.40). The DAFdiscovery platform results attributed bioactivity correlations to the MC variants ranging from 0.44 (-YL) to 0.99 (-RR). Although MC-RR showed the lowest activity score in the NP Analyst analysis (0.08), the bioactivity correlation in the DAFdiscovery analyses presented the highest value (0.99). This variation is probably due to the different algorithms used by each platform. NP Analyst considers the molecule's presence in the active fraction(s) and its absence in the inactive ones (cluster score). If a molecule is present only in the active fraction, it will have a higher score, and if it also appears in inactive fractions, even at much lower concentrations, it will receive a lower score. The DAFdiscovery platform considers the peak intensity in each fraction. High concentrations in bioactive fractions pull the bioactivity correlation up. In a small number of samples, if a compound has a high concentration in a given bioactive fraction and a low concentration in a non-active fraction, it will still have a high bioactivity correlation.
The MG are linear peptides that vary, with few exceptions, from four to six amino acids and generally present a decanoic acid derivative, Ahda (3-amino-2-hydroxy-decanoic acid), at the N-terminus (Welker and Von Döhren, 2006). Approximately 40 congeners are known, and this number is still increasing (Lodin-Friedman and Carmeli, 2018; Riba et al., 2019; Ujvárosi et al., 2020). In this work, we found the presence of cyanostatin B (m/z 754.4426 [M + H]+), microginin MG KR787 (m/z 788.4016 [M + H]+), microginin MG KR604 (m/z 605.3879 [M + H]+). We proposed the molecular formula C41H61N5O9 to the feature m/z 768.4589 [M + H]+ that may be a potentially new MG congener.
MG analogs have been named as microginines, nostoginins, oscillaginins and cyanostatins, according to their producing species (Bagchi et al., 2016; Ujvárosi et al., 2020). These compounds are inhibitors of zinc metalloprotease and are known to be inhibitors of aminopeptidase M (APM), leucine aminopeptidase (LAP) and angiotensin-converting enzyme (ACE) (Ishida et al., 2000; Lifshits et al., 2011; Lodin-Friedman and Carmeli, 2018; Sano et al., 2005). The mechanism of enzyme inhibition is mainly attributed to the Ahda portion, although NeMe-Tyr-Tyr at the C terminus favors binding to the active site of the enzymes (Ishida et al., 2000). Data on the toxic activity of MG on animal models are scarce. Ujvárosi et al. (2020) determined in vitro cytotoxicity and genotoxicity of four MG congeners (cyanostatin B, MG FR3, MG GH787, MGL 402) and a well-characterized cyanobacterial extract encompassing these metabolites. In that study, the extract significantly affected the viability of cells in the MTT assay with the human hepatocellular carcinoma cell line (HepG2). Also, the extract and all tested congeners induced DNA strand breaks. Fernandes et al. (2019) tested a crude extract of Microcystis sp. containing several MG. They demonstrated its acute toxicity in larvae of a tropical freshwater fish Astyanax altiparanae, reporting numerous abnormalities, including abdominal and pericardial edema. In our study, the DAFdiscovery shows a high bioactivity correlation (0.75–0.81) for MG compounds. However, a low activity score (0.1–0.18) was assigned by NP Analyst. This difference is likely due to the different parameters considered by each algorithm. For example, MG KR787 appears in higher concentration in the active fraction F3 which gives it a high correlation of activity in DAFdiscovery. However, it is also present, although at a lower concentration, in the non-active fraction F4, which makes it assigned a low activity score by the NP Analyst. The scores assigned to the cyanostatin B variant followed the same reasoning.
The CP are a broad group of cyclic depsipeptides that can be produced by various cyanobacterial genera, including Planktothrix, Anabaena, Nostoc and Microcystis (Welker and Von Döhren, 2006). These molecules comprise a six-amino acid ring and a side chain with one or two residues. All CP are characterized by 3-amino-6-hydroxy-2-piperidone (Ahp) in position 3 (Mazur-Marzec et al., 2018). All positions in the ring, except Thr and Ahp, can be occupied by variable amino acids, giving the CP a wide structural variability (Welker and Von Döhren, 2006).
Certain CP have demonstrated inhibitory effects on serine proteases, including elastase, chymotrypsin, thrombin, and trypsin (Gesner-Apter and Carmeli, 2009; Linington et al., 2008; Weckesser et al., 1996). In our study, both platforms attributed to CP compounds the highest bioactivity correlations (0.54 by NP Analyst and 0.71–0.81 by DAFdiscovery), including the potentially novel analogs (C49H77N11O14 and C51H81N11O14). These results provide compelling evidence supporting the hypothesis that CP are indeed one of the classes that should be considered among the monitoring cyanotoxins. Furthermore, these findings illustrate the importance of environmental monitoring for discovering new cyanopeptides congeners. Gademann et al. (2010) has already shown that CP 1020, isolated from Microcystis, was toxic (LC50 = 8.8 µM) to the freshwater crustacean Thamnocephalus platyurus in similar concentrations to some well-known MC. Additionally, Faltermann et al. (2014) pointed out that exposure of zebrafish embryos to different concentrations of CP 1020 resulted in transcriptional alterations of genes related to several biological pathways.
The AER are a class of linear tetrapeptides with uncommon 2-carboxy-6-hydroxyoctahydroindole (Choi) and (4-hydroxy) phenyl lactic acid (Hpla) moieties and an arginine derivative at the C terminus (Ishida et al., 1999). AER is a cyanopeptide family with some potent inhibitors of serine proteases (Ersmark et al., 2008). The inhibitory mechanisms have been elucidated by X-ray crystallography of aeruginosin-protease complexes (Hicks et al., 2006). In our analysis, the presence of AER 298A was correlated with bioactivity by both platforms (scores 0.18 and 0.31 using NP Analyst and DAFdiscovery, respectively). Scherer et al. (2016) stated that some AER variants match the well-known MC-LR in the toxicity against crustaceans. Their studies reported a LC50 = 34.5 µM for the synthetic AER 828A, a chlorosulfopeptide analogue, against the organism T. platyurus. Similar results were found by Kohler et al. (2014), that reported a LC50 value of 22.4 µM for this AER variant against T. platyurus.
Using chemometric tools to predict the toxicity of compounds in complex mixtures
In this study, we used DAFdiscovery and NP Analyst tools to investigate the bioactivity correlation of different cyanopeptides in a complex extract from a biomass collected in a tropical reservoir. Both tools proved to be effective, each one with its particularities. Bioinformatics tools are suggested as a cost-effective and straightforward way to determine if a compound contributes to toxicity in a complex extract without the need to isolate the compound or use standards for biological tests (Atasanov et al., 2021). The bioactivity correlation results from this study reinforce that different cyanopeptides in the sample contribute to the overall toxicity observed. Based on current literature, MC was expected to exhibit the highest bioactive activity. However, to our surprise, microginins, aeruginosins, and, mainly, cyanopeptolins also strongly correlated with the toxicity observed in our tests. This study also suggested that the biomass extract contained potentially novel cyanopeptolins with high bioactivity correlations. Furthermore, we cannot rule out the synergistic action of these substances, as we are talking about a complex extract and not isolated compounds. Regarding the particularities of the tested tools, DAFdiscovery is suggested when the number of samples is high and when the separation achieved between each extract or fraction is inefficient since it considers the ionized compound's peak intensity in each sample. In turn, NP Analyst works better for a small number of samples and when the separation of compounds occurs efficiently, as the tool tends to smooth the correlation values if a compound appears in non-active fractions. Although this can lead to underestimating the molecule's toxicity, NP Analyst also provides another index, the cluster compound value, that helps interpret the results.