Primer in silico design and ARGs diversity
Sequences of the target ARGs with orthology degrees higher than 70% were retrieved from the corresponding KEGG entries. The number of sequences selected was as follows: aadA: 96 sequences, aadB: 21, ampC: 81, blaSHV: 89, blaTEM: 81, dfrA1: 42, ermB: 58, fosA: 20, mecA: 37, qnrS: 15, tetA(A): 49. All the ARGs’ sequences retrieved from the KEGG website are listed in supplementary Table 2SI. Subsequently, alignments were conducted using the MAFFT algorithm implemented in the Geneious Prime software to perform the in silico primer design, following the recommendations based on efficacy and specificity scores of Dreier et al. [21]. According to the parameters suggested by Bustin and Huggett [17], the adequate annealing temperatures (50–65ºC), base composition (GC content ranging of 50–60%), length (between 18–25 pb), absence of secondary structure, lack of potential hairpin formation, and lack of self-annealing were accomplished. Besides, to guarantee the total in silico coverage of the newly designed primers on the template sequences, degenerated bases were introduced into the primers to match all the sequences (maximum 2 degenerations per primer) wherever necessary. The new primers’ set guaranteed the total coverage of the template sequences (allowing two ambiguities per primer), generating amplicon sizes between 215 and 494 bp. In addition, the specificity of amplification of the new primers was tested in the 589 genomes used in this study, without obtaining in silico annealings out of the genetic zones of the genomes previously described as antibiotic resistance coding sequences. The candidate primers that matched over every targeted sequence and only exclusively over the target regions were considered for further research. The sequences of the new primers designed here and their main features are presented in Table 2. Higher coverages of the newly developed primers for the quantification of the selected sequences of aadA, ampC, blaSHV, dfrA1, mecA, qnrS, and tetA(A) genes were found compared to those of all the available primers tested in this study (Table 3SI). Additionally, the primers available for the fosA gene presented very low coverages, except those developed recently by Abbott et al. [32]; however, the presence of a triplet at the beginning of the forward primer and a higher level of homodimers in the reverse primer are detrimental in comparison with the new primer set (fosA-7F/fosA-306R) designed in this study. On the other hand, several available primers for aadB, blaTEM, and ermB presented coverages of 100% over the selected sequences. Nevertheless, a large number of non-compliances with the recommendations of the MIQE guidelines were found for these primers, underlining the necessity to develop new primers that solve those issues. In this sense, Keenum et al. [6] recently reported poor coverages of some of the most common primers used in the literature for the quantification of ARGs, highlighting the necessity to improve the molecular tools available for qPCR quantification. Hence, the de novo design of the new primers achieved better performance and allowed higher representativity of the absolute abundance of the different ARGs, as a higher detection coverage was warranted in this study, since all the current sequences deposited in the KEGG database (> 70% orthology degree) for a given ARG were specifically amplified in silico by the new primers.
Table 2
Sequences of the primers designed in thit study and their major features.
Gene Target
|
Primer ID
|
Sequence (5´- 3´)
|
Primer length (pb)
|
Amplicon size (pb)
|
aadA
|
aadA-336F
|
CATTCTTGCRGGTATCTTCGAGC
|
23
|
215
|
aadA-550R
|
GCACTACATTYCGCTCATCGC
|
21
|
aadB
|
aadB-118F
|
GACACAACGCAGGTCACATT
|
20
|
419
|
aadB-536R
|
GGTGGTACTTCATCGGCATAG
|
21
|
ampC
|
ampC-535F
|
GTGAAGCCRTCTGGTTTGAG
|
20
|
494
|
ampC-1028R
|
GCGACATAGCTACCAAATCCG
|
21
|
blaSHV
|
blaSHV-286F
|
CAGGATCTGGTGGACTAYTC
|
20
|
219
|
blaSHV-504R
|
CGCCTCATTSAGTTCCGTTTC
|
21
|
blaTEM
|
blaTEM-335F
|
CGGATGGCATGACAGTAAGAG
|
20
|
275
|
|
blaTEM-609R
|
TTGCCGGGAAGCTAGAGTAAG
|
21
|
|
dfrA1
|
dfrA-127F
|
GTMGGSCGCAAGACDTTYGA
|
20
|
255
|
dfrA-381R
|
GWARACATCACCYTCTGGCT
|
20
|
ermB
|
ermB-109F
|
GGAACAGGTAAAGGGCAT
|
18
|
434
|
|
ermB-543R
|
TCTGTGGTATGGCGGGTAAG
|
20
|
|
fosA
|
fosA-7F
|
ACCGGTCTCAATCACCTGAC
|
20
|
300
|
fosA-306R
|
GAGGAAGTAGAACGAATCGCC
|
21
|
mecA
|
mecA-1196F
|
CTTCACCAGGTTCAACTCA
|
19
|
370
|
mecA-1565R
|
CCTTGTCCRTAACCTGAATC
|
20
|
qnrS
|
qnrS-244F
|
GCCAATTGYTACGGKATWGAG
|
21
|
227
|
qnrS-470R
|
GACTCTTTCARTGATGCRCC
|
20
|
tetA(A)
|
tetA(A)-737F
|
TCATGCARCTYGTAGGMCAGG
|
21
|
454
|
tetA(A)-1190R
|
AKCCATGCCMAWCCGTTCCA
|
20
|
In vivo amplification of ARGs using the new primer sets
The specificity and the efficacy of the new primer sets were tested in vivo using the corresponding reference strains previously described as carrying ARGs in their genomes, observing in all the cases a unique amplification band with the expected amplicon size (Table 3). According to the results of the different PCR reactions, all new primer pairs generated single bands of the expected size from both pure cultures (reference strains) and environmental DNA pools. An example of the in vivo validation of the specificity and the efficacy of the primers is showed in Fig. 1SI.
Table 3
Reference bacterial strains carrying target ARGs selected for this study.
Gene
|
Reference strain
|
aadA
blaTEM
|
Escherichia coli ATCC 35218 (CECT 943)
|
ampC
|
Escherichia coli K12 (CECT 433)
|
aadB/
blaSHV
|
Klebsiella pneumoniae subsp. pneumoniae ATCC 700603 (CECT 7787)
|
mecA
|
Staphylococcus epidermidis RP62A,
ATCC 35984 (CECT 4184)
|
fosA
|
Pseudomonas aeruginosa PAO1 (CECT 4122)
|
dfrA1, ermB,
qnrS, tetA(A)
|
Environmental DNA extracted from sludge samples
|
Construction of the qPCR standars
The PCR products obtained for the in vivo validation of the primers were used as templates for the construction of the corresponding qPCR standards. For that purpose, purifications of the amplicons were made and the resulting cloning reactions were performed. Subsequently, eight transformant colonies obtained in each reaction were randomly selected to verify the presence of an insert of the right size by PCR, using agarose gel electrophoresis. Lately, four positive plasmids were selected for each ARG to confirm the correct identity of the inserts, by means of Sanger sequencing. All the sequenced clones were matched to the control sequences using the Map to Reference command of the Geneious Prime, obtaining sequence identities values ranging from 98–100%. The plasmid sequences with the highest homology were deposited in GenBank and used as standards for the qPCR assays. Generally considered, these results confirmed the specificity of the new primers designed here and their utility to accurately quantify ARGs in DNA isolated from complex environmental matrices, representing useful tools for monitoring the antibiotic-resistance phenomenon in the environment [13].
Validation of real-time quantitative PCR assays
The quantification of the gene copy numbers by qPCR is based on the linear relationship between the logarithm of the initial template quantity and the threshold cycle (Ct) value during amplification, allowing the calculations of the amplification efficiency and delineating its limits of detection (LOD) and quantification (LOQ) [33]. Hence, after validating different temperatures and times of amplification by qPCR, the thermal profiles that generated better amplification efficiencies are described in Table 4. Besides, the equations of linear regression and major properties of the linearity assays are presented in Table 5.
Table 4
Thermocycler conditions for quantification of the different ARGs by qPCR.
|
|
aadA
|
aadB
|
ampC
|
blaSHV
|
blaTEM
|
dfrA1
|
ermB
|
fosA
|
mecA
|
qnrS
|
tetA(A)
|
Initial denaturalization
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
95°C,
10 min.
|
Amplification
(×35 cycles)
|
Denaturalization
|
95°C,
30 sec.
|
95°C,
30 sec.
|
95°C,
30 sec.
|
95°C,
15 sec.
|
95°C,
15 sec.
|
95°C,
15 sec.
|
95°C,
15 sec.
|
95°C,
30 sec.
|
95°C,
30 sec.
|
95°C,
15 sec.
|
95°C,
30
sec.
|
Annealing
|
60°C,
30 sec.
|
60°C,
30 sec.
|
60°C,
30 sec.
|
55°C,
30 sec.
|
60°C,
30 sec.
|
55°C,
30 sec.
|
55°C,
30 sec.
|
60°C,
30 sec.
|
50°C,
30 sec.
|
55°C,
30 sec.
|
55°C,
30 sec.
|
Elongation
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
72°C,
40 sec.
|
Melting curve
|
60°C-95°C + 0.15°C/second. Fluorescence measured each 15 seconds
|
Table 5
Equation of the linear regression of standard curves, R2 value, and amplification efficiency of each qPCR assay.
Target gene
|
Standard curve
|
R2
|
Amplification
efficiency
|
aadA
|
y = -3.507x + 34.893
|
0.997
|
100%
|
aadB
|
y = -3.4644x + 31.965
|
0.995
|
94%
|
ampC
|
y = -3.5201x + 33.994
|
0.995
|
92%
|
blaTEM
|
y = -3.5512x + 35.579
|
0.998
|
91%
|
blaSHV
|
y = -3.5431x + 34.665
|
0.998
|
92%
|
dfrA1
|
y = -3.5276x + 33.713
|
0.998
|
92%
|
ermB
|
y = -3.5199x + 33.689
|
0.994
|
92%
|
fosA
|
y = -3.487x + 33.185
|
0.997
|
94%
|
mecA
|
y = -3.5821x + 34.339
|
0.997
|
90%
|
qnrS
|
y = -3.5798x + 34.293
|
0.998
|
90%
|
tetA(A)
|
y = -3.587x + 35.124
|
0.996
|
90%
|
The validation of a new qPCR assay must fulfill a set of minimal performance characteristics to assess that the reliability of the method has been achieved [34]. These characteristics are mainly working range, analytical sensitivity, trueness, LOD, and LOQ, efficiency ruggedness and precision, and selectivity. A wide dynamic working range of a well-designed assay is one of the key performance parameters to achieve in a new qPCR method, as it is one of the most important advantages of qPCR assays, and ensures the accurate quantification of the copy numbers of the target genes, even when their range of abundance spans several logarithmic units [17]. Also, the linearity in the qPCR quantification must be reported through the coefficient of determination (R2 value) between the Ct values and the logarithm of gene copy numbers [13, 14]. Accordingly, an R2 value > 0.980 provides good confidence in correlating Ct and target copy number, suggesting a proper fit of the distribution of the data along the dynamic range employed [17]. In this respect, the standard curves of the Ct values and the gene abundances here reported showed good linearity over a wide dynamic range (from 20 to 2 × 108 gene copies (see details in Supplementary Fig. 2SI)), with R2 values > 0.990. Hence, the new developed qPCR protocols provided an accurate estimation of the abundances of the different ARGs within a broad dynamic range.
Considering that the abundance of ARGs in certain types of samples may be very low, the development of a qPCR assay must be designed to differentiate a low number of copies of a given ARG in a sample from the inherent noise of the method. In this regard, the Ct values of the non-template assays (negative controls) were null or at least 3.3 cycles higher than those of the last dilution of the last standard point, subsequently, no significant background noise was found in these experiments [33]. Besides, the LOD and LOQ values for the qPCR methods were estimated from the analysis of the replicate standard curves [35]. The LOD values, based on detecting the target sequence at the lowest concentration of the standard curve [33], were 80–120 copy numbers. Similarly, the LOQ values of the assays corresponded to 4 ×104 gene copies per gram of biological sample, which reflects the assay’s capacity to precisely quantify the target genes expressed as gene copies abundance per gram of matrix [34]. According to these results, the new qPCR methods developed here achieved proper levels of analytical sensitivity and trueness.
The qPCR assays with efficiency values within the range 80–120% are considered acceptable and could be subsequently employed [6]. The efficiency values of the different qPCR assays described here ranged between 90% and 100% (Table 5), therefore confirming the good performance of the developed methods. In this respect, variable amplification efficiency have been reported in the quantification range for blaCTX−M (95.3%), blaTEM (107.4%), blaOXA1 (92.1%), ermB (91.3%), tetA (95.4%), sul1 (95.8%), sul2 (83%), dfrA1(88.5%) and dfrA12 (99.4%) [14]. The ruggedness was determined by comparing the Ct values among analytical replicates in a given qPCR assay and those obtained in the different qPCR assays. In this regard, the intra-analytical deviations of the standards among all qPCR assays were very low (average Ct value = 0.345, ranging from 1.164 to 0.024 cycles). Similarly, the mean analytical deviations among different qPCR experiments were 1.01 cycles (ranging from 0.558 to 1.316). Therefore, these qPCR methods showed a good reproducibility both in the same qPCR experiment and when comparing independent qPCR assays, confirming their effectiveness and ruggedness to quantify several ARGs reliably and accurately.
Finally, the specificity of the qPCR methods was also determined by analyzing the amplicon products after performing the assay [36]. In all cases, the melt curves displayed a single sharp peak and were shoulderless, indicating that the amplicons obtained were free of unspecific products, highlighting the reliable amplification of the target ARGs (Fig. 3SI). Also, the verification of the melting temperature of the environmental samples was compared with the expected peak of melting temperature obtained for the standard amplification products. Both melting temperature peaks were equivalent, confirming the specificity of primer annealing previously observed in the in vivo amplification assays.
Total quantification of ARGs in environmental samples
Developing new qPCR methods requires detecting and quantifying the actual occurrence of a given population, including rare, cryptic, and elusive genes in complex DNA from different environmental samples [37]. For that purpose, the efficiency of the primers, the qPCR conditions, and the effects of different matrices were validated in six natural and engineered environmental samples to show the right level of compliance with the critical considerations that need to be addressed for the validation of the design of new primers and the establishment of newly developed qPCR approaches. The total abundance of the ARGs detected in this study is presented in Fig. 2. The genes aadA, aadB, blaTEM, dfrA1 and fosA were prevalent in all samples. On the other hand, the ampC, blaSHV, ermB, qnrS, and tetA(A) genes were only detected in some samples, suggesting that these ARGs are rare in the different environments analyzed. Finally, the mecA gene was not measurable in any samples. According to these results, high ARGs’ detection frequencies (ranging from 70–100%) were found in the environmental samples using the de novo primer sets and qPCR methodologies proposed in this research, except for the mecA gene. These broad ARGs’ occurrences lend support to the idea of the ubiquitous distribution of ARGs in the environment [38, 39]. The widespread occurrences of ARGs for the most used antibiotics (beta-lactams, fluoroquinolones, tetracyclines, macrolides and sulfonamides) in natural and engineered ecosystems have been identified as a severe public health concern [40], a fact widely attributed to the subminimum concentrations of antibiotics that reach these ecosystems [9, 41].
The total abundances of ARGs among all the samples oscillated from 103 to 108 copies/g of biomass. Generally considered, the most abundant ARG was the aadA gene (average value 2.81 × 107 copies/g), followed by ermB (2.75 × 106 copies/g), dfrA1 (2.75 × 106 copies/g), ampC (1.90 × 106 copies/g), aadB (1.18 × 106 copies/g), qnrS (8.10 × 105 copies/g), blaTEM (5.45x105 copies/g), tetA(A) (3.65x105 copies/g), blaSHV (1.33x105 copies/g), and, finally, fosA (1.35x104 copies/g). According to the Kruskal-Wallis and Conover-Iman tests (Fig. 2), ermB and aadA were the most abundant genes in all the samples in which these ARGs were detected. The abundances of aadB, blaTEM, dfrA1, and tetA(A) genes presented a middle prevalence, and the lowest abundances were statistically found for blaSHV, fosA and qnrS. Finally, the numbers of gene copies of ampC were highly variable among samples.
Relative abundance of ARGs in environmental samples
Figure 3 and Table 4SI display the normalized ratio of ARGs copies to 16S rRNA copies (see details of 16S rRNA values in Fig. 4SI).
The highest relative abundance of ARGs for aminoglycosides (aadA, average 3.02%) and macrolides (ermB, 1.00%) observed in this study, are in agreement with those previously found in different WWTPs [42], in anaerobic digestates [43], in farms [44], and in riverine ecosystems [27]. The aadA gene confers resistance to a plethora of aminoglycoside antibiotics (spectinomycin, gentamicin, hygromycin B, and neomycin), which cumulatively are the third-most-used antibiotic class [45]. The high prevalence of the aadA gene in the six samples regardless of their origin could be linked to the fact that streptomycin has been amply used in animal husbandry and plant disease control since the late 1950s [46], which could have had a strong impact in the development and dissemination of aminoglycoside resistance in the natural environment. However, the use of streptomycin as a first-line antibiotic for tuberculosis treatment makes mandatory to avoid the dissemination of the aadA gene within pathogens and other sensitive bacteria [47]. On the other hand, one of the most abundant genes in the bacterial communities was ermB [46], whose dissemination is linked to bacteriophages via transduction, a prevalence that has been connected to the common clinical use of macrolides [43]. Also, the presence of ermB confers resistance to macrolides and it is also frequently associated with resistance to lincosamide and streptogramin type B, resulting in treatment failure to these three antibiotics groups, which inhibit bacterial protein synthesis [48].
The low relative abundances of of aadB, ampC, blaSHV, blaTEM, dfrA1, qnrS, and tetA(A) genes suggests that these ARGs are not the dominant resistance mechanisms in the bacterial communities here analyzed. Similarly low relative abundances of these genes have been described in activated sludge from different WWTPs, anaerobic digestate, compost, riverine sediments, and agricultural soil samples for the aadB [43, 49], ampC [25, 50, 51], blaSHV [25, 52], blaTEM [2, 12, 25, 53–56], dfrA1 [6, 53, 54, 56, 57], qnrS [59, 60], and tetA(A) genes [51, 53, 54, 57]. However, although these ARGs were detected in low relative abundances, their common occurrence implies a loss of the therapeutic effects of some of the most employed antibiotics compounds, resulting in treatment failures in both developed and developing countries and could be important agents in the exacerbation of antibiotic resistance emergence [61]. The enzyme encoded by the aadB gene confers resistance to gentamicin, kanamycin, and tobramycin [62]. This gene is usually present in microorganisms carrying extensive antibiotic resistances [63]. The plasmid-mediated extended-spectrum beta-lactamase (ESBL) blaSHV and blaTEM genes and the chromosomal ampC gene encode enzymes that can hydrolyze and inactivate broad-spectrum beta-lactam antimicrobials (third-generation cephalosporins, penicillins and aztreonam), except cefepime and carbapenems [64, 65]. The expression of the dfrA1 gene inhibits the therapeutic effect of the synthetic antibiotic combination of trimethoprim/sulfamethoxazole [66], antibiotics that are poorly removed during WW treatment processes, increasing their prevalence in the environment as a selective pressure on the bacterial communities [42]. Also, this resistance mechanism is typically carried as cassettes in integrons or insertions, facilitating their dispersion to other sensitive bacteria [67]. The qnrS gene mainly mediates the resistance to quinolones, being considered an antimicrobial resistance of the highest priority due to its significance in human medicine, particularly in developing countries [68]. In addition, the development of unexpected resistance mechanisms to synthetic antibacterial quinolones and trimethoprim encoded by the qnrS and dfrA1 genes highlights that antibiotic resistance is a process in continuous development [6, 60]. Among all tetracycline resistance genes, tetA(A) has been recently suggested as the molecular marker of the tet operon due to its abundance and relationship with anthropogenic inputs [69]. The broad presence of this gene in the environment is related to the nearly universal use of tetracyclines in livestock production, resulting in increased tetracycline resistance among Escherichia coli strains [70]. Ultimately, although all these genes were detected in low relative abundances in the samples analyzed here, monitoring their occurrence in the environment is essential to prevent their overdispersion, which will increase antibiotic resistance emergence irretrievably.
Finally, low relative abundances of fosA and the null presence of mecA genes were found in the six environmental samples here analyzed. Although only a few studies have determined the occurrence of fosA gene in environmental samples [71, 72], its scarce detection is in agreement with the previous results found in isolated bacteria [73] and in different environmental samples [74]. The enzyme encoded by the fosA gene confers resistance to fosfomycin, an antibiotic routinely used for the treatment of urinary infections in the last decades, that is also currently employed to deal with infections caused by extensively drug-resistant (XDR) Gram-negative bacteria [75]. Also, it is important to note that the prevalence of fosA in the environment supposes the proliferation of its main reservoirs, which are the well-recognized human pathogens Klebsiella spp., Enterobacter spp., and Serratia marcescens [76]. The absence of mecA gene in the different environmental samples is in agreement with the previous report of Shoaib et al. [77], which described a prevalence of this gene mainly in healthcare environments. However, a starting high prevalence of methicillin-resistant bacteria in environmental samples, mainly WWTPs, has been addressed in recent years [78, 79]. The mecA gene codes for a penicillin-binding protein (PBP2a), making microorganisms resistant to all beta-lactam compounds [80]. The broad proliferation of the mecA gene has been considered one of the most serious global health threats due to its capacity to cause nosocomial infections by multi-drug resistant human pathogens (mainly Methicillin-resistant Staphylococcus aureus (MRSA)), and to avoid its dissemination to bacteria sensitive to beta-lactams through horizontal gene transfer is a major priority [81]. Therefore, monitoring of both ARGs needs to be conducted and included in the framework for surveillance in natural environments to fill the gaps in the knowledge of the prevalence in environmental samples of fosA and mecA encoding resistance to therapeutic compounds essential to fight bacterial infections [72].
The development of effective and reliably new qPCR methods represent an outstanding contribution to quantifying the abundance of the ARGs and can provide information about the occurrence of ARB useful to propose new politics that minimize the emergence of antibiotic resistances. Hence, the improved primer sets developed in this study could be considered valuable tools to accurately monitor the ARGs in the environment as the first step to ameliorate antibiotic resistance phenomena.