Prevalence of Legionella species in air samples
Legionella species were not isolated by direct culture or amoebic co-culture methods from any of the 202 air samples collected from roads or bathrooms and other indoor sites (Table 1). Although 49 Legionella-suspected colonies grew on glycine-vancomycin-polymyxin B-cycloheximide (GVPC) agar plates, they also grew on blood agar plates, indicating that they do not belong to the genus Legionella. There was low background growth on the GVPC agar plates.
Table 1
Prevalence of Legionella species in air samples
| | No. (%) of positive samples by: | Geometric mean ± SD (log10 copies/m3) in Legionella DNA-positive samplesd |
Sampling site | No. of samples | Culture | Amoebic co-culture | qPCRc |
Road | 151 | 0 | 0a | 114 (75.5) A | 1.80 ± 0.52 C |
Bathroom | 21 | 0 | 0 | 15 (71.4) A | 1.82 ± 0.50 CD |
Indoor site | 30 | 0 | NTb | 11 (36.7) B | 0.88 ± 0.56 D |
a22 out of 151 samples were not tested. bNot tested. cValues with different letters are significantly different (P < 0.05). Data were analyzed by Fisher’s exact test followed by post hoc Holm test. dValues with different letters are significantly different (P < 0.05). Data were analyzed by the one-way ANOVA test followed by Tukey–Kramer multiple comparisons. |
However, Legionella DNA was detected in 114/151 (75.5%) air samples collected near roads at all 12 sampling sites (locations A‒L). Legionella DNA was also detected in 15/21 (71.4%) air samples collected from 14/17 bath facilities and 11/30 (36.7%) samples collected from 4/4 other indoor sites. The rates of positivity for the samples collected near roads and from bathrooms were significantly higher than those for samples collected from indoor sites other than bathrooms (P < 0.05; Fisher’s exact test followed by post hoc Holm test). The geometric means ± standard deviation (SD) (log10 copies/m3) of Legionella-specific 16S rRNA genes in the Legionella DNA-positive samples were 1.80 ± 0.52, 1.82 ± 0.50, and 0.88 ± 0.56 for roads, bathrooms, and other indoor sites, respectively. The values for these three sampling source types were determined to be significantly different by one-way analysis of variance (one-way ANOVA) test (P < 0.05). Moreover, the Tukey-Kramer method revealed that the amounts of Legionella DNA were significantly different among the samples from roads and indoor sites other than bathrooms (P < 0.05).
Of the 202 air samples, 150 (129 from roads and 21 from bathrooms) were tested by amoebic co-culture. For one sample collected from a bathroom (Legionella-specific 16S rRNA genes, 1.45 log10 copies/m3), the amount of Legionella DNA was found to be increased by 1.1 × 105-fold after amoebic co-culture. BLAST search showed that the macrophage infectivity potentiator (mip) gene sequence in the sample had 92% identity with the mip gene sequence of L. nautarum.
[Insert Table 1 here]
EMA-qPCR for the detection of viable Legionella cells in air samples
To detect DNA from viable but non-culturable (VBNC) Legionella cells, we collected another set of 30 air samples near roads and carried out EMA-qPCR. Considering the low airborne concentration of viable Legionella cells, the sampling time was extended to 30 min as it increased the volume of air sampled and thus, improved the limit of detection. Furthermore, Chang et al. reported that collection with deionized water yielded greater recovery of viable L. pneumophila than with tween mixture (containing 0.01% Tween 80, 1% peptone, and 0.005% Antifoam Y-30) by EMA-qPCR [19]. Thus, for EMA-qPCR, sterile deionized water was used as the sampling solution instead of 0.005% Tween-80 solution. Among the 30 air samples collected near asphalt roads, Legionella DNA was detected in 20 samples (66.7%) by EMA-qPCR and in 26 samples (86.7%) by qPCR (no EMA treatment, as a control). The geometric means ± SD (log10 copies/m3) of Legionella-specific 16S rRNA genes in the Legionella DNA-positive samples were 0.97 ± 0.88 by EMA-qPCR and 1.21 ± 0.70 by qPCR.
Geographic And Meteorological Characterizations
The numbers of air samples collected near roads at the 12 sampling sites were as follows: A, N = 15; B, N = 12; C, N = 12; D, N = 11; E, N = 10; F, N = 13; G, N = 11; H, N = 11; I, N = 17; J, N = 13; K, N = 11; and L, N = 15. At these sampling sites, the detection rates of Legionella DNA ranged from 60.0–93.3%. The geometric means ± SD (log10 copies/m3) of Legionella-specific 16S rRNA genes in the Legionella DNA-positive samples ranged from 1.54 ± 0.66 to 2.03 ± 0.42. The differences in the detection rates according to sampling sites and the amount of Legionella-specific 16S rRNA genes according to sampling sites were not significant by Fisher’s exact test followed by post hoc Holm test and the one-way ANOVA test, respectively.
We assessed the correlation between the climatic conditions (air temperature, relative humidity, total precipitation, and wind speed) and the amount of Legionella-specific 16S rRNA genes (log10 copies/m3) in the Legionella DNA-positive samples (Table 2). The amount of Legionella DNA correlated with the monthly total precipitation (r = 0.25, P < 0.01). It was also directly and inversely correlated with the daily total precipitation for seven days (r = 0.21, P = 0.01) and one day (r = − 0.29, P < 0.01) before the sampling day, respectively. The scatter plots of total precipitation and the amount of Legionella DNA are shown in Fig. 1. The geometric mean ± SD of the Legionella DNA-positive samples at a monthly total precipitation of > 200 mm (2.00 ± 0.40 log10 copies/m3) was higher than that at a monthly total precipitation of ≤ 200 mm (1.73 ± 0.54 log10 copies/m3; P < 0.05, Student's t-test). However, the detection rates of Legionella DNA at monthly total precipitation of > 200 mm (87.9%, 29/33 samples) and ≤ 200 mm (72.0%, 85/118 samples) were not significantly different (P ≥ 0.05, Fisher’s exact test). The detection rate and the geometric mean of Legionella DNA seven days before the sampling day at a daily total precipitation of > 10 mm (100%, 16/16 samples; 2.07 ± 0.32 log10 copies/m3) were also higher than those at a daily total precipitation of ≤ 10 mm (72.6%, 98/135 samples; 1.75 ± 0.53 log10 copies/m3) (P < 0.05; Fisher’s exact test and Student's t test, respectively). However, the geometric mean of Legionella DNA one day before the sampling day at a daily total precipitation of ≤ 10 mm (1.83 ± 0.49 log10 copies/m3) was higher than that at a daily total precipitation of > 10 mm (1.55 ± 0.64 log10 copies/m3; P < 0.05, Student's t-test); the detection rates of Legionella DNA one day before the sampling day at daily total precipitation of ≤ 10 mm (75.0%, 99/132 samples) and > 10 mm (78.9%, 15/19 samples) were not significantly different (P ≥ 0.05, Fisher’s exact test).
Table 2
Correlation between the climatic conditions and the amount of Legionella DNAa
| Air temperature (mean, °C) | | Relative humidity (mean, %) | | Total precipitation (mm) | | Wind speed (mean, m/s) |
| r | P | | r | P | | r | P | | r | P |
Daily value: | | | | | | | | | | | |
on the sampling day | −0.10 | 0.22 | | 0.13 | 0.11 | | 0.17 | 0.04c | | −0.10 | 0.22 |
one day before the sampling day | 0.01 | 0.90 | | −0.19 | 0.02c | | −0.29b | < 0.01c | | 0.16 | 0.049c |
two days before the sampling day | −0.07 | 0.39 | | −0.15 | 0.06 | | 0.12 | 0.16 | | 0.08 | 0.34 |
three days before the sampling day | −0.10 | 0.24 | | −0.01 | 0.90 | | −0.11 | 0.17 | | −0.15 | 0.06 |
four days before the sampling day | −0.06 | 0.48 | | 0.03 | 0.74 | | 0.02 | 0.85 | | −0.19 | 0.02c |
five days before the sampling day | −0.09 | 0.29 | | −0.05 | 0.53 | | −0.06 | 0.45 | | −0.08 | 0.35 |
six days before the sampling day | −0.11 | 0.17 | | 0.04 | 0.65 | | −0.09 | 0.28 | | 0.07 | 0.39 |
seven days before the sampling day | −0.13 | 0.12 | | 0.14 | 0.09 | | 0.21b | 0.01c | | −0.07 | 0.37 |
Monthly value | −0.10 | 0.24 | | −0.02 | 0.78 | | 0.25b | < 0.01c | | 0.01 | 0.88 |
aGeometric mean (log10 copies/m3) of Legionella-specific 16S rRNA genes in the Legionella DNA-positive samples. bAn absolute value of Pearson’s r ≥ 0.20 was considered to be correlated. cP < 0.05 was considered significant. |
[Insert Table 2 here]
16s Rrna Gene Amplicon Sequencing
16S rRNA gene amplicon sequencing was performed on randomly selected samples collected from roads (N = 30), and all samples collected from bathrooms (N = 21) and other indoor sites (N = 30). The median number of reads after quality filtering, denoising, merging, and removing chimeric sequences was 62,246 (range, 9,852‒246,625) from roads, 112,471 (range, 16,534‒341,587) from bathrooms and 111,835 (range, 16,009‒250,862) from other indoor sites. A total of 8,174,054 reads (100,914 reads per sample) were assigned to 18,426 amplicon sequence variants (ASVs).
At the genus level, 485, 421, and 368 bacterial genera were detected in samples collected from roads and from bathrooms and other indoor sites, respectively (Fig. 2). The top three most abundant bacterial genera in the samples collected near roads were Sphingomonas (21.1%), Streptococcus (14.6%), and Methylobacterium (1.6%); those in the samples collected in bathrooms were Sphingomonas (17.6%), Pseudomonas (5.4%), and Methylococcus (4.3%); and those in the samples collected from indoor sites other than bathrooms were Sphingomonas (19.2%), Achromobacter (5.0%), and Arthrobacter (3.8%). Reads from Legionella species were detected in 9/30 samples (30%) collected near roads (mean proportion of reads, 0.11%), 5/21 samples (24%) in bathrooms (mean, 0.04%), and 1/30 samples (3%) from indoor sites other than bathrooms (mean, 0.03%). The rate of positivity for the samples collected near roads was significantly higher than the positivity rate for samples collected from indoor sites other than bathrooms (P < 0.05; Fisher’s exact test followed by post hoc Holm test).
At the species level, L. pneumophila was detected in 2/30 samples collected near roads (mean proportion of reads, 0.11% and 0.09% in the detected samples) and in 1/21 samples collected in bathrooms (mean, 0.15% in the detected sample). Furthermore, L. birminghamensis and L. geestiana were detected in 1/30 samples collected from roads (mean, 0.81% and 0.02% in the detected samples, respectively), and L. nautarum was detected in 1/30 samples collected from indoor sites other than bathrooms (mean, 0.79% in the detected sample).
Alpha diversity indices in air samples are shown in Fig. 3. Shannon diversity indexes (a quantitative measure of bacterial community richness) and Pielou’s evenness (a measure of community evenness) were not significantly different between the three sampling source types (P ≥ 0.05, Kruskal-Wallis test). However, Faith’s phylogenetic diversity indices, a qualitative measure of bacterial community richness that incorporates phylogenetic relationships between the features associated with samples from road exhibited a much higher value than those from other indoor sites (FDR-adjusted P < 0.05; Kruskal-Wallis test, Benjamini-Hochberg correction). The principal coordinates plot showed that a portion of the air samples collected from roads and indoor sites other than bathrooms plotted separately from those from bathrooms (Fig. 4). Linear discriminant analysis (LDA) effect size (LEfSe) analysis revealed 15 genera with an LDA score of at least 3.0, which were significantly more abundant in the three sampling source types (Fig. 5; P < 0.05, pairwise Wilcoxon test). Specifically, we found the following four genera to be more abundant in the samples (LDA score > 4.0): Pseudomonas, Vibrio, and Staphylococcus from bathrooms (red color), and Achromobacter from other indoor sites (green color).