4.1 Drastic Vulnerability Analysis
The spatial distribution of the classes defined for each DRASTIC parameter is shown in Fig. 3.The maximum depth of groundwater (D) (29.7m), is found in the central zone of the study area and minimum values (around 6.3m) are located in the north and south sectors (Fig. 3a). Almost 70% of the study area has a water table depth of less than 9m, which determines that the most of the area is vulnerable to contamination due to the small thickness of the unsaturated zone.
The net recharge (R) varies from 0 to 984 mm per year. The maximum values correspond to irrigated zones (mostly located in the south), covering an area of 20%. In the central and northern zones, the recharge is a combination of rainfall and irrigation (more than 60% of the study area) (Fig. 3b). Despite the area with highest recharge is small, the recharge is higher than 254mm (maximum limit established by DRASTIC methodology) what favors contaminant infiltration from the surface.
The aquifer media factor (A) of the study area is defined by the dominant presence of sands and gravels (rating values 4–9). The highest permeability of the aquifer occurs in the central zone (35% of total area), which is considered the most vulnerable to contamination (Fig. 3c). Although the permeability of the aquifer is high, it shows little variation. Therefore, this parameter does not affect the distribution or variability of the DRASTIC vulnerability index.
The soil media factor (S) is defined by the occurrence of loamy and silty loamy textures, the latter being found in most of the study area (approximately 70%) (Fig. 3d). This type of soil texture helps to protect the vadose zone from the entry of contaminants.
More than 70% of the study area has a very low slope (T) (between 0–2%), only increases at the boundaries of the area and along the river banks (Fig. 3e). The gentle topography results in low surface runoff, which favors vulnerability to irrigation-related infiltration of pollutants.
The vadose zone (I) is defined by the occurrence of gravels, sands, clays, and silts (more than 80% of the study area). The most permeable materials are located in the central zone (Fig. 3f). Permeability contributes to the movement of pollutant movement from the surface to aquifer increasing the vulnerability there.
The aquifer hydraulic conductivity factor (C) ranges from 0 to 476 m/d. The hydraulic conductivity in more than 50% of the study area was higher than 81.49 m/d (which is the highest limit established by the DRASTIC methodology). The highest values are located in the south and in some areas in the north (Fig. 3g). These areas are susceptible to have high vulnerability, due to their high transmissivity and low unsaturated thickness. The lowest values of hydraulic conductivity are located in the central zone. Therefore, this area is less vulnerable to contamination due to its low hydraulic conductivity.
The DRASTIC Vulnerability Index (DI) ranged from 94 to 207. The distribution of the four defined vulnerability classes (low, moderate, high and very high vulnerability) is shown in the DRASTIC vulnerability map (Fig. 4). Almost 20% of the study area (mostly in the southern part) shows a very high vulnerability, influenced by the recharge related with high crop irrigation. A large part of the aquifer shows high vulnerability (53% of the study area). This is located along study area (north, central-west and edges at south zones), related to the high permeability of materials in these zones. Moderate and low vulnerability values are identified in the central-eastern part of the study area, where the water level is deeper and the hydraulic conductivity is lower. The DRASTIC vulnerability map shows that almost 70% of the study area has high and very high vulnerability. This result reveals that the “Aluviales Jarama-Tajuña” aquifer is highly vulnerable to contamination.
4.2 GOD vulnerability analysis
The spatial distribution of the classes defined for each GOD parameter is shown in Figs. 5a and 5b. The “Aluviales Jarama-Tajuña” aquifer is unconfined, according to lithological sections (Fig. 2b) and pumping test (Carreño Conde et al. 2014). Therefore, 100% of study area is unconfined aquifer and the map of groundwater occurrence (G) is defined by a single value equal to one (1) according to GOD method. The lithology of aquifer factor (O), equivalent to vadose zone factor in DRASTIC, varies from 0.5 to 0.7, as gravels, sands, clays and silts constitute 100% of study area. There is little variation of the permeability regarding the thickness of unsaturated material, which makes the area very vulnerable (Fig. 5a). As in the DRASTIC method, depth of groundwater factor (D) is low (depth is less than 10m in more than 70% of area), which is contributing to the high vulnerability (Fig. 5b).
The GOD vulnerability index ranged from 0.32 to 0.70. The map in Fig. 5c shows the distribution of the normalized and classified GOD vulnerability index. Almost 60% of the study area has very high and high vulnerability (40% and 20%, respectively). This occurs in three well-defined zones located in the south, central and north parts of the study area. The very high to high vulnerability is due to the join effect of the high permeability of the materials and the low thickness of the unsaturated zone. 36% of the study area shows moderate vulnerability, mainly in the central zone where the relatively high depth of groundwater decreases the possibility that the pollutant reaching the aquifer. Only 4.6% of area displays low vulnerability, which occurs at the lateral edge of the aquifer at east of the central zone and in the southwestern part of study area. There, the materials consist mainly of clays and gypsum that reduce the infiltration.
4.3 AVI vulnerability analysis
The hydraulic resistance values varied between 0 to 767 years (Fig. 6a). 68.2% of the study area shows very high vulnerability from north to south (only moderate to low vulnerability predominates in the southernmost part, Fig. 6b). The low values are related to the high hydraulic conductivity of the unsaturated zone, together with a low thickness (6m on average) of this layer.
4.4 K-means Cluster analysis
The parameters considered in the K-means cluster analysis are shown in Fig. 7.
The “n x d” data matrix was made of 5842 data points and six feature space (“D”, “R”, “C”, “Kv”, “Th” and “L”). After max-min normalization of database, the method resulted in an optimal number of three (3) clusters, proposed by 12 of 26 index.
The results of high dimensional K-means cluster analysis are summarized in Table 4.
Table 4 Variation of feature data in the three identified clusters (High dimensional dataset)
|
|
|
D (m)
|
R (mm/year)
|
C (m/d)
|
Kv(m/d)
|
Th(m)
|
L
|
Vulnerability
|
Cluster
|
points
|
%
|
Mean
|
Mean
|
Mean
|
Mean
|
Mean
|
Mean
|
1
|
2461
|
42.1
|
8.8
|
16.3
|
82.0
|
9.0
|
10.9
|
2.6
|
Low
|
2
|
2147
|
36.8
|
8.7
|
35.3
|
80.1
|
8.5
|
11.4
|
5.0
|
Moderate
|
3
|
1234
|
21.1
|
5.3
|
967.9
|
132.6
|
6.9
|
10.9
|
4.3
|
High
|
Cluster 3 includes the lowest values of depth of groundwater (D) and the highest values of recharge and hydraulic conductivity (R,C). In addition, land use (L), has a high value. All these conditions contribute to define high vulnerability. On the other hand, Cluster 1 represents the opposite scenario of low vulnerability with the lowest values of recharge (R) and land use (L). Cluster 2 shows moderate vulnerability with higher recharge (R) than cluster 1, but lower than cluster 3. Although land use (L) in cluster 2 has the highest value, it was very similar to cluster 3, Thus recharge (R) and land use (L) together contribute to define moderate vulnerability in cluster 2. Note that vertical permeability on unsaturated zone (Kv) and aquifer thickness (Th) did not influence vulnerability ranking, as they were very similar in all clusters.
4.4.1 K-means cluster by low dimensional analysis
To perform the K-means clustering in a low dimensional dataset, three of the six features were selected using PCA analysis (Table 5). According to the procedure described in the Materials and Methods section, the selected features explain more than 86% of the variance. The relevant features were net recharge (R), Depth of water table (D) and land use (L), in this order of importance calculated by their contribution. These three features, selected by PCA for 5842 points across the study area, produced the low dimensional data set.
Table 5
Eigen vectors and Eigen values, varimax component matrix and eigenvectors contribution obtained from the PCA. Bold numbers in eigenvectors represent the maximum eigen values associated to each parameter.
Parameters
|
PC1
|
PC2
|
PC3
|
D
|
0.1766423
|
-0.113552
|
0.9424516
|
R
|
-0.906886
|
0.334078
|
0.2368487
|
C
|
-0.124457
|
0.072504
|
-0.197747
|
Kv
|
0.0499861
|
0.0002084
|
0.0742895
|
Th
|
0.0089448
|
-0.083407
|
0.0905335
|
L
|
-0.358171
|
-0.929131
|
-0.05356
|
Standard deviation
|
0.413
|
0.3031
|
0.2198
|
Proportion of Variance
|
0.4746
|
0.2555
|
0.1344
|
Cumulative Proportion
|
0.4746
|
0.7301
|
0.8645
|
Contribution
|
1.63
|
1.53
|
1.6
|
The K-means cluster analysis on the low dimensional data set resulted in three clusters as the optimal number of clusters, as was the case for the high-dimensional dataset. The results of the low dimensional K-means cluster analysis are summarized in Table 6.
Table 6 Variation of features data in the three identified clusters (Low dimensional dataset)
|
|
|
D (m)
|
R (mm/year)
|
L
|
Vulnerability
|
Cluster
|
points
|
%
|
Mean
|
Mean
|
Mean
|
1
|
2461
|
42.1
|
8.8
|
16.3
|
2.6
|
Low
|
2
|
2147
|
36.8
|
8.7
|
35.3
|
5.0
|
Moderate
|
3
|
1234
|
21.1
|
5.3
|
967.9
|
4.3
|
High
|
The results of K-means cluster analysis on the low dimensional data set show the same behavior as the high dimensional data set. The clusters consist of the same number of points and represent the same vulnerability classes of vulnerability. Figure 8, shows the clustering vulnerability map.
4.5 Vulnerability method validation
The nitrate observation points were classified by their concentration in four categories (Table 7). The classified points have been projected onto the vulnerability maps (Figs. 4, 5c, 6b and 8).
Table 7
Nitrate pollution indicator (four classes) based on nitrate concentration in water quality monitoring wells.
|
Nitrate Concentration (mg/L)
|
|
< 12
|
12–25
|
25–50
|
> 50
|
Samples
|
5
|
10
|
3
|
5
|
Percentage (%)
|
21.7
|
43.5
|
13.0
|
21.7
|
Nitrate pollution indicator
|
low
|
moderate
|
High
|
very high
|
The graphical coincidences for high and low vulnerability and high and low nitrate pollution are noticeable in DRASTIC and K-means maps (Figs. 4 and 9). In contrast, GOD and AVI methods show less graphical agreement (Figs. 5c and 6b).
Table 8 shows the Spearman´s correlation coefficient between nitrate concentration samples and each method used to assess the vulnerability to contamination.
Table 8 Spearman correlation coefficient between nitrate concentration and vulnerability p-value of the studied methods.
Method
|
Spearman rank correlation (rho)
|
p-value
|
DRASTIC
|
0.34
|
0.049*
|
GOD
|
-0.50
|
0.007**
|
AVI
|
0.01
|
0.48
|
K-means (Low dimensional data set
|
0.48
|
0.019*
|
*Spearman test p−value<0.05 |
The vulnerability indices GOD and AVI vulnerability did not yield a valid correlation with nitrate concentration values. Better correlations were obtained by the DRASTIC and the K-means methods. However, the cluster analysis showed a better correlation with nitrate concentration, with higher correlation coefficients compared to those for DRASTIC method. K-means cluster analysis resulted in 48% of Spearman´s correlation coefficients. The p-values confirms that the best methods (DRASTIC, K-means) were statistically significant.
Figure 9 shows the percentage of area with very high, high, moderate and low vulnerability, depending on the applied assessment method, as well as the nitrate contamination range classes. The results obtained from the AVI method were completely different from the rest of the methods, as the AVI method assigned very high vulnerability to a large portion of the aquifer (more than 60% of the study area). This contrasting result is due to the fact that this assessment method only considers the travel time of the contaminant through the unsaturated zone. The low correlation of the AVI method with nitrate pollution (Table 8) shows that more characteristics need to be considered to obtain better or more adjusted vulnerability assessment. Thus, the AVI method is not suitable to be applied to an aquifer whose vulnerability is dominated by hydrological and hydrogeological features as net recharge, depth of water table and land use. The GOD method showed a negative correlation, meaning that the high values of nitrate concentration correspond with low vulnerability values and vice versa. This method does not take the aquifer recharge into account like the AVI method, which confirmed that recharge is a feature of paramount importance in the vulnerability assessment of the study area. In addition, the vulnerability assessed in the study area by the GOD method is strongly influenced by depth of water table over the other parameters considered in the methodology. The low correlation of GOD and nitrate concentration (Table 8) is due to the fact that the depth of groundwater in this case is not sufficient to define vulnerability zones, suggesting that in detrital aquifers is necessary to consider others parameters. DRASTIC resulted in a lower proportion of very high vulnerability, similar to the percentage of samples with very high nitrate contamination (around 19%). On the other hand, DRASTIC showed different proportions of high, moderate and low vulnerability compared to the percentage of samples of nitrate concentration classes (Fig. 10). Despite this, the Spearman`s correlation coefficient between the vulnerability index of DRASTIC and the nitrate concentration was higher than GOD and AVI methods (34%, Table 8), indicating that some of the parameters considered on DRASTIC method had a major influence on improving the vulnerability assessment in the aquifer. The K-means method showed the highest Spearman´s correlation coefficient between vulnerability classes and nitrate concentration (48%). This showed that it is important to select non-redundant parameters and, in this case, the most influencing parameters were net recharge, depth of groundwater and land use, as obtained by PCA analysis. Considering nitrate as an indicator contamination (Table 7, Fig. 10), almost 22% of the samples corresponded to the very high pollution class, with the nitrate concentration exceeding the recommended limit (50mg/L). These samples are located on high vulnerability values areas in the cluster map. Many water quality samples (43%) are indicative of moderate pollution (12–25 mg/L), the most numerous being located in the central zone of the aquifer coinciding with the moderate vulnerability zones in the K-means cluster map (where the water depth is higher and net recharge is lower).
K-means cluster analysis based on relevant features emerges as the best method to assess the vulnerability to pollution of a detrital aquifer, being more objective that the overlay index methods, which verifies the advantage of using this technique.