After library studies and knowing the situation of the studied area, in the first step, all the input data as the effective criteria in the hazard of flooding converted to the raster layers in ArcGIS Pro using various tools. The raster layers of seven criteria, including dem, slope, drainage density, NDVI, land cover, geology, and precipitation are shown in Fig. 2. Only land cover and geology have polygon structure among the raster layers, which were rasterized using the feature to raster tool. The value of the raster cells increases from blue to red in other raster layers with the structure of a point or polyline rasterized using the Euclidean distance, interpolation tool and kernel density. In other words, the red cells (except geology and land cover) indicated the high values. Among these layers, drainage density and NDVI were inversely related to the hazard of flooding. However, precipitation, dem, and slope were directly associated with the hazard of flooding (Ogato et al., 2020; Argaz et al., 2019).
After rasterization of the criteria, the rasterized layers were standardized by using different fuzzy membership functions based on the library and field studies. The standardized layers have values between 0 and 1. The numbers in these layers that are close to 1 indicate a high risk of flooding (Fig. 3). It shows that the precipitation criterion in the Darab watershed's center, which has the most population, has a considerable proclivity to flood. Precipitation criteria such as slope, dem, land cover, NDVI, and geology had a low potential for flash floods at the survey boundary's center, with the greatest potential for flooding towards the study area's margin. The majority of the survey borders demonstrated a high desirability to flood risk in the fuzzy layer of drainage density. Table 6 indicates the final weight of each of the criteria selected in this study. The criteria of drainage density, slope, land cover, precipitation, geology, dem, and NDVI had the highest weights, respectively. Drainage density, slope, and land cover, with a final weight of 0.2737, 0.1716, and 0.1533, respectively, had the highest value among the criteria, which had the greatest effect on overlaying the layers. In the fuzzification layer of drainage density, which is the most important among the variables, more pixels of the study area were introduced as the areas with the flood hazard compared to the other variables, excluding NDVI. Unlike in the slope layer, fewer pixels are at risk of flooding, which is mostly located in the northern regions of the study area.
Table 6
Significance weights of the main criteria and sub-criteria
Main criteria | W1 | Sub-criteria | W2 | Final Weight |
Hydroclimate | 0.411 | Drainage Density | 0.666 | 0.2737 |
| | Rain | 0.333 | 0.1368 |
Topography | 0.261 | Dem | 0.333 | 0.0858 |
| | Slope | 0.666 | 0.1716 |
Land type | 0.327 | Land cover | 0.475 | 0.1553 |
| | Geology | 0.415 | 0.1357 |
| | NDVI | 0.109 | 0.0356 |
The final layers obtained from overlaying by five fuzzy operators are shown in Fig. 4. The red and blue spots on the output maps were considered to be places with a high and low risk of flooding, respectively. Finally, the zonation of flooding potential was created by overlaying the weighted fuzzy layers by using different fuzzy operators, which is the best method for zoning the hazard of flooding based on the conducted studies (Mngutyo et al., 2013; Ogato et al., 2020; Kazakis et al., 2015). Other operators, except Or and Sum, which showed illogical results based on the flooding, indicated that the highest flood hazard is related to the north and south of the study area near the altitudes of the Darab watershed. Among these operators, the Gamma operator (0.9) showed the best and most logical result compared to the other operators, based on the value range of the cells.
Figure 5A indicates the total area of the regions with a high hazard of flooding is approximately 63432.7344 hectares, which contains pixels with a value above 0.5 and shows the red and orange spots. Moreover, the areas with moderate flood hazard (0.15–0.5) include 86204.2969 hectares of the Darab watershed. The highest area is related to the values less than 0.015, with a total area of 90131.8281 hectares, which shows the blue and safe spots on the map. As shown in Fig. 5B, the majority of the residential areas within the study area are in areas with a low to moderate hazard of flooding. The non-hazardous pixels were located at the left and right of the study area (Fig. 5A) and were removed from the next analysis of the study due to the absence of residential areas and lack of flooding in these regions. In Fig. 5A, areas with high, moderate, and low hazard include 63280.2188, 85809.4375, and 5909.7344 hectares of the secondary boundary, respectively. The georeferenced points were used for verifying the obtained results. These points were collected from the studied area using the GPS device and field visits from flooding areas under the supervision of the experts and locals during the rainy years (recent 10 years). Every 60 GPS points were converted to the ship file format in ArcGIS software and overlaid on the final zone of flooding hazard (Fig.5B).
Based on the results of the verification, most of the GPS points were located in areas with a high hazard of flooding. Each point indicates a significant and somewhat damaging flood in its surrounding areas. Based on the range of values of each zone (Fig. 4) and the ground truth map, the Gamma operator (0.9) was the best selection for the final zone of flooding hazard (Fig. 6).
In the study area, the total residential areas include 302 villages, most of which, especially the places showing the highest density on the map and considered the most populous areas, are probably located in moderate and low-hazard areas. To ensure this issue, the kernel density estimation tool was used to estimate the flood hazard density in residential areas after performing the kernel density estimation on residential areas (Fig. 7).
In Fig. 7, the darker blue pixels indicate the higher density of flood hazard in the study area. In other words, the dark blue spots in the Darab watershed, which are mostly located in the center of the boundary, show a high hazard of flooding in residential areas. However, it cannot be certainly said that high-hazard residential areas are located in the center of the study area since, in addition to the weight (flood hazard) of residential points, kernel density estimation considers the location of points and their distance from each other for estimating the density. Therefore, global and local Moran analysis was used in the next steps to further ensure. As shown in Fig. 8A, the spatial distribution pattern of unweighted population points is clustered.
In this analysis, the nearest neighbor ratio is above zero (0.607) and P-value is equal to zero, which indicates that the spatial distribution pattern of unweighted residential points, i.e., the spatial distribution pattern was evaluated only based on the location of points and their distance from each other, is significant and clustered (Z-score is equal to -13.05). Moreover, in Fig. 8B, the distribution pattern of weighted residential areas with flooding hazard is clustered. Based on the results indicating the highest degree of significance (P-value = 0), Moran's index is above zero (Moran's index = 0.359) and the Z-score is equal to 10.95, showing the spatial distribution pattern of the weighted residential areas is clustered.
Hotspots analysis (local Moran) was used to identify the high-risk and low-risk clusters after ensuring the clustering of the distribution pattern of residential areas (unweighted and weighted). After performing hotspot analysis on the weighted residential areas with flooding hazards, Fig. 9A was obtained, which indicates three categories of low-risk clusters or cold spots with a confidence level of 90, 95, and 99%, which are shown in blue. Three categories of high-risk clusters or hotspots with a confidence level of 90, 95, and 99% exist in this figure, which are indicated in red. Furthermore, there is a category without a confidence level, which is displayed in gray and smaller spots. Low-risk clusters of residential areas relative to flooding with a 99% confidence level are located in the west and east of the study area (blue). However, high-risk clusters of flooding with a 99% confidence level are dispersed in the north and northeast (red) of the study area at higher altitudes. The other points are not statistically significant and do not show a reliable result.
Cluster and outlier analysis were used to identify the high-risk and low-risk clusters of residential areas more accurately. In Fig. 9B, two categories, including High-High and Low-Low clusters, indicated the high-risk and low-risk clusters, respectively, whose distribution is similar to the output of hotspot analysis. Moreover, two categories, including High-Low outlier and Low-High outlier, show the outliers. The first category indicates that high-hazard residential areas were surrounded by low-hazard residential areas that were flooded. However, no residential properties with these characteristics were observed in this study. The second category (Low-High outlier) shows the residential areas with low flood potential around which high flood hazard residential areas are located. Such residential areas can be considered areas at high flood risk due to their proximity to areas with a high hazard of flooding. In this study, a residential point called Fatah al-Mubin town is located in this category. Therefore, the residential clusters in the north and northeast of the watershed (with a population of less than 5000 people) are at risk of flooding. In this regard, necessary measures should be taken by managers, environmental planners, and municipalities.
Two effective indices on the output of this analysis, including Z-score and P-value, were categorized using Symbology for assessing the results of the hotspot analysis in more detail. In Fig.10A, the points with Z-score higher than 2.5 and lower than − 2.5 indicate the high-risk clusters and low-risk clusters of hotspot analysis with 99% confidence. The P-value shows the significance of the identified clusters. The confidence level of clusters decreased when the distance from 0 to 1 was increased. As shown in Fig. 10B, blue points indicate the high-risk and low-risk clusters of flooding with a 99% confidence level.
Figure 10. Hotspot analysis of flooding areas, A: Z-score, B: P-value
Land cover is a key factor in the occurrence of floods. Precipitation in the land without vegetation flows quickly on the ground compared to forest areas. Therefore, severe runoff flows in some land areas (for instance, a high percentage of urban) compared to the similar areas covered by forest and grass (Bakhtyari Kia 2012). 14 Land cover variable, indicating the type of land (man-made or natural) for a specific application such as urban, agricultural, industrial, rangeland, and forest, is one of the most important factors affected by the flood hazard. The map of flooding obtained by overlaying fuzzy operators was zoned in the studied area (Fig. 11).
After applying the Zonal Statistics analysis in this zoning, the maximum rate of flooding hazard exists in poor and moderate grassland covers (blue polygons), which are located in the marginal areas of the study area. Moreover, the minimum level of hazard of flooding was observed in the urban, agricultural, forest, and garden uses (yellow, orange, and red polygons) in the boundary center and near the populous centers of the Darab watershed.