To investigate the WRF design storms' effect on flood hazard modelling, we used the nine design storms as input for the openLISEM model. The model results produced by using AB2yr and AB10yr are used as a benchmark. The flood model outputs are discussed in terms of flood hazard characteristics, including flood hydrographs, flood extent maps, flood areas, and flood impact on the number of buildings.
5.1 Flood hydrograph
Figure8 shows the resulting hydrographs at the Lubigi catchment outlet for 11 design storms. As shown in the figure, a similar low peak discharge is obtained when using AB2yr as well as the WRFL (design storm with the lower total rainfall amount and peak intensity. In contrast, the highest peak discharge is obtained when using the AB10yr and WRFU design storms, which have a higher total rainfall amount and peak intensity. In particular, a flood peak obtained at the catchment outlet when using WRF3U is about 70 m3/s, which is above the reference flood control structure's capacity (i.e., 67 m3/s).
5.2 Simulated flood extent
Table4 shows the calculated F scores for the simulated extent of the flood for 9 WRF design storms benchmarked with AB2yr and AB10yr. Compared to the AB2yr event, WRF1L, WRF2L, and WRF3L produce a better flood extent with higher F scores of 0.87, 0.94, and 0.94, respectively. Compared to AB2yr, WRFUs overestimate the flood extent, which results in a lower F score (Table4, first row). Considering the AB10yr event as a benchmark, WRF1M, WRF2U, and WRF3U produce a better flood extent with higher F scores of 0.85, 0.89, and 0.91, respectively. Compared to AB10yr, WRFLs underestimates the flood extent, which results in lower F scores (Table 4, second row). As shown in the table, the aggregate score decreases as we go from left to right (i.e., from WRFL to WRFU) when comparing with AB2yr) and vice-versa when comparing with AB10yr. The results indicate that for WRFL, the comparison with AB2yr is more appropriate, while for WRFU, the comparison with AB10yr is more appropriate.
5.3 Flood depth
In order to verify the applicability of the WRF design storms in producing flood depth maps used for flood hazard analysis, we compare flood depth maps produced when using the 9 WRF design storms with the results when using the IDF-AB storms based on a visual comparison of the maps. Figure 9 shows the depths of food water in the catchment area produced when using the WRF and IDF-AB design storms. As shown in the figure, following the topography of the catchment area, the low-lying areas and wetlands are flooded when using all design storms with flood depths varying between 0.5 to 2.6 m. However, as we go from WRFL to WRFU or as the return period increases, so too do the flood depths, as would be expected. Thus, maximum water depths of 2 m and above are simulated when using WRFU and AB10yr. The results are compatible with previous studies in the catchment (Sliuzas et al., 2013; Umer et al., 2019), whose results indicated that the wetlands of the catchment are fully flooded with design storms of typical 2-year events or more.
In Fig. 9, red circle a, we showed the relevant location used for comparison of WRFL versus AB2yr. When using all WRFL, the simulated flood depths are between 1.5 to 2 m, but with AB2yr, the flood depth is between 1.0 to 1.5 m at the same place, which is due to the lower cumulative rainfall amount of AB2yr compared to that of WRFLs. In comparing WRFU with AB10yr (at circle b), the simulated flood depths are above 2.0 m in all cases. However, the number of grid-cells flooded with flood depths of greater than 2.0 m is more in the case of WRF3U compared to AB10yr.
As the intensity for the flooding is often expressed as the maximum depth at any grid-cell, a frequency distribution of that would be directly interesting for flood hazard analysis. Toward this, we produced the histogram of the water depths versus its frequency and compared the flood depths differences at any grid-cell when using the WRF and IDF-AB storms. As a showcase, flood depth differences per grid-cell between the WRF3 design storms and the IDF-AB storms are given in Fig. 10. As shown in the figure, for WRF versus AB2yr (Fig. 10, top row), the results with WRF flood depths are slightly higher; hence, the histogram differences are skewed in the positive x-direction. However, when comparing WRF versus AB10yr, except for WRF3U, the histogram differences in water depths are negative. For instance, in the case of 'WRF3L - AB2yr', the flood depths differences per grid-cell are concentrated around zero with the frequency of 90 %, while for 'WRF3U - AB2yr', the flood depths difference is greater than zero and the frequency around the zero value is 40 % with its distribution spreads toward the positive x-axis. The figure also shows that the WRF results have little bias/slight overestimation of flood depths when using the lower and median quantiles and large differences of flood depths when using the upper quantile design storm with respect to AB2yr. The figure also shows that the flood depths when using WRF are underestimated at WRF3L and WRF3M and slightly overestimated water depths at WRF3U with respect to the AB10yr. It is important to note that the maximum flood depths differences per grid-cell for 'WRF3L – AB2yr' and 'WRF3U – AB10yr' is less than 0.2 with frequency distribution concentrated near-zero value, which indicates that the WRFL and WRFU design storm can be relevant for 2-year and 10-year return period flood hazard assessment, respectively.
5.4 Effects of flooding on buildings
To analyze the applicability of the constructed design storms for flood hazard modelling, we also compared the results in terms of flood effect on the building. The effect of the flood extent on the building is calculated considering the building's areal density of 90 m2. Table 4, row 4, shows the number of building affected by the flood extent (water depth > 0.1 m) when using 11 design storms. Notably, the number of buildings affected by the flood extent when using WRF1L, WRF2L, and WRF3L are 5058, 4425, and 4777, respectively, slightly higher than when using AB2yr (i.e., 4258). In contrast, more buildings are affected by flood extent when using AB10yr (7258) and WRFU (i.e., 5761, 6299, and 8223 for WRF1U, WRF2U, and WRF3U, respectively), which is characterized by higher total rainfall amount and peak intensity. In all cases, the number of buildings affected by flood extent is well correlated with the inundated areas (see Table 4, 3rd Row).
Moreover, model results also indicated that for all 11 design storms, the number of buildings affected by the flood is more at lower water depth (i.e., 0.1 – 0.5 m) and less at higher water depth (i.e., depths > 0.5 m) (see, Fig. 9). For instance, due to the flood depth ranges 0.1 – 0.5 m, the number of affected buildings is 3-9 times higher than at flood depths > 0.5 m. The results show that the maximum flood depth is more confined in the non-built-up areas represented by wetlands, consequently less effect on built-up.
5.5 Discussion
The study presents a new method to get a location-specific design storm based on WRF simulated high-intensity rainfall events, which proved to be suitable for flood hazard modelling in the data-scarce area. The method presented is flexible as it can be based on any desired combination of event magnitude and peak intensity. The magnitudes can be based on disaster mitigation plans of the stakeholders in the areas. However, while the magnitude is relatively straightforward to derive from a Gumbel analysis, the peak intensity may not be well known. A peak intensity could come from high-resolution rainfall measurement, or in the absence of that, from satellite imagery (30-minute intensity) or even an IDF curve analysis. All of these have associated uncertainty. Rainfall measurements and IDF curves may not have long time records, so selecting a characteristic peak intensity is less evident when time series are not very long. Besides, the construction of valid IDF curves relies on storm data. Peak intensity can also be derived from satellite imagery, and for instance, GPM-IMERG has a 30-minute time interval with global coverage dating back to the year 2000. Therefore, the time series derived from these images is already 20+ years, but while aggregated values (3-day and weekly totals) show good agreement with ground measurements, the 30-minute intensities do not show a high correlation in general (Fang et al., 2019; Chen et al., 2020).
In this study, the data are based on WRF, but operating WRF is not an easy task. The parameterization needs to be properly done and is area-dependent. Purely as a method to derive design storms, this is a large task. However, many meteorological services in countries use the WRF model or other weather models for weather forecasting, so good knowledge on the local parametrization of a weather model may be locally available.
Weather models do not produce pixel-precise results, i.e., the spatial patterns of rainfall do not coincide with ground-based measurements. The patterns are a result of complex atmospheric physics of the entire lower atmosphere, and the interaction with the earth's surface can still be improved (Ryu et al., 2016; Paul et al., 2018). This is not immediately a problem for flood hazard analysis, as a hazard is not based on a real event but is a simulation of a potential situation: for a given storm of a known size and probability of occurrence, the potential maximum effect (i.e., water level and extent) is simulated. Therefore, that event can be derived from anywhere as long as it is representative of the weather patterns of Kampala (in our case). Practically this was done by selecting the grid cells in the inner domain area as being representative, but this is only a practical choice. More research would be needed to determine which area can be considered representative for an area.