Location and Research Data
The eastern side of Caochangmen Street, was selected as the study area (Fig. 1). The coordinates of the study area are 32.060°-32.070° N; 118.747°-118.757° E, in Gulou District, Nanjing, Jiangsu Province, China, with an area of approximately 683 644 m2. The District has relatively abundant rainfall, with an average annual precipitation of ~1106 mm, and the overall topography is gentle. Because it is in the central business district, the nearby residential communities and schools were densely distributed, and the buildings and hardened road surface areas were saturated; therefore, urban flash-flooding often occurred. An analysis of the drainage network distribution characteristics indicated that the area formed a relatively independent catchment area that was not easily affected by external runoff. Additionally, two sets of road-ponding monitoring and warning devices were distributed in the area, conducive to case-study inference verification.
This case study used data provided by the local government bureau, including flood-prone-point monitoring and rainfall-station observation data from 2016 to 2019; 5 × 5 m digital elevation model, land use, road building, and other basic geographic data; and water conservation facility, and drainage network data. The data were divided into three major categories: hydro-meteorological, basic geographic, and water conservancy and drainage and pre-processed for use in building a case analysis for the UFD-GOM construction. The rainfall observation data from 2016 to 2019 were used as a set of historical case data, and a typical urban flooding geographic scenario that occurred on July 17, 2020, was selected as a reference case study.
UFD-GOM Framework
Urban flooding disasters include rainfall, surface, and drainage systems (Fig. 2); each system had a distinct hierarchy and sufficient elements, based on which the UFD-GOM with rainfall, surface and drainage ontologies as its core was built¾as a “triple-core” ontology model. Therefore, it was critical to clarify the system framework before constructing the UFD-GOM. Rainfall ontology can create a response in surface ontology and, in turn, surface ontology can create a response in the drainage system, forming a logical spatial continuity. The hierarchical structure of the three urban flooding ontology scenarios was subdivided to create a framework for constructing the UFD-GOM (Fig. 2.).
Depending on the application scenarios, the ontologies were divided into task ontologies that focused on the application of specific tasks (e.g., emergency disaster relief tasks) and domain ontologies that focused on the analysis of specific domains (e.g., disaster processes) (Zhong et al. 2017). UFD-GOM construction methods differ between ontologies; domain ontology construction methods were used in this study. Additionally, based on previous research (Chang et al. 2004), this study proposed six principles for UFD-GOM construction as follows:
(1) Definitional clarity¾the concept of word selection should be clear, the definition should be objective and unambiguous, and standard terminology should be used whenever possible.
(2) Logical consistency¾the ontology model should be logically smooth, and the meaning of the inference results consistent.
(3) Semantic constraint¾the semantic distance maintained between concepts at each level should be minimal, and information should be conveyed with the least number of words, without semantic overlap.
(4) Geospatiality¾the constructed ontology should be based on a specific geographical location, and the data should have a suitable coordinate system.
(5) Human-geographical integration¾the ontology should be constructed considering the influence of anthropomorphic activities as much as possible.
(6) Extensibility¾suitable space should be reserved for future needs or applications that can be directly extended to existing concepts.
The construction of the UFD-GOM was divided into three main elements: attributes, classes, and relationships, in which the attributes were descriptions of classes that can constrain relationships, and relationships were bridges between connected classes. The key to constructing the UFD-GOM (Fig. 3) was to first define and organize the classes within the geographic scenario. Classes, often referred to as concepts, are the core of an ontology; they refer to a collection of elements with similar properties and are normative and clear descriptions of the knowledge of the domain they cover. Being the top class, urban flooding (Fig. 2) was used as the starting point to define the rainfall, surface, and drainage classes in the next layer, and these, in turn, to define the natural element, human element, rainfall information, and drainage facility classes, in the next layer, and so on. Because urban flooding ontology involves complex anthropomorphic factors, it is necessary to define it using expert evaluation and narrative lists in this field to obtain comprehensive and distinct classes.
Relationships play an important role in urban flooding hazard ontology as links between layers and the elements within layers. The relationships in urban flooding disaster ontology include five primary categories: temporal, spatial, topological, and semantic relationships, and element interrelationships. Temporal relationships refer to the momentary state of the disaster event and the process of dynamic evolution associated with the relationship; spatial relations indicate the spatial geographic relationship between elements within the disaster scene, and together with temporal relationships formed the basis of the relationship descriptions; elemental interrelationships describe the qualitative or quantitative relationships between elements in the disaster scene in time or space (Lv et al.2017), and were the core of the relationship definitions. Topological relationships, also termed geometric relationships, refer to static or dynamic structural relationships such as adjacency, connectivity, and inclusivity between elements in a disaster scene. Semantic relationships represent the relationships between human cognition and the description of the geographic features of a disaster event or scene and are key to realizing the relationship descriptions.
Table 1
Attribute Concept Definitions
Attribute
|
Data type
|
Interpretation of attributes
|
WaterloggingID
|
long
|
Event unique identifier
|
WaterloggingName
|
string
|
Event name
|
RainfallIntensity
|
double
|
Rainfall intensity
|
StartTime
|
datatime
|
Rainfall start time
|
EndTime
|
datatime
|
Rainfall end time
|
Elevation
|
float
|
Elevation value
|
BridgeName
|
string
|
Bridge name
|
AffectedPeopleNumber
|
long
|
The number of victims
|
ReliefWorkersNumber
|
long
|
The number of rescuers
|
PlaceName
|
string
|
Location of the event
|
PlaceLongitude
|
double
|
Longitude
|
PlaceLatitude
|
double
|
Latitude
|
RainwellNumber
|
long
|
The number of rainwater well
|
The attributes are quantitative descriptions of each class within the urban flooding scenario and were divided into three main parts: class name, data type, and semantic information (Table 1) The name of the class is the name of each element, such as BridgeName, RainwellID, and AffectedPeopleNumber; the data type includes long, string, float, double and datatime; and the semantic information is the interpretation of the class. For example, the semantic information for StartTime is the rainfall start-time and can be interpreted in computer language to facilitate human-computer communication.
Bayesian Reasoning Model
Ontology inference is an application extension of ontology and an important means of verifying the constructed ontology (Li et al. 2014). The current inference methods for implementing ontologies primarily include neural networks, Bayesian networks, word graph matching, and concept similarity. Many researchers have favored Bayesian networks as a flexible probabilistic graphical model with rigorous mathematical expression logic and causal inference (Bruce et al. 2019; Nam et al. 2020). It has been favored by many researchers (Liu et al. 2014; Shi et al. 2020). The introduction of Bayesian networks to the construction of UFD-GOMs enables the formal representation and application of ontologies and contains three modules: variable definition, structure learning, and parameter learning.
Variable definition refers to the assignment of values to relevant concepts and causal logic in the UFD-GOM as a means of inferring uncertain variables; therefore, the range of values of the variables should be determined. The variables in the UFD-GOM were plotted as a directed graph, in which the likely influencing factors were screened as node variables. Structure learning is the core of the Bayesian network UFD-GOM construction; its main purpose is to determine the topology, which directly reflects the dependency or independence among variables, and facilitate screening and error correction. Parameter learning refers to calculating the posterior probability distribution of parameters based on their prior probability distribution and data sample sets. Because Bayesian networks tend to deal with discrete probability data, the data should first be discretized to calculate the conditional distribution of each node. Based on the Bayesian network topology structure, the parameters were learned by training the sample dataset, and the conditional probability distribution among the variables was obtained using the maximum likelihood estimation algorithm to determine the accuracy of the inference. The specific calculation formula is as follows.
Suppose the sample set D= {\({x}_{1},{x}_{2},{x}_{3}, \dots ,{x}_{n}\)} ,and the sampl, s were independent of each other, so:
\(P\left({A}_{i}\right)=\sum P\left(\text{B}\right)P\left({A}_{i}|B\right)\)
|
(1)
|
|
\(P({A}_{1},{A}_{2},{A}_{3}, \dots ,{A}_{n})={\prod }_{i=1}^{n}P\left({A}_{i}\right|B)\)
|
(2)
|
We obtain the Bayesian formula: |
\(P\left({A}_{i}|B\right)=\frac{P\left({A}_{i}\right)P\left(B|{A}_{i}\right)}{\sum _{j=1}^{n}P\left({A}_{j}\right)P\left(B|{A}_{j}\right)}\)
|
(3)
|
where \(P\left(B\right)\)denotes the prior probability of event B, that is, the normalization constant; \(P\left({A}_{i}\right)\)denotes the nodal probability of event A, which is independent of event B; \(P({A}_{1},{A}_{2},{A}_{3}, \dots ,{A}_{n})\)denotes the joint probability of event A. \(P\left({A}_{i}|B\right)\)denotes the conditional probability of A after the known occurrence of B, that is, the value of B and is called the posterior probability of A; \(P\left(B|{A}_{j}\right)\)denotes the conditional probability of B after the known occurrence of A.