Earthquake disasters have a profound impact on human living environment due to their suddenness and destructiveness. Severe casualties, house collapse and economic loss would be caused under the action of intense seismic ground motion. Strong earthquakes have continued to appear worldwide in the past two decades (Rossetto et al., 2007; Lara et al., 2016; Shimada, 2016). People injured or killed by the earthquake could range from a few to tens of thousands of people, distributed in different spatial locations (Zhao et al., 2018). Although the concerns of seismic problem continue to deepen and the seismic awareness of human is constantly enhanced, within the past decades, the active activities of geological structures are still affecting the environment of anthroposphere (Sun et al., 2016; Wu et al., 2020; Santos-Reyes and Gouzeva, 2020; Luo et al., 2021). Due to the unpredictability of earthquake occurrence, it is difficult to prepare before earthquake, so countries are committed to improve the emergency rescue ability after earthquake (Huang and Li, 2014). Among the various types of earthquake disaster information, modeling the earthquake casualty is particularly important for offering reference for emergency rescue and decision making. Casualty evaluation after earthquake is fast becoming an important issue increasingly responsible for significant economic, social, and environmental risk management (Huang and Huang, 2018).
The disaster under intense seismic motion is a complex result of various influencing factors. Seismic intensity, topography, population and economic level are all related to casualties due to earthquake to a certain extent. The traditional physical model or statistical regression model is difficult to reflect the nonlinear relation between earthquake-hit population and factors (Erdik et al., 2011). With the continuous improvement of computing speed in recent years, machine learning methods have been more widely used. More and more scholars apply them for disaster mapping under earthquake considering that machine learning methods could provide the ability to learn from historical data for producing insight into extreme events (Yang et al., 2015; Choubin et al., 2019; Pourghasemi et al., 2019; Jena et al., 2020; Hou et al., 2020; Si and Du, 2020; Luo et al., 2020). Aghamohammadi et al. (2013) used artificial neural network (ANN) for estimating the human loss of building damage under earthquake based on the data of the 2003 Bam earthquake. Huang et al. (2015) proposed a robust wavelet (RW) v-SVM (support vector machine) earthquake casualty prediction model. The factors including earthquake magnitude, intensity, population density, pre-warning level, in-building probability, location of occurrence, supply support and building collapse ratio were considered. It was concluded that RW v-SVM model had higher prediction accuracy and quicker learning than standard SVM and neural network. Gul and Guneri (2016) built up an ANN model for casualty prediction taking occurrence time, magnitude, population density as factors. Data of 21 earthquakes in Turkey were collected as samples for network training. Huang et al. (2020) established Extreme Learning Machine (ELM) network to predict earthquake casualty based on the data of 84 groups of earthquake victims in China. It was found that the ELM algorithm had better robustness and generalization capability than BP neural network and SVM. It can be noticed that the existing researches focused on the prediction of population numerical value affected by earthquake considering factors of multiple dimensions. Moreover, the accuracy and performance of different machine learning methods were compared based on the evaluation results of earthquake casualty. However, the input layer of different machine learning methods used numerical data without spatial information, and the spatial characteristics of disaster information of output layer have not been evaluated effectively. For earthquake emergency management, the spatial distribution of disaster information within the earthquake affected area has a greater significance for the formulation of detailed rescue plans.
The generalization capability of network refers to the ability to obtain accurate output when inputting new data other than training samples. Generalization capability is the most important index to measure the performance of network. The complexity of structure and samples are the main factors affecting the generalization capability of model. Research of Partridge (1996) on three-layer neural network found that the influence of training set on generalization capability is great, even more than the influence of neural number. Many researchers combine principal component analysis (PCA), clustering analysis and other methods with machine learning to optimize the training set aiming to improve the generalization capability of the network (Basharat et al., 2016; Li et al., 2020). Lou et al. (2012) used PCA to reduce the dimension of assessment factors, disaster-formative environment and disaster-affected bodies, and established a BP neural network to assess the economic loss under tropical cyclones in Zhejiang Province. A combined use of PCA and ANN was adopted by Gao et al. (2020) to evaluate the personal exposure level to PM2.5, and it was found that the combined use of PCA and ANN produced more accurate results than simple ANN method. It can be seen that optimizing the input samples of network could improve the generalization capability. Most of the existing sample optimization methods are based on statistical analysis on numerical dimensions. The distribution of influencing factors and training results in the spatial dimension are also related. Sample optimization based on spatial correlation characteristics might provide a novel solution to improve the generalization capability.
The study presented herein aimed at effective evaluating the spatial distribution of earthquake disaster information in each county. The earthquake-hit population spatial distribution was selected as the study content and evaluated based on correlation characteristics of influencing factors and BP neural network, using data from the 2013 Ms7.0 Lushan earthquake. The selection of samples was optimized based on the spatial characteristics resulting from correlation analysis, to improve the generalization capability of network and accuracy of evaluation results.