As one of the most destructive geological disasters on Earth, earthquakes commonly induce a large number of landslides (Gorum et al., 2011; Li et al., 2014). For example, May 12, 2008, the Wenchuan earthquake in China (Mw 7.9) triggered hundreds of thousands of landslides, causing a lot of human casualties and economic losses (Xu et al., 2013). As early as the 1970s and 1980s, studies have been conducted on the mechanism, process, and impact of earthquake-induced landslides (Pain, 1972; Garwood et al., 1979; Pearce et al., 1986; Hasegawa et al., 1987). Since the Chi-Chi earthquake in Taiwan in 1999, many studies of the mechanism and distribution characteristics of earthquake-triggered landslides have been conducted (Shou et al., 2003; Lin et al., 2004; Tang et al., 2009). These studies have revealed the mechanism of earthquake-induced landslides and the landslide distribution characteristics from different perspectives.
From the 1900s to the 2010s, much effort has been devoted to investigating the relationship between earthquake-triggered landslides and the possible impact factors (Keefer, 1984; Rodriguez et al., 1999; Mahdavifar et al., 2006). These studies have shown that the factors that have a great influence on earthquake-triggered landslides are the topographical factors (Yagi et al., 2009; Yuan et al., 2015; Serey et al., 2019), the geological structure (Mahdavifar et al., 2006; Sato et al., 2007; Huang et al., 2009), the ground shaking parameters (Meunier et al., 2007; Nowicki et al., 2014; Huang et al., 2021) . Many scholars have used remote sensing images from before and after the earthquake to identify post-seismic landslides and generate earthquake-triggered landslide inventories (ETLIs) (Sato et al., 2007; Wartman et al., 2013; Valkaniotis et al., 2018; Ferrario, 2019). Based on the above background, some scholars have constructed spatial prediction models for earthquake-triggered landslides using machine learning method and obtained good results. For example, Niu et al. (2014) used an improved support vector machine model to identify and predict the landslides triggered by the Lushan earthquake in China in 2013 and evaluated the susceptibility to landslides in the area of high seismic intensity. Chuang et al. (2021) selected the 1999 Chi-Chi earthquake in Taiwan and used the logistic regression (LR) algorithm for machine learning modeling. They then tested the model using data from the 1998 Jueili earthquake in Taiwan and gave suggestions for probability threshold. Fan et al. (2021) used a random forest (RF) model to track the landslides that occurred after the 2008 Wenchuan earthquake over ten years and established multi-temporal landslide susceptibility maps.
However, there can often be earthquake sequences or a large earthquake followed by many aftershocks over a short period in areas with high earthquake occurrences. One ETLI results from a continuous influence of multiple shakes, and there can be many slopes that suffer more than one failure under the impact of numerous shocks. For instance, Ferrario (2019) constructed two landslide inventories for the landslides that occurred in the first and second halves of the earthquake sequence in Lombok, Indonesia, in 2018. However, the previous studies used the landslides after the earthquake sequence to build an overall spatial prediction model and did not consider predicting multiple landslides in the earthquake sequence. If an ETLI is generated soon enough right after the first shock, can we predict the possibility of secondary landslides once subsequent shocks occur over a short period using machine learning? Is there any difference between the machine learning models for short-term landslide prediction and overall spatial prediction?
In response to the above questions, we chose two ETLIs for the 2018 Lombok sequence built by Ferrario and used two different strategies to perform machine learning modeling. One approach is to use the ETLI for the first half of the earthquake sequence for training and the ETLI for the second half for testing. The other strategy was to merge the two ETLIs and use 10-fold cross-validation to evaluate the prediction results. Terrain information, vegetation coverage, and seismic parameters were used as input factors for the model training. We selected three different types of commonly used machine learning methods—logistic regression (LR), random forest (RF), and neural network (NN)—to eliminate the difference between the results of the two strategies caused by the algorithm. The NN used a basic single-layer artificial neural network (ANN) and a multi-layer deep neural network (DNN), so there was a total of four different algorithms. Thus, eight machine learning models were established for the two strategies using these four algorithms. In this paper, we compare the differences in the prediction results from four models under the two strategies and analyze the reasons for the differences in the results of the two strategies.