Forecasting Short-Term Water Demands with an Ensemble deep learning Model for a Water Supply System

doi:10.21203/rs.3.rs-2110428/v1

Download PDF

Research Article

Forecasting Short-Term Water Demands with an Ensemble deep learning Model for a Water Supply System

https://doi.org/10.21203/rs.3.rs-2110428/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Apr, 2023

Read the published version in Water Resources Management →

You are reading this latest preprint version

Short-term water demand forecasting is crucial for constructing intelligent water supply system. There are plenty of useful models built to address this issue. However, there are still many challenging problems, including that the accuracies of the models are not high enough, the complexity of the models makes them hard for wide use in reality and the capabilities of models to catch peaks still have much room for improvement. In order to solve these problems, we proposed an ensemble deep learning model named STL-Ada-LSTM for daily water demand forecast by combining STL method with AdaBoost-LSTM model. After data preprocessing, the smoothed series is decomposed by STL to gain three input series. Then, several LSTM models are integrated by the AdaBoost algorithm to construct the ensemble deep learning model for water demand forecast. At last, the superiority of the proposed model is demonstrated by comparing with other state-of-art models. The proposed method is applied for water demand forecast using daily datasets from two representative water plants in Yiwu, East China. All models are assessed by mean absolute scaled error (MAE), mean absolute percentage error (MAPE), mean square error (MSE), root mean square error (RMSE), coefficient of determination (R2) and Akaike information criterion (AIC). The results show that the proposed model not only enhances the accuracy of the forecast, but also improves the stability and conciseness, which make it a practical daily water demand forecast model.

Short-term water demands

Ensemble deep learning model

Practical application

STL-Ada-LSTM

Water scarcity has become a threat to human in recent decades (Salloom et al. 2021). An intelligent water management system is efficient to respond to the threat. Since short-term water demand forecasting which means forecasting water demand over time horizons ranging from 1 day to 1 month (Pacchin et al. 2019) is an important component of an intelligent water supply system, it is crucial to predict it effectively and accurately. However, forecasting urban water demand is a challenging task especially considering its complicated influencing variables, non-stationarity and stochasticity (Guo et al. 2020).

In recent years, numerous studies have directed their efforts towards short-term urban water demand forecast. A variety of short-term water demand forecasting models (SWDF) have been constructed to enhance accuracy and efficiency of prediction. These models can be classified from the perspective of input, model construction techniques and output. Regarding the input of SWDF, it is possible to categorize the models into single-variable based model and multiple-variable based model. The input of a single-variable based model is single type which is historical observed water demand data in most cases. Guo et al. (2018) and Huang et al.(2021) selected historical water demands as the single input for short-term water demand forecasting. In the other category, historic water demand and weather data are considered as input for multiple-variable based models (Antunes et al. 2018). Although water demand is influenced by plenty of factors, practically it is hard for water utilities to collect daily meteorological and hydrological data and others in a short time for SWDF (Chen et al. 2022). Multiple variables as inputs would also increase the uncertainty of forecast due to the errors of variables themselves. To increase the stability and reliability of the forecast (Bakker et al. 2013), this study extracted the historic water demand data as the single input of SWDF.

In terms of model construction techniques, most of the SWDF can be classified into statistical and artificial intelligence-based models. To represent temporal dynamics, a classic family of statistical models has been applied in the former years, such as Auto-Regressive model, Auto-Regressive Integrated Moving Average (ARIMA) model and seasonal ARIMA (Caiado 2010). However, they are usually inadequate to fit the complex features of urban water demands and may not produce sufficient accuracy for forecast (Voitcu and Wong 2006). To solve this problem, a plenty of artificial intelligence (AI) models such as support vector machine (Candelieri et al. 2019), random forest (Xenochristou et al. 2021) and artificial neural networks (Antunes et al. 2018) have been widely used based on the rapid development of artificial intelligence in recent years. However, the layer architectures of these classical machine learning are too shallow to mine more implicit features of large data.(Chen et al. 2020b).

Deep learning is one of the pragmatic solutions to cope with this situation for its strong feature mining ability and it could improve the performance of water demand forecast compared to classical artificial intelligence models (Guo et al. 2018). Since deep learning was introduced to SWDF, Recurrent Neural Networks (RNNs) has become a good option for forecasting practitioners because they often obtain reliable forecasts that outperform the benchmarks. Moreover, it is also concluded that RNNs are capable of modelling seasonality directly if the series in the dataset possess homogeneous seasonal patterns (Hewamalage et al. 2021). The variants of RNN including gated recurrent unit network (GRU) and long short-term memory network (LSTM) were found to be able to increase the predicting accuracy. Guo et al. (2018) developed a gated recurrent unit network (GRUN) model for SWDF, obtained more accurate and stable results than conventional ANN model, but they found that the errors at extreme points were hard to improve. To tackle the problem, Salloom et al. (2021) inserted virtual data between the actual data so that the nonlinearity at these points can be drastically declined to alleviate the nonlinearity at the extreme points to reduce the error. However, they also admitted that this expansion reflects negatively on the computational load because of increment in the input size. Compared to GRU, although LSTM has more parameters, it is more powerful and expressive, which means that it has stronger ability to catch extreme points in long time series (Hewamalage et al. 2021). In 2021, LSTM was applied to short-term urban water demand forecast for the first time, and the study proved that LSTM-based model can offer forecast with improved accuracy than other classical models (ARIMA, RF, SVM) when dealing with data with abrupt changes and data with a relatively high uncertainty level (Mu et al. 2020).

To further improve the performance of a single LSTM for water demand forecasting, some studies started constructing hybrid models. Du et al. (2021) proposed a hybrid LSTM model combining with discrete wavelet transform (DWT) and principal component analysis (PCA) pre-processing techniques for water demand forecast. To further solve uninformative and unreliable problems when the uncertainty level of data increases, a hybrid model (KDE-PSO-LSTM), which combines LSTM with kernel density estimation (KDE) optimized by using the particle swarm optimization (PSO) algorithm was proposed (Du et al. 2022). To some degree, the mechanically stitched hybrid model increases the difficulty of model application and uncertainty of the forecast because the other components of the hybrid model are usually relatively independent models to LSTM without the interconnectedness of internal mechanisms, such as DWT and PCA. Models like DWT and PCA are two independent methods and they need to be tuned respectively. An ensemble model has the potential to solve the problems above since the components are deeply integrated in a holistic model which means managers do not need to run the composed models respectively. Only one run would gain the final forecast results while the complexity and time-consuming of forecast would decrease dramatically. Nevertheless, to our knowledge, the ensemble methods based on LSTM for water demand forecasting have not yet been studied sufficiently. Zanfei et al. (2022) used the simple average method to ensemble different models including LSTM for drinking water consumption. This simple ensemble method ignores the different contributions of different models. The AdaBoost algorithm, one of the most successful ensemble methods, has various advantages including simple computation, high precision and without overfitting (He et al. 2020). Therefore, in this study, the AdaBoost algorithm is adopted to improve LSTM performance through iterative computation to achieve a stronger SWFD. Considering the output of the SWDF, it is possible to categorize the models into deterministic and uncertain results. In this study, we aim to constructing a SWDF which offers deterministic result which is more convenient for the managers of a water supply system.

To summarize, limited by the non-linearity, non-stationarity and uncertainty of the short-term urban water demand, most studies have to combine multiple models to achieve higher accuracy. The operation of every single combined model is required to be accurate enough to guarantee the final accuracy. Due to the complexity and high technical barrier, these hybrid models might be difficult for use in the real water supply system because the operators work in grassroots positions might not have such professional knowledge and model experience. So, to solve this problem, we attempt to construct a more practical forecasting model with high accuracy and low technological barriers. The main contributions of this study is constructing an ensemble STL-Ada -LSTM deep learning model for short-term urban water demand forecast for the first time. The proposed model significantly reduces the complexity of the pervious hybrid SWDF and enhanced the accuracy and stability of SWDF, especially the errors in the peaks are reduced. The STL-Ada -LSTM is validated and compared to the state-of-art SWDF using the Yiwu City in East China as a case study. The proposed model of this study is expected to offer a practical SWDF for applications in water systems of other similar cities. The remainder of this paper is organized as follows. Section 2 describes the main methodology used in this study, including the STL-Ada-LSTM model. Section 3 provides a practical case study. Section 4 introduces the main results and discussion. Finally, in Section 5, the conclusion of this work and future works are given.

A schematic diagram of the proposed research framework is presented in Fig. 1. Firstly, the outliers of the origin water demand time series are identified by the use of 3σ criterion. The moving average method is applied to smooth the identified outliers. Then, according to the seasonal characteristics of the data series, the Seasonal and Trend decomposition using the Loess (STL) method (Antunes et al. 2018; Chen et al. 2022) is adopted to extract the seasonal, trend and residual features of the smoothed series respectively. Thus, the time series is decomposed into three components including trend series T_t, seasonal series S_t and residual series R_t as shown in Fig. 1. Considering the heterogeneity and strong forecast capabilities of LSTM, and the AdaBoost model that have independent error distributions, a forecast method that combines these two named AdaBoost-LSTM, is developed to improve the overfitting problem of LSTM and further improve the forecast accuracy. The decomposed series are regarded as the inputs of three AdaBoost-LSTM models respectively. The first AdaBoost Model, displayed on the left of Fig. 1, is designed to forecast the trend of the original data; the middle one is designed to extract and predict seasonal features over time; the right one is proposed to enhance the ability of the model to catch the peaks in forecast. Finally, the outputs of three AdaBoost-LSTM deep learning model are summed up to gain the final forecasts of water demand. The whole model aforementioned is named STL-Ada-LSTM model.

Identification and processing of outliers

Outliers in time series, depending on their nature, may have a moderate to significant impact on the model forecast (Chen and Liu 1993). To guarantee the reliability of data, the 3σ criterion is used to distinguish the outliers of original water demand series X_t. Using the 3σ criterion, X_t will be controlled in a 99.73% confidence interval (Du et al. 2021) and the other outliers will be smoothed to fit in with the standard by the weighted average method as Eq. (1):

${E_t}={\theta _{t - k}}{x_{t - k}}+ \cdots {\theta _{t - 1}}{x_{t - 1}}+{\theta _{t - 1}}{x_{t - 1}}+ \cdots +{\theta _{t+k}}{x_{t+k}}$ (1) ${\theta _{t - k}}$ and${x_{t - k}}$ represent the weighted values and historical data near the outliers, respectively; k refers to a positive integer and ${E_t}$ is the smoothed outliers. Finally, all the data will be processed in the band by $\left[ {{\mu _t} - 3{\sigma _t},{\mu _t}+3{\sigma _t}} \right]$, where $\mu$ and $\sigma$ represent the mean and standard deviation of the original water demand series respectively (Alvarado-Barrios et al. 2020).

Seasonal and Trend Decomposition Using Loess

Seasonal and Trend Decomposition Using Loess (STL) is a time series decomposition method based on locally weighted scatterplot smoothing (loess) (Cleveland et al., 1990). The time series could be decomposed into the three additive components of seasonal S_t, trend T_t, and remainder R_t components:${X_t}={S_t}+{T_t}+{R_t}$. Compared with other traditional seasonal decomposition techniques, such as X-12-ARIMA and the ratio-to-moving-average method, STL is able to provide more robust results (Xiong et al. 2018). Because the short-term water demand time series is characterized by seasonality and instability (Antunes et al. 2018), the STL is suitable for it. Also, this method is also not as complicated as discrete wavelet transform which needs to be tuned. STL is an iterative method consisting of two recursive procedures, inner and outer loops. The detailed steps of this method are described in the study by Tao Xiong et. al (2018).

Deep Learning Using Long Short-Term Memory

Long Short-Term Memory (LSTM) is controlled by three gates, i.e, input gate, output gate and forget gate, forming a self-loop update. The forget gate (f) decides how much information will be kept and passed to the next stage, the input gate (i) decides how much new information will be added, and the output gate (o) updates the system state using the information and cell state (c) from the previous two gates. The update process of the LSTM is illustrated briefly as follows (Han et al. 2019):

$${f_t}=\sigma \left( {{W_f} \cdot \left[ {{h_{t - 1}},{x_t}} \right]+{b_f}} \right)$$

$${i_t}=\sigma \left( {{W_i} \cdot \left[ {{h_{t - 1}},{x_t}} \right]+{b_i}} \right)$$

$${O_t}=\sigma \left( {{W_O} \cdot \left[ {{h_{t - 1}},{x_t}} \right]+{b_O}} \right)$$

$${\tilde {c}_t}=\tanh \left( {{W_c} \cdot \left[ {{h_{t - 1}},{x_t}} \right]+{b_c}} \right),{c_t}={f_t} \times {c_{t - 1}}+{i_t} \times {\tilde {c}_t}$$

$${h_t}={o_t} \times \tanh \left( {{c_t}} \right)$$

Where ${W_f}$, ${W_i}$, ${W_c}$, ${W_O}$ denote the weight matrices of forget gate, input gate, cell state and output gate respectively; ${b_f}$, ${b_i}$, ${b_c}$ are the bias items of forget gate, input gate and output gate, respectively; represents the hidden state, x represents the input, t represents the time, σ means sigmoid function, and “×” means point-wise multiplication, respectively.

The Back Propagation Through Time (BPTT) algorithm is used to train the LSTM model (Tepper et al. 2016). Firstly, calculate forwardly the outputs of each memory cells in LSTM and then calculate backwardly the error for each cell. Afterwards the gradients and biases of each weight matrix using the errors are supposed to figure out. Finally, put gradients and biases into optimization algorithms, such as Stochastic Gradient Descent (SGD) or Adam (Tepper et al. 2016). In this study, the Adam optimization algorithm is applied to train every single LSTM of the ensemble mode.

Adaptive Boosting

As an efficient ensemble learning model, Adaptive Boosting (AdaBoost) performs well in time series forecast (Bai et al. 2021; Xiao et al. 2019) by combining a set of classifiers into a more powerful learner to improve model performance. Weak classifiers are trained and each time the next weak leaner is trained on a different set of weights of the sample. The weights are determined by errors and boosting is used to reduce bias as well as variance for supervised learning. The best weak classifiers selected by each iteration training are built into strong classifiers (Xiao et al. 2019). The construction of AdaBoost algorithm is introduced as follows(Bai et al. 2021):

Step1: Input training dataset$\left( {X,Y} \right)=\left\{ {\left( {{x_1},{y_1}} \right),\left( {{x_2},{y_2}} \right), \cdots ,\left( {{x_N},{y_N}} \right)} \right\}$, where X is historical water demand data, Y is the current observed data, and N is the length of data, Among them, ${x_i}$ is a column vector with d entries, ${x_i} \in \chi \subseteq {R^d}$.

Step2: Initialize weights, ${D_1}=\left( {{w_{11}},{w_{12}}, \cdots ,{w_{1N}}} \right)$

${w_{1i}}=1/N,i=1,2,...,N$ (7)

Step3: Repeat the following for m = 1,2,…,M to get M base learners, the base learners are LSTM models in this study:

(1) The m_th base learner is obtained by training the data according to the sample weight distribution D_m ${G_m}\left( x \right):{G_m}\left( x \right):\chi \to \left\{ { - 1,+1} \right\}$

(2) To calculate the classification error rate of ${G_m}\left( x \right)$ on the weighted training dataset:

$${e_m}=\sum\limits_{{i=1}}^{N} {P\left( {{G_m}\left( {{x_i}} \right) \ne {y_i}} \right)} =\sum\limits_{{i=1}}^{N} {{w_{mi}}} I\left( {{G_m}\left( {{x_i}} \right) \ne {y_i}} \right)$$

In the above equation, $I\left( \cdot \right)$ is the indicator function.

(3) The calculated ${G_m}\left( x \right)$ coefficient (i.e. the weight of the base learner used in the final integration) is:

$${\alpha _m}=\frac{1}{2}\log \frac{{1 - {e_m}}}{{{e_m}}}$$

(4) Update weights of the training samples:

$${D_{m+1}}=\left( {{w_{m+1,1}},{w_{m+1,2}},...,{w_{m+1,N}}} \right)$$

$${w_{m+1,i}}=\frac{{{w_{mi}}}}{{{Z_m}}}\exp \left( { - {\alpha _m}{y_i}{G_m}\left( {{x_i}} \right)} \right),i=1,2,...,N$$

where ${Z_m}$ is the normalization factor, so that all the elements of ${D_{m+1}}$ sum to one.

$${Z_m}=\sum\limits_{{i=1}}^{N} {{w_{mi}}} \exp \left( { - {\alpha _m}{y_i}{G_m}\left( {{x_i}} \right)} \right)$$

(5) Construct the final linear combination of the base learners:

$$f\left( x \right)=\sum\limits_{{i=1}}^{M} {{\alpha _m}} {G_m}\left( x \right)$$

The final regression mode is obtained as:

$$G\left( x \right)=\sum\limits_{{i=1}}^{M} {{\alpha _m}{G_m}\left( x \right)}$$

According to Eq. (9), when the error rate ${e_m} \leqslant 0.5$, ${\alpha _m} \geqslant 0$. And ${\alpha _m}$ increases with the decrease of ${e_m}$, that is, the smaller the classification error rate is, the larger the proportion of the base learner will be in the final integration. That is, AdaBoost can adapt to the training error rate of each weak classifier.

Ensemble AdaBoost-LSTM Deep Learning Model

The AdaBoost algorithm was originally designed for classification; consequently, to use this algorithm for water demand forecasting, it needs to be modified appropriately. In this study, we develop the AdaBoost-LSTM(Ada-LSTM) model by adjusting the sample weights depending on whether a specified threshold is exceeded. And the LSTM deep learning models are used as week learners in the Ada-LSTM model. Figure 2 shows the architecture of the proposed LSTM-AdaBoost model for SWDF.

Firstly, the decomposed data series T_t and R_t are divided them into two sets (80% training set and 20% testing set) respectively. Secondly, the optimal structure of a single LSTM model is selected by analyzing the sensitivities of the main hyper-parameters including the number of hidden layers and the number of neurons in hidden layers and train epoch. In experiments, the BPTT (Back Propagation Through Time) is used to optimize the training process of LSTM networks. Additionally, the value range of hyper-parameters refers to the existing studies (Du et al. 2021; Bai et al. 2021). To reduce the number of parameters of the proposed model to ease the tuned procedure, the values of each hyper-parameter in every LSTM base leaner are the same and only four hyper-parameters needed to be set according to the reference interval. The loss function of LSTM model is mean squared error function. After training by the first LSTM model, the result of the first base learner $G_{{_{1}}}^{T}$ and$G_{{_{1}}}^{S}$ are automatically imported to the Trend and Seasonal AdaBoost Model respectively. Then the weights of the first LSTM model will be updated and transformed to the second LSTM model based on the error rate of previous. The weight of the samples that are not accurately predicted in the last round will increase in the next round to improve the performance of these samples. The forecast results of the second LSTM are also automatically imported to the AdaBoost Model in the whole ensemble model. According to this law, the training data sets are trained by the LSTM models one by one to improve the forecast of the samples continually. After multiple iterations, the results of the LSTM models are weighted and combined to gain the final strong learner according to Eq. (9) ~ Eq. (12).

Different from the forecast procedure of the other two decomposed series, the residual series is ranked and all the peaks identified by the hypothesis testing about differences of two consecutive slopes introduced by Bramante (2019) and other historical data are put into training data set before forecast due to its large fluctuations. Finally, the sum of the Trend AdaBoost-LSTM, Seasonal AdaBoost-LSTM and Residual AdaBoost-LSTM model output is the final forecast.

Study area and data description

The study area Yiwu City, a city of East China, is located between 119°49 'E-120 °17' E and 29°02 '13 "N-29 °33' 40" N covering an area of 1105 km². Yiwu City is the largest small commodity distribution center in the world. The annual average water resources in Yiwu is 820 million cubic meters. According to the permanent resident population, the per capita annual water resource occupancy is 1,76 cubic meters. If we consider about 950,000 permanent migrant population according to “2021 Yiwu City National Economic and Social Development Statistical Bulletin”, the per capita water resource occupancy is 500 cubic meters, which is only 22.7% of the national per capita water resource occupancy of 2200 cubic meters. As cities grow and populations gather, per capita ownership will decline further. In the foreseeable future, water resources in Yiwu will become increasingly tense and scarce. Accurate water demand forecast is helpful to construct intelligent and efficient water supply system for sustainable development of water shortage industrial city such as Yiwu.

Daily water demand data were obtained from a real-world water distribution system in two water plants of Yiwu City. There are eight water plants in Yiwu City and two of them are selected as representations in this study. The first water plant (P1) named Chengbei is located in the center of the city and mainly supplies water for industry but also contains residential water. The second water plant (P2) named Fotang is an agricultural water plant which guarantees the main irrigation water in the southeast of Yiwu. The observed daily water demand data series of P1 and P2 range from March 1 2010, to February 28 2021, and from March 1 2014, to February 28 2021 respectively. In total, the data sets contain 4,015 and 2,555 observations. Table 1 presents the characteristics of the data for the plants.

Table 1

Summary of the characteristics of data for the water plants
Water plant	Mean demand(m³/day)	Maximum demand(m³/day)	Minimum demand(m³/day)	Standard deviation	Type
P1	61555	88601	35669	55847701	Industrial& residential
P2	39156	46855	24231	19520379	Agricultural& residential

Evaluation criteria

Six criteria, i.e., mean absolute scaled error (MAE), mean absolute percentage error (MAPE), mean square error (MSE), root mean square error (RMSE), coefficient of determination (R2) and Akaike information criterion (AIC) are calculated to evaluate the forecasting model from different angles. The MAE and MAPE are used to evaluate the forecasting accuracy of models. MAE is a commonly used to select the models that have the fewest average deviations, but it is not suitable for comparison of models with different data. MAPE is the most important measures for evaluating forecast accuracy since it could compare the models with different magnitude of input data. R2 (the best value is 1) is calculated to evaluate the model performance and it indicates the agreement between observed and predicted values (Xenochristou et al. 2021). MSE and RMSE magnify values with large deviations, which means that both are sensitive to outliers. Therefore, they are generally suitable for evaluating stability (Huang et al. 2021). AIC, as a reliable tool for selecting the best between several competing models depending on the number of variables and output error of the models when applying the same data set, is calculated to evaluate the complexity of models(Chen et al. 2017). The model with the lowest AIC is deemed to be the simplest (Salloom et al. 2021). Their formulas are as follows respectively:

$$MAE=\frac{1}{N}\sum\limits_{{t=1}}^{N} {\left| {{y_i} - {{\hat {y}}_i}} \right|}$$

$$MAPE=\frac{1}{n} \times \sum\limits_{{i=1}}^{N} {\frac{{\left| {{y_i} - {{\hat {y}}_i}} \right|}}{{{y_i}}}} \times 100\%$$

$$MSE=\frac{1}{n} \times {\sum\limits_{{i=1}}^{n} {\left( {{y_i} - {{\hat {y}}_i}} \right)} ^2}$$

$$RMSE=\sqrt {MSE} =\sqrt {\frac{1}{n} \times \sum\limits_{{i=1}}^{n} {{{\left( {{y_i} - {{\hat {y}}_i}} \right)}^2}} }$$

$${R^2}={\frac{{\left( {\sum\limits_{{i=1}}^{n} {\left( {{y_i} - {{\hat {y}}_i}} \right)\left( {{y_i} - {{\hat {y}}_i}} \right)} } \right)}}{{\sum\limits_{{i=1}}^{n} {{{\left( {{y_i} - {{\hat {y}}_i}} \right)}^2}\sum\limits_{{i=1}}^{n} {{{\left( {{y_i} - {{\hat {y}}_i}} \right)}^2}} } }}^2}$$

$$AIC=n\ln \left( {\frac{{RSS}}{n}} \right)+2k+\left( {\frac{{2k\left( {k+1} \right)}}{{n - k - 1}}} \right)$$

where ${y_i}$ represents the ith observed value; ${\hat {y}_i}$ denotes ith forecast value; N is the total number of data points being predicted, k is the number of model’s variables, which are usually the weights and the biases of a neural network model, and RSS represents the sum of square error of the model output. When 1 <$\frac{n}{k}$ < 40, AIC needs a bias adjustment (Panchal et al. 2010).

Development of Forecast Models

To simplify model structure, the base learner has one hidden layer and the time step of input variables is 1 for daily forecast, gradient threshold is 1 and initial learn rate is 0.02 (Bai et al. 2021). Adaptive Moment Estimation (Adam) is used as the activation function. In the training process, the training data are shuffled at the beginning of every training epoch. Four core hyper-parameters including the number of nodes in hidden layers, batch size, epoch and number of base learners(M) are selected by sensitive analysis. In experiments, the ranges of the former hyper-parameters are 1–20, 1–10,50–500,1–10.

Figure 4 depicts the sensitive analysis results of the MAPE and AIC of the proposed model when changing the four core hyper-parameters respectively to analyze the best structure with the lowest predictive error and simplest model. As shown in Fig. 4(a), MAPE exhibits a process of first rocketing, then fluctuating and finally decreasing dramatically when varying the nodes number of the hidden layer. MAPE is lowest with a value of 0.03 at the nodes number of the hidden layer is 17 and AIC at this point with the value of 62717.99 is smaller than other points with the similar MAPE. If we give priority to the accuracy of the model and consider the complexity of model as supplement, the optimal nodes number of the hidden layer is set to 17. According to Fig. 4(b), it could be noticed that the changing of AIC is similar to MAPE. The lowest MAPE and AIC are both yielded with the value of 0.03 and 62999.81 respectively when the number of batch size is 4. So, this study selects 4 as the number of batch size of the proposed model. It could be observed from Fig. 4 that the values of MAPE are comparatively small when the number of epoch are 100 and 500, and the value of AIC are 60484.45 and 60362.11. Although MAPE and AIC are lowest when the number of epoch is 500, in reality, the more epochs of the forecast model, the more training time required to be consumed. To save the forecast time and make the model more practical, the number of epoch is set to 100 in the proposed model. Finally, the number of base leaners is decided based on Fig. 4(d). It is obvious that MAPE and AIC keep climbing with the increase of the number of base learners. Therefore, the number of base learner is set to 2. From what has been discussed above, the optimal structure of the proposed STL-Ada-LSTM model is set as follows: the nodes number of the hidden layer is 17, the number of batch size is 4, the number of epoch is 100 and the number of base learner is 2.

Data preprocessing

According to 3σ criterion, the outliers of daily water demand values of P1 water plant were identified as the red points shown in Fig. 5(a). Then, the outliers were smoothed to fit in with the standard by the weighted average method based on Eq. (1). A thermodynamic diagram was built to illustrate the monthly and yearly behaviors of the water demand values. According to Fig. 5(b), the monthly pattern of the water demand is higher in June, July and August and lower in December, January and February. Further, a box-and-whisker plot as shown in Fig. 5(c) confirmed the seasonal characteristics of water demand. It is apparent that the water demand value is higher in summer and lower in winter. It could also be observed from Fig. 5(c) that the water demand value is more centralized in summer and more dispersive in autumn than other seasons. So, the seasonal characteristics are obvious in the water demand series. To distinguish the seasonal characteristics, STL method was used hereafter to obtain the seasonal component of original water demand series. In addition, to verify the seasonal characteristics, the observed data series of P2 water plant was also analyzed by the procedure above. Since the conclusion of P2 is similar to that of P1, the results of P2 were omitted.

Decomposing the data by STL

From Fig. 6, the smoothed water demand time series of P1 and P2 water plants were decomposed by STL. The period parameter of STL is set to 365 days to obtain characteristics. The trend, seasonal and residual components were all extracted to depict the features of the original data. It could be observed that the seasonal periodicity of data series is obvious in both plants as shown in the seasonal subgraph. It is easy to simulate and forecast the seasonal and trend characteristics of the data series since their changing processes are regular in some degree as seen in Fig. 6. Therefore, the decomposed seasonal and trend series are helpful for the forecast model to capture the variation characteristics of the input data series and enhance the forecast accuracy. However, the variations of residual components are hard to be illustrated as seen in Fig. 6. In order to improve the accuracy of residual components forecast, all the residual values are sorted before forecast and the peaks and other historical data (sum to eighty percent of the whole data set) are put into the training data set.

Forecasting results of the STL-Ada-LSTM model

By applying the STL-Ada-LSTM model for water demand forecast, the results of the forecasted decomposed components (trend, seasonal and residual) of P1 and P2 water plants are shown in Figs. 7. It can be observed that the predicted trend and components of the two water plants fitted perfectly due to its simple change features. As for predicted seasonal components, the STL-Ada-LSTM model performed better in P1 than P2. The type and length of input data series of P1 and P2 water plants are different, so these two factors might influence the capability of STL-Ada-LSTN model to obtain seasonal variation characteristics of series. The residual components are predicted accurately and the forecast errors usually happen at extreme points. The MAPE of residual series of P1 and P2 are 0.18% and 0.35% respectively. Nevertheless, these tiny errors are accepted with the 0.5% threshold. All the predicted components are summed up to obtain the final daily water demand forecast results. The final predicted (red line) and observed values (yellow line) are exhibited in Fig. 8.

Furthermore, the forecasted results of proposed model are compared with other relevant forecast models in Fig. 8 for verification of the effectiveness and advancement of STL-Ada-LSTM model. The forecast results show that the proposed STL-Ada-LSTM model has better performance than others. Especially, the proposed model demonstrates outstanding imitative ability in extreme values, thus it would offer distinct advantages over the traditional SWDF for the water demand series with high fluctuation.

Comparison of STL-Ada-LSTM model with the other models

In order to inspect the improvement of the proposed model for correlative common daily water demand models, nine forecast models were built. The core idea of construction of STL-Ada-LSTM is mainly extracting and simulating the characteristics of the water demand series especially the significant seasonal characteristics, so the smoothed data are processed to extract the seasonal, trend and residual features to facilitate the further forecast.

However, there are also other ways to extract the seasonal features. Autocorrelation regression is suitable to gain prior knowledge of input variable dynamics, and it has been implemented in a range of hydrological studies (Yaseen et al. 2015; Mouatadid et al. 2019). By this statistical method, the significant input variables of different seasons could be selected to embody the seasonal characteristics in the construction of the model. But why the STL method was applied in the proposed model to embody the seasonal characteristics in the construction of the model rather than autocorrelation regression or other methods? Hence, the autocorrelation regression method as a common method in data preprocessing process of SWDF, was used to build the seasonal contrast models (which indicates “seasonal” in the parentheses of the name of the contrast models, such as BP(seasonal) and LSTM(seasonal)) as representatives of other possible models based on seasonal characteristic extraction methods. On the other hand, the base learner of STL-Ada-LSTM is the LSTM model, so it is necessary to compare the performance of proposed model with LSTM model to show the effect of AdaBoost algorithm. Huang et.al ( 2021) also applied ensemble learning-based method combining the BP-AdaBoost (Ada-BP ) model to forecast short-term water demand and achieved promising improvements in both accuracy and stability compared with the single BPNN and SARIMA models. Thus, the Ada-BP model was also applied for comparison. All in all, nine models namely BP(no-seasonal), BP(seasonal), LSTM(no-seasonal), LSTM(seasonal), Ada-BP(no-seasonal), Ada-BP(seasonal), Ada- LSTM(no-seasonal), Ada-LSTM(seasonal) and STL-LSTM are used for comparison purpose. The “seasonal” in model names means using autocorrelation regression to select inputs of different seasons, and the “no-seasonal” in model names means without the selection process. The STL-LSTM model is the combination of STL method and LSTM model forecast. All the values of contrast models’ parameters which also exist in proposed model are the same with the STL-Ada-LSTM model.

The forecasting results of the nine models are all shown in Fig. 8. It could be observed that LSTM-based (which means the model containing LSTM as a component) models perform better than BP-based models and the AdaBoost algorithm could improve the forecasts of water demands at peaks. To illustrate the forecasting results of the models more clearly, scatter plots between actual value and predicted value of ten models are shown in Fig. 9. The points are closer to the 45^o diagonal, indicating that the forecasts are closer to the observed data. As shown in Fig. 9, the “seasonal” models using autocorrelation regression don’t show significant improvement compared with “no-seasonal” models. And it is obvious that the accuracy of LSTM model and Ada-LSTM model are higher than BP model and Ada-BP model. The STL-LSTM model performed better than the LSTM model and Ada-LSTM model. Among all the models, the points of STL-Ada-LSTM are closest to the 45^o diagonal which means they are predicted most accurately.

The performances of forecast models were evaluated by the evaluation criteria before-mentioned, and the results of assessment were tabulated in Table 2. The STL-Ada-LSTM model has the lowest MAPE, MAE, MSE, RMSE and AIC in both P1 and P2 water plants. So, it can be inferred from Table 2 that the proposed model outperforms the other models not only in the forecast accuracy but also in stability and conciseness of the model itself. Compared to the BP- based models (BP(no-seasonal) model, BP(seasonal) model, Ada-BP(no-seasonal) model, Ada-BP(seasonal) model) and other LSTM-based models (LSTM(no-seasonal) model, LSTM (seasonal) model, Ada-LSTM (no-seasonal) model, Ada-LSTM(seasonal) model, STL-LSTM model), the STL-Ada-LSTM model of water plant P1 improves ranging from 73.08–88.37% in terms of MAPE metric. In the meanwhile, the MAPE metric of P2 forecast results is reduced about 69.57%~88.52% from other contrast models except the STL-LSTM model. Especially, the MAPE of STL-LSTM is the same as the proposed model with the value of 0.01% in water plant P2. However, the RMSE and AIC of STL-Ada-LSTM model are significantly less than those of the STL-LSTM model, indicating that the proposed model improves the STL-LSTM model in terms of stability and simplicity. In addition, the proposed model yields the highest R2 coefficient with the value in range of 0.99 ~ 1.00. So, the predicting ability of the proposed model is the most robust among all the others. It is necessary to emphasize that the R2 value 1.00 of proposed model in water plant P2 is the results of being rounded to keep two decimals, and the real R2 value is a number which is quiet close to 1.00.

To sum up, the predictive performance of the STL-Ada-LSTM are superior to the other models, indicating that the ensemble deep learning model improves the traditional LSTM daily water demand model both in forecast accuracy, but also in stability and simplicity of the model. Therefore, the proposed STL-Ada-LSTM model performed best among these common models.

Table 2

Performance metrics of water demand forecast models
Water Plant	Model	MAPE(%)		MAE	MSE	RMSE	AIC	R2
P1	BP(no-seasonal)	0.04		2499.71	14246491.46	3774.45	66141.17	0.78
	BP(seasonal)	0.04		2174.62	9872667.02	3142.08	64668.70	0.88
	Ada-BP(no-seasonal)	0.04		1974.10	8637580.48	2938.98	64132.11	0.90
	Ada-BP(seasonal)	0.04		2483.67	11325131.00	3365.28	65219.78	0.89
	LSTM(no_seasonal)	0.03		1435.42	3671960.93	1916.24	60697.69	0.97
	LSTM(seasonal)	0.03		1423.48	3755274.16	1937.85	60787.77	0.97
	AdaLSTM(no_seasonal)	0.03		1797.84	6815766.49	2610.70	63167.28	0.99
	Ada-LSTM(seasonal)	0.04		2215.75	9311268.15	3051.44	64433.64	0.90
	STL-LSTM	0.03		1390.28	3580229.52	1892.15	60596.11	0.97
	STL-Ada-LSTM	0.01		281.71	146647.70	382.95	47767.59	1.00
P2	BP(no-seasonal)		0.04	1412.31	4323681.00	2079.35	39045.42	0.78
	BP(seasonal)		0.06	2114.26	8788480.00	2964.54	40857.77	0.55
	Ada-BP(no-seasonal)		0.02	801.02	1377495.00	1173.67	36122.91	0.93
	Ada-BP(seasonal)		0.06	2335.47	8211999.00	2865.66	40684.43	0.58
	LSTM(no-seasonal)		0.02	833.78	1213080.00	1101.40	35798.16	0.94
	LSTM(seasonal)		0.03	918.42	2005808.00	1416.27	37083.03	0.90
	Ada-LSTM(no-seasonal)		0.02	734.50	1361002.00	1166.62	36092.13	0.93
	Ada-LSTM(seasonal)		0.02	768.12	1407917.00	1186.56	36178.73	0.93
	STL-LSTM		0.01	398.32	259441.00	509.35	31857.36	0.99
	STL-Ada-LSTM		0.01	280.68	125765.30	354.63	30007.25	0.99

In order to further prove the significance of this study, our model is compared with the state-of-the-art models shown in Table 3. It is worth mentioning that the “Technology” term in the table presents the dominant model used, and the other details are omitted. In addition, the minimum value of MAPE, RMSE and AIC of the results in the studies are selected for comparison. For example, if the forecast scenario of the study contains both one-step and multi-step forecast, the smallest evaluated indexes were selected. As for the three indexes of our study, they were set to the mean of results of P1 and P2 water plants. As shown in Table 3, the proposed model in this study achieves the best MAPE with the value 0f 0.01% compared with the literature. Although RMSE of the proposed model is larger than the study of Chen et Al. (2022), our forecast obtained lower MAPE with relative small RMSE, which could balance the accuracy and stability of the forecast. AIC was not discussed in most studies. Despite the value of AIC in our study is larger than that of Salloom et al. (2021), the more accurate results were achieved by the STL-Ada-LSTM model. All in all, the proposed model gained the most accurate short-term water demand forecasting results and balanced the stability and simplicity of the model in the meanwhile. As a footnote, the comparison results are actually limited by the number of data, data stability and so on.

Table 3

Comparison with the results of the previous publications
Research	Technology	Forecast scenario	Accuracy (MAPE)	Stability (RMSE)	Complexity (AIC)
Baigang Du et. al (2021)	Combination of LSTM,DWand PCA	One-step daily WDF	0.02	Not discussed	Not discussed
Haidong Huang et.al (2021)	BP_AdaBoost model	24-h-ahead WDF	2.65	14556	Not discussed
Li Mu et. al (2020)	LSTM model	15 min, 1h and 24h WDF	1.40	315	Not discussed
Maria Xenochristou et. al (2021)	Combination of RF with statistical method	One-step daily WDF	17.90	Not discussed	Not discussed
Guancheng Guo et al .(2018)	GRUN model	15 min and 24 h-ahead WDF	2.06	2.18	Not discussed
Baigang Du et. al (2022)	Combination of LSTM, KDE and PSO	One-step daily WDF	Not discussed	Not discussed	Not discussed
Tony Salloom et. al(2021)	Combination of GRU with classification method	One-step and multi-step 15min WDF	1.31	Not discussed	29396
Jun Guo et. al(2022)	Combination of TCN, DWT and RF	One-step daily WDF	0.01	6289.65	Not discussed
Mo’tamad H. Bata et al. (2020)	ANN models	24 h and 1 week ahead WDF	12.30	Not discussed	Not discussed
Lei Chen et. al (2022)	Conv1D-GRU	One-step 15-min-ahead WDF	2.50	65.59	Not discussed
This study	STL-Ada-LSTM model	One-step daily WDF	0.01	368.79	38887.42

In this study, we proposed an ensemble deep learning model named STL-Ada-LSTM for daily water demand forecast. The proposed model was combined with the STL method and AdaBoost-LSTM model, where the former tone was used for decomposition of smoothed data series and the latter model was constructed for forecast. The proposed method has been evaluated with the daily water demand datasets from two representative water plants in Yiwu, China. To verify the superiorities of the proposed model, nine relative models have been constructed and some state-of-the-art studies were listed for comparison. The main conclusions are as follows:

(1) The results indicate that the daily water demand forecasts of the proposed STL-Ada- LSTM model are better than those of the BP(no-seasonal), BP(seasonal), LSTM(no-seasonal), LSTM(seasonal), Ada-BP(no-seasonal), Ada-BP(seasonal), Ada-STM(no-seasonal), Ada-STM(seasonal), and STL-LSTM models in terms of the MAPE, MAE, MSE, RMSE, AIC and R2 coefficient evaluation metrics. Also, the model has the distinguished ability to catch the peaks compared to other models.

(2) The proposed model embodies STL in the ensemble model based on the characteristics of water demand series, and the experiment proved that the STL method is more ingenious for feature extraction of the water demand original data than other methods like autocorrelation regression, especially the seasonal characteristics could be decomposed clearly by this way.

(3) STL-Ada-LSTM enhances the accuracy and stability of traditional LSTM SWDF by applying AdaBoost ensemble algorithm to integrate several LSTM models in a whole ensemble deep learning model.

(4) Compared with state-of-the-art models in recent years, the proposed SWDF can not only enhance the accuracy of the forecast model which is reflected by its lowest MAPE, but also balance the stability and conciseness of the model in some degree. The relatively improved stability is helpful for water supply system to obtain stable output to reduce the pipe wear. In the meanwhile, the conciseness of the model is conductive for the application of STL-Ada-LSTM in practice.

(5) There are only four parameters needed to be tuned which makes the method very easy and practical for use in water supply system. Therefore, this model can be used in many water supply systems for short-term forecast purpose.

Ethical Approval Not required as no animal/human was involved in the study.

Compliance with Ethical Standards Conflict

Consent to Participate Authors agreeing to participate any survey or feedback tasks.

Consent to Publish Authors allow publication if the research is accepted.

Authors Contributions All authors contributed to the study conception and Yueping Xu devoted to the design of the paper. Material preparation, data collection and analysis were performed by Jing Liu and Xinlei Zhou. The first draft of the manuscript was written by Jing Liu and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Funding This research was financially funded by the Major Project of the Natural Science Foundation of Zhejiang, grant number LZ20E090001 and the Zhejiang Key Research and Development Plan, grant number 2021C03017.

Competing Interest All authors declare that they have no conflict of interest.

Availability of data and materials Not applicable due to confidentiality agreement.

Alvarado-Barrios L, Rodríguez del Nozal Á, Boza Valerino J, et al (2020) Stochastic unit commitment in microgrids: Influence of the load forecasting error and the availability of energy storage. Renew Energy 146:2060–2069. https://doi.org/10.1016/j.renene.2019.08.032
Antunes A, Andrade-Campos A, Sardinha-Lourenço A, Oliveira MS (2018) Short-term water demand forecasting using machine learning techniques. J Hydroinformatics 20:1343–1366. https://doi.org/10.2166/hydro.2018.163
Bai Y, Xie J, Wang D, et al (2021) A manufacturing quality prediction model based on AdaBoost-LSTM with rough knowledge. Comput Ind Eng 155:107227. https://doi.org/10.1016/j.cie.2021.107227
Bakker M, Vreeburg JHG, van Schagen KM, Rietveld LC (2013) A fully adaptive forecasting model for short-term drinking water demand. Environ Model Softw 48:141–151. https://doi.org/10.1016/j.envsoft.2013.06.012
Bramante R, Facchinetti S, Zappa D (2019) Online detection of financial time series peaks and troughs: A probability-based approach*. Stat Anal Data Min 12:426–433. https://doi.org/10.1002/sam.11411
Caiado J (2010) Performance of Combined Double Seasonal Univariate Time Series Models for Forecasting Water Demand. J Hydrol Eng 15:215–222. https://doi.org/10.1061/(asce)he.1943-5584.0000182
Candelieri A, Giordani I, Archetti F, et al (2019) Tuning hyperparameters of a SVM-based water demand forecasting system through parallel global optimization. Comput Oper Res 106:202–209. https://doi.org/10.1016/j.cor.2018.01.013
Chen C, Liu L-M (1993) Joint Estimation of Model Parameters and Outlier Effects in Time Series. J Am Stat Assoc 88:284. https://doi.org/10.2307/2290724
Chen D, Zhang J, Jiang S (2020a) Forecasting the Short-Term Metro Ridership with Seasonal and Trend Decomposition Using Loess and LSTM Neural Networks. IEEE Access 8:91181–91187. https://doi.org/10.1109/ACCESS.2020.2995044
Chen G, Long T, Xiong J, Bai Y (2017) Multiple Random Forests Modelling for Urban Water Consumption Forecasting. Water Resour Manag 31:4715–4729. https://doi.org/10.1007/s11269-017-1774-7
Chen L, Yan H, Yan J, et al (2022) Short-term water demand forecast based on automatic feature extraction by one-dimensional convolution. J Hydrol 606:127440. https://doi.org/10.1016/j.jhydrol.2022.127440
Chen Y, Peng G, Zhu Z, Li S (2020b) A novel deep learning method based on attention mechanism for bearing remaining useful life prediction. Appl Soft Comput J 86:105919. https://doi.org/10.1016/j.asoc.2019.105919
Du B, Huang S, Guo J, et al (2022) Interval forecasting for urban water demand using PSO optimized KDE distribution and LSTM neural networks. Appl Soft Comput 122:108875. https://doi.org/10.1016/j.asoc.2022.108875
Du B, Zhou Q, Guo J, et al (2021) Deep learning with long short-term memory neural networks combining wavelet transform and principal component analysis for daily urban water demand forecasting. Expert Syst Appl 171:114571. https://doi.org/10.1016/j.eswa.2021.114571
Guo G, Liu S, Wu Y, et al (2018) Short-Term Water Demand Forecast Based on Deep Learning Method. J Water Resour Plan Manag 144:1–11. https://doi.org/10.1061/(asce)wr.1943-5452.0000992
Guo W, Liu T, Dai F, Xu P (2020) An improved whale optimization algorithm for forecasting water resources demand. Appl Soft Comput J 86:105925. https://doi.org/10.1016/j.asoc.2019.105925
Han L, Zhang R, Chen K (2019) A coordinated dispatch method for energy storage power system considering wind power ramp event. Appl Soft Comput J 84:105732. https://doi.org/10.1016/j.asoc.2019.105732
Hewamalage H, Bergmeir C, Bandara K (2021) Recurrent Neural Networks for Time Series Forecasting: Current status and future directions. Int J Forecast 37:388–427. https://doi.org/10.1016/j.ijforecast.2020.06.008
Huang H, Zhang Z, Song F (2021) An Ensemble-Learning-Based Method for Short-Term Water Demand Forecasting. Water Resour Manag 35:1757–1773. https://doi.org/10.1007/s11269-021-02808-4
Mouatadid S, Adamowski JF, Tiwari MK, Quilty JM (2019) Coupling the maximum overlap discrete wavelet transform and long short-term memory networks for irrigation flow forecasting. Agric Water Manag 219:72–85. https://doi.org/10.1016/j.agwat.2019.03.045
Mu L, Zheng F, Tao R, et al (2020) Hourly and Daily Urban Water Demand Predictions Using a Long Short-Term Memory Based Model. J Water Resour Plan Manag 146:1–11. https://doi.org/10.1061/(asce)wr.1943-5452.0001276
Pacchin E, Gagliardi F, Alvisi S, Franchini M (2019) A Comparison of Short-Term Water Demand Forecasting Models. Water Resour Manag 33:1481–1497. https://doi.org/10.1007/s11269-019-02213-y
Panchal G, Ganatra A, Kosta YP, Panchal D (2010) Searching Most Efficient Neural Network Architecture Using Akaike’s Information Criterion (AIC). Int J Comput Appl 1:54–57. https://doi.org/10.5120/126-242
Salloom T, Kaynak O, He W (2021) A novel deep neural network architecture for real-time water demand forecasting. J Hydrol 599:126353. https://doi.org/10.1016/j.jhydrol.2021.126353
Tepper JA, Shertil MS, Powell HM (2016) On the importance of sluggish state memory for learning long term dependency. Knowledge-Based Syst 96:104–114. https://doi.org/10.1016/j.knosys.2015.12.024
Voitcu O, Wong YS (2006) On the construction of a nonlinear recursive predictor. J Comput Appl Math 190:393–407. https://doi.org/10.1016/j.cam.2004.12.039
Xenochristou M, Hutton C, Hofman J, Kapelan Z (2021) Short-Term Forecasting of Household Water Demand in the UK Using an Interpretable Machine Learning Approach. J Water Resour Plan Manag 147:. https://doi.org/10.1061/(asce)wr.1943-5452.0001325
Xiao C, Chen N, Hu C, et al (2019) Short and mid-term sea surface temperature prediction using time-series satellite data and LSTM-AdaBoost combination approach. Remote Sens Environ 233:111358. https://doi.org/10.1016/j.rse.2019.111358
Xiong T, Li C, Bao Y (2018) Seasonal forecasting of agricultural commodity price using a hybrid STL and ELM method: Evidence from the vegetable market in China. Neurocomputing 275:2831–2844. https://doi.org/10.1016/j.neucom.2017.11.053
Yaseen ZM, El-shafie A, Jaafar O, et al (2015) Artificial intelligence based models for stream-flow forecasting: 2000-2015. J Hydrol 530:829–844. https://doi.org/10.1016/j.jhydrol.2015.10.038
Zanfei A, Menapace A, Granata F, et al (2022) An Ensemble Neural Network Model to Forecast Drinking Water Consumption. J Water Resour Plan Manag 148:1–15. https://doi.org/10.1061/(asce)wr.1943-5452.0001540

Highlights.docx

Download PDF

Journal Publication

published 10 Apr, 2023

Read the published version in Water Resources Management →

Editorial decision: Minor revisions
13 Jan, 2023
Reviewers agreed at journal
21 Oct, 2022
Reviewers invited by journal
21 Oct, 2022
Editor assigned by journal
04 Oct, 2022
First submitted to journal
30 Sep, 2022

You are reading this latest preprint version

Forecasting Short-Term Water Demands with an Ensemble deep learning Model for a Water Supply System

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Methodology

Identification and processing of outliers

Seasonal and Trend Decomposition Using Loess

Deep Learning Using Long Short-Term Memory

Adaptive Boosting

Ensemble AdaBoost-LSTM Deep Learning Model

Case Study

Study area and data description

Evaluation criteria

Development of Forecast Models

Results And Comparisons

Data preprocessing

Decomposing the data by STL

Forecasting results of the STL-Ada-LSTM model

Comparison of STL-Ada-LSTM model with the other models

Conclusions

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1