In the ANN model, there is one input layer, one hidden layer and one output layer. Three processing elements (neurons) are used in the input layer for LST, temperature and precipitation parameters. EVI data is used in the output layer. The number of neurons in the hidden layer is one of the most important factors affecting the learning success of the network. If the number of neurons in the hidden layer is less, learning will not be as successful. Even if the number of neurons in the hidden layer is more than it should be, it may decrease the performance of the network as it will cause the models to memorize the data (Çakır 2018). In the paper, number of neurons was gradually increased until the best number of hidden layer neurons was obtained.
In the model, 2005–2015 data (264 data) were used to train, 2016–2018 data (72 data) were used to test and validation. In other words, 60% of the data was used for training, 40% for validation and testing. The model performance was calculated by Mean Absolute Percent Error (MAPE), Mean Square Error (MSE), Root Mean Square Error (RMSE), and correlation coefficient (R) statistics.
In the NARX model created, the best result was obtained when 6 hidden layer neurons were selected in both stations. The model performance according to the number of selected hidden layer neurons is given in Table 3.
Table 3
Comparison of the MSE and R values of the number of neurons in the ANN system.
|
Model
|
ANN
|
Station
|
Hidden Neuron Number
|
3
|
6
|
9
|
12
|
|
|
MSE
|
R
|
MSE
|
R
|
MSE
|
R
|
MSE
|
R
|
Iznik
|
Training
|
3.8e-4
|
0.74
|
2.4e-4
|
0.82
|
3.5e-4
|
0.75
|
3.2e-4
|
0.75
|
Validation
|
2.5e-4
|
0.84
|
4.1e-4
|
0.78
|
3.3e-4
|
0.8
|
3.3e-4
|
0.79
|
Test
|
2.1e-4
|
0.74
|
4.7e-4
|
0.77
|
4.7e-4
|
0.72
|
3.6e-4
|
0.76
|
Output
|
3.2e-4
|
|
3.7e-4
|
|
2.8e-4
|
|
3.3e-4
|
|
Alanya
|
Training
|
2.5e-4
|
0.82
|
2.7e-4
|
0.84
|
2.4e-4
|
0.82
|
2.5e-4
|
0.83
|
Validation
|
3.2e-4
|
0.82
|
2.5e-4
|
0.83
|
1.9e-4
|
0.83
|
2.7e-4
|
0.83
|
Test
|
2.7e-4
|
0.75
|
3.2e-4
|
0.79
|
2.0e-4
|
0.81
|
3.2e-4
|
0.72
|
Output
|
4.3e-4
|
|
4.1e-4
|
|
2.1e-4
|
|
4.6e-4
|
|
The relationship between the target values of training, validation and test results is as in Table 3. The training values are 82% and %84 for Iznik and Alanya, respectively. In general, 70% and 80% R values are accepted for good training in ANN scans (Sağık et al. 2021). As a result of the training with the ANN, the estimated percentage of all data are as 80% and 82% for Iznik and Alanya, respectively.
The performance results of ANN model, which has the most successful performance according to the number of hidden layer neurons, according to the MAPE and RMSE performance indications are given in Table 4.
Table 4
ANN Model performance for Iznik and Alanya.
Station
|
Model
|
Training
|
Test
|
All
|
R
|
R2
|
R
|
R2
|
R
|
R2
|
MAPE
|
RMSE
|
MSE
|
Iznik
|
ANN
|
0.82
|
0.67
|
0.77
|
0.59
|
0.80
|
0.64
|
15.9
|
0.29
|
0.002
|
Alanya
|
ANN
|
0.84
|
0.70
|
0.79
|
0.62
|
0.82
|
0.67
|
5.1
|
0.02
|
0.001
|
As a result of ANN analysis, MAPE values were calculated to be 15.9% and 5.1% for Iznik and Alanya, respectively, and 0.29 and 0.02 for RMSE. Furthermore, the MSE values were 0,002 and 0,001 for Iznik and Alanya, respectively. The low RMSE and MAPE values indicate that the proposed model is successful, in other words, the proposed model results are successful of both stations.
The literature review shows that the success ratio of model based on hybrid models by preprocessing of the data with ANN increases (Partal and Kişi 2007; Zhang et al. 2014; Rout et al. 2014). In this hybrid model, LST, temperature and precipitation data were divided into 8 sub-components with Discrete Wavelet Transform (DWT). The data was analyzed by using different wavelets and, the best model performance was obtained by using the 'd4' wavelet. This step is of critical importance as the selection of the appropriate main wavelet and the determination of the appropriate level of decomposition according to the characteristics of the data affect the performance analysis (Emhan 2013). The attributes of the outputs were obtained with the statistical operations and reanalyzed as NARX ANN input parameters. In this study, the sum of the components with a correlation coefficient greater than 0.1 was used as the input parameter of W-ANN to define the subcomponents. The W-ANN model performance according to the number of selected hidden layer neurons is given in Table 5.
Table 5
Comparison of the MSE and R values of the number of neurons in the W-ANN system.
|
Model
|
W-ANN
|
Station
|
Hidden Neuron Number
|
3
|
6
|
9
|
12
|
|
|
MSE
|
R
|
MSE
|
R
|
MSE
|
R
|
MSE
|
R
|
Iznik
|
Training
|
3.7e-4
|
0.77
|
4.2e-4
|
0.96
|
2.08e-4
|
0.95
|
2.3e-4
|
0.92
|
Validation
|
2.1e-4
|
0.82
|
2.7e-4
|
0.94
|
2.8e-4
|
0.91
|
2.3e-4
|
0.93
|
Test
|
2.7e-4
|
0.76
|
1.8e-4
|
0.94
|
4.4e-4
|
0.94
|
2.4e-4
|
0.91
|
Output
|
2.8e-4
|
|
2.9e-4
|
|
3.06e-4
|
|
2.3e-4
|
|
Alanya
|
Training
|
2.6e-4
|
0.81
|
2.7e-4
|
0.81
|
1.9e-4
|
0.91
|
2.2e-4
|
0.85
|
Validation
|
3.3e-4
|
0.84
|
2.1e-4
|
0.86
|
2.3e-4
|
0.85
|
2.3e-4
|
0.86
|
Test
|
2.6e-4
|
0.85
|
2.4e-4
|
0.83
|
3.5e-4
|
0.81
|
2.6e-4
|
0.82
|
Output
|
2.8e-4
|
|
2.5e-4
|
|
2.4e-4
|
|
2.6e-4
|
|
The best result was obtained when the number of hidden layer neurons was 6 for Iznik and 9 for Alanya. As a result of the model developed by using W-ANN, the training values are 96% and %91 and the estimated percentage of all data increase to 95% and 88% for Iznik and Alanya, respectively. With the hybrid model developed, it was determined that the network provided an 18.7% and 7.3% increase in learning success in Iznik and Alanya, respectively. The performance evaluation of the network developed using the W-ANN model is given in Table 6.
Table 6
W-ANN Model performance in Iznik and Alanya.
Station
|
Model
|
Training
|
Test
|
All
|
R
|
R2
|
R
|
R2
|
R
|
R2
|
MAPE
|
RMSE
|
MSE
|
Iznik
|
W-ANN
|
0.96
|
0.92
|
0.94
|
0.88
|
0.95
|
0.90
|
5.4
|
0.02
|
0.001
|
Alanya
|
W-ANN
|
0.91
|
0.80
|
0.85
|
0.72
|
0.88
|
0.77
|
2
|
0.02
|
0
|
The EVI results made with both models are given in Fig. 5. As seen in Fig. 5, the average value of the 2018 EVI was calculated as 0.28 and 0.30 in Iznik and Alanya, respectively. On the other hand, according to the W-ANN model results, the average EVI for 2030 is expected to be 0.22 and 0.28 in Iznik and Alanya, respectively. EVI value for the Iznik W-ANN model results will decrease by 21.4% until 2030 with a 5.4% error probability compared to 2018 (Fig. 5a). Similarly, with a 2% error probability in Alanya, the EVI value will decrease by 6.6% until 2030 compared to 2018 (Fig. 5b).
It is also important to explore the relationship between NDBI, LST and EVI to better understand the impact of the results on urbanization (Mathew et al., 2015; Malik, 2019) (Fig. 6). According to the EVI and NBDI data observed from 2014 to 2018, it was determined that there was a strong negative relationship between the variables (R = 0.87 and R = 0.98, alpha = 0.01 in Iznik and Alanya, respectively). When the data between 2005 and 2018 were examined, it was determined that a moderately negative correlation between EVI and daily average air temperature (R = 0.63 and R = 0.57, alpha = 0.01 in Iznik and Alanya, respectively) was slightly stronger than that between EVI and LST (R = 0.54 and R = 0.55, alpha = 0.01 in Iznik and Alanya, respectively). EVI was shown a strong positive relationship with precipitation during the whole period (R = 0.88 and R = 0.80, alpha = 0.01 in Iznik and Alanya, respectively) (Fig. 7).
As seen in Fig. 6, the strong relationships between these parameters can be a very good indicator for the future urbanization rates of the selected regions.
As a result of the statistical research, it was determined that relationship between EVI and LST is statistically a moderately downhill relationship (R = 0.54 and R = 0.55, alpha = 0.01 in Iznik and Alanya, respectively). As the density of vegetation biomass on the land surface increases, a decrease in land surface temperature is observed since the evaporation it causes is greater. For this reason, it causes cooling because the amount of energy lost is high in areas where the vegetation is high and dense. The weakening of the negative relationship between LST and vegetation biomass density shows the effect of continuous expansion of impermeable surfaces on increasing land surface temperature. According to this situation, it can be predicted that the land surface temperatures are high in areas where the vegetation biomass density is low and the land surface temperatures will increase as the vegetation biomass density decreases. The relation between EVI and air temperature is a moderately negative (R = 0.63 and R = 0.57, alpha = 0.01 in Iznik and Alanya, respectively). Numerous studies have found that the air temperature is lower in green spaces than in cities. It is anticipated that a rise in air temperature will result in a drop in plant density.