2.1 Reliability Verification of UAV Hyperspectral Imaging Data
In order to verify the reliability of UAV hyperspectral imaging data, the winter wheat hyperspectral data collected by ASD were resampled into UHD185 bands to calculate the average reflectance for each plot. Correlation analysis showed that the spectral reflectance of UHD185 and ASD are in high consistence at spectral bands between 458 and 830 nm, overlapping in both green peak position and red edge region (Fig. 4a). At spectral bands between 830 and 950 nm, the spectral reflectance of UHD185 gradually decreased, while that of ASD deviated from UHD185 and exhibited a flat curve (Fig. 4a). This may be due to relatively high noise between 830 and 950 nm since this range is close to the edge of UHD185 sensor spectrum. We compared spectral reflectance of UHD185 and ASD at 458–830 nm and found a high correlation (R2 > 0.99) between them (Fig. 4b). These results indicate that the spectral reflectance of UHD185 is reliable between 458 and 830 nm (3–96 wavebands) and can be used to estimate winter wheat LAI.
2.2 Correlation between Winter Wheat LAI and UAV Hyperspectral Imaging Data
Comparisons were made between the winter wheat LAI and raw UHD185 spectral reflectance (458–830 nm), and between LAI and FD-transformed UHD185 spectral reflectance (458–830 nm). Between LAI and UHD185 spectral reflectance, the maximum negative and positive correlation coefficients were at 654 nm(r = -0.80) and 802 nm (r = 0.49), respectively (Fig. 5). Between LAI and FD-transformed spectral reflectance, the maximum negative and positive correlation coefficient were at 546 nm (r = -0.74) and 774 nm (r = 0.83), respectively. Spectral bands with absolute correlation greater than 0.6 are 498–506 nm, 542 nm, 546 nm, 738–786 nm, and 830 nm (Fig. 5).
2.3 Selection of Characteristic Bands related to Winter Wheat LAI from UAV Hyperspectral Reflectance Data
Four methods including FD, SPA, CARS, and CARS_SPA were used to select characteristic bands related to wheat LAI. The FD correlation analysis demonstrated that spectral bands of 498–506 nm, 542 nm, 546 nm, 738–786 nm, and 830 nm were in high correlation with winter wheat LAI (correlation coefficients > 0.6). We selected bands corresponding to the maximum value at the inflection point, and discarded multiple highly correlated bands that are close to the inflection point. In this way, spectral bands at 506, 546, 774, and 830 nm were selected by FD, which account for 4.25% of the total variables.
During the process of SPA algorithm, the minimal and maximal numbers of characteristic bands extracted were set as 5 and 94, respectively. At the minimum RMSE of 1.049, a total of 28 optimal characteristic bands (458, 466, 474, 482, 498, 502, 506, 510, 518, 526, 530, 534, 542, 546, 558, 566, 570, 574, 610, 626, 650, 658, 686, 698, 710, 762, 814, and 830 nm), accounting for 29.8% of the total variables, were selected (Fig. 6).
The number of characteristic bands gradually decreased with the increase in CARS iteration (Fig. 7a). With the increase in sampling time, the 10-fold RMSE cross validation first slightly decreased, then increased significantly at the CARS iteration of 47 (Fig. 7b). This result indicates that certain key information was lost after performing 47 iterations of CARS, resulting to a poor performance of the model. At the iteration of 24, RMSE reached its minimum of 0.9674. Thirteen variables (566, 586, 602, 610, 634, 682, 698, 710, 730, 734, 790, 802, and 814 nm) accounting for 13.8% of the total variables were therefore selected (Fig. 7c).
The 28 characteristic bands selected using SPA algorithm may contain noise due to the complexity in SPA calculation process CARS algorithm can effectively remove the variables with small weight, and effectively select variables closely related to LAI. Therefore, the 28 characteristic bands selected by SPA algorithm were filtered by CARS algorithm to obtain the optimal combination of characteristic bands. The result showed the minimum RMSE (0.9718) appeared at the CARS iterations of 15 (Fig. 8). Therefore, nine variables (466, 474, 518, 526, 610, 658, 710, 814, and 830 nm) accounting for 9.57% of the total variables were selected.
We compared the LAI characteristic bands selected by the four algorithms, the distributions of characteristic bands selected by different algorithms were in consistent to a large extent, yet differences were also observed (Fig. 9).
2.4 Construction of Winter Wheat LAI Estimation Model by Different Modeling Methods
Based on characteristic bands of wheat LAI selected by using different variable selection algorithms and full spectrum information, three modeling methods including PLSR, SVR, and Xgboost were employed to construct LAI estimation models. Independent samples including calibration and validation sets were used to test these models. Table 2 summarizes the results for the models obtained by different modeling methods.
Table 2
Regression analysis of characteristic bands and winter wheat LAI
Modeling method
|
Variable
extraction
|
Wavelengths
numbers
|
Calibration
|
Validation
|
R²
|
RMSE
|
RPD
|
R²
|
RMSE
|
RPD
|
PLSR
|
Full_spectrum
|
94
|
0.79
|
0.81
|
2.15
|
0.78
|
0.79
|
2.06
|
FD
|
4
|
0.81
|
0.77
|
2.26
|
0.81
|
0.75
|
2.15
|
SPA
|
28
|
0.79
|
0.81
|
2.14
|
0.78
|
0.81
|
2.01
|
CARS
|
13
|
0.80
|
0.79
|
2.22
|
0.81
|
0.78
|
2.08
|
CARS_SPA
|
9
|
0.83
|
0.73
|
2.39
|
0.83
|
0.74
|
2.19
|
SVR
|
Full_spectrum
|
94
|
0.80
|
0.79
|
2.20
|
0.77
|
0.79
|
2.06
|
FD
|
4
|
0.80
|
0.78
|
2.22
|
0.80
|
0.72
|
2.24
|
SPA
|
28
|
0.81
|
0.75
|
2.31
|
0.79
|
0.76
|
2.14
|
CARS
|
13
|
0.79
|
0.79
|
2.20
|
0.77
|
0.79
|
2.04
|
CARS_SPA
|
9
|
0.82
|
0.75
|
2.31
|
0.84
|
0.65
|
2.75
|
Xgboost
|
Full_spectrum
|
94
|
0.93
|
0.50
|
3.48
|
0.80
|
0.79
|
2.04
|
FD
|
4
|
0.88
|
0.65
|
2.69
|
0.81
|
0.75
|
2.17
|
SPA
|
28
|
0.84
|
0.73
|
2.37
|
0.76
|
0.82
|
1.96
|
CARS
|
13
|
0.82
|
0.82
|
2.12
|
0.82
|
0.84
|
1.93
|
CARS_SPA
|
9
|
0.89
|
0.63
|
2.51
|
0.89
|
0.55
|
2.92
|
A reliable PLSR model was constructed using 9 characteristic bands selected by CARS_SPA algorithm as input. In this model, similar results of model evaluation indices (R², RMSE, and RPD) were obtained for the calibration and validation sets. The R², RMSE, and RPD of the calibration set were 0.83, 0.73, and 2.39, respectively, and those of the validation set were 0.83, 0.74, and 2.19, respectively. The PLSR model constructed based on 28 characteristic bands selected by SPA algorithm showed a poor performance. The R², RMSE, and RPD of the calibration set were 0.79, 0.81, and 2.14, respectively, and those of the validation set were 0.78, 0.81, and 12.01, respectively.
The SVR method based on different combinations of characteristic bands yield similar results of evaluation indices. Among these SVR models, the one using 9 characteristic bands selected by CARS_SPA algorithm as input showed the best performance. The R², RMSE, and RPD of the calibration set were 0.82, 0.75, and 2.31, and those of the validation set were 0.84, 0.65, and 2.75.
Further analysis was performed on the model constructed by using Xgboost method. The Xgboost model based on the 28 characteristic bands selected by SPA algorithm showed a poor performance. The R², RMSE, and RPD of the calibration set were 0.84, 0.73, and 2.37, respectively, and those of the validation set were 0.76, 0.82, and 1.96, respectively. The Xgboost model showed the best performance when using 9 characteristic bands selected by CARS_SPA algorithm. The R², RMSE, and RPD of the calibration set were 0.89, 0.63, and 2.51, respectively, and those of the validation set were 0.89, 0.55, and 2.92, respectively.
In summary, among all characteristic bands combinations, the one containing 9 characteristic bands selected by CARS_SPA algorithm outperformed with either of the ML modeling methods, followed by models constructed using the four characteristic bands selected by FD algorithm. This may be due to the fact that these 9 characteristic bands selected by CARS_SPA algorithm are uniformly distributed within the spectral range of 458–830 nm, which thus well maintain the spectral information of the reflectance corresponding to LAI inversion. The three LAI estimation models constructed based on characteristic bands selected by CARS_SPA were superior to LAI models constructed based on the full spectrum. However, the accuracies of LAI models constructed based on selected characteristic band were different from those of the models constructed based on full spectrum information. These results demonstrated that characteristic bands extraction could greatly reduce the number of variables used for modeling, thus reduces the modeling complexity which improves the modeling efficiency while ensures its accuracy. Comparing the three ML modeling methods, the Xgboost models performed the best, followed by PLSR and SVR. The calibration and validation results of the best-performed model (LAI_CARS_SPA_Xgboost) are shown in Fig. 10.