The methodology aims to create an automatic process that uses remote sensing data to generate a complete, objective fuel map that can be used with wildfire prevention simulation software. The Rothermel fuel models were selected for this purpose since they are the most commonly used in simulation software (Simons et al., 2013). However, since the Rothermel Models are qualitative and were developed for a climate that is quite different from that of the study area, the correspondence between the vegetation structure and type and the Rothermel Models cannot be derived systematically using remote sensing. The fuels photo-guide was used for this purpose. It includes the most common vegetation types in Galicia, and they are characterized according to qualitative and quantitative parameters which could potentially be estimated for the study area using remote sensing data. Additionally, the guide provides wildfire behavior data for each model, which facilitates the identification of correspondences between the fuels photo-guide and the Rothermel fuel models. This process allows for the objective and systematic mapping of Rothermel fuel models using remote sensing data from the study area.
The methodology consists of five main steps: 1) identification of the vegetation types in the study area using Sentinel-2 images; 2) characterization of the vegetation height (Canopy Height Model, CHM) and structure (LiDAR metrics) for the study area using ALS data; 3) establishing correspondences between the identified vegetation types and structures and the fuel models from the fuels photo-guide; 4) establishing the correspondences between the fuels photo-guide models and Rothermel fuel models; 5) mapping of Rothermel fuel models. An overview of the method is shown in Fig. 2.
4.1.- Land Cover Mapping
According to the fuels photo-guide, the principle variable that determines the assignment of a fuel type to a plot or forest stand is its dominant species. Sentinel-2 images were used to obtain a land cover map of the study area specifying the principle species and groups of species. The target classes were defined by considering the vegetation in the study area, the aim of the study, and the authors’ previous experience in multispectral classification. These classes are detailed in Table 4. The map was created following the methodology described in Alonso et al. (2021). First, the images were downloaded and pre-processed to mask out the cloudy pixels from each image. This step was performed by applying the cloud mask provided by the Sentinel-2 Level-2A product.
Table 4
Legend of the land use map created using Sentinel-2 images.
Class | Description |
Eucalyptus sp. | Adult individuals and young masses of Eucalyptus globulus. |
Conifers | Adult individuals and young masses of conifers (mainly Pinus pinaster, Pinus radiata). |
Broadleaves | Adult individuals and young masses of hardwoods (mainly Quercus robur and riparian species). |
Crops and pastures | Crops or land covered by non-woody vegetation. |
Shrubs | Areas of shrubs (mainly Ulex and Cytisus genera and ericaceae family). |
Bare soil | Land covered by rocks and very small shrubs or non-anthropogenic, non-vegetated areas. |
Anthropic | Built-up areas. |
Water | Accumulations of water, both marine and continental. |
Next, training data were collected through the photointerpretation of 2017 and 2020 PNOA images in order to perform a supervised classification. Polygons for each target class were manually delineated over the entire study region. Then, a supervised classification was performed for each image in the time series, which consisted of 12 images from 2019, one per month. The supervised classifications were performed using the random forest algorithm (Breiman, 2001). This step was carried out using the “randomForest” R package (Liaw and Wiener, 2002) with the default configuration parameters. The Sentinel-2 bands used in this step were the B02, B03, B04, B05, B06, B07, B08, B8A, B11, and B12 bands. All bands were resampled at 5 m using nearest neighborhood interpolation. Once single-date classifications were obtained, they were aggregated using the plurality voting decision criteria (Alonso et al., 2021; Lewiński et al., 2017). These decision criteria identify the class that most frequently occurs for each pixel throughout the time series and assigns that class to the pixel (Lewiński et al., 2017).
Finally, the map obtained was cross-verified. To do this, a random stratified set of points was distributed throughout the study area. The ground truth of these points was obtained through the photointerpretation of 2017 and 2020 PNOA images. Sentinel-2 images were used to provide supporting information, to address which PNOA image should ultimately be used to assign the land cover class. For example, if the 2017 PNOA image shows an area covered by conifer trees and the 2020 PNOA image shows that that same area is now covered by shrubs, Sentinel-2 images were used to confirm when the harvesting event took place, allowing us to determine if the land cover class for the year 2019 should be "Shrubs" or "Conifers". A confusion matrix was then constructed and accuracy metrics were calculated. The accuracy metrics calculated are as follows:
-
Overall Accuracy (OA): This value represents the proportion of correctly classified reference points. It is calculated by dividing the total number of correctly classified points by the number of reference points.
-
Producer’s Accuracy (PA): This value indicates how often real-world features are correctly represented on a classified map, or how likely it is that a given land cover of an area is correctly classified. It is the map accuracy. It can be computed for each category by dividing the number of correctly classified reference points for that category by the total number of reference points.
-
User’s Accuracy (UA): This metric represents the probability that the ground truth of a categorized pixel genuinely corresponds to that category. It represents the reliability of the map. It is calculated by dividing the total number of correctly categorized pixels in each category by the overall number of pixels classified in that category.
-
Kappa Index: This value compares the computed accuracy of the classification with the accuracy that would be computed randomly. It is determined by subtracting the accuracy that would be obtained through a random classification from the difference between the total accuracy (OA), divided by one minus the accuracy that would be obtained through a random classification.
4.2.- Structural Characterization Of Vegetation
According to the fuels photo-guide, the assignment of a fuel to a certain area is determined, not only by the species present in that area, but also by the height and structure of the vegetation. LiDAR data were used to characterize the vegetation height in the study area and to generate an approximation of the forest structure. The process consisted of two main steps: the first one, which includes noise removal, height noise removal, classification, and normalization, is oriented at preparing the data. The second step is the data processing; it includes canopy height modeling, computation of statistics and rasterization.
4.2.1.- Lidar Data Pre-processing
The first step of the LiDAR data pre-processing is noise removal, that is, the elimination of isolated points. This was performed using the “lasnoise” tool in the LAStools software (rapidlasso GmbH, 2014). Afterwards, the LiDAR data were classified in two classes: ground points and non-ground points. This was done with the “lasground” tool in the LAStools software (rapidlasso GmbH, 2014). The tool settings were adjusted to obtain an optimized performance. This step was needed for further normalization of the point cloud. The normalization was performed in order to transform the z coordinates. These coordinates, which originally represented the altitude of each point above the Earth’s reference surface (according to the official vertical Datum); after normalization, represent the height of each point above the ground. The normalization algorithm applied consisted of a nearest neighbourhood (KNN) interpolation with an inverse distance weighting (IDW). It was implemented using the "lidR" package in the R software (Roussel and Auty, 2021; Roussel et al., 2020).
The final pre-processing step was the vegetation height segmentation. Ground points and aerial noise were removed so that the data processing would focus solely on the vegetation points. For this step, all points below 0.5 m and above 60 m were removed from the point cloud.
4.2.2.- Lidar Data Processing
In order to estimate vegetation height and obtain information about the vertical structure of stands, several statistics were obtained from the normalized point cloud. Each was represented as a raster layer to facilitate its individual analysis, its interpretation using the reference images and its comparison with the other statistics.
First, a canopy height model (CHM) was obtained. The CHM is a raster layer where the digital value of each pixel corresponds with the height of the vegetation at that pixel. The vegetation height is estimated as the maximum value of the z coordinate, selected from among all of the points within the vertical prism defined by the pixel’s contour. These values were obtained using the “LidR” package available in the R software (Roussel and Auty, 2021; Roussel et al., 2020). The CHM was produced using a 5 m pixel size.
Additionally, a set of 36 statistics were calculated for the point clouds. Each of the statistics was computed using the normalized values of the z coordinates of the points; meaning that they ought to provide an approximation of the vertical structure of the point cloud. They were obtained and rasterized using the “LidR” package available in the R software (Roussel and Auty, 2021; Roussel et al., 2020). A raster layer with a spatial resolution of 5 meters was obtained for each metric. The complete list of metrics is detailed in Table 5. A detailed explanation of the metric “Cumulative percentage of returns in the Xth layer” can be found in Woods et al. (2008). In order to obtain these layers, the maximum height of the LiDAR data was divided into 10 intervals. The cumulative percentage of LiDAR returns was calculated for each interval. The final interval is excluded as the cumulative percentage of points in this interval is always equal to 1. Further information on the rest of the metrics can be found in Roussel et al. (2021).
Table 5
Metrics obtained from the normalized LiDAR point clouds.
ID | Metric | ID | Metric | ID | Metric | ID | Metric |
B1 | Maximum | B10 | 10th Percentile | B19 | 55th Percentile | B28 | Cumulative percentage of returns in the 1st layer |
B2 | Mean | B11 | 15th Percentile | B20 | 60th Percentile | B29 | Cumulative percentage of returns in the 2nd layer |
B3 | Standard deviation | B12 | 20th Percentile | B21 | 65th Percentile | B30 | Cumulative percentage of returns in the 3rd layer |
B4 | Skewness | B13 | 25th Percentile | B22 | 70th Percentile | B31 | Cumulative percentage of returns in the 4th layer |
B5 | Kurtosis | B14 | 30th Percentile | B23 | 75th Percentile | B32 | Cumulative percentage of returns in the 5th layer |
B6 | Entropy | B15 | 35th Percentile | B24 | 80th Percentile | B33 | Cumulative percentage of returns in the 6th layer |
B7 | Percentage of returns above the mean | B16 | 40th Percentile | B25 | 85th Percentile | B34 | Cumulative percentage of returns in the 7th layer |
B8 | Percentage of returns above 2 | B17 | 45th Percentile | B26 | 90th Percentile | B35 | Cumulative percentage of returns in the 8th layer |
B9 | 5th Percentile | B18 | 50th Percentile | B27 | 95th Percentile | B36 | Cumulative percentage of returns in the 9th layer |
Finally, horizontal sections were obtained to evaluate the number of points at different levels and the vertical continuity of the vegetation. Specifically, 7 vertical sections were considered. The height thresholds of the sections are summarized in Table 6. It should be noted that the first threshold, 0.5m, was fixed in order to discard points at ground level. The last threshold, 12 m, was established since points above that value correspond to high stand canopy. These statistics were transformed into raster layers with a spatial resolution of 5 m. In these layers the digital value of each pixel corresponds to the number of points within the prism defined by the cell contour and the minimum and maximum z values for the section in question. The rasters were obtained by using the “counter” switch that is available in the “lasgrid” tool in LAStools (rapidlasso GmbH, 2014).
Table 6
Height thresholds of the sections in which the point clouds were divided to evaluate the vertical continuity of the vegetation.
Section 1 | Section 2 | Section 3 | Section 4 | Section 5 | Section 6 | Section 7 |
0.5–2 m | 2–4 m | 4–6 m | 6–8 m | 8–10 m | 10–12 m | > 12 m |
4.3.- Definition Of Correspondence Between Remote Sensing Data And Fuel Models
The land cover map, the CHM, and the analysis of the vertical structure of the vegetation allowed for the assignation, at the pixel level, of the corresponding Lourizán CIF fuel models for the entire study area. These fuel models are specified in Table 7. An extra fuel model, fuel model 0, was added to assign to areas that do not correspond with any particular fuel model, typically built-up areas and bodies of water. Then, the fire behavior graphics of the identified Lourizán-CIF fuels were compared to the nomograms established by Rothermel; in this way, for the models present in the study area, the proper correspondences between the fuels photo-guide and the Rothermel fuel models were established. These correspondences are also specified in Table 7. All of the defined correspondences were verified through field work and visual inspection by an expert in fuel model evaluation.
Table 7
Fuel models correspondences between the fuels photo-guide, the Rothermel fuel models and the remote sensing classes.
Remote sensing classes | Lourizán CIF fuels photo-guide | Rothermel model |
Bare soil | - | 1 |
Crops and pastures | Pl-01, Pl-02, Pa-01, Pa-04 | 2 |
Shrubs greater than 1 m in height | Ea-05, Es-01, Pt-05, Pt-06, Pt-07, Ue-05, Ue-08, Ue-10, Ub-02, Ub-04, Cs-01, Cs-03 | 4 |
Shrubs less than 1 m in height | Eu-01, Eu-05, Eu-06, Pt-01, Ue-01, Ue-02, Ub-03 | 5 |
Forest areas with continuous vertical structure | PpMB-02, PpL-04, PpL-09, EgL-08, QrL-04 | 7 |
Forest areas with discontinuous vertical structure | PpF-03, PpF-05, PpF-06, EgL-05, EgL-06, EgF-03, EgF-05, QrF-02, QrF-04, BaL-01 | 10 |
Litter and other forestry remnants | - | 12 |
Anthropogenic areas, bodies of water | - | 0 |
The “bare soil” class from the land cover map was assigned to Rothermel fuel model 1. According to the Rothermel descriptions, fuel model 1 is driven by fine, dry, low grasslands with a light shrub load. Bare soils correspond to areas mainly covered by rocks, but they usually include small patches of scattered grasslands and shrubs. Therefore, as a principle of caution, bare soils were also included under fuel model 1. This particular landscape was not identified in the fuels photo-guide, therefore, this correspondence could not be cross-referenced.
Crops and pasture classes were assigned to Rothermel fuel model 2, since, according to the fuels photo-guide, the typical pastures species in the study area present a fire behavior typical of this model.
Shrubs were assigned to Rothermel fuel models 4 and 5 since the wildfire behavior associated with these classes was found to be similar to the behavior associated to shrubs in the fuels photo-guide. The height threshold for differentiating between the two models, following Rothermel’s indications, was 1 m.
Tree covered vegetation classes included in the land cover map were found to correspond to Rothermel fuel models 7 and 10. Rothermel fuel model 7 is associated to very flammable shrub species with heights of less than 2 meters or pine stands such as those found in the state of Florida. However, according to the fuels photo-guide, many tree stands in the study area can develop a fire behavior similar to the behavior described for this Rothermel fuel model. The main characteristic of these stands is that they present vertical continuity, therefore fuel model 7 was assigned to areas covered by trees that presented a continuous vertical structure. On the other hand, areas covered by trees with a discontinuous vertical structure were associated with Rothermel fuel model 10. A stand is considered discontinuous when the base height of the crown is 2 to 3 times the height of the understory.
Fuel models 8 and 9 have not been classified in this study since the difference between one and the other is the compaction of the mantle, a characteristic that can hardly be differentiated using the sentinel images and the LiDAR data implemented in this study. Areas that would correspond to those models were assigned to fuel model 10, as a principle of caution.
Finally, areas potentially covered by significant amounts of slash were assigned to Rothermel fuel model 12. Rothermel fuel model 11 was rare or found only in areas that were not large enough to study; i.e., within the minimum unit analyzed (a 5 m pixel).
4.4. Mapping Of Rothermel Fuel Models
Once the correspondences between the remote sensing data and the fuel models were defined, the next step was the generation of a map for each Rothermel fuel model. Different models involved different procedures, which are described below. The final step was the integration of all of the models into a single Rothermel fuel models map.
a) Models 0, 1 and 2
These three fuel models were obtained by establishing a direct relationship between the model and some of the classes from the land cover map. Specifically, all pixels classified as anthropogenic areas and bodies of water in the land cover map were assigned to fuel model 0. Bare soil pixels were assigned to fuel model 1 and Crops and pastures were assigned to fuel model 2.
b) Models 4 and 5
These models were identified through the land cover map and the LiDAR-derived CHM. Models 4 and 5 corresponded with shrubs: model 4 with tall shrubs and model 5 with short shrubs. Therefore, all the pixels covered by shrubs in the land cover map were classified as model 4 or 5. Furthermore, shrub pixels with a CHM value of greater or equal to 1 m were reclassified as fuel model 4. Shrub pixels with a CHM of less than 1 m were reclassified as fuel model 5. This procedure was performed using the “raster” R package (Hijmans, 2021).
c) Models 7, 10 and 12
Models 7 and 10 corresponded with areas covered by trees, with and without a continuous vertical structure, respectively. First, all of the pixels covered by trees (Eucalyptus sp., conifers and broadleaves) on the land cover map were selected.
Once the tree-covered areas of the study area were identified, the vertical continuity of the vegetation in these zones was evaluated. LiDAR metrics were needed for this purpose. Since the vertical structure of vegetation is a complex parameter, it cannot be identified by simply establishing threshold values, as was possible in previous cases. The automatic identification of Rothermel fuel models 7 and 10 was performed through supervised classification. Training and verification areas were defined for this purpose. A set of 90 regular polygons, 25 m x 25 m in size, was designed and distributed over the previously identified tree-covered areas. Half of the polygons represented model 7, and the other half, represented model 10. The identification of fuel models in these polygons was carried out by visually analyzing vertical sections of the point clouds for each training set; the 7 horizontal raster layers containing the total number of points per pixel, obtained as described in section 4.2.2, were used for support in this process. Pixels where all of the 7 raster layers had a high number of points were considered to correspond to stands with vertical continuity, and therefore assigned to fuel model 7. Conversely, pixels where the upper and lower layers presented a much greater number of points than the intermediate layers were considered to correspond to stands with vertical discontinuity, and therefore assigned to fuel model 10. The 90 regular polygons were randomly divided into training and verification data. A total of 60 polygons were used as training areas, encompassing a total of 1500 pixels.
Once the training data were obtained, the supervised classification was performed. The supervised classification was performed using the machine learning algorithm Random Forest (Breiman, 2001); it was applied using the “randomForest” package in R (Liaw and Wiener, 2002) with the pre-defined configuration parameters. The predictor variables, used in the implementation of the Random Forest predictive model, were the 36 metric rasters obtained from the LiDAR point cloud (see section 4.2.2). As a result, the random forest algorithm creates a model that can be applied to each individual pixel in the study area to predict its corresponding class, or in this case, fuel model. Afterwards, the model was applied to the selected pixels (pixels covered by a tree-related class), resulting in a fuel model map with fuel models 7 and 10. The different variables’ importance, based on their Mean decrease in Gini, was also calculated (Breiman and Cutler, 2004) in order to find out which variables were the most vital in predicting the fuel model.
Finally, a cross-verification was performed. For this, the verification dataset comprised of 30 regular polygons was used. A confusion matrix was then obtained and accuracy metrics were calculated. The same accuracy metrics were calculated as the ones presented in Section 4.2.2.
Fuel model 12 is associated with recently harvested areas. All pixels with tree land cover (either Eucalyptus sp., Conifers or Broadleaves) on the land use map but that have no data in the LiDAR metrics layers were assigned to this model. As explained previously, the land cover map is obtained by assigning the most frequent land cover throughout the year in question (in this case, 2019); therefore, harvested areas are classified as a tree-related cover. However, since the LiDAR point-cloud was obtained in November of 2019, areas where harvest events had taken place prior to the flight do not present enough returns to calculate the statistics due to the absence of trees or any other vertical structure.