Study area
The Czech Republic has a mild four-season climate that is between the oceanic and continental. The country` climate is characterized by the prevailing western winds, intense cyclonic activity, and relatively high precipitation. The average daily temperature ranges from about 3°C in January to 17°C in July, and the average annual rainfall ranges from 600 to 800 mm in most of the country (Tolasz et al. 2007). The altitude ranges from 115 to 1,603 m a.s.l. with a median of 430 m a.s.l. The predominant type of relief is hills and highlands. The average population density is 133 people.km− 2 (Pokorná et al. 2019).
In the absence of humans, mixed beech forests would dominate in most of the Czech Republic, with oak-dominated deciduous forests in the lowlands and coniferous forests with spruce at higher altitudes (Chytrý 2012). Because of the intensive forest management performed since the 19th century, the current forest composition differs significantly from the natural state. Forests currently cover 33.9% of the territory and consist predominantly of Picea abies (52%), Pinus sylvestris (17%), Fagus sylvatica (7%), Quercus spp. (7%), Larix decidua (4%), Betula pendula (3%), and Abies alba (1%). Other deciduous species (e.g., Carpinus betulus, Acer spp., Fraxinus spp., Populus spp., Salix spp., and Tilia spp.) occupy about 8% of the forested area (Ústav pro hospodářskou úpravu lesů 2019).
Forest Fire Database
Detailed information on FFs were obtained from the Fire Rescue System of the Czech Republic (Hasičský záchranný sbor České republiky 2019), which keeps statistics on all fires in the country that required intervention. The database covers the whole area of the Czech Republic in the period 2006–2015 (except for military areas which represent < 5% of the area). From 2006 to 2010, the database indicated the districts where FFs occurred. The exact geographical locations were not indicated until after 2010.
Because of problems with the clear identification of some FFs, all records (approx. 10,000) were manually checked for misclassification; based on the descriptive information provided for each fire, the fire was excluded or included in the final database (FFD) (Berčák et al. 2018; Holuša et al. 2018). Also, FFs that occurred between January-March and November-December were not included in the final FFD (around 100 FFs in total), because these FFs were mostly associated with events such as accidental fuel leaks or fireworks. In addition, due to the minimal forest area, FFs from the administrative district of the capital city (Prague with more than 1,500,000 inhabitants) were also removed from the analysis.
In total, the FFD contains data on 7,105 FFs, with an average of 710 FFs per year over the reporting period.
The Dependent Variable
As the dependent variable, we used the numbers of FFs in individual municipality districts (n = 205) (State Administration of Land Surveying and Cadastre 2019) rather that than accurate spatial coordinates of the FFs for the following reasons: (i) data for the period 2006–2010 indicate FF occurrence in districts but without coordinates within districts; and (ii) socio-economic data (population, tourism, etc.) are only available for districts; and (iii) burned area is not very useful variable for understanding FF characteristics (Mohammadi et al. 2021). FFs in the Czech Republic are often extinguished quickly, and the small resulting burned areas reflect the effectiveness of the intervention rather than the nature of the FF ignition (Berčák et al. 2018). Moreover, burned area depends on fuel quality, but it is not related to the ignition.
Predictor Variables
We used several predictors, which represent key factors driving the fire risk in the temperate forests; climate data, forest-related data, socio-economic data, and landscape context data (Appendix 1) (Clark and Merkt 1989; Emmons 1973; Ellenberg 1996; Osvald 1997; Roy 2003; Flannigan et al., 2005; Tinner et al. 2005; Martinez et al. 2009; Zachar 2009; Thomas and McAlpine 2010; Niklasson et al. 2010; Aldersley et al. 2011; Zumbrunnen et al. 2012; Donis et al. 2017; Vacchiano et al. 2018; Dupire et al. 2019; Müller et al. 2020; Elia et al. 2020; Calheiros et al. 2021).
On a monthly and summer scale, we used the average monthly air temperature and monthly rainfall as predictor variables. On an annual scale, we used the average annual air temperature, the annual total rainfall, and the number of days with snow cover per year as predictor variables. The climate data and their differentiation were used from the climate atlas and specific data for the studied period measured by Czech Hydrometeorological Institution (2019).
Forest-related data at the district level were obtained from the national forest statistics (Ústav pro hospodářskou úpravu lesů 2019). The predictor variables used were the percentage of forest area, the percentage of conifers (%), and the percentage of pine (eAGRI 2019). In the Eurasian boreal zone, FFs predominantly occur in pine forests (Engelmark 1987; Tanskanen 2007), because pine trees are the most flammable (Ubysz and Valette 2010). In addition, pine forests usually grow under relative dry conditions, produce resinous and easily combustible debris, and in the case of older stands have a relatively thin canopy that allows the debris to dry (Lecomte et al. 2005). The values for these variables were kept constant throughout the study period.
The interface between forest and urban areas and between forest and agricultural areas is related to the incidence of human-caused fires (Rodrigues et al. 2020; Ganteaume et al. 2021). Values for these variables were derived for districts using spatial analysis of Corine LandCover data European Environment Agency (2019). The used variables were the lengths of the borders relative to the area of the forest (m.ha− 1) between the forest and urban and agricultural areas, i.e., the forest-urban interface and the forest-agricultural land interface. The values for these variables were kept constant throughout the study period.
The two variables used were the number of residents per forest area (inhabitants. ha− 1) and the number of overnight tourists per forest area represented by the number of overnight guests in recreational facilities in the district in 2012–2016 (overnight guests. ha− 1y− 1). Information on the total population in individual districts on 1 January 2011 and data on overnight guests was obtained from the public database of the Czech Statistical Office (2019).
Analyses
Relationships between the predictors and the number of FFs were analyzed using generalized additive models (GAM) (Wood 2017). This model predicts values of the dependent variable based on a linear combination of predictor variables that are approximated by the so-called smoother functions. The degree of smoothness of the function (the number of nodes, k) is determined separately for each predictor variable by cross-validation to avoid overfitting (over-adaptation of the function to data and loss of ability to generalize). The main results of the analysis are the deviance explained by the model, the statistical significance of the individual predictor, and the shape of the smooth function along with the Effective Degrees of Freedom, which reflect the degree of non-linearity of a curve. The mgcv library of statistical program R was used for the analysis.
Given the high number of candidate predictor variables (Appendix 1), their mutual substitutability (concurvity) was evaluated, and redundant variables were discarded. Subsequently, several models involving different predictor variables were iteratively tested, and variables that had a statistically significant association with the number of FF were used for the interpretation. The influence of first-order interactions among the variables was tested too. The quality of the created model was assessed based on the percent of deviance explained, distribution of residuals, their autocorrelation, and the correlation of predicted and measured data.