A virtual species study to establish baseline for assessing the predicted current and future distribution ranges of real species in mountainous areas

doi:10.21203/rs.3.rs-4443811/v1

Download PDF

Research Article

A virtual species study to establish baseline for assessing the predicted current and future distribution ranges of real species in mountainous areas

https://doi.org/10.21203/rs.3.rs-4443811/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Simulation and model prediction of virtual species aim to establish baseline for assessing the projected contemporary and future distribution ranges of real species in mountainous areas. Fundamental niches and geographic ranges of 5 virtual species were defined in the diagram of principal components analysis based on a high-resolution climate dataset generated from meteorological data. Heterogeneity of the climate dataset had been validated to influence the relationships between species responses and suitable environments, consequently affecting the geographical distributions of virtual species. The performances of 11 algorithms were evaluated by the extracted fraction of shared presences (ESP), instead of TSS and AUC. ESP calculates the overlap between simulated suitable ranges and predicted current potential ranges of virtual species. According to ESP, ensemble modeling outperformed the 11 algorithms. A small sample size has significant effects on model performance due to the extremely low value of ESP, and the presence of only 5 sample points was evidently a limitation of model predictions. Furthermore, geographical distance among sample points provide signals of niches that will be identified through accurate predictions of ensemble modeling in our analyses. By the 2050s and 2090s, climate change may drive the range expansion of real species currently distributed in inland areas or on leeward slopes, while causing range restriction or local extinction of real species in coastal areas or on windward slopes. Our study can inform application of species distribution models to provide scientific support for conservation planning in mountainous areas and forecasts of species distributions under climate change.

commission error

ensemble modeling

expected fraction of shared presences (ESP)

omission error

species distribution model

Taiwan

virtual species

Species distribution models (SDMs) are powerful tools for exploring the habitat characteristics associated with occurrence patterns of species (Chambers et al. 2013; El-Gabbas & Dormann 2018; Qazi et al. 2022; Zimmer et al. 2023). SDMs correlate either presence-only or presence/absence data of species with relevant environmental variables and subsequently generate a geographical map indicating the potential distribution ranges of species (Peterson et al. 2011). Accurate distribution ranges projected by SDMs are crucial for making informed assessments and are particularly important for improving conservation management of rare or endangered species (Guillera‐Arroita et al. 2015; Hama & Khwarahm 2023; HamadAmin & Khwarahm 2023; Lannuzel et al. 2021; Zurell et al. 2023). Accuracy of SDMs largely depends on the sample size and unbiased sampling of spatial data (Bean et al. 2012; El‐Gabbas & Dormann 2018; Elith et al. 2011; Guisan et al. 2007; Inman et al. 2021; Kadmon et al. 2003). However, a large sample size and unbiased sampling of spatial data are great challenges for rare or endangered species (Laskey et al. 2020). Particularly in mountainous areas, complex topography, habitat fragmentation, as well as steep climate gradients along mountain slopes usually result in small populations and biased distribution of rare or endanger species. Small population size, fragmented or biased distribution of rare or endangered species are critical issues in model predictions (Lannuzel et al. 2021; Liao & Chen 2021).

SDMs are seldom applied in mountainous areas not only because of the sample size of species, but also because of the resolution of climate dataset. Global climate datasets with 30-arc resolution, such as WorldClim (Fick & Hijmans 2017) or Chelsa (Karger et al. 2017), do not precisely reflect the heterogeneous climate environments in mountainous areas (Fick & Hijmans 2017), even when the climate dataset is downscaled to finer resolutions (Dobrowski 2011; Pradervand et al. 2014; Wang et al. 2016). Recently, a statistical method to generate high resolution climate datasets in mountainous areas was developed to improve the performances of SDMs (Liao & Chen 2021; Liao et al. 2023). The high resolution climate dataset of Liao et al. (2023) precisely reflected the heterogeneous climate environments in mountainous areas, and SDMs accurately projected potential distribution ranges of grassland when using this climate dataset (Liao et al. 2023). However, this high resolution climate dataset has not been directly validated by modeling rare or endangered species, and it may not be optimal for predicting species distributions in mountainous areas, potentially limiting its application.

Before being used in modeling studies, the high resolution climate dataset needed to be assessed. Virtual species are increasingly used in SDMs mostly because they allow the separation of the effects of individual features in complex models (De Marco & Nóbrega 2018). In this study, virtual species were simulated to assess whether the high-resolution climate dataset had reflected the heterogeneity of climate environments in mountainous areas. A virtual species was generated by resembling real species to create a simulated ecological niche in an n-dimensional environmental space (Duan et al. 2015; Hirzel et al. 2001; Leroy et al. 2016; Qiao et al. 2016). The high-resolution climate dataset generated by Liao et al. (2023) was used to simulate virtual species in the principal components analysis (PCA) by estimating the probability of each cell belonging to the climate niche (Leroy et al. 2016). Subsequently, the gridded cells that were categorized as presence were mapped to show the suitable or geographical ranges in the study area. Different virtual species were simulated in different areas of the PCA diagram, and they were assumed to have distinct geographical distributions due to the heterogeneous climate environments in the study area.

SDMs were then employed to project the potential distribution ranges of the virtual species. Virtual species have become increasingly popular for testing SDMs because simulated virtual species offer known distribution ranges, which support a comprehensive understanding of species-environment relationships (De Marco & Nóbrega 2018; Hirzel et al. 2001; Meynard & Kaplan 2013). Virtual species can provide controlled, unbiased presence and absence data, which are unavailable to field ecologists (Hirzel et al. 2001; Meynard et al. 2019). The “true” presence and absence data of virtual species are more appropriate for assessing model performance and overcoming the effects of biased samples of spatial points (Bombi & D’Amen 2012; Hirzel et al. 2001; Leroy et al. 2016; Qiao et al. 2019; Qiao et al. 2016).

In addition to the predicting current distribution range, SDMs were also employed to project the future distribution range under future climate scenarios in this study. Climate change is a strong force that can cause shifts or expansion in species ranges and significantly increases the risk of extinction for rare or endangered species (Jiang et al. 2022; Qazi et al. 2022; Zurell et al. 2023). Rare or endangered species, which are characterized by small population sizes, are especially threatened by climate change (IUCN 2014). SDMs accurately predicting current distribution ranges and evaluating future range shifts are urgently necessary for the effective design of conservation strategies for rare or endangered species (Ali et al. 2023; Jiang et al. 2022; Ning et al. 2021; Wan et al. 2021; Zimmer et al. 2023; Zurell et al. 2023). Therefore, future climate datasets are crucual for SDMs to assess the vulnerability of species to climate change (HamadAmin & Khwarahm 2023; Zurell et al. 2023). If the current climate dataset generated by Liao et al. (2023) accurately reflects climate heterogeneity in mountainous areas, it enables the generation of future climate datasets. The generation of future climate datasets was based on the methodology of Liao et al. (2023), which was applied for SDMs to project range shifts of both virtual and real species in the study area.

This study aims to generate high-resolution current and future climate datasets in a mountainous area expected to adequately capture the heterogeneous climate characteristics both in the current state and in future climate changes. In this study, five virtual species were simulated to validate the heterogeneity of climate datasets. Several question were addressed. (1) The characteristics of the current climate dataset were hypothesized to influence the relationship between species response (presence) and environments (habitat suitability), referred to as the species-environment relationship. (2) Various virtual species, simulated using the high-resolution climate dataset to determine habitat suitability, were assumed to have distinct geographical distribution ranges in the mountainous area. (3) SDMs were conducted to explore the current and future distribution ranges of virtual species in a mountainous area, aimed to examine the extent of range expansion, restriction, shift or local extinctions under climate change. (4) Real species were then applied for algorithms to predict both the current potential range and future distribution ranges under climate change. These predictions aim to inform applications for conservation management.

Study area

Taiwan is a subtropical island located at the western edge of the Pacific Ocean, with coordinates ranging from 21° 55’ to 25° 20’ N and 119° 30’ to 122° 00’ E (Fig. 1). The subtropical island is located 150 km off the southeast coast of Mainland China and is characterized by a monsoon climate (Chen & Tsai 1983; Su 1984). The northeast monsoon during winter and the southwest monsoon during summer influence the weather conditions of Taiwan Island. Particularly, the northeast monsoon during winter prevails in Taiwan for six months, bringing heavy rainfall and strong winds to the northern and eastern slopes of the Central Mountain Range.

In northern Taiwan (NTWN), a steep precipitation gradient extends from coast to inland areas that is significantly influencing the distribution of plant species (Liao & Chen 2022; Liao et al. 2023). The annual precipitation decreased from more than 6,000 mm at the northeastern slope to 1,900 mm at the southwestern slope of the mountain ridge in NTWN (Liao et al. 2023). The mean monthly temperatures at the mountain ridge range from 11.3 ℃ in winter to 20.5 ℃ in summer, while those at the coastal area range from 17.9 ℃ in winter to 26.6 ℃ in summer (Liao & Chen 2022; Liao et al. 2023).

The study area in NTWN ranges from 24° 57’ to 25° 17’ N and 121° 24’ to 122° 00’ E (Fig. 1), covering an area of approximately 1,031 square kilometers (103,100 hectares). The highest mountain peak in the study area is Qixingshan, which stands at an elevation of 1,120 meters above sea level (asl.). The major vegetation type in NTWN is evergreen broad-leaved forest (Hsieh et al. 1997; Li et al. 2013; Liao et al. 2012). The forests in NTWN are dominated by species such as Castanopsis, Cleyera, Cyclobalanopsis, Dendropanax, Elaeocarpus, Engelhardia, Gordonia, Helicia, Ilex, Keteleeria, Limlia, Litsea, Machilus, Meliosma, Michelia, Pinus, Schefflera, Symplocos, and Trochodendron (Li et al. 2013). The mean canopy height of these forests is approximately 15 meters. There is no deciduous forest in NTWN, while native deciduous tree species are scattered within the forests of the region. Natural grasslands are commonly found along the mountain ridges spanning from the coast to the inland regions within the NTWN, while the elevations of natural grassland vary across these ridges within the study area (Liao et al. 2023). The predominant species of natural grassland in these mountain areas are Miscanthus sinensis and Pseudosasa usawai (Liao et al. 2014; Liao et al. 2023). It is worth noting that the climatic niches of these two species are similar and may demonstrate convergence of climatic niches (Liao et al. 2023).

Downscaling of the current climate dataset

In this study, a gridded climate dataset with spatial resolution of 50 × 50 m² was created to present historical climate environments of the study area. This dataset was generated by utilizing daily meteorological data downloaded from the Central Weather Bureau’s website (CWB, https://e-service.cwa.gov.tw/HistoryDataQuery/index.jsp). The 50 × 50 m² gridded climate dataset was adopted to accurately capture the heterogeneous climate environments along the mountain slopes. To construct this dataset, we downloaded daily meteorological data from CWB’s website. The detailed process of downscaling the historical climate dataset described in Liao et al. (2023) includes the following steps: (1) interpolation of the meteorological climate dataset to generate smooth climate variable surfaces; (2) creation of gridded cells, each with a spatial resolution of 50 × 50 m², to extract data from the smooth climate variable surfaces; (3) altitudinal adjustment of the extracted climate data.

Climate data from 30 meteorological stations in and around the study area were downloaded from the CWB website. Mean monthly temperature and total monthly precipitation obtained from the 30 meteorological stations were imported into ArcInfo software (ESRI, Redlands, California, USA) to generate smooth surfaces of climate variables using the Kriging method, resulting in the generation of .tif files for the climate variables. Gridded cells with a spatial resolution of 50 × 50 m² were also created using ArcInfo software. A total of over 0.4 million gridded cells were generated within the study area. For each gridded cell, the longitude, latitude, and elevation data were extracted from a digital terrain model (DTM) developed by the Department of Geography, Chinese Culture University. The DTM had a resolution of 20 by 20 meters. The elevation data obtained from the DTM was named DElev. Subsequently, the 50 × 50 m² gridded cells were mapped and overlapped with the .tif files of the meteorological climate surfaces to extract climate data. The meteorological climate dataset with a spatial resolution of 50 × 50 m² was named MCD50. Furthermore, the elevations of the meteorological stations were also interpolated using the Kriging method in ArcInfo software, resulting in the generation of a smooth elevation surface named MElev. The gridded cells of MCD50 were overlapped with MElev to extract the elevation data. The differences between DElev and MElev were calculated for the altitudinal adjustment of MCD50. The altitudinal adjustment function is: AdjMCD50 = slope × (DElev – MElev) + MCD50. The abbreviation AdjMCD50 represents the altitudinally adjusted meteorological climate data with a spatial resolution of 50 × 50 m². The slope of the function, also known as the empirical lapse rate, was calculated as the slope of the linear correlation between the elevation and climate data of the nearest 12 meteorological stations. The linear regression model was implemented using the "stats" package within the R environment (Chambers & Hastie 1992). The detailed methodology for generating the current climate dataset followed the study conducted by Liao et al. (2023).

The altitudinally adjusted climate data, AdjMCD50, was utilized as the historical climate dataset for modeling species distributions. The climate dataset was generated by considering the following nine variables: mean annual temperature (Tmean), mean maximum temperature of the warmest month (Twrm), mean minimum temperature of the coldest month (Tcld), mean temperature in summer (Tsmr) and winter (Twnt), temperature differences between warmest and coldest months (Tdif), annual total precipitation (Pann), total precipitation in summer (Psmr) and winter (Pwnt).

Downscaling of future climate projections

The history of the Intergovernmental Panel on Climate Change’s (IPCC) assessment reports covers several generations of emissions scenarios (IPCC 2013, 2022; Pedersen et al. 2021). Emission scenarios are generally developed to describe different socio-economic and policy choices, allowing for the assessment of different potential futures and their implications for the long-term climate systems (Kebede et al. 2018). In 2022, the global Shared Socioeconomic Pathways (SSPs) were introduced in the Sixth Assessment Report (AR6) most recently published by the IPCC, integrating socioeconomic developments into future climate scenarios (IPCC 2022). There are five SSPs (SSP1 to SSP5), and these SSPs are used in conjunction with climate models to generate a range of possible future climate and environmental conditions based on different societal choices and policy directions (IPCC 2022; Pedersen et al. 2021). Among the five pathways, SSP1 is the most optimistic scenario and emphasizes sustainable development, while SSP2 represents a middle pathway (Pu et al. 2020). SSP3 and SSP4 are the most undesirable pathway, assuming unsustainable development trends. SSP5 assumes an energy intensive, fossil-fuel-based economy, but also relatively optimistic development (Pu et al. 2020).

Regarding the future climate scenarios, the working group of Taiwan Climate Change Information and Adaptation Knowledge Platform (TCCIP) has generated 5 × 5 km² gridded climate datasets to present future climate projections for the Taiwan island (Wang et al. 2021). The future climate projections fort the Taiwan island were downscaled from 49 General Circulation models (GCMs) during the 6th phase of the Coupled Model Intercomparison Project (CMIP6). Among the 49 available GCMs, five climate system models were used in this study: ACCESS-CM2 (Meucci et al. 2023), FGOALS-g3 (Pu et al. 2020), GFDL-ESM4 (Dunne et al. 2020), MIROC6 (Kataoka et al. 2020), and TaiESM1 (Wang et al. 2021). The five GCMs were used to generate downscaled future climate datasets at 50 × 50 m² resolution for the study area under different climate scenarios.

The TCCIP published 5 × 5 km² gridded climate datasets to present climate data from 1960 to 2100. Since the climate dataset of AdjMCD50 was at a spatial resolution of 50 × 50 m², the gridded climate dataset projecting future climate at 50 × 50 m² was recalculated based on the relative changes observed in the TCCIP’s historical and future climate datasets. We selected three time periods of climate data from TCCIP’s climate datasets to present the climate datasets for the early (2000–2020), mid (2045–2055), and end (2091–2100) of the 21th century. The TCCIP’s climate datasets for the three time periods were used to calculate the mean monthly temperature and total monthly precipitation. Subsequently, differences in monthly temperature and total monthly precipitation between the early and mid, as well as the early and end, of the 21th century were calculated to generate relative changes in climate data. The relative changes in temperature and precipitation were also represented as 5 × 5 km² gridded data, which were used to generate .tif files of smooth climate surfaces using the Kriging method in ArcInfo software. The gridded cells with a spatial resolution of 50 × 50 m² were overlapped with the .tif files of smooth climate surfaces to extract the relative changes in climate data. The historical climate datasets, AdjMCD50, were overlapped with the relative changes in 50 × 50 m² gridded climate datasets to project future climates.

Simulations of virtual species

The gridded cells with historical climate data (2000–2020) were analyzed using principal components analysis (PCA). PCA was employed to diminish dimensionality and mitigate collinearity among environmental variables, enabling a clearer quantification of environmental overlap (Journé et al. 2020; Meynard et al. 2019; Qiao et al. 2016). Nine climate variables were initially included in the PCA. Notably, significant collinearity was observed among Pwnt and Pann, Tsmr and Twrm, and Twnt and Tcld. Thus, Pann, Twrm, and Tcld were excluded from the PCA. The remaining climate variables were Tmean, Tsmr, Twnt, Tdif, Psmr, and Pwnt. Retaining of the first two principal components (PC1 and PC2) explained a combined 84.1% of the overall variation.

To create a range of spatial patterns in habitat suitability, the function “generateSpFromPCA” implemented by R package “virtualspecies” was employed to simulate virtual species (Leroy et al. 2016). When using the function, the parameter nb.points was set to 5000. The other two parameters, means and sds, were configured to simulate five virtual species located in various regions of the PCA diagram, representing distinct niches and suitable environments of different virtual species. The performance of the function resulted in distinct geographical distribution patterns in the study area (Fig. 1 and Supplement 1). The first virtual species was designed to be located at the center of the PCA diagram, resulting in a geographic distribution range covering the windward slopes near mountain ridge (Fig. 1). The other four virtual species were designed to be located at the lower, left, right, and upper sides of the PCA diagram (Supplement 1). The simulation of the five virtual species aims to ensure that they occupied different environments and that their geographical distribution ranges widely across the study area.

Inventory of real species

In this study, we selected eleven real plant species and grasslands for modeling analysis. Among them, five species are rarely observed in the study area: Maackia taiwanensis, Benthamidia japonica, Lilium speciosum, Rhododendron pseudochrysanthum, and Bretschneidera sinensis. Their rarity is attributed to their limited geographic range on this island, mainly confined to the mountainous regions of NTWN, wtih only a few occurrences of these species recorded in the fieldwork. The remaining seven species include four woody angiosperms: Rhododendron nakaharai, Euscaphis japonica, Ficus fistulosa, Saurauia tristyla var. oldhamii, as well as two fern species, Dipteris conjugata and Sphaeropteris lepifera. Occurrences of these eleven plant species and grasslands were collected along the roads and mountain trails within the study area. Coordination of occurrences collected in the fieldwork were spatially verified to delete duplicated occurrence records.

Modelling technique

In this study, 11 algorithms were employed to predict the potential distribution ranges of virtual and real species in mountainous areas. The 11 algorithms include artificial neural network (ANN), classification tree analysis (CTA), flexible discriminant analysis (FDA), generalized additive model (GAM), generalized boosting model (GBM, or usually called boosted regression trees), general linear model (GLM), multiple adaptive regression splines (MARS), Maximum Entropy (MAXENT), random forest (RF), surface range envelop (SRE, or usually called BIOCLIM), and extreme gradient boosting training (XGBOOST). In addition, ensemble modeling, which includes 11 algorithms, was employed to predict virtual and real species in mountainous areas. The 11 algorithms and ensemble modeling were implemented using the “biomod2” package in R software (Thuiller et al. 2016).

For each virtual species, occurrence and background points were integrated to generate a modeling dataset. The coordinates of the occurrences and background points were used to extract climate data from the climate surfaces. The occurrences were randomly sampled from the geographic distribution range of the virtual species, as shown in Fig. 1 and Supplement 1. Various numbers of sample points were used in the model prediction to assess the impacts of sample size on the model performances. The numbers of sample points imported into the model predictions were 5, 20, 50, 100, and 200. The number of background points was 100 times the number of sample points randomly selected in the study area. When the sample and background points were used for the model predictions, a random set comprising 80% of the occurrence and background data was selected to train the model, and the remaining 20% was used for evaluation.

Model performance

To assess accuracy of algorithms and ensemble modeling, the training dataset was resampled and modeled 10 times to quantify uncertainties in predictions. True skill statistics (TSS) and the area under receiver operating characteristic curve (AUC) were commonly used to assess the accuracy of species distribution models (Fois et al. 2015; Lannuzel et al. 2021; Qiao et al. 2019; Xu et al. 2021). For creating the final ensemble models, only those models with a TSS score greater than 0.8 were used (Khan & Verma 2022).

Notably, a previous document proposed that the error indices, such as TSS and AUC, do not imply accuracy of suitability, since these indices provide a single-number discrimination measure across all possible ranges of thresholds (Lobo et al. 2008). High values of TSS and AUC do not guarantee accurate model performance (Liao & Chen 2022). Thus, a similarity index called expected fraction of shared presences (ESP) was introduced to evaluate model performance in this study. The ESP was modified from the Sorenson similarity index to compare the similarity of potential ranges between two species (Godsoe 2014; Inman et al. 2021). In this study, the ESP was revised to compare the suitable range of a virtual species simulated by PCA with its potential ranges predicted by 11 algorithms and ensemble modeling. The function of the ESP is:

$$\text{E}\text{S}\text{P}=\frac{{2{\Sigma }}_{1}^{j}{P}_{s\left(j\right)}{P}_{p\left(j\right)}}{{{\Sigma }}_{1}^{j}({P}_{s\left(j\right)}+ {P}_{p\left(j\right)})}$$

where P_s(j) denotes the presence of suitable range at a given cell j, and P_p(j) denotes the presence of potential range at a given cell j. Meanwhile, P_s(j)P_p(j) denotes that a given cell j is both presence of suitable range (P_s(j)) and potential range (P_p(j)). An ESP value of 1 indicates perfect agreement between the suitable and potential ranges of a virtual species, while a value of 0 indicates complete geographic separation (Godsoe 2014; Inman et al. 2021).

Suitable ranges of virtual species

Five virtual species were simulated by selecting gridded cells from different regions in the PCA diagram, each representing distinct suitable ranges of the five virtual species in the study area. The gridded cells at the center of the PCA (PCAC) diagram represented a distribution range on windward slopes near mountain ridges in the study area (Fig. 1). The gridded cells at the lower (PCAO), left (PCAL), right (PCAR), and upper (PCAU) sides of the PCA diagram represented distribution ranges in the coastal area, mountain ridge, inland area, and leeward slopes near mountain ridge, respectively (Supplement 1). We generate 5 virtual species with distinct climatic niche in PCA diagram and the 5 virtual species evidently presented distinct geographical distribution ranges in the study area. It is evident that heterogeneous climate environments can provide diverse suitable habitats for the growth of species. By simulating the virtual species, the high-resolution gridded climate dataset generated from meteorological data was validated to reflect the climate heterogeneity of mountainous areas.

Performances of 11 algorithms

The potential distribution ranges of the five virtual species were predicted by 11 algorithms to evaluate their performances, aiming to select appropriate algorithms for predicting the potential distribution ranges of real species in mountainous area. Calculations of the ESPs were then used in this study to estimate the degree of overlap between suitable ranges of virtual species simulated in the PCA diagram and potential ranges predicted by the algorithms (Fig. 2). FDA, GBM, GLM, MARS, MAXENT, and SRE have relatively higher mean ESP values. Relatively higher mean ESP values demonstrated greater overlap between the simulated suitable ranges and the projected potential ranges, indicating better performances of these 6 algorithms.

In the virtual species approach, ESP associated with false presences and false absences can be used to detect algorithm characteristics. Lower ESP value indicates that the values of commission (false presence) and omission (false absence) errors are likely to be higher (Fig. 2). A high value of false presences and false absences indicates overestimation and underestimation of the potential ranges of virtual species, respectively. A small sample size uncovers the characteristics of various algorithms. When the number of sample points was 5, the predicted ranges of CTA, SRE and XGBOOST showed relatively lower values of false presence but very high values of false absences. CTA, SRE, and XGBOOST tend to have higher omission errors or underestimations of the species’ potential ranges. On the contrary, the predicted ranges of GLM and MAXENT showed relatively high values of false presences and slightly lower values of false absences. GLM and MAXENT tend to have higher commission errors or overestimations of the species’ potential ranges. Algorithms that either overestimated or underestimated potential ranges can serve as technological options to provide scientific support for designing conservation strategies.

The potential ranges predicted by algorithms based on large sample size were highly overlapped with the simulated suitable ranges of virtual species. Our findings demonstrate that sample size is a significant factor affecting the model performances, particularly when the sample size is 5. Five sample points are evidently the limitation of model predictions, because the ESP values were consistently lower than 0.45 for all of the algorithms.

In addition, the distinct geographical ranges of the five virtual species have, to some extent, affected the ESP values. The full range of ESP values is slightly narrower for the virtual species distributed at the coastal area (PCAO, the first diagram of Supplement 2). On the contrary, the full range of ESP values is slightly wider for the virtual species distributed at the leeward slopes near mountain ridge (PCAU, the lowest diagram of Supplement 2).

In most of the previous studies, the TSS and AUC were calculated to assess predictive performance of algorithms. Higher TSS and AUC should indicate better model performance. However, high values of TSS and AUC did not guarantee better model performances in this study. When the ESP values were relatively higher, the TSS and AUC were not always higher and were somewhat contradictory to the ESP index. In addition, there is no specific trend in TSS and AUC values in this study (Fig. 3 and Supplement 3). Conclusively, the TSS and AUC were not appropriate indices for representing the predictive performance of the algorithms.

This study also employs virtual species distributions to evaluate the predictive power of ensemble modeling. Ensemble modeling accurately represented the potential geographical ranges of the five virtual species (Fig. 4 and Supplement 4). Ensemble modeling performed better than the 11 algorithms, as the ESP values were mostly close to or higher than 0.8 when the sample points were greater than 5. Sample size and geographical distribution ranges have less effect on ensemble modeling. Even so, an extremely small sample size had robust effects on the projection results of ensemble modeling. When 5 sample points were applied for ensemble modeling, a low ESP value demonstrated poor model performance (Fig. 5). Ensemble modeling performed well when number of sample points was higher than 20. A large sample size caused a precise overlap between the simulated suitable range and the predicted potential range of virtual species. Since ensemble modeling precisely projected potential distribution ranges of virtual species, it is a good model for predicting species’ potential distribution ranges. All 5 virtual species can be accurately predicted by ensemble modeling, and this pattern does not depend on the ecological characteristics of the virtual species. Our results demonstrate that ensemble modeling produced reliable information on the potential ranges of species.

Importance of predictors

For the 5 virtual species, the relative importance of predictors varied among algorithms, with temperature being evidently more important than precipitation (Table 1 and Supplement 4). The study area possesses complex topography and elevation gradients that certainly affected species distributions. Temperature is highly correlated with elevation gradient. Therefore, temperature is certainly important in predicting virtual species. On the other hand, precipitation significantly differs between windward and leeward slopes in the study area. There are two species, B. sinensis and B. japonica, mainly observed on the windward slopes, and the distributions of these two species were likely related to the high precipitation on the windward slope. However, the relationships between plant distribution and precipitation gradient in the study area were not significantly represented in the model predictions.

Table 1

The importance values of predictors for 11 algorithms and ensemble modeling in predicting virtual species. The virtual species are located at the center of PCA diagram.
Predictors	ANN	FDA	CTA	GAM	GBM	GLM	MAXNET	MARS	SRE	RF	XGBOOST	Ensemble
Psmr	0.11	0.00	0.00	0.05	0.03	0.00	0.01	0.04	0.09	0.06	0.07	0.04
Pwnt	0.08	0.01	0.01	0.07	0.03	0.02	0.01	0.05	0.09	0.06	0.16	0.05
Tmean	0.30	0.31	0.00	0.18	0.02	0.15	0.00	0.10	0.20	0.21	0.04	0.14
Twnt	0.16	0.27	0.33	0.25	0.23	0.27	0.25	0.28	0.18	0.18	0.26	0.24
Tsmr	0.20	0.32	0.51	0.24	0.28	0.50	0.38	0.35	0.22	0.26	0.18	0.31
Tdif	0.15	0.09	0.15	0.21	0.42	0.05	0.35	0.18	0.22	0.23	0.30	0.21

Model evaluation of real species

Since ensemble modeling performed well in predicting virtual species, it was employed to predict the potential distribution ranges of the 11 plant species and grassland (Fig. 6). The most significant factor affecting model performance is the sample sizes of occurrences. A large sample size of species is usually difficult to collect in mountainous areas, particularly when the target species is rare or endangered species. If the number of occurrences is fewer than 50 records, all occurrences of the species were used for ensemble modeling predictions. For some species, the number of occurrences is more than 50 records, and 50 sample points randomly selected from the occurrences were used for ensemble modeling predictions. Ensemble modeling successfully and reasonably projected potential distribution ranges of the 11 plant species and grassland (Fig. 6).

A small sample size had a significantly effect on the potential distribution ranges of real species. The sample sizes of L. speciosum and M. taiwanensis are relatively small; they have 9 and 14 records of occurrences, respectively. However, the two species have dramatically different patterns of potential distribution ranges. A small sample size with a widely distribution range, such as L. speciosum, resulted in larger potential geographical ranges. On the contrary, occurrences and potential distribution ranges of M. taiwanensis are constrained within a small geographical range in the study area. Sample size is not the only factor that has an impact on the potential distribution ranges of species; geographical distances among occurrence points also have effects on the potential distribution range.

Range shifts of virtual and real species under future climate change

Virtual species with distinct contemporary distribution ranges markedly presented range shifts, restrictions, or expansions under climate change (Supplement 6). Virtual species distributed on the windward slope, mountain ridges, or coastal areas exhibited range shifts in the mid-century and range restrictions by the end of the century (Supplement 6.1, 6.2 and 6.3), particularly under SSP585 scenario of climate models. On the contrary, virtual species distributed in inland areas and on leeward slopes near inland areas exhibited range expansion in the mid and end of this century (Supplement 6.4 and 6.5).

Ensemble modeling has also shown that the consequences of climate change may lead to different possible outcomes of the 11 plant species and grassland (Supplement 7). The predicted results of ensemble modeling demonstrate that climate change could lead to the local extinction of some rare species, such as M. taiwanensis (Supplement 7.8), R. nakaharai (Supplement 7.9), and R. pseudochrysanthum (Supplement 7.10). Surprisingly, rare species do not always exhibit range restriction or local extinction under climate change, as seen with L. speciosum, which is predicted to have range expansion in the mid and end of this century (Supplement 7.7). Several plant species exhibit range shifts in the mid and end of this century, including B. sinensis (Supplement 7.2), D. conjugata (Supplement 7.3), S. tristyla (Supplement 7.11), S. lepifera (Supplement 7.12). On the other hand, B. japonica (Supplement 7.1) and F. fistulosa (Supplement 7.5) exhibit range expansions in the mid and end of this century. The remaining species, E. japonica (Supplement 7.4), and grassland (Supplement 7.6) exhibited range restriction or local extinction based on different climate models.

Simulation of virtual species

The simulation of virtual species was suggested to test any new method in SDM studies before applying it to real data (Austin 2007; Austin et al. 2006). In previous studies, the simulation of virtual species had been used to assess the influence of environmental structures on SDM performances, data aggregation strategies, and resolution and scales (Hirzel et al. 2001; Meynard et al. 2019; Qiao et al. 2019). In this study, virtual species were simulated in a mountainous area to test the heterogeneity of climate datasets, characteristics of algorithms, effects of sample sizes, and range shifts and/or local extinctions under climate change.

Heterogeneity of climate dataset

The WorldClim dataset (Fick & Hijmans 2017) is a widely used for SDM studies. Despite the global coverage of WorldClim, there are still some uncertainties in the environmental data due to spatial interpolation in mountainous areas and in regions with few observation stations (Fick & Hijmans 2017). The variable from WorldClim had been downscaled to finer resolutions for SDM studies, but this may only retain the information from the original data source without increasing the spatial resolution of the variables (Peterson et al. 2011; Sillero & Barbosa 2021). In this study, high-resolution historical and future climate datasets were generated by using the proposed statistical method (Liao et al. 2023). The high-resolution climate datasets included horizontal variations of climate features due to the interpolation of meteorological data, while calculations of lapse rates to adjust climate data of gridded cells encompassed climate variations along elevation. The high-resolution climate dataset generated in this study evidently reflected the topographical and altitudinal variations of climate environments in mountainous areas. The simulation of virtual species successfully validated the heterogeneous climate environments in the study area. The high-resolution climate dataset is appropriate for addressing issues related to SDM studies.

The spatial resolution of future climate datasets derived from GCMs was 5 × 5 km², which is not accurate enough to capture the heterogeneous climate environments in mountainous areas. This study applied the statistical method developed by Liao et al. (2023) to downscale future climate datasets from 5 GCMs to a finer resolution. The downscaled future climate datasets, with a spatial resolution of 50 × 50 m², are evidently reliable for predicting the future distribution ranges of the virtual species. This statistical method can confidently generate high-resolution historical and future climate datasets in other mountainous areas.

Performances and characteristics of algorithms

A virtual species approach can be used to explore the predictive accuracy of algorithms. The TSS and AUC, evaluated by split-sample validation, are commonly used to assess model accuracy. However, the estimated discrimination capacity of TSS and AUC was not available to reflect the actual predictive ability of SDMs. TSS and AUC tend to be over-optimistic compared to the real model performance, when predicted under current conditions and especially when projected to future conditions (Santini et al. 2021). In this study, the TSS and AUC of 11 algorithms were mostly close to or higher than 0.8, except when the sample size was 5 (Fig. 3 and Supplement 3). High values of TSS and AUC do not guarantee accurate predictions of species’ potential ranges. Practically, the TSS and AUC only slightly varied across algorithms in our analyses and were not appropriate for presenting model performances.

This study suggests using the ESP value to assess model performance in SDM studies. The ESP, used as an index, clearly presented the predictive power of algorithms and is much better than TSS and AUC. Although the ESP is a good indicator of model performances, the ESP was significantly affected by a small sample size. In our analyses, five sample points are evidently the limitation of the model predictions, since the ESP value is consistently lower than 0.45 for all of the algorithms. Meanwhile, the ratios of false presence and false absence were particularly high when the sample size was 5. That is, a small sample size significantly resulted in low accuracy in algorithm performance.

Future distribution range of species

Our analysis revealed that climate change had distinct effects on climate environments in different areas and consequently caused different responses in the five virtual species. Climate environments in coastal areas are projected to shift to higher temperatures and lower precipitation by the middle or end of this century according to climate systems. The climate environments shifted in coastal areas will be similar to the current climate environments in inland areas. The virtual species currently distributed in inland area or on leeward slopes of the study area will expand or shift their distribution ranges toward coastal areas under climate change. Meanwhile, the virtual species currently distributed in coastal areas or on mountain ridges undergo range restriction or local extinction because they have almost no chance to track their suitable ranges in the study area under climate change. Therefore, the distinct responses of various virtual species under climate change evidently relate to their climatic demands.

Different responses of virtual species under climate change provide a baseline for studying the fates of plant species. Many of our study species were under threat of local extinction, such as B. japonica, B. sinensis, E. japonica, M. taiwanensis, R. nakaharai, and R. pseudochrysanthum. The suitable ranges of these six species were at coastal areas or on windward slopes near mountain ridges. Some species, such as D. conjugata, Grassland, S. tristyla, S. lepifera undergo range restrictions by the middle or end of this century. The suitable range of grassland is widely distributed along mountain ridges, while the three species are widely distributed from leeward to windward slopes. Only two species will expand their distribution ranges under climate change: F. fistulosa, and L. speciosum. It is reasonable that the species F. fistulosa will expand its suitable range under climate change because it is widely distributed in the study area. However, it is surprising that L. speciosum will expand its suitable range under climate change. L. speciosum was once widely distributed in the lowlands of northern Taiwan, but the species had suffered from extensive collections for trade or cultivation since the last century. The population size of the species has greatly diminished in northern Taiwan, and only a few remnant individuals are currently scattered throughout the study area. Due to human influences, the present distribution of L. speciosum is mostly absent from the environmentally suitable areas and only occupies only a portion of its fundamental niche in the study area. It is evident that the signal of the absence caused by artificial factors will be found in the distribution of presence records and projected by the algorithms (Elith et al. 2011). In addition, the future range expansion of L. speciosum by the middle or end of this century indicates that if a rare or endangered species is not threatened by climate but by artificial factors, it may not go extinct under climate change.

Niche study

Recently, numerous studies have been proposed to correlate species occurrence data with important environmental dimensions, and researchers have developed algorithms to estimate ecological niches and explore potential distribution ranges (Peterson et al. 2011). In the virtualspecies package of R programming, users generate virtual species by defining suitability from a PCA to analyze ecological niches in both multivariate environmental and geographical spaces, effectively linking views of niche and distribution (Leroy et al. 2016; Qiao et al. 2016). This study focused on the niche from the climatic perspective to explain the geographical distribution and ecological characteristics of species. Our study can confidently define the fundamental niche of virtual species at the landscape scale and can be used to link climate demands and geographical distribution of species in mountainous areas.

This study demonstrates that virtual species study is a powerful tool for validating the heterogeneity of high-resolution climate datasets, evaluating the characteristic of algorithms, and providing baseline for assessing predicted potential distribution ranges and future range shifts of real species in mountainous area. (1) Using a real landscape to study virtual species has the advantage of correlating explanatory environmental variables with geographical distributions. In this study, the high-resolution climate dataset generated from meteorological data effectively reflected climate features in the mountainous area. This dataset is available for PCA to define fundamental niches of virtual species and correlate their geographical distribution ranges. (2) Virtual species simulated in PCA aim to evaluate performances of algorithms, with ensemble modeling performing better than the other 11 algorithms. Ensemble modeling effectively captures the processes linking climate demand and geographical distribution of virtual species and has practically predicted the potential range of rare or endangered species. (3) Algorithm performances were evaluated using the ESP. The ESP value calculates the degree of overlap between suitable ranges of virtual species simulated by PCA and their potential ranges predicted by algorithms. Therefore, the ESP, associated with false presence and false absence, is a better index than TSS and AUC. (4) In this study, rare or endangered species scattered or biasedly distributed in mountainous areas empirically caused either wide or narrow potential geographical ranges. Under these circumstances, virtual species study provides a baseline for assessing predicted contemporary and future distribution ranges of real species. (5) Rare or endangered species predicted by ensemble modeling based on high-resolution climate datasets can be confidently applied for designing conservation areas and management strategies in mountainous areas. (6) Our study can inform the applications of species distribution models to provide scientific support for conservation planning in mountainous areas and forecasts of species distributions under climate change.

Author Contribution

C.C. Liao conceived and designed the research method, data collection, conducted the analyses and wrote the manuscript. Y. H. Chen collected some data and some significant ideas for improving the quality of this manuscript. H. Y. Lin contributed some significant inputs to improve analytical methods, particularly the method of downscaling high-resolution historical and future climate datasets.

Acknowledgement

The authors would like to thank Mr. Shi-Chieh Kuo and several students from the Department of Life Science at Chinese Culture University, Taipei, Taiwan for their assistance in data collection. Particularly, Ms. Weng, Yu-Ting and Zhang, Qiao-Zhu gave the greatest help in the field works. The authors are grateful to Mr. Kai-Jie Yang from the Institute of Geography at Chinese Culture University, Taipei, Taiwan for providing technical supports for ArcGIS software. This work was partially supported by Yangmingshan National Park, Construction and Planning Agency, Ministry of the Interior, Executive Yuan, Taipei, Taiwan. Professor Hung-Yang Tseng from the Department of Atmospheric Sciences at Chinese Culture University assisted in coordinating with the Taiwan Climate Change Projection Information and Adaptation Knowledge Platform (TCCIP) to obtain climate model data. We would like to express our gratitude for his assistance. The authors also express their gratitude to TCCIP for providing gridded datasets on the magnitude of temperature and precipitation changes predicted by climate models for the 21st century.

Ali F., Khan N., Khan A.M., Ali K. and Abbas F. 2023. Species distribution modelling of Monotheca buxifolia (Falc.) A. DC.: Present distribution and impacts of potential climate change. Heliyon 9.
Austin M. 2007. Species distribution models and ecological theory: a critical assessment and some possible new approaches. Ecological modelling 200: 1–19.
Austin M., Belbin L., Meyers J.a.A., Doherty M. and Luoto M. 2006. Evaluation of statistical models used for predicting plant species distributions: role of artificial data and theory. Ecological modelling 199: 197–216.
Bean W.T., Stafford R. and Brashares J.S. 2012. The effects of small sample size and sample bias on threshold selection and accuracy assessment of species distribution models. Ecography 35: 250–258.
Bombi P. and D’Amen M. 2012. Scaling down distribution maps from atlas data: a test of different approaches with virtual species. Journal of Biogeography 39: 640–651.
Chambers D., Périé C., Casajus N. and de Blois S. 2013. Challenges in modelling the abundance of 105 tree species in eastern North America using climate, edaphic, and topographic variables. Forest Ecology and Management 291: 20–29.
Chen W.K. and Tsai C.Y. 1983. The climate of Yangmingshan National Park. Yangmingshan National Park, Construction and Planning Agency Ministry of the Interior, Executive Yuan, Taipei, Taiwan.
De Marco P. and Nóbrega C.C. 2018. Evaluating collinearity effects on species distribution models: An approach based on virtual species simulation. PLoS One 13: e0202403.
Dobrowski S.Z. 2011. A climatic basis for microrefugia: the influence of terrain on climate. Global Change Biology 17: 1022–1035.
Duan R.Y., Kong X.Q., Huang M.Y., Wu G.L. and Wang Z.G. 2015. SDMvspecies: a software for creating virtual species for species distribution modelling. Ecography 38: 108–110.
Dunne J.P., Horowitz L., Adcroft A., Ginoux P., Held I., John J., Krasting J.P., Malyshev S., Naik V. and Paulot F. 2020. The GFDL Earth System Model version 4.1 (GFDL-ESM 4.1): Overall coupled model description and simulation characteristics. Journal of Advances in Modeling Earth Systems 12: e2019MS002015.
El-Gabbas A. and Dormann C.F. 2018. Wrong, but useful: regional species distribution models may not be improved by range‐wide data under biased sampling. Ecology and Evolution 8: 2196–2206.
Elith J., Phillips S.J., Hastie T., Dudík M., Chee Y.E. and Yates C.J. 2011. A statistical explanation of MaxEnt for ecologists. Diversity and Distributions 17: 43–57.
Fick S.E. and Hijmans R.J. 2017. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology 37: 4302–4315.
Fois M., Fenu G., Lombrana A.C., Cogoni D. and Bacchetta G. 2015. A practical method to speed up the discovery of unknown populations using Species Distribution Models. Journal for nature conservation 24: 42–48.
Godsoe W. 2014. Inferring the similarity of species distributions using Species’ Distribution Models. Ecography 37: 130–136.
Guillera-Arroita G., Lahoz‐Monfort J.J., Elith J., Gordon A., Kujala H., Lentini P.E., McCarthy M.A., Tingley R. and Wintle B.A. 2015. Is my species distribution model fit for purpose? Matching data and models to applications. Global Ecology and Biogeography 24: 276–292.
Guisan A., Zimmermann N.E., Elith J., Graham C.H., Phillips S. and Peterson A.T. 2007. What matters for predicting the occurrences of trees: techniques, data, or species characteristics? Ecological Monographs 77: 615–630.
Hama A.A. and Khwarahm N.R. 2023. Predictive mapping of two endemic oak tree species under climate change scenarios in a semiarid region: Range overlap and implications for conservation. Ecological Informatics 73: 101930.
HamadAmin B.A. and Khwarahm N.R. 2023. Mapping Impacts of Climate Change on the Distributions of Two Endemic Tree Species under Socioeconomic Pathway Scenarios (SSP). Sustainability 15: 5469.
Hirzel A.H., Helfer V. and Metral F. 2001. Assessing habitat-suitability models with a virtual species. Ecological modelling 145: 111–121.
Hsieh C.F., Chao W.C., Liao C.C., Yang K.C. and Hsieh T.H. 1997. Floristic composition of the evergreen broad-leaved forests of Taiwan. Nat. Hist. Res. 4: 1–16.
Inman R., Franklin J., Esque T. and Nussear K. 2021. Comparing sample bias correction methods for species distribution modeling using virtual species. Ecosphere 12: e03422.
IPCC. 2013. The Physical Science Basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental
Panel on Climate Change Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA.
IPCC. 2022. Climate Change 2022: Mitigation of Climate Change. Contribution of Working Group III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. In: Shukla P. R., Skea J., Slade R., Khourdajie A. A., Diemen R. v., McCollum D., Pathak M., Some S., Vyas P., Fradera R., Belkacemi M., Hasija A., Lisboa G., Luz S. and Malley J. (eds), Cambridge University Press, Cambridge, UK and New York, NY, USA.
IUCN. 2014. IUCN red list.
Jiang R., Zou M., Qin Y., Tan G., Huang S., Quan H., Zhou J. and Liao H. 2022. Modeling of the potential geographical distribution of three Fritillaria species under climate change. Frontiers in Plant Science 12: 749838.
Journé V., Barnagaud J.Y., Bernard C., Crochet P.A. and Morin X. 2020. Correlative climatic niche models predict real and virtual species distributions equally well. Ecology 101: e02912.
Kadmon R., Farber O. and Danin A. 2003. A systematic analysis of factors affecting the performance of climatic envelope models. Ecological Applications 13: 853–867.
Karger D.N., Conrad O., Böhner J., Kawohl T., Kreft H., Soria-Auza R.W., Zimmermann N.E., Linder H.P. and Kessler M. 2017. Climatologies at high resolution for the earth’s land surface areas. Scientific Data 4: 170122.
Kataoka T., Tatebe H., Koyama H., Mochizuki T., Ogochi K., Naoe H., Imada Y., Shiogama H., Kimoto M. and Watanabe M. 2020. Seasonal to decadal predictions with MIROC6: Description and basic evaluation. Journal of Advances in Modeling Earth Systems 12: e2019MS002035.
Kebede A.S., Nicholls R.J., Allan A., Arto I., Cazcarro I., Fernandes J.A., Hill C.T., Hutton C.W., Kay S. and Lázár A.N. 2018. Applying the global RCP–SSP–SPA scenario framework at sub-national scale: A multi-scale and participatory scenario approach. Science of the Total Environment 635: 659–672.
Khan S. and Verma S. 2022. Ensemble modeling to predict the impact of future climate change on the global distribution of Olea europaea subsp. cuspidata. Frontiers in Forests and Global Change 5: 977691.
Lannuzel G., Balmot J., Dubos N., Thibault M. and Fogliani B. 2021. High-resolution topographic variables accurately predict the distribution of rare plant species for conservation area selection in a narrow-endemism hotspot in New Caledonia. Biodiversity and Conservation 30: 963–990.
Laskey H., Crook E.D. and Kimball S. 2020. Analysis of Rare Plant Occurrence Data for Monitoring Prioritization. Diversity 12: 427.
Leroy B., Meynard C.N., Bellard C. and Courchamp F. 2016. virtualspecies, an R package to generate virtual species distributions. Ecography 39: 599–607.
Li C.F., Chytrý M., Zelený D., Chen M.Y., Chen T.Y., Chiou C.R., Hsia Y.J., Liu H.Y., Yang S.Z. and Yeh C.L. 2013. Classification of Taiwan forest vegetation. Applied Vegetation Science 16: 698–719.
Liao C.C., Chang C.R., Hsu M.T. and Poo W.K. 2014. Experimental evaluation of the sustainability of dwarf bamboo (Pseudosasa usawai) sprout-harvesting practices in Yangminshan National Park, Taiwan. Environmental Management 54: 320–330.
Liao C.C. and Chen Y.H. 2021. Improving performance of species distribution model in mountainous areas with complex topography. Ecological research 36: 648–662.
Liao C.C. and Chen Y.H. 2022. The effects of true and pseudo-absence data on the performance of species distribution models at landscape scale. Taiwania 67: 9–20.
Liao C.C., Kuo S.C. and Chang C.R. 2012. Forest distribution on small isolated hills and implications on woody plant distribution under threats of global warming. Taiwania 57: 242–250.
Liao C.C., Lin H.Y. and Fan S.W. 2023. A statistical method to generate high-resolution climate datasets for modeling plant distribution range and range shits under climate change in mountainous areas. Taiwania 68: 8–22.
Lobo J.M., Jiménez-Valverde A. and Real R. 2008. AUC: a misleading measure of the performance of predictive distribution models. Global Ecology and Biogeography 17: 145–151.
Meucci A., Young I.R., Hemer M., Trenham C. and Watterson I.G. 2023. 140 years of global ocean wind-wave climate derived from CMIP6 ACCESS-CM2 and EC-Earth3 GCMs: Global trends, regional changes, and future projections. Journal of Climate 36: 1605–1631.
Meynard C.N. and Kaplan D.M. 2013. Using virtual species to study species distributions and model performance. Journal of Biogeography 40: 1–8.
Meynard C.N., Leroy B. and Kaplan D.M. 2019. Testing methods in species distribution modelling using virtual species: what have we learnt and what are we missing? Ecography 42: 2021–2036.
Ning H., Ling L., Sun X., Kang X. and Chen H. 2021. Predicting the future redistribution of Chinese white pine Pinus armandii Franch. Under climate change scenarios in China using species distribution models. Global Ecology and Conservation 25: e01420.
Pedersen J.S.T., Santos F.D., van Vuuren D., Gupta J., Coelho R.E., Aparício B.A. and Swart R. 2021. An assessment of the performance of scenarios against historical global emissions for IPCC reports. Global Environmental Change 66: 102199.
Peterson A.T., Soberón J., Pearson R.G., Anderson R.P., Martínez-Meyer E., Nakamura M. and Araújo M.B. 2011. Ecological niches and geographic distributions (MPB-49). Princeton University Press.
Pradervand J.-N., Dubuis A., Pellissier L., Guisan A. and Randin C. 2014. Very high resolution environmental predictors in species distribution models: Moving beyond topography? Progress in Physical Geography 38: 79–96.
Pu Y., Liu H., Yan R., Yang H., Xia K., Li Y., Dong L., Li L., Wang H. and Nie Y. 2020. CAS FGOALS-g3 model datasets for the CMIP6 scenario model intercomparison project (ScenarioMIP). Advances in Atmosphere Sciences 37: 1081–1092.
Qazi A.W., Saqib Z. and Zaman-ul-Haq M. 2022. Trends in species distribution modelling in context of rare and endemic plants: a systematic review. Ecological Processes 11: 1–11.
Qiao H., Feng X., Escobar L.E., Peterson A.T., Soberón J., Zhu G. and Papeş M. 2019. An evaluation of transferability of ecological niche models. Ecography 42: 521–534.
Qiao H., Peterson A.T., Campbell L.P., Soberón J., Ji L. and Escobar L.E. 2016. NicheA: creating virtual species and ecological niches in multivariate environmental scenarios. Ecography 39: 805–813.
Santini L., Benítez-López A., Maiorano L., Čengić M. and Huijbregts M.A. 2021. Assessing the reliability of species distribution projections in climate change research. Diversity and Distributions 27: 1035–1050.
Sillero N. and Barbosa A.M. 2021. Common mistakes in ecological niche models. International Journal of Geographical Information Science 35: 213–226.
Su H.J. 1984. Studies on the climate and vegetation types of the natural forests in Taiwan (II) Altitudinal vegetation zones in relation to temperature gradient. Quarterly Journal of Chinese Forestry 17: 57–73.
Thuiller W., Georges D., Engler R., Breiner F., Georges M.D. and Thuiller C.W. 2016. Package ‘biomod2’. Species distribution modeling within an ensemble forecasting framework.
Wan J.-N., Mbari N.J., Wang S.-W., Liu B., Mwangi B.N., Rasoarahona J.R., Xin H.-P., Zhou Y.-D. and Wang Q.-F. 2021. Modeling impacts of climate change on the potential distribution of six endemic baobab species in Madagascar. Plant Diversity 43: 117–124.
Wang T., Wang G., Innes J., Nitschke C. and Kang H. 2016. Climatic niche models and their consensus projections for future climates for four major forest tree species in the Asia–Pacific region. Forest Ecology and Management 360: 357–366.
Wang Y.C., Hsu H.H., Chen C.A., Tseng W.L., Hsu P.C., Lin C.W., Chen Y.L., Jiang L.C., Lee Y.C. and Liang H.C. 2021. Performance of the Taiwan earth system model in simulating climate variability compared with observations and CMIP6 model simulations. Journal of Advances in Modeling Earth Systems 13: e2020MS002353.
Xu Y., Huang Y., Zhao H., Yang M., Zhuang Y. and Ye X. 2021. Modelling the effects of climate change on the distribution of endangered Cypripedium japonicum in China. Forests 12: 429.
Zimmer S.N., Holsinger K.W. and Dawson C.A. 2023. A field-validated ensemble species distribution model of Eriogonum pelinophilum, an endangered subshrub in Colorado, USA. Ecology and Evolution 13: e10816.
Zurell D., Fritz S.A., Rönnfeldt A. and Steinbauer M.J. 2023. Predicting extinctions with species distribution models. Cambridge Prisms: Extinction 1: e8.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

A virtual species study to establish baseline for assessing the predicted current and future distribution ranges of real species in mountainous areas

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Study area

Downscaling of the current climate dataset

Downscaling of future climate projections

Simulations of virtual species

Inventory of real species

Modelling technique

Model performance

Results

Suitable ranges of virtual species

Performances of 11 algorithms

Importance of predictors

Model evaluation of real species

Range shifts of virtual and real species under future climate change

Discussion

Simulation of virtual species

Heterogeneity of climate dataset

Performances and characteristics of algorithms

Future distribution range of species

Niche study

Conclusion

Declarations

Author Contribution

Acknowledgement

References

Additional Declarations

Supplementary Files

Status:

Version 1