Motivated by a wealth of unconventional location data from reintroduced Channel Island foxes, we developed a novel approach to integrating spatial data types of varying quality to estimate animal movement through time. We showed that incorporating imprecise polygon data, derived from field notes and expert interpretation, can spatially enhance and temporally extend animal location datasets and improve estimation of movement trajectories. In our island fox case study, the inclusion of polygon data more than doubled the number of location points available for the 61 study individuals and reduced the variation in the fits of the trajectories, especially for individuals that had few precise location estimates. This method has broad potential applications to long-term monitoring datasets that were not collected with the explicit aim of studying movement.
With our newly developed algorithm, we reconstructed population-level movement summaries to better understand the initial movement of foxes after reintroduction. By stratifying by birth status (captive vs. wild), we examined the effects of reintroduction on broad behavioral movement patterns in the fox population. Captive-born foxes, which by definition had never roamed the island, exhibited a strong seasonal signal, moving longer distances in fall and winter and moving shorter distances in spring and summer. One direct explanation for this pattern is that most captive-born foxes were released in the fall (Sept-Nov), and it is natural to see long-range movements by individuals recently introduced to a novel landscape. Captive-born foxes may have explored the island more widely after release from captivity before establishing a home range, since there was little competition for space. In contrast, wild-born foxes had more stable movement behavior and showed a much more subtle seasonal signal. When stratified by time on the island, we found that wild-born individuals move longer distances in their first year and then move much less in subsequent years. There is a seasonal signal in the first twelve months of wild-born movement, similar to the captive-born movement pattern, but more subtle. The wild-born seasonal pattern aligns broadly with known island fox ecology: fox births occur in dens in the mid-to-late spring, leading to a period of limited long-range movement while pups are reared; juvenile dispersal occurs in the fall, consistent with the longer-range movement pattern we see at that time of year (20, 22, 30).
Our analyses demonstrate the interacting roles of spatial and temporal resolution in enabling movement reconstruction and highlight that the quality of the reconstruction depends primarily on temporal frequency and regularity of location data. Even when an individual’s time series consists primarily of relatively imprecise polygon data (i.e., Fox F3D2F; Fig. 3), the algorithm can yield a similar median trajectory if the data are abundant, with no large temporal gaps. In our case study, polygon data filled large gaps in the time series of GPS data, delivering major improvements to the reconstruction. Data quantity and temporal regularity are related, but data quantity does not imply temporal regularity as is seen with Fox 73D0D (Figure S6). When incorporating polygon data increases the amount of data available but concentrates the data in time, leaving long gaps, the smoothing spline algorithm can give problematic results, as seen by the fox movement trajectory projected into the ocean (Figure S6). In sum, the quality of the movement reconstruction is determined by both spatial and temporal resolution of the data, but good temporal coverage is essential.
We have demonstrated the considerable benefit of including polygon data in trajectory estimation, but initial translation of the recorded location descriptions into digital polygons was onerous and time-consuming. In future studies, polygon creation could be streamlined by having pre-defined areas on the island that the field notes map onto, rather than translating each phrase into its own precise polygon. These pre-defined areas could be delineated by topographical features much as our polygons were. Some degree of spatial resolution could be lost by this method, but the tradeoff of greater data availability could compensate for this loss. There is also scope to apply artificial intelligence methods, including natural language processing, whether trained by an expert or untrained and working from landscape features, to identify polygons. Further, although the inclusion of polygons added crucial information to our analysis, our uniform sampling of the polygon areas did not always match our beliefs and our expert knowledge of the fox’s position. If the location of a fox was concentrated at one end of a long canyon, this positioning would not be captured by our current algorithm formulation due to uniform sampling of the area. Filtering the polygon areas and length-to-width ratios was a first step to managing uninformative polygons, but weighting the area of the polygon based on landscape characteristics or nearby GPS location data has the potential to further improve positional estimates.
There is great opportunity for data imputation methods to be built into modern spatial statistical techniques. Spatial data resampling methods could lend great strength and flexibility to statistical analysis by accommodating nearly any data type and facilitating the link of polygon locations to all spatial methods. A useful extension of this study would be to integrate the resampling of the polygons (to account for their shape/error) into the Bayesian framework developed by Buderman et al. (16). Their analysis assumed that the spatial dataset is fixed and described the error distributions of their observations parametrically. To integrate the polygon data into their framework, the spatial data could be resampled during MCMC estimation of the spline. Our resampling framework could be incorporated more broadly into other spatial and movement analyses and provides an avenue to integrate polygon-type location data into other spatial methods, as long as the data have sufficient temporal frequency.
Our algorithm utilized smoothing splines to interpolate location data through space and time. Smoothing splines offer a flexible approach to data interpolation via tuning a single smoothing parameter either manually or through a generalized cross-validated procedure. For our analysis, we fixed the smoothing parameter to ensure that the curvature was consistent across individuals, rather than fitting it by various procedures (e.g., generalized cross-validation, maximum likelihood, Akaike information criterion). We selected a smoothing parameter value of 0.1 (on a scale of 0 to 1), balancing several considerations. If we had chosen a larger smoothing parameter, the trajectories would have become less curved and more like connected lines between the observation points. This would lead to a lower degree of averaging across the locations, which could capture the precision of the high-resolution locations more confidently but would also put more confidence in the lower-quality polygon locations than desired, given that our uniform resampling algorithm can generate apparent locations anywhere within a polygon. Our choice of a low smoothing parameter enables the method to smooth out these errant points, but at the cost of occasionally failing to capture real, short-term, long-distance movements, plus the risk of overshooting the data when there are temporal gaps. Depending on the nature of the system and dataset and the goals of the analysis (e.g., home range estimation or construction of a contact network), more or less curvature may be optimal, but further study is required to uncover how the temporal and spatial resolution of the dataset influences the optimal choice of smoothing parameter.
The choice of spline can also impact the results and interpretation of the trajectories. We opted to use a smoothing spline to fit the coordinates since it could robustly accommodate the patterns in our data, but a more complex spline could add more flexibility to the fitting and may be appropriate for other systems where individuals make frequent, meaningful movements that are important to capture. An alternative B-spline framework would require the data to be more regular through time and is much more sensitive to temporal gaps in the data. Thus, the algorithm would fail for individuals with insufficient temporal regularity or data quantity. The smoothing spline we chose converges for all individuals but can give biologically dubious results when data are insufficient. Additional study is needed on the influence of data quantity and resolution on the choice of spline and associated parameters to improve the robustness of our approach to non-ideal datasets.
In our study of a terrestrial species on an oceanic island, the island boundary introduces a further challenge to movement estimation. The island coastline is a hard ecosystem boundary for the foxes, but such boundaries have only been addressed in a few estimation techniques (e.g., LoCoH and special implementations of kernel density estimation) (5, 8). Most other studies address this challenge by clipping the estimated home ranges or trajectories at the boundary, which could bias the estimates because it does not redistribute the density appropriately. In our method, the polygon resampling approach restricts point observations to the allowed region (i.e., the island’s terrestrial surface). However, the spline can fit curves that extend beyond the island boundary (as in Figure S6C). A more restrictive spline fitting or explicit approach to reject any splines which project movement into the excluded area could address this shortcoming.
This novel estimation of animal movement trajectories has wide-ranging applicability. It is essential to recognize that, under our current parameterization, the reconstructed trajectories are intended to capture the central tendency of an animal’s location in a given time period and will not reconstruct all short-term excursions from the broader path. Nevertheless, our approach generates systematic trajectories that align with expert knowledge, even if some quantitative details are imperfect – and our analysis of 61 foxes shows the potential to recover valuable population-scale insights into movement patterns. Such information, at individual and population scales, has important application to support studies of behavioral ecology, conservation, and disease spread (29–36).
Crucially, our approach enables such insights to be derived from existing and currently unused location data for a wide range of wildlife systems. We have detailed several clear steps that can be taken to further fine-tune our algorithm and improve the precision and accuracy of the estimated trajectories. We hope this work can be a useful first step towards analyzing these valuable data to gain important insights into animal movement ecology, as well as the spatiotemporal aspects of related fields.