Data collection
Data were collected on the Sunshine Coast region in Queensland, Australia (-26.65o S, 153.07o E), from February – April 2019. All methods were carried out in accordance with relevant guidelines and regulations. All experimental protocols and methods were approved and carried out in compliance with the ARRIVE guidelines under the approval of the University of the Sunshine Coast (USC) Animal Ethics permit (ANA/16/109T); Human Ethics permit (A181114) and in conjunction with the Sunshine Coast Council (SCC) Local Law permit (OM18/19).
Animals used in the trials
We recruited 10 domestic cats through an approved media release (males n = 8; females n = 4; weight 2.8–8.4 kg; Age 1.5–12 years; body length 38–53 cm; foreleg length 16–19 cm). As per the Sunshine Coast Council local law requirements, all cats had to be neutered, registered and microchipped to participate in the study.
Equipment
We fitted each cat with a retail harness, to which we attached a tri-axial accelerometer (AX3; Axivity, Newcastle University, UK; 23 x 32.5 x 8.9 mm; 11 g) using cable ties (Fig. 1a). The accelerometer was initialised using the Open Movement Graphical User Interaction application (OMGUI; V1.0.0.37). Because a trade-off exists between data resolution and battery life, we logged data at 50 Hz and with a dynamic range of ± 8 g, with a 13-bit resolution, similar to a previous study (Godfrey et al. 2015). When combined with the in-built memory storage capacity of 512 MB, and battery limitations, this configuration resulted in a maximum of 8 − 14 days of data collection. The quartz Real Time Clock (RTC) and calendar provided a timestamp with a frequency of 32.768 kHz and a precision of ± 50 ppm, with manufacturer specifications indicating a drift of 0.18 seconds per hour. To overcome this drift over the eight days, we calibrated devices by video recording the signals of five claps/taps on the device, at the start and end of each individual data collection period, and also at random times during the day.
We positioned the accelerometer on the scapular brace-strap of the harness, inverted such that the accelerometer was on the sternum of the cat (Fig. 1a–c). Field trials over four months on four cats in the study determined that this position, in comparison with mounting on the dorsal cranial median plane, did not interfere with the animals’ balance; it also removed all of the abnormal movement behaviours and unnecessary discomfort to the cat (Ryan et al., 2013). The positioning of the logging device on the frontal anterior, median plane, resulted in the primary axis for fore-aft (surge), lateral (sway) and dorso-ventral (heave) movement to be reflected in the X, Y and Z signals, respectively (Fig. 1c).
The accelerometer harness was used in conjunction with the CatBib™ for the relevant treatment periods. The total combined weight of the harness, accelerometer and Catbib™ came to 34.1 g, all cats weighed 2.8 kg or more, weight of equipment did not weigh above 1.2% of total body weight of any cats studied. The CatBib™ is a prey protector device, manufactured from a lightweight, washable neoprene material, that is attached to a cat’s safety collar (Fig. 1b). The dimensions of the bib are 17.5 mm x 17.5 mm x 6.5 mm, with a total mass of 23.1 g and it is purple in colour. All cats adjusted to the harness and CatBib™ within the first hour of deployment and no subsequent adjustments were required. All cats had unrestricted access to roam freely outside during the eight days of field trials.
To capture training data, each cat was filmed with a GoPro + 3 Hero device (H.264–1920 x 1080; f/2.8; 60 fps), undertaking natural or stimulated active behaviours through play (Fig. 1b). These activities or behaviours were manually documented to track the activity, date and the timestamps. We conducted two treatments over the eight days: in the first, cats were fitted with CatBib™, whereas in the other, bibs were not worn. Each treatment was conducted for four consecutive days, and the sequence of treatments for each cat was randomised. The accelerometer device on the harness was left on the cats for the entire field trial and recorded continuously for the eight days (~ 192 hrs per cat; total = 2304 hrs).
Data analysis
Each accelerometer trace file was exported as a raw binary file through OMIGUI and imported into a custom-built MATLAB GUI. To build our training dataset, the video file timestamp information, determined using Mediainfo (version 18.08, 2018; Martinez et al. 2002), was used to define the start time for a subset of the accelerometer trace, and the video length to define the end point (Supp. Figure 1). Offsets between the accelerometer trace and video files were determined using the closest calibrated tap signal trace for each day. We were able to watch each video file in synchrony with the accelerometer trace, and manually annotate each movement/activity from the video files to the accelerometer subset (Clemente et al., 2016) (Supp. 1.1. Matlab interface instructions) (Supp. Figure 1).
We grouped activities according to behaviour into three classes: Sedentary, Eating and Locomotive and Hunting. We further subdivided each group into behaviours. Sedentary included lying, sitting, grooming and watching; Eating and Locomotive included – eating/drinking, walking, trotting; and for Hunting – galloping, jumping, pouncing, swatting, biting/holding (Supp. Table 1).
The accelerometer trace was then further divided into rolling epochs of 50 samples in length, using 1 second duration at 50 Hz to ensure intensive acceleratory bursts of short duration such as jumping and pouncing are captured. The activity/movement with the maximum duration within each epoch was assigned as that epoch’s label. Raw accelerometer data in each epoch was assigned as that epoch’s label. Raw accelerometer data in each epoch was summarized using 26 of the most effective variables for model accuracy identified by Tatler et al. (2018). We included: axial acceleration (X, Y, Z); mean acceleration (X, Y, Z); minimum acceleration.(X, Y, Z); maximum acceleration (X, Y, Z); standard deviation of acceleration (X, Y, Z); Signal Magnitude Area, minimum Overall Dynamic Body Acceleration (ODBA); maximum ODBA, minimum Vectorial Dynamic Body Acceleration VDBA; maximum VDBA, sum ODBA; sum VDBA; correlation (XY, YZ, XZ); skewness (X, Y, Z); and kurtosis(X, Y, Z) (Tatler et al. 2018) (See Supp. Table. 3 for a detailed description of each variable). Finally, we coded the two treatments: BibON and BibOFF and exported this information as the training data set.
Classification modelling
To determine whether we could predict cat hunting behaviours, we analysed the training data sets using a Kohonen super Self Organising Map (SOM) in the R package ‘Kohonen’ version 2.0.19 (Jahan et al., 2013; Wehrens & Kruisselbrink 2018).
Machine learning models such as Random Forest and Support Vector Machines each provide computationally powerful methods of data classification, however each method is not equal in how it visualises its output. SOMS have been used in behavioural studies (Chon et al., 2004, Park et al., 2005, Ji et al., 2007; Chon 2011) for their ability to efficiently create easily interpreted maps and identify patterns of behaviour. In this study, a Self-Organising Map algorithm was chosen for its efficiency in visualising multi-dimensional and complex data onto an easily interpreted two dimensional map output. SOMs also have the ability to visualise which variables are most influential with the use of component planes (Fig. 3b-e)) and unlike other models mentioned, SOMs use cluster analysis which in this study aids in identifying similar behaviours and visualising them closer together (in clusters) on the map output.
To prepare data for the SOM function a random sample of the classifiers for the trained data were extracted, along with their associated behaviour, and combined into a list with 2 elements (measurements and activity). This list was then input into the function supersom.R function, with the grid argument defined using the somgrid.R function [e.g. supersom(TrainingData, grid = somgrid(7, 7, "hexagonal"))]. The 7 x 7 grid function was chosen based on 12 behaviours, with 4 elements for each behaviour (12 x 4 = 48). For map symmetry we rounded the map output to 49 hexagonal spaces to spread the output data in a 2-dimensional space (Jahan et al. 2013; Schulz & Dominik, 2013; Stefanovic & Kurasova (2011b). The output of the supersom function was then used as the input into a predict.R function, with the newdata argument directed to a testing data set, which was a similar 2 element list containing all samples not included in the training data set [e.g. predict(ssomOutput, newdata = testData)].
We then built a confusion matrix from the output of the predict function, using the table.R function with predictions compared with the testData set [e.g. table(predictions = ssom.pred$predictions$activity, activity = testData$activity) ]. The confusion matrix was then finally used to compute four specific accuracy metrics – sensitivity (or recall), precision, specificity, as well as overall accuracy.
To identify relationships between the size of training dataset, we trained a randomised subset of the BibOFF training data, to predict the remaining BibOFF data from all cats. We tested 35 different subset sample sizes from 100–100,000, replicating each sample size ten times (with replacement) to determine variation at each sample size.
We then tested the extent to which accelerometer traces are modified by the presence of the CatBib™. This modification was indicated by a change in overall prediction accuracy of the SOM between BibOFF and BibON treatments. To do this, we trained the SOM using a subset of the trained data for BibOFF and tested it against annotated classified BibON samples. In order to statistically compare results from bootstrap resampling, we took the median among bootstrap samples as the estimate of performance and quantified uncertainty using the corresponding 2.5th and 97.5th percentiles to represent credible 95% confidence intervals (CIs). If CIs for any pair of estimates (medians) do not overlap, then this is evidence of a significant difference between the estimates. If, however, one estimated median fell within the confidence interval for another estimate, then this was used as evidence of a lack of significant difference. For all other outcomes, differences are equivocal, and we interpreted them tentatively on the basis of the relative overlap in CIs.