Aging is associated with progressive loss of motor function. These deficits are heterogeneous and may manifest as reduced walking speed, poor balance, increased, gait variability, increased fear of falling, and shorter stride length1–3. Objective measures of gait obtained during brief supervised gait testing in a lab or clinic predict survival, varied adverse health outcomes, and loss of independent living4–6. However, these brief assessments provide only a limited snapshot of an individual's gait abilities and may not reflect function and variability during the manifold demands of daily living7,8. Advances in unobtrusive sensor technology afford investigators the opportunity to obtain a more comprehensive assessment of mobility via remote multi-day recordings of daily living. However, the automated analytic tools employed for many commercially available devices focus nearly exclusively on healthy younger adults and do not account for the impairments observed in older adults during device development or validation9,10. Hence, there is an urgent need for the development and validation of automated tools to quantify daily living gait among the full health spectrum of older adults that reside in the community-setting11,12.
Previous studies investigating real-world gait employed accelerometers worn on the lower back, leveraging the inherent quasi-periodicity of lumbar movement during walking13. While these studies have demonstrated the potential of assessing daily living gait, sensor placement on the lower back may present limitations for long-term adherence due to potential discomfort14. A different approach is to ask participants to wear a wrist-worn accelerometer. Wrist-worn accelerometers have gained widespread use to measure daily living physical activity 15–18. In this regard, the ubiquity and popularity of smartwatches make wrist-worn accelerometers a practical choice for ensuring high compliance in daily living studies. Wrist-worn accelerometers enable the extraction of a wide range of daily living behaviors, including sleep patterns19, circadian metrics20, and levels of physical activity21. While estimated physical activity levels can provide many insights22,23, to date, most studies using a wrist-worn accelerometer lacked detailed and high-resolution information about other crucial facets of gait quality21. Therefore, recent efforts have focused on leveraging these accelerometers to assess walking and gait quality.
The first step in deriving gait metrics from an accelerometer is the detection of gait sequences from the raw accelerometer signals24,25. Gait detection from a wrist-worn accelerometer is more challenging compared to other locations, such as lower limbs or lower back, due to the non-gait related hand movement and the fact that wrist movements often deviate from the expected periodic swinging during the gait cycle. This may occur, for instance, when an individual walks while simultaneously engaging in other activities, such as texting. This challenge is exacerbated for older adults and people with gait disturbances, such as Parkinson’s disease who manifest reduced arm swing during walking26. People with Parkinson’s disease also exhibit symptoms of tremor and dyskinesia, which impact wrist movements and contribute to an overall less stable and consistent gait pattern, factors complicating gait detection algorithms24.
Classical gait detection algorithms employ signal processing techniques, such as peak detection and wavelet analysis, to extract features both from the time and frequency domain25,27. These features are then used to identify gait sequences based on the repeated periodic waveforms manifested during gait. However, the complex wrist movements render the differentiation between gait and non-gait movements very challenging. Alternative approaches are needed to detect gait from wrist-worn accelerometers.
Previous studies addressed this goal by employing supervised machine learning algorithms that were trained to identify patterns in the signal associated with gait17,28,29. Kluge et al.25 conducted a comprehensive analysis of gait detection algorithms using accelerometer data from lower-back and wrist-worn accelerometers. The algorithms were trained on data from healthy young adults and subsequently tested on diverse subsets of adults from the Mobilise-D technical validation study30, including older adults with and without varied diagnoses. They found, not surprisingly, that algorithms based on lower-back data outperformed wrist-based algorithms. Yet, the reduced performance of wrist-based algorithms may be attributed, in part, to being trained on data from healthy young adults, potentially leading to suboptimal performance among older adults. This highlights the need to optimize wrist-based algorithms for older adults, who more commonly show heterogeneous gait abnormalities that do not occur as frequently in younger adults.
The best performing wrist-based algorithm identified in the study by Kluge et al. was initially developed and validated in Brand et al.24. In this study, we employed a supervised convolutional neural network with U-Net architecture31 for gait detection, focusing on older adults and people with Parkinson’s disease (PD). The results were then compared to those of a control group comprising healthy young adults. Our findings indicated that biological meaningful measures of gait quality (e.g., cadence and gait regularity) and quantity (e.g. daily walking duration) could be derived from a wrist-worn accelerometer. However, it is crucial to note that the model's performance was reduced when applied to older adults and individuals with PD, compared to the healthy young adult control group. An important impediment for training a supervised model that can be applied to older adults and varied clinical conditions derives from the scarcity of ground-truth labels indicating the temporal location of the gait sequences, especially for recordings of unsupervised movement during daily living.
Recently, there has been a growing interest in leveraging self-supervised learning (SSL) methods to overcome the gap imposed by the shortage of labeled data32. SSL generally comprises two main stages. First, learning feature representations of varied signals using a substantial amount of unlabeled data, which can be achieved through methods such as multi-task learning (MTL)33 and contrastive learning32,34. An example of contrastive learning is the SimCLR method: “A Simple Framework for Contrastive Learning of Visual Representations”32. In these approaches, the model's objective is to predict characteristics of the signal that do not require any labels. This stage is commonly referred to as the 'pretext' stage. The second stage involves fine-tuning the SSL model with a smaller set of labeled data in a supervised manner for a downstream task (e.g., gait detection).
The SSL approach has demonstrated significant potential in varied human activity recognition tasks35–37. For example, Yuan et al.38 utilized the UK Biobank dataset, which comprised daily living recordings from a wrist-worn accelerometer, to develop an SSL model for activity recognition and exhibited improved performance in several tasks and datasets. Small et al.39 fine-tuned this SSL model for gait detection on a semi-living dataset, termed OxWalk, which included approximately one hour of recording in a home environment. However, the dataset used for fine-tuning included only healthy adults (N = 39, mean age = 38.5 years). Thus, their model may not be optimized for older adults or individuals with gait disturbances.
Here, we developed and evaluated a gait detection deep learning approach, termed ElderNet, that was oriented and optimized for older adults and, in particular, those who might have impaired gait. The first stage involved the training of an SSL model, utilizing the pre-trained UK Biobank model of Yuan et al.38. This SSL model was extensively modified in both architecture and training cohorts to include a large unlabeled dataset of more than 1000 older adults with and without impaired gait who wore a wrist-worn device for up to 10 days (mean age 83 years old) and were participating in the RUSH Memory and Aging Project (MAP)40–42. Next, we fine-tuned the model on a labeled dataset consisting of 83 older adults (mean age = 71.9 years) from the Mobilise-D technical validation study30, each wearing a wrist-worn accelerometer for approximately 2.5 hours. The Mobilise-D dataset is one of the largest available labeled datasets that include daily living recordings from a wrist-worn accelerometer in older adults. It contains a ground-truth reference for indicating the presence or absence of gait sequences. Additionally, the dataset contains participants with different health conditions, presenting a range of gait patterns, including individuals with Parkinson's disease, proximal femoral fracture, chronic obstructive pulmonary disease, congestive heart failure, and healthy adults. To explore the added value of the putative enhancements of the ElderNet, we compared it to two state-of-the-art algorithms: the U-Net architecture, which achieved the highest results in the study by Kluge et al.25, and the model developed by Small et al., termed OxWalk, utilizing the strong UK Biobank SSL model.
Finally, we applied ElderNet to a set of new participants – not previously trained by the model - to begin to explore its construct validity and generalizability. Construct validity refers to the degree to which a measurement tool, like ElderNet, accurately evaluates its intended purpose, specifically gait detection. In this context, we examined walking duration obtained through ElderNet across cohorts whose clinical status is likely to lead to reduced daily living walking.