Injuries at the workplace constitute a common and substantial public health problem with major economic consequences worldwide [24]. As time off work due to disabling injuries increases, injury-related costs such as indemnity payments, medical and legal expenses, and employee substitution costs rise. In the case journey of a work injury case, a disability prognosis is essential as this will impact the expected medical intervention from medical professionals, the resource allocation from an operational perspective, or the compensation reserved by insurance companies [25, 26]. The existing prognosis forecasting in industry relies on human decisions based on different companies’ internal guidelines and the experience of doctors, rehabilitation professionals, or case managers. However, there are limitations of case management quality in Hong Kong, such as the high turnover rate of staff, the lack of experience, the lack of formal training, and the lack of a regulation or standard regarding the qualification requirement for case managers. This creates inconsistency, with potential over- or under-estimation of the risk, which will affect the relevant organizations from the long-term portfolio management perspective. Importantly, inaccurate estimation of injury recovery will have adverse impact on the case management and the treatment plan developed for injured workers [27]. Otherwise, this will affect the expectation on recovery outcome and can lead to misallocation of medical resources and postponement of treatment. Clinical decision-making tools using artificial intelligence to build a disability forecasting system can solve this problem by increasing consistency and controlling the risk of mis-assessment on medical prognosis effectively [28, 29]. In the journey of a work injury case, the injured worker will always be influenced by many psycho-social factors: for example, the injured employee may adopt a sick role. The Parsonian Model has outlined the role of a sick person be exempted from responsibility for the incapacity and from normal social role obligations. This incapacity as illness provides a legitimate basis for the sick individual’s exemption from normal role and task obligations [30]. This causes the worker to be on sick leave for longer. Eggert found that frustration, depression, discrimination, and obstacles in understanding how the workers’ compensation system works affect workers’ ability to return to work [31]. Booth-Kewley’s 2013 study found that recovery expectations was a key predictor of injury recovery and suggested early interventions to modify workers’ expectations [32]. In 1995, Rohling conducted a meta-analytic review of financial compensation (sometimes called compensation neurosis) and the treatment of chronic pain and found that compensation is related to increased reports of pain and decreased treatment efficacy [33]. Compensation neurosis is a controversial concept that the psychological and physical symptoms that may arise in injured workers who are involved in litigation for the personal injury, and that may be influenced by their expectations of compensation [34, 35].
Sick Leave Days
The results of this study showed that for non-litigated cases, SWIM simulated predictions of sick leave days in line with the estimations of CM-A, who had approximately four years of case management experience. However, this was not the case in regard to litigated cases. As a case progresses, numerous psycho-social factors affect its progress, and experienced case managers consider these potential factors when making estimations based on injury severity and demographic information [36, 37]. The reserving guidelines of their companies also influence their estimations. A further analysis between non-litigated and litigated cases, which this case legal or non-legal status was unknown to the case managers during the experiment, showed that the mean sick leave days estimated by CM-A were 109 days for non-litigated cases and 97 days for litigated cases, compared to SWIM’s predictions of 110 and 120 days, respectively. There was a noticeable difference between the estimations of CM-B and CM-C: CM-B estimated an average of 190 days for non-litigated cases and 225 days for litigated cases, while CM-C estimated an average of 169 days for non-litigated cases and 213 days for litigated cases. The results suggest that CM-A may not have considered any psycho-social factors when making estimations as the mean was similar between non-legal and legal cases in her estimations. On the other hand, the results showed that CM-B’s and CM-C’s estimations were significantly similar regardless of case status. This could be because case managers B and C were working in the same company and using the same reserving guidelines, and thus their case management approaches should be similar.
The results suggested that compared to CM-A, SWIM predicted a higher number of sick leave days for litigated cases but still could not simulate CM-B and CM-C, who had more experience. The results of the Krustal-Wallis test were consistent with the ICC test on inter-rater reliability, showing that SWIM and CM-A were moderately reliable (ICC = 0.637, p < .001) for non-litigated cases, while CM-B and CM-C were strongly reliable across all cases (ICC = 0.837, p < .001 non-litigated; ICC = 0.845, p < .001 litigated).
Interestingly, it was found that the estimation of CM-B (mean = 190) was significantly similar to the actual data (mean = 271) for non-litigated cases. This finding revealed that the mean of CM-B’s sick leave estimation was higher than that of CM-C for both litigated and non-litigated cases, even though their results were significantly similar. This suggests inconsistency among case managers, even when they adopt similar estimation approaches, and that their differing levels of experience can still yield varied results. Therefore, a clinical decision-making tool can address this inconsistency, which encourages further research into developing an accurate and validated system for work injury rehabilitation prediction similar to that of an experienced case manager.
Permanent Disability Percentage
It was unfortunate to find that SWIM’s predictions on PD% were statistically significantly different (p = < 0.001) across the actual data and from any of the case managers’ predictions under the Krustal-Wallis post hoc test. The ICC result was consistent with the findings of the Krustal-Wallis test, showing that the average measures were close to zero. The ICC also illustrated that CM-A was moderately reliable with CM-B (ICC = 0.682 non-legal; ICC = 0.559 legal, p = < 0.001) and with CM-C (ICC = 0.663 non-legal; ICC = 0.602 legal, p = < 0.001). CM-B and CM-C had good reliability for all cases (ICC = 0.715 non-legal; ICC = 0.795 legal, p = < 0.001). Therefore, the results suggested that further development or validation is required for the PD% predictions of SWIM.
Upon further analysis of the estimation pattern by nature of injury, the deviation of SWIM estimations from one injury type to another did not vary as much as those of the case managers, particularly CM-B and CM-C (Fig. 2). The boxplot in Fig. 2 is based on the Actual SL days. The most prevalent injury within the dataset of this study was fractures, accounting for 147 out of the 422 cases. The findings revealed a clear pattern indicating that the predictions made by SWIM and CM-A were similar but not accurate compared to CM-B and CM-C when compared to the actual outcomes of the cases. As the cases progressed, various psycho-social factors emerged that could impact the progress of the cases and potentially lead to an extension of sick leave. These factors included cases becoming legal matters, the injured employees maintaining a belief that they were unfit for work, or developing a compensation-oriented mindset rather than a recovery-oriented one. It is important to note that the case managers in this study conducted the initial estimation of sick leave without considering any psycho-social factors. Hence, it is expected that the actual data deviates from the predictions made by SWIM and other human case managers. The accurate estimation of prognosis is highly dependent on experienced judgment and the identification of high-risk factors that may deviate the course of individual cases while more case information is obtained from the injured employee or by the case development facts. This limitation in predicting the actual sick leave duration suggests the need for further research into the impact of psycho-social factors on rehabilitation prognosis.
The other observation in Fig. 2 is CM-B and CM-C estimated a higher average number of sick leave days for non-musculoskeletal injuries such as burns, concussions, or electric shocks, and severe injuries such as amputations, dislocations, and multiple injuries. The SWIM result was only slightly higher than the CM-A estimation, which aligns with this study’s finding that the estimations of SWIM and CM-A were statistically significantly similar. The pattern displayed in Fig. 2 also suggests that SWIM may have limitations in reflecting the actual risk for non-musculoskeletal and severe injuries. This is likely because insurance companies typically appoint rehabilitation service providers for musculoskeletal injuries rather than for other injuries, and therefore, the number of cases of the validation dataset had limited number on the other type of injuries. Alternatively, the dataset of SWIM may lack sufficient data to generate an effective algorithm for non-musculoskeletal cases.
Figure 3 shows that the average PD% predicted by SWIM did not vary much across different injury types, which is consistent with the Krustal-Wallis and ICC results showing that it was significantly different from other groups. The boxplot in Fig. 3 is based on the Actual PD%. Different cases should vary in PD% on the basis of demographic information and injury severity; however, SWIM’s predictions did not demonstrate this.
In conclusion, SWIM can simulate the prediction capabilities of a case manager with approximately four years of case management experience regarding sick leave days for non-litigated cases. However, it requires further development to simulate the prediction capabilities of more experienced case managers, particularly in relation to sick leave certification or PD% for all types of cases. It is crucial to consider psycho-social factors as they may be one of the factors that more experienced case managers consider when estimating injury prognosis. Case management is a complex process that involves numerous human factors and varying reactions from injured workers. The rehabilitation journey for each work injury case is unique; thus, continued efforts to simulate human estimation in this area are strongly recommended.
Limitations
The development of SWIM provides a valuable tool for clinical decision-making in Hong Kong, particularly for predicting work injury rehabilitation and sick leave days in non-litigated cases. However, there is room for SWIM to improve its accuracy in replicating the expertise of more experienced case managers, thereby addressing the resource shortage in the market. Despite the absence of psycho-social factors in the data provided to case managers, these factors are often considered by case managers due to their historical case handling experience. Given the jurisdiction of Hong Kong, it is common for injured workers to file litigation claims, which typically have a significant impact on return to work and compensation. Therefore, it is recommended that clinical decision-making tools in Hong Kong incorporate psycho-social factors, including litigation risk, when devising treatment plans. Medical treatment outcomes are notoriously poor in patients with pending litigation following disability claims, especially those covered by workers’ compensation programmes [38]. Consequently, this study suggests further research on work injury rehabilitation predictions, specifically examining or considering psycho-social factors to address high-risk factors.