This paper developed the Persian WHODAS 2.0 by translation, then described psychometric properties of the Persian version of phone-based interviewer-administered tool for road traffic injury survivors in Tabriz, Iran, six months after crash, between 21 May 2020 and 20 March 2021. This questionnaire was translated and validated in different languages even in Persian [35, 36] but neither specifically through phone survey nor in RTIs patients. So, in this study, we translated the tool into Persian, and indicated that Persian WHODAS 2.0 is psychometrically reliable to the great extent in phone-based assessment of disability. Satisfactory internal consistency reveals the Persian WHODAS to measure the construct of disability and deliver high reliable score at item level and total score in our representative sample. The schedule is robust to changes in the same subjects over time and less prone to measurement error caused by time elapse, as ICC showed excellent reliability. Exploratory factor analysis discloses the high validity of the schedule as items hang consistently together to create the single construct of disability. The final model represented four factors of social/self-activities, cognition, mobility, and self-care, accounted for 86% of the total variance.
According to the exploratory factor analysis, the Persian WHODAS 2.0 measured disability with four components accounted for 86% of total variance and demonstrated better results compared with some studies [35, 37-39]. The factor loadings of the individual items were strong, as eleven items’ factor loadings were above 0.60 and one item showed 0.39, consistent with factor loadings in a study [37]. One factor, titled social/self-activities, contains items pertained to the subscales of Getting along, Life activities, and Participation, similar to a study in terms of the items of the extracted factor, not the number of extracted factors [35]. Three separate factors certainly distinguished cognition, mobility, and self-care subscales. The structure is quite similar to the structure found in the original English version [21]. A study indicated different groupings, attained a two-factor solution in older people from different dementia sites, with subscales of mobility, life activities, cognition, and participation in its first dominant factor and the self-care and getting along subscales in its second factor [40]. It seems higher number of sample size would help to extract the remaining three subscales separately as the same three distinct factors as in the original version. However, it is better to consider the “most appropriate” rather than the “correct” number of factors.
No measurement difference was indicated between the test and retest results. In spite of importance of using the spectrum of non-extreme in face-to-face interviews, it is not practical in phone-based surveys. However, we attained high consistency between total and subscales’ scores in phone-based surveys.
The Cronbach’s α coefficient for the Persian version indicated the ideal internal consistency and better results compared with a study among Syrian refugees (α=0.74)[37]. The Cronbach’s α coefficient was similar to several studies with comparable target groups (Ethiopian Amharic version α=0.88, in trauma patients α=0.91[35]) and different target groups (a global study [25], Brazilian study in patients with chikungunya (α=0.93) [41], in only phone-based psychometric study of WHODAS among critically ill patients (α=0.91) [26], Spanish version in depressed patients (α=0.89)[38], and Portuguese version in patients with musculoskeletal pain (α=0.84) [42]). The highest Cronbach’s alpha was related to Cognition in consistent with a study in Ethiopia [35].
The study confirmed that the scale was consistent over time representing excellent test-retest reliability as ICC fell in a range from 0.89 (Getting along) to 0.99 (Mobility) which is in line with a study in people with and without hand injuries (ICC value of 0.88) [43]. A study in patients with Kashin-Beck disease affirmed a good repeatability of the 12-item WHODAS 2.0 [44]. We reached higher time consistency at total and subscale levels compared with a study in Polish society (ICC value of 0.91, ranged from 0.50 (Self-care) to 0.97 (Mobility)) [16]. Both studies showed the highest ICC for the subscale of Mobility with the items of “Standing for long periods such as 30 minutes?” and “Walking a long distance such as a kilometre [or equivalent]?” It could be explained by the fact that these activities are of physical activities rarely affected by any actions through the time between test and retest. Generally, the results might differ regarding the mode of administration (in-person interviews versus our phone-based interview) and characteristics of the study population such as age, health status, country of origin) [45]. Moreover, the quality of translation definitely influence the understanding of questions by interviewees.
Data showed floor effects but no ceiling effects, except for one item (item seven). It was almost in line with baseline result of a study in critically ill patients [26] as well as a study of polish society [16]. Overall, study setting and way of using a tool might bring about floor and ceiling effects in data. In this study, the tool was used through phone interviews. However, the content of the tool was examined by asking the opinion of interviewees about the clarity and comprehensibility [46].
This tool is short, and have less complex questions, so it is a proper choice to use in such prospective cohort study with various questionnaires and examinations. The tool was completed via phone calls. This technique is essential for a couple of reasons. It takes around five minutes to complete it on phone calls [47]. It enabled data collection without unnecessary travel of patients to the study site/ hospital or travel of interviewers to the place of patients’ residence, due to patients’ movement restrictions. Consecutively it helped to avoid their exposure to pathogens in the current situation of the Pandemic COVID-19. Also, in-person interview is not an easy-to-access opportunity for every researcher/ participant. Our data were attained by one assistant interviewer with more than four years of experience in running phone interviews to fill out questionnaires determined in the IRTIRS. So, lack of such skills would also lead to different results in phone-based interviews. Since conducting telephone interviews requires special communication skills, it is necessary to repeat the research on the basis of phone-interview conducted by different interviewers. On the other hand, some patients such as elderly patients and those with hearing problems may have problem with phone survey. So it is recommended to reassess the tool in specific target population such as elderly people.
In this study, subjects were recruited at six months after crash. It is important to ensure that their function was partially stable and patients have got recovered from their minor injuries causing slight physical restrictions [19].
Strengths and Limitations
To the best of our knowledge, this is the first study in which the psychometric properties of the Persian version of the 12-item WHODAS 2.0 have been evaluated through phone interviews in road traffic injury patients above 18 years in the region and even the region. This study enables assessing functioning, and disability in injured population unable to attend the study site and conducting studies to compare results between different settings.
The age limitation in the tool assessment is defined owing to some feasibility issues. So, the psychometric assessment of brief WHODAS in teenagers is recommended. We plan to extend the research by developing and administrating the tool to the population with Azeri language, the most common language in the northwest Iran. Since it is necessary to use the valid and reliable tool for people whose mother language is Azeri, and there are difficulties in running the Persian version for aged and illiterate people. Regarding the effect of various communication skills, we recommend running phone-based survey by different interviewers.
The potential sources of bias might be considered when making inferences based on our findings. This study as an exploratory cross-sectional one, six months after crash, never provides evidence on the longitudinal performance of the tool for short or long term timing after crash. Moreover, this was a cohort study involving patients with road traffic injuries hospitalized in the only two referral hospitals in Northwest Iran and registered. Thus, it mainly represents the patients needed to be hospitalized post-crash. Conducting the research on the basis of phone interviews in the same and different contexts but in larger sample size is needed. We suggest using a longitudinal design to evaluate responsiveness to change and test–retest reliability.