Validation of Indian version of musculoskeletal tumor society score questionnaire- hospital- based cohort study

doi:10.21203/rs.3.rs-1757397/v1

Background- Scoring systems are regarded as quantitative methods to assess the severity of a disease and its treatment modalities. The Musculoskeletal tumor society score (MSTS) questionnaire which was developed in 1985 and revised in 1993 to measure the functional outcome in patients with neoplasm still holds good. All scoring systems need validation in the native language of the people being used. Here, we have created an Indian version of the MSTS scoring system and validated the psychometric properties of the created version.

Material and methods- all the patients who were operated on for aggressive benign and malignant bone tumors of lower limbs were included in the study. two independent observers who were well versed in the Hindi language made a consensual draft. the questionnaire was put to the patients at two different point of time and the results were analysed.

Results- the lower extremity version demonstrated an excellent internal consistency (Cronbach's alfa = 0.999: CI = 0.9-1.0) at day 0 and excellent reliability (Cronbach alfa = 0.999: CI = 0.9-1.0) except for good reliability (Cronbach's alfa = 0.87: CI = 0.80–0.92) of gait score at day 10.

Conclusion - from the above validation we conclude that the Hindi version of the MSTS score can be easily used in place of the English in our Indian population

Scoring systems are regarded as quantitative methods to assess the severity of a disease and its treatment modalities. The Musculoskeletal tumor society score (MSTS) questionnaire which was developed in 1985 and got its revised version in 1993 to measure the functional outcome in patients with neoplasm still holds good in its present form. (1) The outcome of various neoplasm of the musculoskeletal system is still evaluated MSTS system. (2) (3) Although used extensively, all standard scoring systems still need validation in the native language of the patients being used. The MSTS scoring system has been translated and validated in Brazilian, Japanese, Chinese, Turkish, and Danish language before. (4) (5) (6) (7) (8)To the best of our knowledge, the Indian version of MSTS scoring system still needs validation in our study population.

Translating the content of the standardized scoring system needs language translation and cultural adaptation. It helps in ensuring that the content of the evaluation system is well understood by the narrator and the response provided is after a thorough understanding of the question asked rather than mere assumptions. There are also some studies that recommend various ways to assess the psychometric properties of these adapted instruments. (9)

Also, few studies have done test-retest and inter-rater reliability of the version created for the assessment tool. Where test-retest assesses the reliability of the same test which is conducted at a different time and inter-rater reliability is an assessment of the tool when the same test is conducted by two different people. (10)

Here in our study, we have created the Indian version of the MSTS scoring system and validated the psychometric properties of the created version. We have also tried to compare our results with the different translated versions of MSTS scoring found in the literature.

The designing road for the study- we translated the MSTS questionnaire into the Hindi language in the department of orthopaedics by one of the residents who are well versed with the subject and had this language till his high school level. The same questionnaire was given to a clerical staff who had the Hindi language till his graduation level.

The patients who were operated on for malignant and aggressive benign lower limb tumours and had to follow up for at least one year were included in the study. Due to the pandemic condition the patients were called telephonically by two different residents at the interval of one week. The patients were asked the same questionnaire telephonically and different scores of the assessment tool vis a vis pain score, functional score, emotional score, walking score, support score and gait score were asked. Verbal consent was obtained from all the patients and the study was approved by the institutional ethical committee. The translational method used was according to the international standard published for the translation of assessment tools. (11)

Preparation of MSTS tool in the Hindi language-

Two independent translators one from the departmental resident who was well versed in Hindi language and had it till high school level and another translator who was a clerical staff and had the Hindi language till graduation as the main subject was identified. Both the translations were prepared and discussed for discrepancies. The discrepancies were sorted out by both translators and a concurred draft was created with the consent of both translators. The author then reviewed the translated version and components of the questionnaire and any discrepancies were discussed and sorted out.

Study population- all the patients who were operated on in our institute for sarcoma and aggressive benign tumours of the lower limb were included in the study. Table 1 shows the Baseline Characteristics of the patients fulfilling inclusion criteria.

The patients who refused to participate or who have succumbed to the disease were excluded from the study. Patients who were not well versed in the Hindi language were excluded from the study. Of the total 97 cases shortlisted we included 85 cases. The rest of the cases either succumbed to disease, refused to participate or the information of their mobile number was not correct. Table 2 demonstrates the distribution of the included patients by the diagnosis and intervention.

Measurement of MSTS- as the scoring system is based on the factors related to the patient as a whole and specific to the upper and lower extremity. the items specific to lower limbs are pain, daily function, emotional acceptance, use of aids, walking ability and gait. (1) Each of the items was assigned a value of 0 to 5 and the final score is calculated as the percentage of the maximum obtainable value.

Data analysis

The data were analysed in Stata, version 12.1. Patients’ clinical demographic were recorded as descriptive data. The psychometric assessment was done using domains such as reliability and estimation of floor and ceiling effects were also determined. Reliability was further assessed using internal consistency, reliability and measurement errors. The internal consistency was measured using Cronbach’s α which was considered good when found between 0.70 and 0.95. The inter-rater reliability was assessed by asking the questionnaire by two independent observers at two different points of time and measured by the intraclass correlations coefficient. The floor and ceiling effects were considered significant when more than 15% of the patients receive the lowest and highest score.

The demographic characteristics, the surgery performed and surgical staging is described in Table 1. The distribution of various surgical methods along with the diagnosis is described in Table 2.

Table 1: Baseline Characteristics

Characteristic

Value

Age (yrs., mean)

2 – 65, 33.5

Sex (number, %)

Male

Female

54, 55.6

43, 44.4

Histological type (number, %)

CHONDROSARCOMA

OSTEOSARCOMA

EWING SARCOMA

ANEURYSMAL BONE CYST

SYNOVIAL SARCOMA

BONE METS

GIANT CELL TUMOR

OTHER

3, 3.1

20, 20.6

11, 11.3

3, 3.1

5, 5.2

47, 48.4

5, 5.2

Type of surgery (number, %)

TUMOR PROSTHESIS

CURETTAGE & BONE GRAFT

CURETTAGE & BONE CEMENT

ARTHRODESIS

AMPUTATION

WIDE EXCISION

IN SITU FIXATION

24,24.7

17, 17.5

13, 13.4

16, 16.5

5, 5.1

Surgical Staging (Number, %)

1A

1B

2A

2B

3

18, 18.5

24, 24.7

19, 19.6

20, 20.6

16, 16.6

Table 2

DISTRIBUTION OF THE OPERATION METHOD BY DIAGNOSIS
Diagnosis	Tumor prosthesis	Wide excision	Curettage &bone graft	Curettage and Bone cement	Arthrodesis	Amputation	In situ fixation
Osteosarcoma	7	0	0	0	4	9	0
GCT	10	0	16	15	4	2	0
Ewings Sarcoma	4	0	0	0	4	3	0
Metastasis	0	0	0	0	0	0	5
Synovial Sarcoma	0	3	0	0	0	0	0
Chondrosarcoma	2	0	0	0	0	1	0
Aneurysmal BC	0	0	1	2	0	0	0
Other	1	2	0	0	1	1	0
Total	24	5	17	17	13	16	5

MSTS scoring of all the patients called on day 0 is detailed in Table 3 with upper and lower limits

Table 3

MSTS Day 0
Variables	Cronbach’s alpha (Intraclass correlation^a)	95% confidence interval		F test with true value 0
Variables	Cronbach’s alpha (Intraclass correlation^a)	Lower bound	Upper bound	Value	Df1	Df2	sig
Pain	1.000^b	.	.	.	83	.	.
Functional	.995^b	.992	.997	203.107	83	83	.000
Emotional	.983^b	.974	.989	58.600	83	83	.000
Support	.984^b	.975	.989	61.484	83	83	.000
Gait	.963^b	.943	.976	27.986	83	83	.000
Cumulative	.999^b	.999	1.000	1727.689	83	83	.000
a. Type A intraclass correlation coefficients using an absolute agreement definition b. This estimate is computed assuming the interaction effect is absent, because it is not estimable otherwise.

MSTS scoring of all the patients called on day 10 is detailed in Table 4 with upper and lower limits

Table 4

MSTS Day 10
Variables	Cronbach’s alpha (Intraclass correlation^a)	95% confidence interval		F test with true value 0
Variables	Cronbach’s alpha (Intraclass correlation^a)	Lower bound	Upper bound	Value	Df1	Df2	sig
Pain	.991^b	.986	.994	112.299	83	83	.000
Functional	.971^b	.955	.981	33.991	83	83	.000
Emotional	.979^b	.968	.987	47.793	83	83	.000
Support	.964^b	.944	.976	27.187	83	83	.000
Gait	.874^b	.805	.918	7.897	83	83	.000
Cumulative	.999^b	.999	1.000	1355.781	83	83	.000
a. Type A intraclass correlation coefficients using an absolute agreement definition b. This estimate is computed assuming the interaction effect is absent, because it is not estimable otherwise.

The measurement of errors was assessed by Bland-Altman plots in the inter-rater test (Fig. 1: day 0 and Fig. 2: day 10).

(Footnote: The graph is plotted on the XY axis where X represents the difference between the two measurements, and the Y-axis shows the mean of the two measurements. The plot showed the intervals of agreements i.e. 95% of the data points should lie within +/-1.96 SD of the mean difference – limits of agreement)

Excellent overall internal consistency (Cronbach’s α = 0.999: CI = 0.9–1.0) of all the parameters of the MSTS score variables at day 0 of the study (Table 3) (Cronbach’s α = 0.999: CI = 0.9–1.0). Except for the gait score at day 10, good reliability (Cronbach’s α = 0.87: CI = 0.80–0.92), all other parameters showed excellent reliability (Cronbach’s α = 0.999: CI = 0.9–1.0) (Table 4). Bland-Altman plot at day 0 and day 10 shows good results with most values lying within th 95 % of limits of agreement Figure (1), Figure (2).

Disability scales have become prominent and complementary to the traditional outcome measures such as survival or physical assessment in musculoskeletal cancer evaluation. Multiple–language versions of the existing validated questionnaires allow us to standardize the outcome assessment and to increase the statical power of clinical studies. The English version of the MSTS questionnaire had been translated, validated and culturally adapted by other countries including Japan, Turkey, Brazil, Denmark and China in patients with sarcoma of the upper and/or lower extremity. (4) (5) (6) (7) (8) However, it was never translated to the Hindi language and this study was conducted to test whether this tool could evaluate health status outcomes and its psychometric properties. To the best of our knowledge, this is the first time the lower extremity version of the MSTS questionnaire has been translated into the Hindi language following a standardized guideline.

This study revealed that the Hindi versions of the MSTS questionnaires for lower limbs are suitable and adequate tools for measuring functional outcomes in the Indian population. The Hindi version of the MSTS questionnaire showed good internal consistency, inter and intra-observer reliability and construct validity during the postoperative evaluation of patients with lower extremity sarcoma in our study. MSTS is usually the standard measurement tool to evaluate the functional outcome, which is evaluated by the physician. Considering the fact that almost all of the Indian population are Hindi speaking, some questions of the English version of the score were not appropriate for Indian patients, so we developed the Hindi version of MSTS for use in Indian patients. To the best of our knowledge, the current study is the first to test the factor structure of the MSTS rating scale for lower limbs and its reliability properties in the Indian population.

An existing questionnaire must undergo a proper cross-cultural translation to ensure that it measures the same concept as the original measurement while using it in a different subset of population. (11)In this study, we present a cross-cultural adaptation of the original MSTS rating for lower limbs in the Indian population.

The good internal consistency and inter-rater reliability found in this study are comparable to those found by Rebolledo et al (Cronbach's alpha = 0.84) and reliability (test-retest reliability and interobserver agreement of 0.92 and 0.98, respectively), (4) Iwata et al (Cronbach’s alpha coefficient was 0.87 correlation coefficient (0.92; 95% CI, 0.88–0.95) (5)and Xu et al (Cronbach's α of 0.86 correlation coefficient of 0.85–0.96) (6). It is noted that the original English version did not report a Cronbach’s α although it reported good inter-observer reliability. The translation and back translation in our study resulted in minimal discrepancies that were resolved by consensus. The resultant Indian rating score reflects both the semantic and conceptual equivalence to the original English version. Analysis from our study indicated an excellent internal consistency (0.9). This result is similar with that found by Davis et al. (3) (0.91) in a study with 83 patients with lower extremity sarcoma and better than that of Rebolledo et al. (4) In their study, Lee et al. (12)reported Cronbach’s alpha of 0.88 in a study with 49 patients with musculo- skeletal tumors, thus reflecting the internally valid nature of our score – MSTS Hindi.

Measurement errors in the MSTS has been studied previously by Saebye CKP et al. (8) where they demonstrated low mean bias on all plots, however, with wide limits of agreement, which indicated a possible high measurement error. In the present study, there was low mean bias as demonstrated in Figs. 1 and 2 with a narrow limit of agreement indicating a low measurement error. No other study has included measurement error into the analysis. A change in the MSTS score greater than the measurement error should be considered a possible ‘real’ change in the functional outcome and hence the test for measurement error becomes important part of the validation process. (13)

Our study included 97 patients with lower extremity sarcomas. Considering the previous guidelines concerning the validation of instruments have set a minimum of 100 patients as an excellent sample size, while 50 to 99 patients constitute a good sample size, this study included a very good sample size.

This study had several limitations. The limited number of patients and the oncologic type distribution of the patients may have influenced the results. There was a gap of 10 days interval between the two-measurement taken by the same observer and it could potentially cause a deviation in their score. Although it is necessary to wait for this period in validity studies, this could be one of the limitations of our study. Patients were identified and evaluated retrospectively using our institutional records based on diagnosis and the data related to imaging, pathology examination, or intraoperative findings were not available in detail for each patient. A major limitation of our study was that the functional scores like SF-36 and Toronto Extremity Salvage Score (TESS) were not evaluated. This was, in part, also impacted by ongoing global pandemic (COVID-19), which resulted in alteration of routine patient visit and follow up.

The Hindi version of MSTS scoring system is a reliable method to evaluate the quality of life of native Indians with Hindi as their mother tongue. Though the number of patients was less but this study can prove as a benchmark to carry out further studies in Indian population.

Ethics approval and consent to participate

Appropriate ethical approval from the institute ethical committee was taken along with the consent to participate from the patients

Consent for publication

I, the corresponding author, on behalf of all the co-authors give the consent to publish the article

Competing Interests

The authors and co-authors declare no conflict of interest in the above study.

Author contributions

M.D. – Planning and forming the outline of the study, final manuscript

S.A. – Manuscript writing, statistical analysis

S.B. – Data collection, review of literature, statistical analysis

C.K.K. – Review of literature, statistical analysis

S.P.S. – Review of literature

M.K. – Data collection

All authors reviewed the manuscript

Funding

The authors and the co-authors did not receive any funding from any source for the study

Availability of data and materials

All the data of the patients is there with the corresponding author and can be produced on request

Enneking WF, Dunham W, Gebhardt MC, Malawar M, Pritchard DJ. A system for the functional evaluation of reconstructive procedures after surgical treatment of tumors of the musculoskeletal system. Clin Orthop. 1993 Jan;(286):241–6.
Ginsberg JP, Rai SN, Carlson CA, Meadows AT, Hinds PS, Spearing EM, et al. A comparative analysis of functional outcomes in adolescents and young adults with lower-extremity bone sarcoma. Pediatr Blood Cancer. 2007 Dec;49(7):964–9.
Schreiber D, Bell RS, Wunder JS, O’Sullivan B, Turcotte R, Masri BA, et al. Evaluating function and health related quality of life in patients treated for extremity soft tissue sarcoma. Qual Life Res Int J Qual Life Asp Treat Care Rehabil. 2006 Nov;15(9):1439–46.
Rebolledo DCS, Vissoci JRN, Pietrobon R, de Camargo OP, Baptista AM. Validation of the Brazilian version of the musculoskeletal tumor society rating scale for lower extremity bone sarcoma. Clin Orthop. 2013 Dec;471(12):4020–6.
Iwata S, Uehara K, Ogura K, Akiyama T, Shinoda Y, Yonemoto T, et al. Reliability and Validity of a Japanese-language and Culturally Adapted Version of the Musculoskeletal Tumor Society Scoring System for the Lower Extremity. Clin Orthop. 2016 Sep;474(9):2044–52.
Xu L, Li X, Wang Z, Xiong J, Wang S. Functional evaluation for patients with lower extremity sarcoma: application of the Chinese version of Musculoskeletal Tumor Society scoring system. Health Qual Life Outcomes. 2017 May 19;15(1):107.
Ocaktan B, Deveci MA, Tokgöz MA, Yapar A, Şimşek A. Cross-cultural adaptation and validation of the Turkish version of the Musculoskeletal Tumor Society scoring system in patients with musculoskeletal tumors. Acta Orthop Traumatol Turc. 2021 Mar;55(2):141–6.
Saebye CKP, Keller J, Baad-Hansen T. Validation of the Danish version of the musculoskeletal tumour society score questionnaire. World J Orthop. 2019 Jan 18;10(1):23–32.
Mason TC. Cross-cultural instrument translation: assessment, translation, and statistical applications. Am Ann Deaf. 2005;150(1):67–72.
The 4 Types of Reliability | Definitions, Examples, Methods [Internet]. Scribbr. 2019 [cited 2022 Feb 2]. Available from: https://www.scribbr.com/methodology/types-of-reliability/
Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine. 2000 Dec 15;25(24):3186–91.
Lee SH, Kim DJ, Oh JH, Han HS, Yoo KH, Kim HS. Validation of a functional evaluation system in patients with musculoskeletal tumors. Clin Orthop. 2003 Jun;(411):217–26.
Terwee CB, Mokkink LB, Knol DL, Ostelo RWJG, Bouter LM, de Vet HCW. Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012;21(4):651–7.

No competing interests reported.

Validation of Indian version of musculoskeletal tumor society score questionnaire- hospital- based cohort study

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methods

Data analysis

Results

Discussion

Conclusion

Declarations

References

Additional Declarations

Status:

Version 1