The reliability and validity of a slightly revised Chinese version simplified modified Rankin scale questionnaire

DOI: https://doi.org/10.21203/rs.2.13565/v2

Abstract

Background: The slightly revised English version simplified modified Rankin scale questionnaire smRSq(2011) was shown to be reliable, valid, and useful in scoring the modified Rankin scale (mRS) after stroke. Our aim was to assess the inter-rater reliability and validity of a novel Chinese version smRSq(2011). Methods: The English version smRSq(2011) was translated into Chinese by a standard process. We recruited 300 consecutive hospitalized ischemic stroke patients in the department of neurology, Beijing Chaoyang Hospital. Six randomly paired raters scored the conventional mRS, the novel Chinese version smRSq(2011), the National Institutes of Health Stroke Scale (NIHSS), and the Barthel index (BI) in-person. Inter-rater reliability and validity were assessed. Results: Among the 300 ischemic stroke patients, mean age was 64.9±12.1 years, and 220 (73%) were male. For inter-rater reliability of the smRSq(2011), the percent agreement among the paired raters was 87%, the kappa (κ) was 0.84 (95% CI, 0.79-0.88), and the weighted kappa (κw) was 0.96 (95% CI, 0.95-0.98). The percent agreement between the smRSq(2011) scores by the first rater and the conventional mRS scores by the second rater in each pair was 55%, κ=0.47 (95% CI, 0.40-0.54), and κw=0.91 (95% CI, 0.89-0.93). In construct validity testing, the Spearman’s correlation coefficients comparing the smRSq(2011) scores by the first rater with the NIHSS and the BI scores by the second rater were 0.83 (P<0.001) and -0.86 (P<0.001), respectively. Conclusions: Our results suggest usefulness of the novel Chinese version smRSq(2011) in scoring the mRS in Chinese stroke patients. Further validation in other clinical settings, including in communities and by remote methods in China is warranted.

Background

Stroke is quite common throughout the world [1, 2] including China [3]. Assessing functional outcomes after stroke accurately and reliably is a critical part of clinical research and stroke registries. The modified Rankin Scale (mRS) is the most commonly utilized scale for assessing functional outcome after stroke [4]. However, scoring the mRS involves some subjectivity and its reliability is limited [5]. Multiple mRS scoring aids have been reported to improve its reliability [6-10].

The simplified mRS questionnaire (smRSq)(2010) was initially reported in 2010 with good reliability among a wide variety of raters in addition to being relatively simple to use and an average time to score the mRS of <2 minutes [7]. The initial version of the smRSq has been translated and validated in Chinese stroke patients [11].   

A slightly revised version of the smRSq(2010), with improved agreements for mRS scores 3 to 5 (moderate to severe disability) was reported in 2011, the smRSq(2011) [9]. A panel of experts for the International Consortium of Health Outcomes has recommended the smRSq(2011) for standardized mRS scoring [12]. In this study we test the clinimetric properties of a novel Chinese version smRSq(2011). A useful Chinese version smRSq(2011) could facilitate the collection of internationally standardized functional outcome data in Chinese stroke patients. More accurate and standardized data could lead to a better understanding of stroke prognosis in China.

Materials and Methods

First, we translated the smRSq(2011) from English to Chinese with forward-backward translation to allow for inconsistency detection, and the draft questionnaire was checked for face validity, as with our prior translation process of the original smRSq [11]. We anticipated that the Chinese version smRSq(2011)(Figure 1.e) would be improved in comparison to the original Chinese version smRSq(2010), similar to the results for the English version smRSq(2011) [9].

We enrolled 300 consecutive ischemic stroke patients in the department of neurology, Beijing Chaoyang Hospital, between July and December 2014. We used the World Health Organization definition of stroke [13].  All strokes were confirmed by CT or MRI.

Six stroke experienced physicians performed all the ratings within 7 days after admission blinded to the patients’ clinical data and to the other raters’ scores. The six raters were randomly allocated into 300 pairs, one pair for each patient. The first rater in each pair assesses a patient on day one and the second rater on day two, in order to minimize the risk of change in the patient's condition. If patients could not answer the questionnaire, we interviewed their caregivers when possible. Each rater scored the conventional mRS first, followed by the Chinese version smRSq(2011), and the National Institutes of Health Stroke Scale (NIHSS). Only the second rater scored the Barthel Index (BI), either before or after the NIHSS. The NIHSS indicates stroke severity. The Barthel Index (BI) measures activities of daily living. Each rater estimated their average time to score the smRSq.

Our study was approved by the ethics committee of Beijing Chaoyang Hospital, Capital Medical University. Every patient gave a valid informed consent to participate.

Statistical Analysis 

For inter-rater reliability of the conventional mRS and the smRSq(2011), we compared scores between the first and the second rater. We calculated the percent agreement and determined kappa (κ) and weighted kappa (κw) with 95% confidence intervals (CI). For validity of the smRSq(2011) against the conventional mRS, we compared the smRSq(2011) scores of the first rater to the mRS scores of the second rater. We correlated the Chinese version smRSq(2011) scores by the first rater with the NIHSS scores and the BI scores by the second rater using the Spearman’s correlation. We used the Statistical Package for Social Sciences (SPSS) version 16.0 (SPSS Inc., Chicago, IL, USA) for data analysis. P values less than 0.05 were considered statistically significant.

Results

Among 300 ischemic stroke patients, mean age was 64.9±12.1 years, 220 (73%) were male, 210 (70%) had hypertension, 88 (29%) had diabetes mellitus, 68 (23%) had prior stroke, 55 (18%) had coronary disease, 21 (7%) had atrial fibrillation, and 160 (53%) were current cigarette smokers. The median scores on the NIHSS was 4.5, the BI was 73.2, the conventional mRS was 2.5, and the smRSq(2011) was 2.6. The average time to score the smRSq(2011) among all the raters was 70 seconds.

For the conventional mRS inter-rater reliability, the percent agreement between the raters was 80%, the κ=0.76 (95% CI, 0.70-0.81), and the κw=0.93 (95% CI, 0.90-0.96).

Table 1 shows the cross-tabulation between the smRSq(2011) scores by the first rater with the second rater. For inter-rater reliability, the percent agreement between the raters was 87%, the κ=0.84 (95% CI, 0.79-0.88), and the κw=0.96 (95% CI, 0.95-0.98). Figure 1 illustrates agreement between the smRSq(2011) scores by the paired raters.

Comparing the smRSq(2011) scores by the first rater with the conventional mRS scores by the second rater in each pair (Table 2), percent agreement was 55%, κ=0.47 (95% CI, 0.40-0.54), and κw=0.91 (95% CI, 0.89-0.93). Comparing the smRSq(2011) scores by the second rater with the conventional mRS scores by the first rater produced a similar result (agreement 54%, κ=0.46 (95% CI, 0.40-0.52), and κw=0.90 (95% CI, 0.88-0.93).

In construct validity testing, comparing the smRSq(2011) scores by the first rater with the NIHSS scores by the second rater, the Spearman correlation coefficient was 0.83 (P<0.001). Comparing the smRSq(2011) scores by the second rater with the NIHSS scores by the first rater gave a similar result (Spearman correlation coefficient 0.82). Comparing the smRSq(2011) scores by the first rater with the BI scores by the second rater, the Spearman correlation coefficient was -0.86 (P<0.001).

Discussion

Our primary objective in this study was to test a novel Chinese version smRSq(2011)(Figure 1.e). We assessed the inter-rater reliability of the smRSq(2011) and validated it against the conventional mRS interview. We tested the construct validity of the smRSq(2011) against the NIHSS and the BI. We found good reliability and validity of the Chinese version smRSq (2011).

The inter-rater reliability of the Chinese version smRSq(2011) was slightly better than that of the conventional mRS (κ=0.84 vs κ=0.76, respectively), and that of the Chinese version original smRSq(2010) [11]. In this study the inter-rater reliability is similar to those reported with a structured interview mRS [5,6,14].

Construct validity testing showed good correlations between the smRSq(2011) and the NIHSS and the BI. This result is consistent with other validity studies using the conventional mRS [15], the English version smRSq(2011) [16], and our prior Chinese smRSq(2010) study [11].

As in our prior study with the Chinese smRSq(2010), the Chinese smRSq(2011) questions were understood by the majority of patients and caregivers with little or no explanation, and the scale was easily performed by the raters. Time to score the smRSq was relatively brief (average 70 seconds).

This study has limitations. First, the paired ratings were done on two consecutive days, which may have introduced some recall bias. To limit recall bias we instructed the patients to treat each interview independently of the others. Second, the mRS should ideally be scored after some period of recovery from stroke and in a community setting. Thus, although the scores in our patients likely do not represent their ultimate functional outcome, the paired ratings were done under similar circumstances. Third, we did not test the novel Chinese version smRSq (2011) over the telephone or via telemedicine, and remote outcome assessments are often more practical than in-person assessments.

Conclusion

In conclusion, this study demonstrates good reliability and validity of the novel Chinese version smRSq (2011) in scoring the mRS in Chinese stroke patients. Furthermore, the simplicity of the smRSq(2011) tool further augments its usefulness. For generalizability, confirmation of our findings in Chinese stroke patients in community settings, by remote methods, by non-stroke physicians, and by non-physicians is warranted.

Abbreviations

mRS: modified Rankin scale; BI: Barthel index; NIHSS: National Institutes of Health Stroke Scale; smRSq: simplified modified Rankin scale questionnaire.

Declarations

Acknowledgments

We thank all the clinical neurologists (Z.S, Y.Z., Y.L., S.W., L.D., X.Z.) for their ratings of the clinical stoke scales.

Funding

This work was supported by the National Natural Science Foundation of China (81301016) and the Beijing Municipal Administration of Hospitals Incubating Program (PX2019009). The funders did not play any role in study design; in the collection, analysis, and interpretation of data; in the writing of the report; nor in the preparation, review, or approval of the manuscript.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author upon request.

Authors’ contributions

JLY, WLH and AB conceived and designed the experiments. JLY analyzed the data and drafted the manuscript. JLY and YXW collected and analyzed data. All authors have read and approved the final manuscript to be published.

Ethics approval and consent to participate

Our work was approved by the Ethics Committee of Beijing Chaoyang Hospital. This study was performed in accordance with the Declaration of Helsinki and all authors agreed the publish statements of BMC Neurology.

Consent for publication

Written informed consent was obtained from all the subjects.

Competing interests

The authors declare that they have no competing interests.

References

  1. Donkor ES. Stroke in the 21(st) Century: A Snapshot of the Burden, Epidemiology, and Quality of Life. Stroke Res Treat. 2018; 2018: 3238165.
  2. Katan M, Luft A. Global Burden of Stroke. Semin Neurol. 2018; 38(2): 208-11.
  3. Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L, et al. Prevalence, Incidence, and Mortality of Stroke in China: Results from a Nationwide Population-Based Survey of 480 687 Adults. Circulation. 2017; 135 (8): 759-71.
  4. van Swieten JC, Koudstaal PJ, Visser MC, Schouten HJ, van Gijn J. Interobserver agreement for the assessment of handicap in stroke patients. Stroke. 1988; 19(5): 604-7.
  5. Quinn TJ, Dawson J, Walters MR, Lees KR. Reliability of the modified Rankin Scale: a systematic review. Stroke. 2009; 40(10): 3393-5.
  6. Wilson JT, Hareendran A, Grant M, Baird T, Schulz UG, Muir KW, et al. Improving the assessment of outcomes in stroke: use of a structured interview to assign grades on the modified Rankin Scale. Stroke. 2002; 33(9): 2243-6.
  7. Bruno A, Shah N, Lin C, Close B, Hess DC, Davis K, et al. Improving modified Rankin Scale assessment with a simplified questionnaire. Stroke. 2010; 41(5): 1048-50.
  8. Saver JL, Filip B, Hamilton S, Yanes A, Craig S, Cho M, et al. Improving the reliability of stroke disability grading in Clinical Trials and Clinical Practice: the Rankin Focused Assessment (RFA). Stroke. 2010; 41(5): 992–5.
  9. Bruno A, Akinwuntan AE, Lin C, Close B, Davis K, Baute V, et al. Simplified modified rankin scale questionnaire: reproducibility over the telephone and validation with quality of life. Stroke. 2011; 42(8): 2276-9.
  10. Patel N, Rao VA, Heilman-Espinoza ER, Lai R, Quesada RA, Flint AC. Simple and reliable determination of the modified Rankin scale score in neurosurgical and neurological patients: The mRS-9Q. Neurosurgery. 2012; 71(5): 971–5.
  11. Yuan JL, Bruno A, Li T, Li SJ, Zhang XD, Li HY, et al. Replication and extension of the simplified modified rankin scale in 150 Chinese stroke patients. Eur Neurol. 2012; 67(4): 206-10.
  12. Salinas J, Sprinkhuizen SM, Ackerson T, Bernhardt J, Davie C, George MG, et al. An international standard set of patient-centered outcome measures after stroke. Stroke. 2016; 47(1):180-6.
  13. Goldstein M, Barnett HJM, Orgogozo JM, Sartorius N, Symon L, Vereshchagin NV. Stroke--1989. Recommendations on stroke prevention, diagnosis, and therapy. Report of the WHO Task Force on Stroke and other Cerebrovascular Disorders. Stroke, 1989, 20(10): 1407-31.
  14. Wilson JT, Hareendran A, Hendry A, Potter J, Bone I, Muir KW. Reliability of the modified Rankin Scale across multiple raters: benefits of a structured interview. Stroke. 2005; 36(4):777-81.
  15. Banks JL, Marotta CA. Outcomes validity and reliability of the modified Rankin Scale: implications for stroke clinical trials: a literature review and synthesis. Stroke. 2007; 38(3):1091- 6.
  16. Bruno A, Close B, Switzer JA, Hess DC, Gross H, Nichols FT 3rd, et al. Simplified modified Rankin Scale questionnaire correlates with stroke severity. Clin Rehabil. 2013; 27(8):724-7.

Tables

Table 1. Cross-tabulation between the smRSq(2011) scores by the first and the second rater in each pair. 

 

 

Second rater

 

0

1

2

3

4

5

Total

First

 

 

rater

0

90

0

0

0

0

0

90

1

8

24

3

2

2

0

39

2

0

2

21

1

0

0

24

3

1

0

0

29

8

1

39

4

0

0

1

1

41

3

46

5

0

0

1

0

5

56

62

Total

99

26

26

33

56

60

300



Table 2. Cross-tabulation between the smRSq(2011) scores by the first rater and the conventional mRS scores by the second rater.

 

 

smRSq(2011) rater1

 

 

0

1

2

3

4

5

Total

mRS

rater2

0

28

3

0

1

0

0

32

1

62

18

1

0

0

0

81

2

0

14

22

1

1

1

39

3

0

2

1

24

1

0

28

4

0

2

0

13

41

28

84

5

0

0

0

0

3

33

36

Total

90

39

24

39

46

62

300