The reliability and validity of a novel Chinese version simplified modified Rankin scale questionnaire(2011)

DOI: https://doi.org/10.21203/rs.2.13565/v5

Abstract

Background The modified Rankin Scale (mRS) is a key global outcome measure after stroke internationally. The latest English version of the simplified modified Rankin scale questionnaire (smRSq)(2011) is a reliable and valid tool in scoring the mRS after stroke. In order to use this tool in Chinese patients, we translated it into Chinese and tested its clinimetric properties. Methods The English version smRSq(2011) was translated into Chinese by a standard process. We recruited 300 consecutive hospitalized ischemic stroke patients in the department of neurology, Beijing Chaoyang Hospital. Six randomly paired raters scored the conventional mRS, the novel Chinese version smRSq(2011), the National Institutes of Health Stroke Scale (NIHSS), and the Barthel index (BI) in-person. Inter-rater reliability and validity were assessed. Results Among the 300 ischemic stroke patients, mean age was 64.9±12.1 years, and 220 (73%) were male. For inter-rater reliability of the smRSq(2011), the percent agreement among the paired raters was 87%, the kappa ( κ) was 0.84 (95% CI, 0.79-0.88), and the weighted kappa ( κ w ) was 0.96 (95% CI, 0.95-0.98). The percent agreement between the smRSq(2011) scores and the conventional mRS scores was 55%, κ =0.47 (95% CI, 0.40-0.54), and κ w =0.91 (95% CI, 0.89-0.93). In construct validity testing, the Spearman’s correlation coefficients comparing the smRSq(2011) scores with the NIHSS and the BI scores were 0.83 ( P <0.001) and -0.86 ( P <0.001), respectively. Conclusions Our results show good to excellent clinimetric properties of the novel Chinese version smRSq(2011) in scoring the mRS in Chinese stroke patients. Further validation in other clinical settings, including in communities and by remote methods in China is warranted.

Background

China comprises nearly one fifth of the world’s population and the age-standardized prevalence of stroke among Chinese adults has been estimated at 1.1-2.1% (approximately 16-27 million people) [1, 2]. However, the age-specific stroke prevalence increases sharply after the age of 50 years to approximately 5.0-6.7% among people aged 70-79 years.

Assessing functional status after stroke accurately and reliably is a critical part of clinical research and stroke registries. The modified Rankin Scale (mRS) has emerged as the most commonly utilized scale for assessing functional status after stroke [3]. However, because scoring the mRS involves collecting various patient performance data by interviewing patients and caregivers, some subjectivity is inherent. Consequently, its reliability has been measured as suboptimal [4].

Multiple standardized mRS scoring aids have been proposed to improve its reliability [5-9]. These aids consist of prespecified questions and an algorithm to determine the mRS score. One of the simplest, shortest, and validated mRS scoring aids is the latest simplified modified Rankin Scale questionnaire (smRSq)(2011) [8]. This tool has been validated among a wide variety of raters, including over the telephone, with an average time to complete of <2 minutes.

We previously translated into Chinese and validated the original smRSq(2010) [10]. Subsequently, the smRSq(2011) showed improved agreements between raters over the original smRSq(2010) [8]. A panel of experts for the International Consortium of Health Outcomes has recommended the smRSq(2011) for standardized mRS scoring [11]. Thus, in this study our aim was to validate a novel Chinese version smRSq(2011). A validated Chinese version smRSq(2011) could facilitate the collection of internationally standardized functional outcome data in Chinese stroke patients. More accurate and standardized data could lead to a better understanding of stroke prognosis in China.

Materials And Methods

We translated the smRSq(2011) from English to Chinese with forward and backward translation to allow for inconsistency detection, and the draft questionnaire was checked for face validity. We enrolled 300 consecutive ischemic stroke patients in the department of neurology, Beijing Chaoyang Hospital, between July and December 2014. We excluded stroke patients who were critically ill on respirators, neurologically unstable, or refused to participate. Also, patients with mild strokes who were released soon after admission, were excluded. We used the World Health Organization definition of stroke [12]. All strokes were confirmed by CT or MRI.

Six raters performed all the ratings within 7 days after admission blinded to the patients’ clinical data and to the other raters’ scores. The six raters consisted of neurology residents between 1-3 years in training at our hospital. Each patient was rated by two of the six randomly selected raters. To limit recall bias patients were rated only once on the first day, and to avoid a change in patients functional status, the second rating was done no later than the following day. The first rater in each pair assesses a patient on day one and the second rater on day two, in order to minimize the risk of change in the patient's condition. If patients could not answer the questionnaire, we interviewed their caregivers. Each rater scored the conventional mRS first, followed by the Chinese version smRSq(2011), and the National Institutes of Health Stroke Scale (NIHSS). Only the second rater scored the Barthel Index (BI), either before or after the NIHSS. The NIHSS indicates stroke severity. The Barthel Index (BI) measures activities of daily living. Each rater estimated their average time to score the smRSq.

Our study was approved by the ethics committee of Beijing Chaoyang Hospital, Capital Medical University. Every patient gave a valid informed consent to participate.

Statistical Analysis

For inter-rater reliability of the conventional mRS and the smRSq(2011), we compared scores between the first and the second rater. We calculated the percent agreement and determined kappa (κ) and weighted kappa (κ_w)with 95% confidence intervals (CI). For validity of the smRSq(2011) against the conventional mRS, we compared the smRSq(2011) scores of the first rater to the mRS scores of the second rater. We correlated the Chinese version smRSq(2011) scores with the NIHSS scores by the two raters in each pair. We correlated the Chinese smRSq(2011) scores by the first rater with the BI scores by the second rater (only the second rater scored the BI) using the Spearman’s correlation. We used the Statistical Package for Social Sciences (SPSS) version 16.0 (SPSS Inc., Chicago, IL, USA) for data analysis. We considered kappa values and correlation coefficients >0.75 as good and >0.90 as excellent. P values less than 0.05 were considered statistically significant.

Results

Table 1 shows the clinical characteristics and the aggregate scores for each scale of the 300 patients. The average time to score the Chinese smRSq(2011) among all the raters was 70 seconds.

For the conventional mRS inter-rater reliability, the percent agreement between the raters was 80%, the κ=0.76 (95% CI, 0.70-0.81), and the κ_w =0.93 (95% CI, 0.90-0.96).

Table 2 shows the cross-tabulation of the smRSq(2011) scores between the paired raters. For inter-rater reliability, the percent agreement between the raters was 87%, the κ=0.84 (95% CI, 0.79-0.88), and the κ_w=0.96 (95% CI, 0.95-0.98). Figure 1 illustrates agreement between the smRSq(2011) scores by the paired raters.

Comparing the smRSq(2011) scores by the first rater with the conventional mRS scores by the second rater in each pair (Table 3), percent agreement was 55%, κ=0.47 (95% CI, 0.40-0.54), and κ_w=0.91 (95% CI, 0.89-0.93). Comparing the smRSq(2011) scores by the second rater with the conventional mRS scores by the first rater produced a similar result (agreement 54%, κ=0.46 (95% CI, 0.40-0.52), and κ_w=0.90 (95% CI, 0.88-0.93).

In construct validity testing, comparing the smRSq(2011) scores by the first rater with the NIHSS scores by the second rater, the Spearman correlation coefficient was 0.83 (P<0.001). Comparing the smRSq(2011) scores by the second rater with the NIHSS scores by the first rater gave a similar result (Spearman correlation coefficient 0.82). Comparing the smRSq(2011) scores by the first rater with the BI scores by the second rater, the Spearman correlation coefficient was -0.86 (P<0.001).

Discussion

Our primary objective in this study was to test the clinimetric properties of a novel Chinese version smRSq(2011)(Figure 1.e). We assessed the inter-rater reliability of the smRSq(2011) and validated it against the conventional mRS interview. We tested the construct validity of the smRSq(2011) against the NIHSS and the BI. We found good to excellent reliability and good validity of the Chinese version smRSq (2011). The Chinese smRSq(2011) questions were understood by the majority of patients and caregivers with little or no explanation, and the scale was easy to administer. Time to score the smRSq was relatively brief (average 70 seconds).

The inter-rater reliability of the novel Chinese smRSq(2011) was good to excellent (κ=0.84, κ_w=0.96) and somewhat better than that of the conventional mRS interview (κ=0.76, κ_w=0.93). Comparing the Chinese smRSq(2011) to the conventional mRS interview showed a lower agreement (κ=0.47), but the disagreements were relatively small as indicated by the excellent weighted kappa of 0.91. Similar inter-rater reliabilities and comparisons to the conventional mRS have been reported using various other aids involving a structured interview [4,5,7,13].

Construct validity testing showed good correlations between the novel Chinese smRSq(2011) and both the NIHSS (0.82-0.83) and the BI (-0.86). This result is consistent with other validity studies using the conventional mRS [14], the English version smRSq(2011) [15], and our prior study of the Chinese smRSq(2010)[10].

Our results suggest that the novel Chinese smRSq(2011) may be a suitable aid for scoring the mRS in Chinese stroke patients. The advantage of this aid over the conventional mRS interview is simplicity, brevity, and perhaps improved reliability. The significance of this aid is magnified by the relatively large prevalence of stroke in China, and the advantage of acquiring standardized functional outcome measures.

This study has limitations. First, the paired ratings were done on two consecutive days, which may have introduced some recall bias. To limit recall bias we instructed the patients to treat each interview independently of the others. Second, the mRS should ideally be scored after some period of recovery from stroke and in a community setting. Thus, although the scores in our patients likely do not represent their ultimate functional outcome, the paired ratings were done under similar circumstances. Third, we did not test the novel Chinese version smRSq (2011) over the telephone or via telemedicine, and remote outcome assessments are often more practical than in-person assessments.

Conclusion

In conclusion, this study demonstrates good to excellent reliability and good validity of the novel Chinese smRSq (2011) in scoring the mRS in Chinese stroke patients. The simplicity of the smRSq(2011) aid further augments its usefulness. Additional confirmatory testing of the Chinese smRSq(2011) in out-of-hospital settings, by remote methods, by non-stroke physicians, and by non-physicians is warranted.

Abbreviations

mRS: modified Rankin scale; BI: Barthel index; NIHSS: National Institutes of Health Stroke Scale; smRSq: simplified modified Rankin scale questionnaire.

Declarations

Acknowledgments

We thank all the clinical neurologists (Zheng Sun, Yue Zhao, Yue Li, Shumei Wang, Lingling Ding, Xiaoyu Zhang) for their ratings of the clinical stoke scales.

Funding

This work was supported by the National Natural Science Foundation of China (81301016) and the Beijing Municipal Administration of Hospitals Incubating Program (PX2019009). The funders did not play any role in study design; in the collection, analysis, and interpretation of data; in the writing of the report; nor in the preparation, review, or approval of the manuscript.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author upon request.

Authors’ contributions

JLY, WLH and AB conceived and designed the experiments. JLY analyzed the data and drafted the manuscript. JLY and YXW collected and analyzed data. All authors have read and approved the final manuscript to be published.

Ethics approval and consent to participate

Our work was approved by the Ethics Committee of Beijing Chaoyang Hospital. This study was performed in accordance with the Declaration of Helsinki. Written informed consent was obtained from all the subjects.

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

References

Wang W, Jiang B, Sun H, et al. Prevalence, Incidence, and Mortality of Strokein China: Results from a Nationwide Population-Based Survey of 480 687 Adults. Circulation 2017;135:759-71.
Li Q, Wu H, Yue W, et al. Prevalence of Stroke and Vascular Risk Factors in China: a Nationwide Community-based Study. Sci Rep 2017. DOI:10.1038/s41598-017-06691-1.
van Swieten JC, Koudstaal PJ, Visser MC, Schouten HJ, van Gijn J. Interobserver agreement for the assessment of handicap in stroke patients. Stroke. 1988; 19(5): 604-7.
Quinn TJ, Dawson J, Walters MR, Lees KR. Reliability of the modified Rankin Scale: a systematic review. Stroke. 2009; 40(10): 3393-5.
Wilson JT, Hareendran A, Grant M, Baird T, Schulz UG, Muir KW, et al. Improving the assessment of outcomes in stroke: use of a structured interview to assign grades on the modified Rankin Scale. Stroke. 2002; 33(9): 2243-6.
Bruno A, Shah N, Lin C, Close B, Hess DC, Davis K, et al. Improving modified Rankin Scale assessment with a simplified questionnaire. Stroke. 2010; 41(5): 1048-50.
Saver JL, Filip B, Hamilton S, Yanes A, Craig S, Cho M, et al. Improving the reliability of stroke disability grading in Clinical Trials and Clinical Practice: the Rankin Focused Assessment (RFA). Stroke. 2010; 41(5): 992–5.
Bruno A, Akinwuntan AE, Lin C, Close B, Davis K, Baute V, et al. Simplified modified rankin scale questionnaire: reproducibility over the telephone and validation with quality of life. Stroke. 2011; 42(8): 2276-9.
Patel N, Rao VA, Heilman-Espinoza ER, Lai R, Quesada RA, Flint AC. Simple and reliable determination of the modified Rankin scale score in neurosurgical and neurological patients: The mRS-9Q. Neurosurgery. 2012; 71(5): 971–5.
Yuan JL, Bruno A, Li T, Li SJ, Zhang XD, Li HY, et al. Replication and extension of the simplified modified rankin scale in 150 Chinese stroke patients. Eur Neurol. 2012; 67(4): 206-10.
Salinas J, Sprinkhuizen SM, Ackerson T, Bernhardt J, Davie C, George MG, et al. An international standard set of patient-centered outcome measures after stroke. Stroke. 2016; 47(1):180-6.
Goldstein M, Barnett HJM, Orgogozo JM, Sartorius N, Symon L, Vereshchagin NV. Stroke--1989. Recommendations on stroke prevention, diagnosis, and therapy. Report of the WHO Task Force on Stroke and other Cerebrovascular Disorders. Stroke, 1989, 20(10): 1407-31.
Wilson JT, Hareendran A, Hendry A, Potter J, Bone I, Muir KW. Reliability of the modified Rankin Scale across multiple raters: benefits of a structured interview. Stroke. 2005; 36(4):777-81.
Banks JL, Marotta CA. Outcomes validity and reliability of the modified Rankin Scale: implications for stroke clinical trials: a literature review and synthesis. Stroke. 2007; 38(3):1091- 6.
Bruno A, Close B, Switzer JA, Hess DC, Gross H, Nichols FT 3rd, et al. Simplified modified Rankin Scale questionnaire correlates with stroke severity. Clin Rehabil. 2013; 27(8):724-7.

Tables

Table 1. Characteristics of the 300 patients in this study.

Characteristic	Result
Age, years, mean (SD)	64.9 (12.1)
Men, n (%)	220 (73)
Hypertension, n (%)	210 (70)
Diabetes mellitus, n (%)	88 (29)
Prior stroke, n (%)	68 (23)
Coronary artery disease, n (%)	55 (18)
Atrial fibrillation, n (%)	21 (7)
Current cigarette smoker, n (%)	160 (53)
NIHSS score, median (IQR)	4 (1-7)

SD – standard deviation; IQR – interquartile range;

Table 2. Cross-tabulation of the smRSq(2011) scores between the paired raters.

		Second rater
		0	1	2	3	4	5	Total
First rater	0	90	0	0	0	0	0	90
	1	8	24	3	2	2	0	39
	2	0	2	21	1	0	0	24
	3	1	0	0	29	8	1	39
	4	0	0	1	1	41	3	46
	5	0	0	1	0	5	56	62
Total		99	26	26	33	56	60	300

Table 3. Cross-tabulation of the smRSq(2011) scores by the first rater and the conventional mRS

scores by the second rater.

		smRSq(2011) rater1
		0	1	2	3	4	5	Total
mRS rater2	0	28	3	0	1	0	0	32
	1	62	18	1	0	0	0	81
	2	0	14	22	1	1	1	39
	3	0	2	1	24	1	0	28
	4	0	2	0	13	41	28	84
	5	0	0	0	0	3	33	36
Total		90	39	24	39	46	62	300

Additional File

- File name: Additional File 1

- Title of data: The Chinese language smRSq(2011)

- Brief description of the data: This additional File 1 was about the Chinese language smRSq(2011),

which was translated from the smRSq(2011) with forward and backward translation to allow for

inconsistency detection, and the draft questionnaire was checked for face validity.