“From Where I Stand”: Using Multiple Anchors Yields Different Benchmarks for Meaningful Improvement and Worsening in the Rheumatoid Arthritis Flare Questionnaire (RA-FQ)

doi:10.21203/rs.3.rs-427688/v1

Download PDF

Research Article

“From Where I Stand”: Using Multiple Anchors Yields Different Benchmarks for Meaningful Improvement and Worsening in the Rheumatoid Arthritis Flare Questionnaire (RA-FQ)

https://doi.org/10.21203/rs.3.rs-427688/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Purpose

The Rheumatoid Arthritis Flare Questionnaire (RA-FQ) is a patient-reported measure of disease activity in RA. We estimated minimal and meaningful change from the perspective of RA patients, physicians, and using a disease activity index.

Methods

Data were from 3- and 6-month visits of adults with early RA enrolled in the Canadian Early Arthritis Cohort. Participants completed the RA-FQ, the Patient Global Assessment of RA, and Patient Global Change Impression at consecutive visits. Rheumatologists recorded joint counts and MD Global. Clinical Disease Activity Index (CDAI) scores were computed. We compared mean RA-FQ change across categories using patients, physicians, and CDAI anchors.

Results

The 808 adults were mostly white (84%) women (71%) with a mean age of 55 and moderate-high disease activity (85%) at enrollment. At V2, 79% of patients classified their RA as changed; 59% were better and 20% were worse. Patients reporting they were a lot worse had a mean RA-FQ increase of 8.9 points whereas those who were a lot better had a -6.0 decrease. Minimal worsening and improvement were associated with a mean 4.7 and -1.8 change in RA-FQ, respectively, while patients rating their RA unchanged had stable scores. Physician and CDAI classified more patients as worse than patients, and minimal and meaningful RA-FQ thresholds differed by group.

Conclusion

Thresholds to identify meaningful change vary by anchor used. These data offer new evidence demonstrating robust psychometric properties of the RA-FQ and offer guidance about improvement or worsening, supporting its use in RA care, research and decision-making.

Health Economics & Outcomes Research

Rheumatology

Rheumatoid Arthritis Flare Questionnaire (RA-FQ)

improvement

Patient Global Assessment of RA

Patient Global Change Impression

The RA-FQ is a new tool co-created with patients to identify current disease activity in RA. Understanding the score change that reflects minimal and meaningful change is essential to understanding whether RA control is improving or worsening between two timepoints. Different anchors can be used to identifying meaningful change; treat-to-target strategies for managing RA emphasize physician and CDAI anchors. In this study, we compared thresholds that represent minimal and meaningful change using patients, physicians, and a composite disease activity index (CDAI) anchors. We found that change in scores representing minimal and meaningful improvement and worsening differed by anchor and direction of change. Patients were most likely to view their RA as improving and required larger changes for worsening compared to physician and CDAI; CDAI was least sensitive to improving RA. Understanding where there is overlap and disagreement when assessing changing RA status is important to patient-physician communication, interpreting the results of drug changes and new medications, and optimizing treatment and outcomes.

Rheumatoid arthritis (RA) affects up to 1% of adults, and is three times more common in women than men [1, 2]. The joint pain, swelling, and damage associated with RA greatly affects physical, emotional, and social health, and significantly impairs health-related quality of life (HRQL) [3-5]. Up to half of people with RA are unable to work due to disability after 10 years, a trend that remains unchanged despite the introduction of biologics 20 years ago [6]. One reason for this may be that people with RA experience both unexpected temporary and sustained increases (flares) in RA activity which vary in frequency, severity, and impact. Disease flares are periods of increased inflammatory disease, and have been defined as a cluster of symptoms of sufficient duration and intensity to require a review and possible change of existing treatment [7]. RA flares are important to identify and manage as they contribute to joint damage, disability, and increased cardiovascular disease that greatly impact health-related quality of life (HRQL) in people with RA [8].

Current treatment guidelines emphasize tight disease control and a “treat-to-target approach” in RA with remission or low disease activity as the treatment goal to reduce long-term disability and improve HRQL [9]. To achieve this, RA is closely monitored, a patient’s level of disease activity is calculated using a validated algorithm, and therapies adjusted as needed until this target is reached. However, reliable monitoring of disease activity in real time is hampered by the lack of a gold standard and use of different approaches and tools of varying complexity that can yield different results.

Patient reported outcomes (PROs) have been a cornerstone of RA disease monitoring since the late 1970s when the Patient Global Assessment (PGA), an 11-point numeric rating scale (NRS) was introduced [10]. In clinical practice, physicians monitor the number of swollen and tender joints and ask patients about their pain and disability to form an overall impression of disease activity (MD Global Assessment [MDGA]). Joint counts and MDGA constitute clinician reported outcomes (CLIN-ROs). In the early 1990s, researchers created a composite disease activity index by combining the PGA with MD joint counts and a biomarker (first erythrocyte sedimentation rate and later C-reactive protein) in a weighted algorithm to create the Disease Activity Score (DAS) [11]. Thresholds were identified to classify RA patients as being in remission, low, moderate or high disease activity states and better match treatment to current disease activity. The inclusion of biomarkers largely relegated the DAS to research given the need for laboratory results and complex calculation. More recently, the PGA and CLIN-ROs (i.e., joint counts and MDGA) were combined into a simple summative score known as the Clinical Disease Activity Index (CDAI). CDAI can be easily calculated during clinical encounters to inform treatment decision-making at the visit [12]. However, growing evidence suggests that CLIN-ROs overestimate improvement and underestimate progression of disease activity [13]. This prompted international members of the Outcome Measures in Rheumatology (OMERACT) RA Flare Working Group to co-create with patients a new composite PRO of RA disease activity – the RA Flare Questionnaire (RA-FQ) [14]. In the RA-FQ, patients rate RA symptoms and function over the past week [15]. International patient focus groups were conducted to identify relevant domains [16] and items were refined through a Delphi exercise that included several hundred patients, clinicians, researchers and other stakeholders [17]. The RA-FQ has been shown to be valid, reliable, and responsive to change in international clinical trials and observational studies [17, 18]. To increase utility of any questionnaire, evidence supporting score interpretation is needed. Meaningful change, sometimes referred to as the minimally clinically important difference (MCID; a lot better or a lot worse). In RA, MCIDs help to establish if treatment is sufficiently controlling RA inflammation to allow patients to feel and function better in everyday life. While the patient perspective of whether change is meaningful is considered the most relevant, in clinical practice physician assessment and CDAI level often drive treatment decisions given the emphasis on treat-to-target for RA management.

Our goal was to identify thresholds for minimal and meaningful within-person change in the RA-FQ. We compared thresholds derived from patients, treating physicians, and in relation to changes in CDAI levels. We also explored score changes in response to both improving and worsening RA as others have noted these may vary based on the direction of change [19].

Design

We used data from two consecutive visits (the 3- and 6-month follow-ups) of RA patients enrolled in the Canadian Early Arthritis Cohort (CATCH). CATCH is a prospective observational inception cohort of adults with early RA who are enrolled around the time of diagnosis and beginning treatment and followed at pre-determined intervals [20]. These time points were chosen as 3 months is a typical time frame used to judge the effectiveness of initial RA treatments, and it also allowed for sufficient variation in patients experiencing improvement and worsening at the second visit. Ethics approval was previously obtained at each of the 16 participating CATCH sites across Canada. Written informed consent was obtained from participants at enrollment, and the study was conducted in accordance with the Declaration of Helsinki.

Participants

Participants were adults 18+ years of age who were enrolled in the Canadian Early Arthritis Cohort (CATCH) from 2011 when the RA-FQ was implemented to March 2017 (when the Patient Global Impression of Change (PGIC) was discontinued) and who had RA-FQ and PGIC scores available at the 3 and 6 month visits.

Outcomes

Rheumatoid Arthritis Flare Questionnaire (RA-FQ). The RA-FQ contains 5 items that ask respondents to rate their pain, physical function, stiffness, fatigue, and participation over the past week using 11-point NRS (0 = none to 10 = severe). The RA-FQ was co-created with patients and in accordance with best practice methods [21-23]. The conceptual framework evolved from international focus groups with RA patients [16] and Delphi exercises with RA patients, clinicians, researchers, and other stakeholders [24]. Psychometric performance of the RA-FQ was examined in multiple international clinical trials and longitudinal observational studies, including CATCH [18, 25, 26]. Rasch analysis showed acceptable fit to the Rasch model, with items and people covering a broad measurement continuum with appropriate targeting of items to people, ordered thresholds, minimal differential item functioning by language, sex, or age [17]. A summative score across items is defensible, yielding an interval score (0-50) where higher scores reflect worsening disease activity.

Patient Reported Outcomes (PROs). The Patient Global Assessment is an 11-point NRS that asks “Considering all the ways arthritis affects you, how well are you doing today” with responses ranging from 0=very well to 10=very poorly [27]. At the second visit, they also completed an 5-point RA Global Impression of Change rating (“Compared to your last visit would you say that your arthritis is: a lot better, a little better, the same, a little worse, a lot worse”).

Physician-reported outcomes (CLIN-ROs). At each visit, physicians counted the number of swollen and tender joints (from a total of 28). Swollen joint counts represent the most widely used “objective” indicator of inflammatory activity reflecting inflamed synovial tissue, while tender joint counts are thought to indicate the patient’s level of pain [28]. Among the 7 RA Core Data set measures in the American College of Rheumatology for clinical trials, joint counts are weighted heavily compared to the other five Core Data Set measures [29]. The MD Global Assessment is a 0-10 rating scale with higher scores representing higher RA disease activity.

Clinical disease activity. The Clinical Disease Activity Index (CDAI), an index that sums tender and swollen joint counts and patient and MD Global Assessments, was calculated for each visit [30]. CDAI is widely used to classify RA disease activity level (remission ≤2.8, low 2.8-≤10, moderate 10-≤22, and high >22) and help guide treatment decisions. We used CDAI and not indices that include biomarkers as DAS-28-CRP and SDAI as scores on all of measures are moderately-highly correlated [31] and CDAI is widely used in care as part of shared decision making [32].

Statistical Methods

Descriptive statistics were calculated to summarize patient characteristics and outcomes. Our overall approach was to calculate the mean within-patient change scores (difference) between the two visits from three perspectives – patients, physicians, and between CDAI categories. Among patients, meaningful change was based on mean RA-FQ differences for patient ratings themselves a lot better or a lot worse, whereas minimal differences reflected mean differences associated with patients rating themselves a little better or a little worse [33]. We calculated mean difference in traditional RA clinical indicators of disease activity (CDAI, Patient Global, swollen and tender joints) across change categories.

To identify meaningful and minimal RA-FQ score changes from the physician perspective, we calculated the difference in the MD global, a static global impression of (RA) severity (0-10) obtained at the two visits. This method was recently recommended by the U.S. FDA as it may be less likely to recall error [23]. We are aware of only one study by an OMERACT committee that has identified clinically important changes in physician global assessments of RA activity using a consensus process among rheumatologists, methodologists, and other stakeholders [34]. They reported a change of 2.3 in an 11-point NRS represented clinically important changes in RA patients, and 1.4 in clinical trials of new therapeutic agents [34]. As 1- and 2- point change in patient global assessments have been associated with being “slightly better” and “much better” in patients in studies of musculoskeletal pain [35] including RA [36], we used these thresholds to represent minimal and meaningful change from the physician perspective. This approach is also consistent with the widely used ACR20 criteria (i.e., 20% improvement of core outcomes including physician global) to define improvement.

We also estimated score changes associated with a 1 or 2 category change in CDAI levels, as change in CDAI level is a common secondary outcome in RA research. Finally, to visually examine the separation between change among groups across all levels of the RA-FQ change scores, we generated empirical cumulative distribution function (eCDF) curves by patient, MD and CDAI change categories. Analyses were completed using SAS (v. 9.4) and plots were constructed using R (v 3.4.3).

Participants

Participants were 808 middle-aged adults who were mostly white (84%) and female (71%). At enrolment, 85% were classified as having moderate or high RA disease activity (Table 1). At 3 months, about half (43%) were in moderate-high disease activity, 39% in Low Disease Activity and 18% in Remission; by 6 months 65% were in Remission or Low Disease Activity. Mean RA-FQ scores at enrollment through the 3- and 6- months visits were 25.5, 15.6, and 13.9, respectively.

Patient Perspective: Minimal and Meaningful Change

At the second visit, 59% of patients rated their RA disease activity as improved, 20% worse, and 21% the same as the previous visit. Mean changes in RA-FQ total score and component scores by patient change categories are shown in Table 2. RA-FQ scores changed in the expected directions, and the magnitude of change was similar within components. Worsening disease activity was associated with larger mean changes in RA-FQ scores than improvement. Similar patterns were evident in changes in traditional disease activity indicators (CDAI, patient and MD global assessments, and tender and swollen joint counts). Cumulative distribution function (CDF) curves by patient RA change categories, shown in Figure 1, suggest that throughout most of the range of change scores, there was clear separation of curves supporting discrimination among categories by patients, with larger spread across scores related to worsening.

Physician Ratings: Minimal and Meaningful Change

At the second visit, 44% of physicians rated RA disease activity as improved, 24% had worsened, and 32% rated it the same as the previous visit. As compared with patients, physicians were more likely to classify patients as the same or worse. Mean changes in RA-FQ total score and component scores by physician change categories 0, |1|, |≥2| are shown in Table 3. RA-FQ scores changed in the expected directions, and the magnitude of change was similar within components. In contrast to patients, improvement was associated with larger mean changes in RA-FQ scores than worsening disease activity. CDF curves by physician RA change categories shown in Figure 2, suggest that throughout much of the range of scores, physicians had difficulty discriminating between patients who were the same or a little worse at the second visit.

CDAI Categories: Minimal and Meaningful Change

Scores changes in relation to CDAI change by |1| or |≥2| categories are shown in Tables 4 and 5. At the second visit, 56%remained within the same, 29% had improved and 15% had worsened by one or more CDAI categories. The thresholds for a 1 category change were similar to patient and physician thresholds representing meaningful change. Improvement was associated with a larger mean RA-FQ score change for patients starting with moderate-high disease activity where a two-category change (i.e., to remission) was nearly double that for 1 category (to low disease activity). Similarly, for worsening disease activity, patients starting in low disease activity had numerically larger mean changes than individuals who started in remission at the first visit. Across disease activity levels, patients whose CDAI category was unchanged had similar scores at both visits. CDF curves illustrate considerable variability within and wide separation of curves between change categories among patients within the same CDAI category at both visits (see Figure 3).

The RA-FQ is one of only a few RA measures that can summarize the extent and complexity of the broad impact of RA symptoms on how people feel and function as a single number. We have previously shown the RA-FQ is sensitive to changes in RA symptoms and function [17]. This is the first study to identify within-person minimal and meaningful change scores for the RA-FQ. Values derived from patients are contrasted with those of treating physicians and in relation to change in CDAI levels, the keystone of the treat-to-target approach.

Our results hold implications for researchers, methodologists, and clinicians. First, the thresholds for minimal and meaningful change varied depending on the anchor used. This in turn impacted the proportion of patients who were classified as having improved or worsened. For example, among patients, 59% classified themselves as better or much better, whereas physicians classified 44% as improved; 29% had improved at least one CDAI category. Patients were least likely to be classified as worse using a change in CDAI level (15%) and most likely to be seen as having worsened disease activity by physicians (24%), with patients landing in between (20%). Second, patients had larger thresholds for defining worsening RA whereas physicians had larger thresholds for defining improvement. In effect, physicians were looking for more resolution of symptoms/impacts than patients to be confident RA was much better (score change -7.3 vs. -6.0, respectively), but had a lower threshold to judge symptoms and RA activity as much worse (score change +5.7 vs. +8.9). Establishing the overlap among thresholds that both patients and physicians consider meaningful and worthwhile holds important implications for clinical trials, comparative effectiveness research, and optimal management of RA in care settings. Discrepancies in thresholds are also important to identify as these may contribute to patient dissatisfaction with care, treatment non-adherence, poorer disease outcomes, and increase in health care utilization and costs [37].

Two recent systematic reviews concluded that patient and physician perspectives regarding RA disease status often significantly differ [34, 35]; when discordance exists, up to 79% of patients generally perceived their disease as being more active than their treating physician. In part this is because pain, disability and for fatigue may feature prominently in patient assessments whereas joint counts and acute phase reactants influence physician ratings [38]. Our data suggest assessments of change also differ where patients are more likely to perceive improvement than providers. Further, among patients the RA-FQ score change associated with meaningful improvement vs. worsening differed. Meaningful improvement for patients was associated with 6-point decrease in RA-FQ whereas meaningful worsening was associated with a 9-point increase; similar patterns were seen for minimal worsening or improvement. This suggests that either patients may be more vigilant to identify improvement, or perhaps that the threshold for disease to be perceived as worse is higher than that for improvement. At the same time, the graphical displays suggest patients appear better able to identify worsening RA at a finer level than improvement.

The pattern observed with physicians differed from patients. First, clinicians classified fewer patients as improved between visits with more classified as the same or a lot worse. In contrast to patients, on average, a numerically larger RA-FQ score change was associated with improvement versus worsening. It is interesting to note that the change in joint count (and CDAI) also was higher for patients rated as a lot worse or a lot better as compared with patient-based assessments of change. This is likely due to the fact that physicians view joint counts as a reliable indicator of inflammatory disease activity. Physicians also appeared less able to differentiate patients who were a little worse from those who had the same level of disease activity at the previous visit. Changes in RA-FQ scores in relation to changes in CDAI levels were robust and similar in the direction of worsening or improvement; scores were also stable between visits in participants whose CDAI level was unchanged. Overall, triangulating meaningful RA-FQ change scores among patients and clinicians ratings with RA disease activity classifications yielded robust results adding further evidence that disease activity as captured by the RA-FQ represents a well-defined concept that can be reliably measured. Evaluation of within-patient change establishes both the responsiveness of the RA-FQ and patient-relevant thresholds of change. The use of multiple anchors also demonstrates that benefits of a treatment as judged by physicians and using a treat-to-target approach are also perceived as clinically meaningful by patients. Conversely, both physicians and CDAI change may suggest patients are deteriorating before patients perceive their RA has changed.

While thresholds identified at the group level can inform policy and comparisons between different treatments, individual-level thresholds are necessary to inform clinical treatment decisions [39]. Examining within-patient change visually also can offer new information. The CDF curves presenting a continuous view of the proportion of patients within each category experiencing scores changes. The curves suggest that the vast majority of patients who said they were a lot better or a lot worse were relatively distinct from those reporting they were a little better or worse or the same throughout most of the continuum of score changes. We also evaluated changes in traditional RA clinical indicators and observed that changes in mean CDAI and joint counts were largest when using the CDAI categories followed by physician change categories. This is not surprising since joint counts and physician global impression of disease activity constitute 3 of 4 components of the CDAI. When physicians rated patients as a lot better CDAI decreased by 12 points; similarly, when patients were rated as a lot worse, CDAI increased 11 points. The change in CDAI scores we observed were notably larger than MCIDs identified for CDAI (i.e., -6 points when starting with moderate disease activity, and 2 points for worsening when starting in remission/low disease activity) in one study [40]. The change in CDAI when using patient reports of feeling a lot better was -5 points, whereas a lot worse was 7 points.

These findings add to the growing body of evidence supporting strong measurement properties (reliability, construct, content and criterion validity, responsiveness) of the RA-FQ. One reason may be that the RA-FQ was developed in accordance with best practice methods [21-23]. The conceptual framework of the RA-FQ evolved from international focus groups with RA patients [16] and Delphi exercise with RA patients, clinicians, researchers, and other stakeholders [24]. Performance of the RA-FQ was examined in multiple international clinical trials and longitudinal observational studies, including CATCH [18, 25, 26]. Rasch analysis showed acceptable fit to the Rasch model, with items and people covering a broad measurement continuum with appropriate targeting of items to people, ordered thresholds, minimal differential item functioning by language, sex, or age [17].

The strengths of this study include the use of multiple anchors to identify meaningful change in a well characterized and diverse real-world sample of people with RA. Patients had been recently diagnosed and had started disease modifying treatments which often take several months to reach maximum therapeutic effectiveness; only 21% rated their RA as the same at both visits. We used patient and clinician impressions and CDAI, a disease activity index that combines these perspectives with joint counts is viewed as “the most specific quantitative clinical measure” in RA research and care [29]. In this early RA cohort, many patients reported a change in their RA status at the second visit, and these reports were supported by changes in the standardized indicators of RA disease activity recommended in international clinical practice guidelines [41, 42] and used in clinical trials as part of a treat-to-target approach. We also visualized scores associated with patient change categories with CDFs, a technique recently recommended by the US Food and Drug Administration [43]. There are limitations. We evaluated thresholds in relation to disease activity levels at the first visit for CDAI only. In patients with established RA, different thresholds may be obtained for improvement and worsening as patients gain more experience with disease flares. Meaningful change thresholds may also be influenced by the presence of painful comorbidities such as osteoarthritis.

In summary, in a large diverse cohort of real-world patients recently diagnosed with RA, we compared multiple anchors to derive meaningful and minimal changes from the perspective of patients, clinicians, and using CDAI, a disease activity index used widely as part of treat-to-target approaches. We found similar patterns overall, but some important differences in the actual value of thresholds. Further, our results suggest that the benchmarks used to classify RA treatments as a success or failure differ depending on the anchor used. These findings contribute new information that can be used to interpret RA-FQ scores in research and patient care. They also demonstrate that proportion of patients classified as “responders” to a new treatment could vary considerably depending on the anchor used to define meaningful change.

Acknowledgements: The authors wish to acknowledge and thank CATCH Principal Investigators (M Baron, I Colmegna, S Fallavollita, D Haaland, B Haraoui, S Jamal, R Joshi, B Nair, P Panopoulos, L Rubin, E Villeneuve, M Zummer) and CATCH participants for their contributions.

Compliance with Ethical Standards

Funding - The Canadian Early Arthritis Cohort (CATCH) study was independently designed and is implemented by the investigators. It has been financially supported through unrestricted research grants from Amgen, Pfizer Canada, AbbVie, Medexus, Inc., Eli Lilly, Merck Canada, Sandoz Canada Biopharmaceuticals, Hoffmann‐La Roche, Janssen Biotech, UCB Canada, Bristol Myers Squibb Canada, and Sanofi Genzyme. The funders played no part in planning or conducting the study.

Conflicts of interest/Competing interests – All authors declare no conflicts of interest relevant to this article.

Availability of data and material – available upon request via email to corresponding author

Code availability – not applicable

Authors’ contributions - The CATCH study was conceived by VB. VB, SB, JP, GB, CH, GH, LB, EK, CT, DT serve on the CATCH Steering committee; OS serves as Scientific Director. SB, VB, and CO conceived the study design, and MFV completed the statistical analyses under the supervision of OS and SB. SB and CO wrote the first draft of the manuscript. All critically revised the manuscript and approved the final published version.

Ethics approval - Ethics approval was previously obtained from each of the 16 participating CATCH sites across Canada. The study was conducted in accordance with the Declaration of Helsinki.

Consent to participate - Written informed consent was obtained from participants at study enrollment

Consent for publication - not applicable

Myasoedova, E., Crowson, C. S., Kremers, H. M., Therneau, T. M., & Gabriel, S. E. (2010). Is the incidence of rheumatoid arthritis rising?: results from Olmsted County, Minnesota, 1955–2007. Arthritis Rheum, 62, 1576–1582.
Myasoedova, E., Davis, J. M. 3rd, Crowson, C. S., & Gabriel, S. E. (2010). Epidemiology of rheumatoid arthritis: rheumatoid arthritis and mortality. Curr Rheumatol Rep, 12, 379–385.
Bartlett, S. J., Hewlett, S., Bingham, C. O. 3rd, Woodworth, T. G., Alten, R., Pohl, C., Choy, E. H., Sanderson, T., Boonen, A., Bykerk, V., et al. (2012). Identifying core domains to assess flare in rheumatoid arthritis: an OMERACT international patient and provider combined Delphi consensus. Ann Rheum Dis, 71, 1855–1860.
Sanderson, T., Morris, M., Calnan, M., Richards, P., & Hewlett, S. (2010). Patient perspective of measuring treatment efficacy: the rheumatoid arthritis patient priorities for pharmacologic interventions outcomes. Arthritis Care Res (Hoboken), 62, 647–656.
Gossec, L., Dougados, M., Rincheval, N., Balanescu, A., Boumpas, D. T., Canadelo, S., Carmona, L., Daures, J. P., de Wit, M., Dijkmans, B. A., et al. (2009). Elaboration of the preliminary Rheumatoid Arthritis Impact of Disease (RAID) score: a EULAR initiative. Ann Rheum Dis, 68, 1680–1685.
Ward, M. M.: Trends in permanent work disability associated with rheumatoid arthritis in the United States, 1999–2015. Arthritis Care Res (Hoboken) 2021.
Bingham, C. O. III, Pohl, C., Woodworth, T. G., Hewlett, S. E., May, J. E., Rahman, M. U., Witter, J. P., Furst, D. E., Strand, C. V., Boers, M., & Alten, R. E. (2009). Developing a standardized definition for disease "flare" in rheumatoid arthritis (OMERACT 9 Special Interest Group). J Rheumatol, 36, 2335–2341.
Myasoedova, E., Chandran, A., Ilhan, B., Major, B. T., Michet, C. J., Matteson, E. L., & Crowson, C. S. (2016). The role of rheumatoid arthritis (RA) flare and cumulative burden of RA severity in the risk of cardiovascular disease. Ann Rheum Dis, 75, 560–565.
Smolen, J. S., Breedveld, F. C., Burmester, G. R., Bykerk, V., Dougados, M., Emery, P., Kvien, T. K., Navarro-Compan, M. V., Oliver, S., Schoels, M., et al. (2016). Treating rheumatoid arthritis to target: 2014 update of the recommendations of an international task force. Ann Rheum Dis, 75, 3–15.
Nikiphorou, E., Radner, H., Chatzidionysiou, K., Desthieux, C., Zabalan, C., van Eijk-Hustings, Y., Dixon, W. G., Hyrich, K. L., Askling, J., & Gossec, L. (2016). Patient global assessment in measuring disease activity in rheumatoid arthritis: a review of the literature. Arthritis Research & Therapy, 18, 251.
Prevoo, M. L., 't Hof, M. A., Kuper, H. H., van Leeuwen, M. A., van De Putte, L. B., & van Riel, P. L. (1995). Modified disease activity scores that include twenty-eight-joint counts. Development and validation in a prospective longitudinal study of patients with rheumatoid arthritis. Arthritis Rheum, 38, 44–48.
Aletaha, D., & Smolen, J. (2005). The Simplified Disease Activity Index (SDAI) and the Clinical Disease Activity Index (CDAI): a review of their usefulness and validity in rheumatoid arthritis. Clin Exp Rheumatol, 23, S100–S108.
Oderda, G. M., & Balfe, L. M. (2011). Comparative effectiveness research (CER): a summary of AHRQ's CER on therapies for rheumatoid arthritis. J ManagCare Pharm, 17, S19–S24.
Bingham, C. O. 3rd, Alten, R., Bartlett, S. J., Bykerk, V. P., Brooks, P. M., Choy, E., Christensen, R., Furst, D. E., Hewlett, S. E., Leong, A., et al. (2011). Identifying preliminary domains to detect and measure rheumatoid arthritis flares: report of the OMERACT 10 RA Flare Workshop. J Rheumatol, 38, 1751–1758.
Bykerk, V. P., Lie, E., Bartlett, S. J., Alten, R., Boonen, A., Christensen, R., Furst, D. E., Hewlett, S., Leong, A. L., Lyddiatt, A., et al. (2014). Establishing a core domain set to measure rheumatoid arthritis flares: report of the OMERACT 11 RA flare Workshop. J Rheumatol, 41, 799–809.
Hewlett, S., Sanderson, T., May, J., Alten, R., Bingham, C. O. 3rd, Cross, M., March, L., Pohl, C., Woodworth, T., & Bartlett, S. J. (2012). 'I'm hurting, I want to kill myself': rheumatoid arthritis flare is more than a high joint count–an international patient perspective on flare where medical help is sought. Rheumatology, 51, 69–76.
Bartlett, S. J., Barbic, S. P., Bykerk, V. P., Choy, E. H., Alten, R., Christensen, R., den Broeder, A., Fautrel, B., Furst, D. E., Guillemin, F., et al. (2017). Content and Construct Validity, Reliability, and Responsiveness of the Rheumatoid Arthritis Flare Questionnaire: OMERACT 2016 Workshop Report. J Rheumatol, 44, 1536–1543.
Bartlett, S. J., Bykerk, V. P., Cooksey, R., Choy, E. H., Alten, R., Christensen, R., Furst, D. E., Guillemin, F., Halls, S., Hewlett, S., et al. (2015). Feasibility and Domain Validation of Rheumatoid Arthritis (RA) Flare Core Domain Set: Report of the OMERACT 2014 RA Flare Group Plenary. J Rheumatol, 42, 2185–2189.
Strand, V., & Singh, J. A. (2010). Newer biological agents in rheumatoid arthritis: impact on health-related quality of life and productivity. Drugs, 70, 121–145.
Bykerk, V. P., Jamal, S., Boire, G., Hitchon, C. A., Haraoui, B., Pope, J. E., Thorne, J. C., Sun, Y., & Keystone, E. C. (2012). The Canadian Early Arthritis Cohort (CATCH): Patients with New-onset Synovitis Meeting the 2010 ACR/EULAR Classification Criteria But Not the 1987 ACR Classification Criteria Present with Less Severe Disease Activity. J Rheumatol, 39, 2071–2080.
Patrick, D. L., Burke, L. B., Gwaltney, C. J., Leidy, N. K., Martin, M. L., Molsen, E., & Ring, L. (2011). Content validity–establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO Good Research Practices Task Force report: part 2–assessing respondent understanding. Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research, 14, 978–988.
Reeve, B. B., Wyrwich, K. W., Wu, A. W., Velikova, G., Terwee, C. B., Snyder, C. F., Schwartz, C., Revicki, D. A., Moinpour, C. M., McLeod, L. D., et al. (2013). ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Qual Life Res, 22, 1889–1905.
US Food and Drug Administration. (2020). Principles for Selecting, Developing, Modifying, and Adapting Patient-Reported Outcome Instruments for Use in Medical Device Evaluation; Draft Guidance for Industry, Food and Drug Administration Staff, and Other Stakeholders; Availability. In vol. FDA-2020-D-1564 (pp. 53820–53822). Food and Drug Administration, health and Human Services. 53820–53822 (53823 pages).
Bartlett, S. J., Hewlett, S., Bingham, C. O. 3rd, Woodworth, T. G., Alten, R., Pohl, C., Choy, E. H., Sanderson, T., Boonen, A., Bykerk, V., et al. (2012). Identifying core domains to assess flare in rheumatoid arthritis: an OMERACT international patient and provider combined Delphi consensus. Ann Rheum Dis, 71, 1855–1860.
Bartlett, S. J., Bykerk, V. P., Fautrel, B., Guillemin, F., den Broeder, A., Alten, R., Christensen, R., Choy, E., Furst, D., Hewlett, S., et al. (2017). THE RA FLARE QUESTIONNAIRE (RA-FQ) IS RESPONSIVE TO CHANGE IN RA SYMPTOMS AND IMPACTS IN CLINICAL AND OBSERVATIONAL TRIALS. Annals of the Rheumatic Diseases, 76, 470–470.
Bykerk, V. P., Bingham, C. O., Choy, E. H., Lin, D., Alten, R., Christensen, R., Furst, D. E., Hewlett, S., Leong, A., March, L., et al. (2016). Identifying flares in rheumatoid arthritis: reliability and construct validation of the OMERACT RA Flare Core Domain Set. RMD Open, 2, e000225.
Nikiphorou, E., Radner, H., Chatzidionysiou, K., Desthieux, C., Zabalan, C., van Eijk-Hustings, Y., Dixon, W. G., Hyrich, K. L., Askling, J., & Gossec, L. (2016). Patient global assessment in measuring disease activity in rheumatoid arthritis: a review of the literature. Arthritis Research & Therapy, 18, 251.
Scott, I. C., & Scott, D. L. (2014). Joint counts in inflammatory arthritis. Clin Exp Rheumatol, 32, S–S7.
Sokka, T., & Pincus, T. (2009). Joint counts to assess rheumatoid arthritis for clinical research and usual clinical care: advantages and limitations. Rheum Dis Clin North Am, 35, 713–722, v-vi.
Aletaha, D., & Smolen, J. S. (2007). The Simplified Disease Activity Index (SDAI) and Clinical Disease Activity Index (CDAI) to monitor patients in standard clinical care. Best Pract Res Clin Rheumatol, 21, 663–675.
Dhaon, P., Das, S. K., Srivastava, R., & Dhakad, U. (2018). Performances of Clinical Disease Activity Index (CDAI) and Simplified Disease Activity Index (SDAI) appear to be better than the gold standard Disease Assessment Score (DAS-28-CRP) to assess rheumatoid arthritis patients. International Journal of Rheumatic Diseases, 21, 1933–1939.
Yun, H., Chen, L., Xie, F., Patel, H., Boytsov, N., Zhang, X., & Curtis, J. R. (2020). Do Patients With Moderate or High Disease Activity Escalate Rheumatoid Arthritis Therapy According to Treat-to-Target Principles? Results From the Rheumatology Informatics System for Effectiveness Registry of the American College of Rheumatology. Arthritis Care & Research, 72, 166–175.
Revicki, D., Hays, R. D., Cella, D., & Sloan, J. (2008). Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J Clin Epidemiol, 61, 102–109.
Goldsmith, C. H., Boers, M., Bombardier, C., & Tugwell, P. (1993). Criteria for clinically important changes in outcomes: development, scoring and evaluation of rheumatoid arthritis patient and trial profiles. OMERACT Committee. J Rheumatol, 20, 561–565.
Farrar, J. T., Young, J. P. Jr., LaMoreaux, L., Werth, J. L., & Poole, M. R. (2001). Clinical importance of changes in chronic pain intensity measured on an 11-point numerical pain rating scale. Pain, 94, 149–158.
Salaffi, F., Stancati, A., Silvestri, C. A., Ciapetti, A., & Grassi, W. (2004). Minimal clinically important changes in chronic musculoskeletal pain intensity measured on a numerical rating scale. Eur J Pain, 8, 283–291.
Starfield, B., Wray, C., Hess, K., Gross, R., Birk, P. S., & D'Lugoff, B. C. (1981). The influence of patient-practitioner agreement on outcome of care. Am J Public Health, 71, 127–131.
Desthieux, C., Hermet, A., Granger, B., Fautrel, B., & Gossec, L. (2016). Patient-Physician Discordance in Global Assessment in Rheumatoid Arthritis: A Systematic Literature Review With Meta-Analysis. Arthritis Care Res (Hoboken), 68, 1767–1773.
Beaton, D. E., Bombardier, C., Katz, J. N., Wright, J. G., Wells, G., Boers, M., Strand, V., & Shea, B. (2001). Looking for important change/differences in studies of responsiveness. OMERACT MCID Working Group. Outcome Measures in Rheumatology. Minimal Clinically Important Difference. J Rheumatol, 28, 400–405.
Curtis, J. R., Yang, S., Chen, L., Pope, J. E., Keystone, E. C., Haraoui, B., Boire, G., Thorne, J. C., Tin, D., Hitchon, C. A., et al. (2015). Determining the Minimally Important Difference in the Clinical Disease Activity Index for Improvement and Worsening in Early Rheumatoid Arthritis Patients. Arthritis Care Res (Hoboken), 67, 1345–1353.
Anderson, J., Caplan, L., Yazdany, J., Robbins, M. L., Neogi, T., Michaud, K., Saag, K. G., O'Dell, J. R., & Kazi, S. (2012). Rheumatoid arthritis disease activity measures: American College of Rheumatology recommendations for use in clinical practice. Arthritis Care Res (Hoboken), 64, 640–647.
Bykerk, V. P., Akhavan, P., Hazlewood, G. S., Schieir, O., Dooley, A., Haraoui, B., Khraishi, M., Leclercq, S. A., Legare, J., Mosher, D. P., et al. (2012). Canadian Rheumatology Association recommendations for pharmacological management of rheumatoid arthritis with traditional and biologic disease-modifying antirheumatic drugs. J Rheumatol, 39, 1559–1582.
U.S. Food and Drug Administration. Discussion Document for Patient-Focused Drug Development Public Workshop on Guidance 3: Select, Develop, or Modify Fit-for-purpose Clinical Outcome Assessments. In Methods to Identify What is Important to Patients & Select, Develop or Modify Fit-for-Purpose Clinical Outcomes Assessments Rockville, MD; Workshop Date October 15–16, 2018.

Table 1. Characteristics of participants at enrolment (n=808).

	Mean (SD) or n (%)
Age (years)	55 (15)
Women (%)	575 (71%)
White race	679 (84%)
Education* > High school (%)	508 (63%)
Patient Global Disease Activity (0-10)	5.8 (2.9)
Swollen Joints (0-28)	6.9 (5.9)
Tender Joints (0-28)	7.8 (6.4)
MD Global Assessment (0-10)	4.8 (2.5)
Clinical Disease Activity Index (0-76)	25.3 (13.9)
Remission	10 (1%)
Low disease activity	98 (12%)
Moderate disease activity	263 (33%)
High disease activity	421 (52%)
Missing	16 (2%)

*missing 20 (2%)

Table 2. Change in RA-FQ scores at second visit by patient global impression of change categories.

Domain	A Lot Better (N=346; 43%)			A Little Better (N=132; 16%)			The Same (N=174; 21%)			A Little Worse (N=94; 12%)			A Lot Worse (N=62; 8%)
Domain	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD
RA-FQ Total (0-50)	-6.0	(-7.1, -4.9)	10.3	-1.8	(-3.2, -0.3)	8.4	-0.1	(-1.3, 1.1)	8.1	4.7	(2.9, 6.6)	9.1	8.9	(5.1, 12.7)	15.0
RA-FQ Components (0-10)
Pain	-1.2	(-1.4, -0.9)	2.4	-0.4	(-0.8, 0.0)	2.3	0.0	(-0.2, 0.3)	1.8	1.3	(0.8, 1.7)	2.2	2.0	(1.2, 2.9)	3.3
Physical Function	-1.3	(-1.6, -1.1)	2.4	-0.3	(-0.6, 0.1)	2.1	0.0	(-0.3, 0.3)	2.1	0.9	(0.4, 1.4)	2.4	1.8	(0.8, 2.7)	3.7
Fatigue	-1.1	(-1.4, -0.8)	2.6	-0.4	(-0.7, 0.0)	1.9	0.0	(-0.3, 0.3)	2.1	0.7	(0.3, 1.1)	2.1	1.3	(0.5, 2.1)	3.2
Stiffness	-1.1	(-1.4, -0.9)	2.4	-0.4	(-0.7, 0.0)	2.0	-0.1	(-0.4, 0.2)	2.0	1.1	(0.6, 1.5)	2.2	1.8	(1.0, 2.7)	3.3
Participation	-1.2	(-1.5, -1.0)	2.5	-0.1	(-0.5, 0.3)	2.1	-0.1	(-0.4, 0.2)	2.2	0.8	(0.4, 1.3)	2.2	2.0	(1.1, 2.8)	3.4
Disease Activity Indicators
CDAI*	-5.3	(-6.3, -4.3)	9.1	-3.3	(-5.4, -1.3)	11.5	-0.8	(-2.0, 0.5)	8.1	1.7	(-0.1, 3.5)	8.8	6.8	(3.7, 9.8)	12.0
Patient Global (0-10)	-1.3	(-1.5, -1.0)	2.7	-0.5	(-0.9, -0.1)	2.1	-0.1	(-0.4, 0.2)	2.1	1.3	(0.8, 1.8)	2.4	2.9	(2.1, 3.6)	3.1
MD Global (0-10)	-1.2	(-1.4, -1.0)	1.9	-0.7	(-1.1, -0.3)	-0.1	-0.1	(-0.4, 0.2)	1.9	0.1	(-0.3, 0.5)	2.8	0.7	(0.0, 1.5)	2.8
Swollen Joints (28)	-1.4	(-1.7, 1.0)	3.2	-1.0	(-1.8, -0.2)	4.6	-0.4	(-0.9, 0.0)	3.0	0.0	(-0.7, 0.7)	3.4	1.3	(0.2, 2.5)	4.6
Tender Joints (28)®	-1.5	(-1.9, -1.1)	3.9	-1.3	(-2.2, -0.3)	5.5	0.0	(-0.7, 0.6)	4.3	0.3	(-0.7, 1.2)	4.5	2.2	(0.8, 3.5)	5.4

*Clinical Disease Activity Index

Table 3. Change in RA-FQ by physician change categories* (N=787).

	A Lot Better (N=198; 25%)			A Little Better (N=149; 19%)			The Same (N=253; 32%)			A Little Worse (N=93; 12%)			A Lot Worse (N=94; 12%)
	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD
RA-FQ Total (range 0-50)	-7.3	(-8.8, -5.7)	10.8	-3.3	(-5.0, -1.7)	10.2	0.2	(-0.9, 1.2)	8.6	0.8	(-1.4, 3.0)	10.7	5.7	(3.4, 8.1)	11.3
RA-FQ Components (0-10)
Pain	-1.5	(-1.9, -1.2)	2.5	-0.6	(-1.0, -0.2)	2.4	0.1	(-0.1, 0.4)	2.1	0.4	(-0.1, 0.9)	2.7	1.2	(0.7, 1.8)	2.7
Physical Function	-1.6	(-1.9, -1.2)	2.4	-0.9	(-1.2, -0.5)	2.5	0.1	(-0.2, 0.4)	2.1	0.1	(-0.5, 0.6)	2.7	1.3	(0.8, 1.9)	2.6
Fatigue	-1.0	(-1.4, -0.6)	2.7	-0.6	(-1.0, -0.2)	2.4	-0.2	(-0.5, 0.1)	2.3	-0.1	(-0.6, 0.4)	2.3	0.8	(0.2, 1.3)	2.6
Stiffness	-1.7	(-2.1, -1.4)	2.5	-0.6	(-1.0, -0.2)	2.4	0.2	(-0.1, 0.4)	2.1	0.2	(-0.3, 0.7)	2.2	1.5	(1.0, 2.0)	2.4
Participation	-1.4	(-1.8, -1.1)	2.6	-0.7	(-1.1, -0.3)	2.4	0.0	(-0.3, 0.3)	2.1	0.2	(-0.4, 0.8)	2.8	1.0	(0.4, 1.5)	2.7
Disease Activity Indicators
Clinical Disease Activity Index	-12.4	(-13.9, -11.0)	10.3	-3.9	(-4.8, -3.1)	5.2	0.0	(-0.5, 0.5)	4.1	3.0	(1.8, 4.3)	6.2	10.6	(8.6, 12.6)	9.8
Physician Global	-3.8	(-4.2, -3.4)	1.4	-1.0	--	--	0.0	--	--	1.0	--	--	2.4	(1.7, 3.2)	0.8
Patient Global (0-10)	-1.7	(-2.0, -1.3)	2.7	-0.6	(-1.0, -0.1)	2.8	0.2	(-0.1, 0.5)	2.2	0.5	(-0.1, 1.1)	2.8	1.2	(0.7, 1.8)	2.7
Swollen Joints (28)	-3.6	(-4.1, -3.0)	4.0	-1.2	(-1.6, -0.8)	2.3	-0.2	(-0.4, 0.1)	2.0	0.7	(0.2, 1.2)	2.4	3.0	(2.1, 3.9)	4.4
Tender Joints (28)	-3.9	(-4.7, -3.1)	5.7	-1.1	(-1.6, -0.7)	2.8	0.0	(-0.3, 0.3)	2.3	0.8	(0.1, 1.6)	3.7	3.5	(2.4, 4.5)	5.0

*1- and 2- point change in physician global assessments were used to identify a little or a lot of change, respectively.

Table 4. Change in RA-FQ by change in Clinical Disease Activity levels between visits.

	Better									Worse
	2 Categories MOD-HIGH to REM (N=38; 5%)			1 Category MOD-HIGH to LDA (N=97; 12%)			1 Category LDA to REM (N=94; 12%)			1 Category REM to LDA (N=35; 5%)			1 Category LDA to MOD-HIGH (N=61; 8%)			2 Categories REM to MOD-HIGH (N=14; 2%)
	Δ	95% CI	Δ	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD	Δ	95% CI	SD
RA-FQ Total	-15.8	(-19.7, -11.8)	-5.4	-8.4	(-10.3, -6.5)	9.6	-5.4	(-6.7, -4.1)	6.3	6.8	(2.4, 11.2)	12.9	7.4	(4.5, 10.3)	11.4	16.9	(10.6, 23.1)	10.8
Components
Pain	-3.6	(-4.5, -2.8)	-1.1	-1.8	(-2.3, -1.4)	2.1	-1.1	(-1.4, -0.8)	1.5	1.8	(0.9, 2.7)	2.6	2.0	(1.3, 2.6)	2.7	4.3	(2.8, 5.8)	2.6
Function	-3.1	(-4.0, -2.3)	-1.2	-1.7	(-2.2, -1.2)	2.4	-1.2	(-1.6, -0.9)	1.5	1.4	(0.4, 2.4)	2.9	1.6	(0.9, 2.4)	2.8	3.5	(1.8, 5.2)	3.0
Fatigue	-2.3	(-3.4, -1.2)	-1.0	-1.4	(-1.8, -0.9)	2.3	-1.0	(-1.5, -0.6)	2.2	1.3	(0.3, 2.2)	2.9	0.7	(0.0, 1.4)	2.6	2.4	(0.3, 4.4)	3.5
Stiffness	-3.7	(-4.5, -2.9)	-1.0	-1.8	(-2.2, -1.3)	2.3	-1.0	(-1.3, -0.6)	1.6	1.4	(0.4, 2.4)	3.0	1.8	(1.2, 2.5)	2.6	3.3	(2.0, 4.6)	2.3
Participation	-3.0	(-3.9, -2.1)	-1.1	-1.7	(-2.3, -1.2)	2.6	-1.1	(-1.4, -0.8)	1.6	1.0	(0.0, 2.0)	2.9	1.3	(0.6, 2.0)	2.6	3.4	(1.7, 5.2)	3.1
Disease Activity
CDAI*
Patient Global	-2.0	(-2.3, -1.7)	-0.5	-1.2	(-1.4, -1.0)	0.9	-0.5	(-0.5 -0.4)	2.5	0.4	(0.4, 0.5)	0.2	1.0	(0.8, 1.2)	0.8	1.9	(1.2, 2.5)	1.1
MD Global	-4.2	(-5.0, -3.4)	-1.4	-1.8	(-2.3, -1.3)	2.5	-1.4	(-1.7, -1.1)	1.5	2.4	(1.4, 3.3)	2.7	2.5	(1.8, 3.2)	2.7	5.1	(3.8, 6.5)	2.3
Swollen Joints	-6.0	(-7.5, -4.5)	-1.0	-3.4	(-4.1, -2.6)	3.6	-1.0	(-1.2, -0.7)	1.2	0.4	(0.0, 0.8)	1.1	2.4	(1.4, 3.4)	4.1	4.0	(0.9, 7.1)	5.4
Tender Joints	-5.9	(-7.8, -4.0)	-1.0	-4.6	(-5.6, -3.6)	4.9	-1.0	(-1.2, -0.8)	1.2	0.9	(0.4, 1.4)	1.5	3.1	(2.2, 4.1)	3.6	6.9	(3.7, 10.1)	5.5

*Clinical Disease Activity Index

Table 5 is not available with this version

Download PDF

Reviewers invited by journal
29 Jun, 2021
Reviews received at journal
29 Jun, 2021
Editor invited by journal
03 May, 2021
Editor assigned by journal
16 Apr, 2021
First submitted to journal
14 Apr, 2021

You are reading this latest preprint version

“From Where I Stand”: Using Multiple Anchors Yields Different Benchmarks for Meaningful Improvement and Worsening in the Rheumatoid Arthritis Flare Questionnaire (RA-FQ)

Status:

Version 1

Abstract

Figures

Plain English Summary

Background

Methods

Results

Discussion

Declarations

References

Tables

Status:

Version 1