Is the flexion relaxation ratio a reliable, valid, and responsive measure for individuals with and without non-specific spine pain? A systematic review and meta-analysis

doi:10.21203/rs.3.rs-3221710/v1

Download PDF

Article

Is the flexion relaxation ratio a reliable, valid, and responsive measure for individuals with and without non-specific spine pain? A systematic review and meta-analysis

https://doi.org/10.21203/rs.3.rs-3221710/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 08 Feb, 2024

Read the published version in Scientific Reports →

You are reading this latest preprint version

This review sought to identify, critically appraise, compare, and summarize the literature on the reliability, discriminative validity and responsiveness of the Flexion Relaxation Ratio (FRR) in adults (≥ 18 years old) with or without spine pain (any duration), in either a clinical or research context. The review protocol was registered on Open Science Framework (https://doi.org/10.17605/OSF.IO/27EDF) and follows COSMIN, PRISMA, and PRESS guidelines. Six databases were searched from inception to June 1, 2022. The search string was developed by content experts and a health services librarian. Two pairs of reviewers independently completed titles/abstracts and full text screening for inclusion, data extraction, and risk of bias assessment (COSMIN RoB Toolkit). At all stages, discrepancies were resolved through consensus meetings. Data were pooled where possible with random effects meta-analyses and a modified GRADE assessment was used for the summary of findings. Following duplicate removal, 728 titles/abstracts and 219 full texts were screened with 55 included in this review. We found, with moderate certainty, that the cervical FRR has high test-retest reliability and lumbar FRR has moderate to high test-retest reliability, and with high certainty that the cervical and lumbar FRR can discriminate between healthy and clinical groups (standardized mean difference − 0.82 [95% CI -1.82, 0.17] and − 1.21 [-1.84, -0.58] respectively). There was not enough evidence to summarize findings for thoracic FRR discriminative validity or the standard error of measurement for the FRR in either the cervical, thoracic, or lumbar segments of the spine. Several studies that used FRR assumed responsiveness, but no studies were designed in a way that could confirm responsiveness. The evidence supports adequate reliability of FRR for the cervical and lumbar spine, and discriminative validity for the cervical and lumbar spine only. Improvements in study design and reporting are needed to strengthen the evidence base to determine the remaining measurement properties of this outcome.

Health sciences/Anatomy/Musculoskeletal system

Health sciences/Biomarkers/Diagnostic markers

Flexion relaxation ratio

flexion relaxation phenomenon

electromyography

measurement properties

reliability

validity

measurement error

responsiveness

Non-specific spine pain defined as neck or low back pain in the absence of a pathological cause is a leading contributor to disability worldwide and affects individuals of all ages [1]. Using “biomarkers” to sub-classify patients with non-specific spine pain has been an avenue of interest to guide clinical management [2]. Two recent systematic reviews have suggested that the flexion-relaxation ratio (FRR), quantified using features of electromyographic (EMG) signals from the lumbar spine’s extensor muscles during full forward bending, to be a potential biomarker of neuromuscular function for people with non-specific chronic low back pain (NSCLBP) [3, 4]. Similar features of EMG signals from the cervical extensor muscles have also been used in research studies conducted using people with non-specific neck pain. Viability of the FRR as a biomarker for use in people with non-specific neck or low back pain is predicated by its psychometric properties such as reliability, validity, and responsiveness; however, these parameters have yet to be fully reported.

The typical pattern of spine extensor EMG signals during forward bending is characterized by increased activity to eccentrically control movement of either the head (cervical) or trunk (lumbar) followed by the sudden quiescence of these muscles near the end-range of motion [5]. A second period of activity is observed for the muscles to concentrically extend the head or trunk from the fully flexed posture to an upright posture. Previous work has reported that the reduction in EMG signal magnitude near the end-range of motion is not exhibited by some individuals with non-specific spine pain [6, 7]. Using the EMG signal magnitude, typically expressed in units of (milli)volts, at full forward flexion as a biomarker is problematic because of the many factors that can influence EMG signal magnitude [8]. Measures of EMG signal magnitude during either the forward bending or return phases of the movement can be used to provide a reference for the EMG signal magnitude at full forward flexion. The ratio of these measures of EMG signal magnitude defines either the flexion-relaxation ratio (FRR; when the forward bending phase is used as the reference) or extension-relaxation ratio (ERR; when the return phase of the movement is used).

As previously mentioned, two recent systematic reviews have investigated the psychometric properties of the FRR for people with NSCLBP [3, 4]. These reviews focused on reliability (inter-rater, intra-rater, test-retest within and between sessions), validity (content, criterion, construct/discriminative) and prevalence of the FRR. Responsiveness, which refers to the ability of a measurement to detect change over time when there has been a change (e.g., response to treatment or progression of disease) in the construct being measured [9], was not evaluated by either of the previous systematic reviews. Furthermore, there has not been a systematic review or meta-analysis to our knowledge that has focused on the psychometric properties of the FRR or ERR for the cervical extensor muscles. Thus, the objective of this review was to identify, critically appraise, compare, and summarize the literature on the construct validity (hypothesis testing for discriminative validity), reliability (test-retest, intra-rater, inter-rater), measurement error, and responsiveness of the FRR in adults (≥ 18 years old) with or without spine pain (any duration), in either a clinical or research context.

Study Selection

Our search identified 1746 articles. Following duplicate removal, we screened 728 titles/abstracts and 219 full texts. Backward citation tracking identified 999 articles that were screened, leading to 17 additional full texts that were reviewed. No additional articles were identified through the forward search. 9 non-english studies were retrieved but not screened due to limited access to translation services [10–18]. One study appeared to meet the inclusion criteria but was excluded because we could not confirm it was an independent sample of participants from a previously included study [19]. In total, 55 studies were included in this review, 8 of which assessed multiple measurement properties: 6 reliability (3 lumbar [20–22], 3 cervical [23–25]), 21 discriminative validity (15 lumbar [21, 22, 26–38], 6 cervical [23, 24, 39–42]), 38 responsiveness (1 thoracic [43], 23 lumbar [30, 36, 37, 44–63], 14 cervical [42, 60, 64–75]), and 1 measurement error (lumbar [21]) (Fig. 1).

Study Characteristics

Across all measurement properties, the studies were conducted in Australia, Brazil, Canada China, Finland, Hong Kong, Iran, Italy, Malaysia, Netherlands, New Zealand, Norway, Poland, Spain, South Korea, United Kingdom and USA. The majority of studies were collected in the laboratory setting. Study characteristics from reliability, discriminative validity, and responsiveness are presented in Supplementary Tables S1A-E online.

Risk of Bias

Findings from the risk of bias (RoB) assessment are included in Supplementary Fig. S2A-E online. For reliability, only two studies [20, 21] had overall ratings of adequate, three studies were rated doubtful (Watson 1997 for between-day reliability) [22–24] and two studies were rated inadequate (Watson 1997 for within-day reliability) [22, 25]. The two domains that drove down the rating for the rest of the studies were D6 (Were there any other important flaws in the design or statistical methods of the study) and D7 (Statistical methods: not using the ICC for continuous scores). For discriminative validity, the majority [23, 24, 41, 42] of studies for the cervical FRR received adequate overall RoB scores with only two [39, 40] receiving overall ratings of doubtful. The majority [27–30, 32, 34, 35, 37, 76] of studies for the lumbar FRR received doubtful overall RoB scores with one receiving inadequate [33], and five receiving adequate or very good overall scores [21, 22, 26, 31, 36]. For responsiveness of the cervical FRR, most studies received a doubtful overall score [64–67, 69, 72, 75], one received inadequate [71], with the rest receiving overall scores of adequate or very good [42, 60, 68, 70, 73]. Similarly, for responsiveness of the lumbar FRR, most studies received an overall rating of doubtful [30, 37, 44, 45, 48, 49, 51, 53, 56–58, 62, 63], and the rest received an overall score of adequate [28, 36, 46, 47, 52, 54, 55, 61].

Individual Studies

Reliability

The reliability results of individual studies are presented in Fig. 2A (cervical reliability) and Fig. 2B (lumbar reliability). The between-day reliability for the cervical FRR, assessed only by Murphy 2010 with a 4-week interval, was found to have excellent reliability (Intraclass correlation coefficients (ICCs); model not specified) ranging from 0.83–0.92) [23]. Within-day reliability for the cervical FRR, assessed using ICC_1,2 [24], ICC (model not specified, [25]), and ICC_3,1 [20] were found to be excellent overall (ICCs ranging from 0.84–0.99, with one 0.77). The between-day reliability of the lumbar FRR was assessed at intervals of 4 weeks [22] and 8 weeks [21]. Between-day reliability in these studies was assessed with different statistics: generalizability theory framework dependability coefficient [21] and pearson’s correlation [22] were variable, ranging from poor (0.55–0.57 [21] to excellent (0.87–0.98 [22]. Within-day reliability of the lumbar FRR was higher, ranging from 0.86–0.94 [20, 22].

Discriminative Validity

For the cervical FRR, most values were lower for clinical compared to healthy groups (Fig. 3A); however, we found that the 95% CI for the standardized mean difference did not include zero in only 47.6% of the comparisons [23, 24, 40–42].

Only studies that used a similar equation to calculate the cervical or lumbar FRR are summarized in Figs. 3A and 3B. There were several studies that compared the lumbar FRR between clinical and healthy groups, but used a different equation to calculate the FRR [26, 29, 30, 35–37, 50]. Data from these studies are summarized in Table 1.

Table 1

– Results of discriminative validity of the lumbar flexion relaxation ratio (FRR) for studies using other equations to calculate the FRR. SD = standard deviation. CI = confidence interval.
Author	Equation	Location	Clinical			Healthy			Mean Diff	95% CI	ROB	Good Measure
Author	Equation	Location	n	Mean	SD	n	Mean	SD	Mean Diff	95% CI	ROB	Good Measure
Dankaerts, 2006	Sitting / Slumped Sitting	L5	33	0.95	1.01	34	1.38	1.01	-0.43	[-0.923, 0.063]	Very Good	+
Dankaerts, 2006	Sitting / Slumped Sitting	L1	33	1.09	0.98	34	1.38	1.00	-0.29	[-0.773, 0.193]	Very Good	+
Kim, 2013a	Full Flexion / Flexion* 100	L3/L4*	17	20.8	13.2	16	8.9	2.3	11.9	[5.068, 18.732]	Doubtful	+
Kim, 2013a	Full Flexion / Flexion* 100	L3/L4^	14	24.5	14.4	16	8.9	2.3	15.6	[8.137, 23.063]	Doubtful	+
Laird, 2018	Full Flexion / Flex + Ext	L3	140	0.25	0.32	124	0.012	0.32	0.238	[0.160, 0.316]	Doubtful	+
Mak, 2010	Upright Sitting / Flexed Sitting	L3	25	2.02	1.49	20	3.45	2.2	-1.43	[-2.542, -0.318]	Doubtful	+
Paoletti, 2020	Full Flexion / Extension	L1, L5	12	0.55	0.26	13	0.25	0.2	0.3	[0.109, 0.491]	Doubtful	+
Pool-Goudzwaard, 2018	Extension / Full Flexion	L1, L4	16	1.41	0.39	24	1.5	0.33	-0.09	[-0.322, 0.142]	Adequate	+
Ringheim, 2015	NR	Bilateral ES	17	3.5	NR	20	10.3	NR	-6.8	[-16.98, 3.38]^φ	Doubtful	+
ES = Erector Spinae, spine level not reported, NR = Not Reported. ^φ95% CI estimated. Kim et al 2013a subgroups * LFRS, ^LERS.
Where authors present bilateral results, only R side reported. Paoletti and Pool-Goudzwaard presented combined results for bilateral measures at L1 and L5, and L1 and L4 respectively.

Similar to the cervical FRR, most values for the lumbar FRR were lower for the clinical compared to healthy groups (Fig. 3B, Table 1), and when considering the 95% CI for the standardized mean difference, we found this to be significant in 57.1% of the comparisons presented in the included papers [21, 22, 27–30, 32, 38].

Responsiveness

We did not find any of the studies were adequately set up to test the measurement property of responsiveness for the cervical, thoracic, or lumbar spine. Therefore, we have included only the characteristics and risk of bias ratings, and not the results of these studies, in this review.

Studies compared the cervical FRR before and after exposure to a physical task (sitting [64], static end range flexion [67], smart phone posture [73], overhead work [72], computer work [71], below knee assembly work [60], assembly work [74], therapeutic intervention (exercise [66, 68], spinal manipulation [68], stretching [70], and fatigue protocols [42, 69, 75] (Table 2).

Table 2

– Summary of exposure types and specifics, and risk of bias and good measurement property results for the included studies of responsiveness for the cervical flexion relaxation ratio.
Exposure Type	Specifics	Author	ROB	Good Measurement Property
	Prolonged Sitting	Choi et al. 2020	Doubtful	-
	End Range Cervical Flexion (10 minute)	Mousavi-Khatir et al. 2016	Doubtful	+
	Smartphone posture desk vs lap	Shin and Kim 2014c	Adequate	-
	Overhead work	Shin et al 2012	Doubtful	+
	VDT Entry (30 minutes)	Shin et al 2014b	Inadequate	-
	Below Knee Assembly Work	Shin et al., 2014a	Very Good	+
	Assembly Work	Yoo et al., 2014	Doubtful	+
Therapeutic Interventions	Surgery for Knee Flexion Contracture	Ding et al., 2016	Doubtful	+
	Exercise	Hyun-Mu et al., 2016	Doubtful	+
	Exercise (Group 2)	Murphy et al., 2010a	Very Good	-
	SMT (Group 1)	Murphy et al., 2010a	Very Good	-
	Static Stretching	Park et al., 2019	Very Good	+
	Dynamic Stretching	Park et al., 2019	Very Good	+
Fatigue Protocols		Nimbarte 2014a	Doubtful	+
		Nimbarte 2014b	Doubtful	+
		Zabihhosseinian et al., 2015	Adequate	+

Studies compared the lumbar FRR before and after exposure to a physical task (standing [37], below knee assembly [60]), therapeutic interventions (lumbar support [45], spinal manipulation [46, 53, 57, 63], Kinesiotape [48], physical therapy [30, 50, 57], exercise [50–53, 55, 58], and stretching [54] (Table 3).

Table 3

– Summary of exposure types and specifics, and risk of bias and good measurement property results for the included studies of responsiveness for the lumbar flexion relaxation ratio.
Exposure Type	Specifics	Author	ROB	Good Measurement Property
Prolonged Postures	Prolonged Standing (15 minute)	Ringheim et al., 2015	Doubtful	+
Prolonged Postures	Below Knee Assembly Work	Shin et al., 2014a	Very Good	+
Therapeutic Interventions	Myofascial Release (2 sessions, 2 wks apart)	Arguisuelas et al., 2019	Doubtful	-
	Lumbar Support (2 month)	Bataller-Cervero et al., 2019	Doubtful	-
	SMT	Bicalho et al., 2010	Very Good	+
	Kinesiotape	Greześkowiak et al., 2019	Doubtful	-
	Physical Therapy (40 min, 4x/wk)	Kim et al., 2013b	Very Good	+
	Lumbar Stabilization Exercises (40 min, 4x/week)	Kim et al., 2013b	Very Good	+
	SMT	Lalanne et al., 2009	Doubtful	-
	Rehabilitation (5 sessions/wk, 12 weeks)	Mak et al., 2010	Doubtful	+
	Exercise Intervention	Marshall and Murphy 2006a	Very Good	+
	Exercise Intervention	Marshall and Murphy 2006b	Doubtful	+
	SMT compared to control and Combo w/Swiss Ball	Marshall et al., 2008	Doubtful	+
	Stretching (5-week program)	Moore et al., 2015	Very Good	-
	Flexion/Extension Exercise (Biofeedback)	Pagé et al., 2015	Very Good	+
	Traditional Bone Setting (5x @ 2-week intervals)	Ritvanen et al., 2007	Doubtful	+
	Physical Therapy (5x @ 2-week intervals)	Ritvanen et al., 2007	Doubtful	+
	Stabilization Exercises (8x over 4 wks)	Salamat et al., 2017	Doubtful	+
	Movement Control Exercises (8x over 4 wks)	Salamat et al., 2017	Doubtful	+
	Stretching (3x/week 4 weeks)	Shamsi et al., 2022	Very Good	-
	Strengthening (3x/week over 4 weeks)
	Control
	SMT	Ting et al, 2017	Adequate	-
Fatigue Protocols	Sustained Contraction to Fatigue	Descarreaux et al., 2008	Very Good	-
Fatigue Protocols	Exercise to Fatigue	Horn et al., 2013	Doubtful	-
Behavioural/ Cognitive	Pick up wallet after watching videos of movement performed with different emotions	Pool-Goudzwaard et al., 2018	Adequate	+
	Bending with Cognitive Task	Pouretezad et al., 2018	Doubtful	+
	15-day Cognitive Behavioural Therapy Program	Watson et al., 1997	Doubtful	+

Interrater Reliability of Data from Digitized Graphs

A total of 170 data points were digitized from graphs in the included studies. Interrater reliability of digitized measurements was 0.99 with an average absolute value of the difference between raters of 6% (expressed relative to the average of measures between raters).

Syntheses

The only measurement property that our review team felt could be adequately synthesized with a meta-analysis was the discriminative validity of the cervical and lumbar FRR for the studies that examined the same spine levels/muscles, with similar methods, and the same FRR equation. For the cervical FRR, the standardized mean difference of the pooled estimate favoured a lower FRR in people with neck pain (-1.16 [-2.00, -0.32], p < 0.01, I² = 88%, Fig. 3A). For the lumbar FRR, the standardized mean difference of the pooled estimate favoured a lower FRR in people with back pain (-1.21 [-1.84, -0.58], p < 0.01, I² = 84%, Fig. 3B).

Certainty of Evidence

Using the modified Grading of Recommendations Assessment, Development, and Evaluation (GRADE) criteria presented by the Consensus-Based Standards for the Selection of Health Measurement Instruments (COSMIN)[76], this review concludes that there is moderate certainty of the evidence that there is high test-retest reliability for the cervical FRR and moderate to high test-retest reliability of the lumbar FRR; that there is high certainty of the evidence that the cervical and lumbar FRR can be used to discriminate between healthy and clinical groups (Table 4). This review was unable to draw a conclusion about the thoracic FRR (only one study) or responsiveness of the FRR (included studies assumed responsiveness of the measure but were not methodologically designed to test responsiveness).

Table 4

– Summary of findings for test-retest reliability, discriminative validity and responsiveness for the cervical and lumbar flexion relaxation ratio (FRR).
Research Question	What is the reliability, discriminative validity, responsiveness and measurement error of the FRR?
Population	Adults (≥ 18 years old), either healthy (no recent history of LBP or clinical (experiencing episodes of LBP of any duration)
Setting	Research or clinical settings
Outcome Attribute	Summary result	Overall rating (Criteria for good measurement properties)*	Certainty of the evidence (GRADE)**	Comments
Reliability (Test-Retest) Cervical	High test-retest reliability	+	Moderate ⊕⊕⊕ΟО	Downgrade for Risk of Bias
Reliability (Test-Retest) Lumbar	Moderate to High test-retest reliability	?	Moderate ⊕⊕⊕Ο	Downgrade for inconsistency
Construct validity (discriminative validity) Cervical	Can discriminate between healthy and clinical groups with a standardized mean difference of -0.8211[-1.8153, 0.1730]	+	High ⊕⊕⊕⊕
Construct validity (discriminative validity) Lumbar	Can discriminate between health and clinical groups with a standardized mean difference of -1.21 [-1.84, -0.58]	+	High ⊕⊕⊕⊕
*Good measurement property “+” = sufficient, “?” = indeterminant, “-” = insufficient
**GRADE Rules:
Risk of Bias: No downgrade (multiple studies of at least adequate quality, or one study of very good quality), -1 Serious (multiple studies of doubtful quality or only one study of adequate quality), -2 Very serious (multiple studies of inadequate quality, or only one study of doubtful quality), -3 Extremely serious (only one study of inadequate quality).
Inconsistency: No downgrade (study results are consistent with each other or subgrouping explains inconsistent results/consistent results within subgroups), -1 Serious (result from at least one study is not consistent with the rest, with no explanation found, or only one study of adequate quality), -2 Very serious (results from more than one study are not consistent with the rest, with no explanation found). In this case we may consider not pooling the data at all and not giving a quality level of the evidence.
Imprecision: No downgrade (Total n > 100), -1 Total n = 50–100, -2 Total n = < 50.
Indirectness: No downgrade (study population is well defined and matches our review inclusion criteria for population and clinical condition), -1 Serious (study population not well defined or representative of review population), -2 Very serious (evidence that the study population consists of individuals that would confound (misclassified/misrepresented) what is being studied).

This review found, with moderate certainty, that the cervical FRR has high test-retest reliability and lumbar FRR has moderate to high test-retest reliability, with high certainty that the cervical and lumbar FRR can discriminate between healthy and clinical groups. There was not enough evidence to summarize findings for thoracic FRR discriminative validity or the standard error of measurement. We found numerous experimental studies that used the FRR as a dependent measure, but no studies were designed in a way that could confirm responsiveness.

Concerning reliability of the FRR outcome, only one study, Watson et al. 1997, was consistently identified between our study and previous reviews [22]. A study by Alschuler and colleagues was excluded from our review because their population included patients that had undergone spine surgery [77]. Marshall and Murphy 2006 [78] reported measures of reliability for the EMG signal magnitude in each phase of the forward bending movement task (i.e., flexing, flexed and extending phases), but was not considered for reliability in our review because they did not report reliability for the FRR. Overall, our review also found that test-retest reliability for the lumbar FRR was moderate to high, but with a moderate certainty of evidence because of inconsistency between studies. This, coupled with only a single study that reported measurement error [21], underscores the need for further work on test-retest reliability if the lumbar FRR is to be considered as a biomarker for people with NSCLBP [3].

There have been two recent reviews addressing the lumbar FRR and its potential use as a biomarker for people with NSCLBP [3, 4]. Both reviews concluded that the lumbar FRR had at least moderate test-retest reliability and was capable of discriminating between groups of people with and without NSCLBP; however, when considering the recommendations from these reviews, it is important to note that neither used GRADE to summarize the certainty of evidence. The review by Moissenet and colleagues broadly identified 31 kinematic and 14 EMG-based measurements, one of which was the lumbar FRR, that could potentially differentiate people with NSCLBP from an asymptomatic population [3]. Only two papers were cited by that review for the measurement properties of reliability and construct (discriminative) validity for the FRR [22, 32]. Gouteron and colleagues narrowed their focus to the Flexion Relaxation Phenomenon (FRP) and the measurement properties of the FRR and ERR [4]. These authors identified three articles with reliability analyses [22, 77, 78] and four that compared the FRR between groups of participants with and without NSCLBP [22, 31, 37, 79]. Our current review also identified three studies that included measures of reliability for the lumbar FRR and 14 studies that reported relevant data for discriminative validity. For discriminative validity of the lumbar FRR, our current review included all identified studies from previous reviews with one exception. The study by Rose-Dulcina and colleagues was excluded from our review because they reported the absolute value of the side-to-side difference of the FRR instead of the FRR [79]. Nonetheless, our review conclusions were consistent with the previous reviews and determined that there was evidence for discriminative validity of the lumbar FRR between people with and without NSCLBP.

This review is the first, that we are aware of, to have systematically assessed the literature for measurement properties of the cervical FRR. The cervical FRR appeared to outperform the lumbar FRR for both test-retest reliability and discriminative validity with moderate and high certainty of evidence, respectively.

Neither of the previous reviews assessed responsiveness of the lumbar FRR; however, Moissenet and colleagues discussed the importance of this measurement property for clinical use of the lumbar FRR [3]. Responsiveness of a measurement has been referred to as “longitudinal validity” and defined as a measurement’s ability to detect change in a construct over time; however, the amount of measured change should be commensurate with changes in the construct (i.e., not disproportionately smaller or larger) [9]. Despite identifying many experimental studies in the current review that used the lumbar and cervical FRR as a dependent measure, none were considered adequate to assess responsiveness of the measurement. We identified two main issues that precluded the assessment of responsiveness. First was inconsistency across experimental studies at measuring changes in the construct that the FRR addressed. The second issue was related to the limited understanding of the test-retest reliability between-days and the associated between-day measurement error for the FRR [80]. Determining whether the FRR is responsive is one remaining gap.

Limitations of reporting were identified in the first part of the COSMIN risk of bias tool that rated the quality of reported preparatory actions, data collection, data processing/storage, and calculation of the final score. These items were not used to determine the overall risk of bias for a study, but with one exception. Studies were downgraded for overall risk of bias if the authors either did not report the protocol for handling data from multiple trials (e.g., using the median value from a series of trials) or did not describe the process for combining data from muscles on the left and right sides of the body (e.g., averaging) when separate data from the left and right sides were not presented. This was captured in Part A by “calculation of the final score” and by “other” in Part B of the COSMIN RoB tool. A sensitivity analysis identified that 25 studies (45%) included in our review did not report the process for handling multiple trials or combining data between left and right sides, which affected the overall risk of bias rating in 14 of these studies (4 downgraded to adequate from very good, 7 downgraded to doubtful from very good and 3 downgraded to doubtful from adequate). Preparatory actions (e.g., removal of hair and cleaning the skin surface prior to electrode application) and details of the data collection (e.g., EMG amplifier characteristics and trial instructions) were also poorly described in many studies. These issues have recently been a focus of the Consensus for Experimental Design in Electromyography (CEDE) project that aims to improve consistency in acquisition, processing and reporting of EMG data in scientific studies [81, 82]. A final limitation relates to the quantities that comprise the numerator and denominator of the FRR in many of the studies. Ratios are typically constructed with the quantity of interest in the numerator [83]. The characteristic of the FRP that distinguishes people with NSCLBP from control participants is the absence of a sudden decrease in the extensor EMG signal magnitude near the end range of forward flexion. Thus, the EMG signal magnitude in the fully flexed posture is the quantity of interest and should be presented in the numerator of the FRR; however, most studies calculate the FRR with this quantity in the denominator. Future work may consider calculating the FRR as the ratio of the average signal magnitude at full forward flexion divided by the peak signal magnitude during forward bending.

A recent study identified cut-off values, with high sensitivity and specificity for different methods of calculating the FRR and ERR to distinguish the presence and absence of the FRP in people with NSCLBP [84]. These authors reported many of the same issues with methodological consistency in formulating the FRR and ERR that were identified by our review. To better understand the utility of the FRR for use in clinical practice and research studies, future work should continue to investigate the test-retest reliability, measurement error, and responsiveness of the FRR. The large heterogeneity between studies may be improved with adherence to a consistent protocol for obtaining measures of the FRR. Examples of elements for the measurement protocol that should be standardized are: location and placement of electrodes, preparation of the electrode site, a standardized script of instructions to participants, pace of movements, post-collection processing of the EMG signals, and the method for extracting values that comprise the numerator and denominator of the FRR.

The current review included a robust team with content experts in surface EMG, a methodologist and a health sciences librarian. We followed the best-practices for a systematic review that included: registering the protocol, developing a sensitive search strategy that was peer-reviewed according to the Peer Review of Electronic Search Strategies (PRESS) guidelines, having independent raters for all stages with consensus, using COSMIN for evaluating RoB, assessing the certainty of evidence with GRADE, and reporting our findings in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) and the synthesis without meta-analysis (SWiM) guidelines. A potential limitation of this review was its scope, which led to the inclusion of many studies and division of the screening, data extraction, and RoB assessment across multiple pairs of reviewers. For example, two independent reviewers were each responsible for extracting information from half of the included studies with a third reviewer checking the extracted information for errors. Errors identified by the third reviewer were discussed with the person who completed the extraction to achieve consensus on the correct information. Multiple pairs of reviewers were responsible for independently completing the RoB assessment with a meeting to achieve consensus between the reviewers on the final determination. Again, a calibration exercise was conducted for the risk of bias assessment to ensure consistency between pairs of reviewers. Our decision to use the COSMIN tool to assess RoB could another limitation. This tool uses a very conservative “worst score principle” to determine the overall risk of bias for a study. Nonetheless, the COSMIN tool was chosen because of its specific focus on assessing the quality of evidence for an outcome/dependent measurement. A final limitation might be the use of a web-based tool for extracting data from digitized graphs that were presented in the included studies; however, interrater reliability for the digitized data obtained from graphs in the included studies of this review was near-perfect (ICC_2,1 = 0.99) and is consistent with previously reported interrater reliability [85].

In conclusion, this review determined that the FRR and ERR have moderate-to-high test-retest reliability and can discriminate between people with either neck or low back pain and pain-free controls. The utility of these measurements in clinical practice and longitudinal research is limited by the lack of information regarding measurement error and responsiveness.

Registration and Protocol

The protocol for this review was registered prior to the study start on Open Science Framework (https://doi.org/10.17605/OSF.IO/27EDF). We followed the working procedure for conducting a systematic review of validity, reliability and measurement error developed by COSMIN [9, 76]. This report follows the most recent PRISMA guideline [87].

Eligibility Criteria

Population

This review targeted studies that included adults (aged ≥ 18 years or older). Studies including both a clinical group and healthy group were included for the assessment of discriminative validity, while studies including a clinical group, healthy group, or heterogenous group were included for the assessment of reliability and responsiveness. A clinical group was defined in this review as participants currently experiencing an episode of non-specific spinal pain (i.e., neck, upper, or low back pain) of any duration (i.e., acute or persistent/chronic) and who were included in the study based on self-identification or an outcome measure of pain (e.g., visual analogue scale, numeric pain rating scale) or function/disability (e.g., Oswestry Disability Index). A healthy group was defined as participants with no current pain or recent history of non-specific spinal pain. We excluded data from clinical groups consisting of participants with a specific cause of spinal pain (e.g., infection, malignancy, fracture, history of spine surgery) or if the study authors did not screen for these potential causes of pain.

Outcome

We included studies that calculated the FRR from surface electromyography (sEMG) of spine muscles (i.e., neck, upper, or lower back) as a dependent measure. We deviated from our protocol to accept those with or without a concurrent measure of spine angle (e.g., motion capture, inertial motion sensor, or accelerometer/inclinometer) so long as the instructions for the FRR trial defined the motion of the trial (flexion and extension phases). Studies that employed all methods of calculation for the FRR were accepted (e.g., maximum root mean squared sEMG flexing divided by maximum root mean squared sEMG at full flexion, maximum root mean squared sEMG extending divided by maximum root mean squared sEMG at full flexion).

Included studies must have assessed the reliability or responsiveness of the FRR and/or discriminative validity of the FRR through hypothesis testing as or present enough data (mean and standard deviation per group) that could be used to assess validity or responsiveness against our review hypotheses. We used the data available that matched our inclusion criteria (i.e., if only the healthy/asymptomatic population matched our review inclusion criteria, and we could use that for reliability or responsiveness, it was included in those parts of the review respectively). Both populations were considered for studies of reliability or measurement error. Discriminative validity compared between populations that are known to be different.[4] Studies of responsiveness included either an experimental or clinical intervention applied to one or both populations.

Validity

Validity refers to the extent to which a measure assesses the construct it is supposed to measure [5]. Several aspects of validity need to be addressed when assessing the suitability of a measurement outcome. Construct validity is the aspect of validity that refers to the degree to which the scores of a measurement are consistent with hypotheses that align with the construct of interest [5]. For this review, the form of construct validity that we explored was discriminative validity, which refers to the ability of a measurement score to distinguish between predictably different individuals or groups. We hypothesized that clinical groups (i.e., individuals with non-specific spinal pain) will have significantly different FRR values (95% confidence intervals (CI) for mean group differences do not overlap zero) compared to healthy/asymptomatic groups.

Reliability

Reliability refers to the extent to which scores are the same for repeated measurements and can be observed by different persons on the same occasion (inter-rater), over time (test-retest), or by the same person on different occasions (intra-rater) given that the value of the construct has remained stable [5]. The construct must be stable for evaluations of test-retest and intra-rater reliability. Measurement error, a component of reliability, refers to the systematic and random error of an observed score that is not attributed to true changes in the construct being measured [5]. Any measure of reliability was accepted for this review.

Responsiveness

Responsiveness refers to the ability of a measurement instrument to detect change over time, when there has been a change, such as in response to treatment or during progression of disease in the construct being measured [5]. To confirm responsiveness of the FRR outcome measure, we hypothesized that significant differences in FRR (95% CI for mean difference pre/post exposure do not cross zero) would be found before and after exposures/interventions.

Types of studies

We deviated from our protocol to only include articles published in peer-reviewed journals or full papers published in peer-reviewed conference proceedings. We included randomized controlled trials, cohort studies, case-control studies, cross-sectional studies, quasi-experimental studies, and laboratory experiments. No language limits were set. Attempts were made to translate studies to English for inclusion; however, in the event this was not possible, the identified studies were listed for future reviews to use. We excluded the following types of studies: feasibility studies, pilot studies, systematic and non-systematic reviews, protocols, theses/dissertations, commentaries, reports, and any other non-peer-reviewed studies.

Context

Studies conducted in either a clinical or laboratory setting were included.

Information Sources and Search Strategy

Six databases (MEDLINE via Ovid, Embase via Embase.com, CINAHL via EBSCO, SPORTDiscus via EBSCO, Web of Science Core Collection, and Scopus) were searched for published studies from inception to June 1, 2022. Search terms consisted of subject headings specific to each database (e.g., MeSH in MEDLINE) and free text words relevant to the search concepts, such as "flexion relaxation" and "spine". The search string was developed by content experts (DDC, SH, SM, MF) together with a health services librarian (KR) and the search strategy was peer-reviewed by a second librarian according to the PRESS guidelines [88]. The complete search strategies for all databases are included in Supplementary Information S3 online.

Selection Process

Results from each database were combined and imported into Covidence (Veritas Health Innovation, Melbourne, Australia) where duplicates were removed prior to screening. Results for each stage of the review were tracked in Microsoft Excel (Microsoft Corporation, Redmond, USA). Two pairs of reviewers independently screened titles and abstracts and reviewed full texts for inclusion in the review. The reviewers met at each stage for consensus and to resolve any discrepancies through discussion. A third reviewer was consulted when necessary. Backward citation tracking was conducted on all included studies. The reference lists of systematic reviews, pilot studies, feasibility studies, and protocols were screened for articles missed by our search (backwards search) and a forward search of all included articles was performed (articles citing the included articles were found and screened).

Data Collection Process and Data Items

Data were extracted from the included articles by one reviewer and independently checked by a second reviewer. Discrepancies between reviewers were resolved through a consensus meeting. A third reviewer was available to resolve any discrepancies that could not be resolved. Available supplementary files were consulted during data extraction for any relevant data that was not directly presented in the original study. Study authors were contacted for clarification where necessary. Data extracted on the study populations included author, year, country, setting (i.e., clinical or laboratory), sample size, patient characteristics (e.g., age, location of pain, duration of pain, outcome measure used for inclusion into the study), and healthy population characteristics (e.g., age, definition). Information pertaining to the characteristics of the study investigators (e.g., professional background, level of training, and/or years of experience) was extracted where possible. Relevant methodological information included data collection and processing methods (e.g., equipment, preparatory action/instructions to participants, preparation of patients, unprocessed data collection, data processing and storage, and session information), description of the FRR calculation, measurement properties assessed, components repeated (for reliability), source(s) of variation varied (i.e., days, raters), and classification thresholds (if used). Data on the description of FRR calculation, FRR result (mean and variance), statistical analysis and results for each measurement property assessed in each relevant study were extracted and the criteria for good measurement properties were applied [9]. Data only displayed in graphs were extracted by one reviewer and checked by a second using Webplot Digitizer (Version 4.3, https://automeris.io/WebPlotDigitizer). Inter-rater reliability of digitized data was assessed using an ICC_2,1 calculated with the psych package [89] in R [90]. Standardized mean difference and 95% confidence intervals were calculated where possible (and not reported by the study authors). Final data tables were checked by a fourth person for errors.

Risk of Bias Assessment and Quality of Reporting

Two pairs of reviewers independently assessed the quality of each included study using the COSMIN Risk of Bias (RoB) tools/checklists to assess reliability and measurement error, construct validity (Box 9b, discriminative validity) and responsiveness (Boxes 10b, 10c, 10d) [9]. The COSMIN RoB tool and checklist are modular, meaning that the boxes in the tool and checklist were completed based on the measurement properties evaluated in each study. If a study reported multiple outcomes of one measurement property (e.g., inter-rater and intra-rater reliability), the corresponding box in the COSMIN RoB tool/checklist was completed more than once. Each standard within the COSMIN RoB box was rated as ‘very good’, ‘adequate’, ‘doubtful’, or ‘inadequate’ according to the criteria outlined by the tool. We followed the “worst score counts” principle, where the overall rating of the quality of each study was determined by taking the lowest rating of any of the standards in the COSMIN boxes used [9, 76]. Reviewers met for consensus and a third reviewer helped to resolve discrepancies that could not be resolved through discussion. Quality of reporting was assessed through Part A of the COSMIN RoB tool, which was the same for all measurement properties and focused on reporting the parameters specific to the instruction to participants, data collection, processing, and analysis and was rated with the same levels and criteria presented above. The results of Part A were not used in the judgement of RoB (Part B); however, we present the summary of the Part A results (percentage of each rating by domain) together with our results for RoB in traffic-light plots (results of individual domains by study) were also prepared as Supplementary Fig. S2A-E using ROBVIS [91].

Effect Measures

Standardized mean difference and 95% confidence intervals were calculated where possible (and not reported by the study authors) for discriminative validity and responsiveness. A standard effect size was also calculated (change in the mean score divided by the standard deviation of the baseline) if responsiveness was not explicitly reported in a study but enough information was provided. There was no effect measure used for the synthesis of reliability and/or measurement error.

Synthesis Methods

Results of individual studies for each measurement property (e.g., the range of values, percentage of confirmed hypotheses) were summarized according to the COSMIN methodologies for systematic reviews. Specifically, the methodology for patient-reported outcomes [76] was followed for discriminative validity, and the methodology for clinician-reported outcome measurement instruments, performance-based outcome measurement instruments, and laboratory values [9] was followed for reliability, responsiveness, and measurement error. We checked study results against our review hypotheses for the assessment of construct validity and responsiveness.

Explanations for inconsistent results between studies for a measurement property (i.e., test-retest reliability) were explored and subgroups of homogeneous studies were summarized (e.g., different study populations, quality of the studies). If no explanation for inconsistency was found, we concluded that the results were inconsistent. Once again, the overall results were compared to the criteria for good measurement properties to determine whether FRR has sufficient (+), insufficient (-), or indeterminate (?) construct validity, reliability, measurement error, and/or responsiveness [9]. Results were reported by two reviewers for each measurement property. The reviewers met for consensus through discussion and a third reviewer helped to resolve persistent discrepancies.

Where possible, results from studies on reliability and discriminative validity were statistically pooled in a random effects meta-analysis using the package meta in R [92]. Statistical heterogeneity was assessed by the I² statistic. Only studies that reported confidence intervals (or from which we could calculate confidence intervals) and that used the same population, context, study design, FRR calculation, and statistical model/formula were quantitatively pooled and visualized with forest plots. The results of the remaining studies were presented in tables and a synthesis without meta-analysis was conducted in adherence with the SWiM Reporting Guidelines [93].

Certainty Assessment: Grading the quality of cumulative evidence

Two reviewers independently assessed the overall quality of evidence on validity, reliability, measurement error, and responsiveness of the FRR using the modified GRADE approach outlined by COSMIN methodology for systematic reviews of patient-reported outcome measures [9, 76]. The quality of the evidence was graded as high, moderate, low, or very low evidence for our confidence in the measurement property estimates. Four factors were considered when evaluating the quality of the evidence: risk of bias (methodological quality of the studies); inconsistency (unexplained inconsistency of results across studies); imprecision (total sample size of the available studies); and indirectness (evidence from different populations than the population of interest). Each study started with the assumption that the overall result of the study was of high quality and could be downgraded by one to three levels based on each of these four factors. The rules for downgrading were presented in our protocol a priori. The two reviewers met for consensus and a third reviewer helped resolve any persistent discrepancies. The final grading of the quality of the evidence was recorded in a Summary of Findings Table together with the rules (table footnotes) and justifications for any decisions to downgrade.

Acknowledgments

Ava McGrath for her assistance double checking data tables.

Funding

Natural Sciences and Engineering Research Council (NSERC) Discovery Grant #20161771. The funder had no role in the development or reporting of this research study.

Competing Interests

The authors declare no conflicts of interest for this work.

Availability of Materials and Data

All data generated or analysed during this study are included in this published article and its supplementary information files.

Wu A, March L, Zheng X, Huang J, Wang X, Zhao J, Blyth FM, Smith E, Buchbinder R, Hoy D. Global low back pain prevalence and years lived with disability from 1990 to 2017: estimates from the Global Burden of Disease Study 2017. Ann Transl Med. 2020 Mar;8(6):299.
Saragiotto BT, Maher CG, Hancock MJ, Koes BW. Subgrouping Patients With Nonspecific Low Back Pain: Hope or Hype? J Orthop Sports Phys Ther. 2017 Feb;47(2):44–8.
Moissenet F, Rose-Dulcina K, Armand S, Genevay S. A systematic review of movement and muscular activity biomarkers to discriminate non-specific chronic low back pain patients from an asymptomatic population. Sci Rep [Internet]. 2021;11(1). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85102517241&doi=10.1038%2fs41598-021-84034-x&partnerID=40&md5=7e6034d302e8e173400bfe548ba9e212
Gouteron A, Tabard-Fougère A, Bourredjem A, Casillas JM, Armand S, Genevay S. The flexion relaxation phenomenon in nonspecific chronic low back pain: prevalence, reproducibility and flexion-extension ratios. A systematic review and meta-analysis. Eur Spine J Off Publ Eur Spine Soc Eur Spinal Deform Soc Eur Sect Cerv Spine Res Soc. 2021 Sep 22;
Howarth SJ, Mastragostino P. Use of kinetic and kinematic data to evaluate load transfer as a mechanism for flexion relaxation in the lumbar spine. J Biomech Eng [Internet]. 2013;135(10). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84887583436&doi=10.1115%2f1.4025112&partnerID=40&md5=57c4490e7b0db04d6319c81cac697f3d
Ahern DK, Follick MJ, Council JR, Laser-Wolston N, Litchman H, Ahern DK, Follick MJ, Council JR, Laser-Wolston N, Litchman H. Comparison of lumbar paravertebral EMG patterns in chronic low back pain patients and non-patient controls. PAIN. 1988;34(2):153–60.
Meyer JJ, Berk RJ, Anderson AV. Recruitment patterns in the cervical paraspinal muscles during cervical forward flexion: evidence of cervical flexion-relaxation. Electromyogr Clin Neurophysiol. 1993 Jun;33(4):217–23.
De Luca CJ. The Use of Surface Electromyography in Biomechanics. J Appl Biomech. 1997 May;13(2):135–63.
Mokkink LB, Boers M, van der Vleuten CPM, Bouter LM, Alonso J, Patrick DL, de Vet HCW, Terwee CB. COSMIN Risk of Bias tool to assess the quality of studies on reliability or measurement error of outcome measurement instruments: a Delphi study. BMC Med Res Methodol. 2020 Dec 3;20(1):293.
Weifei SHI, Xinhai S, Xiangrong C. Effect of Different Lifting Load on Biomechanical Variables of Lumbar Spine during Trunk Flexion-extension Performance. J Tianjin Inst Sport Tianjin Tiyu Xueyuan Xuebao. 2013;28(6):493–6.
Kumamoto T, Seko T, Tanaka M, Shida M, Ito T. Difference in the flexion relaxation phenomenon between sitting and standing. Rigakuryoho Kagaku. 2014;29(4):621–6.
Garcia Diaz J, Vargas Montes J, Romero Diez ME. The lumbar flexion-relaxation phenomen as a diagnostic test in assessment of lumbar impairment. Sensitivity and specificity. Rehabilitacion. 2020;54(3):162–72.
Kasahara S, Toduka M, Takahashi M, Miyamoto K. Flexion-Relaxation Phenomenon during Trunk Asymmetric Forward Bending. Rigakuryoho Kagaku. 2010;25(1):133–8.
Feng N, Li Y, Miao Y. A study on the function of lumbar paraspinal muscles in patients with chronic low back pain during trunk flexion-extention. Chin J Rehabil Med. 2012;27(7):600–4.
Kumamoto T, Seko T, Tanaka M, Ito T. Effects of trunk angle in the sitting and standing postures on the static flexion relaxation phenomenon. Rigakuryoho Kagaku. 2015;30(2):279–83.
Gan X, Lu Y, Chen Z. The effect of self- myofascial release on neuromuscular responses of lumber in healthy subjects during trunk flexion-extension. Chin J Rehabil Med. 2019;34(11):1316–22 and 1327.
Li P, Nie Y, Chen J, Ning N. Application progress of surface electromyography and surface electromygraphic biofeedback in low back pain. Chung-Kuo Hsiu Fu Chung Chien Wai Ko Tsa ChihChinese J Reparative Reconstr Surg. 2017;31(4):504–7.
García Díaz J, Vargas Montes J, Romero Díez ME. Reliability of the cervical flexion-relaxation phenomenon. Factors defining an assessment protocol. Rehabilitacion. 2018;52(2):75–84.
Shamsi H, Khademi K, Okhovatian F. Investigation of Flexion-Relaxation Ratio Symmetry in SubjectsWithandWithoutNon-specificChronicNeckPain. J Mod Rehabilit. 2022;16(2):185–94.
Owens EF, Gudavalli MR, Wilder DG. Paraspinal Muscle Function Assessed with the Flexion-Relaxation Ratio at Baseline in a Population of Patients with Back-Related Leg Pain. J Manipulative Physiol Ther. 2011;34(9):594–601.
Shahvarpour A, Henry SM, Preuss R, Mecheri H, Larivière C. The effect of an 8-week stabilization exercise program on the lumbopelvic rhythm and flexion-relaxation phenomenon. Clin Biomech. 2017;48:1–8.
Watson PJ, Booker CK, Main CJ, Chen ACN. Surface electromyography in the identification of chronic low back pain patients: The development of the flexion relaxation ratio. Clin Biomech. 1997 Apr;12(3):165–71.
Murphy BA, Marshall PW, Taylor HH. The cervical flexion-relaxation ratio: Reproducibility and comparison between chronic neck pain patients and controls. Spine. 2010;35(24):2103–8.
Pinheiro CF, Santos MF dos, Chaves TC, dos Santos MF. Flexion-relaxation ratio in computer workers with and without chronic neck pain. J Electromyogr Kinesiol. 2016;26:8–17.
Wang D, Ding Y, Wu B, Si F, Yu F, Xiao B, Liu B. Cervical Extensor Muscles Play the Role on Malalignment of Cervical Spine: A Case Control Study With Surface Electromyography Assessment. Spine. 2021;46(2):E73–9.
Dankaerts W, O’Sullivan P, Burnett A, Straker L. Altered patterns of superficial trunk muscle activation during sitting in nonspecific chronic low back pain patients: importance of subclassification. Spine. 2006;31(17):2017–23.
Ippersiel P, Preuss R, Fillion A, Jean-Louis J, Woodrow R, Zhang Q, Robbins SM. Inter-joint coordination and the flexion-relaxation phenomenon among adults with low back pain during bending. Gait Posture. 2021;85:164–70.
Kim MH, Yi CH, Kwon OY, Cho SH, Cynn HS, Kim YH, Hwang SH, Choi BR, Hong JA, Jung DH. Comparison of lumbopelvic rhythm and flexion-relaxation response between 2 different low back pain subtypes. Spine. 2013;38(15):1260–7.
Laird RA, Keating JL, Kent P. Subgroups of lumbo-pelvic flexion kinematics are present in people with and without persistent low back pain. BMC Musculoskelet Disord [Internet]. 2018;19(1). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85052726305&doi=10.1186%2fs12891-018-2233-1&partnerID=40&md5=c2d96d5cbcf62e3fae58f1721385df78
Mak JN, Hu Y, Cheng AC, Kwok HY, Chen YH, Luk KD. Flexion-relaxation ratio in sitting: application in low back pain rehabilitation. Spine. 2010;35(16):1532–8.
McGorry RW, Lin JH. Flexion relaxation and its relation to pain and function over the duration of a back pain episode. PLoS ONE [Internet]. 2012;7(6). Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84862490689&doi=10.1371%2fjournal.pone.0039207&partnerID=40&md5=37b1200ac79e67e6b0267d3d7a1df94c
Neblett R, Brede E, Mayer TG, Gatchel RJ. What is the best surface EMG measure of lumbar flexion-relaxation for distinguishing chronic low back pain patients from pain-free controls? Clin J Pain. 2013;29(4):334–40.
Othman SH, Muhammad NF, Ibrahim F, Omar SZ. Muscles activity of the back and hamstring during trunk flexion and extension task in healthy and low back pain women. In: IFMBE Proceedings [Internet]. 2007. p. 211–4. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84928888500&doi=10.1007%2f978-3-540-68017-8_55&partnerID=40&md5=7e545acdfea79691d39c71a6f9b86462
Othman SH, Ibrahim F, Omar SZ, Rahim RBA. Flexion Relaxation Phenomenon of Back Muscles in Discriminating Between Healthy and Chronic Low Back Pain Women. In: AbuOsman NA, Ibrahim F, WanAbas WAB, AbdulRahman HS, Ting HN, editors. 4th Kuala Lumpur International Conference on Biomedical Engineering 2008, Vols 1 and 2 [Internet]. New York: Springer; 2008. p. 199-+. (IFMBE Proceedings; vol. 21). Available from: ://WOS:000261284500047
Paoletti M, Belli A, Palma L, Paniccia M, Tombolini F, Ruggiero A, Vallasciani M, Pierleoni P. Data acquired by wearable sensors for the evaluation of the flexion-relaxation phenomenon. Data Brief. 2020 Aug;31:8.
Pool-Goudzwaard A, Groeneveld W, Coppieters MW, Waterink W. Changes in spontaneous overt motor execution immediately after observing others’ painful action: two pilot studies. Exp Brain Res. 2018 Aug;236(8):2333–45.
Ringheim I, Austein H, Indahl A, Roeleveld K. Postural strategy and trunk muscle activation during prolonged standing in chronic low back pain patients. Gait Posture. 2015;42(4):584–9.
Carrillo-Perez F, Diaz-Reyes I, Damas M, Banos O, Soto-Hermoso VM, Molina-Molina A. A novel automated algorithm for computing lumbar flexion test ratios enhancing athletes objective assessment of low back pain. In: icSPORTS 2018 - Proceedings of the 6th International Congress on Sport Sciences Research and Technology Support [Internet]. 2018. p. 34–9. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85059037685&doi=10.5220%2f0006922600340039&partnerID=40&md5=ca0c097e59a2841afc16ee7908249304
DeVocht JW, Gudavalli K, Gudavalli MR, Xia T. Novel Electromyographic Protocols Using Axial Rotation and Cervical Flexion-Relaxation for the Assessment of Subjects With Neck Pain: A Feasibility Study. J Chiropr Med. 2016 Jun;15(2):102–11.
Maroufi N, Ahmadi A, Mousavi Khatir SR. A comparative investigation of flexion relaxation phenomenon in healthy and chronic neck pain subjects. Eur Spine J. 2013;22(1):162–8.
Shamsi H, Khademi-Kalantari K, Akbarzadeh-Baghban A, Izadi N, Okhovatian F. Cervical flexion relaxation phenomenon in patients with and without non-specific chronic neck pain. J Back Musculoskelet Rehabil [Internet]. 2021; Available from: https://www.embase.com/search/results?subaction=viewrecord&id=L634104215&from=export http://dx.doi.org/10.3233/BMR-200137
Zabihhosseinian M, Holmes MWR, Ferguson B, Murphy B. Neck muscle fatigue alters the cervical flexion relaxation ratio in sub-clinical neck pain patients. Clin Biomech. 2015;30(5):397–404.
Yoo WG. Comparison of the thoracic flexion relaxation ratio and pressure pain threshold after overhead assembly work and below knee assembly work. J Phys Ther Sci. 2016;28(1):132–3.
Arguisuelas MD, Lisón JF, Doménech-Fernández J, Martínez-Hurtado I, Salvador Coloma P, Sánchez-Zuriaga D. Effects of myofascial release in erector spinae myoelectric activity and lumbar spine kinematics in non-specific chronic low back pain: Randomized controlled trial. Clin Biomech. 2019;63:27–33.
Bataller-Cervero AV, Rabal-Pelay J, Roche-Seruendo LE, Lacárcel-Tejero B, Alcázar-Crevillén A, Villalba-Ruete JA, Cimarras-Otal C. Effectiveness of lumbar supports in low back functionality and disability in assembly-line workers. Ind Health. 2019;57(5):588–95.
Bicalho E, Palma Setti JA, Macagnan J, Rivas Cano JL, Manffra EF. Immediate effects of a high-velocity spine manipulation in paraspinal muscles activity of nonspecific chronic low-back pain subjects. Man Ther. 2010;15(5):469–75.
Descarreaux M, Lafond D, Jeffrey-Gauthier R, Centomo H, Cantin V. Changes in the flexion relaxation response induced by lumbar muscle fatigue. BMC Musculoskelet Disord [Internet]. 2008;9. Available from: https://www.scopus.com/inward/record.uri?eid=2-s2.0-40149098578&doi=10.1186%2f1471-2474-9-10&partnerID=40&md5=4a970f56892b8577cf3e27b2a11d6f09
Grzeskowiak M, Krawiecki Z, Labedz W, Kaczmarczyk J, Lewandowski J, Lochynski D. Short-Term Effects of Kinesio Taping (R) on Electromyographic Characteristics of Paraspinal Muscles, Pain, and Disability in Patients With Lumbar Disk Herniation. J Sport Rehabil. 2019 Jul;28(5):402–12.
Horn ME, Bishop MD. Flexion Relaxation Ratio Not Responsive to Acutely Induced Low Back Pain from a Delayed Onset Muscle Soreness Protocol. ISRN Pain. 2013;2013:617698.
Kim JH, Kim YE, Bae SH, Kim KY. The effect of the neurac sling exercise on postural balance adjustment and muscular response patterns in chronic low back pain patients. J Phys Ther Sci. 2013;25(8):1015–9.
Marshall P, Murphy B. Changes in the flexion relaxation response following an exercise intervention. Spine. 2006 Nov;31(23):E877–83.
Marshall PWM, Murphy BA. Evaluation of functional and neuromuscular changes after exercise rehabilitation for low back pain using a Swiss ball: A pilot study. J Manipulative Physiol Ther. 2006 Sep;29(7):550–60.
Marshall PW, Murphy BA. Muscle Activation Changes After Exercise Rehabilitation for Chronic Low Back Pain. Arch Phys Med Rehabil. 2008;89(7):1305–13.
Moore A, Mannion J, Moran RW. The efficacy of surface electromyographic biofeedback assisted stretching for the treatment of chronic low back pain: A case-series. J Bodyw Mov Ther. 2015;19(1):8–16.
Page I, Marchand AA, Nougarou F, O’Shaughnessy J, Descarreaux M. Neuromechanical responses after biofeedback training in participants with chronic low back pain: an experimental cohort study. J Manipulative Physiol Ther. 2015 Sep;38(7):449–57.
Pouretezad M, Salehi R, Negahban H, Yazdi MJS, Mehravar M. Effects of cognitive loading on lumbar flexion relaxation phenomenon in healthy people. J Phys Ther Sci. 2018 Jun;30(6):744–7.
Ritvanen T, Zaproudina N, Nissen M, Leinonen V, Hänninen O. Dynamic Surface Electromyographic Responses in Chronic Low Back Pain Treated by Traditional Bone Setting and Conventional Physical Therapy. J Manipulative Physiol Ther. 2007;30(1):31–7.
Salamat S, Talebian S, Bagheri H, Maroufi N, Shaterzadeh MJ, Kalbasi G, O’Sullivan K. Effect of movement control and stabilization exercises in people with extension related non -specific low back pain- a pilot study. J Bodyw Mov Ther. 2017 Oct;21(4):860–5.
Shamsi M, Ahmadi A, Mirzaei M, Jaberzadeh S. Effects of static stretching and strengthening exercises on flexion relaxation ratio in patients with LBP: A randomized clinical trial. J Bodyw Mov Ther. 2022 Apr;30:196–202.
Shin SJ, Yoo WG. Changes in neck and back pain, cervical range of motion and cervical and lumbar flexion-relaxation ratios after below-knee assembly work. J Occup Health. 2014;56(2):150–6.
Ting X, Long CR, Vining RD, Gudavalli MR, DeVocht JW, Kawchuk GN, Wilder DG, Goertz CM. Association of lumbar spine stiffness and flexion-relaxation phenomenon with patient-reported outcomes in adults with chronic low back pain -- a single-arm clinical trial investigating the effects of thrust spinal manipulation. BMC Complement Altern Med. 2017;17:1–15.
Watson PJ, Kerry Booker C, Main CJ. Evidence for the role of psychological factors in abnormal paraspinal activity in patients with chronic low back pain. J Musculoskelet Pain. 1997;5(4):41–56.
Lalanne K, Lafond D, Descarreaux M. Modulation of the Flexion-Relaxation Response by Spinal Manipulative Therapy: A Control Group Study. J Manipulative Physiol Ther. 2009;32(3):203–9.
Choi KH, Cho MU, Park CW, Kim SY, Kim MJ, Hong B, Kong YK. A comparison study of posture and fatigue of neck according to monitor types (Moving and fixed monitor) by using flexion relaxation phenomenon (FRP) and craniovertebral angle (CVA). Int J Environ Res Public Health. 2020;17(17):1–12.
Ding Y, Liu B, Qiao H, Yin L, He W, Si F, Wang D. Can knee flexion contracture affect cervical alignment and neck tension? A prospective self-controlled pilot study. Spine J. 2020;20(2):251–60.
Hyun-Mu L, Du-Jin P, Seong-Yeol K. Immediate Effects of Dynamic Neck Training Combined with the Hold-Relax Technique for Young College Students with Video Display Terminal Syndrome. Int J Athl Ther Train. 2016;21(1):50–5.
Mousavi-Khatir R, Talebian S, Maroufi N, Olyaei GR. Effect of static neck flexion in cervical flexion-relaxation phenomenon in healthy males and females. J Bodyw Mov Ther. 2016;20(2):235–42.
Murphy B, Taylor HH, Marshall P. The effect of spinal manipulation on the efficacy of a rehabilitation protocol for patients with chronic neck pain: a pilot study. J Manipulative Physiol Ther. 2010;33(3):168–77.
Nimbarte AD, Zreiqat MM, Chowdhury SK. Cervical flexion–relaxation response to neck muscle fatigue in males and females. J Electromyogr Kinesiol. 2014;24(6):965–71.
Park DJ, Park SY. Long-term effects of diagonal active stretching versus static stretching for cervical neuromuscular dysfunction, disability and pain: An 8 weeks follow-up study. J Back Musculoskelet Rehabil. 2019;32(3):403–10.
Shin SJ, Yoo WG. Changes in cervical range of motion, flexion-relaxation ratio and pain with visual display terminal work. Work. 2014;47(2):261–5.
Shin SJ, An DH, Oh JS, Yoo WG. Changes in pressure pain in the upper trapezius muscle, cervical range of motion, and the cervical flexion-relaxation ratio after overhead work. Ind Health. 2012;50(6):509–15.
Shin H, Kim K. Effects of cervical flexion on the flexion-relaxation ratio during smartphone use. J Phys Ther Sci. 2014;26(12):1899–901.
Yoo And IG, Yoo WG. Changes in the cervical FRR, shoulder muscle pain and position after continuous detailed assembly work. Work Read Mass. 2014;49(4):735–9.
Nimbarte AD, Zreiqat M, Ning XP. Impact of shoulder position and fatigue on the flexion-relaxation response in cervical spine. Clin Biomech. 2014 Mar;29(3):277–82.
Prinsen C a. C, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, Terwee CB. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res Int J Qual Life Asp Treat Care Rehabil. 2018 May;27(5):1147–57.
Alschuler KN, Neblett R, Wiggert E, Haig AJ, Geisser ME. Flexion-relaxation and clinical features associated with chronic low back pain: A comparison of different methods of quantifying flexion-relaxation. Clin J Pain. 2009;25(9):760–6.
Marshall P, Murphy B. The relationship between active and neural measures in patients with nonspecific low back pain. Spine. 2006 Jul;31(15):E518–24.
Rose-Dulcina K, Genevay S, Dominguez D, Armand S, Vuillerme N. Flexion-Relaxation Ratio Asymmetry and Its Relation With Trunk Lateral ROM in Individuals With and Without Chronic Nonspecific Low Back Pain. Spine. 2020 Jan;45(1):E1–9.
Kimberlin CL, Winterstein AG. Validity and reliability of measurement instruments used in research. Am J Health-Syst Pharm AJHP Off J Am Soc Health-Syst Pharm. 2008 Dec 1;65(23):2276–84.
Hodges PW. Editorial: Consensus for Experimental Design in Electromyography (CEDE) project. J Electromyogr Kinesiol Off J Int Soc Electrophysiol Kinesiol. 2020 Feb;50:102343.
McManus L, Lowery M, Merletti R, Søgaard K, Besomi M, Clancy EA, van Dieën JH, Hug F, Wrigley T, Besier T, Carson RG, Disselhorst-Klug C, Enoka RM, Falla D, Farina D, Gandevia S, Holobar A, Kiernan MC, McGill K, Perreault E, Rothwell JC, Tucker K, Hodges PW. Consensus for experimental design in electromyography (CEDE) project: Terminology matrix. J Electromyogr Kinesiol Off J Int Soc Electrophysiol Kinesiol. 2021 Aug;59:102565.
Curran-Everett D. Explorations in statistics: the analysis of ratios and normalized data. Adv Physiol Educ. 2013 Sep;37(3):213–9.
Gouteron A, Tabard-Fougère A, Moissenet F, Bourredjem A, Rose-Dulcina K, Genevay S, Laroche D, Armand S. Sensitivity and specificity of the flexion and extension relaxation ratios to identify altered paraspinal muscles’ flexion relaxation phenomenon in nonspecific chronic low back pain patients. J Electromyogr Kinesiol Off J Int Soc Electrophysiol Kinesiol. 2023 Feb;68:102740.
Drevon D, Fursa SR, Malcolm AL. Intercoder Reliability and Validity of WebPlotDigitizer in Extracting Graphed Data. Behav Modif. 2017 Mar;41(2):323–39.
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, Chou R, Glanville J, Grimshaw JM, Hróbjartsson A, Lalu MM, Li T, Loder EW, Mayo-Wilson E, McDonald S, McGuinness LA, Stewart LA, Thomas J, Tricco AC, Welch VA, Whiting P, Moher D. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021 Mar 29;372:n71.
McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Statement. J Clin Epidemiol. 2016 Jul;75:40–6.
Revelle, William. Procedures for Psychological, Psychometric, and Personality Research. Northwest Univ Evanst Illionois. R package version 2.3.6.
R Core Team. R: A language and environment for statistical computing. [Internet]. In R Foundation for Statistical Computing.; 2023. Available from: https://www.R-project.org/
McGuinness LA, Higgins JPT. Risk-of-bias VISualization (robvis): An R package and Shiny web app for visualizing risk-of-bias assessments. Res Synth Methods [Internet]. 2020 Apr 26 [cited 2020 May 21];n/a(n/a). Available from: https://doi.org/10.1002/jrsm.1411
Balduzzi S, Rücker G, Schwarzer G. How to perform a meta-analysis with R: a practical tutorial. Evid Based Ment Health. 2019 Nov;22(4):153–60.
Campbell M, McKenzie JE, Sowden A, Katikireddi SV, Brennan SE, Ellis S, Hartmann-Boyce J, Ryan R, Shepperd S, Thomas J, Welch V, Thomson H. Synthesis without meta-analysis (SWiM) in systematic reviews: reporting guideline. BMJ. 2020 Jan 16;368:l6890.

No competing interests reported.

Download PDF

Journal Publication

published 08 Feb, 2024

Read the published version in Scientific Reports →

Editorial decision: Revision requested
07 Nov, 2023
Reviews received at journal
06 Oct, 2023
Reviewers agreed at journal
25 Sep, 2023
Reviewers invited by journal
12 Sep, 2023
Editor assigned by journal
12 Sep, 2023
Editor invited by journal
12 Sep, 2023
Submission checks completed at journal
12 Sep, 2023
First submitted to journal
31 Jul, 2023

You are reading this latest preprint version

Is the flexion relaxation ratio a reliable, valid, and responsive measure for individuals with and without non-specific spine pain? A systematic review and meta-analysis

Status:

Journal Publication

Version 1

Abstract

Figures

INTRODUCTION

RESULTS

Study Selection

Study Characteristics

Risk of Bias

Individual Studies

Reliability

Discriminative Validity

Responsiveness

Interrater Reliability of Data from Digitized Graphs

Syntheses

Certainty of Evidence

DISCUSSION

METHODS

Registration and Protocol

Eligibility Criteria

Population

Outcome

Validity

Reliability

Responsiveness

Types of studies

Context

Information Sources and Search Strategy

Selection Process

Data Collection Process and Data Items

Risk of Bias Assessment and Quality of Reporting

Effect Measures

Synthesis Methods

Certainty Assessment: Grading the quality of cumulative evidence

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1