The results of the search strategy are presented in Fig. 2. A total of 6732 studies were identified and screened. Of these 6450 were excluded from the title and abstract. Of the remaining 282 full-text studies reviewed, 255 were excluded. Twenty-three studies met the inclusion criteria and were included in the review (2, 27–48)
Study characteristics – Population and location
A total of 21,968 participants were recruited across the 23 included studies. There were 17,604 participants with non-specific neck pain and 4364 participants with CSR. Sixteen studies included neck pain populations, five were cohort study designs (27, 29, 34, 43, 46) and 11 were cross-sectional in study design (2, 28, 32, 35–37, 39, 41, 42, 45, 47). Of the seven studies that included CSR populations, five were observational (30, 31, 38, 44, 48) and two were secondary analyses of healthcare records (33, 40). The characteristics of the included studies are presented in Table 1 (summary study characteristics). A full table of study characteristics can be accessed in supplementary file 1.
Table 1
Summary study characteristics
Author and year
|
Spinal diagnosis
|
Mental health diagnosis or symptoms
|
Health outcome
|
Alipour (2009)
|
Non-specific neck pain
|
Anxiety symptoms regarding changed
|
Sick Leave from employment
|
Beltran-Alacreu
(2018)
|
Non-specific neck pain
|
Kinesiophobia
|
Presence of pain (NPRS)
|
Bohman (2019)
|
Neck pain for 3 months or longer
|
Depressive symptoms
|
Neck Disability Index
|
Carroll (2004)
|
Non-specific neck pain
|
Depressive symptoms
|
Development of pain (NPRS)
|
Diebo (2018)
|
Cervical spine radiculopathy
|
Psychological outcomes with SF-36
|
Neck Disability Index (NDI)
|
Divi
(2020)
|
Cervical spine radiculopathy
|
Psychological outcomes with SF-12
|
Neck Disability Index (NDI)
|
Elbinoune (2016)
|
Neck pain for 3 months or longer
|
Anxiety and depressive symptoms
|
Presence of Pain (NPRS)
|
Engquist (2015)
|
Cervical radiculopathy
|
Depressive symptoms
|
Neck Disability Index
|
Grimby-Ekman (2012)
|
Non-specific neck pain
|
Stress
|
Presence of pain (NPRS)
|
Hill
(2007)
|
Non-specific neck pain
|
Psychological distress
|
Presence of pain (NPRS)
|
Hoe (2012)
|
Non-specific neck pain
|
Job strain & SF-12 MCS
|
Presence of pain (NPRS)
|
Hurwitz (2006)
|
Non-specific neck pain
|
SF-36 Mental health
|
Neck Disability Index
|
Kim
(2018)
|
Cervical spine radiculopathy
|
Depressive symptoms
|
Neck Disability Index and Numeric Pain Rating Score
|
Lee (2007)
|
Non-specific neck pain
|
Psychological distress
|
Presence of pain (NPRS)
|
MacDowell (2018)
|
Cervical radiculopathy
|
Anxiety and depressive symptoms
|
Neck Disability Index
|
McLean (2011)
|
Neck pain for 3 months or longer
|
Anxiety and depressive symptoms
|
Disability of arm and shoulder (DASH)
|
Meisingset (2018)
|
Non-specific neck pain
|
Catastrophising
|
Pain (NPRS)
|
Myhre (2013)
|
Non-specific neck pain
|
Emotional distress
|
FABQ-W
|
Peolsson (2006)
|
Cervical spine radiculopathy
|
Distress
|
Neck Disability Index
|
Pico-Espinosa (2019)
|
Non-specific neck pain
|
Depressive symptoms
|
Pain levels (NPRS)
|
Rodriguez-Romero (2016)
|
Non-specific neck pain
|
Psychological outcomes with SF-36
|
Presence of pain (NPRS)
|
van den Heuvel (2005)
|
Non-specific neck pain
|
Job strain
|
Presence of neck and upper limb pain shoulder pain (NPRS)
|
Wibault
(2014)
|
Cervical spine radiculopathy
|
Depression and Anxiety
|
Neck Disability Index
|
Seven studies included participants with CSR recruited from elective spinal surgery waiting lists. The CSR diagnosis was made using imaging associated with a neurological deficit on clinical examination (30, 31, 33, 38, 40, 44, 48). Despite contacting the corresponding authors for further information, no further details were obtained.
Nine studies measured depressive symptoms (2, 29, 32, 33, 38, 40, 43, 45, 48). Five studies measured anxiety symptoms (27, 32, 40, 43, 48) and three studies measured job-strain and stress (34, 35, 46). Three studies used the psychological components of SF-36 (30, 39, 47). Two studies used the psychological components of SF-12 (31, 35). One study measured kinesiophobia (28) and one study measured catastrophising (41). Three studies used more than one mental health symptom measurement (32, 35, 43). A summary of the mental health symptoms and tools to measure the severity of mental health conditions across the 23 included studies are presented in Table 1.
Neck pain associative outcomes. Depressive symptoms.
Of the 16 studies with people with non-specific neck pain, there were positive and negative associations between mental health symptoms and health outcomes. Four studies reported a positive association (2, 32, 43) and one study reported a negative association (29) with depression health outcomes. Using GRADE classifications, the overall strength of evidence was ‘low’, which is attributed to a high risk of bias.
Depressive symptoms measured through Hospital Anxiety and Depression Scale (HADS) was positively associated with the Disabilities of the Arm, Shoulder and Hand (DASH) questionnaire (r:0.245, p = 0.004) (43), Odds Ratio (OR): 3.46 (95% CI: 2.01–5.95) (45) and OR: 1.02 (95% CI: 0.98–1.06) (32). When measured through the CES-D, depressive symptoms were positively associated with pain (Hazard Ratio (HR): 3.97, 95% CI: 1.81–8.72) (2). Depressive symptoms measured by the Montgomery Asberg Depression Rating Scale were negatively associated with Neck Disability Index (NDI) (OR: 0.94, 95% CI: 0.86–1.03) (29).
Neck pain associative outcomes. Anxiety symptoms.
Anxiety symptoms were positively associated with poorer health outcomes in two studies (27, 32) and had no significance in one study (43). The overall strength of evidence was ‘very low’ in the GRADE assessment which is attributed to a high risk of bias and imprecision.
Anxiety symptoms measured through the Nordic musculoskeletal questionnaire were more likely to be associated with sick leave (OR: 1.4, 95% CI: 0.9–2.1) (27). Anxiety symptoms measured through HADS were more likely to be associated with the presence of pain (OR: 1.02, 95% CI: 0.98–1.05) (32). Whereas in one study, anxiety symptoms measured through HADS had no statistical significance with DASH (r: 0.104, p = 0.220) (43).
Neck pain associative outcomes. Kinesiophobia
Kinesiophobia was associated with poorer health and the presence of pain (r: 0.566, P = < 0.05) in one study (28).
Neck pain associative outcomes. Catastrophising
Catastrophising, measured by the catastrophising pain scale, was positively associated with pain (OR: 1.03, 95% CI 0.97–1.09) in one study (41).
Neck pain associative outcomes. Stress
Stress was positively associated with the presence of pain (OR: 0.32, 95% CI: 0.25–0.39) in one study (34).
Neck pain associative outcomes. Job strain
A higher job strain was negatively associated with the presence of pain in the neck and shoulder in two studies (Relative Risk (RR): 1.79, 95% CI: 1.19–2.69) (46) and OR: 1.51 (95% CI: 0.88–2.59) (35). This was rated as ‘low’ in the GRADE assessment, attributed to imprecision across the studies.
Neck pain associative outcomes. Distress
Distress was positively associated with health outcomes in three studies (37, 39, 42) and negatively associated with health outcomes in two studies (36, 47). The overall strength of evidence using the GRADE approach is ‘very low’, which is attributed to a high risk of bias and imprecision.
Psychological distress measures were positively associated with the presence of pain when measured by SF-36 (r2: 0.12, p < 0.01) (39) and Hopkins Check List-10 (OR: 2.32, 95% CI: 1.20–3.43) (42). Similarly, this was positively associated with NDI (OR: 1.75, 95% CI 0.83–3.70) (37). Two studies reported a negative association between distress and the presence of pain (OR: 0.88, 95% CI: 0.62–1.24) (36) and OR: -0.3, 95% CI -0.4-0.1 (47).
Cervical spine radiculopathy associative outcomes. Depressive symptoms
Of the seven studies with CSR populations, there were both positive and negative associations between depressive symptoms and health outcomes. Three studies reported a negative association (33, 40, 44), whereas one study reported a positive association (38). The overall strength of evidence using the GRADE approach was ‘very low’, this is attributed to a high risk of bias and imprecision.
Depressive symptoms were positively associated with NDI when measured through the Zung Self-Reporting Scale (NDI with depression 42.8 (High) (SD: 19.9) vs 20.9 (SD: 15.9), p < 0.0001) (38). Three studies reported negative associations (OR: 0.71, p < 0.001) (44), regression coefficient 0.25 (95% CI: -0.01-0.50) (40) and risk of depression not being significant (p = 0.3) (33).
Cervical spine radiculopathy associative outcomes. Distress
There were two studies that reported a positive association between SF-36 (p < 0.05) (30) and SF-12 (p = 0.04) (31) and NDI. Whereas one study reported distress being negatively associated with NDI (r2 = 0.80, p = 0.0005) (44). The overall strength of evidence using the GRADE approach was ‘very low’. This is attributed to a high risk of bias and imprecision.
Cervical spine radiculopathy associative outcomes. Anxiety symptoms
In one study, anxiety symptoms were positively associated with NDI in CSR populations (OR: 0.63, p = 0.006) (48). All associative outcomes data are populated in Table 4.
Quality Assessment. Neck pain populations
Five cohort studies included patients with non-specific neck pain as their exposure (27, 29, 34, 43, 46). These studies scored between five and seven out of nine on the NOS. All studies met the ‘representativeness of exposed cohort’ and ‘adequate follow-up’. All five studies did not complete the ‘assessment of outcome’ item.
Eight studies were cross-sectional in study design. Scores ranged from five to seven out of nine on the NOS (2, 28, 32, 35, 39, 41, 47). All studies met the ‘representativeness of exposed cohort’ and ‘adequate follow-up’. All eight studies did not meet the ‘assessment of outcome’ item. Three studies completed a secondary analysis of data (36, 37, 45). These studies scored six to seven out of a possible nine. All studies did not meet the ‘demonstration that outcome of interest was not present at the start of study’ item and ‘assessment of outcome’. The overall strength of evidence measured through GRADE is populated in Table 2. The quality assessment tables are populated in Table 3.
Table 2
Certainty of evidence. GRADE approach for health outcomes
Study Design
|
Study lead author
|
Number of studies/patients
|
Risk of bias
|
Imprecision
|
Inconsistency
|
Indirectness
|
Overall strength of evidence
|
|
Observational
Neck pain without CSR
|
Depression
|
Bohman
Caroll
Elbinoune
McClean
Pico-Espinosa
|
5/1,718
|
High
|
Serious
|
Moderate
|
No seriousness
|
Low
|
|
Anxiety
|
Alipour
Elbinoune
McClean
|
1/12,415
|
High
|
Serious
|
High
|
No seriousness
|
Very low
|
|
Catastrophising
|
Meisingset
|
1/70
|
High
|
Serious
|
High
|
No seriousness
|
Very low
|
|
Stress
|
Grimby-Ekman
|
1/1200
|
High
|
Serious
|
High
|
No seriousness
|
Very low
|
|
Job strain
|
Van den Heuvel
Hoe
|
2/1898
|
High
|
Serious
|
Moderate
|
No seriousness
|
Low
|
|
Distress
|
Lee
Hill
Hurwitz
|
3/802
|
High
|
Serious
|
Moderate
|
No seriousness
|
Very Low
|
|
Kinesiophobia
|
Beltran-Alacreu
|
1/128
|
High
|
Serious
|
Moderate
|
No seriousness
|
Low
|
|
Observation
CSR
|
|
Distress
|
Diebo
Divi
Peoplsson
|
3/639
|
High
|
Serious
|
Moderate
|
No seriousness
|
Very Low
|
|
Depression
|
Kim;
Peolsson
Enquist
MacDowell
|
4/471
|
High
|
Serious
|
Moderate
|
No seriousness
|
Very Low
|
|
Anxiety
|
Wilbault;
|
1/254
|
High
|
Serious
|
Moderate
|
No seriousness
|
Very Low
|
|
Through this, the certainty of the evidence was either increased (upgraded) or decreased (downgraded) against the following five criteria: |
(1) Methodological limitations using the Cochrane Risk of Bias tool (downgraded where there was a high risk of bias for three or more items; upgraded where all items demonstrated a low risk of bias); |
(2) Indirectness relating to similarity to clinical practice (downgraded where reviewers felt the study design was not generalisable to UK practice; upgraded where study design was generalisable to UK practice); |
(3) Imprecision relating to the number of participants and events (downgraded where outcomes reported less than 300 participants or five events; upgraded where effects reported in excess of 450 participants or 20 events); |
(4) Inconsistency in effect estimates across trials for a given analysis (downgraded where the CIs were four-times the magnitude of the effect estimate; upgraded where CIs were two-times the magnitude of the effect estimate) |
(5) Likelihood of publication bias (downgraded when reviewers observed asymmetry in funnel plot shape; upgraded when reviewers observed symmetry in funnel plot shape). |
Table 3
Quality assessment scoring for all included studies
Author and year
|
Representativeness of the exposed cohort
|
Selection of the non-exposed cohort
|
Ascertainment of exposure
|
Demonstration that outcome of interest was not present at start of study
|
Comparability of cohorts based on the design or analysis
|
Assessment of outcome
|
Was follow-up long enough for outcomes to occur
|
Adequacy of follow up of cohorts
|
TOTAL STARS
|
Alipour (2009)
|
1
|
1
|
0
|
1
|
2
|
0
|
1
|
0
|
7
|
Beltran-Alacreu
(2018)
|
1
|
1
|
0
|
0
|
0
|
0
|
1
|
1
|
5
|
Bohman (2019)
|
1
|
1
|
0
|
0
|
0
|
0
|
1
|
1
|
5
|
Diebo (2018)
|
1
|
1
|
1
|
0
|
2
|
0
|
1
|
1
|
7
|
Divi
(2020)
|
1
|
1
|
1
|
0
|
2
|
0
|
1
|
1
|
7
|
Carroll (2004)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
7
|
Elbinoune (2016)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
6
|
Engquist (2015)
|
1
|
1
|
1
|
0
|
0
|
0
|
1
|
1
|
5
|
Grimby-Ekman (2012)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
7
|
Hill
(2007)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
0
|
6
|
Hoe (2012)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
7
|
Hurwitz (2006)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
0
|
6
|
Kim
(2018)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
6
|
Lee (2007)
|
1
|
1
|
0
|
0
|
0
|
0
|
1
|
1
|
5
|
MacDowell (2018)
|
1
|
1
|
1
|
0
|
2
|
0
|
1
|
1
|
7
|
McLean (2011)
|
1
|
1
|
0
|
0
|
1
|
0
|
1
|
1
|
6
|
Meisingset (2018)
|
1
|
1
|
0
|
0
|
2
|
0
|
0
|
0
|
5
|
Myhre (2013)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
7
|
Peolsson (2006)
|
1
|
1
|
1
|
0
|
2
|
0
|
1
|
0
|
6
|
Pico-Espinosa (2019)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
7
|
Rodriguez-Romero (2016)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
0
|
6
|
van den Heuvel (2005)
|
1
|
1
|
0
|
1
|
2
|
0
|
1
|
0
|
7
|
Wibault
(2014)
|
1
|
1
|
0
|
0
|
2
|
0
|
1
|
1
|
6
|
Quality assessment. Cervical spine radiculopathy populations
Five cohort studies included patients with CSR as their exposure population (30, 31, 38, 44, 48). These studies scored between six and seven out of a possible nine on NOS. All studies met the ‘representativeness of exposed cohort’ and ‘adequate follow-up’. All five studies did not complete the ‘assessment of outcome’ item. Two studies with a CSR study population were retrospective secondary data analyses where each study scored five (33) and seven (40), respectively. The overall strength of evidence measured through GRADE is populated in Table 2. The quality assessment tables are populated in Table 3.
Table 4
Associative data between health outcome and mental health.
Author and year
|
Associative data between health outcome and mental health
|
Alipour (2009)
|
OR: 1.4
(95% CI: 0.9–2.1)
|
Beltran-Alacreu
(2018)
|
Association kinesiophobia and presence of pain (r = 0.566)
|
Bohman (2019)
|
OR: 0.94
(95% CI: 0.86–1.03)
|
Carroll (2004)
|
Hazard Rate Ratio 3.97
(95% CI 1.81–8.72)
|
Diebo (2018)
|
When NDI is low
MHC = 25.81
(SD: 8.85)
When NDI is high
MCS = 25.60
(SD: 8.87)
|
Divi
(2020)
|
MHC low score
23.9
(95% CI: 21.0-26.7)
vs
MHC high score
31.8
(95% CI: 24.7–38.9) (p = 0.04)
|
Elbinoune (2016)
|
HADS-Anxiety
OR: 1.02
(95% CI: 0.98–1.05)
HADS-Depression OR: 1.02
(95% CI: 0.98 to 1.06)
|
Engquist (2015)
|
No risk of depression
4 (95% I: -4 to 15)
At risk of depression
10 (95% CI: 1–19)
(p = 0.3)
|
Grimby-Ekman (2012)
|
OR 0.32
(95% CI: 0.25–0.39)
|
Hill
(2007)
|
OR 0.88
(95% CI: 0.62–1.24)
|
Hoe (2012)
|
High Job Strain
OR: 1.51
(95% CI: 0.88–2.59)
SF-12 Mental Health Component OR: 0.98
(95% CI: 0.96–0.99)
|
Hurwitz (2006)
|
OR 1.75 (95% CI 0.83–3.70)
|
Kim
(2018)
|
NDI
Depression 42.8 (SD: 19.9)
vs
Low-depression 20.9 (SD: 15.9) (p < 0.0001)
NPRS
Depression 5.5 (SD: 2.2)
vs
Low depression 3.0 (SD: 2.4)
(p < 0.0001)
|
Lee (2007)
|
SF-36 MCS and Physical activity (r2: 0.12 p < 0.01)
|
MacDowell (2018)
|
Regression Coefficient
0.25
(95% CI: -0.01-0.50)
|
McLean (2011)
|
Depression
r: 0.245
(p = 0.004)
Anxiety
r:0.104
(p = 0.220)
|
Meisingset (2018)
|
OR: 1.03
(95% CI 0.97–1.09)
|
Myhre (2013)
|
OR: 2.32
(95% CI: 1.20–3.43)
|
Peolsson (2006)
|
NDI r2 = 0.80
to DRAM
(p = 0.0005)
|
Pico-Espinosa (2019)
|
OR: 3.46
(95% CI 2.01–5.95)
|
Rodriguez-Romero (2016)
|
OR: -0.3
(95% CI: -0.4-0.1)
|
van den Heuvel (2005)
|
Low job strain
RR: 1.00 (95% CI 0.76–1.92)
High job strain
RR: 1.79
(95% CI 1.19–2.69)
|
Wibault
(2014)
|
Depression
OR: 0.71
(p = < 0.001)
Anxiety
OR: 0.63 (p = 0.006)
|