The Validity and Reliability of the Patient Health Questionnaire-9 in Screening for Post-Stroke Depression

doi:10.21203/rs.2.11681/v3

Download PDF

Research article

The Validity and Reliability of the Patient Health Questionnaire-9 in Screening for Post-Stroke Depression

https://doi.org/10.21203/rs.2.11681/v3

This work is licensed under a CC BY 4.0 License

Journal Publication

published 09 Jun, 2020

Read the published version in BMC Psychiatry →

You are reading this latest preprint version

Background: Depression affects about 30% of stroke survivors within five years. Timely diagnosis and management of post-stroke depression facilitate motor recovery and improve independence. The original version of the Patient Health Questionnaire-9 (PHQ-9) is recognized as a good screening tool for post-stroke depression. However, no validation studies have been undertaken for the use of the Thai PHQ-9 in screening for depression among Thai stroke patients. Methods: The objectives were to determine the criterion validity and reliability of the Thai PHQ-9 in screening for post-stroke depression by comparing its results with those of a psychiatric interview as the gold standard. First-ever stroke patients aged ≥ 45 years with a stroke duration 2 weeks–2 years were administered the Thai PHQ-9. The gold standard was a psychiatric interview leading to a DSM-5 diagnosis of depressive disorder. The summed-scored-based diagnosis of depressive disorder with the PHQ-9 was obtained. Validity and reliability analyses, and a receiver operating characteristic curve analysis, were performed. Results: In all, 115 stroke patients with a mean age of 64 years (SD: 10 years) were enrolled. The mean PHQ-9 score was 5.2 (SD: 4.8). Using the DSM-5 criteria, 23 patients (20%) were diagnosed with depressive disorder. The Thai PHQ-9 had satisfactory internal consistency (Cronbach’s alpha: 0.78). The algorithm-based diagnosis of the Thai PHQ-9 had low sensitivity (0.52) but very high specificity (0.94) and positive likelihood ratio (9.6). Used as a summed-scored-based diagnosis, an optimal cut-off score of six revealed a sensitivity of 0.87, specificity of 0.75, positive predictive value of 0.46, negative predictive value of 0.95, and positive likelihood ratio of 3.5. The area under the curve was 0.87 (95% CI: 0.78–0.96). Conclusions: The Thai PHQ-9 has acceptable psychometric properties for screening for post-stroke depression, with a recommended cut-off score of ≥ 6 for a Thai population.

Psychiatry

Neurology

depression

Patient Health Questionnaire-9

reliability

screening

stroke

Thai

validity

Depression is the most common psychological problem experienced by survivors of a stroke.¹ The pool frequency is 31% of stroke survivors at any time up to five years after their stroke.² However, a review of prospective longitudinal research³ showed that there is a biphasic pattern in post-stroke depression rates. The depressive symptoms gradually rise in the first 6 months, ease slightly at around 12 months, and worsen again during the second year after the stroke. Post-stroke depression (PSD) is associated with a longer length of hospital stay and decreased participation in rehabilitation programs, resulting in less functional improvement.^{4, 5} After stroke patients are discharged, they tend to become physically inactive and socially isolated.⁶ Depressed patients have fewer daily activities and a lower quality of life.⁷ This may lead to more cognitive impairment⁸ and increased mortality during the 2–5 years following the stroke.⁹

It is difficult to make a diagnosis of depression after a stroke because the symptoms of depression can be confused with certain symptoms that are typical of stroke patients.¹⁰ Screening for mood disorders after a stroke is recommended by many stroke and stroke-rehabilitation guidelines.^{11, 12} Given that the availability of psychiatrists is limited in Thailand, there is a need for a screening tool to assist primary care physicians and other specialists in assessing for depression. Extensively studied in the non-Thai population and post-stroke patients, the Patient Health Questionnaire-9 (PHQ-9) has been reported to be a good PSD screening tool and to have the highest sensitivity.^{13, 14} The PHQ-9 has also been translated into Thai (Thai PHQ-9) and validated in primary care patients.¹⁵ The cut-off score of the Thai PHQ-9 for major depression in primary care patients is 9, which differs from the original version of the PHQ-9.¹⁶ As to PSD, Williams et al.¹⁷ reported a cut-off score for the original version of 10 for the diagnosis of major depression, with a sensitivity of 91% and a specificity of 89%. However, the PHQ-9 has not yet been validated for PSD among Thais. Because Thailand and western countries have different health care systems, cultures, attitudes, mindsets, and family support systems, this study investigated the validity and reliability of the Thai PHQ-9 in screening for depressive disorder after stroke among Thais.

Subjects and procedures

Ethics approval was obtained from the Medical Ethics Committee of the Human Research Protection Unit, Faculty of Medicine Siriraj Hospital. The patients were recruited November 2017–December 2018 from the Department of Rehabilitation Medicine, Faculty of Medicine Siriraj Hospital, a tertiary hospital in Thailand. All patients gave written consent to participate. They were informed that their emotional status would be assessed via a questionnaire and a psychiatric interview. The patient inclusion criteria were aged ≥ 45 years; having a first-time stroke, as per WHO criteria,¹⁸ and with a stroke duration 2 weeks–2 years; stable vital signs, neurological signs, and stroke symptoms, as confirmed by a neurologist; and the ability to communicate in Thai. Excluded were patients with a cognitive impairment score of < 24, as measured by the Thai Mental State Examination,¹⁹ or a previous diagnosis of dementia, a psychiatric disorder, or another neurological disease.

Demographic characteristics were gathered from interviews with the enrolled patients, and information related to their stroke (such as any comorbid illnesses, and the types of stroke diagnosed from imaging studies) were obtained from medical records. The Modified Rankin Scales were also obtained to determine the level of disability of the participants. The Thai PHQ-9¹⁵ was administered by one of the researchers (PD) at either the inpatient rehabilitation ward or the outpatient rehabilitation clinic, depending on a patient’s visit. On the same day, a psychiatrist interviewed each patient in a private area and made a diagnosis according to the criteria detailed in the American Psychiatric Association’s Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5). The researcher and the psychiatrist were blinded to each other’s assessment.

Measures

Thai Mental State Examination¹⁹

The Thai Mental State Examination (TMSE) is the first neuropsychiatric test that was used to provide a standard mental status examination of Thais. The maximum TMSE score is 30 points. For the diagnosis of a normal, healthy, older Thai person, a TMSE cut-off score of 24 points is used.

Modified Rankin Scale

The Modified Rankin Scale (MRS), a clinician-reported measure of global disability, has been widely applied to evaluate stroke recovery.^{20, 21} It is an ordinal scale, with 7 categories ranging from zero (no symptoms) to six (death). The MRS assesses an individual’s ability to ambulate and complete the activities of daily living. MRS scores > 3 are defined as severe disability.²²

Thai PHQ-9¹⁵

The PHQ-9 consists of 9 questions that are based on the 9 DSM-IV criteria for a major depressive disorder. The questionnaire explores the symptoms experienced by patients during the 2 immediately preceding weeks. The scores for each PHQ-9 item range from 0 (not at all), to 1 (several days), 2 (more than half of the days), and 3 (nearly every day). The PHQ-9 also provides a preliminary diagnosis of major depressive disorder using an algorithm-based diagnosis (≥ 5 items, including Items 1 and/or 2, are rated ≥ 2), resulting in the total score for the questionnaire being 10 or higher. PHQ-9 can be used as a screening tool for the diagnosis of depression by using a summed-scored-based algorithm. The summed scores range from 0 to 27. Various cut-off scores allow for the determination of different degrees of depression. A study on the Thai PHQ-9 in the general Thai population reported that a summed score of 9 or greater signified a major depressive disorder, with a sensitivity of 0.84 and specificity of 0.77.

DSM-5

The DSM-5 criteria for depressive disorders were used as the reference standard.²³ A psychiatric interview was conducted for each patient. Three psychiatrists had a process of standardization whereby they discussed and agreed on the content of the interviews before they were conducted. Depressive disorders could be classified as a major depressive disorder, a persistent depressive disorder (dysthymia), a depressive disorder due to another medical condition, another specified depressive disorder, or as an unspecified depressive disorder.

Data analysis

PASW Statistics for Windows, version 18.0 (SPSS Inc., Chicago, Ill., USA)²⁴ and MedCalc for Windows, version 15.0 (MedCalc Software, Ostend, Belgium)²⁵ were used for the statistical analyses. The demographic data, MRS, and PHQ-9 scores were analyzed by descriptive statistics. The quantitative data (age) was analyzed by an independent-sample t-test, while the stroke durations and Thai PHQ-9 scores were analyzed with the Mann–Whitney U test. Gender, education levels, risk factors, stroke pathology, side of weakness, and MRS scale were analyzed by Chi-square tests.

The stroke patients were divided into normal and depression groups, based on their psychiatric diagnoses. The psychiatrist determined the types of depressive disorders by using the relevant DSM-5 criteria. The depression scores of the normal and depression groups were analyzed by the independent-sample t-test. All analyses were significant at a p-value of < 0.05. Internal consistency was analyzed by Cronbach’s alpha. As a bivariate response, the psychiatric diagnosis of depression was used as the reference standard to calculate the sensitivities and specificities of all possible PHQ-9 cut-off scores. The positive and negative predictive values as well as the positive and negative likelihood ratios were calculated for each PHQ-9 cut-off score. Receiver-operator characteristic (ROC) analyses subsequently combined the instrument sensitivity and specificity into one measure (referred to as the area under the curve, or AUC) for all possible cut-off scores.

In all, 190 stroke patients were approached for participation. Seventy-five of those were excluded: 21 had recurrent stroke, 17 had cognitive impairment, 17 had aphasia, 10 were < 45 years, and 10 had a stroke duration > 2 years (Fig. 1). After applying the exclusion criteria, 115 stroke patients were enrolled. They comprised 63 males (54.8%) and 52 females (45.2%), with a mean age of 64 years (SD: 10 years; min, max: 45, 88). The majority had graduated primary school, followed by lower-secondary school and upper-secondary school. The comorbid illnesses found were, in descending order of frequency, hypertension, dyslipidemia, diabetes mellitus, and heart disease. The median duration of stroke was 59 days. The large majority of patients (81.7%) suffered from ischemic stroke, and left-side weakness was dominant (61%). Most patients (65.2%) were recruited from inpatient rehabilitation.

All patients were administered the PHQ-9 as the index test. The reference standard was the psychiatric interview conducted on the same day, with the resultant diagnosis based on the DSM-5 criteria. The psychiatrist who administered the interview was blinded to the corresponding score for the index test, and all interviews were conducted regardless of the index test scores. The mean Thai PHQ-9 score was 5.2 ± 4.8. According to the DSM-5 criteria, 23 patients (20%) were diagnosed with PSD, whereas 92 patients (80%) were normal. In the PSD group, eight (6.9%) were classified as having a major depressive disorder, two (1.7%) with an unspecified depressive disorder, and one (0.9%) with another specified depressive disorder. The remaining 12 patients (10.5%) were diagnosed as having an adjustment disorder with a depressed mood.

Table 1. The baseline characteristics of the stroke patients

Variables	Normal (N=92)	PSD (N=23)	P-value
Demographic-related
Age¹	64.7 (9.5)	64.6 (12.2)	0.960
Gender² Male Female	54 (58.7) 38 (41.3)	9 (39.1) 14 (60.9)	0.092
Education level² Primary school Secondary school Bachelor degree and higher	42 (45.7) 26 (28.3) 24 (26.0)	13 (56.6) 5 (21.7) 5 (21.7)	0.430
Comorbid illness² Hypertension Dyslipidemia Diabetes mellitus Smoking Heart disease	77 (83.7) 53 (57.6) 37 (40.2) 21 (22.8) 19 (20.7)	21 (91.3) 17 (73.9) 12 (52.2) 4 (17.4) 6 (26.1)	0.518 0.152 0.300 0.572 0.572
Duration of stroke² <3 months 3-6 months >6 months	58 (63.0) 14 (15.2) 20 (21.7)	16 (69.6) 5 (21.7) 2 (8.7)	0.293
Pathology of stroke² Infarction Hemorrhage	74 (80.4) 18 (19.6)	20 (87.0) 3 (13.0)	0.561
Side of weakness² Left Right	54 (58.7) 38 (41.3)	16 (69.6) 7 (30.4)	0.339
Setting² Inpatient Outpatient	60 (65.2) 32 (34.8)	15 (65.2) 8 (34.8)	0.793
Disability-related
Modified Rankin Scale² 1 2 3 4 5	7 (7.6) 16 (17.4) 18 (19.6) 50 (54.3) 1 (1.1)	2 (8.7) 0 (0.0) 3 (13.0) 15 (65.2) 3 (13.0)	0.036*
Depression-related
Median PHQ-9 score³	4.0 (0.5, 5.75)	10.0 (7.0, 15.0)	<0.001*

¹ Mean (SD); ² number (%); ³ median (IQR 25,75), *significant at p-value < 0.05

The demographic characteristics of the normal and depression groups revealed no statistically significant differences (Table 1). However, the MRS and the median PHQ-9 scores of the groups differed. MRS scores of 0–3 were defined as no-severe disability, while MRS scores > 3 were defined as severe disability; more stroke patients were disabled in the depression group (78%) than in the normal group (55.4%).

Reliability and item analysis

As presented in Table 2, the highest mean score of the nine PHQ-9 items was found for Item 3 (“trouble falling or staying asleep, or sleeping too much”). Item 9 (“thoughts that you would be better off dead or of hurting yourself”) had the lowest score. As to the internal consistency of the PHQ-9, Cronbach’s alpha was 0.78. All items, if deleted, would consistently decrease the total scale alpha. The least item-total correlation was for Item 5 (“poor appetite or overeating”).

Table 2. Mean score, standard deviation, and internal reliability score for each PHQ-9 score

PHQ-9 items	Mean	Standard deviation	Corrected item-total correlation	Cronbach's alpha if item deleted
1. Little interest or pleasure in doing things	0.72	0.881	0.612	0.708
2. Feeling down, depressed, or hopeless	0.64	0.926	0.516	0.723
3. Trouble falling or staying asleep, or sleeping too much	1.11	1.256	0.404	0.749
4. Feeling tired or having little energy	0.68	0.984	0.321	0.755
5. Poor appetite or overeating	0.47	0.955	0.199	0.773
6. Feeling bad about yourself – or that you are a failure	0.71	1.015	0.612	0.704
7. Trouble concentrating on things	0.27	0.641	0.345	0.749
8. Moving or speaking so slowly that other people have noticed	0.35	0.731	0.555	0.722
9. Thoughts that you would be better off dead or of hurting yourself	0.25	0.662	0.525	0.729

Validity analysis

A comparison was made of the performance of the Thai PHQ-9 against the diagnosis of depressive disorder (based on the DSM-5 criteria for depressive disorders as the standard). According to the DSM-5 criteria, 23 patients (20%) met the diagnosis of PSD. The median Thai PHQ-9 score for the depression group was 10 (IQR 25%, 75%: 7, 15) whereas the median score of the normal group was 4 (IQR 25%, 75%: 0.5, 5.75). The differences in the median PHQ-9 scores of the 2 groups were statistically significant.

Table 3. The performance of different PHQ-9 cut-off scores in detecting depression

Score

Sensitivity
(%) (95% CI)

Specificity
(%) (95% CI)

Positive predictive value
(%) (95% CI)

Negative predictive value
(%) (95% CI)

Positive likelihood ratio
(95% CI)

Negative likelihood ratio
(95% CI)

Accuracy
(95% CI)

Youden’s index

The algorithm-based diagnosis

≥ 10

34.8

(16.4, 57.3)

97.8

(92.4, 99.7)

80.0

(47.6, 94.6)

85.7

(81.6, 89.0)

16.0

(3.6, 70.3)

85.7

(81.6, 89.0)

85.2

(77.4, 91.2)

-----

The summed-item-based diagnosis

≥ 5

91.3

(71.9, 98.9)

65.2

( 54.6, 74.8)

39.6

(32.6, 47.2)

96.8

(88.8, 99.1)

2.62

(1.9, 3.6)

0.13

(0.04, 0.5)

70.4

(61.2, 78.6)

0.565

≥ 6

87.0

(66.4, 97.2)

75.0

(64.9, 83.4)

46.5

(37.1, 56.2)

95.8

(88.8, 98.5)

3.5

(2.4, 5.1)

0.2

(0.1, 0.5)

77.4

(68.6, 84.7)

0.620

≥ 7

78.3

(56.3, 92.5)

81.5

(72.1, 88.8)

51.4

(39.6, 63.1)

93.8

(87.3, 97.0)

4.2

(2.6, 6.8)

0.3

0.1, 0.6

80.9

(72.5, 87.6)

0.598

≥ 8

65.2

(42.7, 83.6)

83.7

(74.5, 90.6)

50.0

(36.6, 63.4)

90.6

(84.5, 94.4)

4.0

(2.3, 6.9)

0.42

(0.2, 0.7)

80.0

(71.5, 86.9)

0.489

≥ 9

56.5

(34.5, 76.8)

90.2

(82.2, 95.4)

59.1

(41.4, 74.7)

89.3

(83.8, 93.0)

5.8

(2.8, 11.8)

0.5

(0.3, 0.8)

83.5

(75.4, 89.7)

0.467

≥ 10

52.2

(30.59, 73.2)

94.6

(87.7, 98.2)

70.6

(48.4, 85.9)

88.8

(83.7, 92.4)

9.6

(3.7, 24.5)

0.5

(0.3, 0.8)

86.1

(78.4, 91.8)

0.467

When using the algorithm-based diagnosis, an assessment of the validity of the Thai PHQ-9 index test revealed a sensitivity of 34.8%, specificity of 97.8%, positive predictive value of 80%, negative predictive value of 85.7%, and positive likelihood ratio of 16.0 (Table 3). As to using the summed-scored-based diagnosis, the corresponding values for different PHQ-9 thresholds in diagnosing PSD are detailed in Table 2. The cut-off score of 6 showed the highest Youden’s index. This cut-off score had a sensitivity of 87.0 % (95% CI: 66.4, 97.2), specificity of 75.0% (95% CI: 64.9, 83.4), positive predictive value of 46.5% (95% CI: 37.1, 56.2), negative predictive value of 95.8% (95% CI: 88.8, 98.5), positive likelihood ratio of 3.5 (95% CI: 2.4, 5.1), and negative likelihood ratio of 0.2 (95% CI: 0.1, 0.5). The ROC curve illustrates that the PHQ-9 performed well in identifying patients with PSD (Figure 2). The AUC in our study was 0.87 (95% CI: 0.78, 0.96), which represents good discrimination.

This study was the first in Thailand to determine the validity of a depression screening questionnaire with stroke patients. The questionnaire investigated was the PHQ-9, one of the good screening tools for PSD.¹⁴ The reference standard was a psychiatric interview based on the DSM-5 criteria for depressive disorders. In this study, the validity of the PHQ-9 in screening PSD was good in terms of its discriminatory power (AUC: 0.87) relative to the gold-standard, DSM-5 criteria. In addition, its internal consistency was acceptable (Cronbach’s alpha: 0.78).

Twelve patients were diagnosed with an adjustment disorder with a depressed mood. In clinical practice, such stroke patients are usually administered antidepressant medications to assist them in adjusting to their physical disability. Although adjustment disorders fall under a different entity to depressive disorders, this study included the cases of adjustment disorder with depressed mood in the PSD group. PSD was found in 23 patients (20%), which was less than the corresponding figures reported by other studies. A meta-analysis conducted by Hackett and Pickles²⁶ found that 31% of stroke patients developed depression or depressive symptoms in any setting and at any time up to 5 years following their stroke. Robinson²⁷ undertook a pooled analysis and reported mean incidences for major and minor depression of 19.3% and 18.5%, respectively, among hospitalized patients in acute care and rehabilitation hospitals. By comparison, the low incidence in the present study probably stemmed from having the criterion that only stroke patients aged ≥ 45 years would be included. Previous research has found that younger stroke survivors are more likely to become depressed than older survivors.^{28, 29} Nevertheless, the incidence established by the current study is in line with that of research by Fuentes et al., which recruited stroke patients of the same age group and found a low depression incidence of 9.9%.³⁰

Moving on to the demographic characteristics of stroke patients with and without PSD, our study revealed no significant differences in the demographic-related variables of the groups. In the case of the disability-related variable, the MRS was used to determine the level of disability after stroke. The patients with an MRS score > 3, who were classified as having a severe disability, appeared more frequently in the depression group. PSD has been found to be associated with more severe neurological deficits and physical disabilities in the acute and chronic phases.^{31, 32}

The internal consistency of the Thai PHQ-9 administered to the stroke patients in this study was 0.78, which is considered acceptable. However, the level of internal consistency we found differed from that of the original version of the PHQ-9. The original studies—performed in primary care and in obstetrics and gynecology settings—showed an internal consistency of 0.89 and 0.86, respectively.¹⁶ In addition, Turner et al., who utilized PHQ-9 to screen for PSD, found an internal consistency of 0.82.¹³ In the case of the Thai version of the PHQ-9, a validity study on the Thai population reported an internal consistency of 0.79.¹⁵ Later, Lee and Dajpratham, who employed the Thai version on elderly Thais, reported an internal consistency of 0.76.³³ In the current research, the internal consistency was 0.78, which means that it is highly congruent with those two earlier studies using the Thai version of the PHQ-9.

The Thai PHQ-9 can be used as a screening tool since the AUC showed a good level of discriminatory power (AUC: 0.87). The results of our study are in line with several other investigations that have reported a good discriminatory power for the PHQ-9, with an AUC of > 0.8.^{13, 17, 34–36} As to its validity, the PHQ-9 score can be used in 2 ways to diagnose depression. The first is an algorithm-based diagnosis for major depression, with a cut-off score of 10. In 2015, Manea et al.³⁷ conducted a diagnosis meta-analysis of the PHQ-9 algorithm-based scoring method as a screening tool for depression. They found that although the sensitivity was as low as 53% (95% CI: 42–65), the specificity was as high as 94% (95% CI: 91–96). Our study applied the algorithm-based diagnosis for PSD in a tertiary-hospital setting. Our evaluation of the diagnostic accuracy revealed low sensitivity and high specificity (Table 2), consistent with the results of the work by Manea et al.³⁷ Low sensitivity is not a good property of a screening tool. Therefore, all of the previous PHQ-9 validation studies for the detection of PSD have used the alternative diagnostic approach, summed-scored-based diagnosis, for their comparisons with various structured interviews as their reference standard.^{13, 17, 34, 36, 38} Pettersson et al.³⁹ performed a systematic review to explore the diagnostic accuracy of the structured interviews as index tests. The only structured interviews which were found to have sufficient accuracy for the diagnosis of depression disorders were the Structured Clinical Interview for DSM-IV (SCID) and the Mini International Neuropsychiatric Interview (MINI). The summed-scored-based PHQ-9 diagnoses in the current research were validated against the psychiatric interviews that were based on DSM-5 criteria. Our analysis revealed an optimum cut-off score of 6 for the diagnosis of depression. This finding differed from those of other studies.^{13, 17} Turner et al.¹³ validated the PHQ-9 for the detection of PSD against the DSM-IV criteria; they reported a summed score greater than 8 as the cut-off score for diagnosis. Similarly, Williams et al.¹⁷ reported a summed score of 10 or greater as the cut-off score for diagnosis.

There were some limitations to this study. Firstly, the high mean age of the participants, 64 years, meant that the findings cannot be generalized to younger stroke patients. However, the incidence of stroke at a younger age is lower and only represents a small proportion in clinical practice. Another limitation is that only participants who could communicate were recruited. Stroke patients who are unable to communicate would probably be very depressed. Moreover, the mood assessment scale for patients who cannot communicate is different. Finally, this study did not perform test–retest reliability; consequently, the temporal stability of the measure for Thai people with a stroke is presently unknown.

The Thai version of the PHQ-9 had good validity and acceptable reliability for the screening of PSD. The summed-scored-based depression diagnosis should therefore be employed for screening, with a cut-off score of 6 signifying PSD.

AUC: area under the curve; MINI: Mini International Neuropsychiatric Interview; PHQ-9: Patient Health Questionnaire 9; PSD: post-stroke depression; ROC: receiver operating characteristic curve; SCID: Structured Clinical Interview for DSM-IV

Ethics approval and consent to participate

Ethics approval was obtained from the Medical Ethics Committee of the Human Research Protection Unit, Faculty of Medicine Siriraj Hospital (COA no. 623/2017). All participants gave their written consent to participate.

Consent for publication

Not applicable.

Availability of data and materials

All data generated or analyzed during this study are included in this published article and its supplementary information files.

Competing interests

The authors declare that they have no competing interests.

Funding

No funding was obtained for this study.

Authors’ contributions

PD conceived the study, designed the protocol, analyzed the data, and prepared the manuscript. PP participated in the study design, designed the protocol, assisted with the data collection, and commented on the manuscript. WA and KW participated in the study design, assisted with the data collection, and commented on the manuscript. JB and KP designed the protocol and commented on the manuscript. All authors read and approved the final version of the manuscript.

Acknowledgements

The authors thank Dr. Chulaluk Komoltri and Mr. Sutthipol Udompunturak for their assistance with the statistical analyses.

Rogers S. Poststroke depression screening: an executive summary. J Neurosci Nurs. 2017;49(2):66–8.
Hackett M, Pickles K. Part I: frequency of depression after stroke: an updated systematic review and meta-analysis of observational studies. Int J Stroke 2014;9:1017–25.
Werheid K. A two-phase pathogenic model of depression after stroke. Gerontology. 2016;62:33–9.
Rigler S. Management of poststroke depression in older people. Clin Geriatr Med. 1999;15(4):765–83.
Sugawara N, Metoki N, Hagii J, Saito S, Shiroto H, Tomita T, et al. Effect of depressive symptoms on the length of hospital stay among patients hospitalized for acute stroke in Japan. Neuropsychiatr Dis Treat. 2015;11:2551–6.
Gillen R, Tennen H, McKee T. Depressive symptoms and history of depression predict rehabilitation efficiency in stroke patients. Arch Phys Med Rehabil. 2001;82:1645–9.
Arwert H, Meesters J, Boiten J, Balk F, Wolterbeek R, Vlieland TV. Poststroke Depression: A Long-Term Problem for Stroke Survivors. Am J Phys Med Rehabil. 2018;97(8):565–71.
Serrano S, Domingo J, Rodríguez-Garcia E, Castro M, Ser Td. Frequency of cognitive impairment without dementia in patients with stroke: a two-year follow-up study. Stroke. 2007;38(1):105–10.
Bartoli F, Lillia N, Lax A, Crocamo C, Mantero V, Carrà G, et al. Depression after stroke and risk of mortality: a systematic review and meta-analysis. Stroke Res Treat. 2013;2013:862978.
Lipsey J, Spencer W, Rabins P, Robinson R. Phenomenological comparison of functional and poststroke depression. Am J Psychiatry 1986;143:527–9.
National institute for health and care excellence. Stroke rehabilitation: long term rehabilitation after stroke [Clinical guideline 162 Methods, evidence and recommendations ]. 2013 [updated 29May 2013 cited 2016 Nov 21]. Available from: https://www.nice.org.uk/guidance/cg162/resources/cg162-stroke-rehabilitation-full-guideline3.
Miller E, Murray L, Richards L, Zorowitz R, Bakas T, Clark P, et al. Comprehensive overview of nursing and interdisciplinary rehabilitation care of the stroke patient: a scientific statement from the American Heart Association. Stroke. 2010;41:2402–48.
Turner A, Hambridge J, White J, Carter G, Clover K, Nelson L, et al. Depression screening in stroke: a comparison of alternative measures with the structured diagnostic interview for the diagnostic and statistical manual of mental disorders, fourth edition (major depressive episode) as criterion standard. Stroke. 2012;43(4):1000–5.
Meader N, Moe-Byrne T, Llewellyn A, Mitchell A. Screening for poststroke major depression: a meta-analysis of diagnostic validity studies. J Neurol Neurosurg Psychiatry 2014;85:198–206.
Lotrakul M, Sumrithe S, Saipanish R. Reliability and validity of the Thai version of the PHQ-9. BMC Psychiatry 2008;8:46–52.
Kroenke K, Spitzer R, Williams J. The PHQ-9: validity of a brief depression severity measure J Gen Intern Med. 2001;16(9):606–13.
Williams L, Brizendine E, Plue L, Bakas T, Tu W, Hendrie H, et al. Performance of the PHQ-9 as a screening tool for depression after stroke Stroke. 2005;36(3):635–8.
Investigators WMp. The World Health Organization MONICA projects (Monitoring trends and determinants in cardiovascular disease). J Clinical Epidemiol 1988;41:105–14.
Train the brain forum committee. Thai Mental State Examination. Siriraj Hosp Gaz 1993;45:359–74.
Rankin L. Cerebral vascular accidents in patients over the age of II. Prognosis Scott Med J. 1957;2:200 –15.
Swieten J, Koudstaal P, Visser M, Schouten H, Gijn J. Interobserver agreement for the assessment of handicap in stroke patients. Stroke. 1988;19:604 –7.
Sulter G, Steen C, Keyser JD. Use of the Barthel index and modified Rankin scale in acute stroke trials. Stroke. 1999;30(8):1538–41.
American Psychiatric Association. Diagnostic and Statistical manual of mental disorders—DSM- 5th ed. ed. Washington DC: American Psychiatric Press, Inc; 2013.
SPSS Inc. Released PASW Statistics for Windows, Version 18.0. Chicago: SPSS Inc.
Statistical software version 16.4.3. MedCalc Software bvba, Ostend, Belgium; http://www.medcalc.org;2016.
Hackett M, Pickles K. Part I: frequency of depression after stroke: an updated systematic review and meta-analysis of observational studies. Int J Stroke. 2014;9:1017–25.
Robinson R. Poststroke depression: prevalence, diagnosis, treatment and disease progression. Biol Psychiatry. 2003;54:376–87.
Barker-Collo S. Depression and anxiety 3 months poststroke: prevalence and correlates. Arch Clin Neuropsychol. 2007;22:519–31.
Chatterjee K, Fall S, Barer D. Mood after stroke: a case control study of biochemical, neuro-imaging and socio-economic risk factors for major depression in stroke survivors. BMC Neurol 2010;10:125–34.
Fuentes B, Ortiz X, Sanjose B, Frank A, Díez-Tejedor E. Post-stroke depression: can we predict its development from the acute stroke phase? Acta Neurol Scand. 2009;120(3):150–6.
Kutlubaev M, Hackett M. Part II: predictors of depression after stroke and impact of depression on stroke outcome: an updated systematic review of observational studies. Int J Stroke. 2014 9(8):1026–36.
Blöchl M, Meissner S, Nestler S. Does depression after stroke negatively influence physical disability? A systematic review and meta-analysis of longitudinal studies. J Affect Disord. 2019;15(247):45–56.
Lee S, Dajpratham P. Criterion validity of the Thai version of the PHQ-9 and the PHQ-2 for screening major depression in Thai elderly. J Thai Rehbail 2017;27(1):30–7.
deMan-vanGinkel J, Hafsteinsdóttir T, Lindeman E, Burger H, Grobbee D, Schuurmans M. An efficient way to detect poststroke depression by subsequent administration of a 9-item and a 2-item Patient Health Questionnaire. Stroke. 2012;43(3):854–6.
Wang Z, Zhu M, Su Z, Guan B, Wang A, Wang Y, et al. Post-stroke depression: different characteristics based on follow-up stage and gender-a cohort perspective study from Mainland China. Neurol Res. 2017 39(11):996–1005.
Prisnie J, Fiest K, Coutts S, Patten S, Atta C, Blaikie L, et al. Validating screening tools for depression in stroke and transient ischemic attack patients. Int J Psychiatry Med. 2016 51(3):262–77.
Manea L, Gilbody S, McMillan D. A diagnostic meta-analysis of the Patient Health Questionnaire-9 (PHQ-9) algorithm scoring method as a screening for depression. Gen Hosp Psychiatry. 2015;37(1):67–75.
Wang E, Meyer C, Graham G, Whooley M. Evaluating screening tests for depression in post-stroke older patients. Journal of Geriatric Psychiatry and Neurology. 2018;31(3):129–35.
Petterson A, Bostrom K, Gustavsson P, Ekselius L. Which instruments to support diagnosis of depression have sufficient accuracy: a systematic review. Nord J Psychiatry. 2015;69(7):497–508.

Download PDF

Journal Publication

published 09 Jun, 2020

Read the published version in BMC Psychiatry →

Editorial decision: Major revision
08 Apr, 2020
Review #1 received at journal
02 Apr, 2020
Reviewer #1 agreed at journal
19 Mar, 2020
Editor assigned by journal
28 Feb, 2020
Reviewers invited by journal
28 Feb, 2020
Submission checks completed at journal
27 Feb, 2020
Editor invited by journal
27 Feb, 2020

You are reading this latest preprint version

The Validity and Reliability of the Patient Health Questionnaire-9 in Screening for Post-Stroke Depression

Status:

Journal Publication

Version 3

Abstract

Figures

Background

Methods

Results

Discussion

Conclusion

Abbreviations

Declarations

References

Status:

Journal Publication

Version 3