Collaborative arrangements
The SBP was established through a collaboration between the Karolinska Institutet and the St. Göran bipolar outpatient clinic at the Northern Stockholm Psychiatry in 2005. Subsequently, in 2009, a second study center was started through a partnership between the Sahlgrenska University hospital in Gothenburg and the Institute of Neuroscience and Physiology at the University of Gothenburg.
Recruitment approach
Participants in the SBP study were consecutively enrolled from new or ongoing individuals treated at the respective bipolar outpatient clinics in Stockholm (SBP-S) and Gothenburg (SBP-G), Sweden. Both these outpatient units provide long-term (usually lifelong) follow-up care for individuals diagnosed with bipolar disorder. Individuals were diagnosed after an initial evaluation using the Affective disorder evaluation (ADE). Individuals meeting the inclusion criteria (see below) were eligible for inclusion. Prior to inclusion, the study’s objectives, procedures, and the risks and benefits were presented to potential participants. The voluntary nature of research participation was explained, along with a presentation of the alternatives for non-research treatment. Patients retained the right to withdraw their consent or exit the study at any time during their participation.
For each included patient, seven age and sex-matched controls were randomly selected by Statistics Sweden and contacted by mail. Out of the controls that received the invitation mail, 14% reached out to the research team, a response rate that is on par with similar studies according to Statistics Sweden. A telephone screening was conducted by trained research nurses to exclude persons with severe mental health issues, neurological problems, and severe substance abuse. Persons who passed the telephone screening were scheduled for a 1-day comprehensive assessment as part of the study.
Baseline study procedures
The study procedures were performed when patients were judged to be in a mood stable state. At the visit for blood sampling and lumbar puncture, study participants were assessed using the Montgomery-Åsberg Depression Rating Scale (MADRS) [2] and Young Ziegler Mania Rating Scale (YMRS) [3]. From 2012 and onwards, MADRS and YMRS were also administered in conjunction with the neuropsychological test sessions.
The structured interview instrument ADE (the Affective Disorder Evaluation) was used in the STEP-BD project [1]. The ADE includes a social anamnesis, medical history, and the affective module of the Structured Clinical Interview for the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV). The number and characteristics of lifetime affective episodes are documented. With the permission of the originator Gary S. Sachs (Bipolar Clinic and Research Program, Harvard Medical School, Boston, MA, USA), we modified the ADE to suit Swedish conditions. Specifically, we added a more detailed assessment of participants’ occupational level including details about sick- leave. We also included questions pertaining to the Swedish weapon law, as well as details about non-pharmacological treatment including electroconvulsive therapy, psychotherapy, and patient education. The structured interview instrument, referred to a as the Patient History Form, underwent minor refinements during the study and a more thorough revision was undertaken in 2014.
Further, the structured the Mini International Neuropsychiatry Interview (M.I.N.I.) [4] was administered to screen for other psychiatric diagnoses than bipolar disorder. The information for the Patient history form and the M.I.N.I. interviews were collected by board-certified psychiatrists, residents in psychiatry or a specialist nurse. Assessments were supplemented with information from medical records and interviews with next of kin when needed. A final best-estimate diagnostic decision was made at a diagnostic case-conference by a consensus panel (n = 2–5) of experienced board-certified psychiatrists specialized in bipolar disorder.
Participants also filled out self-report versions of the Alcohol Use Disorders Identification Test (AUDIT) [5] and the Drug Use Disorders Identification Test (DUDIT) [6]. Furthermore, both the symptom and function domains of the Global Assessment of Functioning (GAF) scales were scored to assess the overall psychosocial functioning.
Controls completed a structured interview to collect information on dwelling, educational and occupational level including details about sick-leave, questions pertaining to the Swedish weapon law, family history of psychiatric disorders, as well as reproductive and medical history. In addition, symptom and function domains of the Global Assessment of Functioning (GAF) scales were scored to assess the overall psychosocial functioning. The M.I.N.I. interview was administered to exclude psychiatric morbidity and SCID-II was administered to screen for personality disorders. Substance abuse was screened for at the telephone interview by the nurse, in the psychiatric interview, by AUDIT and DUDIT, as well as by determining serum levels of carbohydrate-deficient transferrin.
Inclusion and exclusion criteria
The inclusion criteria for patient participation in the SBP were as follows: 1) age of 16 years or older; 2) a diagnosis meeting the DSM-IV Criteria for bipolar I, bipolar II, bipolar not otherwise specified (NOS), cyclothymia, or schizoaffective disorder of the manic / bipolar subtype, and 3) completed the basic diagnostic Affective disorder evaluation. Exclusion criteria were: 1) Unwilling or unable to comply with study requirements or 2) Not competent to give informed consent in the opinion of the investigator. Notably, participants were not obliged to participate in all the study’s investigations. They had the option to choose which components of the study they wished to participate in.
Overconsumption of alcohol as revealed by carbohydrate-deficient transferrin (> 2.0%) or responses indicating large consumption (> 8 standard drinks per time more than 2 times per week) and/or amnesia and/or loss of control more than once per month were exclusion criteria for controls. Other exclusion criteria were neurological conditions (other than mild migraine), untreated endocrinological disorders, pregnancy, dementia, recurrent depressive disorder, and suspected severe personality disorders (based on interview and SCID-II personality assessment), and a family history of schizophrenia or bipolar disorder in first-degree relatives. We did not exclude controls with past minor depressive episodes, isolated episodes of panic disorder, eating disorders, or obsessive-compulsive disorder that had remitted spontaneously or with brief psychotherapy counselling.
Follow-up visit study procedures
The aim of the study was to conduct follow-up assessments with study participants at 7 and 14 years after their baseline visits. The time interval was chosen to balance study site workload, provide sufficient follow up length, and have an acceptable rate of study dropouts. The Patient History Form was modified for these follow-up visits. Changes included the addition of data collection regarding medication, episode relapse, inpatient care, suicide attempts that occurred during the follow-up period. Controls completed a shortened version of the modified Patient History Form at the follow-up visits.
Self-reported rating scales
The St. Göran study incorporates several self-reported rating scales. While some of these scales have been consistently used from study start throughout follow-up visits, other rating scales have been introduced or existing ones removed during the study. Table 1 provides an overview of rating scales across visits for both study centres.
Table 1
– Self-assessment rating scales in the St Göran project
| SBP-S | SBP-G |
| Patients | Controls | Patients | Controls |
Instrument | Baseline | 7 y | 14 y | Baseline | 7 y | 14 y | Baseline | 7 y | Baseline | 7 y |
Alda scale | no | yes1 | yes | - | - | - | no | yes1 | - | - |
BWAS | yes1 | yes1 | yes1 | yes | no | no | yes1 | yes | no | no |
BIS-4 | no | no | yes | no | yes2 | yes | no | no | no | no |
BRIEF-A | no | yes1 | yes | no | no | yes | no | no | no | no |
BSRI | no | no | no | no | yes | no | no | no | no | no |
DES | yes1 | yes | no | yes | no | no | yes | no | yes | no |
EHI | yes1 | yes1 | no | yes | no | no | yes | no | yes | no |
EQ-5D | no | yes | yes | no | yes | yes | yes | yes | no | yes |
ISEL | yes | no | no | yes | no | no | no | no | no | no |
Kinsey scale | no | yes | yes1 | no | yes | no | no | no | no | no |
KSQ | no | yes | no | no | yes | no | yes | yes | no | yes |
PDI | yes1 | yes | no | yes | yes | no | yes | yes | no | yes |
PGWB | no | yes | yes | no | yes | yes | no | no | no | no |
SCID-II | yes | yes | no | yes | no | no | yes | yes | yes | yes |
SDS | yes | yes | yes | no | yes | yes | no | no | no | no |
CAQ | yes1 | no | yes | yes | no | no | yes | no | no | no |
SOC | yes | yes1 | yes | yes | no | yes | no | no | no | no |
SPAQ | yes | no | no | - | - | - | no | no | - | - |
SSP | yes | yes | no | yes | no | no | yes | yes | yes | yes |
STAI-S | no | yes1 | yes | no | no | yes | no | no | no | no |
TEMPS-A | yes1 | yes | no | yes | no | no | yes | yes | yes | yes |
Table legend: Alda scale - The Retrospective Assessment of the Lithium Response Phenotype Scale, BWAS - Barron Welsh Art Scale, BIS-4 - Berlin Intelligence Structure Model test, BRIEF-A - Behavior Rating Inventory of Executive Function - Adult Version, BSRI - Bem sex role inventory, DES - Dissociative Experiences Scale, EHI - Edinburgh Handedness Inventory, ISEL - Interpersonal Support Evaluation List, KSQ - Karolinska Sleep Questionnaire, PDI - Delusions Inventory, PGWB - Psychological Well-Being Index, SCID-II - Structured Clinical Interview for DSM-IV - Axis II disorders, SDS - Sheehan Disability Scale, CAQ - Creative Achievement Questionnaire, SOC - Sense of Coherence Scale, SPAQ - Seasonal Pattern Assessment Questionnaire, SSP - Swedish Universities Scales of Personality, STAI-S – Spielberger State-Trait Anxiety Inventory, and TEMPS-A - Temperament Evaluation of Memphis, Pisa, Paris, and San Diego Autoquestionnaire. 1Administered to a subset of patients and 2administered to a subset of controls. |
Delusional ideation and dissociative experiences were measured using the Peters et al. Delusions Inventory (PDI) [7] and the Dissociative Experiences Scale (DES) [8]. Anxiety was measured using the short version of the Spielberger State-Trait Anxiety Inventory (STAI-S) [9]. Patients in the SBP-S cohort were assessed using the Seasonal Pattern Assessment Questionnaire (SPAQ) [10] at baseline. The Edinburgh Handedness Inventory (EHI) [11] was used to ascertain handedness. Gender traits were assessed in the SBP-S control group at the 7-year follow-up using the Bem sex role inventory (BSRI) [12]. Sexual orientation was assessed using the Kinsey Scale [13]. Data on personality were collected using the Structured Clinical Interview for DSM-IV - Axis II disorders (SCID-II) [14], the Swedish Universities Scales of Personality (SSP) [15], and the Temperament Evaluation of Memphis, Pisa, Paris, and San Diego Autoquestionnaire (TEMPS-A) [16]. Subjective well-being was rated using the Psychological Well-Being Index (PGWB) [17]. Study persons view of life was rated using the Sense of Coherence Scale (SOC) [18]. Functional level and quality of life were assessed using the Sheehan Disability Scale (SDS) [19], Interpersonal Support Evaluation List (ISEL) [20], and EQ-5D questionnaires [21]. The Karolinska Sleep Questionnaire (KSQ) [22] was used to collect information on sleep difficulties. Creativity was measured using the Barron Welsh Art Scale (BWAS) [23], the Creative Achievement Questionnaire (CAQ) [24], and the Berlin Intelligence Structure Model test (BIS-4) [25]. Everyday behavior associated with executive function was assessed by the Behavior Rating Inventory of Executive Function - Adult Version (BRIEF-A) questionnaire [26]. Finally, lithium response was measured using the The Retrospective Assessment of the Lithium Response Phenotype Scale (Alda scale) [27].
2.2.6 Neuropsychiatric assessment
As shown in Table 2, the presence of autism spectrum traits was assessed using the Autism-Spectrum Quotient Test (AQ test) [28] and the Ritvo Autism and Asperger Diagnostic Scale – 14 Screen (RAADS-14) [29]. ADHD symptoms were rated using the Adult ADHD Self-Report Scale (ASRS) [30], the Brown Attention-Deficit Disorder Scales (BADDS) [31], and the Wender Utah Rating Scale (WURS) [32]. Further, controls in the SBP-S cohort completed the Barkley self-assessment test for ADHD symptoms [33] at the 7-year follow-up visit.
Table 2
– Neuropsychiatric rating scales used in the St. Göran project
| SBP-S | SBP-G |
| Patients | Controls | Patients | Controls |
Instrument | Baseline | 7 y | 14 y | Baseline | 7 y | 14 y | Baseline | 7 y | Baseline | 7 y |
AQ test | yes | yes1 | no | yes | no | no | no | no | no | no |
ASRS | yes | yes | no | yes | yes | yes2 | yes | yes | yes | yes |
Barkley self-assessment | no | no | no | no | yes | no | no | no | no | no |
BADDS | yes | yes | no | yes | no | yes2 | yes | yes | yes | yes |
RAADS-14 | no | yes | no | no | yes | no | yes | yes | no | yes |
WURS | yes | yes | no | yes | no | no | yes | yes | no | yes |
Table legend: AQ test – Autism-Spectrum Quotient Test, ASRS – Adult ADHD Self-Report Scale, BAADS – Brown Attention-Deficit Disorder Scales, RAADS – Ritvo Autism and Asperger Diagnostic Scale, and WURS – Wender Utah Rating Scale. 1Administered to a subset of patients and 2administered to a subset of controls. |
Neuropsychological tests
Study participants were assessed with a comprehensive neuropsychological test battery covering several cognitive domains [34]. In the SBP-S cohort, the tests included the Claeson-Dahl Verbal Learning and Retention test (CDT), the Color-Word Interference Test, the Design Fluency Test, the Tower Test, the Trail Making Test, and the Verbal Fluency Test from Delis-Kaplan Executive Function System (D-KEFS), the Wechsler Adult Intelligence Scale (WAIS version III or IV; all subtests except All the Object Assembly were administered), the Continuous Performance Test II, and the Rey Complex Figure Test (REY). To reduce the workload for study participants, certain tests were omitted in the SBP-G cohort (at baseline and follow-up) and for follow-up examinations in SBP-S. Table 3 details the administered neuropsychological tests to the patient and control groups across visits.
Table 3
– Neuropsychological tests used in the St. Göran Project
SBP-S | SBP-G |
| Baseline | 7-year follow-up | 14-year follow-up | Baseline | 7-year follow-up |
CDT | yes | yes1 | yes | no | no |
D-KEFS | yes | yes2 | yes3 | yes3 | yes3 |
WAIS | yes | yes | yes | yes | yes |
REY | yes | yes | yes | yes | yes |
CPT-II | yes | yes | yes1 | no | no |
Table legend: CDT – Claeson-Dahl Test, D-KEFS – Delis-Kaplan Executive Function System, WAIS – Wechsler Adult Intelligence Scale, REY – Rey Complex Figure Test, and CPT-II – Connors Continuous Performance Test 2nd edition. 1Administered to controls and a subset of patients, 2the Design Fluency Test and the Tower Test from D-KEFS were only administered to a subset of patients, 3the Design Fluency Test and the Tower Test from D-KEFS were excluded. |
Blood sampling and lumbar puncture
Blood and CSF sampling was performed on the same day. Participants were in a stable euthymic mood and fasted overnight before the blood and CSF collection. Blood samples were drawn between 8–9 AM and CSF samples between 9–11 AM. An identical procedure was performed for the controls.
Blood samples were allowed to clot in room temperature for 30–60 min pending centrifugation (10 min at 1700 × g). In Stockholm, the supernatant was kept in low temperature (< 5°C) pending direct transport to the biobank within 4 h for long-term storage at − 70°C. In Gothenburg, the supernatant was immediately stored in a local − 70°C freezer awaiting bulk transport to the biobank.
For CSF sampling via lumbar puncture, the spinal needle was inserted into either the L3/L4 or L4/L5 interspace. A total volume of 12 ml of the CSF was collected, gently inverted to prevent gradient effects, and divided into 1.0–1.6 ml aliquots that were stored at − 80°C pending analysis. In SBP-S, baseline samples were not centrifugated up to May 2011, but in SBP-G all samples were centrifugated. The procedure for CSF and blood sampling remained identical at the 7-year follow-up visit. At the 14-year follow-up visit, blood but not CSF was collected.
Blood and CSF samples are stored at the Karolinska Institutet Biobank.
Magnetic resonance imaging
At the time of MRI scanning, patients were in a mood stable state. MRI scans for the SBP-S cohort were conducted at the MR Research Centre, Karolinska University Hospital, Stockholm. All subjects were examined in the same scanner for the baseline assessment and the first follow-up on a 1.5-Tesla MRI scanner (General Electric Signa Excite 1.5T) equipped with an eight-channel head coil. At second follow-up, the MRI scanner has been changed to a 3-Tesla General Electric Discovery MR750. Coronal 3D T1-weighted images were obtained using a spoiled gradient echo recall sequence (3D-SPGR), with the following parameters: repetition time (TR) of 21.0 milliseconds, echo time (TE) of 6 milliseconds, field of view (FOV) of 18 cm, flip angle of 30°, acquisition matrix of 256×256, 128 slices, and voxel size of 0.7×0.7×1.8 mm³. Axial fluid attenuation inversion recovery T2-weighted scans were acquired for examination by a senior radiologist. This assessment aimed to exclude clinically significant anatomical abnormalities or neuropathology. During the study, the calibration filter for the scanner was changed. This led to variations in the intensity distribution among subjects scanned with different filters. At the first follow-up, the scanning protocol was repeated, and two additional sequences were added: a resting-state functional magnetic resonance imaging (fMRI) sequence and a diffusion tensor imaging (DTI) scan. The resting-state fMRI sequence lasted 7.5 minutes and included the acquisition of 180 volumes. Each volume consisted of 39 BOLD-sensitive T2-weighted axial echo-planar images each. The imaging parameters were as follows: resolution of 3.79 mm x 3.79 mm, slice thickness 4 mm covering the whole brain, TR = 2.5 seconds, TE = 40 milliseconds, FOV = 243 mm, and flip angle of 85°. The DTI protocol included TR = 7.39 seconds, TE = 85 milliseconds, FOV = 220 mm, flip angle 90°, acquisition matrix 96x96, voxel size 0.94 x 0.94 mm, slice thickness 2.3 mm.
For the SBP-G cohort, MRI scanning was initially conducted on a Philips Achieva dStream 3T scanner using a 32-channel head coil. The scanner was replaced with a Philips M7700 3T scanner in January 2022. We conducted a structural T1-weighted 3D turbo field echo (TFE) HR sequence (TR = 6 milliseconds, TE = 2.8 milliseconds, FOV 240 mm x 240 mm, flip angle 8°, acquisition matrix 242x232, 180 slices, voxel size 1x1x1 mm3), a resting-state fMRI sequence (resolution of 3.06 mm x 3.06 mm, slice thickness 3 mm, TR = 2.47 seconds, TE = 30 milliseconds, FOV = 220 mm x 220 mm, flip angle = 75°), and a DTI sequence (TR = 7.62 seconds, TE = 92.46 milliseconds, FOV 224 mm x 224 mm, flip angle 90°, acquisition matrix 96x98, voxel size 2.33 mm x2.29 mm, slice thickness 2.3 mm).