Frequency-specific changes in prefrontal activity associated with maladaptive belief updating in volatile environments in euthymic bipolar disorder

doi:10.21203/rs.3.rs-4202591/v1

Download PDF

Article

Frequency-specific changes in prefrontal activity associated with maladaptive belief updating in volatile environments in euthymic bipolar disorder

https://doi.org/10.21203/rs.3.rs-4202591/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Bipolar disorder (BD) involves altered reward processing and decision-making, with inconsistencies across studies. Here, we integrated hierarchical Bayesian modelling with magnetoencephalography (MEG) to characterise maladaptive belief updating in this condition. First, we determined if previously reported increased learning rates in BD stem from a heightened expectation of environmental changes. Additionally, we examined if this increased expectation speeds up belief updating in decision-making, associated with modulation of rhythmic neural activity within the prefrontal, orbitofrontal, and anterior cingulate cortex (PFC, OFC, ACC). Twenty-two euthymic BD and 27 healthy control (HC) participants completed a reward-based motor decision-making task in a volatile setting. Hierarchical Bayesian modelling revealed BD participants anticipated greater environmental volatility, resulting in a more stochastic mapping from beliefs to actions and paralleled by lower win rates and a reduced tendency to repeat rewarded actions than HC. Despite this, BD individuals adjusted their expectations of action-outcome contingencies more slowly, but both groups invigorated their actions similarly. On a neural level, while healthy individuals exhibited an alpha-beta suppression and gamma increase during belief updating, BD participants showed dampened effects, extending across the PFC, OFC, and ACC regions. This was accompanied by an abnormally increased beta-band directed information flow in BD. Overall, the results suggest euthymic BD individuals anticipate environmental change without adequately learning from it, contributing to maladaptive belief updating. Alterations in frequency-domain amplitude and functional connectivity within the PFC, OFC, and ACC during belief updating underlie the computational effects and could serve as potential indicators for predicting relapse in future research.

Biological sciences/Neuroscience

Biological sciences/Psychology/Human behaviour

Bipolar disorder (BD) is a chronic affective condition characterised by episodes of elation, depression, and mixed states, interspersed with periods of clinical remission[1, 2]. Alterations in reward processing and impaired decision-making performance have been associated with the condition[3, 4], pointing to disrupted functional connectivity between the prefrontal cortex (PFC) and the mesolimbic reward system[5, 6]. Yet findings across studies are variable, even when considering euthymic periods alone[5–7]. BD research has reported both heightened sensitivity to negative feedback and decreased learning from rewards, or the reverse[8, 10–13]. Recently, Ossola et al.[14] found that in euthymic BD, attenuated belief updating from positive feedback forecasts relapse, highlighting the importance of investigating dynamic belief updating during euthymia.

Influential proposals advocate for the application of computational models in BD research to understand fluctuations in mood and reward processing[15–17]. These models, building upon previous neurocomputational work on mood instability[18], suggest that altered affective reactivity to reward and punishment in BD may elevate learning rates, even during euthymic phases, predisposing individuals to form stronger expectations about rewards or punishments. New empirical work supports this, revealing a tendency in BD for reward perception to be biased by fluctuations in the momentum of recent reward prediction errors[19]. An increased learning rate could also reflect a heightened anticipation of environmental changes in BD[15]. Indeed, seminal modelling frameworks support that agents learn faster when anticipating more frequent transitions in the environment[20, 21]. However, the extent to which individuals with BD perceive their environment as more volatile and how this perception influences their belief updating and decision-making remains unexplored.

This study investigates the computational processes underlying alterations in decision-making in euthymic bipolar patients, compared to healthy participants, as they undertake a probabilistic reward-based learning task in a volatile environment. Our primary hypothesis is that individuals with euthymic BD overestimate environmental volatility, thereby increasing their learning about the statistical correlations within their environment. Such inflated volatility estimates could introduce 'noise' into the decision-making process[22], leading to incorrect decisions. Alternatively, the reported deficits in decision-making performance during euthymic BD[3, 4] could be explained by slower belief updating, aligning with recent empirical observations in valence-dependent learning[14].

To test these hypotheses, we employ the Hierarchical Gaussian Filter (HGF), a validated modelling framework based on Hierarchical Bayesian inference that describes individual learning dynamics in environments characterised by uncertainty and volatility[23–26]. We used the HGF to model how input about probabilistic reward outcomes and their change over time is integrated with prior beliefs during learning, resulting in posterior beliefs about the hidden states causing the observed outcomes[23, 24]. Belief updates in the HGF are driven by prediction errors (PE)—the discrepancy between predictions and outcomes—and are modulated by the precision weights, where precision is defined as the inverse variance of belief distributions. This computational framework has already proven useful for understanding psychiatric conditions[13, 27, 28], and could offer insights into how affective states and mood dynamically shape adaptive learning[29–31].

To gain a more mechanistic understanding of the processes underlying the hypothesised computational alterations in euthymic BD, we additionally investigated the neural correlates of hierarchical belief updating using magnetoencephalography (MEG). Existing research supports the role of cortical oscillations in maintaining predictions and encoding PEs[32–35]. Specific frequency rhythms such as alpha (8–12 Hz) and beta (13–30 Hz) oscillations have been associated with the transmission of top-down predictions, and encoding precision, while gamma-band activity (> 30 Hz) has been linked to the propagation of PEs and precision-weighted PEs, pwPEs[33, 35–37]. Importantly, disruptions in these rhythms are suggested to contribute to learning deficits observed in various psychiatric conditions, including anxiety, schizophrenia, and autism[37–39].

On a neural level, we hypothesised that biases in probabilistic reward-based learning in BD in a volatile setting can be reflected in alpha, beta, and gamma activity during the encoding of pwPE and precision. These alterations are expected to manifest in the orchestrated activity across decision-making brain areas, such as the prefrontal, anterior cingulate, and orbitofrontal cortex (PFC, ACC, OFC). These regions are involved in learning in volatile and uncertain settings[20, 37, 40, 41], and form part of the fronto-striatal reward circuit, which exhibits disturbed connectivity in BD[9, 10]. We therefore additionally hypothesised that changes in frequency-domain connectivity patterns between these regions during belief updating would occur in BD relative to healthy control participants.

Lastly, we aimed to determine whether the computations underlying decision-making deficits in euthymic BD influence the motivational aspects associated with the invigoration of movements. Evidence suggests that reward expectations can speed motor performance[42, 43]. The nigrostriatal dopamine pathway, central to the 'dopamine hypothesis' of BD[44, 45], is crucial for invigorating future movements[46]. Moreover, individuals with BD have been shown to experience increases in energy and effort following success, thus demonstrating enhanced motor vigour effects[4, 47]. Consequently, our final complementary hypothesis posits that the strength of predictions about reward contingencies in the environment will speed decision-related movements in euthymic BD more than in healthy individuals[48].

Participants

Participants included 22 bipolar patients (mean age: 29.1 years [SEM = 1.67], 17 females; Table 1), and 27 healthy participants (27.5 years [SEM = 1.18], 15 females). Bipolar participants were assessed by a consultant psychiatrist who confirmed the diagnosis of BD (I or II) using the structured clinical interview for the International Statistical Classification of Diseases and Related Health Problems (ICD-11)[49]. Patients included in the study were euthymic for at least 2.5 months before recruitment. Additional inclusion criteria were: most recent episode being depression, aged 18–50 years, absence of symptoms from other mental health conditions beyond BD, and no history of substance abuse. We assessed residual mood symptoms and cognitive performance using validated scales on mania, anxiety, depression and tasks on executive and general cognitive performance. See Table 1 for further details, and Supplemental Material for sample size estimates (minimum sample size estimated was 20 per group, to identify volatility effects at 0.8 power). The study was approved by the Institutional Review Board of National Research University Higher School of Economics and the Local Ethical Committee of the First Moscow State Medical University. All participants provided written informed consent.

Table 1

Group demographics and variables describing the measures of affective state, cognitive and executive function in euthymic bipolar disorder patients (BD, N = 22) and healthy control participants (HC, N = 27).
	Bipolar group (N = 22)	Healthy control group (N = 27)	Statistics (Permutation test p-value and Bayes factor)
Age (years), M (SEM)	29.1 (SEM 1.67)	27.5 (SEM 1.18)	P = 0.4921, BF₁₀ = 0.3472
Sex (females), n (%)	17 (77.2%)	15 (56%)	χ² = 1.6559, df = 1, P = 0.1849; BF₁₀ = 1.102
Duration of illness (years), M (SEM)	7.4 (SEM 1.04)	NA	NA
Last episode polarity, (%)	depression (100%)	NA	NA
Time in euthymia (months), M (SEM)	4.9 (0.68)	NA	NA
Bipolar II, n (%)	17 (77.3%)	NA	NA
Bipolar I, n (%)	5 (22.7%)	NA	NA
Antidepressant, n (%)	10 (45.5%)	NA	NA
Antipsychotic, n (%)	14 (63.6%)	NA	NA
Mood stabiliser, n (%)	16 (81.8%)	NA	NA
Summary: Taking psychiatric medication, n (%)	19 (86.4%)	NA	NA
HADS anxiety, M (SEM)	5.5 (0.94)	5 (0.64)	P_FDR = 0.64, BF₁₀ = 0.31
HADS depression, M (SEM)	4 (0.69)	3.30 (0.57)	P_FDR = 0.4003, BF₁₀ = 0.3824
Beck Depression Inventory, M (SEM)	8.3 (1.75)	5 (1)	P_FDR = 0.1038, BF₁₀ = 0.8668
Altman's Mania Scale, M (SEM)	1.7 (0.50)	2.6 (0.50)	P_FDR = 0.2040, BF₁₀ = 0.5652
State-Trait Anxiety Inventory (state subscale on MEG day)¹, M (SEM)	35.5 (1.72)	33.7 (1.90)	P_FDR = 0.4571, BF₁₀ = 0.3833
Trail Making Test (Part 1), M (SEM)	23.3 (1.21)	21.2 (1.22)	P_FDR = 0.2216, BF₁₀ = 0.5296
Trail Making Test (Part 2), M (SEM)	87.8 (8.5)	55.7 (5.6)	P_FDR *= 0.0022, BF₁₀ = 14.907**
Wisconsin Card Sorting Test, M (SEM)	29.3 (0.92)	31.6 (0.67)	P_FDR = 0.0364, BF₁₀ = 1.6560
Mini-Mental State Examination, M (SEM)	29.5 (0.18)	29.5 (0.17)	P_FDR = 0.8678, BF₁₀ = 0.2875

For MEG analysis, data were available for 27 HCs and 21 BDs. Values are provided as mean and SEM (M, SEM) or as count and percentage (n, %). Between-group differences in scale values were assessed using permutation tests. Our alpha significance level was set at 0.05, and we controlled the false discovery rate (FDR) at q = 0.05 for multiple tests. Permutation test p-values were complemented with Bayes factor analysis to assess the amount of evidence in favour of H₁ or H₀. Age and sex distribution were comparable across groups (Age: P_FDR = 0.4921, no-significant differences; BF₁₀ = 0.3472, providing anecdotal evidence for H₀; Sex: Chi-squared statistic, χ² = 1.6559, df = 1, P = 0.1849, no significant; BF₁₀ = 1.102, no evidence for H₀ or H₁). There was substantial and anecdotal evidence supporting no differences between BD and HC groups in anxiety, depression, mania, and general cognitive scores (Bayes factor, BF₁₀, in the range 1/3–1: anecdotal evidence; 1/10–1/3: substantial evidence). Between-group differences after FDR control were observed exclusively for Part 2 of the Trail Making Test (denoted by *), assessing task switching, based on strong evidence. There was anecdotal evidence for differences in Wisconsin Card Sorting Test scores, assessing executive functions. However, this effect was not significant after FDR control. ¹State anxiety scores on the day of the experimental session were available for 22 bipolar patients and 18 healthy participants. Abbreviations: HADS, Hospital Anxiety and Depression Scale. The type of mood stabiliser taken by bipolar participants was mainly Lamotrigine (n = 11), but a few patients were taking Carbamazepine (n = 2), Valproic acid (n = 2), or Lithium (n = 1). See Supplementary Materials for further details on the scales and the corresponding references.

Reward-based motor sequence learning task

Participants underwent an initial fine motor control assessment, then completed a validated motor-based decision-making paradigm[48] (Fig. 1A), which combines probabilistic binary reward-based learning within a volatile setting (reminiscent of reversal learning) with the execution of motor sequences to express decisions. Participants learned two sequences of four finger presses, followed by a 320-trial test phase. In each trial, they were required to choose and perform one of the sequences to potentially earn a reward (5 points; Fig. 1A). Reward probabilities for sequences were reciprocal (p, 1-p) and changed pseudorandomly (Fig. 1B). The aim was to infer the reward probability associated with each sequence ('action-outcome' contingencies henceforth) and adjust their choices considering changing contingencies. Accumulated points translated to monetary rewards. See timeline in Fig. 1C. The task, programmed in MATLAB using Psychtoolbox, recorded participants' keypress timings to evaluate reaction time (RT) and performance tempo (Fig. 1D). See Supplemental Material.

MEG recording and preprocessing

MEG was performed using a 306-channel system (Elekta Neuromag VectorView), with head movements tracked by a head position indicator with four coils. Concurrently, ECG and EOG were recorded for MEG artefact rejection. Recordings were sampled at 1000 Hz and filtered between 0.1–330 Hz. MEG preprocessing involved head movement correction, noise reduction, and channel selection using standard methods ([50]; Elekta Maxfilter software; Supplemental Material). MEG data was further processed using MNE-python[51] (Python version 3.11.5) and custom Python scripts, lowpass filtered at 125 Hz, downsampled to 250 Hz, with a notch filter applied at 50 and 100 Hz. Independent component analysis (FastICA) removed eye and heart artifacts (3.3. ICs on average per participant).

General task performance

General task performance was assessed using the win rate (rewarded trials), average tempo (mean inter-key press interval in a sequence, mIKI), and RT (time from stimulus to first key press).

To assess practice effects in performance tempo and RT over trials as a function of the group, we employed Bayesian multilevel regression modelling (BML) with the logarithm of trial-wise RT or mIKI (in milliseconds) as the dependent variables. BML was performed with the brms R package[52, 53]. For each dependent variable, models of decreasing complexity were constructed (Table S1). Details on model comparison can be found in the Supplemental Material.

Modelling decision-making behaviour using hierarchical Gaussian filters

To assess probabilistic learning in our task we used a validated hierarchical Bayesian model, the 3-level perceptual HGF for binary categorical inputs[23, 24](Fig. 2a). This model described how participants infer hidden states about the tendency of the action-outcome contingencies on trial k, x₂^(k)(level 2), and the rate of change in that tendency (log-volatility), x₃^(k). Level 1 represents the binary reward input. Gaussian belief distributions on levels 2 and 3 are represented by their posterior mean (µ₂^(k), µ₃^(k)) and posterior variance (uncertainty: σ₂, σ₃). Precision is the inverse variance or uncertainty, π_i (i = 2, 3). Belief updating on each level i and trial k is driven by prediction errors, and modulated by precision ratios, weighting the influence of precision or uncertainty in the current level and the level below. This is termed precision-weighted PE, pwPEs. For level 2, belief updating takes the simple form:

$$\varDelta {\mu }_{2}^{k}={\mu }_{2}^{\left(k\right)}-{\mu }_{2}^{\left(k-1\right)}={\sigma }_{2}^{k}{\delta }_{1}^{\left(k\right)}$$

Thus, updating beliefs about the tendency of the action-outcome contingencies is proportional to the PE about action outcomes, δ₁^(k), weighted with the estimation uncertainty on that level, σ₂^(k). Here, pwPE is equal to σ₂^(k)δ₁^(k). See general equation, representing updates on level 3, in Supplementary Material, and [24].

States x₂, x₃ evolve as random Gaussian walks, with volatility states x₃ directly influencing the time evolution of x₂ through its variance (conditional on past values):

${x}_{2}^{k}\sim N\left({x}_{2}^{k-1},{f}_{2}\left({x}_{3}\right)\right)$

(2)

with (dropping k for simplicity)

$${f}_{2}\left({x}_{3}\right)\stackrel{\scriptscriptstyle\text{def}}{=}exp\left(\kappa {x}_{3}+{\omega }_{2}\right)$$

In (3), ω₂ represents the invariant (tonic) portion of the log conditional variance of x₂, while κ is a coupling constant that modulates the influence of phasic volatility, x₃, on the size of action-outcome belief updates. The step size on level 3 is modulated by ω₃, a high-level tonic volatility parameter. See Supplemental Material for modelling details.

To assess how beliefs mapped to decisions, we coupled this perceptual model to response models previously used in similar tasks[24, 37, 38]. First, we considered a unit-square sigmoid response model where choice probability is shaped by a free fixed (time-invariant) parameter ζ, interpreted as inverse decision noise: the sigmoid approaches a step function as ζ tends to infinity. This constituted our model M1. Model M2 was similar but employed a two-level HGF with constant volatility. M3 combined the 3-level HGF with a response model where the sigmoid function depends on the trial-wise prediction of log-volatility, $\zeta ={e}^{-{\mu }_{3}^{\left(k-1\right)}}$ [22](Fig. 2a). In this model, higher estimates of volatility lead to a more stochastic mapping from beliefs to decisions. As a result, there is an increased likelihood of choosing responses that deviate from predictions, consistent with increased exploration (exploring whether the contingency has changed). In models M1 and M3, parameters ω₂ and ω₃ were free; ω₂ was also free in M2. Additionally, ζ was free in M1 and M2, while initial values µ₃⁽⁰⁾ and σ₃⁽⁰⁾ were free in M3. A fourth model, M4, was constructed similarly to M3 but replaced the free parameter ω₂ with κ[28].

Models were fit to individual behavioural data (series of chosen and rewarded sequences: responses and observed outcomes), using prior values described in Table S2. Log model evidence was used for model comparison using random-effects Bayesian model selection (Supplemental Materials). As in previous work, simulations were conducted to quantify how well free model parameters could be estimated[27, 37]. Relevant belief and uncertainty trajectories were used subsequently for our MEG analysis (Fig. 2b; see Results). The models were implemented as a part of the TAPAS toolbox[54]. We used the HGF release v7.1 in Matlab R2020b, and functions ‘tapas_ehgf_binary’.

Assessing motor invigoration

We additionally investigated whether trial-by-trial predictions about the probabilistic action-outcome mapping, ${\stackrel{\prime }{\mu }}_{2}^{\left(k\right)}$, differentially influenced motor performance in our groups. Following Tecilla et al.[48], given the sign of ${\stackrel{\prime }{\mu }}_{2}^{\left(k\right)}$ is arbitrary, we used as predictor for Bayesian multilevel modelling the absolute value,$\left|{\stackrel{\prime }{\mu }}_{2}^{\left(k\right)}\right|$, representing the strength of those predictions, and assessed its association with performance variables (log-mIKI, log-RT). We hypothesised a negative correlation, suggesting that stronger expectation about reward contingencies speeds performance. We also hypothesised a greater sensitivity to these predictions (steeper slope) in BD compared to HC. Details of the models are in Table S3.

Source reconstruction of MEG signals

We reconstructed MEG signals using Linearly Constrained Minimum Variance beamforming (LCMV[55]) in MNE-Python and individual T1-MRI images for cortical divisions with Freesurfer 6.0[56, 57], http://surfer.nmr.mgh.harvard.edu/). We aligned MRI and MEG coordinate systems, selected the Desikan-Killiany–Tourville atlas for cortical parcellation (DKT[58]), and performed forward modelling with boundary element models[37].

We focused on alpha, and beta frequency bands, band-pass filtering signals between 1–40 Hz before LCMV beamforming. Theta-band activity was also examined given its robust association with feedback processing[34, 59], relevant for win/lose outcomes in our task. Gamma frequency analysis followed a similar process (30–124 Hz band-pass filter). Time courses were extracted for regions of interest (ROIs) associated with decision-making under uncertainty and reward processing[35, 60–65], and linked with impairments in fronto-striatal reward circuitry in [5, 10, 66, 67]. These included the (1) ACC, (2) OFC, including the ventromedial PFC, (3) dorsomedial PFC (dmPFC), (4) dorsolateral PFC (dlPFC). We also included the (5) primary motor cortex (M1) and (6) premotor cortex (PMC), to assess motor activity during decision-making[68].

Our study's ROIs comprised 16 bilateral labels in eight areas from the DKT atlas: (1) rostral and caudal ACC, (2) lateral and medial OFC (including vmPFC), (3) superior frontal gyrus (dmPFC, and supplementary motor area, SMA), (4) rostral middle frontal gyrus (rMFG), (5) precentral gyrus (M1), and (6) caudal MFG. Time series extraction utilised the PCA flip method in MNE-Python. Although the 'flip' operator was not relevant for our time-frequency analysis, it was essential for preparing the source-reconstructed time series for subsequent connectivity analysis. See anatomical label references in Supplemental Material.

Convolution modelling of time-frequency responses during outcome processing

We used a validated convolution-modelling approach to analyse frequency-domain amplitude changes related to belief updating and uncertainty following outcome presentation[36, 38, 69]. Building on previous work[37], this frequency-domain general linear model (GLM) included as parametric regressors the unsigned pwPE updating beliefs on level 2 (representing precision-weighted Bayesian surprise; the absolute value is preferred for the binary HGF where sign on level 2 is arbitrary[70, 71]), and uncertainty measures (σ₂, σ₃). It also included discrete regressors for win/lose outcomes and error trials. To avoid regressor collinearity and potential GLM misspecification, we excluded the level 3 pwPE[37, 38], due to its high linear correlation with the unsigned pwPE on level 2 (Supplementary Materials).

The GLM was applied to concatenated epochs of source-reconstructed data in our ROIs, using Morlet wavelets for time-frequency (TF) analysis in 4–100Hz and within − 0.5–1.8 s (Figure S1). We hypothesised between-group differences in gamma and concomitant alpha/beta activity during pwPE processing, and in alpha/beta activity during uncertainty encoding. Theta 4–7 Hz activity was assessed for the win/lose regressors[34, 59].

We conducted this analysis using SPM12 software (http://www.fil.ion.ucl.ac.uk/spm/), adapting original code by Spitzer et al.[72], as used in Hein et al.[37], with additional details available in the Supplemental Materials.

Frequency-resolved functional connectivity

To analyse directed functional connectivity between frequency-resolved activity in our ROIs, we employed time-reversed Granger causality (TRGC[73]; Supplemental Materials) as a robust metric for directed information flow[74]. Following the recommendations of Pellegrini et al.[74], we applied TRGC in the frequency domain to LCMV-based source-reconstructed time series from our 16 ROIs after the PCA flip transformation.

Our analysis focused on between-group differences in the directionality of information flow within the 8–30 Hz range during the 0.5–1 s interval of outcome processing for trials with large unsigned pwPEs updating beliefs at level 2. We employed a median split of unsigned pwPE values, yielding approximately 160 high-|pwPE| trials per participant. This frequency range was selected based on evidence that beta-band functional connectivity from the PFC effectively differentiates levels of predictability, exhibiting reduced values during unpredictable trials[34]. By examining TRGC in trials with high unsigned pwPE values, we anticipated a general decrease in beta-band TRGC in HC, in parallel with alpha/beta amplitude suppression during belief updating. We hypothesised that this pattern would be disrupted in BD. The TRGC analysis was conducted using the ROIconnect plugin for EEGLAB[74], adapted for our MNE-python LCMV outputs. See Supplemental Materials.

Statistical analysis

Between-group analyses of behavioural, computational, and TRGC-derived variables used independent-sample permutation tests (5000 permutations) in MATLAB®. Within-subject analyses used paired permutation tests. We maintained an alpha significance level at 0.05 and controlled false discovery rates (FDR) at q = 0.05 for multiple tests. Non-parametric effect sizes are reported as probability of superiority[75, 76] (Δ). Non-significant effects were further evaluated using Bayes Factors (BF₁₀), interpreted following Wetzels and Wagenmakers[77].

Statistical analysis of source-level time-frequency images used cluster-based permutation testing in the FieldTrip Toolbox[78, 79] (1000 permutations). We averaged TF activity across frequency bins within each band (theta, alpha, beta; 60–100 Hz for gamma[37]). Temporal intervals of interest for statistical analyses were selected based on previous research[37, 38, 59]: 0.5–1.8 s for parametric regressors, 0.2–1 s for win/lose regressors. We controlled the family-wise error rate (FWER) at 0.025. See Supplemental Materials.

Demographics

BF analysis provided anecdotal evidence for a balanced distribution of age and sex across the groups. Furthermore, substantial to anecdotal evidence indicated similar scores in mania, anxiety, depression, and general cognitive functioning between groups. Significant differences were observed exclusively in executive functioning, with the BD group demonstrating lower performance. See Table 1.

Altered reward-based decision dynamics in bipolar disorder during euthymia

Euthymic BD patients exhibited lower win rates compared to HC individuals (P_FDR = 0.0014; Δ = 0.79, CI = [0.60, 0.90]; Fig. 2c). They also demonstrated lower win-stay rates (P_FDR = 0.0194; Δ = 0.71, CI = [0.55, 0.85]; Fig. 2d). This indicates that, after securing a win on a trial, BD individuals were less likely to repeat the sequence compared to HC. Their decision to switch strategies post-loss was similar, based on anecdotal evidence (lose-shift rate: P = 0.0966; BF₁₀ = 0.8905; Fig. 2d), and despite an overall increased total switch rate in BD relative to HC (See details in Supplemental Materials, including evidence for similar performance error rates).

To test our computational hypotheses, we used the HGF framework[24]. Bayesian model selection identified as the best model overall, and for each group separately, a three-level HGF with a response model in which the decisions depend on dynamic trial-by-trial expectations of log-volatility, µ₃^(k−1), and with ω₂, ω₃, µ₃⁽⁰⁾, and σ₃⁽⁰⁾ as free model parameters (M3; Table S4). Simulation analyses confirmed good parameter recovery (Figure S2).

Using this model, we found that BD participants had higher expectations of log-volatility initially and on average (µ₃⁽⁰⁾: P_FDR = 0.0142; Δ = 0.70, CI = [0.55, 0.85]; trial-average µ₃: P_FDR = 0.0428; Δ = 0.66, CI = [0.52, 0.82]; Fig. 2ef). This suggests increased stochasticity in their responses, as also indicated by a correlation between log-volatility µ₃ and the response switch rate (Figure S3). Additionally, BD participants exhibited lower tonic volatility, ω₂, compared to HCs (P_FDR = 0.0174; Δ = 0.67, CI = [0.52, 0.82]; Fig. 2g), suggesting a slower adjustment of beliefs about action-outcome contingencies (see simulation analysis in Figure S4). No significant between-group differences were found in ω₃.

We additionally assessed the association between residual symptoms in BD participants and relevant HGF variables. Prior work suggests a positive correlation between volatility and trait anxiety[37]. Accordingly, we analysed the relationship between trait anxiety levels in BD and µ₃, confirming significant positive correlation (Spearman's rank correlation ρ = 0.46, 95% confidence interval [0.04, 0.75], P_FDR = 0.030). For mania scores, we hypothesised a correlation with the precision weights term, σ₂, which scales the influence of prediction errors on belief updates about action-outcome contingencies (Eq. 1). The rationale was that higher mania levels in BD might be associated with an enhanced reactivity to PEs[19], leading to faster belief updating driven by σ₂. Linear regression analyses revealed a negative association between mania and σ₂ (ρ = -0.46 [-0.75, -0.02], P_FDR = 0.037). Conversely, we considered that depression scores might be associated with attenuated reward-based belief updating (lower σ₂), yet found a lack of association between these variables (ρ = 0.04 [-0.34, 0.41], P = 0.836; BF₁₀ = 0.464, anecdotal evidence). See Figure S5 and Supplemental Materials.

Similar timing of actions during motor decision-making in euthymic bipolar and healthy participants

Bipolar participants, although slower in baseline motor performance (Supplemental Materials), matched HC in tempo during the main task (mIKI: HC 339 [16.6] ms, BD 354 [18.2] ms, P = 0.6223; BF₁₀ = 0.33, substantial evidence against H₁). RTs were also comparable between groups (meanRT HC 501 [20.1] ms, BD 553 [32.2] ms, P = 0.2262; BF₁₀ = 0.55).

Additionally, both groups improved in tempo and RT over trials, as expected, but BD participants exhibited faster improvements in tempo (Fig. 3a-c). Bayesian multilevel modelling confirmed this (Table S5; Figure S6). The best model demonstrated a negative slope in practice effects for both timing variables (negative posterior point estimate, with the 95% credible interval [CI] not including zero, denoting a credible non-zero Bayesian estimate; Fig. 3ab; 3d-f). The slope of the practice effect on tempo was steeper in BD (95% CI with negative slope difference; Fig. 3c). See Supplemental Materials.

Expectation about reward probability invigorates motor performance similarly in euthymic bipolar and healthy participants

Further analysis revealed that expectations about the tendency of the action-outcome probability similarly influenced performance tempo in BD and HC groups. BML identified as the best model one in which performance tempo (log-mIKI) was modulated by variations in the predictor$\left|{\stackrel{\prime }{\mu }}_{2}^{\left(k\right)}\right|$, without necessitating the inclusion of group differences (Supplemental Materials; Table S6). This model demonstrated that stronger predictions led to faster performance across all participants, irrespective of group, confirmed by the negative slope of this association (95% CI not including zero; Fig. 3gi; Figure S7). Conversely, log-RT was not modulated by the strength of predictions or other variables (Table S6).

Attenuated neural representation of precision-weighted prediction errors updating beliefs about the action-outcome contingencies in bipolar disorder

During the processing of unsigned pwPEs about the tendency of action-outcome contingencies, HC and BD participants exhibited suppression of 8–30 Hz activity across prefrontal, orbitofrontal, cingulate, and motor regions (negative cluster within 0.5–0.9 s, post relative to pre-outcome baseline, P_FWER = 0.001, 0.024 in each group; Fig. 4ab). This suppression effect was less widespread in BD, and the between-group difference was significant across the caudal and rostral ACC, MFG, and OFC; as well as in the SFG and M1 (BD − HC positive cluster at 0.6–0.9 s, P_FWER = 0.0130; Fig. 4cd; Figure S8). Alongside these 8–30 Hz effects, the BD group exhibited significantly attenuated high gamma activity (60–100 Hz) compared to HC (negative cluster, P_FWER = 0.0090; Fig. 4e). The latency of the gamma effect coincided with the timing of the alpha-beta modulations, spanning 0.5–0.82 seconds, and overlapping within the aforementioned ROIs. No significant within-subject changes in gamma activity to the unsigned pwPE regressor were observed in either group (Supplemental Material).

In addition, for the uncertainty regressors σ₂ and σ₃, despite a significant widespread increase in 8–30 Hz activity to estimation uncertainty σ₂ in HC, no significant between-group differences were observed (Figure S9). Regarding theta modulation by win and lose events, no significant differences were observed between groups either. However, as expected, both groups showed significant increases in theta activity from baseline in the ACC, extending to prefrontal and orbitofrontal ROIs (Figure S10).

In a post-hoc analysis, we investigated alpha and beta raw power during inter-trial intervals. This aimed to determine whether BD's reduced suppression in the 8–30 Hz range to the pwPE regressor indicated a limited dynamic range of activity at these frequencies. Significantly lower power was observed in BD compared to HC, yet exclusively at 13–20 Hz. This effect emerged in most of the ROIs where the pwPE effect was expressed (Figure S11; Supplemental Material).

Frequency-domain functional connectivity patterns during unsigned pwPE processing

We next assessed group differences in the directionality of information flow during outcome processing for trials with large unsigned pwPEs updating beliefs at level 2. The BD cohort exhibited significantly larger TRGC coefficients than HC participants from the cACC to the rMFG and rACC, as well as from the SFG to the cMFG, in the beta frequency range (P_FDR = 0.0032, 0.0064, 0.0064, respectively; Fig. 5). The effect from the cACC to the rMFG extended to the alpha range (Fig. 5a,c). These findings indicate stronger evidence for statistical dependencies between sources in the identified directions for BD than HC in the beta (alpha) range. Importantly, these between-group effects were not attributable to differences in signal-to-noise ratio (Figure S12).

Completing our reward-based motor decision-making task, euthymic BD patients demonstrated lower win rates and a decreased tendency to repeat rewarded actions than HC, despite similar post-loss decision-making behaviour. Furthermore, employing the HGF to probe the computational processes underpinning decision-making, we found that BD participants expected more environmental volatility than HC, leading to a more stochastic mapping from beliefs to actions and higher switch rates, particularly after wins. These findings align with previous reports of heightened risk-taking and inconsistent behaviour in BD[80, 81], mirroring elevated win-switch tendencies in BD adolescents[82] and deficits in response reversal during remission[3, 4]. This suggests that decisions in euthymic BD are misaligned with their beliefs about recent successes, favouring suboptimal actions due to an overestimation of environmental changes, potentially overriding the influence of their beliefs about action-outcome contingencies on decisions.

Despite expecting increased volatility, BD patients were slower to adjust their expectations of action-outcome contingencies compared to HC, with a lower tonic volatility parameter ω₂ indicating slower adaptation. Similar results in HGF modelling for paranoia[27] suggest that this propensity to anticipate change without learning from it appropriately may be a common feature across paranoia and bipolar disorder. Additionally, despite similar residual symptom levels in BD and HC, trait anxiety in BD correlated with volatility estimates, aligning with findings that high trait anxiety exacerbates difficulties in adapting to environmental changes[37, 83].

Of particular relevance in BD, we observed that residual mania symptoms negatively correlated with estimation uncertainty, σ₂, which scales the influence of PEs about action-outcomes on level-2 belief updating. Therefore, those with higher mania scores struggled more with updating these predictions. Given that early relapse in BD has been associated with a reduction in an empirical measure of belief updating in response to positive feedback[14], future work could investigate if computational metrics of belief updating like σ₂ enhance prediction of clinical progression over behavioral indicators. Further investigations should also explore the effect of comorbid anxiety on volatility responses and relapse.

Despite deficits in decision-making and baseline executive function in our BD sample, motor performance invigoration was comparable to HC, indicating preserved motivational drive in euthymic BD. This contrasts with previous findings that rewards and success amplify energy and effort in BD[4, 47]. Our Bayesian analyses revealed a similar sensitivity of performance tempo to expectations about reward contingencies in both groups, highlighting that the alterations in euthymic BD were confined to decision-making processes.

On a neural level, convolution modelling on source-reconstructed time-frequency activity revealed BD individuals had attenuated neural representations of encoding unsigned pwPE—updating beliefs about action-outcome contingencies—compared to HC. This was marked by decreased gamma and increased alpha-beta amplitude changes 0.5–0.9 seconds post-outcome across multiple PFC, OFC, ACC, and motor regions. Spatial effects in anatomical PFC labels corresponded with functional vmPFC, dmPFC, and dlPFC, aligning with the neural correlates of decision-making under uncertainty[35, 62–64], and BD-specific neural alterations during reward processing[5, 10, 66, 67].

Recent rhythm-based formulations of predictive coding suggest distinct roles of oscillatory activity at different frequencies in conveying predictions and PE during perception[33, 84]. Alpha and beta oscillations in deep cortical layers are implicated in conveying top-down predictions, while gamma oscillations in superficial layers are associated with the representation of PE, particularly in sensory cortices and related areas[33, 84]. This division has received empirical validation in both human and animal studies, supporting generative models like predictive coding[85–87] and hierarchical Bayesian inference[36, 88], extending across perceptual and cognitive domains[35, 89]. In models of hierarchical Bayesian inference, like the HGF, these oscillatory activities may underpin pwPE encoding, demonstrating an antithetical modulation of alpha/beta and gamma activity[36]. The observed dysregulation of these rhythms in conditions like anxiety[37, 38] suggests a neurophysiological basis for symptoms resulting from imbalances in belief updating.

Our findings indicate that in euthymic BD, exacerbated alpha and beta activity may inhibit gamma activity during unsigned pwPE encoding, potentially accounting for maladaptive belief updating. This may reflect an under-reliance on using predictions about action-outcome contingencies to optimise behaviour, in line with the computational results. Such rhythmic changes match electrophysiological evidence of heightened beta and reduced gamma activity in BD during oddball processing[90, 91]. Moreover, using the TRGC to assess directional influences in frequency-domain activity, we observed stronger evidence for beta-band directional flow in BD compared to HC, from cACC to rACC and rMFG and SFG to cMFG during trials with larger unsigned pwPEs. TRGC values increased in BD but decreased in HC, aligning with expectations from primate research where beta-band Granger Causality in the PFC decreases during unpredictable trials—a pattern suggesting normative responses[34]. Thus, our study revealed that euthymic BD was associated both with altered frequency-domain amplitude changes and functional connectivity during belief updating.

Insufficient GABAergic neurotransmission and excessive glutamatergic system activity have previously been linked to electrophysiological alterations in BD[92]. Considering that mood stabilisers for BD, such as valproate and lithium, may have opposing effects on beta activity and potentially on beta/gamma connectivity[92, 93], a promising avenue for future research is to assess the modulation of alpha-beta and gamma amplitude and connectivity during pwPE encoding in BD as potential markers for tracking treatment response and for diagnostic purposes. A key limitation of our study is the inclusion of patients on diverse psychiatric medications, including mood stabilisers, antipsychotics, and antidepressants. These treatments impact various neurotransmitter systems like dopamine and serotonin, affecting neural and behavioural aspects of decision-making[9, 94]. The varied effects of these medications may have influenced the magnitudes of the effects reported, a factor future research should consider. Additionally, the study was not preregistered, yet all analyses followed established pipelines from our recent work involving similar tasks[37, 48], except for the TRGC analysis. This was specifically designed based on similar Granger-causality analyses that assess rhythm-based hypotheses of predictive processing[34]. Lastly, our study did not contrast the HGF framework with alternatives like Bayesian change-point models[95] or those jointly estimating volatility and stochasticity[21]. Future research would benefit from such comparisons to validate the computational processes underlying belief updating alterations in BD.

In sum, our findings highlight significant alterations in belief updating among BD individuals during euthymia, when learning reward-based probabilistic mappings in volatile environments, without affecting the motivational aspects of motor execution. Importantly, the identification of frequency-domain amplitude and functional connectivity alterations underpinning these computational maladaptations provides crucial insights for enhancing relapse prediction and monitoring treatment response in future research.

Code availability

The code for the analyses will be shared at https://osf.io/m63a8/

Conflict of Interest

The authors declare that they have no competing financial interests in relation to the work described.

Funding

This research was partially supported by the Basic Research Programme of the National Research University Higher School of Economics (Russian Federation). The research used the Elekta Neuromag 306-channel MEG system at Centre for the neurocognitive research (MEG-Centre) in Moscow (Russian Federation).

Grande I, Berk M, Birmaher B, Vieta E. Bipolar disorder. Lancet. 2016; 387(10027):1561–72.
Merikangas KR, Jin R, He JP, Kessler RC, Lee S, Sampson NA, et al. Prevalence and Correlates of Bipolar Spectrum Disorder in the World Mental Health Survey Initiative. Arch Gen Psychiatry. 2011; 68(3):241.
Gorrindo T, Blair RJR, Budhani S, Dickstein DP, Pine DS, Leibenluft E. Deficits on a probabilistic response-reversal task in patients with pediatric bipolar disorder. Am J Psychiatry. 2005; 162(10):1975–7.
Johnson SL, Edge MD, Holmes MK, Carver CS. The Behavioral Activation System and Mania. Annu Rev Clin Psychol. 2012; 8(1):243–67.
Mason L, O’Sullivan N, Montaldi D, Bentall RP, El-Deredy W. Decision-making and trait impulsivity in bipolar disorder are associated with reduced prefrontal regulation of striatal reward valuation. Brain. 2014; 137(8):2346–55.
Trost S, Diekhof EK, Zvonik K, Lewandowski M, Usher J, Keil M, et al. Disturbed Anterior Prefrontal Control of the Mesolimbic Reward System and Increased Impulsivity in Bipolar Disorder. Neuropsychopharmacology. 2014; 39:1914–23.
Adida M, Jollant F, Clark L, Guillaume S, Goodwin GM, Azorin JM, et al. Lithium might be associated with better decision-making performance in euthymic bipolar patients. Eur Neuropsychopharmacol. 2015; 25(6):788–97.
Schreiter S, Spengler S, Willert A, Mohnke S, Herold D, Erk S, et al. Neural alterations of fronto-striatal circuitry during reward anticipation in euthymic bipolar disorder. Psychol Med. 2016; 46(15):3187–98.
Jiménez E, Solé B, Arias B, Mitjans M, Varo C, Reinares M, et al. Characterizing decision-making and reward processing in bipolar disorder: A cluster analysis. Eur Neuropsychopharmacol. 2018; 28(7):863–74.
Bart CP, Titone MK, Ng TH, Nusslock R, Alloy LB. Neural reward circuit dysfunction as a risk factor for bipolar spectrum disorders and substance use disorders: A review and integration. Clin Psychol Rev. 2021; 87:102035.
Adida M, Jollant F, Clark L, Besnier N, Guillaume S, Kaladjian A, et al. Trait-related decision-making impairment in the three phases of bipolar disorder. Biol Psychiatry. 2011; 70(4):357–65.
Brambilla P, Perlini C, Bellani M, Tomelleri L, Ferro A, Cerruti S, et al. Increased salience of gains versus decreased associative learning differentiate bipolar disorder from schizophrenia during incentive decision making. Psychol Med. 2013; 43(3):571–80.
Powers RL, Russo M, Mahon K, Brand J, Braga RJ, Malhotra AK, et al. Impulsivity in bipolar disorder: relationships with neurocognitive dysfunction and substance use history. Bipolar Disord. 2013; 15(8):876–84.
Ossola P, Garrett N, Sharot T, Marchesi C. Belief updating in bipolar disorder predicts time of recurrence. Elife. 2020; 9:1–17.
Mason L, Eldar E, Rutledge RB. Mood Instability and Reward Dysregulation—A Neurocomputational Model of Bipolar Disorder. JAMA Psychiatry. 2017; 74(12):1275.
Eldar E, Felso V, Cohen JD, Niv Y. A pupillary index of susceptibility to decision biases. Nat Hum Behav. 2021; 5(5):653–62.
Pulcu E, Saunders KEA, Harmer CJ, Harrison PJ, Goodwin GM, Geddes JR, et al. Using a generative model of affect to characterize affective variability and its response to treatment in bipolar disorder. Proc Natl Acad Sci U S A. 2022; 119(28).
Eldar E, Niv Y. Interaction between emotional state and learning underlies mood instability. Nat Commun. 2015; 6(1):6149.
Moningka H, El-Deredy W, Bentall RP, Mason L. Misperceiving momentum: computational mechanisms of biased striatal reward prediction errors in bipolar disorder. bioRxiv. 2023; 2023.07.11.548610.
Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS. Learning the value of information in an uncertain world. Nat Neurosci. 2007; 10(9):1214–21.
Piray P, Daw ND. A model for learning based on the joint estimation of stochasticity and volatility. Nat Commun. 2021; 12(1):6587.
Diaconescu AO, Mathys C, Weber LAE, Daunizeau J, Kasper L, Lomakina EI, et al. Inferring on the Intentions of Others by Hierarchical Bayesian Learning. PLOS Comput Biol. 2014; 10(9):e1003810.
Mathys C, Daunizeau J, Friston KJ, Stephan KE. A Bayesian foundation for individual learning under uncertainty. Front Hum Neurosci. 2011; 5(MAY):9.
Mathys CD, Lomakina EI, Daunizeau J, Iglesias S, Brodersen KH, Friston KJ, et al. Uncertainty in perception and the Hierarchical Gaussian Filter. Front Hum Neurosci. 2014; 8.
Friston K, Kilner J, Harrison L. A free energy principle for the brain. J Physiol. 2006; 100(1–3):70–87.
Friston K, Schwartenbeck P, FitzGerald T, Moutoussis M, Behrens T, Dolan RJ. The anatomy of choice: active inference and agency. Front Hum Neurosci. 2013; 7.
Reed EJ, Uddenberg S, Suthaharan P, Mathys CD, Taylor JR, Groman SM, et al. Paranoia as a deficit in non-social belief updating. Elife. 2020; 9:1–55.
Deserno L, Boehme R, Mathys C, Katthagen T, Kaminski J, Stephan KE, et al. Volatility Estimates Increase Choice Switching and Relate to Prefrontal Activity in Schizophrenia. Biol Psychiatry Cogn Neurosci Neuroimaging. 2020 Feb 1; 5(2):173–83.
Pulcu E, Browning M. The Misestimation of Uncertainty in Affective Disorders. Trends Cogn Sci. 2019; 23(10):865–75.
Clark JE, Watson S, Friston KJ. What is mood? A computational perspective. Psychol Med. 2018; 48(14):2277–84.
Aylward J, Valton V, Ahn WY, Bond RL, Dayan P, Roiser JP, et al. Altered learning under uncertainty in unmedicated mood and anxiety disorders. Nat Hum Behav. 2019; 3(10):1116–23.
Arnal LH, Giraud AL. Cortical oscillations and sensory predictions. Trends Cogn Sci. 2012; 16(7):390–8.
Sedley W, Gander PE, Kumar S, Kovach CK, Oya H, Kawasaki H, et al. Neural signatures of perceptual inference. Elife. 2016 Mar 7; 5(MARCH2016).
Bastos AM, Lundqvist M, Waite AS, Kopell N, Miller EK. Layer and rhythm specificity for predictive routing. Proc Natl Acad Sci. 2020; 117(49):31459–69.
Domenech P, Rheims S, Koechlin E. Neural mechanisms resolving exploitation-exploration dilemmas in the medial prefrontal cortex. Science 2020; 369(6507).
Auksztulewicz R, Friston KJ, Nobre AC. Task relevance modulates the behavioural and neural effects of sensory predictions. Jensen O, editor. PLOS Biol. 2017; 15(12):e2003143.
Hein TP, Gong Z, Ivanova M, Fedele T, Nikulin V, Ruiz MH. Anterior cingulate and medial prefrontal cortex oscillations underlie learning alterations in trait anxiety in humans. Commun Biol. 2023; 6(1):271.
Hein TP, Ruiz MH. State anxiety alters the neural oscillatory correlates of predictions and prediction errors during reward-based learning. Neuroimage 2022; 249:118895.
Tarasi L, Trajkovic J, Diciotti S, di Pellegrino G, Ferri F, Ursino M, et al. Predictive waves in the autism-schizophrenia continuum: A novel biobehavioral model. Neurosci Biobehav Rev. 2022; 132(November 2021):1–22.
Rushworth MFS, Behrens TEJ. Choice, uncertainty and value in prefrontal and cingulate cortex. Nat Neurosci. 2008; 11(4):389–97.
Rouault M, Drugowitsch J, Koechlin E. Prefrontal mechanisms combining rewards and beliefs in human decision-making. Nat Commun. 2019; 10(1):301.
Summerside EM, Shadmehr R, Ahmed AA. Vigor of reaching movements: reward discounts the cost of effort. J Neurophysiol. 2018; 119(6):2347–57.
Sedaghat-Nejad E, Herzfeld DJ, Shadmehr R. Reward Prediction Error Modulates Saccade Vigor. J Neurosci. 2019; 39(25):5010–7.
Ashok AH, Marques TR, Jauhar S, Nour MM, Goodwin GM, Young AH, et al. The dopamine hypothesis of bipolar affective disorder: The state of the art and implications for treatment. Mol Psychiatry. 2017; 22(5):666–79.
Zhang CY, Cai X, Guo L, Wang L, Liu Z, Luo XJ, et al. Genetic evidence for the “dopamine hypothesis of bipolar disorder.” Mol Psychiatry. 2023; 28(2):532–5.
da Silva JA, Tecuapetla F, Paixão V, Costa RM. Dopamine neuron activity before action initiation gates and invigorates future movements. Nature. 2018; 554(7691):244–8.
Fulford D, Johnson SL, Llabre MM, Carver CS. Pushing and Coasting in Dynamic Goal Pursuit. Psychol Sci. 2010; 21(7):1021–7.
Tecilla M, Großbach M, Gentile G, Holland P, Sporn S, Antonini A, et al. Modulation of Motor Vigor by Expectation of Reward Probability Trial-by-Trial Is Preserved in Healthy Ageing and Parkinson’s Disease Patients. J Neurosci. 2023; 43(10):1757–77.
Organization WH. The ICD-10 classification of mental and behavioural disorders: clinical descriptions and diagnostic guidelines. World Heal Organ. 1992;
Taulu S, Hari R. Removal of magnetoencephalographic artifacts with temporal signal-space separation: Demonstration with single-trial auditory-evoked responses. Hum Brain Mapp. 2009; 30(5):1524–34.
Gramfort A. MEG and EEG data analysis with MNE-Python. Front Neurosci. 2013; 7.
Bürkner PC. brms: An Package for Bayesian Multilevel Models Using Stan. J Stat Softw. 2017; 80(1).
Bürkner PC. Advanced Bayesian Multilevel Modeling with the R Package brms. R J. 2018; 10(1):395.
Frässle S, Aponte EA, Bollmann S, Brodersen KH, Do CT, Harrison OK, et al. TAPAS: An Open-Source Software Package for Translational Neuromodeling and Computational Psychiatry. Front Psychiatry. 2021; 12:1–25.
Veen BD Van, Drongelen W Van, Yuchtman M, Suzuki A. Localization of brain electrical activity via linearly constrained minimum variance spatial filtering. IEEE Trans Biomed Eng. 1997; 44(9):867–80.
Dale AM, Fischl B, Sereno MI. Cortical Surface-Based Analysis. Neuroimage. 1999; 9(2):179–94.
Fischl B, Sereno MI, Dale AM. Cortical Surface-Based Analysis. Neuroimage. 1999; 9(2):195–207.
Klein A, Tourville J. 101 Labeled Brain Images and a Consistent Human Cortical Labeling Protocol. Front Neurosci. 2012; 6.
Andreou C, Frielinghaus H, Rauh J, Mußmann M, Vauth S, Braun P, et al. Theta and high-beta networks for feedback processing: a simultaneous EEG–fMRI study in healthy male subjects. Transl Psychiatry. 2017; 7(1):e1016–e1016.
Paulus MP, Feinstein JS, Simmons A, Stein MB. Anterior cingulate activation in high trait anxious subjects is related to altered error processing during decision making. Biol Psychiatry. 2004; 55(12):1179–87.
Hayden BY, Heilbronner SR, Pearson JM, Platt ML. Surprise Signals in Anterior Cingulate Cortex: Neuronal Encoding of Unsigned Reward Prediction Errors Driving Adjustment in Behavior. J Neurosci. 2011; 31(11):4178–87.
Rolls ET, Deco G, Huang CC, Feng J. The human orbitofrontal cortex, vmPFC, and anterior cingulate cortex effective connectome: emotion, memory, and action. Cereb Cortex. 2022 Dec 20; 33(2):330–56.
Nour MM, Dahoun T, Schwartenbeck P, Adams RA, FitzGerald THB, Coello C, et al. Dopaminergic basis for signaling belief updates, but not surprise, and the link to paranoia. Proc Natl Acad Sci U S A. 2018; 115(43):E10167–76.
Schulreich S, Schwabe L. Causal Role of the Dorsolateral Prefrontal Cortex in Belief Updating under Uncertainty. Cereb Cortex. 2021; 31(1):184–200.
Grupe DW, Nitschke JB. Uncertainty and anticipation in anxiety: an integrated neurobiological and psychological perspective. Nat Rev Neurosci. 2013; 14(7):488–501.
Dutra SJ, Man V, Kober H, Cunningham WA, Gruber J. Disrupted cortico‐limbic connectivity during reward processing in remitted bipolar I disorder. Bipolar Disord. 2017; 19(8):661–75.
Acuff HE, Versace A, Bertocci MA, Ladouceur CD, Hanford LC, Manelis A, et al. Baseline and follow-up activity and functional connectivity in reward neural circuitries in offspring at risk for bipolar disorder. Neuropsychopharmacology. 2019; 44(9):1570–8.
Kobayashi K, Hsu M. Neural Mechanisms of Updating under Reducible and Irreducible Uncertainty. J Neurosci. 2017; 37(29):6972–82.
Litvak V, Jha A, Flandin G, Friston K. Convolution models for induced electromagnetic responses. Neuroimage. 2013; 64:388–98.
Hein TP, de Fockert J, Ruiz MH. State anxiety biases estimates of uncertainty and impairs reward learning in volatile environments. Neuroimage. 2021; 224:117424.
Stefanics G, Heinzle J, Horváth AA, Stephan KE. Visual Mismatch and Predictive Coding: A Computational Single-Trial ERP Study. J Neurosci. 2018; 38(16):4020–30.
Spitzer B, Blankenburg F, Summerfield C. Rhythmic gain control during supramodal integration of approximate number. Neuroimage. 2016; 129:470–9.
Haufe S, Nikulin V V., Müller KR, Nolte G. A critical assessment of connectivity measures for EEG data: A simulation study. Neuroimage. 2013; 64(1):120–33.
Pellegrini F, Delorme A, Nikulin V, Haufe S. Identifying good practices for detecting inter-regional linear functional connectivity from EEG. Neuroimage. 2023; 277:120218.
Grissom RJ, Kim JJ. Effect Sizes for Research. Routledge; 2012.
Ruscio J, Mullen T. Confidence Intervals for the Probability of Superiority Effect Size Measure and the Area Under a Receiver Operating Characteristic Curve. Multivariate Behav Res. 2012; 47(2):201–23.
Wetzels R, Wagenmakers EJ. A default Bayesian hypothesis test for correlations and partial correlations. Psychon Bull Rev. 2012; 19(6):1057–64.
Oostenveld R, Fries P, Maris E, Schoffelen JM. FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data. Comput Intell Neurosci. 2011; 2011:1–9.
Maris E, Oostenveld R. Nonparametric statistical testing of EEG- and MEG-data. J Neurosci Methods. 2007; 164(1):177–90.
Lasagna CA, Pleskac TJ, Burton CZ, McInnis MG, Taylor SF, Tso IF. Mathematical modeling of risk-taking in bipolar disorder: Evidence of reduced behavioral consistency, with altered loss aversion specific to those with history of substance use disorder. Comput psychiatry (Cambridge, Mass). 2022; 6(1):96.
Yechiam E, Hayden EP, Bodkins M, O’Donnell BF, Hetrick WP. Decision making in bipolar disorder: A cognitive modeling approach. Psychiatry Res. 2008; 161(2):142–52.
Dickstein DP, Finger EC, Brotman MA, Rich BA, Pine DS, Blair JR, et al. Impaired probabilistic reversal learning in youths with mood and anxiety disorders. Psychol Med. 2010; 40(7):1089–100.
Browning M, Behrens TE, Jocham G, O’Reilly JX, Bishop SJ. Anxious individuals have difficulty learning the causal statistics of aversive environments. Nat Neurosci 2015; 18(4):590–6.
Bastos AM, Usrey WM, Adams RA, Mangun GR, Fries P, Friston KJ. Canonical microcircuits for predictive coding. Neuron. 2012; 76(4):695–711.
Xing D, Yeh CI, Burns S, Shapley RM. Laminar analysis of visually evoked activity in the primary visual cortex. Proc Natl Acad Sci. 2012; 109(34):13871–6.
Michalareas G, Vezoli J, van Pelt S, Schoffelen JM, Kennedy H, Fries P. Alpha-Beta and Gamma Rhythms Subserve Feedback and Feedforward Influences among Human Visual Cortical Areas. Neuron. 2016; 89(2):384–97.
Roberts MJ, Lowet E, Brunet NM, TerWal M, Tiesinga P, Fries P, et al. Robust gamma coherence between macaque V1 and V2 by dynamic frequency matching. Neuron. 2013; 78(3):523–36.
Palmer CE, Auksztulewicz R, Ondobaka S, Kilner JM. Sensorimotor beta power reflects the precision-weighting afforded to sensory prediction errors. Neuroimage. 2019; 200:59–71.
van Pelt S, Heil L, Kwisthout J, Ondobaka S, van Rooij I, Bekkering H. Beta- and gamma-band activity reflect predictive coding in the processing of causal events. Soc Cogn Affect Neurosci. 2016; 11(6):973–80.
Ethridge LE, Hamm JP, Shapiro JR, Summerfelt AT, Keedy SK, Stevens MC, et al. Neural Activations During Auditory Oddball Processing Discriminating Schizophrenia and Psychotic Bipolar Disorder. Biol Psychiatry. 2012; 72(9):766–74.
Özerdem A, Güntekin B, Tunca Z, Başar E. Brain oscillatory responses in patients with bipolar disorder manic episode before and after valproate treatment. Brain Res. 2008; 1235:98–108.
Lu Z, Wang H, Gu J, Gao F. Association between abnormal brain oscillations and cognitive performance in patients with bipolar disorder: Molecular mechanisms and clinical evidence. Synapse. 2022; 76(11–12):e22247.
Özerdem A, Güntekin B, Atagün I, Turp B, Başar E. Reduced long distance gamma (28–48 Hz) coherence in euthymic patients with bipolar disorder. J Affect Disord. 2011; 132(3):325–32.
Moncrieff J, Cohen D, Porter S. The Psychoactive Effects of Psychiatric Medication: The Elephant in the Room. J Psychoactive Drugs. 2013; 45(5):409–15.
Moens V, Zénon A. Learning and forgetting using reinforced Bayesian change detection. PLOS Comput Biol. 2019; 15(4):e1006713.

The authors have declared there is NO conflict of interest to disclose

SupplementalMaterials.pdf

Download PDF

Editorial decision: revise
19 Aug, 2024
Review #1 received at journal
06 Jun, 2024
Review #2 received at journal
29 May, 2024
Reviewer #2 agreed at journal
08 May, 2024
Reviewer #1 agreed at journal
08 May, 2024
Reviewers invited by journal
08 May, 2024
Submission checks completed at journal
03 Apr, 2024
First submitted to journal
02 Apr, 2024
Unknown event
02 Apr, 2024
Editor assigned by journal
01 Apr, 2024

You are reading this latest preprint version

Frequency-specific changes in prefrontal activity associated with maladaptive belief updating in volatile environments in euthymic bipolar disorder

Status:

Version 1

Abstract

Figures

Introduction

Methods and Materials

Participants

Reward-based motor sequence learning task

MEG recording and preprocessing

General task performance

Modelling decision-making behaviour using hierarchical Gaussian filters

Assessing motor invigoration

Source reconstruction of MEG signals

Convolution modelling of time-frequency responses during outcome processing

Frequency-resolved functional connectivity

Statistical analysis

Results

Demographics

Altered reward-based decision dynamics in bipolar disorder during euthymia

Similar timing of actions during motor decision-making in euthymic bipolar and healthy participants

Expectation about reward probability invigorates motor performance similarly in euthymic bipolar and healthy participants

Frequency-domain functional connectivity patterns during unsigned pwPE processing

Discussion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1