Modeling choice history biases
In general, the modeling approach started with an unbiased baseline model, followed by including the actor’s identity and choice history data. The modeling steps were hypothesis-driven, and the objective was to arrive at an arguably less complex model that is simpler to interpret and performs well. In the following, we report the results of each modeling step and how we arrived at the best-fitting model.
First, we examined the Task-Only model, where only the current trial stimulus was included as a predictor to model the participants’ performance without additional constraints, such as what the previous response was or the identity of the previous trial actor. This model served as a baseline model and assumed the estimation of the current response depends solely on the current trial stimulus, which was the actual task. The results showed a statistically significant effect of the current stimulus on the current response (β = 1.04, 95% CI [1.01, 1.06], SE = 0.01, p < .001). The accuracy of the model’s prediction was 73.41%, with an MSE of 0.193. The accuracy value was computed by comparing the model’s predicted probabilities (in values 0s and 1s) against the true labels of the outcome variable. The MSE value was calculated as the average squared difference between the predicted and the true values. The threshold for transforming the predicted probabilities into predicted labels was set at 0.5. If the predicted probability for a rightward response exceeded 0.5, we interpreted it as a prediction for class 1 (rightward response). Otherwise, the predicted probabilities below 0.5 were classified as a prediction for class 0 (leftward response). The predicted probability for a rightward response given a rightward-moving stimulus derived from the model’s estimates was 73.8%, which aligned with our intention of the task design to yield a goal of about 75% accuracy performance. The significant and positive association between the predictor and the response suggested the participants followed task instructions and behaved as they should.
To investigate the influence of past trial responses, we built on the baseline model to additionally include the variable “own last response”, which coded for the last response by the acting participant. Note that the variable included only the last response and did not account for the number of trials that may have passed since the same participant last acted. In line with the second hypothesis, which posits that the participant ignores his partner’s response, this Own History model assumes any of the partner’s past choices as irrelevant but allows modeling of the own history bias. The results of the model showed that the participant’s own last response positively and significantly predicted the current response (β = 0.15, 95% CI [0.12, 0.18], SE = 0.01, p < .001), while the current stimulus variable remained significant as seen in the baseline model (β = 1.04, 95% CI [1.01, 1.07], SE = 0.01, p < .001). The model exhibited an AIC value of 31334.72 (Δ = -118.39 units from the Task-Only model). The AIC measures the model’s goodness of fit and complexity by penalizing additional parameters. The observed decrease in AIC values thus suggested the additional variable led to a better fit. The model's accuracy remained at 73.4% with an MSE value of 0.192. Note, however, that the accuracy value was computed based on predictions thresholded at 0.5, thus not fully reflecting the variations in the model’s factual prediction values. The results suggested a repeat bias based on the acting participant’s last response.
Given the participant exhibited a choice history bias, we extended the previous Own History model to include the partner’s last response. This step tested how the partner’s history response influences the model’s prediction. This Own & Partner History model did not account for the number of trials that had passed since the same or different participant (acting or observing participant) last acted and assumed distinct contributions of the participant’s last choice response versus that of their partner’s. Therefore, the model further tested the second hypothesis on whether there is a dyadic choice history or only individual choice history bias. The results showed that both the current stimulus (β = 1.04, 95% CI [1.01, 1.07], SE = 0.01, p < .001) and the participant's own last response (β = 0.15, 95% CI [0.12, 0.18], SE = 0.01, p < .001) had a significant positive effect on the current response. In contrast, the last response made by the participant's partner showed a significant negative association with the current response (β = -0.04, 95% CI [-0.07, -0.02], SE = 0.01, p < .01). The accuracy of the model was 73.4% with an MSE value of 0.192. The AIC value, however, decreased compared to the Own History model (AIC = 31326, ΔAIC= -8.32). The results indicated that including the partner’s last response slightly improved the model’s predictive performance. This suggested that the participant tended to repeat his last decision but did not ignore his partner’s last decision. Nevertheless, the influence of the partner’s last response on switching the choice to be made was relatively slight compared to the participant’s own last response.
The results thus far indicate an influence of the acting and observing participant’s last response on choice behavior; however, it is unclear whether this choice history bias effect reaches further back in trial history for both acting and observing participants. Therefore, in the next step, we developed a family of models (Trace History model), which considered more and more decisions further into the past, to investigate the influence of the past response as the number of the last trial lag increases. These models are indexed up to the fifth lag, reflecting the maximal lag considered for the acting (own) and observing (partner) participant. The indexing notation (i, j) in the Trace History model represents the number of the last trial lags for the acting and observing participants, respectively. As such, the Own & Partner History model is identical to the Trace History (1,1) model. Including the participant’s own second-last choice response results in the Trace History (2,1) model. Following this, we included the same data for the dyadic partner, leading to the Trace History (2,2) model. In this order, we repeated the modeling steps until we reached the Trace History (5,5) model, where both the own and partner's five last trials are considered. From this series of model fitting, we observed the inclusion of the variables for the own last trial data from lag 2 until lag 5 showed a consistent statistical significance (p < .001), reduction in the AIC values (Mean ΔAIC = -176.80), constant accuracy value of 73.41% with slight decrease in the MSE values (Mean ΔMSE = -0.013). Nevertheless, the partner’s trial history response data beyond the last one did not systematically improve the model’s predictive performance. Specifically, the model that showed statistically significant variables with the lowest AIC and MSE is the Trace History (5,1) model (AIC = 30619; MSE = 0.187) in comparison with the rest of the family of models fitted. The Trace History (5,1) model exhibited estimates for each of the acting participant’s trial lag of β1 = 0.10, 95% CI [0.08, 0.13]; β2 = 0.23, 95% CI [0.20, 0.25]; β3 = 0.17, 95% CI [0.14, 0.20]; β4 = 0.15, 95% CI [0.12, 0.18]; β5 = 0.12, 95% CI [0.10, 0.25], each estimate with a SE = 0.01 and a p < .001. In contrast, the partner's last response variable showed a negative estimate of -0.04, 95% CI [-0.06, -0.01], SE = 0.01, p < .01. The stimulus variable showed an estimate of 1.08, 95% CI [1.05, 1.11], SE = 0.01, p < .001. Figure 3 presents a schema illustrating the statistically significant log-odds estimates observed in the Trace History (5,1) model for each variable (own and partner) at different lag indices. This schema highlights the influence of the own trial history compared to that of the partner in predicting the participant’s choice behavior. Taken together, the results suggested only the participant’s own choice bias effect traces further back in trial history, with a small dyadic influence observed in which the partner’s last response predicted a switching of choice response.
Having examined the role of the acting and observing participant’s last five decisions in predicting choice response, we investigated whether a simpler model that assumes exponential decay of past choices could provide an equally good fit. For this, we developed the Joint Weighted History model that fitted the acting and observing participants’ trial history responses in a way that accounted for a memory-decaying effect. The Joint Weighted History model builds on the Own & Partner History model which additionally included two variables that combined the participant’s and the partner’s responses, respectively. These responses were weighted as a function of the lag to the current trial. This approach conceptually aligned with past work that computed a “history kernel” to quantify the effect of stimuli and responses from past trials on the current choice processes (Urai et al., 2017). There, positive and negative weights were assigned to each previous stimulus and choice to indicate a tendency to repeat or alternate. Then, every set of seven previous trials was convolved with exponentially decaying functions sensitive to the changes in history data due to time, e.g., more distant trials having less impact (Fründ et al., 2014). Here, the exponentially weighted moving averages for the own and the partner trial responses were computed with a loss factor gamma (γ) of 0.8. The γ value determined how quickly the influence of the older data points decayed. This value approximated for a window size of 5 last trial lags (1 / (1 - γ) = 5), where the last decision was assumed to receive the highest weight and exponentially diminished as it moved further back in trial history. The results of the Joint Weighted History model showed the current stimulus (β = 1.08, 95% CI [1.05, 1.11], SE = 0.01, p < .001) and the weighted own trial history variable (β = 0.95, 95% CI [0.88, 1.02], SE = 0.04, p < .001) had a significant positive effect on the current response. In contrast, the partner's last response variable showed a significant negative estimate (β = -0.04, 95% CI [-0.07, -0.00], SE = 0.02, p < .05). Notably, during model fitting, the participant's own last response variable was corrected into a negative estimate (β= -0.18, 95% CI [-0.21, -0.14], SE = 0.02, p < .001). However, the additional weighted partner trial history variable did not suggest significance. The model improved its predictive power as indicated by a decrease in the AIC value compared to the Own & Partner History model (AIC = 30616, ΔAIC = -710.02). The model's accuracy is 73.41% and an MSE of 0.262. Further removing the weighted partner trial history variable led to improvement in the model’s performance as observed in the 2 units decrease in the AIC (AIC = 30614, ΔAIC = -2), along with a prediction accuracy of 73.41% and an MSE of 0.187. This reduced model, named the Individual Weighted History model, therefore exhibited more of a balance between model complexity and explanatory power. Specifically, it showed significantly positive estimates for the current stimulus (β = 1.08, 95% CI [1.05, 1.11], SE = 0.01, p < .001) and the weighted own trial history variable (β = 0.95, 95% CI [0.88, 1.02], SE = 0.04, p < .001). The partner's last response variable, however, showed a significant negative estimate (β = -0.04, 95% CI [-0.06, -0.01], SE = 0.01, p < .01). Similarly, the own last response variable exhibited a significant negative estimate (β= -0.18, 95% CI [-0.21, -0.14], SE = 0.02, p < .001). Together, the simpler Individual Weighted History model indicated a choice bias in which the participant tended to repeat based on his own trial history. However, the participant was also more likely to switch after his own last response, similar to the partner’s last response.
In summary, we performed a stepwise modeling procedure guided by our hypotheses and selected the Individual Weighted History model as the best-fitting one based on the model selection criteria. This model assumed exponential decay and achieved a better fit with few parameters. The model specifies an effect of the task stimulus, the last response by the acting, as well as that of the observing participant, and the exponentially weighted moving averages of the participant’s own trial history responses on the choice to be made. Figure 4a shows a coefficient plot for the variable estimates. Figure 4b shows the estimated marginal means (the predicted probability values) for the response at the level of the exponentially weighted moving averages of the participant's own trial history predictor while holding the other predictors constant. As the average weighted response increases from − 1 to + 1, the predicted likelihood of repeating the same response increases, indicating that one’s own cumulative choice history leads to a stronger bias in the participant’s choice decision. Figure 4c, d, e also shows the model’s predicted probabilities for the response at the levels of the other variables in the model, including the trial stimulus and the own and partner’s last response with their associated standard errors. Lastly, Fig. 5 summarizes the regression steps to model choice history biases in dyadic decision-making. The model indicated that while the participant had a repeat bias that spans several trials in the past, he tended to alternate the choice of his own last response. The dyadic partner’s last response also influenced the participant to alternate his choice, albeit to a lesser extent. Thus, the participant did not ignore the partner’s decision as stated in the second hypothesis, rather, he acknowledged the partner's decision by not following it, similar to his own last response.