We investigated cultural fit in emotions using natural language descriptions, as well as intensity ratings, of emotional experiences collected from Dutch-speaking Belgians and Turkish migrants in Belgium. We included language in study of (dis)similarities in emotions because of its potential to provide insight into emotional meaning-making beyond the intensity rating scales provide. We began by assessing whether measures of emotional fit (both rating fit and language fit) reflected cultural differences in how participants described four types of emotion-eliciting situations varying in valence (positive vs. negative) and interpersonal motive (relationship- vs. autonomy-promoting). Consistent with our expectations, we found that cultural groups could be clearly differentiated based on their patterns of language use – for example, Turkish migrants’ language use was more similar to other Turkish migrants than it was to Belgians. Surprisingly, these cultural differences were weaker and less consistent for the traditional rating fit. We next assessed the relationship between the two measures of fit. Contrary to our expectations, we did not find a positive association between language fit and rating fit. In fact, in some analyses the association was negative, particularly when we compared Turkish migrants’ ways of fitting with Belgians in negatively-valenced situations.
The present analyses suggest that language fit may be more reliable than rating fit in differentiating cultural groups. To follow up on the inconsistency of expected cultural differences in rating fit (H1a-d), we examined the associations between fitting in with one culture and with the other. For both rating fit and language fit, people who fit better with the Turkish culture also fit better with the Belgian culture. However, our results showed that positive associations between cultural fits were stronger for intensity ratings than for language use. This imbalance might be partly explained by methodological differences in calculating the two types of fit. While we include all types of language properties (i.e., all LIWC dictionaries) in estimating language fit, we remove the emotion items that cluster differently across cultures when estimating rating fit. This preprocessing step ensures cultural equivalence of the concepts we are comparing, but leaves out items that are cross-culturally different in meaning. It may thus lead to an underestimation of the actual differences that does not occur in calculating language fit. Also, rating fit might be more susceptible to prompt-specific constraints than language fit, because the emotion items used in the rating task more closely relate to the prompts (e.g. valence) than most language categories used to estimate language fit (exceptions are positive vs. negative affect categories).
The lack of a consistent relationship between the fit measures suggests that self-reported intensity ratings and natural language descriptions capture distinct facets of emotional meaning-making. Even if two people give a similar pattern of responses on rating scales, they may diverge in how they conceptualize the emotional experiences connected to those ratingswhich may reveal itself in language. The words people use can be seen as indicators of their mental attention (Boyd & Schwartz, 2021) – in descriptions of emotional experience, perhaps the features (bodily sensations, cognitive processes, involvement of others, etc.) that are being foregrounded. For instance, looking at the words Chinese Americans used in their free descriptions of emotional experiences, Tsai and colleagues (2004) found that migrants’ focus on social and somatic aspects varied as a function of their acculturation orientation. Accordingly, the present findings might be better explained by considering language fit to represent similarity in how attention is deployed across a wide range of emotional features. These features may not be equally foregrounded when people rate emotion words.
The negative association between Turkish migrants’ rating fit and language fit with Belgian culture in negative situations is noteworthy. This finding suggests that when Turkish migrants reported feelings similar to those of an average Belgian, they tended to focus on different features of the experience in their descriptions. For example, when Turkish migrants described offenses, they might have been feeling angry to a similar extent as Belgians but differentially attending to aspects of the situation, for instance paying more attention to social concerns, resulting in different patterns of language, for instance more social words (Tsai et al., 2004). The types of events described might also differ, for example, offenses might involve a close or a distant other, which lead to different appraisal patterns, e.g. attribution of blame, across cultures (Boiger et al., 2018). The appraisal of blame, and who to blame, might reveal itself in language, in form of differing (co)-occurrence of certain words, such as pronouns and intentional verbs (Windsor et al., 2014). One limitation of the current approach to measuring cultural fit in language is that the properties causing differences in fit are not directly observable; only overall estimates of fit are compared. Although including all types of words to estimate language fit allowed us to take a comprehensive, holistic perspective on patterns of language use, it does not allow us to examine which aspects of experience are driving emotional meaning-making. Future research is needed to explore these possibilities further; for instance, by manually coding relevant components of emotions (e.g., social concerns, or appraisals of self- and other-blame), by examining the types of events using automated approaches such as topic modelling (Blei, 2012), and by doing a targeted examination of language properties and their co-occurence to uncover the associations between the types of emotional situations, appraisals, features of the experience being attended to, and subjective feelings.
A multimethod approach to emotional fit may provide valuable insights into the intricate and diverse nature of emotional meaning-making in intercultural contexts, and language is a powerful tool by which we can examine multiple aspects of the emotional experience. As a first step, we demonstrated that cultural differences can be captured by the language used to talk about emotional experience; people have higher language fit with their own culture than the other, showing face validity of our approach. Our tests of convergent validity were not met – language fit and rating fit were not positively related – leaving questions for future research about why these measures deviate and what they each represent. At the same time, the language data revealed differences in migrants’ emotional meaning-making that were not captured by the intensity ratings alone. Future research will need to assess predictive validity by examining whether language fit is differentially associated with relational, psychological, and societal outcomes of interest in the context of immigration.