In this study, we aimed to study the clusters of multidimensional social inclusion (mSI) conceptualized by social functioning and QoL among the patients diagnosed with SSD; and to test mSI prediction using MLR and RF models. Through K-means clustering analyses, five mSI subgroups were identified, including VLL, LL, HL, MH and HH. The MLR and RF models consistently regard genetic predisposition for schizophrenia, premorbid social functioning, symptoms, baseline environment and social domains and numbers of met needs as paramount. Comparatively, the RF model is cautiously considered better because of its capability of discriminating all mSI subgroups. The mSI clusters intend to preliminarily define and bring awareness to social inclusion, an understudied but crucial outcome. The mSI prediction models have clinical and policy implications.
Clusters of multidimensional social inclusion among patients with schizophrenia spectrum disorder
We identified five mSI subgroups among the patients with SSD. The “very low (social functioning)/very low (QoL)” (VLL) and “low/low” (LL) both indicated a low mSI level. VLL was featured by the lowest social functioning and QoL in the spectrum with an even worse QoL, while LL was characterized by relatively low levels of social functioning and QoL with a slightly elevated QoL. These two clusters were merged for the analysis; however, disparities exist when the patients in VLL compared to those in LL might have restricted access to the labor market, recreational activities, and social engagement with simultaneously affected QoL and mSI. Clinicians are possibly not concerned with VLL and LL when advising interventions for similarly low social functioning among VLL and LL. As clinical interventions would probably be a priority, the two clusters could be regarded as one.
The “high/low” (HL) and “medium/high” (MH) clusters implied a medium mSI level. Compared to LL, HL had a better level of social functioning, particularly in the subscales of independence performance, interperson, recreation and prosociality, but shared a nearly similar level of QoL (i.e., moderately lower than the average QoL of all patients with SSD), pointedly with a distinctively lower satisfaction towards general health, environment, and physical conditions. Differently, MH showed an average social functioning level and a mildly higher QoL than the average QoL of all patients with SSD. It is noteworthy that about 50% of patients with SSD are in HL. The feature disparity across LL, HL and MH might point out that the patients in HL had better performance in independent living and (possibly resulted) increasing social exposure through recreational and social activities. Such an involvement could cause patients to compare themselves to the healthy population. This could shift the perception and stimulate other psychological impacts such as internalized stigma, which consequently may lead to a systematically perceived low QoL [38]. Moreover, a significant difference in occupation was absent. A likely elucidation for these patients ending up in different mSI levels is that patients in LL might join a simple task from a program or struggle to continue at work due to low functioning and a lack of support from workplaces [39, 40]. Comparatively, patients in HL and MH who function better are likely to have and stay at a job. However, with increasing functioning, patients in HL might be not satisfied with work suitability, financial earnings, skills required and meaning [40, 41], which results in a lower QoL and mSI.
Eventually, the “high/high” (HH) showed a high mSI level with the highest levels of social functioning and QoL, suggesting that this patient subgroup possibly mimics healthy controls, albeit on an overall lower level in comparison to healthy controls presumably due to their psychiatric conditions.
The characteristics displayed in the mSI subgroups implicated that good functioning or satisfaction in one aspect does not necessarily signify a better perceived mSI. For instance, patients doing equivalently well in occupation do not equate to showing similarly good mSI. Therefore, mSI is essential, as a holistic approach, to provide a comprehensive overview of social inclusion of an individual. The subgroup characteristics could also guide the intervention focus. The patients in VLL and LL may require more psychosocial interventions to manage symptoms and improve social functioning. They should also be prioritized to the eligibility for protected living when independent living is not achievable. The patients in HL and MH could target current programs designed for stigma reduction. The patients in HH could be offered training for advanced skills and opportunities for more challenging working position to further grow self-esteem and self-actualization that are beneficial to mSI enhancement.
Common factors predictive of multidimensional social inclusion
PRSSCZ, symptoms (i.e., positive, negative, depressive), premorbid social functioning, baseline environment and social domain satisfactions and number of met needs were found to be predictors of mSI.
Congruent with previous studies [18, 42–45], in the MLR model, more severe core negative symptoms increase the risk of having low-to-medium mSI (i.e., LL and HL relative to HH). Surprisingly, worse positive symptoms significantly predict good mSI (i.e., MH relative to HH) with a mild protective effect. Other studies have found that positive symptoms do not contribute much to QoL or social cognition [46–48], although the cross-sectional symptomatic remission [49] can significantly improve social functioning [50]. Our result might be the consequence of heterogeneous effects from various positive symptoms at baseline on a prospective mSI [42]. A higher genetic vulnerability towards SSD displayed a significant protective effect on good mSI (HL and MH relative to HH). This is possibly due to the single comparison in a relatively low sample size of HH in a multivariate model, which may occupy the variability of mSI concerning PRSSCZ, and consequently yield a dubious finding. Therefore, the relationship between the genetic predisposition and mSI should be independently investigated in well-powered research. Counterintuitively, we found that the more often an SSD patient has depressive symptoms, the more likely the patient is to be in high mSI (i.e., HH) than in medium mSI (i.e., HL) in the MLR model, which is discordant with previous studies [51–53]. One possible explanation is that in the diagnostic categories of SSD, patients with a higher level of depressive symptoms are more likely to have affective symptoms and to be associated with affective dysregulation, which results in a better outcome (i.e., mSI in our case) than the one of the patients with non-affective symptoms such as withdrawal in HL [54]. Specifically, the early detection and interventions of depressive symptoms could be essential to help patients with SSD, which might further impact on their lives and subsequent mSI. All the aforesaid factors were confirmed informative in the RF model as well.
Compared to the previously mentioned factors, premorbid social functioning has the highest negative effect on low-to-medium mSI in the MLR model (i.e., LL and HL relative to HH). Thus, although ranked after symptoms and next to genetic predisposition in the RF model, premorbid social functioning shows moderate predictability of the 3-year mSI. Earlier studies have shown that worse premorbid social functioning, resulting from environmental (socio-economic status and cultural specificity) and genetic factors [55, 56], may lead to severer negative symptoms and poorer social outcomes later in the course of SSD [57, 58]. Therefore, the premorbid social functioning is undoubtfully vital [18], and can be potentially used for screening of low mSI. Yet, it has not caught enough attention in the field given the limited literature.
As expected, the baseline environment and social domain satisfactions are consistently crucial. Abundant studies have emphasized the importance of occupation (and thus financial income and opportunities for acquiring new skills) and social relationships [2, 59], a safe stable housing, family support [60] and inclusive and accessible support systems across sectors such as transportation [60, 61] for social inclusion. On the other hand, a higher level of fulfilled needs significantly distinguishes MH and HH only in the MLR model. It is conceivable that increased met needs might indicate better social functioning and thus improve mSI among the patients with medium-to-high mSI. In other mSI clusters where needs were met equivalently to the HH subgroup, the heterogeneity within and between subgroups (i.e., LL, HL and HH) and subjective criteria and expectations on met needs could attenuate the significant effects. Particularly in the RF model, the number of met needs is regarded important. Therefore, with a growing emphasis on extramural care [62], the local communities and mental health organizations need to incorporate the heterogeneous environmental and social needs, beyond the medical needs, of patients with SSD at different mSI levels. More generally, given the possibility of a blend of a healthy population and a population with varying physical and mental conditions along severity scales, the mixed demands of such a population should be well considered. Stakeholders with different responsibilities should unitedly, collaboratively and actively participate in the course and co-design the community with local residents. Thus, the right infrastructures and additional human touches might boost perceived satisfaction and elevate mSI.
Model performance in prediction
By the goal of prediction, the predictivities of MLR and RF models are fair and comparable. We infer the RF model outperforms as it supposedly allocates people into the right mSI subgroup significantly better than solely by chance. It means that the MLR model should be applied with caution as some predictors in the MLR model are indiscriminative to mSI subgroups except for the HL cluster due to the imbalanced mSI outcome. Therefore, cautiously, the RF model outperforms regarding discriminability. The RF model successfully passing the generalizability test can be readily implemented in clinical practices through the electronic health record system (see Supplementary Clinical Illustration) [63]. Such prediction could serve as an indicator on how much external assistance an SSD patient may require in the upcoming future to be prevented from and/or helped get through deteriorating mSI. Furthermore, exemplified by our study (i.e., different approaches with different modelling procedures), when constructing prediction models, one should seek candidate strategies in a wide range of methodologies, try and compare different combinations, and choose the most appropriate approach to achieve an optimized balance among predictivity, discriminability, interpretability and applicability of a model, depending on the research goal and population at stake.
Future perspectives
Future studies should continue working on the conceptualization of mSI and study the generalizability of mSI across diagnoses. Following this study, the clinical differences of the identified mSI clusters and comparability to healthy controls should be investigated to derive more clinical meanings and understanding of the mSI subgroups in the practices. Meanwhile, developing a validated composite score might bring more implications due to a higher level of data granularity and a resultant possibility of monitoring over time. Methodologically, future studies should take extra steps in modeling procedures such as outer cross-validation and different feature-selection algorithms and give opportunities to the latest interpretable machine learning algorithms for exertion to pick up next-level clinical utility. Clinically, to build prognostic models, future studies should test the utilities of the potential factors that measure the same clinical outcomes but with slight differences in submodels and to understand the multidimensional mechanisms hidden under the effect sizes of such as premorbid social functioning. Furthermore, the investigations on the genetic effects on behaviors and mSI along the course of SSD could be necessary for early screening and interventions. Statistically, we underscored the importance of satisfaction towards environment and social life. This can possibly be modified by other observable and non-observable contributive factors, such as personality traits, coping strategies, diversity of community residents, community social-economic status, relationship with caregivers, and so forth. Therefore, future studies might consider extending coverage of the data collection from clinical practices in a balance of feasibility and costs. With growing awareness of social inclusion and more efforts in building increasingly sophisticated prediction models on mSI, and in the implementation of personalized interventions and supporting policies, patients with SSD would be able to harness the necessary skills and receive indispensable resources, which consequently assist them in managing their conditions better and be better included in society, eventually.
Study strengths and limitations
We quantified the multidimensional nature of social inclusion by combining the subscales of the self-reported SFS and WHOQOL-BREF. Considering the situation that the questionnaires previously developed regarding social inclusion and its similar terms for other target populations are not well validated, our conceptualization emphasizes the multidimensionality that provides a more comprehensive overview of an individual’s status of social inclusion through a broad range of activities, the perception of an individual, and the exploitation of the existing large cohort and standardly collected data, which can be of convenience for the use beyond clinics. Furthermore, the longitudinal measurements in the cohort were utilized. We also built and compared standard and data-driven models to examine the robustness and enhance the credibility of the factors and the mSI predictability. However, the models were not externally validated.
On the other hand, despite earlier efforts in developing conceptual frameworks of social inclusion, one cannot ignore that the constitution of social inclusion might be meant to be nebulous, implying a variable boundary of this multifaceted construct. The readily accessible data do not provide many flexibilities or balance the eligibility and validity of the elements used for the construct. No interview was conducted to preliminary determine the most applicable and relevant scope of social inclusion. Regardless of the limitations, our attempt to pick up the measurement development of multidimensional social inclusion is vitally implicative to clinical practices and societies.