Assessing learning processes' effectiveness in developing GWR skills requires sound measurement tools. We used an exploratory sequential mixed methods design to investigate the latent construct’s underlying factor structure. The process involved: 1) item development, 2) stakeholder review, 3) scale administration, 4) factor analysis, and 5) reliability testing, as recommended by (Kyriazos & Stalikas, 2018). For scale development, we define broad domains as first-order constructs and domains as second-order constructs. The items are dimensions or third-order skills expressed as statements. We tested the theoretical model in Mauritius, a small island state aspiring to become a knowledge economy. The first author has in-depth knowledge of and familiarity with this context, ensuring robust data collection, analysis and interpretation. Moreover, model testing in Mauritius allowed us to assess the theoretical model adaptability of the, which was informed by Australian frameworks.
Method
Item extraction.
The second-order construct definition guided the extraction of measurement items from the literature (Furr, 2011) or writing new ones. This step required discussion among the researchers before discarding or revising them (Furr, 2011). The item selection criteria included conceptual relevance, language simplicity, clarity, reading level, grammar, conciseness, objectivity, and singular focus (Worthington & Whittaker, 2006; DeVellis, 2017). The number of items for the second-order construct varied as some had more facets to measure (Furr, 2011). Item identification proceeded with the expectation that it would yield a theoretically stable factor set (Worthington & Whittaker, 2006). The item development identified the third-order skills to capture the breadth and depth of the second-order construct constituting the first-order construct. All items were positively worded. As the most crucial stage in developing sound measures, item development was iterative (Hinkin, 1995).
Scale and item refinement
The scale items were reviewed for content validity. The process involved evaluating items’ relevance in measuring the construct, contextual relevance and need for new item inclusion (Worthington & Whittaker, 2006). Seventeen participants from recruitment and employment agencies, the research and innovation, accounting, information technology and engineering field, administrators, academics and policymakers from the higher education sector reviewed the second-order constructs and corresponding items. Participants chose either a focus group discussion or an individual face-to-face or online interview. There were three focus group discussions of two to five participants and eight individual interviews. The sessions were audio recorded, and de-identified transcripts were analysed using NVivo.
Factor determination
The items were used to develop a questionnaire for an employer survey to provide empirical evidence of scale alignment with the theoretical model. The survey was sent to 6690 participants, from both public and private, via email addresses sourced from official employer databases provided by relevant authorities in Mauritius. Participation was voluntary and non-rewarded. A minimum sample size of 101 was required with a set effect size (f2) of 0.2, power at 0.80, and a significance level of 0.05, using the G*Power statistical program.
Data collection. Over three months, participants from establishments employing graduates could access the survey hosted on the Qualtrics platform after providing their consent. The questionnaire comprised three parts: (a) a GWR scale, (b) an open-ended question about any other relevant GWR skill and (c) demographic items with a five-point Likert scale to measure each item from strongly disagree (1) to strongly agree (5). The scaling technique is more reliable, valid, and easy to use (Spooren et al., 2007). There were no reverse scoring items as it may introduce systematic error to a scale (Jackson et al., 1993). The demographic information encompassed establishment size by number of employees, field of operation, and number of graduates employed. All statistical analyses were conducted using R version 4.3.1.
Analysis. Data collected was used to examine the item's relevancy to the scale and explore the new scale’s factor structure. A sample size of 100–200 is considered reasonable for examining psychometric properties (Tay & Jebb, 2017), with 200 being large and 50 being the absolute minimum (Gorsuch, 1983). Given its small sample size, this study used a Bayesian Exploratory factor analysis (BEFA) to identify latent constructs (van de Schoot & Miocević, 2020; Schmid, 2022) instead of conventional factor extraction methods. The BEFA represents a coherent, straightforward, theoretically based and true alternative for factor analysis by taking away arbitrary choices which take place during the frequent data analysis process (Conti et al., 2014; Dux Speltz et al., 2023). The BEFA used the R package “BayesFM” and guidelines from Conti et al. (2014) and Dux Speltz et al. (2023). The underlying factors and their structure were extracted following recommended practices in MCMC modelling and results reporting (van de Schoot & Miocević, 2020; Schmid, 2022).
Result
Item development
The minimum psychometric requirement to ensure measurement adequacy is content validity, built-in through item development (Hinkin, 1995). This ensures that each first-order construct, for example, cognitive skills, is adequately measured by the corresponding second-order construct ‘critical thinking skills’, which is itself measured by third-order skills reflected by items. The second-order constructs form composite measures to reflect cognitive skills (Hinkin, 1995). Building from a theoretical foundation allowed consideration for item inclusion or exclusion (Worthington & Whittaker, 2006) to avoid poor scale validity and ambiguous meaning (Furr, 2011). The outcome was a set of observable items reflecting expected behaviours lined up with their corresponding second-order and first-order construct. This process generated sixty-seven items. According to deVellis (2017), the item generation phase is considered completed after experts review it. The next stage involved seeking the labour market stakeholders’ views on the proposed constructs and items.
Construct and item refinement
The labour market stakeholders review provided empirical evidence to refine and contextualise the theoretical model (Hinkin, 1995). The review process generated seventy-one items spanned over a 13-factor scale (see Fig. 3). The findings are as follows: (1) the term ‘innovation’ was added to ‘creativity’ although the researcher explained that creativity encompasses the ability to propose innovative ideas; (2) some items were critical but did not fit their allocated constructs, resulting in item reshuffling to four new constructs to underscore their importance: planning and organisational skills, digital proficiency, leadership and communication. Some of the new constructs had fewer items, and as item reduction was expected with factor analysis, four new items were added to ensure at least three to four items in each second-order construct. This phase produced thirteen constructs and seventy-one items (see Fig. 3).
Figure 3
GWR scale -Reviewed theoretical model
Establishing the factor structure
This phase explored the empirical structure alignment with the theoretical one by identifying common factors in employers’ perceptions using BEFA.
Item responses. The response rate (Appendix 2) for each item (n = 101) was computed. Next, the factor analysis established the underlying factor structure that would be meaningful for understanding the required skill characteristics.
Demographics. The minimum sample size requirement was met. After data cleaning, out of 156 participants' consent, 101 employer responses were usable. Incomplete responses were deleted. Response to scale items was not forced, leading to one missing response. Data imputation was performed using the median value (Zhang, 2016). Figure 4 summarises the participant characteristics. Most respondents (78.35%) were private employers, with 78.39% being large establishments, primarily private ones and employing fewer than ten graduates. Participants spread across different industries, including agriculture, manufacturing, transportation, construction, IT, professional services, administration, and human resources.
Factor structure. The 71 scale items were suitable for factor analysis. The overall Kaiser-Meyer Olkin index of sampling adequacy (0.81) indicated that the sample was factorable. Figure 5 illustrates that, in general, most items show a positive correlation; therefore, the variables were appropriate for factor analysis. These interrelationships were examined using BEFA to extract the underlying factors.
Table 1
Summary of Posterior Results of BEFA
| Overall | Cognitive | Metacognition | Interpersonal | Intrapersonal |
Maximum number of factors (Kmax) | 4 | 2 | 2 | 3 | 3 |
Identification restriction (Nid) | 10 | 3 | 3 | 5 | 4 |
MCMC iterations | 50000 | 50000 | 200000 | 50000 | 50000 |
Burn-in period | 5000 | 5000 | 5000 | 5000 | 5000 |
Metropolis-Hastings Acceptance rate | 1 | 1 | 1 | 1 | 0.78 |
Posterior frequency of number of factors | K = 4 (100%) | K = 2 100% | K = 2 100% | K = 3 (100%) | K = 3 (93.14%) |
After examining the dataset, the researchers retained four factors, as the aim was first to identify the first-order constructs. No item loaded on more than one factor. The four factors were named cognitive, metacognition, intrapersonal and interpersonal skills. Table 2–6 shows the item factor loading for each factor and sub-factors. Items with factor loading less than 0.5 were removed from the scale. This led to the elimination of four digital proficiency items: DP1, DP2, DP3 and DP4.
Given the likelihood of subfactors being present, the item groupings were examined to identify them. The extraction generated two factors for cognitive and metacognition and three factors for interpersonal and intrapersonal skills. Some items loaded poorly on the subfactors; however, at this stage, the researchers refrained from discarding any item without prior knowledge of each scale's reliability. The posterior correlation matrix indicated that the factors were moderately associated (Table 7)
Table 2
Posterior means of factor loadings for a four-factor model
Item | Cognitive | Metacognition | Interpersonal | Intrapersonal | | Item | Cognitive | Metacognition | Interpersonal | Intrapersonal |
CT1 | 0.60 [0.43,0.78] | | | | | LD1 | | | 0.62 [0.45, 0.79] | |
CT2 | 0.73 [0.57,0.90] | | | | | LD2 | | | 0.69 [0.53, 0.86] | |
CT3 | 0.71 [0.55, 0.88] | | | | | LD3 | | | 0.72 [0.55, 0.88] | |
CT4 | 0.63 [0.46, 0.81] | | | | | LD4 | | | 0.71 [0.56, 0.88] | |
CT5 | 0.70 [0.53, 0.86] | | | | | LD5 | | | 0.76 [0.61, 0.92] | |
CI1 | 0.81 [0.66, 0.96] | | | | | CM1 | | | 0.66 [0.50, 0.83] | |
CI2 | 0.75 [0.59, 0.91] | | | | | CM2 | | | 0.48 [0.30, 0.67] | |
CI3 | 0.78 [0.63, 0.94] | | | | | CM3 | | | 0.63 [0.46, 0.80] | |
CI4 | 0.83 [0.68, 0.98] | | | | | CM4 | | | 0.68 [0.52, 0.85] | |
CI5 | 0.76 [0.61, 0.93] | | | | | CM5 | | | 0.79 [0.64, 0.95] | |
CI6 | | 0.65 [0.43, 0.86] | | | | CM6 | | | 0.53 [0.35, 0.71] | |
MC1 | | 0.61 [0.44, 0.78] | | | | DP1 | | 0.35 [0.16, 0.54] | | |
MC2 | | 0.63 [0.46, 0.80] | | | | DP2 | | 0.41 [0.23, 0.60] | | |
MC3 | | 0.73 [0.58, 0.89] | | | | DP3 | | 0.37 [0.18, 0.56] | | |
MC4 | | 0.68 [0.52, 0.85] | | | | DP4 | | 0.46 [0.29, 0.65] | | |
MC5 | | 0.68 [0.52, 0.86] | | | | DP5 | | | | 0.57 [0.39, 0.75] |
MC6 | | 0.68 [0.52, 0.86] | | | | MO1 | | | | 0.74 [0.59, 0.91] |
PS1 | | 0.73 [0.58, 0.89] | | | | MO2 | | | | 0.72 [0.56, 0.89] |
PS2 | | 0.74 [0.59, 0.90] | | | | MO3 | | | | 0.84 [0.70, 1.00] |
PS3 | | 0.80 [0.66, 0.96] | | | | MO4 | | | | 0.87 [0.72, 1.02] |
PS4 | | 0.81 [0.66, 0.96] | | | | MO5 | | | | 0.80 [0.65, 0.96] |
PS5 | | 0.78 [0.63, 0.94] | | | | SE1 | | | | 0.63 [0.46, 0.81] |
PO1 | | 0.67 [0.50, 0.84] | | | | SE2 | | | 0.68 [0.51, 0.84] | |
PO2 | | 0.71 [0.55, 0.87] | | | | SE3 | | | 0.72 [0.56, 0.89] | |
PO3 | | 0.76 [0.60, 0.91] | | | | SE4 | | | 0.66 [0.49, 0.83] | |
PO4 | | 0.68 [0.52, 0.85] | | | | SE5 | | | 0.60 [0.43, 0.77] | |
CO1 | | | 0.40 [0.23, 0.61] | | | SE6 | | | 0.68 [0.52, 0.85] | |
CO2 | | | 0.54 [0.36, 0.71] | | | CS1 | | | | 0.62 [0.44, 0.79] |
CO3 | | | 0.66[ 0.49, 0.83] | | | CS2 | | | | 0.75 [0.59, 0.91] |
CO4 | | | 0.62 [0.45, 0.79] | | | CS3 | | | 0.72 [0.56, 0.89] | |
CO5 | | | 0.57 [0.40, 0.74] | | | CS4 | | | 0.58 [0.41, 0.76] | |
CO6 | | | 0.58 [0.41, 0.76] | | | GR1 | | | | 0.66 [0.49, 0.83] |
CO7 | | | 0.54 [0.36, 0.71] | | | GR2 | | | 0.61 [0.44, 0.79] | |
CO8 | | | 0.61 [0.44, 0.79] | | | GR3 | | | | 0.68 [0.52, 0.85] |
| | | | | | GR4 | | | | 0.56 [0.38, 0.74] |
| | | | | | GR5 | | | 0.69 [0.53,0.86] | |
| | | | | | GR6 | | | | 0.73 [0.58, 0.90] |
Table 3
Cognitive: Posterior factor loading and 95% High Probability Density (HPD)
Item | Critical thinking | Innovative thinking |
CT1 | 0.72 [0.55, 0.90] | |
CT2 | 0.84 [0.68, 1.01] | |
CT3 | 0.83 [0.67, 0.99] | |
CT4 | 0.70 [0.52, 0.88] | |
CT5 | | 0.70 [0.54, 0.88] |
CI1 | | 0.83 [0.68, 1.00] |
CI2 | | 0.80 [0.64, 0.97] |
CI3 | | 0.85 [0.69, 1.01] |
CI4 | | 0.86 [0.71, 1.02] |
CI5 | | 0.86 [0.70, 1.01] |
Table 4
Metacognition: 2 Posterior factor loading and 95% HPD
| Sub-factor 2 Loading |
Item | Problem-solving | Planning & Organisation |
CI6 | 0.63 [0.46,0.80] | |
MC1 | 0.60 [0.43,0.78] | |
MC2 | 0.64 [0.47, 0.78] | |
MC3 | 0.74 [0.59, 0.91] | |
MC4 | 0.70 [0.54,0.86] | |
MC5 | 0.69 [0.52, 0.85] | |
MC6 | 0.69 [0.53,0.86] | |
PS1 | 0.75 [0.59, 0.91] | |
PS2 | 0.76 [0.60, 0.92] | |
PS3 | 0.80 [0.65, 0.95] | |
PS4 | 0.83 [0.68, 0.98] | |
PS5 | 0.81 [0.66, 0.97] | |
PO1 | | 0.77 [0.61, 0.93] |
PO2 | | 0.86 [0.71, 1.01] |
PO3 | | 0.82 [0.66, 0.97] |
PO4 | | 0.76 [0.60,0.92] |
Table 5
Interpersonal: Posterior factor loading and 95% HPD
Item | Collaboration | Leadership | Self-efficacy |
CO1 | 0.54 [0.35, 0.72] | | |
CO2 | 0.65 [0.47, 0.82] | | |
CO3 | | 0.64 [0.48, 0.84] | |
CO4 | 0.81 [0.65, 0.97] | | |
CO5 | 0.77 [0.61, 0.94] | | |
CO6 | 0.80 [0.65, 0.97] | | |
CO7 | 0.76 [0.60, 0.93] | | |
CO8 | 0.76 [0.60, 0.93] | | |
LD1 | | 0.73 [0.57, 0.90] | |
LD2 | | 0.82 [0.67, 0.98] | |
LD3 | | 0.77 [0.61, 0.93] | |
LD4 | | 0.82 [0.67, 0.98] | |
LD5 | | 0.84 [0.70, 1.00] | |
CM1 | | | 0.66 [0.49, 0.83] |
CM2 | | | 0.48 [0.30, 0.67] |
CM3 | | | 0.66 [0.50, 0.83] |
CM4 | | | 0.69 [0.53, 0.86] |
CM5 | | | 0.79 [0.64, 0.96] |
CM6 | | | 0.55 [0.37, 0.73] |
SE2 | | | 0.75 [0.59, 0.91] |
SE3 | | | 0.80 [0.64, 0.95] |
SE4 | | | 0.71 [0.55, 0.88] |
SE5 | | | 0.62 [0.45, 0.79] |
SE6 | | | 0.72 [0.55, 0.88] |
CS3 | | | 0.76 [0.60, 0.93] |
CS4 | | | 0.59 [0.42, 0.77] |
GR2 | | | 0.59 [0.43, 0.78] |
GR5 | | | 0.72 [0.55, 0.88] |
Table 6
Intrapersonal: Posterior factor loading and 95% HPD
Item | Grit | Motivation | Conscientiousness |
DP5 | 0.64 [0.47, 0.83] | | |
MO1 | 0.77 [0.60, 0.93] | | |
MO2 | | 0.74 [0.58, 0.91] | |
MO3 | | 0.92 [0.77, 1.06] | |
MO4 | | 0.92 [0.78, 1.07] | |
MO5 | 0.86 [0.71, 1.02] | | |
SE1 | | | 0.86 [0.71, 1.02] |
CS1 | | | 0.66 [0.49, 0.84] |
CS2 | | | 0.82 [0.66, 0.99] |
GR1 | 0.74 [0.57, 0.91] | | |
GR3 | | | 0.75 [0.59, 0.93] |
GR4 | | | 0.61 [0.43, 0.79] |
GR6 | | | 0.74, [0.58, 0.92] |
Table 7
Posterior factor correlation matrix
| Mean | [95% HPD] |
F1↔ F2 | 0.70 | [0.61, 0.83] |
F1↔ F3 | 0.56 | [0.44, 0.75] |
F1↔ F4 | 0.59 | [0.46, 0.73] |
F2↔ F3 | 0.73 | [0.64, 0.83] |
F2↔ F4 | 0.59 | [0.47, 0.76] |
F3↔ F4 | 0.64 | [0.54, 0.79] |
F1.1↔ F1.2 | 0.77 | [0.67, 0.87] |
F2.1↔ F2.2 | 0.75 | [0.65, 0.85] |
F3.1↔ F3.2 | 0.58 | [0.45, 0.74] |
F3.1↔ F3.3 | 0.57 | [0.43, 0.72] |
F3.2↔ F3.3 | 0.73 | [0.62, 0.83] |
F4.1↔ F4.2 | 0.81 | [0.73, 0.91] |
F4.1↔ F4.3 | 0.74 | [0.32, 0.95] |
F4.2↔ F4.3 | 0.73 | [0.33, 0.93] |
The result indicates that literature and empirical data played an essential role in laying a sound theoretical foundation, developing a systematic approach to derive a skill classification system, and establishing a set of interrelated latent constructs and their corresponding measurement instrument in the form of work behaviours (Peersia et al., 2024). For details of items dropped and retained, see Appendix 3. The findings of this study identified four first-order constructs as the GWR scale backbone, cascading into ten second-order constructs (scales) and 67 items (Fig. 6). In line with the theoretical construct from literature, GWR skills can be grouped under three broad constructs, namely cognitive, interpersonal and intrapersonal, and the study identified a fourth one, ‘metacognition skills, ' promoted from cognitive skills to a first-order construct. The ten second-order constructs from literature, critical thinking, problem-solving, motivation, self-efficacy, conscientiousness, grit and collaboration, were retained. The second-order construct ‘creativity and innovation’ (renamed ‘innovative thinking’), ‘planning’ and ‘leadership skills’ generated from the stakeholders’ review were also kept. Self-efficacy was grouped under interpersonal skills instead of intrapersonal skills. The reason might be that social self-efficacy is of interest in the workplace context, that an individual trusts their ability to make personal decisions when participating in a collaborative task at the workplace (Smith & Betz, 2000).
Figure 6
GWR scale hypothetical structure
Internal reliability. This study performed a Bayesian reliability analysis using Package Bayesrel version 0.7.7. The analysis assessed the overall scale’s internal consistency, reported as the posterior mean, with Bayes Cronbach’s α and Bayes MacDonald’s ω.
The reliability test results for the overall scale are shown in Table 8. Cronbach’s α and McDonald’s ω show internal consistencies of data as a scale. The findings indicated that Bayes Cronbach’s α coefficients for the four extracted factors range from 0.93 to 0.95, indicating a high reliability. Similarly, Bayes ωt indicated good internal consistency with figures ranging from 0.95 to 0.96, and Bayes ωh total was slightly lower, ranging from 0.85 to 0.89. The prior and posterior probability of those reliability statistics is given in Table 9.
Table 8
Bayesian reliability statistics
| | | | | | 95% Confidence interval |
Factor | | Bayes Reliability statistics | | Posterior mean | | lower | | upper |
Cognitive | | alpha | | 0.93 | | 0.91 | | 0.95 |
| | omega_t | | 0.95 | | 0.93 | | 0.93 |
| | omega_h | | 0.89 | | 0.85 | | 0.93 |
Metacognition | alpha | | 0.95 | | 0.94 | | 0.96 |
| | omega_t | | 0.96 | | 0.95 | | 0.96 |
| | omega_h | | 0.90 | | 0.86 | | 0.84 |
Interpersonal | | alpha | | 0.95 | | 0.94 | | 0.97 |
| | omega_t | | 0.96 | | 0.95 | | 0.97 |
| | omega_h | | 0.85 | | 0.78 | | 0.91 |
Intrapersonal | | alpha | | 0.93 | | 0.92 | | 0.95 |
| | omega_t | | 0.95 | | 0.93 | | 0.96 |
| | omega_h | | 0.88 | | 0.84 | | 0.92 |
Table 9
The probability of Bayesian multidimensional reliability statistic
| | | | Probability |
Factor | | Reliability tests | | Prior | | Posterior |
Cognitive | | omega_t > 0.8 | | 0.28 | | 1.00 |
| | omega_h > 0.6 | | 0.28 | | 1.00 |
Metacognition | omega_t > 0.8 | | 0.17 | | 1.00 |
| | omega_h > 0.6 | | 0.15 | | 1.00 |
Interpersonal | | omega_t > 0.8 | | 0.19 | | 1.00 |
| | omega_h > 0.6 | | 0.17 | | 1.00 |
Intrapersonal | | omega_t > 0.8 | | 0.24 | | 1.00 |
| | omega_h > 0.6 | | 0.21 | | 1.00 |