An evaluation of programmatic assessment across health professions education using contribution analysis

doi:10.21203/rs.3.rs-4278749/v1

Download PDF

Research Article

An evaluation of programmatic assessment across health professions education using contribution analysis

https://doi.org/10.21203/rs.3.rs-4278749/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Introduction: Programmatic assessment is gaining traction in health professions education. Despite this popularity, educators continue to grapple with complex contextual factors that impact implementation and outcome attainment. We used contribution analysis, a theory-informed evaluation method, to understand mechanism underpinning successful implementation.

Method: Applying the six steps of contribution analysis, we developed a postulated theory of change (ToC) and then conducted a qualitative study with programmatic assessment stakeholders (graduates n = 15, supervisors n = 32, faculty n = 19) from four Australian dietetic programs. These data were analysed using the Framework Analysis method and integrated with data derived from a literature review across health disciplines, to assemble contribution claims and story, and verify the ToC.

Results: Impact pathways for programmatic assessment from inception to implementation, and contribution to outcomes were articulated in the ToC. Leaders drove implementation using compromise and worked with a design team to apply the versatile principles. All people required training and purposefully designed tools were implemented within an ideological aligned system. Re-orientation of responsibilities situated learners as leaders, contributing to a psychologically safe environment which promoted a growth mindset. Credible high-stakes progression decisions were enabled, people experienced less stress, and derived gratification from assessment. External factors (institutional and accreditation requirements) and threats (resource mismatch, ideological misalignment, and capabilities of the people) were identified.

Discussion: Contribution analysis revealed mechanism that educators can apply to implement a contextually responsive programmatic assessment across diverse settings.

Competency-based education (CBE) has transformed the health professions education (HPE) landscape as educators explore and apply new approaches to teach and assess learners (Touchie & ten Cate, 2016; Van Melle et al., 2019), notably exemplified by the advent of programmatic assessment (van der Vleuten & Schuwirth, 2005). Programmatic assessment is a systems approach to assessment which longitudinally employs fit-for-purpose assessment methods and moments, undertaken by equipped people, to drive and capture learning (van der Vleuten et al., 2012). Assessment moments are designed to provide rich feedback and are optimised for learning to illuminate learner development. Through a process of triangulation, assessments inform progression decisions which are proportional to the significance of the outcomes (Heeneman et al., 2021; van der Vleuten et al., 2012). Programmatic assessment is a promising remedy to the challenges encountered in operationalising competency-based assessment (Iobst & Holmboe, 2020) with interest growing. Emerging from theory (van der Vleuten & Schuwirth, 2005), the literature now widely describes the principles underpinning programmatic assessment (Heeneman et al., 2021). The recent proliferation of implementation within medicine, veterinary science, dietetics, paramedicine, dentistry, teaching and communication science (Baartman et al., 2022; Torre et al., 2021) demonstrates diffusion of theory into practice. Programmatic assessment is fast becoming the prevailing approach to competency-based assessment (Caretta-Weyer et al., 2024; Pearce & Tavares, 2021), yet few studies consider the complex system in which programmatic assessment is situated.

Research substantiates the feedback and learning function of programmatic assessment (Baartman et al., 2023; Schut et al., 2021), demonstrating that robust progression decisions are possible with sufficient quality data (Baartman et al., 2023; de Jong et al., 2022; Schut et al., 2021). Studies affirm the capacity for early detection and supportive remediation of underperformance (Schut et al., 2021). Yet early adopters have encountered challenges such as stakeholder resistance (Baartman et al., 2023; Jamieson et al., 2022; Schut et al., 2021) and design choice dilemmas when applying theoretical principles to authentic practice (Baartman et al., 2022; Schut et al., 2021). The continued perception, or use, of summative assessments has proven difficult to shake and undermines assessment’s intended learning function (Baartman et al., 2023; Bate et al., 2020; Schut et al., 2021). There is propensity for over-assessment which can burden all stakeholders (Bate et al., 2020; Schut et al., 2021). These implementation challenges may impede widespread adoption and hinder attainment of desired outcomes (Ryan et al., 2023). Such challenges arise as programmatic assessment is situated within (often established) cultures and is reliant on people for operationalisation (Jamieson et al., 2022; Torre et al., 2021). External factors become entwined, exerting unrecognised influence on the system. The nexus between these influences - principles, cultures, people, external factors - coalesce unpredictably to make programmatic assessment non-standardised, context-specific, and inherently complex (Baartman et al., 2022; Torre et al., 2020). This complexity makes evaluation challenging and yet, as popularity grows, there is a need to understand the mechanisms upon which successful implementation is contingent (Torre et al., 2020; Torre et al., 2021; Van Melle et al., 2021). Without this understanding, we will be constrained in our ability to implement programmatic assessment and for it to fulfil its promise for competency-based education (Sandars, 2018).

Evaluation in health professions education

Health professions education has historically applied reductionist evaluation approaches which presuppose a linear attribution relationship between actions and outcomes, relying on the control of variables and use of comparator groups (Frye & Hemmer, 2012; Van Melle et al., 2021). These approaches are incompatible with the complexity of HPE where contextual factors dynamically interact and exert variable influence on outcomes (Frye & Hemmer, 2012; Hall et al., 2021; Van Melle et al., 2021). Theory-informed evaluations are posited to accommodate inherent HPE complexity and have been increasingly applied within HPE scholarship over the last two decades, including contribution analysis, CIPP (context, input, process, product), layered analysis, RE-AIM (reach, effectiveness, adoption, implementation, maintenance) framework, and realist evaluation, to name a few (Allen et al., 2022; Frye & Hemmer, 2012; Haji et al., 2013). As opposed to ‘black box’ evaluation methods, these approaches intentionally explore and reveal the covert mechanisms between program activities and outcomes, making them well suited to understanding and evaluating complex programs (Frye & Hemmer, 2012). Broad features of theory-informed evaluation approaches include stepwise methods, stakeholder engagement, and development of program theory underpinned by data derived from diverse sources. Differences are observed in the foundational concepts and central evaluation questions. Table 1 presents a comparison of theory-informed evaluation methods, selected for their varying application within HPE scholarship to illustrate similarities and differences.

Table 1. Comparison of three theory-informed evaluation approaches used within health professions education: Contribution analysis, realist evaluation, and CIPP (context, input, process, and product).

Evaluation approach	Contribution analysis (Mayne, 1999)	Realist evaluation (Pawson & Tilley, 1997)	CIPP (Stufflebeam, 2014)
Origins	Developed in the 1990s to provide outcome evidence for complex programs, specifically monitoring government programs.	Developed in the 1990s in response to limitations of traditional evaluation approaches.	Developed in the 1960s in response to limitations of experimental evaluation approaches, specifically for national program evaluation.
Ontology (epistemology)	Not defined; critical realism (subjectivity) suggested (Brousselle & Buregeya, 2018).	Scientific or critical realism (objectivity or subjectivity) (Pawson, 2006; Rees et al., 2023).	Objectivism.
Premise	Multiple factors influence outcomes for complex programs making attribution unachievable, instead determine program contribution to observed outcomes.	Programs work differently for different people and context.	Evaluation should not only measure program outcomes but also be used for continuous program improvement.
Intent	Determine the contribution an evaluand makes to observed outcomes.	Understand relationship between mechanisms and context on evaluand outcomes (CMO configuration).	Evaluation of evaluand context, inputs, processes, and products.
Central question	How and why has the program (or component) made a difference, or not, and for whom?	What works for whom, in what circumstances and why?	Dependent on prospective or retrospective application: What needs to be done? How should it be done? Is it being done? Is it succeeding?
Use of theory	Theory of Change is developed which sequentially maps impact pathways from activities to stratified outcomes.	Program theory is developed which frames underlying logic from activities/ inputs to outcomes.	Program theory is developed which frames underlying logic from activities/ inputs to outcomes.
Causality	Inferred by generating credible and plausible claims about how evaluand contributes to observed outcomes.	Identification of mechanisms and influence of context on outcomes.	Determine relationships between evaluation (CIPP) components.
Methodology	Structured six step framework which develops and verifies ToC for evaluand with resultant contribution claims and contribution story.	Broad application of four steps: program theory formation, CMO hypothesis development, observation, and programme specification.	Prospective or retrospective application of evaluation studies for context, input, process, and product.
Methods	Not defined; responsive to evaluand.	Not defined; responsive to evaluand.	Not defined; responsive to evaluand.
Data type	Mixed, as suitable.	Mixed, as suitable.	Mixed, as suitable.
Stakeholder engagement	Yes	Yes	Yes
Iterative	Yes	Yes	Yes
Recognition of complexity	Yes	Yes	Yes
Application in HPE	Limited (Choi et al., 2023).	Increasing (Ajjawi & Kent, 2022).	Wide (Toosi et al., 2021).

Abbreviations: CIPP Context, Input, Process and Produce; CMO context-mechanism-outcome; ToC theory of change.

Contribution analysis

Contribution analysis, one approach within the ‘explanatory generation’ of evaluation (Brousselle & Buregeya, 2018), has been proposed for application in HPE (Moreau & Eady, 2015; Van Melle, Gruppen, Holmboe, Flynn, Oandasan, & Frank, 2017). Contribution analysis recognises that outcomes, or lack thereof, can be experimentally observed yet definitive attribution to a complex program (or component), where a multitude of internal and external factors influence outcomes, is fraught with uncertainty (Mayne, 2001). Instead, CA poses the question of “how and why has the intervention (or component) made a difference, or not, and for whom?” (Mayne 2019, p.175) to reduce, rather than eliminate, uncertainty. Causality is inferred by generating credible and plausible claims about how a program contributed to observed outcomes through collection and scrutiny of evidence against a theory of change (ToC), thus increasing understanding as to why outcomes were observed (Mayne, 2012, 2019). Contribution analysis posits that program contribution to outcomes is inferred when four key conditions are achieved: (i) development of a reasonable and plausible ToC which articulates how and why a program works to achieve outcomes, (ii) program activities were implemented as set forth in the ToC, (iii) the ToC is confirmed by evidence and is not disproven, and (iv) alternative influential factors are considered and shown not to significantly contribute or the contribution is recognised (Mayne, 2011). Introduced in 1999 by John Mayne in response to the limitations of experimental evaluation methods in monitoring government programs (Mayne, 2001), contribution analysis has been conceptually developed into an evaluation framework for complex programs situated within authentic settings (Leeuw, 2023; Mayne, 2019). The evolution of contribution analysis has been categorised into four generations (Budhwani & McDavid, 2017; Dybdal et al., 2010). The first generation (Mayne, 1999) focussed on utilising existing data (quantitative and qualitative) for monitoring program performance with nine steps outlined, representing the foundational elements. In the second generation (Mayne, 2001), still situated within government program performance monitoring (Leeuw, 2023), there was a focus on operational design with the nine steps detailed and collapsed into six and the integration of the ToC. Application to the evaluation of complex programs characterised the third generation (Mayne, 2011) where Mayne strengthened the foundational arguments, namely that experimental designs have limited utility in complex situations. The six steps remained, with minor revisions, and operational guidelines were expanded. The fourth generation, shaped by Mayne (2015) and others (Budhwani & McDavid, 2017), heralding increasing adoption, delineated the application of ToC to understand program complexity through the nexus of activities, mechanisms, outcomes, and context, thus enhancing contribution analysis’s suitability for evaluating complex authentic programs (Leeuw, 2023).

Utilising a six-step process, contribution analysis evaluates a program by first developing and then testing the fidelity of a ToC against evidence (Mayne, 2019). As with other theory-informed evaluation approaches, application of ToC as the program evaluation framework is the linchpin of contribution analysis, facilitating comprehensive understanding of how the evaluand contributes to change (Brousselle & Buregeya, 2018). While sometimes used interchangeable, ToC differs from program theory. Program theory provides a framework to elucidate the logic between evaluand activities and expected outcomes, considering the underlying assumptions. In comparison, ToC is a detailed explanation and sequential depiction of how and, importantly, why change occurs within a particular context. Data-informed impact pathways within the ToC represent the process of change and conditions necessary for achievement of intended outcomes (Brousselle & Buregeya, 2018; Chen, 2004; Mayne, 2017). The ToC provides a framework against which to assess evidence and allows for contribution claims to be made regarding evaluand impact. Contribution analysis posits that if the steps and assumptions of a program, as set forth in the ToC, can be verified with evidence then it is reasonable to conclude that the intervention was a contributing cause for the observed outcomes (Mayne, 2019). Despite the potential for contribution analysis within HPE, its current application is limited with one recent study exploring the contribution of curricular to health graduate capabilities (Choi et al., 2023).

Contribution analysis is well-suited to evaluate programs situated within and across systems that are characterised by uncertainty and dynamic mechanisms, where outcomes may be attributed to multiple factors some of which may be external to the program (Brousselle & Buregeya, 2018), as defines programmatic assessment (Govaerts et al., 2022). While programmatic assessment has been shown to achieve certain learner outcomes (Schut et al., 2021), implementation continues to faces challenges (Ryan et al., 2023; Schut et al., 2021) and evidence for health care recipient outcomes beyond the learner remain limited. This knowledge gap becomes increasingly significant as the adoption of programmatic assessment expands beyond health professions education to become the modern paradigm for higher education (Caretta-Weyer et al., 2024; Lodge et al., 2023). Given the need to adopt theory-informed evaluation methods in CBE and the application of contribution analysis to complex programs in other settings (Biggs et al., 2014; Buregeya et al., 2020; Koleros & Mayne, 2019), we had the aim to evaluate programmatic assessment using contribution analysis and determine the mechanisms necessary for effective and sustained implementation.

Study context

This study was situated within an interpretivist paradigm and sought to understand how, when, and why the complex phenomenon of programmatic assessment worked by developing a program evaluation framework (a ToC) using data from mixed sources (Rees et al., 2023). The research was conducted across four settings and approval was obtained at all institutions (Monash University Research Ethics Committee approval no. 28847, Edith Cowan University Research Ethics Committee approval no. 02561, University of Canberra Research Ethics Committee approval no. 9369, and University of Wollongong Human Research Ethics Committee approval no. 2021/333). All team members had academic roles and experience in HPE. Four members (JJ, CP, RB, SG) had led the design and introduction of programmatic assessment in their respective programs. These members, and JL, had hands-on experience within both programmatic and traditional assessment approaches at their institutions.

Terminology for contribution analysis

Terminology is used interchangeably and inconsistently within contribution analysis literature and so we first present terms and definitions (Table 2) (Mayne, 2019). Notably, outputs and outcomes are distinct concepts. Outputs refer to the tangible goods and services derived from program activities, while outcomes denote changes in behaviours and actions (Hall et al., 2021). Within the ToC, outcomes are stratified into capacity change, behavioural change, direct benefit, and well-being change (Mayne, 2017).

Table 2. Terms and definitions used in contribution analysis, including alternative terms evident in published literature (Lemire et al., 2012; Mayne, 2015, 2017, 2019).

Term (alternative term(s))	Explanation
Impact pathway (results chain, casual chain, logic model)	A pathway depicting the sequences of steps or events from activities to outcomes.
Assumption	Salient connections, events and or conditions necessary for a link within an impact pathway to function as expected and fulfil program outcomes.
Theory of change	Structured assembly of impact pathways and assumptions presenting how program activities lead to outcomes. Components are activities, outputs, reach and reaction, capacity change, behavioural change, direct benefit, and well-being change.
Activities	Observable actions undertaken as part of the program.
Outputs	Tangible goods and services that directly result from the program activities being undertaken.
Reach and reaction	Identification of the target group (reach) who are intended to receive program outputs and their reaction to the program (reaction).
Capacity change	Changes in knowledge, attitudes (beliefs, opinions, feelings, perspectives), skills (mental and physical ability to use new or alternative practices), aspirations (ambitions, hope, objectives, or desires) and opportunities of the target group who receive or use the program outputs; established using the COM-B model which is the influence of capabilities (C) and opportunities (O) on motivation (M) which are necessary for behaviour (B) change.
Behavioural change	Changes in practice that occur in the target group due to capacity change.
Direct benefit	Improvements in the target group derived from behaviour change.
Well-being change	Long-term accrued improvement in the well-being of beneficiaries who may or may not be the program target group.
External influences	Events and conditions unrelated to the program but that contribute to the realisation of intended outcomes.
Nested theory of change	Additional theories of change which capture a particular component of the complex program.
Contribution claim	Statement(s) presenting evidence and describing the mechanism, or lack thereof, for the contribution the program (or component) makes to observed outcomes.
Contribution story	Central narrative that explains how a program, and components, contribute to the observed outcomes.
Relevant Explanation Finder	Structured framework facilitating critical review of collected data (in step 3 of contribution analysis) against the theory of change.

The following sections describe the six steps of contribution analysis and application in the present study to evaluate programmatic assessment, including the qualitative multi-centre study undertaken in step 3.

Step 1: set out the cause-effect issue to be addressed

The first step, describing the problem which the evaluand seeks to address and developing cause-effect questions, serves to focus the evaluation (Mayne, 2012) and is usually undertaken by a team with program experience (Biggs et al., 2014; Buregeya et al., 2020; Choi et al., 2023; Riley et al., 2018). Following this approach, we (JJ, CP, MH, SG) used our experience of programmatic assessment to develop cause-effect questions. Three of us had led the design and implementation of programmatic assessment at two dietetic programs (JJ at Edith Cowan University (Jamieson et al., 2017), and CP and SG at Monash University (Palermo et al., 2017)) with the fourth researcher having extensive experience in teaching programmatic assessment (MH). We had each evaluated programmatic assessment within and across the two programs providing a comprehensive understanding (Dart et al., 2021; Jamieson et al., 2022; Jamieson et al., 2021). One researcher (JJ) developed cause-effect questions which were reviewed and agreed upon by the other researchers (CP, MH, SG). The cause-effect questions were (i) what factors have influenced the implementation of programmatic assessment? (ii) what role did programmatic assessment contribute, or not, to intended outcomes? (iii) what conditions were necessary to achieve this contribution to the outcomes? These questions guided the ToC development (in step 2) and gathering of data (in step 3) by framing focus group questions with key stakeholders.

Step 2: develop the postulated theory of change and risks to it, including rival explanations

An initial ToC is then iteratively developed often using existing evaluative data (Biggs et al., 2014), stakeholder consultation (Biggs et al., 2014; Delahais & Toulemonde, 2017; Downes et al., 2019; Koleros & Mayne, 2019), and expert discussion (Delahais & Toulemonde, 2012; Delahais & Toulemonde, 2017; Koleros & Mayne, 2019), providing a sound comprehension of program activities, outputs, and intended outcomes (Mayne, 2012). Top-level outcomes (well-being change) are usually first identified and then, progressively working backwards, proposed impact pathways are constructed (Riley et al., 2018). Importantly, a robust and credible ToC is critical as it is the framework against which collected evidence is later evaluated (Mayne, 2012, 2019).

We applied several steps to develop the initial ToC for programmatic assessment. First, we selected the Edith Cowan University dietetic course as a case study as we possessed extensive experience designing, implementing, and evaluating programmatic assessment in this context and it had been operational for several years. The lead researcher (JJ) first familiarised themselves with the contribution analysis literature and completed training in program theory. The same researcher then conducted a focus group with faculty staff (n = 2) who co-designed the programmatic assessment at Edith Cowan University and had been key stakeholders in subsequent implementation over the prior two years. These practical experiences provided valuable insight when building the ToC. Both focus group participants were provided questions in advance (Online Resource 1), including ToC explanation and key terms, to optimise discussion. As this focus group (in step 2) had the purpose to develop the initial ToC, the questions were derived from the cause-effect questions established in step 1. During the focus group, the researcher wrote goals, outcomes, activities, assumptions and influencing factors identified by participations onto post-it notes. Then as a group, we iteratively discussed the ordering of links, starting with outcomes, and working backwards to determine how outcomes were achieved (this initial mapping is provided in Online Resources 2 item (a)).

After the focus group, one researcher (JJ) identified and unpacked root causes and consequences of the problem which programmatic assessment sought to address using problem analysis. Consulting published research on competency-based assessment, this involved articulating the core problem and identifying, and untangling, contributing factors and consequences. This produced a Problem Tree which was verified by co-researchers (CP, MH, SG) and, along with the focus group mapping, became the ToC starting point. Next, we (JJ, CP, MH, SG) iteratively developed impact pathways through repeated reading of the Problem Tree, focus group mapping, literature on programmatic assessment, and personal experience, and developed and refined the initial ToC using the COM-B model proposed and detailed by Mayne (2019) (Online Resource 2 item (b) and (c)). COM-B is a social science model positing the influence of capabilities (C) and opportunities (O) on motivation (M) with all three critical inter-related conditions needed for behaviour (B) change (Mayne, 2017). COM-B was introduced to contribution analysis by John Mayne in 2016 as a structured model to explore and explain drivers of behaviour change, enabling robust ToC on which to base inferences (Mayne, 2018, 2019). In the final stage, ToC robustness was evaluated by one researcher (JJ) using the ToC Analysis Criteria given by Mayne (2017), resulting in minor revisions which were reviewed and agreed by all other researchers (CP, MH, SG).

Step 3: gather existing evidence on the Theory of Change

Step 3 involves gathering evidence to determine ToC validity (Riley et al., 2018). Sufficient and rigorous evidence is required to determine if the postulated ToC impact pathways and outcomes occurred as posited (if at all), validate or challenge assumptions, and identify factors and insufficiencies; all which underpin contribution claims (in step 4) and the contribution story (in step 6) (Mayne, 2012). Evidence can be obtained from existing published and unpublished evaluations (Biggs et al., 2014; Choi et al., 2023) while others have conducted mixed methods research to collect data (Buregeya et al., 2020; Hersey & Adams, 2017; Junge et al., 2020), or a blend of multiple methods (Delahais & Toulemonde, 2012; Delahais & Toulemonde, 2017; Downes et al., 2019). Due to the paucity of evaluative data on programmatic assessment at the time of this research, we chose to conduct a multi-centre qualitative study in this step. The following paragraphs present the methods for the multi-centre qualitative study which occurred in step 3 of contribution analysis.

Participant recruitment

In June 2021, two researchers (JJ and CP) delivered a videoconference workshop (Zoom™ Video Communications Inc) on programmatic assessment with representatives from all 16 accredited Australian dietetic programs. Attendees represented 12 of the 16 accredited Australian dietetic programs. During the workshop, researchers first presented programmatic assessment using the clustering of the 12 principles into three themes proposed by Bok et al. (2021). Attendees then completed an activity which determined the extent, if any, programmatic assessment had been implemented in their respective programs which serving the dual purpose of identifying institutions that were implementing programmatic assessment. At the end of the workshop, participants were notified of the research project and asked to email completed activity responses to the researchers (JJ and CP) if they had an interest in joining as co-researchers. Representatives from the four universities who were not in attendance at the workshop were also notified of the research by email and provided the opportunity to appraise their programs and express an interested in collaboration and were sent one reminder. Of the 16 accredited programs, eight returned the form which was reviewed by four researchers (JJ, CP, MH, SG) and of these, five were deemed to have implemented programmatic assessment in accordance with the published principles (Heeneman et al., 2021). One researcher (JJ) then had a videoconference meeting with the program representative to verify adherence to the published principles of programmatic assessment through discussion . After the meeting, one university declined to participate in the research as it exceeded their work capacity at that time. The four remaining programs, Edith Cowan University, Monash University, University of Canberra, and University of Wollongong were included in the study and representatives joined the research team (RB at University of Canberra; JL at University of Wollongong; with JJ at Edith Cowan University; and CP, MH, and SG at Monash University). The collaboration with co-researchers was critical to connecting with key stakeholders for broad context to explore the research question.

From these four participating universities, three key stakeholder groups were recruited for the qualitative study: faculty-employed academics who had responsibility for teaching and assessment, graduates who had met requirements for program completion, and workplace supervisors who were employed at a placement provider and oversaw learner tasks during placement. Inclusion criteria for participation required affiliation with one of the four programs in the prior 12 months. Twenty-one focus groups with faculty (n = 19), graduates (n = 15), and supervisors (n=32 participants), were held across the four universities. Participant characteristics are presented in Table 3. Participants who were employed (n = 57) worked in community or health promotion (n = 23), hospitals (n = 28), education/ teaching (n = 22), aged care (n = 5), food service (n = 5), research and development (n = 5), private practice (n = 4), and or disability (n = 1). Participants reported being employed on a full-time (n = 32), part-time (n = 21), casual (n = 2), or other (n = 2, parental leave, contract) basis.

Table 3. Focus group participant demographics, undertaken in step 3 of contribution analysis, presented according to stakeholder groups.

Faculty

(n = 19)

Graduates

(n = 15)

Supervisors

(n = 32)

University

Edith Cowan University

Monash

University of Canberra

University of Wollongong

Age (years)

47 ± 7 (35 – 58)

30 ± 10 (22 – 50)

38 ± 8 (28 – 57)

Gender

Female

Male

Non-binary

Employment location

Metropolitan

Regional

Rural or remote

Not employed

Research setting

All four universities offered a postgraduate dietetic program that mandated 100 days of workplace-based placement where learners undertook authentic activities under the supervision of workplace supervisors. Programmatic assessment had been introduced to the dietetic program at Edith Cowan University in 2016, 2018 at Monash University, 2016 at University of Canberra, and 2018 at University of Wollongong. Three of the dietetic programs were post-graduate with variable sized cohorts (20 students at Edith Cowan University, 25 – 40 at UC, and up to 61 at Monash University). The University of Wollongong had both an undergraduate (30 students) and postgraduate (20 students). Each program adhered to the twelve principles of programmatic assessment (Online Resource 3) as each uniquely utilised an andragogical justified (principle 5) mix (principle 4) of feedback-rich (principle 2) assessment moments, conceptualised as low-stakes data-points (principle 1 and 6). Learners participated in learning meetings (principle 11) where performance was reviewed and discussed, based on low-stakes data-points, functioning as intermediate check-in moments and providing an opportunity for individualised remedial action (principle 10). Low-stakes data-points were collated and reviewed by at least two independent assessors who used consensus building to make a high-stakes decisions which determined program progression and graduation (principle 3, 7 and 9). All four programs applied the Dietitians Australia National Competency Standards (Palermo et al., 2016) as the framework for designing the programmatic assessment and making high-stakes progression decisions (principle 8) and adopted a learner-centred education paradigm where learner agency was promoted (principle 12).

Data collection

Each co-researcher (JJ, SG, RB, JL) sent the expression of interest email to stakeholders affiliated with their respective programs. The email provided study information and a Qualtrics^TM (Provo, UT) survey link for interested individuals to indicate availability for a focus group and provide demographic data (age, gender, geographical location, area(s) of work practice, current workload). Graduates were also contacted using personal messages in LinkedIn to maximise participation. One researcher (JJ) reviewed all Qualtrics survey responses and organised focus groups.

Separate videoconference semi-structured focus groups were held for each of the four program and stakeholder type between October 2021 and February 2023, with interruptions incurred due to the COVID-19 pandemic. Separate stakeholder focus group sessions (i.e., graduates, or faculty, or supervisors) for each university were conducted to enable a homogenous discussion about each programmatic assessment. One researcher (JJ or SG), not affiliated with the program, facilitated each focus group with the co-researcher, affiliated with the respective program and known to participants, also in attendance (JJ, SG, JL, or RB). While we recognise this propinquity may have given rise to pre-determined aspirations and judgements (Berger, 2015), and, the pre-existing relationships may have influenced participants sharing, we determined that the insider knowledge was important to contextualise discussions ensuring subtleties were not missed. In mitigation, the primary focus group facilitator was external to the program under study giving an outsider positioning. Perspectives and assumptions were handled through multiple researchers being involved in data analysis and interpretations, whereby researchers checked the findings against elements of the raw data.

Nine focus group questions were developed by researchers (JJ, CP, SG, MH) with consideration to the cause and effect questions (developed in step 1) and postulated ToC (developed in step 2). After the first seven focus groups (two with faculty, two with graduates, three with supervisors) an additional five questions were added to capture discussions not explicitly explored in the initial questions, but which were deemed relevant to the contribution analysis evaluation. These additional questions further investigated participants’ views and experiences by exploring differing opinions regarding high-stakes progression decisions, relationships, negotiating learner underperformance, and employability (Online Resource 4). Focus groups were between 24 and 81 minutes in duration and were audio-recorded. Otter.ai (AISense) was used to transcribe focus groups with one researcher (JJ) reviewing and editing all transcriptions for accuracy.

Data analysis

Using the Framework Analysis Method (Gale et al., 2013), two researchers (JJ and SG) abductively coded three focus groups, one from each stakeholder group, in an iterative approach referring to the ToC and research questions. The researchers then discussed the coding and agreed upon an analytical framework by grouping the codes into categories, which reflected the components of the ToC. The initial framework had eight codes and 46 sub-codes, each with a definition and illustrative quotation. The framework and all transcribed focus groups were then entered into NVivo™ Version 12 (QSR International) for subsequent analysis by one researcher (JJ). During analysis, the researcher amended the framework by adding four sub-codes relating to the use of professional competency standards, client outcomes, preparing learners for practice, and improved competency-based assessment practices. All changes were reviewed and agreed to by a second researcher (SG). The final framework is given in Online Resource 5. During data analysis, themes were mapped to the ToC noting modifications to the initial ToC based on the data; reporting by stakeholder group (graduates, supervisors, faculty); and frequency of the data, as was needed for step 4 of contribution analysis to verify the ToC and develop the contribution claims. All researchers involved in the focus groups (JJ, SG, RB, JL) reviewed the results and discussed over two meetings. These conversations confirmed agreement with the analysis and noted the consistency across stakeholders and universities. The researchers also highlighted the key findings which were carried forward into step 4.

Step 4: assemble and assess the contribution claim, and challenges to it

Evidence gathered in step 3 is then analysed to identify and scrutinise links and influencing factors. Influencing factors are contextual conditions that determine an outcome by enabling or hindering the link (Lemire et al., 2012). The Relevant Explanation Finder, introduced to contribution analysis by Lemire et al. (2012) and adapted by others (Biggs et al., 2014; Buregeya et al., 2020), has been applied to structure data analysis and support the construction of contribution claims (Delahais & Toulemonde, 2012). A contribution claim asserts the presence (or lack thereof) of change, the contributing links(s), and influencing factor(s) (Delahais & Toulemonde, 2012). Iteratively, contribution claims are mapped to links within the ToC and the ToC is modified (Biggs et al., 2014; Mayne, 2012). A preliminary contribution story is then developed including a revised ToC and supporting narrative (Delahais & Toulemonde, 2012; Mayne, 2012) which may be reviewed by stakeholders to facilitate identification of new evidence which must be obtained in subsequent steps to strengthen the evaluation (Delahais & Toulemonde, 2012).

One researcher (JJ) used Microsoft Excel to create a Relevant Explanation Finder with an incorporated Evidence Analysis Database (Delahais & Toulemonde, 2012). The structure and application of the Relevant Explanation Finder have been well-described elsewhere (Biggs et al., 2014; Delahais & Toulemonde, 2012; Lemire et al., 2012). The same researcher systematically assembled and critically assessed synthesised data according to each column heading, which was then reviewed by a second researcher (SG). Iteratively, the researcher (JJ) used data compiled in the Relevant Explanation Finder, with frequent reference to step 3 data synthesis, to develop contribution claims and revise the ToC. Adapted from the approach described by Delahais and Toulemonde (2012), each contribution claim included a mechanism label and description with any further actions including ToC revision or need for additional evidence noted for future contribution analysis steps. All researchers then met and discussed the contribution claims and revised ToC and, through discussion, reached agreement. The result was nine contribution claims with nine assumptions, two external influences and three threats to programmatic assessment (presented in Online Resource 6).

Step 5: seek out additional evidence

Additional data is gathered in step 5 to enhance links within the ToC and contribution story credibility (Mayne, 2012). A range of approaches to this penultimate step are reported including merging with earlier steps (Downes et al., 2019; Hersey & Adams, 2017), expert review (Choi et al., 2023; Delahais & Toulemonde, 2012; Delahais & Toulemonde, 2017), further targeted data collection (Koleros & Mayne, 2019), and accessing secondary data sources (Koleros & Mayne, 2019; Riley et al., 2018).

As all components of the ToC and contribution claims were substantiated by evidence collected in step 3 and capacity for further data collection limited by study timeline, we decided to conduct stakeholder (from step 3) review and obtain secondary data via published evaluations of programmatic assessment which had increased since the research was commenced. For the stakeholder review, one researcher (JJ) created a video (17 minutes in length) that presented the ToC and contribution story. The video was reviewed by two other researchers (MH and SG) with minor editing and was then recorded. The video was then emailed to all participants from the study in step 3, with an invitation to view and provide feedback using a Qualtrics^TM survey. The survey asked respondents to confirm that they had viewed the video, identify their stakeholder group (graduate, supervisor, faculty), identify three main learnings from the video, comment if the findings reflected their experience of (programmatic) assessment, what needed further clarification, and what areas they would like to know more about. Participants were provided two weeks to respond. After which, one researcher (JJ) compiled and reviewed the survey data. Feedback indicated that the ToC and contribution story accurately reflected experiences with programmatic assessment and was sufficiently clear for participants to understand. As such, no further modifications were made to the ToC in response to the participant feedback.

A literature review was then undertaken to strengthen the ToC and contribute to the robustness and transferability of the findings. One researcher (JJ) searched the electronic databases MEDLINE (Ovid), Embase (Ovid), Web of Science, Scopus, and Cumulative Index to Nursing and Allied Health Literature Plus (EBSCO) on 23 February 2023 for the term “programmatic assessment” in the title. Inclusion criteria was empirical evaluation evidence on programmatic assessment within any discipline, written in English, and published after 8 December 2019. This date was selected as Schut et al. (2021) had published a literature review on programmatic assessment providing a synthesis of research to this date. The search yielded 407 publications which were imported into Covidence (Veritas Health Innovation). Covidence was used to identify and remove 279 duplicates, leaving 128 publications for title and abstract screening. At title and abstract screening, 41 publications were excluded as they were not about programmatic assessment or did not provide empirical evidence, leaving 87 publications for full text review. Eleven articles met the inclusion criteria with one (Jamieson et al., 2021) being excluded as we (JJ, CP, MH, SG) authored the paper and findings had been incorporated into the ToC in step 2. The remaining ten articles (Baartman et al., 2023; Baartman et al., 2022; Dart et al., 2021; de Jong et al., 2022; Jamieson et al., 2022; Roberts et al., 2022; Ross et al., 2023; Schut et al., 2020; Schut et al., 2021; Torre et al., 2022) were included. The same researcher (JJ) extracted data (publication year, title, aim, setting, methods, participants, results) into an Excel worksheet (summarised in Online Resource 7) which was then mapped, using colour coding to indicate the source, to the contribution claims. In an iterative reading of extraction data and ToC, contribution claims were revised. Revisions were reviewed by another researcher (CP) and then, along with the ToC, contribution claims were updated and finalised. After revisions, we had seven contribution claims and eleven assumptions, with no change to the three threats and two external influences. One initially proposed mechanism had insufficient evidence and was discarded (Online Resource 6, contribution claim 9).

Step 6: revise and strengthen the contribution story

Revising, strengthening and presentation of the contribution story is the final step (Mayne, 2012) and often involves critical review by a steering group or expert panel (Choi et al., 2023; Delahais & Toulemonde, 2012; Delahais & Toulemonde, 2017; Downes et al., 2019). Based on the collection, analysis, and integration of additional data in step 5, one researcher (JJ) revised and finalised the contribution story, which was reviewed by all researchers with no further amendments indicated.

The ToC and contribution story representing the impact pathways necessary for effective and sustained implementation was developed.

Theory of change for programmatic assessment

The ToC articulates the impact pathways, assumptions, threats, and external influences enabling successful and sustainable programmatic assessment (Figure 1). Impact pathways are the sequential steps leading from activities to outcomes (Mayne, 2019). Following the contribution analysis process (Mayne, 2015), the ToC presents the initial activity of designing a programmatic assessment, leading to the outputs of training and implementation. This is followed by reach and reaction, and capacity and behaviour change visualised, due to the complexity, as a Venn diagram and unpacked in the nested impact pathways given in Figure 2 and 3. The direct benefit to the target groups (prepared, safe and confident graduates; early and supportive remedial action; collaborative and individualised learning environment; fair, trustworthy, and credible high-stakes progression decisions; and cultural shift) culminate in well-being changes to the people involved in programmatic assessment and health care recipients. These represent outcomes for programmatic assessment observed across the multi-centre qualitative study (step 3) and literature review (step 5).

The data revealed three key teams essential to programmatic assessment, the (i) design team, (ii) learning team, and (iii) assessment team. The design team was comprised of knowledgeable and capable leader(s) who guided the design and implementation of programmatic assessment. The learning team was comprised of three stakeholders; the learning leader, learning facilitator, and learning coordinator who worked cohesively to cultivate a psychologically safe individualised learning environment. The assessment team held responsibility for high-stakes progression decisions and sat outside the other teams, although individuals could belong to multiple teams and have discrete responsibilities.

Figure 1. Theory of change for programmatic assessment developed using contribution analysis, presenting the impact pathway from development to outcomes, and includes threats (red triangles), assumptions (in dashed textboxes), and external influences (Ex1 and Ex2).

a Abbreviations. Ex External influences

Figure 2. First nested impact pathways within the theory of change for programmatic assessment that outlines how outputs lead to outcomes.

^a Indicates location of the second nested impact pathway (give in Figure 3).

Figure 3. Second nested impact pathways within the theory of change for programmatic assessment that illustrates relationships between stakeholders.

Contribution story for programmatic assessment

The following section gives the contribution story for programmatic assessment which details the impact pathways presented visually in the ToC (Figure 1). The contribution story was developed by following the six steps of contribution analysis which built upon the case study (step 2) with qualitative research (step 3) and literature review (step 5). The contribution story integrates contribution claims, assumptions, threats, and external influences into a narrative. Contribution claims provide the explanation and evidence for, or lack thereof, impact pathways within the ToC. Assumptions are salient conditions necessary for the link within the ToC to function as expected, and for the intervention to achieve outcomes. We identified eleven assumptions for programmatic assessment that are highlighted in the following section and presented in Figure 1. Threats, indicated in Figure 1, were identified from the data as notable constraints to implementing programmatic assessment. External influences are events and conditions unrelated to programmatic assessment but that impact the realisation of the intended outcomes.

Leaders drove transition

The perceived need for change from current practices of assessment, arising from internally focussed outcomes and or external developments, was the precursor to designing a programmatic assessment for a particular context (assumption #1). Knowledgeable, strategic, and collaborative leaders were needed to garner support and establish alliances, creating opportunities and momentum. These leaders brought together and oversaw a design team, kept the vision in focus, and built-in sustainability (assumption #2) . The design team used the principles of programmatic assessment as over-arching guides to shape a unique programme that was authentic and responsive to the context and culture in which it was adopted (assumption #3) . All principles of programmatic assessment were sufficiently flexible and versatile and well suited to this purpose. The design team used relevant professional standards or learning outcomes as the foundation for developing the programmatic assessment (assumption #4) . Challenges and tensions were encountered during the design process, and leaders (and the design team) skilfully used compromise to overcome these whilst adhering to the over-arching principles of programmatic assessment . This starting activity for programmatic assessment could be undermined by resource mismatch (threat #1). Programmatic assessment could require substantial input from stakeholders during development and initial implementation and there can be a tendency towards over-assessment . This may exceed resource capacity, namely stakeholders and their time , which could overwhelm people and compromise success. In turn, this negated desired behaviours such as feedback giving and seeking, degraded the feedback quality, and undermined trust within, and of, the system . A well-considered balance was needed to optimise the often insufficient resources , enhanced by continuous evaluation which could refine programmatic assessment over time (external influence #1).

Training developed capabilities of the people

As part of the programmatic assessment system, formal (i.e., workshops) and informal training (i.e., educative conversations) on assessment literacy (focussed on programmatic assessment rationale and concepts) and application of processes and tools was provided to everyone. Training occurred initially, during design, and ongoing. This created a common understanding and language amongst people and equipped them with knowledge and capabilities to enact role and responsibilities, facilitating implementation as intended by the design team . This built capacity of the people operating the system and contributed to the sustainability of the programmatic assessment . It was vital that people had access to, and engaged with, training, that was aligned to their needs (assumption #5) .

High-quality low-stakes data-points underpinned high-stakes progression decisions

The purposefully designed programmatic assessment included fit-for-purpose tools used as intended (assumption #8). The people engaging with the system recognised that learner performance fluctuated, and interpretations of standards varied, allowing subjectivity to be embraced. This gave value to their contribution to, or generating of, low-stakes assessment data. Low-stakes assessments needed to truly be low consequence, with people perceiving them as such, to enable the assessment for learning function (assumption #6) . A multitude of high-quality, feedback-rich, low-stakes data-points, about a learner were obtained longitudinally from different perspectives . Processes and tools were in place to collect and collate this low-stakes data for interpretation by the assessment team (assumption #7). The collated evidence gave a holistic and balanced picture of a learner’s pattern of performance.

The assessor was situated outside the learning team and had access to the body of low-stakes data-point for each learner, making them best positioned to undertake high-stakes progression decisions, as part of an assessment team. Through a consensus building process, this separate assessment team used the consistency and congruency of collated low-stakes data-points to make a high-stakes progression decision which was viewed as being fair, trustworthy, defensible, and credible . This process overcame challenges in managing differing opinions and disagreement. Critical to this process, was all people feeling that their contribution was valued . As assessment data was co-created and responsibility shared, the burden on each person was minimised which alleviated stress and made assessment gratifying for learners, faculty, and supervisors .

A learning team was fostered

Programmatic assessment re-defined and re-distributed responsibilities and tasks with three stakeholders revealed as being critical, the (i) learning leader (ii) learning facilitator, and (iii) learning coordinator (presented in Figure 3 and 4). Learners were the leader who gained agency through being given, and accepting, responsibility for assessment by translating feedback, undertaking self-reflection, and engaging in feedback discussions. Within this learner-led paradigm, the learner was granted ownership over, and empowerment for, their assessment. Workplace-based supervisors were the learning facilitators focused on observing, teaching, and providing quality-individualised feedback . By supporting learners in their self-reflection and assessment, supervisors gained insight into the learner perspective (feelings, circumstances, thinking) which contributed to positive shifts in roles and strengthened relationships . Faculty staff (or faculty-employed coaches) were learning coordinators who had a supportive and educative role which enabled others to enact and feel comfortable in their responsibilities. Learning coordinators also managed learner-led remedial action, calibrated performance expectations, were learner champions, and worked with other faculty to manage assessment. These three roles coalesce into a collaborative learning team that adopted a growth mindset and participated in frequent and timely feedback conversations to co-construct shared and aligned priorities and expectations, focussed on helping and supporting the learner . The result was a collaborative, productive, psychologically safe, individualised learning space where constructive performance conversations were transparent and easier for all . For learners, this helped overcome self-doubt and self-criticism which alleviated stress, and faculty and supervisors felt involved and invested, bringing satisfaction. This division of roles and responsibilities could be compromised by ideological and philosophical misalignment which was often observed when introducing programmatic assessment as the status quo was changed. Resistance to programmatic assessment arose from (i) disruption to existing authority and power dynamics , (ii) people being unfamiliar and uncertain with the approach, (iii) misalignment between beliefs, values, and practices of the people (or organisation) and programmatic assessment . For most people , training and exposure provided an understanding and acceptance of programmatic assessment , and their resistance diminished. Although for some, resistance and ideological dissonance remained (threat #2).

All people involved accepted and understood their role(s) within a learner-led paradigm (assumption #9). Programmatic assessment was contingent on the buy-in and capabilities of the people. Given their critical role, learning leaders who did not or could not engage with the system for various reasons (lack of motivation, insight, self-reflection skills; language barriers, particularly relevant for narrative-based assessments; underperformance) could compromise the effectiveness and increase the workload of others . Learning facilitators that provided insufficient or low quality feedback or felt excluded from the high-stakes progression decisions could also compromise the system. Extraneous demands (work or personal) on any stakeholders could hinder programmatic assessment (threat #3).

Progression moments enabled early detection of issues and psychologically safe, learner-led remedial action

Members of the learning team participated in regular progression reviews where learner performance and interpretation were discussed in an inclusive and collaborative conversation that fostered a shared mental model (assumption #10) . This enabled early identification of underperformance and learners were provided timely, individualised, and supportive remedial action focussed on growth (assumption #11) . Learners had agency in their remedial action, helping them feel psychologically safe .

Learners became prepared, safe, and confident graduates

Learners used programmatic assessment processes and tools to gain insight and provide evidence for progression. The learner-led approach equipped learners with the skills, knowledge, and attributes for practice, making them self-reflective lifelong learners that could identify, and practice within, the bounds of their abilities and seek support when needed . Graduates, who had studied within a programmatic assessment system, described themselves, and were perceived by faculty and supervisors, as confident and prepared to enter the workforce as safe and effective practitioners. This was perceived by learners, faculty and supervisors as benefiting the recipients of their care.

Programmatic assessment became the status quo

Over time, attitudes for most people transformed to align with programmatic assessment and the principles and ideologies became embodied within the people and embedded within the infrastructure. This instigated a culture shift that contributed to the sustainability and normalisation of practice, particularly feedback .

External influences: evaluation and institutional polices

Evaluation and quality improvement were needed to evolve and improve programmatic assessment, ensuring that it continued to meet the needs of people. Accreditation and institutional policies were powerful influencers that shaped programmatic assessment with support being critical to success . There could be a lag between early adopters and accreditation and institutional policies which was a hinderance. The principles of programmatic assessment were sufficiently agile to accommodate discipline and context specific nuances, to an extent. Ultimately, change was enabled when accreditation and institutional policies aligned with the principles and ideology of programmatic assessment.

Applying the six steps of contribution analysis, we evaluated programmatic assessment to identify the impact pathways from inception to implementation, and described the contribution to outcomes, considered from the perspectives of key stakeholders and published literature. We found that programmatic assessment, when enacted in accordance with the principles, made a noticeable contribution to an improved learning environment leading to psychologically safe learning spaces and relationships, as well as enabling credible high-stakes progression decisions which prepared graduates for practice. Ultimately, people experienced less stress and derived gratification from assessment and there was the perception that clients benefited from better prepared learners. The impact pathways for these contributions were clear and plausible, informed by data which underwent scrutiny and demonstrated congruency, enabling the verification of the ToC. External factors, namely institutional and accreditation requirements, were shown to influence programmatic assessment and their contribution was recognised within the ToC. In accordance with contribution analysis, the findings of this research are considered probabilistic rather than definitive proof (Mayne, 2012).

Implications of findings

Application of theory-informed evaluations in HPE is nascent. Van Melle et al., (2021) put forward a proposed logic model for competency-based medical education which resonated with our own findings. Agreement was observed in processes and outputs (leadership, tool development, user preparation, educational alignment) which contributed to learner-related outcomes (feedback, agency, remedial action, multiplicity of data) and ultimately to practice and client outcomes (readiness and transition to practice, enhanced client outcomes). External factors were similar (institutional structures and role of continuous evaluation) as was inclusion of growth mindset theory. Alternative approaches to constructing the two models such as purpose, methods, setting, data sources, and scope likely contributed to differences, and are not unexpected given the dynamic and context-sensitive nature of HPE (Van Melle et al., 2021). Congruency between our work and that of Van Melle et al., (2021)provides a deepening understanding of mechanisms enabling competency-based assessment, and the role played by programmatic assessment.

As with Van Melle et al., (2021), we found adoption of a growth mindset by the learning team a critical factor in creating a psychologically safe learning environment that enabled programmatic assessment. Mindset theory, the perceived malleability of one’s capabilities, is an emerging factor in how people engage with learning (Dweck, 2019; Sahagun et al., 2021; Yun-Ruei & Catanya, 2022). Conceptualised as a continuum, those adopting a growth mindset consider personal attributes and capabilities as being something which they can alter and improve, whereas a fixed mindset consider these to be innate and unchangeable (Sahagun et al., 2021). People with a growth mindset perceive difficult circumstances as challenges to overcome, providing an opportunity for improvement. A growth mindset has been found to improve learner psychological well-being and academic performance (Sahagun et al., 2021; Williams & Lewis, 2021; Wolcott et al., 2021; Yun-Ruei & Catanya, 2022). Mindsets are a fluid construct and are shaped by environment. Education systems can foster a growth mindset by enabling learner agency and control (Yun-Ruei & Catanya, 2022), using mastery-focussed goals (Richardson et al., 2021; Williams & Lewis, 2021; Wolcott et al., 2021), providing regular low consequence feedback (Sahagun et al., 2021), focussing on formative assessment (Williams & Lewis, 2021), reducing emphasis on numeric grades (Richardson et al., 2021), and positioning mistakes as inherent to learning (Sahagun et al., 2021; Wolcott et al., 2021). Educators (supervisors, coaches, teachers) are instrumental in modelling a growth mindset through performance-focussed feedback, facilitating learner reflection, and participating in open conversations (Williams & Lewis, 2021; Wolcott et al., 2021). Notably, these approaches mirror the intent of programmatic assessment and were observed in our findings which, through assessment design, fostered a growth mindset to produce reflective and agile learners. Our findings highlighted that programmatic assessment may promote learner motivation, recently identified as key to assessments that drives learning and psychological wellbeing (Kusurkar et al., 2023).

Preparing learners to meet the health needs of the community is the paramount outcome for CBE (Frank et al., 2010). Paradoxically, this poses the greatest challenge to evaluators and evidence for attainment is limited (Van Melle et al., 2021; Wong et al., 2012). Although we report some data indicating improved client outcomes, this was conceived from the perception of a learner’s capabilities and determined from the perspective of others, rather than the client and their experience. While the ToC recognises client outcomes as the end-goal of programmatic assessment, the certainty of contribution is thus limited. Determination of the impact of CBE on client outcomes is a critical step and further research, using theory-informed frameworks is recommended to construct a better understanding. Data points for programmatic assessment may need to shift towards a greater reliance on collective competency assessment with evidence of health impact.

Despite increasing enthusiasm for programmatic assessment, viability within a resource constrained higher education sector remains a concern. Scalability within large cohorts and adoption in established programs are challenges (Ryan & Judd, 2022; Torre et al., 2021), and there are barriers to sufficiently resourcing an individualised learning approach (Torre et al., 2021). Assessment resources are finite and each decision is a balance between benefits and costs, considered as financial, educational, or otherwise (Cleland et al., 2020). Evidence is emerging to suggest authentic, peer-based assessment tasks may have improved outcomes (Kusurkar et al., 2023), which begs the question as to what role, if any, knowledge-based summative assessment has in HPE and if cost savings can be made. Implementers of programmatic assessment need to make pragmatic decisions informed by ‘useful knowledge’ which tells them how and why (Baartman et al., 2022; Brown et al., 2015; Sandars, 2018). Our findings build upon the work of others (Baartman et al., 2022; Torre et al., 2021) by sequencing the critical steps, and revealing the influential factors and threats. This knowledge can be used to prioritise resources for maximal outcomes and support creative solutions to design dilemmas, guided by educator priorities and values (Cleland et al., 2020). Although a component of the assessment utility index (van der Vleuten, 1996), little is known about the cost-effectiveness of programmatic assessment, or HPE broadly (Cleland et al., 2020). We demonstrated that programmatic assessment enabled early learner-led remedial action and minimised assessment burden, hypothesised to produce future savings due to the hidden associated costs (Ellaway et al., 2018; Torre et al., 2021). Further research using economic concepts is of benefit to explore the considerations and comparisons educators make about costs and consequences of programmatic assessment. Does investment in learner-led processes, low-stakes assessment, and training result in later gains in terms of learning management and attrition, as proposed by others (Ellaway et al., 2018; Steinert, 2013)? Ultimately, an approach considering and combining satisfaction and sufficiency, what Cleland et al., (2020) refers to as ‘satisficing’, may be required to implement programmatic assessment within finite resources bounds. Understanding the true cost of programmatic assessment, a balance between design choices, resource utilisation, and long term benefits, has the potential to enhance feasibility and support widespread adoption.

Application of contribution analysis in health professions education

To our knowledge, this represents the first use of contribution analysis with programmatic assessment and one of the few within HPE (Choi et al., 2023). We found the structured contribution analysis framework facilitated interrogation of probable causality from programmatic assessment activities to outcomes, enabled through rigorous thinking, data collection and synthesis, and stakeholder discourse. The cornerstone was the ToC, and nested theories were key to unpacking and presenting complexity to avoid oversimplification (Frye & Hemmer, 2012). While others have proposed and utilised logic models (Choi et al., 2023; Van Melle et al., 2021), we found the ToC to be comprehensive which facilitated deeper exploration of program events and sequencing. This was enhanced by the thorough and rigorous application of the six steps of contribution analysis, garnering insight into how activities contributed to outcomes.

While contribution analysis can be time and labour intensive and as such not suited to all circumstances (Mayne, 2019; Van Melle, Gruppen, Holmboe, Flynn, Oandasan, Frank, et al., 2017), we found the investment justified given paucity of programmatic assessment evaluative evidence and increasing adoption. Consideration to efficiencies through reflection of our experience is warranted given calls to increase application of theory-informed evaluation within HPE (Moreau & Eady, 2015). Although we retrospectively constructed the ToC, we endorse an ex ante approach to step 1 and 2, that engages stakeholders as a steering group and occurs during program development to assemble a postulated ToC. This would clarify how the program is intended to work, and for whom, fostering a unified understanding for enhanced delivery and providing a blueprint for subsequent evaluation (Frye & Hemmer, 2012). Efforts invested in such front-end work may mitigate workload while embedding continuous evaluation for improved practices (Mayne, 2019; Oandasan et al., 2020). We also see potential to build upon our own and the efforts of others through the publication of program theories within HPE which could contribute to targeted data gathering in step 3. This may assist the collective construction of knowledge regarding the impact of HPE on client outcomes which, although vital, is challenging to determine due to the multitude of influencing factors unfolding over long timeframes (Hall et al., 2021; Van Melle, Gruppen, Holmboe, Flynn, Oandasan, Frank, et al., 2017). Such actions would streamline contribution analysis processes for widespread adoption and beget HPE outcome evidence.

Experimental evaluation, emblematic of a scientific paradigm, is not only impractical in HPE from a design and cost perspective, but the pursuit of an objective truth is misaligned with inherent multifaceted complexity (Frye & Hemmer, 2012; Grant & Grant, 2023; Palermo et al., 2021). Increasingly in HPE, we recognise truth as multiplicitous and dynamic, constructed through an individual’s socio-cultural context, reflected in the ontological stance of relativism (Rees et al., 2020; Rees et al., 2023; Van Melle, Gruppen, Holmboe, Flynn, Oandasan, Frank, et al., 2017). Although the philosophical underpinnings of contribution analysis have limited exploration, there is alignment with critical realism through the acknowledgement and integration of multiple perspectives within the ToC, achieved through iterative engagement with stakeholders, and promotion of research methods beyond quantitative (Brousselle & Buregeya, 2018; Leeuw, 2023). Yet contribution analysis allows multiple impact pathways to be identified, providing evidence often needed for HPE practice decisions, for example, what program leaders need to implement to achieve programmatic assessment success. Importantly, we are coming to appreciate the need to construct HPE knowledge using different, theory-informed evaluation approaches, across different settings (Allen et al., 2022; Van Melle, Gruppen, Holmboe, Flynn, Oandasan, Frank, et al., 2017).

Limitations

Contribution analysis required rigorous thinking, which we achieved through robust data analysis and integration, adherence to steps, and research team reflective discussions. Recurrent engagement with stakeholders further strengthened methods and findings (Oandasan et al., 2020). We recognise inclusion of dietetic programmes in step 3 limits transferability. To overcome, we vetted the dietetic programmes against the principles of programmatic assessment, provided a rich context description, and intentionally integrated evidence derived from alternative disciplines (in step 5) to enhance transferability to other settings. The four dietetic programmes included in step 3 had small to medium learner cohorts, and, as suggested (Ryan et al., 2023), there are likely to be distinct challenges associated with larger cohorts and other disciplines not captured in this research. We recognise the limitation of narrowing the scope to implementation of programmatic assessment in the workplace-based learning setting. Increasing adoption of programmatic assessment across the higher education sector will underscore the need for research to explore implementation in undergraduate and university-based settings. There is likely much to be learnt from exploring situations where programmatic assessment implementation was attempted and achieved either in part or not at all. Although rigour in methods and transferability was sought, the findings were constructed through the lens of the research team, and others may have different perspectives and experiences. Importantly, ToC should not be considered static, it is a modifiable theory responsive to emerging and alternative data (Oandasan et al., 2020). We anticipate, and look forward to, future research which applies theory-informed evaluation methods to inform our collective understanding of programmatic assessment.

Using contribution analysis, we have evaluated programmatic assessment to construct a ToC which articulates the mechanisms from activities to outcomes. Leverage and risk points have been illuminated which educators can apply to navigate contextually responsive and successful manifestations of programmatic assessment across diverse settings. This research contributes to the health professional education community understanding of CBE outcomes and drives improvements in practice.

Author Contribution

J.J, C.P, M. H., and S. G. conceived the concept for this manuscript. J. J., S. G., J. L., and R. B. undertook data collection. Data analysis was led by J. J., with all authors reviewing and contributing to the analysis. J. J. drafted an initial manuscript with all authors providing critical feedback. All authors reviewed and approved the manuscript for submission.

Acknowledgement

We wish to acknowledge the contributions made by Gemma Jenkins and Katrina Weber to the theory of change.

Ajjawi, R., & Kent, F. (2022). Understanding realist reviews for medical education. J Grad Med Educ, 14(3), 274–278. https://doi.org/https://doi.org/10.4300/JGME-D-22-00334.1.
Allen, L. M., Hay, M., & Palermo, C. (2022). Evaluation in health professions education: Is measuring outcomes enough? Medical Education, 56(1), 127–136. https://doi.org/10.1111/medu.14654.
Baartman, L., Baukema, H., & Prins, F. (2023). Exploring students’ feedback seeking behavior in the context of programmatic assessment. Assess Eval High Educ, 48(5), 598–612. https://doi.org/10.1080/02602938.2022.2100875.
Baartman, L., van Schilt-Mol, T., & van der Vleuten, C. (2022). Programmatic assessment design choices in nine programs in higher education. Front Educ, 7, 1–13. https://doi.org/https://doi.org/10.3389/feduc.2022.931980.
Bate, F., Fyfe, S., Griffiths, D., Russell, K., Skinner, S., & Tor, E. (2020). Does an incremental approach to implementing programmatic assessment work? Reflections on the change process. Med Ed Publish, 9(55).
Berger, R. (2015). Now I see it, now I don’t: Researcher’s position and reflexivity in qualitative research. Qual Res, 15(2), 219–234. https://doi.org/10.1177/1468794112468475.
Biggs, J. S., Farrell, L., Lawrence, G., & Johnson, J. K. (2014). A practical example of contribution analysis to a public health intervention. Evaluation, 20(2), 214–229. https://doi.org/.
Bok, H. G. J., van der Vleuten, C. P. M., & de Jong, L. H. (2021). Prevention Is Better Than Cure: A Plea to Emphasize the Learning Function of Competence Committees in Programmatic Assessment. Front Vet Sci, 8, 638455. https://doi.org/10.3389/fvets.2021.638455.
Brousselle, A., & Buregeya, J. M. (2018). Theory-based evaluations: Framing the existence of a new theory in evaluation and the rise of the 5th generation. Eval, 24(2), 153–168. https://doi.org/10.1177/1356389018765487.
Brown, C., Ross, S., Cleland, J., & Walsh, K. (2015, Jul). Money makes the (medical assessment) world go round: The cost of components of a summative final year Objective Structured Clinical Examination (OSCE). Medical Teacher, 37(7), 653–659. https://doi.org/10.3109/0142159x.2015.1033389.
Budhwani, S., & McDavid, J. C. (2017). Contribution Analysis: Theoretical and practical challenges and prospects for evaluators. Can J Program Eval, 32(1), 1–24. https://doi.org/https://doi.org/doi:10.3138/cjpe.31121.
Buregeya, J. M., Loignon, C., & Brousselle, A. (2020). Contribution analysis to analyze the effects of the health impact assessment at the local level: A case of urban revitalization. Evaluation and program planning, 79, 101746. https://doi.org/10.1016/j.evalprogplan.2019.101746.
Caretta-Weyer, H. A., Smirnova, A., Barone, M. A., Frank, J. R., Hernandez-Boussard, T., Levinson, D., Lombarts, K., Lomis, K. D., Martini, A., Schumacher, D. J., Turner, D. A., & Schuh, A. (2024). The Next Era of Assessment: Building a Trustworthy Assessment System. Perspect Med Educ, 13(1), 12–23. https://doi.org/10.5334/pme.1110.
Chen, H. T. (2004). The roots of theory-drive evaluation: Current views and origins. In M. C. Alkin (Ed.), Evaluation roots: Tracing theorists' views and influences. SAGE.
Choi, T., Sarkar, M., Bonham, M., Brock, T., Brooks, I. A., Diug, B., Ilic, D., Kumar, A., Lau, W. M., Lindley, J., Morphet, J., Simmons, M., Volders, E., White, P. J., Wright, C., & Palermo, C. (2023). Using contribution analysis to evaluate health professions and health sciences programs. Frontiers in Medicine, 10, 1–11. https://doi.org/10.3389/fmed.2023.1146832.
Cleland, J. A., Foo, J., Ilic, D., Maloney, S., & You, Y. (2020). You can't always get what you want… economic thinking, constrained optimization and health professions education. Advances In Health Sciences Education, 25(5), 1163–1175. https://doi.org/10.1007/s10459-020-10007-w.
Dart, J., Twohig, C., Anderson, A., Bryce, A., Collins, J., Gibson, S., Kleve, S., Porter, J., Volders, E., & Palermo, C. (2021). The value of programmatic assessment in supporting educators and students to succeed: a qualitative evaluation. Journal of the Academy of Nutrition and Dietetics. https://doi.org/10.1016/j.jand.2021.01.013.
de Jong, L. H., Bok, H. G. J., Schellekens, L. H., Kremer, W. D. J., Jonker, F. H., & van der Vleuten, C. P. M. (2022). Shaping the right conditions in programmatic assessment: How quality of narrative information affects the quality of high-stakes decision-making. Bmc Medical Education, 22(1), 1–10. https://doi.org/https://doi.org/doi: 10.1186/s12909-022-03257-2.
Delahais, T., & Toulemonde, J. (2012). Applying contribution analysis: Lessons from five years of practice. Evaluation, 18(3), 281–293.
Delahais, T., & Toulemonde, J. (2017). Making rigorous causal claims in a real-life context: Has research contributed to sustainable forest management? Evaluation, 23(4), 370–388. https://doi.org/10.1177/1356389017733211.
Downes, A., Novicki, E., & Howard, J. (2019). Using the contribution analysis approach to evaluate science impact: A case study of the national institute for occupational safety and health. Am J Eval, 40(2), 177–189. https://doi.org/10.1177/1098214018767046.
Dweck, C. S. (2019). The choice to make a difference. Perspect Psychol Sci, 14(1), 21–25. https://doi.org/10.1177/1745691618804180.
Dybdal, L., Nielsen, B., S., & Lemire, S. (2010). Contribution Analysis applied: Reflections on scope and methodology. Can J Program Eval, 25, 29–57. https://doi.org/10.3138/cjpe.25.002.
Ellaway, R. H., Chou, C. L., & Kalet, A. L. (2018, Mar). Situating remediation: Accommodating success and failure in medical education systems. Academic Medicine, 93(3), 391–398. https://doi.org/10.1097/acm.0000000000001855.
Frank, J. R., Mungroo, R., Ahmad, Y., Wang, M., De Rossi, S., & Horsley, T. (2010). Toward a definition of competency-based education in medicine: A systematic review of published definitions. Medical Teacher, 32(8), 631–637. https://doi.org/10.3109/0142159x.2010.500898.
Frye, A. W., & Hemmer, P. A. (2012). Program evaluation models and related theories: AMEE guide 67. Medical Teacher, 34(5), e288–299. https://doi.org/10.3109/0142159x.2012.668637.
Gale, K. N., Heath, G., Cameron, E., Rashid, S., & Redwood, S. (2013). Using the framework method for the analysis of qualitative data in multi-disciplinary health research. BMC Medical Research Methodology, 13(1), 117–124. https://doi.org/10.1186/1471-2288-13-117.
Govaerts, M., van der Vleuten, C., & Schut, S. (2022). Implementation of programmatic assessment: Challenges and lessons learned. Educ Sci, 12(10), 717–722. https://www.mdpi.com/2227-7102/12/10/717.
Grant, J., & Grant, L. (2023, Jan). Quality and constructed knowledge: Truth, paradigms, and the state of the science. Medical Education, 57(1), 23–30. https://doi.org/10.1111/medu.14871.
Haji, F., Morin, M. P., & Parker, K. (2013). Rethinking programme evaluation in health professions education: beyond 'did it work?'. Medical Education, 47(4), 342–351. https://doi.org/10.1111/medu.12091.
Hall, A. K., Schumacher, D. J., Thoma, B., Caretta-Weyer, H., Kinnear, B., Gruppen, L., Cooke, L. J., Frank, J. R., & Van Melle, E. (2021). Outcomes of competency-based medical education: A taxonomy for shared language. Medical Teacher, 43(7), 788–793. https://doi.org/10.1080/0142159x.2021.1925643.
Heeneman, S., de Jong, L. H., Dawson, L. J., Wilkinson, T. J., Ryan, A., Tait, G. R., Rice, N., Torre, D., Freeman, A., & van der Vleuten, C. P. M. (2021, Oct). Ottawa 2020 consensus statement for programmatic assessment – 1. Agreement on the principles. Medical Teacher, 43(10), 1139–1148. https://doi.org/10.1080/0142159x.2021.1957088.
Hersey, A., & Adams, M. (2017). Using contribution analysis to assess the influence of farm link programs in the U.S. J Agric Food Syst Community Dev, 7(3), 83–103. https://doi.org/https://doi.org/10.5304/jafscd.2017.073.006.
Iobst, W. F., & Holmboe, E. S. (2020). Programmatic assessment: The secret sauce of effective CBME implementation. J Grad Med Edu, 12(4), 518–521. https://doi.org/10.4300/JGME-D-20-00702.1.
Jamieson, J., Gibson, S., Hay, M., & Palermo, C. (2022). Teacher, gatekeeper, or team member: Supervisor positioning in programmatic assessment. Advances In Health Sciences Education, 28(3), 827–845. https://ro.ecu.edu.au/ecuworks2022-2026/1714.
Jamieson, J., Hay, M., Gibson, S., & Palermo, C. (2021). Implementing programmatic assessment transforms supervisor attitudes: an explanatory sequential mixed methods study. Medical teacher, 43(6), 709–717. https://doi.org/10.1080/0142159X.2021.1893678.
Jamieson, J., Jenkins, G., Beatty, S., & Palermo, C. (2017). Designing programmes of assessment: A participatory approach. Medical Teacher, 39(11), 1182–1188. https://doi.org/10.1080/0142159X.2017.1355447.
Junge, K., Cullen, J., & Iacopini, G. (2020). Using contribution analysis to evaluate large-scale, transformation change processes. Eval, 26(2), 227–245. https://doi.org/10.1177/1356389020912270.
Koleros, A., & Mayne, J. (2019). Using actor-based theories of change to conduct robust evaluation in complex settings. Canadian Journal of Program Evaluation, 33(3). https://doi.org/10.3138/cjpe.52946.
Kusurkar, R. A., Orsini, C., Somra, S., Artino, A. R. Jr., Daelmans, H. E. M., Schoonmade, L. J., & van der Vleuten, C. (2023, May 4). The effect of assessments on student motivation for learning and its outcomes in health professions education: A review and realist synthesis. Acad Med, 98(9), 1083–1092. https://doi.org/10.1097/acm.0000000000005263.
Leeuw, F. (2023). John Mayne and rules of thumb for Contribution Analysis: A comparison with two related approaches. Can J Program Eval, 37(3), 403–421. https://doi.org/10.3138/cjpe.75448.
Lemire, S. T., Nielsen, S. B., & Dybdal, L. (2012). Making contribution analysis work: A practical framework for handling influencing factors and alternative explanations. Eval, 18(3), 294–309. https://doi.org/10.1177/1356389012450654.
Lodge, J. M., Howard, S., Bearman, M., Dawson, P., & Associates (2023). Assessment reform for the age of artificial intelligence.
Mayne, J. (1999). Addressing attribution through contribution analysis: using performance measures sensibly (discussion paper).
Mayne, J. (2001). Addressing attribution through contribution analysis: Using performance measures sensibly. Can J Program Eval, 16(1), 1–24. https://doi.org/10.3138/cjpe.016.001.
Mayne, J. (2011). Contribution analysis: Addressing cause and effect. In K. Forss, M. Marra, & R. Schwartz (Eds.), Evaluating the complex. Transaction.
Mayne, J. (2012). Contribution analysis: Coming of age? Eval, 18(3), 270–280. https://doi.org/10.1177/1356389012451663.
Mayne, J. (2015). Useful Theory of Change Models. Canadian Journal of Program Evaluation, 30(2), 119–142. https://doi.org/10.3138/cjpe.230.
Mayne, J. (2017). Theory of Change Analysis: Building Robust Theories of Change. The Canadian Journal of Program Evaluation, 32(2).
Mayne, J. (2018). The COM-B Theory of Change Model (V3) (discussion paper).
Mayne, J. (2019). Revisiting contribution analysis. Can J Program Eval, 34(2), 171. https://doi.org/10.3138/cjpe.68004.
Moreau, K. A., & Eady, K. (2015). Connecting medical education to patient outcomes: The promise of contribution analysis. Medical Teacher, 37(11), 1060–1062. https://doi.org/10.3109/0142159X.2015.1060307.
Oandasan, I., Martin, L., McGuire, M., & Zorzi, R. (2020). Twelve tips for improvement-oriented evaluation of competency-based medical education. Medical Teacher, 42(3), 272–277. https://doi.org/10.1080/0142159x.2018.1552783.
Palermo, C., Conway, J., Beck, E. J., Dart, J., Capra, S., & Ash, S. (2016). Methodology for developing competency standards for dietitians in Australia. Nursing and Health Sciences, 18(1), 130–137. https://doi.org/10.1111/nhs.12247.
Palermo, C., Gibson, S. J., Dart, J., Whelan, K., & Hay, M. (2017). Programmatic assessment of competence in dietetics: A new frontier. JAND, 117(2), 175–179. https://doi.org/10.1016/j.jand.2016.03.022.
Palermo, C., Reidlinger, D. P., & Rees, C. E. (2021). Internal coherence matters: Lessons for nutrition and dietetics research. Nutr Diet, 78(3), 252–267. https://doi.org/10.1111/1747-0080.12680.
Pawson, R. (2006). Evidence-based policy: A realist perspective. SAGE.
Pawson, R., & Tilley, N. (1997). Realistic evaluation. Sage London.
Pearce, J., & Tavares, W. (2021). A philosophical history of programmatic assessment: Tracing shifting configurations. Advances In Health Sciences Education, 26(4), 1291–1310. https://doi.org/10.1007/s10459-021-10050-1.
Rees, C. E., Crampton, P. E. S., & Monrouxe, L. V. (2020). Re-visioning academic medicine through a constructionist lens. Academic Medicine, 95(6), 846–850. https://doi.org/10.1097/ACM.0000000000003109.
Rees, C. E., Monrouze, L. V., O'Brien, B. C., Gordon, L. J., & Palermo, C. (2023). Foundations of health professional education research: Principles, perspectives and practices. John Wiley & Sons Ltd.
Richardson, D., Kinnear, B., Hauer, K. E., Turner, T. L., Warm, E. J., Hall, A. K., Ross, S., Thoma, B., & Van Melle, E. (2021, Jul). Growth mindset in competency-based medical education. Medical Teacher, 43(7), 751–757. https://doi.org/10.1080/0142159x.2021.1928036.
Riley, B. L., Kernoghan, A., Stockton, L., Montague, S., Yessis, J., & Willis, C. D. (2018). Using Contribution Analysis to Evaluate the Impacts of Research on Policy: Getting to Good Enough. Research Evaluation, 27(1), 16–27.
Roberts, C., Khanna, P., Bleasel, J., Lane, S., Burgess, A., Charles, K., Howard, R., O'Mara, D., Haq, I., & Rutzou, T. (2022). Student perspectives on programmatic assessment in a large medical programme: A critical realist analysis. Medical Education, 56(9), 901–914. https://doi.org/10.1111/medu.14807.
Ross, S., Lawrence, K., Bethune, C., van der Goes, T., Pélissier-Simard, L., Donoff, M., Crichton, T., Laughlin, T., Dhillon, K., Potter, M., & Schultz, K. (2023). Development, implementation, and meta-evaluation of a national approach to programmatic assessment in Canadian family medicine residency training. Academic Medicine, 98(2), 188–198. https://doi.org/10.1097/acm.0000000000004750.
Ryan, A., & Judd, T. (2022). From traditional to programmatic assessment in three (not so) easy steps. Educ Sci, 12(487), 1–13.
Ryan, A., O'Mara, D., & Tweed, M. (2023). Evolution or revolution to programmatic assessment: Considering unintended consequences of assessment change. FoHPE, 185–195. https://doi.org/doi.org/10.11157/fohpe.v24i2.703.
Sahagun, M. A., Moser, R., Shomaker, J., & Fortier, J. (2021). Developing a growth-mindset pedagogy for higher education and testing its efficacy. SSHO, 4(1). https://doi.org/10.1016/j.ssaho.2021.100168.
Sandars, J. (2018, Jun 4). It is time to celebrate the importance of evaluation in medical education. Int J Med Educ, 9, 158–160. https://doi.org/10.5116/ijme.5aed.6f12.
Schut, S., Heeneman, S., Bierer, B., Driessen, E., Tartwijk, J., & van der Vleuten, C. (2020). Between trust and control: teachers' assessment conceptualisations within programmatic assessment. Medical Education, 54(6), 528–537. https://doi.org/10.1111/medu.14075.
Schut, S., Maggio, L. A., & Driessen, E. (2021). Where the rubber meets the road: An integrative review of programmatic assessment in health care professions education. Perspect Med Educ, 10(1), 6–13. https://doi.org/10.1007/s40037-020-00625-w.
Steinert, Y. (2013, Apr). The problem learner: whose problem is it? AMEE Guide 76. Medical Teacher, 35(4), e1035–1045. https://doi.org/10.3109/0142159x.2013.774082.
Stufflebeam, D. L. (2014). Evaluation theory, models, and applications (Second edition. ed.). San Francisco: Jossey-Bass & Pfeiffer Imprints, Wiley.
Toosi, M., Modarres, M., Amini, M., & Geranmayeh, M. (2021). Context, Input, Process, and Product Evaluation Model in medical education: A systematic review. J Educ Health Promot, 10(1), 1–12. https://doi.org/https://doi.org/10.4103/jehp.jehp_1115_20.
Torre, Schuwirth, L. W. T., & van der Vleuten, C. P. M. (2020). Theoretical considerations on programmatic assessment. Medical teacher, 42(2), 213–220. https://doi.org/10.1080/0142159X.2019.1672863.
Torre, D., Rice, N. E., Ryan, A., Bok, H., Dawson, L. J., Bierer, B., Wilkinson, T. J., Tait, G. R., Laughlin, T., Veerapen, K., Heeneman, S., Freeman, A., & van der Vleuten, C. (2021). Ottawa 2020 consensus statements for programmatic assessment – 2. Implementation and practice. Medical Teacher, 43(10), 1149–1160. https://doi.org/10.1080/0142159X.2021.1956681.
Torre, D., Schuwirth, L., van der Vleuten, C., & Heeneman, S. (2022). An international study on the implementation of programmatic assessment: Understanding challenges and exploring solutions. Medical Teacher, 44(8), 928–937. https://doi.org/10.1080/0142159x.2022.2083487.
Touchie, C., & ten Cate, O. (2016). The promise, perils, problems and progress of competency-based medical education. Medical Education, 50(1), 93–100. https://doi.org/10.1111/medu.12839.
van der Vleuten, C. P. (1996, Jan). The assessment of professional competence: Developments, research and practical implications. Advances In Health Sciences Education, 1(1), 41–67. https://doi.org/10.1007/bf00596229.
van der Vleuten, C. P. M., & Schuwirth, L. W. T. (2005). Assessing professional competence: From methods to programmes. Medical Education, 39(3), 309–317. https://doi.org/10.1111/j.1365-2929.2005.02094.x.
van der Vleuten, C. P. M., Schuwirth, L. W. T., Driessen, E. W., Dijkstra, J., Tigelaar, D., Baartman, L. K. J., & van Tartwijk, J. (2012). A model for programmatic assessment fit for purpose. Medical Teacher, 34(3), 205–214. https://doi.org/10.3109/0142159X.2012.652239.
Van Melle, E., Frank, J. R., Holmboe, E. S., Dagnone, D., Stockley, D., & Sherbino, J. (2019). A core components framework for evaluating implementation of competency-based medical education programs. Academic Medicine, 94(7), 1002–1009. https://doi.org/10.1097/acm.0000000000002743.
Van Melle, E., Gruppen, L., Holmboe, E. S., Flynn, L., Oandasan, I., & Frank, J. R. (2017). Using Contribution Analysis to evaluate competency-based medical education programs: It's all about rigor in thinking. Academic Medicine, 92(6), 752–758. https://doi.org/10.1097/ACM.0000000000001479.
Van Melle, E., Gruppen, L., Holmboe, E. S., Flynn, L., Oandasan, I., Frank, J. R., & International Competency-Based Medical Education. (2017). Using Contribution Analysis to evaluate competency-based medical education programs: It's all about rigor in thinking. Academic Medicine, 92(6), 752–758. https://doi.org/10.1097/ACM.0000000000001479.
Van Melle, E., Hall, A. K., Schumacher, D. J., Kinnear, B., Gruppen, L., Thoma, B., Caretta-Weyer, H., Cooke, L. J., & Frank, J. R. (2021). Capturing outcomes of competency-based medical education: The call and the challenge. Medical Teacher, 43(7), 794–800. https://doi.org/10.1080/0142159x.2021.1925640.
Williams, C. A., & Lewis, L. (2021, May). Mindsets in health professions education: A scoping review. Nurse Education Today, 100, 104863. https://doi.org/10.1016/j.nedt.2021.104863.
Wolcott, M. D., McLaughlin, J. E., Hann, A., Miklavec, A., Beck Dallaghan, G. L., Rhoney, D. H., & Zomorodi, M. (2021). Apr). A review to characterise and map the growth mindset theory in health professions education. Medical Education, 55(4), 430–440. https://doi.org/10.1111/medu.14381.
Wong, G., Greenhalgh, T., Westhorp, G., & Pawson, R. (2012). Realist methods in medical education research: What are they and what can they contribute? Medical Education, 46(1), 89–96. https://doi.org/10.1111/j.1365-2923.2011.04045.x.
Yun-Ruei, K., & Catanya, S. (2022). Rethinking the multidimensionality of growth mindset amid the COVID-19 pandemic: A systematic review and framework proposal. Frontiers In Psychology, 13, 1–14. https://doi.org/10.3389/fpsyg.2022.572220.

No competing interests reported.

Download PDF

Reviews received at journal
08 Aug, 2024
Reviewers agreed at journal
19 Jul, 2024
Reviewers invited by journal
02 Jul, 2024
Editor assigned by journal
18 Apr, 2024
Submission checks completed at journal
18 Apr, 2024
First submitted to journal
16 Apr, 2024

You are reading this latest preprint version

An evaluation of programmatic assessment across health professions education using contribution analysis

Status:

Version 1

Abstract

Figures

INTRODUCTION

METHOD

RESULTS

DISCUSSION

CONCLUSION

Declarations

Author Contribution

Acknowledgement

References

Additional Declarations

Supplementary Files

Status:

Version 1