We designed a program consisting of two consecutive pathways titled Freshman and Graduate. Targeted to quality professionals from all ages and backgrounds (see Fig. 1), the aim was to empower them to make their own self-service data requests and to perform a self-guided descriptive analysis using commonly available spreadsheet software such as Microsoft (MS) Excel. To accomplish this, the program would enable them to develop an understanding of basic data vocabulary in conjunction with a sense for statistical thinking to effectively use data products - such as dashboards - developed by quality data analysts and scientists. Among the audience on the first pathway we wanted to identify a group of quality data champions that would assist as Subject Matter Experts (SMEs) in the development of such data products. The former pathway would establish a common ground understanding and the latter would help identify and educate data champions. (See Fig. 2)
Freshman pathway
The main challenge for designing the Freshman pathway was the heterogeneity of the target audience. Some of them having even studied statistics (or a similar field) and others not having engaged with statistics since their high-school-level education. We needed to heavily iterate over the basic concepts to educate the latter while still keeping the former engaged and interested in order to onboard them to our Graduate training and identify them as possible future SMEs. We therefore initially opted for a multi-day, face-to-face, on-site training as opposed to a virtual training or e-learning. This would ensure we had enough time to cover the basics and by recruiting our data analysts/scientists along with the training designers as trainers so the more advanced participants could actively engage with them and discuss expert topics during breaks.
During the training design of the Freshman pathway we implemented several elements that made the training more interactive and entertaining for participants of all levels:
- Used relevant examples for each topic discussed on every slide, prioritizing illustrative health care examples over entertaining or abstract examples (such as astonishing scientific studies or coin flips). See Fig. 3.
- Introduced or closed each section covering a certain concept with an interactive poll using PollEverywhere, an online tool for conducting real-time polls within the audience to keep the engagement [12]. See Fig. 4.
- Every half-day session would include two or more publicly available videos of around 3-5 min length covering a data relevant topic with the help of entertaining animations.
- Each half-day session would be closed with an interactive knowledge check; time was given to repeat key concepts if necessary.
- A quarter of our training time was dedicated to a hands-on session in which participants would have to work in groups to design an analysis plan to address a relevant business problem and afterwards in the second part they would have to implement that analysis in MS Excel. The problem statement allowed the audience to design from a simple and basic to a very complex and creative solution. During these dynamic sessions we made sure that enough trainers (1 trainer per 7 participants) were present to ensure all questions could be answered.
Graduate pathway
After undertaking the Freshman pathway, participants could apply to enroll on the Graduate pathway with the prerequisite to demonstrate their learning by submitting examples of how they have used data-driven approaches and solutions to help them in their daily work and how they plan to use analytics in a future project as well as taking a supplementary class on how analytics can help on the decision making. For this pathway’s design, we kept the same principles as in the Freshman, however these classes were focussed on action-oriented problem solving skills with MS Excel (more than 50%), and the in-depth understanding of descriptive and visual analytics. The pathway concluded with the completion of a final work-related certification project.
In order to properly recognize the achievement of each participant individually and to ensure that the training met industry quality standards the program was certified by the CPD Certification Service [13], which issues online certificates that can be included in digital CVs.
Training modules
The topics covered in both pathways were developed between the data analytics/science teams together with several business SMEs and inspired by other educational formats and materials [14, 15] in conjunction with probable business problems that could be addressed using data analytics.
Freshman pathway
The contents of these modules were imparted in 2 full days divided in 4 modules of about 2 hours each.
- Demystifying Analytics: Explain common data terminology, namely - Machine Learning, Artificial Intelligence (AI), Big Data, Data Science with a focus on the health care sector.
- Naked Statistics: Descriptive Analytics on different data types in conjunction with how to calculate and interpret summary metrics.
- Time to Play with Data: Discuss a business problem at hand and address it individually using Excel.
- How Data might fool you: Understanding common data fallacies and critically interpreting data visualisations.
Graduate pathway
This pathway started with a prerequisite module that had to be completed offline, followed by 2 full days divided in 3 modules of about 2.5 hours each and with breaks in-between, ending with a final Certification Project.
- Class 1 - Advanced Strategies for Data Analysis: Addressed business problems using advanced MS Excel functions, namely - Pivot Tables, advanced formulas and Data Joins.
- Class 2 - Data Visualisations and More: Basic Understanding of Random Variables and Data Distributions followed by the principles of data visualisation and how to implement them in MS Excel.
- Class 3 - GxP Problem Solving (with data!!): Students were asked to put their learning into test by individually solving an analytical problem statement. This was part of the requirements needed to progress onto their Certification Project. This project would take place within the last class so students could have a chance to raise questions.
- Certification Project: On this final task students had yet a final opportunity to demonstrate their newly learned skills, this time on their own real work challenge. They were asked to propose an analytical challenge they were currently facing within their area of expertise, (e.g. GCP or GVP) and attempt to solve it by applying the knowledge acquired. This was a “take-home” project where the students would have to solve it on their own without the instructors’ help. After completion, they shared and discussed their work with the other students.
Evaluation
We performed evaluations on both pathways with two goals in mind. The first one was to give attendees a chance to check and demonstrate their knowledge and understanding; the second was to assess the impact DAU had on their learning as well as to identify any section of the program that could benefit from improvement (e.g. spend more time on a topic or to find a more illustrative example).
Freshman pathway
The exam consisted of 14 multiple choice questions. To earn a Certificate in Data Basics a pass mark of at least 80% was required.
As the aim of this pathway was to give a common ground of understanding, the answer to the questions were fairly straightforward. The questions focused on ensuring concepts, definitions and principles were understood. A few questions were also designed for the student to perform some basic analyses that required simple calculations.
Right after the exam’s completion, a learning transfer check was performed by running Focus Groups sessions where students formed small groups by their work areas to have an opportunity to identify and come up with real-world scenarios to put on practice what they learned. This activity encouraged them to also do a peer-to-peer knowledge check, when bringing up ideas in the discussion, as the small number of members per group allowed everyone to actively participate.
Graduate pathway
With the purpose of assessing an individual’s capability to solve a data analysis problem on their own, where there is not always a clear problem statement that indicates the start to finish path, a typical exam with a set of questions would not be enough. The best way to prove the ability to solve a real data analysis problem would be to do precisely that.
To maintain both the motivation and learning impact, we provided an initial analytical problem statement that would be challenging while at the same time achievable.
After successful completion of it, the attendees would then embark on their Certification Project. This time the challenge consisted in the search and successful resolution of an existent data analysis problem in their work field. The project would have to be ambitious enough so they could put in practice the lessons learned in a practical and useful deliverable in MS Excel.
Upon submission to the quality analytics team, the student would have to present their project, walk through the problem found, the reasoning and approach to the solution as well as the impact it had on the business.
Once reviewed and approved, successful students would be awarded their Certificate as Data Analytics University graduates.