Analytic platform design
This article describes the utility of accumulative hospital database as a platform for decision-making and research was a result of a technological and conceptual evolution. Table 1 demonstrates the successive steps of the Israeli Big Data Analytic Platform (BDAP) formation over the last decade (a flowchart of three years joint venture).
Step 1- Initiation: Following initiating "Digital Israel" in 2009 (www.gov.il/en/departments/digital_israel) one of the authors (OT) was involved in the foundation of the national Digital Health project inspired by the need to establish a united national medical record for all citizens. During 2018 a call for innovative projects to encourage research based on this accumulative data was published, meeting the following criteria: (A) a public- private joint venture, (B) support by a start-up company, (C) showing a potential of a long lasting beneficial continuity to professionals and/or the public. We accepted a two-year research grant to fulfill this mission.
Step 2- Design Partnership: to create an effective hospital- start up encounter, a steering committee was established including the hospital deputy director, director of internal medicine ward, head of hospital research forum, director of IRB Helsinki committee, the legal consultant and head of Information Technology (IT) unit. Their role was to establish guidelines for action, project milestones and a contract among the partners. This was followed by an acceptance among all stakeholders: hospital's steering committee, the regulatory and innovation departments of the Ministry of Health (MOH) and the start-up company.
Step 3- Data warehouse: all the hospital medical records were screened to ensure completion of documentation, and accessibility. To certify that these records are not affected by research activity, a replica of the original data was used for further anonymization.
Step 4- Anonymization: A strategy prioritizing better anonymization at the expense of flexible retrieval of individual data was chosen. Therefore, a dual anonymization was performed: a).concealment of demographic details by the hospital cyber team. b). camouflage of all other identifying elements by the startup company.
Step 5- Authorizing and initiating research activities:
5.1 Role of the hospital: a sub-committee was established to assess and regulate research proposals. Criteria for approval were: involvement of an internal researcher, research benefit of big data (vs. epidemiologic) analysis, and approval by the local IRB committee. All proposals received access to the data, unless a conflict of interest and/or potential harm the patient or the organization were identified.
5.2 Data outsourcing: Using a newly established start-up company, all anonymized data was transferred and stored in a "cloud" based "virtual research room", accessible only to institutional authorized predefined researchers for further data analysis. An academic data analyst was recruited and further trained in medical data retrieval and processing by the startup company. A "cloud committee" was established to set guidelines for data storage.
An external cyber consultant validated information security and our preparedness to block potential threats (Kim, 2017). A double verification was conducted to reassure a tight barrier, using simulations of cyber-attacks in hospitals elsewhere (McKeon, 2021). Thus, patient privacy and confidentiality were kept and facilitated access to data was secured.
5.3 Sharing data with additional healthcare organizations: Application was made to additional hospitals to share their data and expand pooling towards a national database. However, we witnessed hesitance among managers to do so, possibly due to uncertainty or reduced trust in the innovative process itself, or the startup company.
5.4 An AI- ML based algorithm was created for data accumulative continuous data flow analyzing trends of services utilization. The algorithm was later adjusted for each research, based on research objective and need.
Step 6- Call for researches: An early call for hospital researchers ("internal path") resulted in three approved proposals by in-house physicians, followed by a forth proposal after 6 months. A successive call for partners from other organizations including other hospitals, academic institutions and industry ("external path") yielded seven proposals.
Step 7- Pilot research projects: To determine the function of the newly established data analysis system four independent researchers received access to the cloud based virtual data room with an ongoing support by a professional data analyst. Three of these research projects successfully completed data mining and processing reaching final analysis and reporting, in spite of some difficulties and barriers (Table 2).
Step 8- Applying an Artificial Intelligence (AI)(3) algorithm and Machine Learning (ML)(4) : following mining the data through a secured cloud, an AI algorithm suitable to each research question was used to set a model and to validate its result.
Step 9- Merging with a Big Data (BD)(5) national research project: In parallel to our initiative and in view of the rising clinical need, the MOH established a group of data engineers to support decision making by hospital managers. Our local project was presented to the MOH, for feedback and continuous funding.
Step 10- towards a nationwide venture: four other state hospitals joined the project, to create a research community utilizing the same mechanism and database. An academic committee prioritized research proposals from these hospitals.