In the field of medicine, the goal of clinical trials is always to learn more about the human body in an effective and positive way. The ultimate aim of a medical study is always to improve patient care and therapeutic standards through the application of clinical trials. In order to observe the influence of the new interventions and preventive measures in terms of safety and efficiency, a certain number of participants are recruited according to certain criteria. Such recruitment is always a challenging and time-consuming stage in the medical trial pipeline, resulting in a large number of canceled or incomplete trials (1–3).
In each clinical trial, eligibility criteria (eligibility criteria) are defined at the beginning of study development describing the relevant characteristics shared by the participants. Patient cohort identification and recruitment usually is carried out by research staff or primary care personnel through querying the clinical systems manually for patients matching the eligibility criteria. The latter process is a time-consuming and cost-intensive part of a clinical research study pipeline. Therefore, utilizing digital tools can significantly improve the recruitment of subjects and reduce costs and required labor resources (4).
Clinical Trial Recruitment Support Systems (CTRSS) or patient recruitment systems (PRS) can booster patient inclusion of clinical trials by automatically analyzing eligibility criteria based on electronic health records (4–6). Although these systems nowadays are integrated increasingly in many information and communication technology information and communication technology medical research projects, most of the study centers do not tend to use such digital tools for patient recruitment (7). The main reason is the requirement to domain technical staff as well as equipment to perform implementation or supportive tasks for the CTRSS. To address this issue, an individual system can be launched for multiple platforms or to integrate the CTRSS with the existing research systems.
In addition to the existing CTRSS or commercial solutions developed as part of different projects, several open-source tools like the Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) developed by Observational Health Data Sciences and Informatics (OHDSI) or Informatics for Integrating Biology and the Beside (i2b2) established by National Institutes of Health, are free to use for patient recruitment. Both systems are based on the idea of using standard terminologies within data repositories however, they are not bound to a specific nomenclature. In addition, data can be extracted from different sources and in various models, so they may be utilized in different medical contexts and at multiple study centers with different prerequisites (8, 9). Therefore an architecture for a CTRSS utilizing OMOP as its data basis can be used for a patient recruitment system for example.(10, 11)
1.1 Observational Health Data Sciences and Informatics (OHDSI)
The OMOP CDM is part of a whole tool suite created and maintained by OHDSI, as a collaboration among 150 organizations around the world that collect and process healthcare data. OHDSI offers a wide range of open-source tools to support various data-analytics use cases on observational patient-level data, all are interacting with databases using the OMOP CDM. (8, 12)
Inspired by OHDSI, the ATLAS software was designed and developed as a web-based open source application. The primary goal of ATLAS is to be used for the observational analysis and generating real-world evidence through patient data extracted from clinical practice. ATLAS operates as a user interface on the OMOP CDM. According to (10) ATLAS allows defining cohorts, selecting analysis configurations, and tagging diseases with appropriate codes. Also, it allows sharing the project easily with other researchers (8).
ATLAS can also be used for subject recruitment by formalizing the eligibility criteria of a trial for the corresponding cohort. The definition of trials in ATLAS works with two main designs: cohorts and concepts. Concepts are individual codes that belong to a terminology system integrated in the OMOP CDM. They can be grouped into concept sets. A concept set contains several concepts from the standardized vocabulary in combination with logical indicators. It allows the user to specify whether related concepts should be included or excluded from the vocabulary hierarchy. Cohorts are the collection of individuals searched for by a particular query based on specific criteria, so all of a study's eligibility criteria are part of the cohort definition (8).
1.2 Terminology systems
The database of ATLAS enforces the feeding data to follow standardized classification systems like Logical Observation Identifiers Names and Codes (LOINC) or Systematized Nomenclature of Medicine Clinical Terms (SNOMED CT). These nomenclatures work as an interface for the study centers and built the basis for generating patient cohorts. (13, 14)
Several analyses have been conducted to investigate the use of digital systems based on international terminologies.(15, 16) For example, (17) investigated the use of a Clinical Data Interchange Standard Consortium (CDISC) based model in conjunction with a patient-centered model to structure eligibility criteria of clinical studies. International classification systems, such as International Statistical Classification of Diseases and Related Health Problems (ICD), formed the basis for this project. The authors used 200 randomly selected inclusion criteria from other studies, and the majority of the criteria could be represented in the system. (17)
Automatic tools like criteria2query use natural language processing (NLP) to automate formalization of eligibility criteria into the ATLAS cohorts. The latter system works just on specific terminology systems and local specialties are not considered. As the definition of an ATLAS cohort depends highly on the engaged terminology systems, such tools cannot be applied in the presence of other supported coding systems. (18–21)
Although there are more studies available addressing the latter issue, there still is a gap left between the theory and practically introducing the eligibility criteria with different terminology systems into the software (15, 22). Therefore, further analysis should be conducted regarding to this matter in order to explore the hidden potentials behind the different software and coding systems. This study considers the possibility to apply inclusion and exclusion criteria of different clinical trials in ATLAS as a proof of concept.