The importance of laboratory tests in the medical diagnostic processes is unquestioned; recent surveys show that 60–70% of clinical decisions are affected by laboratory test results[13], and at least 80% of guidelines, which are aimed at establishing a diagnosis or managing a disease, require laboratory testing[14]. The low error rate of laboratory tests makes laboratory test results a valuable base for AI-supported medical diagnostics. Numerous studies have shown that laboratory tests can effectively identify chronic diseases or infections[15–19]. However, there are certain common and rare diseases where traditional diagnostic processes are not always effective, resulting in delayed diagnoses and the eventual loss of early screening opportunities. Inflammatory bowel diseases (IBD) and rare metabolic disorders, for example, similarly share diagnostic challenges despite having different clinical and laboratory characteristics. IBD is notoriously difficult to diagnose based on non-specific blood test results, while rare metabolic diseases have more specific findings but are often overlooked due to their rarity. Nevertheless, most patients undergo routine phlebotomies during their hospital stays, which presents a significant opportunity for early screening and risk assessment based on blood test result patterns.
We previously showed that machine learning is ideal for diagnosing rare dyslipidaemias[20, 21], but analyzing a wider set of rare diseases shows that many rare diseases may be ideal candidates for computer-aided screening efforts, even ones that usually require (multiple) specific diagnostic tests for confirming the diagnosis. Although our initial data is promising due to the lack of a larger specific dataset, further studies are needed with the involvement of multiple sites and a higher number of patients to quantify this possibility properly.
Moreover, we tested the software’s general capability for early screening of supported diseases by evaluating the blood test results of patients who appeared regularly at hospitals for screening or other purposes but had not yet received an official diagnosis. As shown in Table 3 in the Results, we found that AI-aided diagnostics can boost early screening efforts, save time, and may allow earlier therapeutic intervention. The model's performance statistics did not, or only slightly, drop within six months before the medical diagnosis. Our retrospective data analysis determined that with the AI-aided blood test interpretation, most of the analyzed disease groups may be diagnosed at least one-to-six or even one-to-nine months before the current medical diagnosis. A significant drop in the performance of early diagnosis was only visible in more acute disease groups, like liver diseases and some cardiovascular pathologies.
Regarding early diagnosis, the regional and hospital-specific changes and patient behavior (e.g., frequent hospital goers) can impact the time of possible early diagnosis. It is also a question of debate what the earliest point is when a disease can be diagnosed, as most of the time, exact thresholds of parameters are to be met for definitive diagnosis. However, we can achieve an earlier diagnosis with more frequent follow-ups for those patients in a "pre-definitive disease stage". Furthermore, guideline updates may later include these pre-definitive disease stages with early treatment options to improve outcomes. Our data suggest that it is plausible that many common diseases may be diagnosed 30–90 days before the clinical diagnosis using AI-aided evaluation based on routine phlebotomy results (see Table 4). Our results also indicate that diagnosis earlier than 180 days will only be possible in a few cases without using other specific blood markers. It is also important to mention that the lowest amount of early diagnostic data (i.e., people’s participation in screening programs) is available for malignancies, a category where early diagnosis is most critical for effective therapy and where our findings show a possibility for very effective AI-assisted early screening. On the national level, significant savings may come from using AI-aided inexpensive and non-specific blood test evaluation for early diagnosis, thus, involving AI in laboratory result evaluation may facilitate the much-needed paradigm shift in healthcare towards early treatment and prevention.
The list of laboratory tests included in our AI analysis covers screening laboratory tests that are commonly used worldwide: basic and complete metabolic panel (ALP, ALT, AST, bilirubin, BUN, creatinine, sodium, potassium, chloride, albumin, total protein, glucose, and calcium), complete blood count with differential, and screening tests for lipid, iron, hepatic and thyroid metabolisms. These tests are among the most frequently ordered[22], allowing easy translation of our findings to help efficiently screen a wide range of diseases.
The common laboratory tests used as input for our software are part of routine phlebotomy due to their importance in general diagnostics of many acute and chronic diseases. For example, TSH tests are widely recommended for first-line thyroid dysfunction screening[23, 24]. At the same time, AST, ALP, GGT, and bilirubin levels alone may often identify the correct type and etiology of liver disease, allowing for a targeted investigation approach during clinical examination and further diagnostic testing[25]. Although these tests are part of general medical education and common medical knowledge, the clinical interpretations are far from trivial[25, 26]. The proper evaluation of liver tests, besides diagnosing acute liver pathologies, also helps diagnose Wilson’s disease; thus, they can distinguish rare pathologies from common medical conditions[25]. The comparison of certain blood analytics and other factors may also raise issues that are hard to handle and may lead to misinterpretation or misdiagnosis. For example, sTSH test reference ranges and values are highly sensitive to measurement technology, circadian patterns, analytical platforms, and geographic regions[27]. Therefore, following AI-aided analysis, including interpretative comments in laboratory reports, could decrease error rates, thus improving the quality of laboratory information and patient safety[28]. Although the major driver for including interpretative comments is new and complex tests in the laboratory report[29], interpretation of common lab tests is also welcome. In a survey in the UK, 88% of primary care doctors and nurse practitioners found interpretative comments on thyroid, gonadotropin, and glucose tolerance test reports helpful[30]. AI-supported decision-making may also decrease the rate of missed or late diagnoses originating from HCP’s incorrect interpretation of even basic laboratory tests. Self-reported testing practices for anemia show the overuse of screening laboratory tests, the underuse of bidirectional endoscopy to evaluate new-onset iron-deficiency anemia, and the misinterpretation of iron studies also included in our analysis[7].