Background: Machine learning (ML) based risk stratification models of Electronic Health records (EHR) data may help to optimize treatment of COVID-19 patients, but are often limited by their lack of clinical interpretability and cost of laboratory tests. We develop a ML based tool for predicting adverse outcomes based on EHR data to optimize clinical utility under a given cost structure. This cohort study was performed using deidentified EHR data from COVID-19 patients from ProMedica Healthcare in northwest Ohio and southeastern Michigan.
Methods: We tested performance of various ML approaches for predicting either increasing ventilatory support or mortality and the set of model features under a budget constraint was optimized via exhaustive search across all combinations of features.
Results: The optimal sets of features for predicting ventilation under any budget constraint included demographics and comorbidities (DCM), basic metabolic panel (BMP), D-dimer, lactate dehydrogenase (LDH), erythrocyte sedimentation rate (ESR), CRP, brain natriuretic peptide (BNP), and procalcitonin and for mortality included DCM, BMP, complete blood count, D-dimer, LDH, CRP, BNP, procalcitonin and ferritin.
Conclusions: This study presents a quick, accurate and cost-effective method to evaluate risk of deterioration for patients with SARS-CoV-2 infection at the time of clinical evaluation.

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5
The full text of this article is available to read as a PDF.
No competing interests reported.
This is a list of supplementary files associated with this preprint. Click to download.
Loading...
Posted 08 Jun, 2021
On 04 Aug, 2021
Received 04 Aug, 2021
On 31 Jul, 2021
Invitations sent on 20 Jul, 2021
On 19 Jul, 2021
On 23 Jun, 2021
On 22 Jun, 2021
On 05 Jun, 2021
Posted 08 Jun, 2021
On 04 Aug, 2021
Received 04 Aug, 2021
On 31 Jul, 2021
Invitations sent on 20 Jul, 2021
On 19 Jul, 2021
On 23 Jun, 2021
On 22 Jun, 2021
On 05 Jun, 2021
Background: Machine learning (ML) based risk stratification models of Electronic Health records (EHR) data may help to optimize treatment of COVID-19 patients, but are often limited by their lack of clinical interpretability and cost of laboratory tests. We develop a ML based tool for predicting adverse outcomes based on EHR data to optimize clinical utility under a given cost structure. This cohort study was performed using deidentified EHR data from COVID-19 patients from ProMedica Healthcare in northwest Ohio and southeastern Michigan.
Methods: We tested performance of various ML approaches for predicting either increasing ventilatory support or mortality and the set of model features under a budget constraint was optimized via exhaustive search across all combinations of features.
Results: The optimal sets of features for predicting ventilation under any budget constraint included demographics and comorbidities (DCM), basic metabolic panel (BMP), D-dimer, lactate dehydrogenase (LDH), erythrocyte sedimentation rate (ESR), CRP, brain natriuretic peptide (BNP), and procalcitonin and for mortality included DCM, BMP, complete blood count, D-dimer, LDH, CRP, BNP, procalcitonin and ferritin.
Conclusions: This study presents a quick, accurate and cost-effective method to evaluate risk of deterioration for patients with SARS-CoV-2 infection at the time of clinical evaluation.

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5
The full text of this article is available to read as a PDF.
No competing interests reported.
This is a list of supplementary files associated with this preprint. Click to download.
Loading...