Maintaining proper health records improves machine learning predictions for novel 2019-nCoV

doi:10.21203/rs.3.rs-33551/v3

Download PDF

Research article

Maintaining proper health records improves machine learning predictions for novel 2019-nCoV

https://doi.org/10.21203/rs.3.rs-33551/v3

This work is licensed under a CC BY 4.0 License

Journal Publication

published 27 May, 2021

Read the published version in BMC Medical Informatics and Decision Making →

You are reading this latest preprint version

Background: An ongoing outbreak of a novel coronavirus (2019-nCoV) pneumonia continues to affect the whole world including major countries such as China, USA, Italy, France and the United Kingdom. We present outcome (’recovered’, ’isolated’ or ’death’) risk estimates of 2019-nCoV over ’early’ datasets. A major consideration is the likelihood of death for patients with 2019-nCoV.

Method: Accounting for the impact of the variations in the reporting rate of 2019-nCoV, we used machine learning techniques (AdaBoost, bagging, extra-trees, decision trees and k-nearest Neighbour classifiers) on two 2019-nCoV datasets obtained from Kaggle on March 30, 2020. We used ’country’, ’age’ and ’gender’ as features to predict outcome for both datasets. We included the patient’s ’disease’ history (only present in the second dataset) to predict the outcome for the second dataset.

Results: The use of a patient’s ’disease’ history improves the prediction of ’death’ by more than 7-fold. The models ignoring a patent’s ’disease’ history performed poorly in test predictions.

Conclusion: Our findings indicate the potential of using a patient’s ’disease’ history as part of the feature set in machine learning techniques to improve 2019-nCoV predictions. This development can have a positive effect on predictive patient treatment and can result in easing currently overburdened healthcare systems worldwide, especially with the increasing prevalence of second and third wave re-infections in some countries.

Medical Informatics

2019-nCoV

pneumonia

machine learning

AdaBoost

Bagging

classifiers

disease

death

prediction

Due to technical limitations, full-text HTML conversion of this manuscript could not be completed. However, the latest manuscript can be downloaded and

accessed as a PDF.

Download PDF

Journal Publication

published 27 May, 2021

Read the published version in BMC Medical Informatics and Decision Making →

Review #1 received at journal
08 Mar, 2021
Reviewer #1 agreed at journal
14 Feb, 2021
Reviewers invited by journal
07 Feb, 2021
Editor assigned by journal
06 Jan, 2021
Submission checks completed at journal
06 Jan, 2021
Editor invited by journal
06 Jan, 2021

You are reading this latest preprint version

Maintaining proper health records improves machine learning predictions for novel 2019-nCoV

Status:

Journal Publication

Version 3

Abstract

Figures

Full Text

Status:

Journal Publication

Version 3