Accuracy of automated computer-aided risk scoring systems to estimate the risk of COVID-19: a retrospective cohort study

doi:10.21203/rs.3.rs-3145703/v1

Download PDF

Short Report

Accuracy of automated computer-aided risk scoring systems to estimate the risk of COVID-19: a retrospective cohort study

https://doi.org/10.21203/rs.3.rs-3145703/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 18 Apr, 2024

Read the published version in BMC Research Notes →

You are reading this latest preprint version

Background: In the UK National Health Service (NHS), the patient’s vital signs are monitored and summarised into a National Early Warning Score (NEWS) score. A set of computer-aided risk scoring systems (CARSS) was developed and validated for predicting in-hospital mortality and sepsis in unplanned admission to hospital using NEWS and routine blood tests results. We sought to assess the accuracy of these models to predict the risk of COVID-19 in unplanned admisisons during the first phase of the pandemic.

Methods: Adult (>=18 years) non-elective admissions discharged (alive/deceased) between 11-March-2020 to 13-June-2020 from two acute hospitals with an index NEWS electronically recorded within ±24 hours of admission. We identified COVID-19 admission based on ICD-10 code ‘U071’ which was determined by COVID-19 swab test results (hospital or community). We assessed the performance of CARSS (CARS_N, CARS_NB, CARM_N, CARM_NB) for predicting the risk of COVID-19 in terms of discrimination (c-statistic) and calibration (graphically).

Results: The risk of in-hospital mortality following emergency medical admission was 8.4% (500/6444) and 9.6% (620/6444) had a diagnosis of COVID-19. For predicting COVID-19 admissions, the CARS_N model had the highest discrimination 0.73 (0.71 to 0.75) and calibration slope 0.81 (0.72 to 0.89) compared to other CARSS models: CARM_N (discrimination:0.68 (0.66 to 0.70) and calibration slope 0.47 (0.41 to 0.54)), CARM_NB (discrimination:0.68 (0.65 to 0.70) and calibration slope 0.37 (0.31 to 0.43)), and CARS_NB (discrimination:0.68 (0.66 to 0.70) and calibration slope 0.56 (0.47 to 0.64)).

Conclusions: The CARS_N model is reasonably accurate for predicting the risk of COVID-19. It may be clinically useful as an early warning system at the time of admission especially to triage large numbers of unplanned hospital admissions because requires no additional data collection and is readily automated.

national early warning score

COVID-19

mortality risk

computer-aided risk scoring systems

The novel coronavirus SARS-CoV-2, which was declared as a pandemic on 11-March 2020, produces the newly identified disease ‘COVID-19’ in patients with symptoms (Coronaviridae Study Group of the International Committee on Taxonomy of Viruses [1]), has challenged health care systems worldwide.

Patients with COVID-19 admitted to a hospital can develop severe disease with life-threatening respiratory and/or multi-organ failure [2, 3] with a high risk of mortality. It is recommended that patients at risk of deterioration are referred to critical care. The appropriate early assessment and management of patients with COVID-19 are important in ensuring high-quality care [4, 5].

In the UK National Health Service (NHS), the patient’s vital signs are monitored and summarised into a National Early Warning Score (NEWS) [6]. NEWS is calculated from six physiological variables or vital signs—respiration rate, oxygen saturation, temperature, systolic blood pressure, heart rate and level of consciousness (alert, voice, pain, unresponsive) and use of supplemental oxygen. NEWS points are allocated according to clinical observations (see Table S1).

We have developed four automated, computer-aided risk scores to predict the patient’s risk of mortality (CARM_N & CARM_NB) and sepsis (CARS_N & CARS_NB) following emergency medical admission to hospital [7–10]. The _N models use NEWS and the _NB models incorporate routine blood test results. We refer to this suite of risk equations as computer-aided risk scoring systems (CARSS).

Our aim in this study was to assess the accuracy of CARSS in predicting the risk of COVID-19 in unplanned admissions to a teaching hospital during the first phase of the novel coronavirus SARS CoV-2 (COVID-19) pandemic. We are not developing new risk prediction models, we are assessing the performance of existing models, re-purposed for COVID-19.

Setting & data

Our cohort of unplanned admissions is from two acute hospitals which are approximately 65 kilometres apart in the Yorkshire & Humberside region of England – Scarborough hospital (n ~ 300 beds) and York Hospital (YH) (n ~ 700 beds), managed by York Teaching Hospitals NHS Foundation Trust. For this study, the two acute hospitals are combined into a single dataset and analysed collectively. The hospitals have electronic NEWS scores and vital signs recording which is routinely collected as part of the patient’s process of care (see Table S1).

We considered all adult (age ≥ 18 years) emergency medical admissions (excluding ambulatory care area patients), discharged (alive/deceased) during 3 months (11 March 2020 to 13 June 2020), with electronic NEWS recorded within ± 24 hours of admission. This on-admission NEWS score is referred to as the index NEWS.

For each emergency admission, we obtained a pseudonymised patient identifier, patient’s age (years), gender (male/female), discharge status (alive/dead), admission and discharge date and time, diagnoses codes based on the 10th revision of the International Statistical Classification of Diseases (ICD-10) [11, 12], NEWS (including its subcomponents respiratory rate [breaths per minute], temperature [^oC], systolic pressure [mmHg], pulse rate [beats per minute], oxygen saturation [percentage], oxygen supplementation [yes/no], and alertness level [alert, voice, pain, unconscious] ) [6, 13], blood test results (albumin [g/L], creatinine [umol/L], haemoglobin [g/l], potassium [mmol/L], sodium [mmol/L], urea [mmol/L], and white cell count [10⁹ cells/L]), and Acute Kidney Injury (AKI) score.

Table 1

Four risk scores for predicting the risk of mortality and sepsis, known as computer-aided risk scoring systems (CARSS)
Computer-Aided Risk (CAR) score	NEWS data only (N)	NEWS and Blood test results data (NB)
Mortality (M)	CARM_N	CARM_NB
Sepsis (S)	CARS_N	CARS_NB

We had developed and externally validated four risk scores: 1) CARM_N for predicting in-hospital mortality based on NEWS [10]; 2) CARM_NB for predicting in-hospital mortality that incorporates routine blood test results [7]; CARS_N for predicting sepsis based on NEWS [9]; CARS_NB for predicting sepsis that incorporates routine blood test results [8] (see Table 1). These four equations are collective known as computer-aided risk scoring systems (CARSS), calculated using index NEWS and blood test results. We excluded records where the index NEWS (or blood test results) was not within ± 24 hours (± 96 hours) or was missing/not recorded at all (see Table S2).

The ICD-10 code ‘U071’ was used to identify records with COVID-19. We searched primary and secondary ICD-10 codes for ‘U071’ for identifying COVID-19. We also linked positive laboratory results for COVID-19 swabs to an automated diagnostic coding entry in the patient electronic health record.

Statistical Analyses

We report discrimination and calibration statistics as performance measures for CARSS [14].

We determined the discrimination of CARSS using the concordance statistic (c-statistic) that gives the probability of randomly selected patients who experienced adverse outcome had a higher risk score than a patient who does not. For a binary outcome (COIVD-19/Non-Covid-19), the c-statistic is the area under the Receiver Operating Characteristics (ROC) curve [15]. The ROC curve is a plot of the sensitivity, (true positive rate), versus 1-specificity, (false positive rate), for consecutive predicted risks. A c-statistic of 0.5 is no better than tossing a coin, whilst a perfect model has a c-statistic of 1. In general, values less than 0.7 are considered to show poor discrimination, values of 0.7 to 0.8 can be described as reasonable, and values above 0.8 suggest good discrimination [16].

Calibration measures a model's ability to generate predictions that are on average close to the average observed outcome and can be readily seen on a scatter plot (y-axis observed risk, x-axis predicted risk). Perfect predictions should be on the 45° line. We internally validated and assessed the calibration for all the models using the bootstrapping approach [17, 18]. The overall statistical performance was assessed using the scaled Brier score which incorporates both discrimination and calibration [14]. The Brier score is the squared difference between actual outcomes and predicted risk of COVID-19, scaled by the maximum Brier score such that the scaled Brier score ranges from 0–100%. Higher values indicate superior models. The 95% confidence interval for the scaled Brier score was calculated using bootstrap approach [19].

We followed the STROBE guidelines to report the findings [20]. All analyses were undertaken using R [21] and Stata [22]. The 95% confidence interval for the c-statistic was computed using DeLong’s method as implemented in the pROC library [23].

Cohort description

There were 6480 discharges over 3 months. We excluded 36 (0.6%) records because the index NEWS was not recorded within ± 24 hours of the admission date/time or NEWS was missing or not recorded at all (see Table S2). We further excluded 1175 (18.1%) because no or missing blood test results recorded.

The prevalence of COVID-19 was 9.6% (620/6444) and of these 32% (199/620) discharged deceased. The demographic, vital signs and outcome profiles of the COVID-19 versus non-COVID-19 admissions and discharged deceased versus discharged alive are shown in Table 2 and Figure S1-S2. COVID-19 admissions were older (73.3 vs 67.7, p < 0.001), more likely to be male (54.7% vs 50.1%, p < 0.001), with higher index NEWS (4.0 vs 2.5, p < 0.001). They also had longer hospital stay (7.3 days vs 3.0 days, p < 0.001) and higher in-hospital mortality (32.1% vs 5.8%, p < 0.001). The average CARSS (CARM_N, CARM_NB, CARS_N, CARS_NB) risk was generally higher for COVID-19 admissions and for those who were discharged deceased.

We assessed the four CARSS models (CARM_N, CARM_NB, CARS_N, CARS_NB) performance according to discrimination (c-statistic) and calibration (graphically) in predicting the risk of COVID-19 (see Table 3 and Figs. 1 & 2).

For predicting COVID-19 admissions, the CARS_N model performed better than others in terms of discrimination 0.73 (95%CI 0.71 to 0.75) and calibration slope 0.81 (95%CI 0.72 to 0.89) compared to other CARSS models: CARM_N (discrimination: 0.68 (0.66 to 0.70) and calibration slope 0.47 (0.41 to 0.54)), CARM_NB (discrimination: 0.68 (0.65 to 0.70) and calibration slope 0.37 (0.31 to 0.43)), and CARS_NB (discrimination: 0.68 (0.66 to 0.70) and calibration slope 0.56 (0.47 to 0.64)).

Table 2

**Characteristics of emergency medical admissions in COVID-19 versus non-COVID-19 who discharged alive/deceased*** Blood test results are missing 1175 (18.1%)
Characteristic	COVID-19		Non-COVID-19
Characteristic	Discharged Deceased	Discharged Alive	Discharged Deceased	Discharged Alive
N	199	421	336	5488
Median Length of Stay (IQR)	9.61 (14.43)	6.73 (10.52)	4.72 (8.88)	2.96 (5.28)
Male (%)	123 (61.81)	216 (51.31)	169 (50.3)	2749 (50.09)
Mean Age [years] (SD)	80.22 (10.01)	70.08 (16.43)	79.44 (12.65)	67.02 (19.14)
Mean NEWS (SD)	4.94 (3.02)	3.52 (2.5)	4.89 (3.42)	2.33 (2.08)
Mean CARM_N (SD)	0.14 (0.11)	0.06 (0.07)	0.15 (0.13)	0.04 (0.06)
Mean CARM_NB (SD)	0.15 (0.15)	0.06 (0.08)	0.16 (0.17)	0.04 (0.06)
Mean CARS_N (SD)	0.36 (0.19)	0.25 (0.16)	0.28 (0.16)	0.16 (0.13)
Mean CARS_NB (SD)	0.34 (0.2)	0.21 (0.16)	0.29 (0.18)	0.15 (0.13)

Table 3

Performance of CARSS models for predicting the risk of COVID-19
Outcome	Model	Mean risk without adverse outcome	Mean risk with adverse outcome	Absolute risk difference	Scaled brier score	Discrimination AUC (95% CI)	Calibration Slope (95% CI)
COVID-19	CARM_N	0.09	0.15	0.06	-0.02 (-0.03 to -0.01)	0.68 (0.66 to 0.70)	0.47 (0.41 to 0.54)
COVID-19	CARM_NB	0.09	0.17	0.08	-0.05 (-0.06 to -0.04)	0.68 (0.65 to 0.70)	0.37 (0.31 to 0.43)
COVID-19	CARS_N	0.09	0.17	0.08	0.05 (0.04 to 0.06)	0.73 (0.71 to 0.75)	0.81 (0.72 to 0.89)
COVID-19	CARS_NB	0.09	0.16	0.07	0.01 (0.00 to 0.02)	0.68 (0.66 to 0.70)	0.56 (0.47 to 0.64)

We assessed the performance of four computer-aided risk scores to predict the risk of COVID-19 in unplanned admissions to hospital. We found that the CARS_N model for sepsis (based on NEWS) had the best performance for predicting the risk of COVID-19. CARS_N was developed for predicting sepsis and we found it has good discrimination and calibration compared to other CARSS models. This may reflect the reported overlap in features between sepsis and COVID-19, such as hyper inflammation and coagulopathy which also contribute to disease severity and death in COVID-19 patients [24]. Zhou et al. [25] found that the Sequential Organ Failure Assessment (SOFA) score (for sepsis) is associated with in-hospital mortality in COVID-19 patients.

A recent systematic review identified models to predict mortality from COVID-19 with c-statistics that ranged from 0.87 to 1 [26]. However, despite these high c-statistics, the review authors cautioned against the use of these models in clinical practice because of the high risk of bias and poor reporting of studies which are likely to have led to optimistic results [26]. The main advantages of our models are that they are (1) rigorously developed and externally validated, (2) designed to incorporate data which are already available in the patient’s electronic health record thus place no additional data collection or computational burden on clinicians and (3) are readily automated. The CARS_N model is particularly attractive because it uses NEWS data which can be available withn a short while (< 30 minutes) of admission and so can support early clinical decision making about patients, which is essential to ensuring safe, high quality care.

There are several limitations to our study: (1) This is data from a single NHS Trust and the extent to which these findings are generalisable, further study is required (2) We used the index NEWS and blood test results which reflects the ‘on-admission’ risk of mortality of the patients. Nonetheless, NEWS and blood test results are repeatedly updated for each patient according to local hospital protocols (Figure S5 in supplementary material) (3) We identified COVID-19 based on ICD-10 code ‘U071’ which was determined by COVID-19 swab test results (hospital or community) and clinical judgment and so our findings are constrained by the accuracy of these methods [27, 28] (4) We have used NEWS in our data but since the NEWS2 is now widely used, further study is required to determine the accuracy of NEWS2 based models [29].

The CARS_N model is reasonably accurate for predicting the risk of COVID-19. It may be clinically useful as an early warning system at the time of admission especially to triage large numbers of unplanned hospital admissions because requires no additional data collection and is readily automated.

Ethics Approval and Consent to Participate

Although this type of study does not require ethical approval because it meets the exemption criteria ("Research limited to secondary use of information previously collected in the course of normal care (without an intention to use it for research at the time of collection), provided that the patients or service users are not identifiable to the research team in carrying out the research [30])", we obtained ethical approval for the main research project of which this is a sub study from Leeds Ethics Committee. Ethical approval was granted and informed consent was wavied by the Leeds Ethics committee (reference number 19/HRA/0548). All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication: Not applicable

Data Availability Statement

The data that support the findings of this study are available from NHS York hospital trust but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. However, if anyone is interested in the data, then they should contact the R&D offices in the first instance https://www.research.yorkhospitals.nhs.uk/about-us1/our-directorates/

Competing Interests

The authors declare no conflicts of interest.

Funding

This research was supported by the Health Foundation. The Health Foundation (Award No 7380) is an independent charity working to improve the quality of healthcare in the UK.

This research was supported by the National Institute for Health and Care Research (NIHR) Yorkshire and Humber Patient Safety Translational Research Centre (NIHR Yorkshire and Humber PSTRC) (Award No PSTRC-2016-006). The views expressed in this article are those of the authors and not necessarily those of the NHS, the Health Foundation, the NIHR, or the Department of Health and Social Care.

Role of the funding source

The funders of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report.

Author contributions

MFa and MAM had the original idea for the work. KB and MFi provided the data extracts. MFa undertook the statistical analyses with support from MAM. MFa, MAM, and DR wrote the first draft of the paper. DR provided clinical perspectives. All others contributed to the final paper and have approved the final version. MAM & MFa will act as study guarantors.

Acknowledgements: Not applicable

Gorbalenya AE, Baker SC, Baric RS, de Groot RJ, Drosten C, Gulyaeva AA, et al. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat Microbiol. 2020;5:536–44. 10.1038/s41564-020-0695-z.
Onder G, Rezza G, Brusaferro S. Case-Fatality Rate and Characteristics of Patients Dying in Relation to COVID-19 in Italy. JAMA - Journal of the American Medical Association. 2020;323:1775–6. 10.1001/jama.2020.4683.
Vincent JL, Taccone FS. Understanding pathways to death in patients with COVID-19. The Lancet Respiratory Medicine. 2020;8:430–2. 10.1016/S2213-2600(20)30165-X.
Hao B, Sotudian S, Wang T, Xu T, Hu Y, Gaitanidis A, et al. Early prediction of level-of-care requirements in patients with COVID-19. Elife. 2020;9:1–23. 10.7554/ELIFE.60519.
Wang T, Paschalidis A, Liu Q, Liu Y, Yuan Y, Paschalidis IC. Predictive Models of Mortality for Hospitalized Patients With COVID-19: Retrospective Cohort Study. JMIR Med informatics. 2020;8. 10.2196/21788.
Royal College of Physicians. National Early Warning Score (NEWS): Standardising the assessment of acuteillness severity in the NHS - Report of a working party. 2012. https://www.rcplondon.ac.uk/file/32/download.
Faisal M, Scally A, Jackson N, Richardson D, Beatson K, Howes R et al. Development and validation of a novel computer-aided score to predict the risk of in-hospital mortality for acutely ill medical admissions in two acute hospitals using their first electronically recorded blood test results and vital signs: a cross-section. BMJ Open (accepted Oct 2018). 2018.
Faisal M, Scally A, Richardson D, Beatson K, Howes R, Speed K, et al. Development and external validation of an automated computer-aided risk score for predicting sepsis in emergency medical admissions using the patient’s first electronically recorded vital signs and blood test results. Crit Care Med. 2018;46:612–8. 10.1097/CCM.0000000000002967.
Faisal M, Richardson D, Scally AJ, Howes R, Beatson K, Speed K, et al. Computer-aided National Early Warning Score to predict the risk of sepsis following emergency medical admission to hospital: a model development and external validation study. CMAJ. 2019;191:E382–9. 10.1503/cmaj.181418.
Faisal M, Richardson D, Scally A, Howes R, Beatson K, Mohammed M. Performance of externally validated enhanced computer-aided versions of the National Early Warning Score in predicting mortality following an emergency admission to hospital in England: A cross-sectional study. BMJ Open. 2019;9.
Organization WH. ICD-10: international statistical classification of diseases and related health problems : tenth revision. https://apps.who.int/iris/handle/10665/42980.
Jolley RJ, Quan H, Jetté N, Sawka KJ, Diep L, Goliath J, et al. Validation and optimisation of an ICD-10-coded case definition for sepsis using administrative health data. BMJ Open. 2015;5:e009487.
Royal College of Physicians. NHS England approves use of National Early Warning Score (NEWS. ) 2 to improve detection of acutely ill patients. 2017. https://www.rcplondon.ac.uk/news/nhs-england-approves-use-national-early-warning-score-news-2-improve-detection-acutely-ill.
Steyerberg EW. Clinical Prediction Models. A practical approach to development, validation and updating. Springer; 2008.
Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology. 2010;21:128–38.
Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982;143:29–36.
Steyerberg EW, Harrell FE, Borsboom GJJ, Eijkemans MJ, Vergouwe Y, Habbema JDF. Internal validation of predictive models: Efficiency of some procedures for logistic regression analysis. J Clin Epidemiol. 2001;54:774–81.
Harrell FE. rms: Regression Modeling Strategies http://cran.r-project.org/package=rms. 2015.
Mantalos P, Zografos K. Interval estimation for a binomial proportion: a bootstrap approach. http://dx.doi.org/101080/00949650701749356. 2008;78:1251–65. doi:10.1080/00949650701749356.
Von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: Guidelines for reporting observational studies. PLoS Med. 2007;4:1623–7. 10.1371/journal.pmed.0040296.
R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing http://www.r-project.org/. 2015.
StatCorp, Stata. Release 14. Statistical Software. College Station, TX: StataCorp LP; 2016.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez JJ-CC, et al. pROC: an open-source package for R and S + to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.
Beltrán-García J, Osca-Verdegal R, Pallardó FV, Ferreres J, Rodríguez M, Mulet S, et al. Sepsis and Coronavirus Disease 2019. Crit Care Med. 2020;Publish Ah:1–4.
Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet. 2020;395:1054–62. 10.1016/S0140-6736(20)30566-3.
Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19: Systematic review and critical appraisal. BMJ. 2020;369:18. 10.1136/bmj.m1328.
Corfield AR, Lees F, Zealley I, Houston G, Dickie S, Ward K et al. Utility of a single early warning score in patients with sepsis in the emergency department. 2012.
Churpek MM, Snyder A, Han X, Sokol S, Pettit N, Howell MD, et al. Quick Sepsis-related Organ Failure Assessment, Systemic Inflammatory Response Syndrome, and Early Warning Scores for Detecting Clinical Deterioration in Infected Patients outside the Intensive Care Unit. Am J Respir Crit Care Med. 2017;195:906–11.
Faisal M, Mohammed M, Richardson D, Fiori M, Beatson K. Development and validation of automated computer-aided risk scores to predict in-hospital mortality for emergency medical admissions with COVID-19: a retrospective cohort development and validation study. BMJ Open. 2022;12:e050274. 10.1136/BMJOPEN-2021-050274.
NHS Health Research Authority. Governance Arrangements for Research Ethics Committees. http://www.hra.nhs.uk/resources/research-legislation-and-governance/governance-arrangements-for-research-ethics-committees/. Accessed 10 Aug 2017.

No competing interests reported.

AppendixR3.pdf

Download PDF

Journal Publication

published 18 Apr, 2024

Read the published version in BMC Research Notes →

Editorial decision: Major revision
14 Jul, 2023
Editor assigned by journal
14 Jul, 2023
Submission checks completed at journal
14 Jul, 2023
First submitted to journal
06 Jul, 2023

You are reading this latest preprint version

Accuracy of automated computer-aided risk scoring systems to estimate the risk of COVID-19: a retrospective cohort study

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Methods

Results

Discussion

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1