Machine Learning Model to Predict Hypotension After Starting Continuous Renal Replacement Therapy

doi:10.21203/rs.3.rs-125446/v1

Download PDF

Research

Machine Learning Model to Predict Hypotension After Starting Continuous Renal Replacement Therapy

https://doi.org/10.21203/rs.3.rs-125446/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: Hypotension after starting continuous renal replacement therapy (CRRT) is associated with worse outcome, but it is difficult to predict because several factors have interactive and complex effects on the risk. The present study applied machine learning algorithms to develop models to predict hypotension after initiating CRRT.

Methods: Among 2,349 adult patients who started CRRT due to acute kidney injury, 70% and 30% were randomly assigned into the training and testing sets, respectively. Hypotension was defined as a reduction in mean arterial pressure (MAP) ≥20 mmHg from the initial value within 6 hours. The area under the receiver operating characteristic curves (AUROCs) in machine learning models, such as support vector machine (SVM), deep neural network (DNN), and light gradient boosting machine (LGBM), were compared with those in disease-severity scores such as the Sequential Organ Failure Assessment and Acute Physiology and Chronic Health Evaluation II.

Results: The DNN model showed the highest AUROC (0.822 [0.789–0.856]), and the LGBM and SVM models followed with AUROCs of 0.810 (0.776–0.845) and 0.807 (0.772–0.842), respectively; all machine learning AUROC values were higher than those obtained from disease-severity scores (AUROCs <0.6). Although different definitions of hypotension were used such as a reduction of MAP ≥30 mmHg or a reduction occurring within 1 hour, the AUROCs of machine learning models were higher than those of disease-severity scores. These machine learning models were well calibrated.

Conclusion: Machine learning models successfully predict hypotension after starting CRRT and can serve as the basis of systems to predict hypotension before starting CRRT.

Critical Care & Emergency Medicine

acute kidney injury

continuous renal replacement therapy

hypotension

machine learning

artificial intelligence.

Continuous renal replacement therapy (CRRT) is an important therapeutic option for severe acute kidney injury with unstable vital signs in critically ill patients. Their outcomes are much worse because they frequently have several comorbidities and imbalanced fluid and electrolytes [1–4]. Although CRRT is started at the right time, complications such as hemodynamic and metabolic crises can aggravate patient outcomes [5–8]. This issue indicates that starting CRRT may not guarantee a survival benefit in critically ill patients. Accordingly, it should be determined which patient subset will benefit from CRRT without complication.

To accomplish this, early prediction of the CRRT-related complication risk is needed in clinical practice, but it has been inadequately resourced. The precise prediction of complications during CRRT may be difficult because several other conditions have interactive and complex effects on the risk [1, 2]. Heterogeneous features of patients may also complicate precise prediction. Artificial intelligence may have a role in this difficult assignment, particularly when the numbers of clinical features and their potential interactions rise [9]. Regarding this issue, we previously used machine learning models to predict the mortality risk in patients starting CRRT and found that the model performance was better than conventional disease-severity scores such as the Sequential Organ Failure Assessment (SOFA), the Acute Physiologic Assessment and Chronic Health Evaluation (APACHE) II, and the abbreviated mortality scoring system for acute kidney injury with CRRT (MOSAIC) [10]. The study results may widen the area of machine learning applicability, particularly in the field of critical care using CRRT. Nevertheless, there are still a number of issues to be addressed in determining whether machine learning can predict other CRRT-related outcomes better than conventional scoring systems.

Hypotension frequently occurs after starting CRRT in up to 40% of cases [11, 12]. This complication may be attributable to disease severity and sometimes to the labored setting of CRRT, and thus, it may not be easily predicted, as described above [13]. Neither models have been developed nor have conventional scoring models been tested to predict hypotension after CRRT. Herein, we addressed whether machine learning models successfully predicted hypotension in a cohort of CRRT in comparison to conventional scoring models.

Data source and study subjects

A total of 2,756 adult patients (≥ 18 years old) who started CRRT due to acute kidney injury were retrospectively reviewed at Seoul National University Hospital from June 2010 to February 2020. Patients who had underlying end-stage renal disease (n = 344), stopped CRRT within 1 hour after initiation (n = 49), and had no information on comorbidities or laboratory data (n = 14) were excluded. Accordingly, 2,349 patients were analyzed in the present study. The patients were randomly divided into a training set (70%) to develop the models and a testing set (30%) to test and calibrate their performance. The study was approved by the institutional review board of the Seoul National University Hospital (no. H-2003-024-1106) and complied with the Declaration of Helsinki. The requirement of informed consent was waived by the board.

Study variables and outcomes

Using an electronic medical record system, a total of 92 features at the time of starting CRRT were used to develop machine learning models. Clinical features included age, sex, weight, application of the mechanical ventilator, and comorbidities, such as diabetes mellitus, hypertension, ischemic heart disease, chronic heart failure, stroke, peripheral vascular disease, dementia, chronic kidney disease including diabetic nephropathy, chronic obstructive pulmonary disease, connective tissue disease, peptic ulcer disease, cancer, and arrhythmia including atrial fibrillation, atrioventricular block, ventricular tachycardia, tachycardia-bradycardia syndrome, and total left bundle branch block. Vital signs such as systolic blood pressure (SBP), diastolic blood pressure (DBP), mean arterial pressure (MAP), heart rate, respiratory rate, and body temperature were measured. The blood pressure values were continuously collected every 1 hour or less after starting CRRT. The laboratory data included white blood cell counts, hemoglobin, hematocrit, platelet, total bilirubin, blood urea nitrogen, creatinine, total protein, albumin, pH, sodium, potassium, calcium, phosphate, uric acid, prothrombin time-international normalized ratio, activated partial thromboplastin time, partial pressures of arterial carbon dioxide and oxygen, partial pressure to fractional inspired oxygen, alveolar to arterial oxygen gradient, and the presence of bacteremia. As a setting value, target dose, blood flow rate, amount of dialysate and replacement fluids (pre- and post-dilution), target amount of input and output, the number of bicarbonate ampules mixed in dialysate and replacement fluids, and catheter type were collected. The information on the infused medications or fluids and their infusion rates were obtained, as shown in Table S1. The number of bicarbonate ampules mixed in these fluids were calculated. The Glasgow coma scales were calculated. The SOFA, APACHE II, and MOSAIC scores were measured based on the methods presented in the original studies [14–16]. Hypotension was defined as a reduction in MAP ≥ 20 mmHg from the initial value within 6 hours. Additionally, other definitions were used such as a reduction in MAP ≥ 30 mmHg from the initial value or setting the timeframe to within 1 hour. The intensive care unit (ICU) mortality, which was defined as all-cause death during the ICU admission, was estimated.

Statistical analysis and development of machine learning models

Development of machine learning models and statistical analyses were performed using R software (version 4.0.2; The Comprehensive R Archive Network: http://cran.r-project.org). Categorical and continuous features are expressed as proportions and the means ± standard deviation, respectively. The chi-square test was used to compare categorical features (Fisher’s exact test, if not applicable), and the Student’s t test was used to compare continuous features between the training and testing sets. The restricted cubic spline was used to display the odds ratio of ICU mortality according to the change in MAP values during CRRT.

Three machine learning algorithms were used including the support vector machine (SVM), deep neural network (DNN), and light gradient boosting machine (LGBM). The SVM models used four kernels including linear, polynomial, sigmoid, and radial basis functions. For each kernel, ten-fold cross-validation and the best hyperparameter using grid search (cost, gamma, degree, and coefficients) were performed. The kernels corresponding to the highest area under the receiver operating characteristic curve (AUROC) were derived from the final model. In the DNN model, optimal hyperparameters consisting of the size (number of hidden nodes) and decay (parameter for weight decay) with 10-fold cross-validation and grid search were determined. When developing the SVM and DNN models, the continuous features were normalized, and categorical features were processed as a one-hot encoding. In the LGBM model, hyperparameters (max_bin, learning rate, boosting method, and nrounds) were adjusted, and the model with the highest AUROC was selected for comparison.

For performance indices, AUROC, accuracy, F1 score, Matthews correlation coefficient (MCC), and area under the precision-recall curve (AUPRC) were measured in the testing set. The AUROCs were compared between models using the DeLong test. MCC is an informative and truthful score in evaluating binary classification compared to accuracy and F1 score [17]. The MCC values of + 1, 0, and − 1 represent perfect prediction, average random prediction, and inverse prediction, respectively. The threshold was determined when the F1 score was the highest. For calibration, Brier's scores were calculated, with those closer to 0 indicating good calibration. We ranked the importance of features in developing the DNN and LGBM models. The performance of machine learning models with variable numbers of features in order of ranking were also evaluated. P values less than 0.05 were considered significant.

Baseline characteristics

The mean age of all patients was 64 ± 15 years old, and 61.4% were male. Their SBP, DBP, and MAP values were 114 ± 28, 59 ± 16, and 77 ± 17 mmHg, respectively. The target dose of CRRT was 40.7 ± 13.1 ml/kg/hr. Information on other features are shown in Table S1. None of the features differed between the training and testing sets.

Association between hypotension and mortality

The prevalence of hypotension which was defined as a reduction in MAP ≥ 20 mmHg and ≥ 30 mmHg within 6 hours were 29% and 14%, respectively. When the timeframe was within 1 hour, the prevalence of a reduction in MAP ≥ 20 mmHg and ≥ 30 mmHg were 10% and 4%, respectively. Figure S1 shows the nonlinear relationship between the odds ratio for ICU mortality and the reduction in MAP after CRRT. The patients with a larger decrease in MAP within 6 hours or 1 hour showed higher risk of ICU mortality than their counterparts.

Performance of machine learning models

When the machine learning models for a reduction in MAP ≥ 20 mmHg within 6 hours were evaluated by AUROCs, the DNN model had the highest value of 0.822 (0.789–0.856), and the LGBM model had the second highest with an AUROC of 0.810 (0.776–0.845) (Table 1). All of the AUROC values in machine learning models were higher than those obtained from SOFA, APACHE II, and MOSAIC scores (Ps < 0.001). When the outcome was defined as a reduction in MAP ≥ 30 mmHg within 6 hours, the best model was the LGBM with an AUROC of 0.850 (0.810–0.889). The DNN models achieved the next highest AUROC value of 0.835 (0.789–0.881). Even in this outcome, the machine learning models demonstrated superior performance to the SOFA, APACHE II, and MOSAIC scores (Ps < 0.001). The plots of AUROCs support these results (Fig. 1).

Table 1

Area under the receiver operating characteristic curves of models predicting hypotension within 6 hours
	Outcomes
Models	MAP Δ20	P*	P^†	P^‡	MAP Δ30	P*	P^†	P^‡
SOFA	0.500 (0.453–0.547)				0.496 (0.435–0.557)
APACHE II	0.546 (0.499–0.593)				0.592 (0.535–0.649)
MOSAIC	0.568 (0.522–0.615)				0.578 (0.518–0.638)
SVM	0.807 (0.772–0.842)	< 0.001	< 0.001	< 0.001	0.830 (0.784–0.876)	< 0.001	< 0.001	< 0.001
DNN	0.822 (0.789–0.856)	< 0.001	< 0.001	< 0.001	0.835 (0.789–0.881)	< 0.001	< 0.001	< 0.001
LGBM	0.810 (0.776–0.845)	< 0.001	< 0.001	< 0.001	0.850 (0.810–0.889)	< 0.001	< 0.001	< 0.001
*Compared with the APACHE II model.
†Compared with the SOFA model.
^‡Compared with the MOSAIC model.
Abbreviations: MAP, mean arterial pressure; MAP Δ20, reduction in MAP ≥ 20 mmHg from the initial value; MAP Δ30, reduction in MAP ≥ 30 mmHg from the initial value; SOFA, Sequential Organ Failure Assessment; APACHE, Acute Physiology and Chronic Health Evaluation; MOSAC, Mortality Scoring system for AKI with CRRT; SVM, support vector machine; DNN, deep neural network; LGBM, light gradient boosting machine.

Other performance indices such as accuracy, F1 score, MCC, and AUPRC for predicting decrease in MAP within 6 hours are shown in Table 2. For the outcome of a reduction in MAP ≥ 20 mmHg, the LGBM model achieved the highest accuracy and F1 score. The SVM and DNN models showed the highest accuracy and F1 score, respectively, for predicting a reduction in MAP ≥ 30 mmHg. The LGBM and DNN models showed the highest MCC in predicting a reduction in MAP ≥ 20 mmHg and ≥ 30 mmHg, respectively. When AUPRC was evaluated, the DNN model had the highest values among the models. All of these indices in machine learning models were higher than those in conventional scoring models. When the outcome was defined as a reduction in MAP within 1 hour, the overall trends remained consistent (AUROC in Table S2, and other indices in Table S3).

Table 2

Accuracy, F1 score, Matthews correlation coefficient, and area under the precision-recall curve of models in predicting hypotension within 6 hours
	Outcomes
Performance indices	MAP Δ20	MAP Δ30
Accuracy
SOFA	0.304	0.203
APACHE II	0.359	0.565
MOSAIC	0.373	0.545
SVM	0.745	0.882
DNN	0.749	0.872
LGBM	0.760	0.848
F1 score
SOFA	0.450	0.257
APACHE II	0.461	0.271
MOSAIC	0.453	0.275
SVM	0.637	0.570
DNN	0.645	0.587
LGBM	0.647	0.552
Matthews correlation coefficient
SOFA	− 0.021	0.012
APACHE II	0.072	0.082
MOSAIC	0.052	0.088
SVM	0.462	0.504
DNN	0.475	0.513
LGBM	0.480	0.468
Area under the precision-recall curve (95% CI)
SOFA	0.301 (0.250–0.348)	0.146 (0.106–0.179)
APACHE II	0.338 (0.277–0.395)	0.200 (0.135–0.256)
MOSAIC	0.337 (0.282–0.391)	0.189 (0.130–0.243)
SVM	0.650 (0.582–0.718)	0.578 (0.481–0.673)
DNN	0.678 (0.617–0.741)	0.580 (0.478–0.680)
LGBM	0.638 (0.562–0.708)	0.554 (0.459–0.646)
Abbreviations: MAP, mean arterial pressure; MAP Δ20, reduction in MAP ≥ 20 mmHg from the initial value; MAP Δ30, reduction in MAP ≥ 30 mmHg from the initial value; SOFA, Sequential Organ Failure Assessment; APACHE, Acute Physiology and Chronic Health Evaluation; MOSAC, Mortality Scoring system for AKI with CRRT; SVM, support vector machine; DNN, deep neural network; LGBM, light gradient boosting machine; CI, confidence interval.

Rank of features in machine learning model

To estimate the contribution degree of each feature in predicting the risk of hypotension, the feature ranking analysis was performed. The initial MAP greatly contributed to the performance of the LGBM model, and age, BPs, and blood tests such as pH, coagulation time, blood urea nitrogen, protein, and albumin also contributed (Fig. 2). In the DNN model, BPs were the most important in the model performance, and other vital signs, pH, and some medications were determined to be important (Figure S2).

The change in performance of the LGBM and DNN models was evaluated by adding each of the top 10 features in order of ranking results (Tables 3 and S4). In the LGBM model, the AUROC values increased depending on the features used, whereas the accuracy, F1 score, MCC, and AUPRC had an increasing trend from 20 to 40 features (Table 3). In the DNN model, increasing performance was shown for the top 30 features used in the model (Table S4). These results indicate that at least 20 or 30 features were needed to precisely predict the hypotension risk in the above machine learning models.

Table 3

Performance indices of light gradient boosting machine models in predicting hypotension* according to the number of features
No. of features	AUROC (95% CI)	Accuracy	F1 score	MCC	AUPRC (95% CI)
10	0.778 (0.741–0.816)	0.716	0.598	0.401	0.627 (0.558–0.694)
20	0.785 (0.749–0.822)	0.760	0.608	0.396	0.612 (0.538–0.683)
30	0.801 (0.766–0.836)	0.738	0.625	0.443	0.637 (0.567–0.706)
40	0.802 (0.767–0.837)	0. 763	0.644	0.477	0.628 (0.558–0.699)
50	0.806 (0.772–0.841)	0.767	0.650	0.485	0.637 (0.568–0.707)
60	0.807 (0.772–0.842)	0.771	0.646	0.483	0.640 (0.570–0.709)
70	0.810 (0.776–0.845)	0.759	0.646	0.478	0.638 (0.564–0.710)
*Defined as a reduction in mean arterial pressure ≥ 20 mmHg from the initial value within 6 hours.
Abbreviations: AUROC, area under the receiver operating characteristic curve; CI, confidence interval; MCC, Matthews correlation coefficient; AUPRC, Area under the precision-recall curve.

Calibration of models

When Brier’s scores were calculated for calibration, the LGBM model had the lowest value for all outcomes, and other models had relatively low values (Table S5). These results indicate that the models were well calibrated.

Unexpected hypotensive events after starting CRRT are a critical issue because they contribute to worse outcomes, as noted in the above association with high ICU mortality [5, 6]. Machine learning models such as LGBM and DNN successfully predicted the risk of hypotension and performed better than conventional scoring models such as SOFA, APACHE II, and MOSAIC. Based on the ranking analysis, at least 20 or 30 features including vital signs and pH were determined to contribute greatly to the model performance. These results indicate that precise prediction of CRRT-related hypotension is achievable by machine learning algorithms although complex and interactive relationships of several features exist.

Critically ill patients undergoing CRRT are in a complex clinical situation, which frequently embarrass clinicians in determining the outcomes. Machine learning may overcome the difficulty of considering complex and numerous clinical situations. Several studies have applied machine learning algorithms to critically ill patients and have shown superior performance compared to existing models or scoring systems in predicting outcomes [18]. Our previous study also demonstrated that machine learning had better performance than conventional scoring systems, such as SOFA and APACHE II, in predicting mortality of CRRT patients [10]. The present study expands the utility of machine learning in predicting hypotension as other outcomes of CRRT and provides a clue on advanced management before the occurrence of hypotension.

Excessive ultrafiltration is thought to significantly affect hypotension during CRRT [13]. Other conditions such as reduced cardiac preload resulting from defective vasoconstriction and redistribution of fluids resulting from sepsis or inflammation also contribute to hypotension during CRRT [19, 20]. Rapid clearance of plasma solutes by convention method results in osmolar reduction and shifts water from intravascular to interstitial compartments, consequently causing decreased effective arterial blood volume and hypotension [13]. Concurrent cardiac dysfunction can be aggravated by ultrafiltration or blood flow of CRRT, resulting in hypotension [21]. However, precise prediction of CRRT-related hypotension could not be obtained by this theoretical approach alone in real clinical practice. The present feature ranking analysis demonstrated that vital signs at the time of CRRT are the most important contributor to hypotension, which should be assessed before starting CRRT.

Although the results are informative, there are certain limitations to be discussed. The sample size of the cohort was modest. The advantage of machine learning is its high performance, particularly with extremely large sample size. However, there is no specific cutoff on the sample size in machine learning algorithms, and the present sample size of 2,349 with ≥ 90 features was greater than the sample size (n = 488) of the previous 258 studies which used machine learning algorithms to analyze ICU data [22]. Because the study analyzed a retrospective cohort, prospective validation is needed. The study identified the most important features with respect to predicting hypotension, but certain degrees of risk, such as the relative risk, could not be obtained. This is a common limitation of machine learning algorithms. Concerns could be raised regarding other issues such as overfitting and the effects of un-identified factors.

The application of machine learning algorithms improves the predictability of hypotension after starting CRRT, and machine learning performs better than conventional scoring models used in critically ill patients. If the machine learning-based prediction models are successfully applied to clinical practice, the overall patient outcomes will improve by proactive management of hypotension. Future studies will explore whether machine learning can predict other outcomes of CRRT and will validate results in an independent cohort.

CRRT, Continuous renal replacement therapy; SOFA, Sequential Organ Failure Assessment; APACHE II, Acute Physiologic Assessment and Chronic Health Evaluation II; MOSAIC, mortality scoring system for acute kidney injury with CRRT; SBP, Systolic blood pressure; DBP, Diastolic blood pressure; MAP, Mean arterial pressure; ICU, Intensive care unit; SVM, Support vector machine; DNN, Deep neural network; LGBM, Light gradient boosting machine; AUROC, Area under the receiver operating characteristic curve; MCC, Matthews correlation coefficient; AUPRC, Area under the precision-recall curve; CI, confidence interval.

Acknowledgements

Not applicable.

Author contributions

MWK and SSH contributed to the design of the study. MWK, SK, YCK, DKK, KHO, KWJ and YSK collected the data. MWK and SSH analyzed and interpreted the data. MWK and SSH drafted the manuscript. SSH reviewed the manuscript. All authors read, commented, and approved the final manuscript.

Funding

None.

Availability of data and materials

Dataset used during the current study is available from the corresponding author on request.

Ethics approval and consent to participate

The study protocol complies with the Declaration of Helsinki, as revised in 2013, and was approved by the institutional review board of the Seoul National University Hospital (no. H-2003-024-1106).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Author details

¹Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea.

Kee YK, Kim D, Kim SJ, Kang DH, Choi KB, Oh HJ, Ryu DR. Factors Associated with Early Mortality in Critically Ill Patients Following the Initiation of Continuous Renal Replacement Therapy. J Clin Med 2018, 7(10).
Kao CC, Yang JY, Chen L, Chao CT, Peng YS, Chiang CK, Huang JW, Hung KY. Factors associated with poor outcomes of continuous renal replacement therapy. PLoS One. 2017;12(5):e0177759.
Gammelager H, Christiansen CF, Johansen MB, Tonnesen E, Jespersen B, Sorensen HT. One-year mortality among Danish intensive care patients with acute kidney injury: a cohort study. Crit Care. 2012;16(4):R124.
Mandelbaum T, Scott DJ, Lee J, Mark RG, Malhotra A, Waikar SS, Howell MD, Talmor D. Outcome of critically ill patients with acute kidney injury using the Acute Kidney Injury Network criteria. Crit Care Med. 2011;39(12):2659–64.
Shawwa K, Kompotiatis P, Jentzer JC, Wiley BM, Williams AW, Dillon JJ, Albright RC, Kashani KB. Hypotension within one-hour from starting CRRT is associated with in-hospital mortality. J Crit Care. 2019;54:7–13.
Silversides JA, Pinto R, Kuint R, Wald R, Hladunewich MA, Lapinsky SE, Adhikari NK. Fluid balance, intradialytic hypotension, and outcomes in critically ill patients undergoing renal replacement therapy: a cohort study. Crit Care. 2014;18(6):624.
Fall P, Szerlip HM. Continuous renal replacement therapy: cause and treatment of electrolyte complications. Semin Dial. 2010;23(6):581–5.
Finkel KW, Podoll AS. Complications of continuous renal replacement therapy. Semin Dial. 2009;22(2):155–9.
Bzdok D, Altman N, Krzywinski M. Statistics versus machine learning. Nat Methods. 2018;15(4):233–4.
Kang MW, Kim J, Kim DK, Oh KH, Joo KW, Kim YS, Han SS. Machine learning algorithm to predict mortality in patients undergoing continuous renal replacement therapy. Crit Care. 2020;24(1):42.
Akhoundi A, Singh B, Vela M, Chaudhary S, Monaghan M, Wilson GA, Dillon JJ, Cartin-Ceba R, Lieske JC, Gajic O, et al. Incidence of Adverse Events during Continuous Renal Replacement Therapy. Blood Purif. 2015;39(4):333–9.
Uchino S, Bellomo R, Morimatsu H, Morgera S, Schetz M, Tan I, Bouman C, Macedo E, Gibney N, Tolwani A, et al. Continuous renal replacement therapy: a worldwide practice survey. The beginning and ending supportive therapy for the kidney (B.E.S.T. kidney) investigators. Intensive Care Med. 2007;33(9):1563–70.
Douvris A, Zeid K, Hiremath S, Bagshaw SM, Wald R, Beaubien-Souligny W, Kong J, Ronco C, Clark EG. Mechanisms for hemodynamic instability related to renal replacement therapy: a narrative review. Intensive Care Med. 2019;45(10):1333–46.
Vincent JL, Moreno R, Takala J, Willatts S, De Mendonca A, Bruining H, Reinhart CK, Suter PM, Thijs LG. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. On behalf of the Working Group on Sepsis-Related Problems of the European Society of Intensive Care Medicine. Intensive Care Med. 1996;22(7):707–10.
Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13(10):818–29.
Kim Y, Park N, Kim J, Kim DK, Chin HJ, Na KY, Joo KW, Kim YS, Kim S, Han SS. Development of a new mortality scoring system for acute kidney injury with continuous renal replacement therapy. Nephrology (Carlton). 2019;24(12):1233–40.
Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020;21(1):6.
Gutierrez G. Artificial Intelligence in the Intensive Care Unit. Crit Care. 2020;24(1):101.
Doshi M, Murray PT. Approach to intradialytic hypotension in intensive care unit patients with acute renal failure. Artif Organs. 2003;27(9):772–80.
Van der Mullen J, Wise R, Vermeulen G, Moonen PJ, Malbrain M. Assessment of hypovolaemia in the critically ill. Anaesthesiol Intensive Ther. 2018;50(2):141–9.
Slessarev M, Salerno F, Ball IM, McIntyre CW. Continuous renal replacement therapy is associated with acute cardiac stunning in critically ill patients. Hemodial Int. 2019;23(3):325–32.
Shillan D, Sterne JAC, Champneys A, Gibbison B. Use of machine learning to analyse routinely collected intensive care unit data: a systematic review. Crit Care. 2019;23(1):284.

Additionalfile1.docx
Table S1. Baseline characteristics. Table S2. Area under the receiver operating characteristic curves of the models predicting hypotension within 1 hour. Table S3. Parameters of the model performance in predicting hypotension within 1 hour. Table S4. Performance indices of deep neural network models in predicting hypotension* according to the number of features. Table S5. Brier’s score of models.
Additionalfile1.docx
Table S1. Baseline characteristics. Table S2. Area under the receiver operating characteristic curves of the models predicting hypotension within 1 hour. Table S3. Parameters of the model performance in predicting hypotension within 1 hour. Table S4. Performance indices of deep neural network models in predicting hypotension* according to the number of features. Table S5. Brier’s score of models.
Additionalfile2.docx
Figure S1. Nonlinear relationship between the odds ratio (OR) of intensive care unit (ICU) mortality and the reduction in mean arterial pressure (MAP) within 6 hours (A) and 1 hour (B). Gray area indicates 95% confidence intervals. Figure S2. Feature rankings for the deep neural network model in predicting a reduction in mean arterial pressure ≥20 mmHg (A, C) and ≥30 mmHg (B, D) within 6 hours (A, B) and 1 hour (C, D). SBP, systolic blood pressure; MAP, mean arterial pressure; DBP, diastolic blood pressure; BUN, blood urea nitrogen; I/O, input and output; PaO2, arterial partial pressure of oxygen; FiO2, fraction of inspired oxygen.
Additionalfile2.docx
Figure S1. Nonlinear relationship between the odds ratio (OR) of intensive care unit (ICU) mortality and the reduction in mean arterial pressure (MAP) within 6 hours (A) and 1 hour (B). Gray area indicates 95% confidence intervals. Figure S2. Feature rankings for the deep neural network model in predicting a reduction in mean arterial pressure ≥20 mmHg (A, C) and ≥30 mmHg (B, D) within 6 hours (A, B) and 1 hour (C, D). SBP, systolic blood pressure; MAP, mean arterial pressure; DBP, diastolic blood pressure; BUN, blood urea nitrogen; I/O, input and output; PaO2, arterial partial pressure of oxygen; FiO2, fraction of inspired oxygen.

Download PDF

Version 1

posted

You are reading this latest preprint version

Machine Learning Model to Predict Hypotension After Starting Continuous Renal Replacement Therapy

Status:

Version 1

Abstract

Figures

Introduction

Method

Data source and study subjects

Study variables and outcomes

Statistical analysis and development of machine learning models

Results

Baseline characteristics

Association between hypotension and mortality

Performance of machine learning models

Rank of features in machine learning model

Calibration of models

Discussion

Conclusion

Abbreviations

Declarations

References

Supplementary Files

Status:

Version 1