Intelligent prediction of RBC demand in trauma patients using decision tree methods

DOI: https://doi.org/10.21203/rs.3.rs-41246/v2

Abstract

Background: The vital signs of trauma patients are complex and changeable, and the prediction of blood transfusion demand mainly depends on doctors' experience and trauma scoring system; therefore, it cannot be accurately predicted. In this study, a machine learning decision tree algorithm (classification and regression tree, CRT and eXtreme gradient boosting, XGBoost) was proposed for the demand prediction of traumatic blood transfusion to provide technical support for doctors.

Methods: A total of 1,371 trauma patients who were diverted to the emergency department from January 2014 to January 2018 were collected from an emergency trauma database. The vital signs, laboratory examination parameters and blood transfusion volume were used as variables, and the non-invasive parameters and all (non-invasive + invasive) parameters were used to construct an intelligent prediction model for RBC demand by logistic regression (LR), CRT and XGBoost. The prediction accuracy of the model was compared with the area under the curve (AUC).

Results: The studies we performed showed that non-invasive parameters were used to predict blood transfusion. The LR method was the best, with an AUC of 0.72 (95% confidence interval [CI] 0.657-0.775), which was higher than the CRT AUC of 0.69 (95% CI 0.633-0.751) and the XGBoost AUC of 0.71 (95% CI 0.654-0.756) (P<0.05). The trauma site and shock index are important prediction parameters. For all the prediction parameters, XGBoost was the best, with an AUC of 0.94 (95% CI 0.893-0.981), which was higher than the LR AUC of 0.80 (95% CI 0.744-0.850) and the CRT AUC of 0.82 (95% CI 0.779-0.853) (P<0.05). Haematocrit/Haemoglobin is an important prediction parameter. Conclusions: The classification performance of the intelligent prediction model constructed by the decision tree algorithm is not inferior to that of the traditional LR method. With the increase in the data quantity, the accuracy of the model improved in the iteration process, and the prediction performance continuously improved, which is conducive to clinical application and wide promotion.

Background

Trauma accounts for approximately 9% of global deaths[1], and deaths mainly occur within the first 12 hours after trauma[2]. The first step in trauma treatment is to control the bleeding as soon as possible, identify the mechanism of trauma, and directly transfer the patient to a nearby trauma treatment institution[3]. Post-traumatic blood loss is a potential and preventable leading cause of death[4]. The core principle of treatment is to identify the risk of haemorrhagic shock as early as possible; meanwhile, fluid resuscitation and blood transfusion are needed to maintain the stability of basic vital signs and haemodynamics[5]. The study found that blood transfusion products pre-hospital within 15 minutes or 15 minutes after injury were associated with 24-hour mortality (5.6% VS. 20.2%) and 30-day mortality (11.8% VS. 22.9%) compared with delayed or non-transfusion[6]. Delayed blood transfusion can lead to pulmonary complications and death[7]. Several studies have found that RBC transfusion in trauma patients is associated with increased morbidity and mortality[8, 9]. Kotwal RS et al. found that the death rate of the massive blood transfusion group was significantly lower than that of the non-massive blood transfusion group, especially in severe and extremely severe trauma (ISS>15). However, regardless of the trauma severity, the mortality decreased gradually in the massive blood transfusion group, non-massive blood transfusion group and non-transfusion group, and there was a significant difference. With the increase in blood transfusion, the mortality rate gradually increased during hospitalization[10]. Therefore, blood products should be given early in the pre-hospital transfer to improve the patients’ survival rate after trauma, and then other interventions should occur as soon as possible to strictly control the amount of blood transfused.

At present, there are many studies on traumatic massive blood transfusion, including various trauma scoring systems for on the battlefield and for civilians[11-13], which are used to predict when to initiate massive blood transfusion programmes. However, in recent years, with the improvement of early pre-hospital and hospitalization trauma management measures, the proportion of patients with massive blood transfusion has gradually decreased[14]. For traumatic patients who do not meet the massive blood transfusion standard, there are few studies on the need for blood transfusion. The fifth edition of the European Trauma Guide recommends that the target Hb should be maintained at 70-90 g/L[5], which can be used as a reference for blood transfusion needs, but the guidelines also suggest that the normal initial test results of Hb may mask bleeding, and it is recommended to use the results of repeated Hb tests as laboratory indicators of bleeding. Therefore, only the results of Hb determine whether to perform a blood transfusion, and the reference value is limited. How to judge the best demand for blood transfusion according to the changing vital signs of trauma patients is a difficult problem for emergency doctors. At present, most of the blood transfusion decisions made by doctors are based on their personal experience, but there is no feasible and recognized reference standard for different individuals. Transfusion too early will not only waste blood components but also affect the prognosis of patients with excessive blood transfusion[10, 15]. A delayed blood transfusion will lead to haemorrhagic shock, aggravate complications such as hypothermia, acidosis and coagulation dysfunction, and seriously affect the survival rate of patients[16].

We believe that the application of machine learning methods, compared with traditional statistical methods, can help us to identify whether patients need a blood transfusion and reduce unnecessary complications caused by delayed transfusion, insufficient blood transfusion or excessive transfusion. Therefore, this study proposes a new method to establish an artificial intelligence mathematical model by retrospective analysis of patients' vital signs, laboratory tests and other data to assist doctors in quickly making decisions on whether a blood transfusion is needed after trauma and to improve the success rate of patient treatment.

Methods

Clinical data

The Emergency Trauma Database of the Chinese People's Liberation Army (PLA) General Hospital is a comprehensive, unidentified dataset containing medical information on 22,491 critically ill patients from January 2014 to January 2018[17, 18]. All patients were admitted to the emergency department. The medical information of 1,371 trauma patients’ who were triaged to a critical rescue room was extracted. The data related to blood transfusion were provided by the clinical blood transfusion intelligent management and evaluation system established by the Transfusion Department of the Chinese PLA General Hospital[19]. The patients’ information in the two databases associated were uniquely identified with the outpatient number. In the process of data extraction, the original data were completely consistent with the database data through quality control. The Medical Ethics Committee of the Chinese PLA General Hospital approved the study and waived the requirement for written informed consent.

Contains variables

Basic information (age, sex, height, weight), diagnosis, admission time, discharge time, after-department track, blood transfusion time, blood transfusion components, RBC infusion volume.

Non-invasive detection parameters: vital signs [heart rate(HR), respiration(R), shock index(SI), systolic blood pressure(SBP), diastolic blood pressure(DBP), blood oxygen saturation(SpO2), temperature(T) ] and test time, trauma site;

Invasive detection parameters: routine blood test parameters [haemoglobin(Hb), haematocrit(Hct), platelet count(PLT), C-reactive protein(CRP), Interleukin(IL)-6] and test time; coagulation indicators [prothrombin time(PT), activated partial thromboplastin time(APTT), international standardized ratio(INR), prothrombin activity(PTA), fibrinogen(Fib) ] and test time; blood gas test parameters [potential of hydrogen(PH), partial pressure of oxygen(PO2), partial pressure of carbon dioxide(PCO2), total carbon dioxide(TCO2), lactate(Lac), actual bicarbonate(AB), standard bicarbonate(SB), potassium(K)] and detection time; trauma severity classification (first level, second level and third level); endotracheal intubation; vasoactive drugs;

Construct new variables: For the trauma diagnostic classification, we divided the variables into the fields of trauma type (open trauma, blunt injury) and trauma site (head and neck, upper extremity, lower extremity, chest and abdomen, spine, trunk and pelvis). (An additional table of examples for 10 patients with their features shows this in more detail [see Additional Table 1]).

Variable dimensionality reduction: To reduce the time and complexity of the model operation, only one variable with a high correlation coefficient was retained, such as Hb and Hct, and only the variable Hct was retained.

Inclusion and exclusion criteria: Inclusion criteria: (1) patients’ diagnoses were matched or fuzzy matched with "injury", and (2) patients were triaged from the emergency department to a critical rescue room. The exclusion criteria were as follows: (1) patients with nonexternal trauma and (2) age < 18 years old.

Acquisition of variables

The process of obtaining variables included extracting and aggregating variables, cleaning variables, and processing variables.

Variable extraction: The numerical variables were extracted directly, including vital sign parameters, laboratory test results, and information related to blood transfusion in the database. The results of the first examination when entering the emergency department were used as variables to predict the demand for blood transfusion. If multiple tests were performed before or after blood transfusion, the results closest to the time blood transfusion were included in the analysis. We used natural language processing to extract effective information from unstructured text variables in the database in advance, such as diagnosis and medical orders. We extracted the variable information from the emergency trauma database and then used the patient’s unique identification as the centre, associated it with the blood transfusion information of the clinical blood transfusion database system, and aggregated it into a record.

Variable cleaning: We needed to clean duplicate data and formulate retention principles, such as testing the changes of vital signs many times after entering the emergency department and taking the results of the first test as the key variable to judge whether a blood transfusion occurred, checking the invalid value and establishing the criteria, such as height and weight with -1, 0, etc., checking the logical relationship among the data, such as the time of admission, the time of laboratory examination, and the blood transfusion start time.

Variable processing: (1) Classify variable processing: convert the variables into a numerical vector and then use it to build models, such as gender and other variables; (2) Unstructured text variable processing: use the automatic counting word segmentation algorithm in natural language processing to transform words into numerical variables; (3) Construct new variables: divide the diagnostic information of patients, such as diagnostic details and variable processing of trauma sites, into phrases and fields and then count and score the different categories in the target variables and train the model using the learned rules to construct new variables.

Establishing the model

SPSS 22.0 software was used to establish the LR model and CRT model.

CRT is supervised analysis technology, which uses the binary classification method to divide the data into two pieces at a time and enter it into the left and right two trees. The root node of the tree is a dependent variable, and the child node is based on the classification variable (parent node). The minimum sample size on the CRT parent node established by the non-invasive parameter is 20, and the child node is 10. The minimum sample size of the CRT parent node for invasive parameters is 50, and the child node is 20. If the sample size on the node does not meet this requirement, the node is a terminal node and will no longer be segmented[20].

XGBoost is a gradient lifting decision tree algorithm that is executed by using the train_test_plit () function provided by the Python language. XGBoost is a supervised learning method and is an integrated learning model that is used for classification analysis (processing discrete data) and regression tree analysis (processing continuous data). The XGBoost algorithm is composed of a loss function and a regular function. The loss function calculates the error between the prediction and the real result, and the loss function is constrained based on the minimum error in the actual calculation. The regular function is used to detect the complexity of the model to avoid overfitting. The loss function and the objective function are given according to the actual situation.

Statistical analysis

The counting data are described by frequency and percentage [n (%)], and the measurement data are expressed by mean and standard deviation [mean (SD)] or median and quartile spacing [median [Range]]. The measurement data of the two groups were compared by analysis of variance or Kruskal-Wallis nonparametric test, and the counting data of the two groups were compared by the chi-square test. If the test P <0.05, the difference was statistically significant.

The LR method was used to screen the significant variables with p<0.05 as independent variables and whether a blood transfusion was used as the dependent variable to establish the model. After the regression coefficient was standardized, the risk factors OR and 95% CI were used to express the relationship between variables and the occurrence of blood transfusion.

CRT and XGBoost models, used the original variables, combined variables or constructed new variables of historical datasets for model training. The historical dataset was randomly divided into an 80% training set and a 20% test set. The model was trained on the training set, and the effect of the model was evaluated on the test set.

LR, CRT and XGBoost models were compared with whether a blood transfusion was used as the target variable, method 1-basic information + non-invasive parameters as analysis variables, and method 2-basic information + non-invasive parameters + invasive parameters as analysis variables to establish models, and AUCs were drawn and analysed. The AUC results of the two methods and three models were compared by Python software; if P<0.05, the difference was statistically significant. (Figure 1)

According to the node level (root node, child node) of each variable in the decision tree, the CRT model reflects the importance of each variable. The XGBoost model is represented by the weight of the factors in the tree model of the gradient lifting decision tree algorithm.

Results

Patient characteristics

The emergency trauma database of the General Hospital of the Chinese PLA contains the medical information of 22,491 critically ill patients. We included 1,371 patients who met the study criteria for analysis. Among them, there were 324 females (23.6%) and 1,047 males (76.4%). A total of 1,183 patients (86.3%) did not receive blood transfusion, and 188 patients (13.7%) received blood transfusion. There was a significant difference between the transfusion group and the non-transfusion group in age, HR, SBP, DBP, SI, HB, Hct, PLT, PT, APTT, PTA, Fib, PH, PO2, TCO2, Lac, AB, SB, K, endotracheal intubation, vasoactive drugs, trauma site, RBC volume, 24-hour RBC and emergency department time (P<0.05). There was no significant difference between the transfusion group and the non-transfusion group in sex, height, weight, R, SpO2, T, CRP, IL-6, INR, PCO2, trauma severity classification or trauma type (P>0.05) (Table 1).

Model prediction

Method 1- The model established with non-invasive parameters predicted the need for blood transfusion after trauma. The AUC of LR (0.72, 95% CI 0.657-0.775) was higher than the AUC of the XGBoost model (0.71, 95% CI 0.654-0.756) and the AUC of the CRT model (0.69, 95% CI 0.633-0.751) (Figure 2A). There was a significant difference in the AUC among the three models (P<0.05). The accuracy of the XGBoost model was 0.75, which was higher than that of LR (0.55) and CRT (0.48). Method 2- The model established with all parameters was used to predict the need for blood transfusion after trauma. The AUC of the XGBoost model was 0.94 (95% CI 0.893-0.981), which was higher than that of the CRT model (0.82, 95% CI 0.779-0.853) and the LR model (0.80, 95% CI 0.744-0.850) (Figure 2B). There was a significant difference in AUC among the three models (P<0.05). The accuracy of the CRT model is 0.89, which is higher than that of XGBoost 0.83 and LR 0.72 (Table 2, Figure 3).

Variable importance analysis

Method 1 - Non-invasive detection parameters were used to predict blood transfusion. LR analysis showed that trauma site (OR=18.371, 95% CI 4.019-83.931, P<0.05) and SI (OR=3.463, 95% CI 1.763-6.801, P<0.05) were risk factors for predicting blood transfusion (an additional table shows this in more detail [see Additional Table 2]). The results of the CRT model analysis show that the order of importance of the variables was SI, trauma site, age and SpO2 (Figure 4A). The top five variables in the XGBoost model were trauma site, SBP, SI, DBP and HR (Figure 5A).

Method 2 - All test parameters were used to predict blood transfusion. LR analysis showed that trauma site (OR=7.961, 95% CI 1.422-44.567), vasoactive drugs (OR=2.039, 95% CI 1.092-3.808), PLT (OR=0.995, 95% CI 0.992-0.998), PTA (OR=0.975, 95% CI 0.964-0.988), Hct/Hb (OR=0.923, 95% CI 0.899-0.948), SB and Fib were risk factors for blood transfusion (P <0.05) (Table 3). The results of the CRT model analysis show that the order of importance of the variables was Hct/HB, Fib and CRP (Figure 4B). The top five variables in the XGBoost model were Hct/HB, TCO2, PH, PCO2 and CRP (Figure 5B).

Discussion

In our study, non-invasive detection parameters and all parameters were established to predict blood transfusion in trauma patients, and the decision tree algorithm (CRT and XGBoost) was compared with the traditional statistical method (LR). The results showed that the LR model with basic information and non-invasive parameters was the best, but the sensitivity of the CRT model was the highest, and the specificity and accuracy of the XGBoost model were the highest. The AUC of the basic information + non-invasive parameter + invasive parameter model was higher than that of the non-invasive parameter model. The XGBoost model was the best, and the sensitivity was the highest, but the CRT model had the highest specificity.

AUC embodies the classification ability of the model. LR had the best classification ability in non-invasive parameter prediction, but it was suitable for data analysis and could not be used in clinical applications. The decision tree algorithm had its advantages, and the CRT model had the highest sensitivity and the best ability to identify patients who needed blood transfusion. The specificity and accuracy of the XGBoost model were the highest, and the ability to identify blood transfusion/non-transfusion was the best. When predicting all the parameters, the XGBoost model was the best, and the ability to identify blood transfusion was the best. The CRT model had the best ability to identify transfusion/non-transfusion. The results showed that the more parameters there are, the more prominent the advantages of the decision tree model. The non-invasive parameters can be quickly obtained after trauma patients have obtained medical resources, and the input data can be used to quickly feedback the results of whether the patients need blood transfusion by using the decision tree model. Although the prediction efficiency is slightly lower than all parameters, its time advantage is incomparable. Moreover, trauma is accompanied by changes in blood loss and fluid volume, and vital signs are complex and changeable. The detection time of invasive parameters is approximately 1 hour. When the results are obtained, they no longer reflect the current physiological parameters of the patients. Therefore, the non-invasive parameters obtained at any time can reflect the vital signs of patients at that time, and the model can be used to predict at any time, which is convenient for clinical application. When predicting all the parameters, the blood transfusion decisions made by clinicians based on experience are often not accurate. In the case of covering as many data and variables as possible, through a part of the data as a training set, on the basis of learning the experience of clinicians, the machine learning method can more accurately and digitally assist doctors in the decision support of blood transfusion for trauma patients.

Trauma treatment should account for the mechanism of the trauma (open trauma or blunt injury), the location of the trauma (head, chest, etc.), pre-hospital resources, hospital emergency room settings (I, II, etc.) and trauma centre facilities (immediate detection equipment and resources)[21]. Similarly, this study found that when predicting non-invasive parameters, the trauma location and SI had the greatest impact on blood transfusion. The model established by combining age, sex, pre-hospital SI, admission HR, Hb and SpO2 can better predict blood transfusion 3 hours before admission[22]. The post-traumatic SI is important in assessing the need for blood transfusion and can predict the demand for massive blood transfusion, laparotomy and mortality[23]. The shock index is more sensitive than the ABC score in predicting traumatic massive blood transfusion[24].

Among the predictive variables of all parameters, Hct/Hb had a great influence on blood transfusion in the three models. Consistent with our study, many models or scoring systems use Hct/Hb as the main parameter for the prediction of traumatic massive blood transfusion[12, 13, 25], which is also consistent with the recommendation that Hb repeat test results should be used as a laboratory indicator of bleeding[5]. Different models have different parameters that affect whether a blood transfusion is carried out. The LR model judges the influence of variables on blood transfusion by risk factor (OR), and the results are generally recognized clinically. Except for trauma site and Hct/Hb, vasoactive drugs, PLT, PTA, and Fib were risk factors for blood transfusion demand. The study found that the use of vasoactive drugs can improve vital signs[26], and early routine medication can improve the effective rate of treatment of patients with severe trauma. Traumatic coagulation easily occurs in the early stage of trauma, and the coagulation index (PLT, PTA, Fib) affects the demand for blood transfusion[16, 27]. In the process of building the CRT model, the variables corresponding to the root nodes are the most important, followed by the leaf nodes, which split in turn[20]. In addition to Hct/Hb, Fib, CRP is an important variable for predicting blood transfusion. Because CRP is an indicator of body stress, CRP stress increases after trauma, which can reflect the trauma severity[28]. In the process of establishing the XGBoost model, the more times the nodes are traversed, the more important the variables corresponding to the nodes are. The importance of variables is mathematically relevant, and whether they have clinical guiding value needs to be comprehensively analysed in combination with clinical experience.

With the progress of science and technology, artificial intelligence methods have been widely used in the field of medicine[29-32]. There is considerable research on machine learning methods in trauma[33-35]. There has been considerable research on the prediction of massive blood transfusion, and the prediction accuracy of the decision tree algorithm is 0.695-0.814[36, 37]. Machine learning (mostly neural networks) has been used in a large number of studies to predict the prognosis of trauma. Most studies have proven the benefits of machine learning methods, and the sensitivity-specificity difference ranges from 0.035 to 0.927[38]. The neural network algorithm accuracy (98.7%) and specificity (51.5%) were the highest in predicting the survival rate of trauma patients[39].

Our research compares the traditional statistical methods with the machine learning decision tree algorithm, and the decision tree algorithm has outstanding advantages. (1) Most of the data in the real world are incomplete (missing key indicators) and noisy (numerical errors/anomalies). Artificial intelligence can allow cases with missing data or outliers to be retained by interpolation and other methods. The larger the number of cases, the more meaningful the statistical results. (2) The XGBoost algorithm is widely used in medicine, and the prediction performance is good[40, 41]. (3) The model can reconstruct more effective features from the training process of blood transfusion big data, which can be used to predict the blood transfusion volume of patients to make the model have stronger generalization ability and reduce overfitting. (4) Using the difference between the prediction results and the training data for training, with the gradual increase in the data quantity the accuracy improves in the iterative process, which ensures the incremental learning characteristics of the model. (5) Currently, doctors are widely used to make blood transfusion decisions by combining various physiological parameters, symptoms and clinical experience. Our research uses a large quantity of historical data as a reference on the basis of doctors' rich clinical experience, establishes a mathematical model, and adjusts the output of multiple experiments to obtain the best results. It has more practical value for primary hospitals or inexperienced doctors. In the future, with the increase in the data quantity, the model can be optimized by self-learning, and the prediction performance will continuously improve. The artificial intelligence mathematical model we constructed can be transformed into intelligent prediction software, which can be connected with ambulances and doctors' working computers and can be widely used in clinics as an auxiliary tool to provide blood transfusion decision support for clinicians. The mature prediction model we constructed has wide applicability, and the data from other medical institutions can be retrained and applied to clinical practice. In the future, we can work with multiple medical centres to verify the predictive performance and universal applicability of the model.

Limitations of the study

The study data are available from the authors upon reasonable request and with permission from the Chinese National Engineering Laboratory for Medical Big Data Application Technology. Therefore, the database is not completely open, and the database dataset cannot be disclosed. The artificial intelligence method is used to construct the mathematical model, which is limited to the fact that the data quantity is not large enough, and the accuracy of the model needs to be improved, but with the increase in the data quantity and the continuous optimization of the model, the prediction accuracy of the model will gradually improve. The variables extracted from unstructured text information are limited, which does not improve the performance of the model, so how to use the effective information to improve the prediction efficiency of the model is the direction of our future research. Some of the patients in our trauma database were transferred to our hospital from primary hospitals after emergency treatment (including blood transfusion), so the number of patients requiring emergency massive blood transfusion was relatively small, but it does not affect the establishment and application of the model. Our model can make decisions on whether a transfusion is based on changing, real-time vital signs and laboratory data in the process of trauma development. With large blood loss after trauma, complications such as hypothermia, acidosis and coagulation dysfunction easily occur, and the amount of plasma and platelet transfusion has an effect on the red blood cell demand. However, our model includes indicators that reflect these symptoms, so the effects of these complications and blood components on erythrocyte demand have been considered.

Conclusions

The traditional LR has the best classification ability when using non-invasive parameter prediction in the intelligent evaluation of post-traumatic blood transfusion demand, but it is only suitable for data analysis and cannot be used in clinical applications. The classification performance of the intelligent prediction model constructed by the decision tree algorithm is not inferior to that of the traditional LR method. With the increase in data quantity, the accuracy of the model improves in the iteration process, and the prediction performance continuously improves, which is conducive to clinical application and wide promotion.

Declarations

Acknowledgements

The authors thank Tanshi Li's team in the emergency department and Wanguo Xue's team in the big data centre for their help and support. The authors thank Chunya Ma for helpful discussions or comments.

Authors’ contributions

Yannan Feng performed the data analysis, and wrote and submitted the manuscript. Zhenhua Xu analysed the data and visualized the results. Junting Liu and Xiaolin Sun collected the data. Deqing Wang designed the study and contributed to the preparation of the manuscript. Yang Yu contributed to the design of the study, data analysis and revision of the manuscript. All authors read and approved the final manuscript.

Funding

The authors wish to thank the Key Project-subtopic of the "13th Five-year Plan" Military Logistics Service Research of China (Grant No. BWS16J006), China, for financially supporting this study.
Availability of data and materials
All authors had full access to all the data in the study.

Ethics approval and consent to participate
The Medical Ethics Committee of the Chinese PLA General Hospital approved the study and waived the requirement for written informed consent.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no conflicts of interest.
Author details

1Department of Blood Transfusion, Chinese PLA General Hospital, Beijing, 100853, China;
2Beijing hexing chuanglian Health Technology Co., Ltd, Beijing, 100000, China.

Abbreviations

CRT: Classification and regression tree, XGBoost: eXtreme gradient boosting, LR: logistic regression, AUC: area under the curve, PLA: People's Liberation Army, HR: heart rate, R: respiration, SBP: systolic blood pressure, DBP: diastolic blood pressure, SpO2: blood oxygen saturation, T: temperature, SI: shock index, Hb: haemoglobin, Hct: haematocrit, PLT: platelet count, CRP: C-reactive protein, IL-6: Interleukin-6, PT: prothrombin time, APTT: activated partial thromboplastin time, INR: international standardized ratio, PTA: prothrombin activity, Fib: fibrinogen, PH: potential of hydrogen, PO2: partial pressure of oxygen, PCO2: partial pressure of carbon dioxide, TCO2: total carbon dioxide, SPO2: oxygen saturation, Lac: lactate, AB: actual bicarbonate, SB: standard bicarbonate, K: potassium, RBC: volume of red blood cell transfusion, 24h RBC: the volume of 24-hour red blood cell transfusion. VD: vasoactive drugs, Trauma_loc: trauma site.

References

[1] GBD 2016 Causes of Death Collaborators. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2017; 390(10100): 1151-1210.

[2] Demetriades D, Murray J, Charalambides K, et al. Trauma fatalities: time and location of hospital deaths. J Am Coll Surg. 2004; 198(1): 20-6.

[3] Scerbo MH, Holcomb JB, Taub E, et al. The trauma center is too late: Major limb trauma without a pre-hospital tourniquet has increased death from hemorrhagic shock. J Trauma Acute Care Surg. 2017; 83(6): 1165-1172.

[4] Eastridge BJ, Mabry RL, Seguin P, et al. Death on the battlefield (2001-2011): implications for the future of combat casualty care. J Trauma Acute Care Surg. 2012; 73(6 Suppl 5): S431-7.

[5] Spahn DR, Bouillon B, Cerny V, et al. The European guideline on management of major bleeding and coagulopathy following trauma: fifth edition. Crit Care. 2019; 23(1): 98.

[6] Shackelford SA, Del Junco DJ, Powell-Dunford N, et al. Association of Prehospital Blood Product Transfusion During Medical Evacuation of Combat Casualties in Afghanistan With Acute and 30-Day Survival. JAMA. 2017; 318(16): 1581-1591.

[7] Croce MA, Tolley EA, Claridge JA, Fabian TC. Transfusions result in pulmonary morbidity and death after a moderate degree of injury. J Trauma. 2005; 59(1): 19-23; discussion 23-4.

[8] Marik PE, Corwin HL. Efficacy of red blood cell transfusion in the critically ill: a systematic review of the literature. Crit Care Med. 2008; 36(9): 2667-74.

[9] Malone DL, Dunne J, Tracy JK, Putnam AT, Scalea TM, Napolitano LM. Blood transfusion, independent of shock severity, is associated with worse outcome in trauma. J Trauma. 2003; 54(5): 898-905; discussion 905-7.

[10] Kotwal RS, Howard JT, Orman JA, et al. The Effect of a Golden Hour Policy on the Morbidity and Mortality of Combat Casualties. JAMA Surg. 2016; 151(1): 15-24.

[11] Nunez TC, Voskresensky IV, Dossett LA, Shinall R, Dutton WD, Cotton BA. Early prediction of massive transfusion in trauma: simple as ABC (assessment of blood consumption). J Trauma. 2009; 66(2): 346-52.

[12] Yücel N, Lefering R, Maegele M, et al. Trauma Associated Severe Hemorrhage (TASH)-Score: probability of mass transfusion as surrogate for life threatening hemorrhage after multiple trauma. J Trauma. 2006; 60(6): 1228-36; discussion 1236-7.

[13] Schreiber MA, Perkins J, Kiraly L, Underwood S, Wade C, Holcomb JB. Early predictors of massive transfusion in combat casualties. J Am Coll Surg. 2007; 205(4): 541-5.

[14] Maegele M. Challenges to improving patient outcome following massive transfusion in severe trauma. Expert Rev Hematol. 2020; 13(4): 323-330.

[15] Leal-Noval SR, Rincón-Ferrari MD, Múñoz-Gómez M. Red blood cell transfusion may be more detrimental than anemia for the clinical outcome of patients with severe traumatic brain injury. Crit Care. 2019; 23(1): 189.

[16] Chang R, Cardenas JC, Wade CE, Holcomb JB. Advances in the understanding of trauma-induced coagulopathy. Blood. 2016; 128(8): 1043-9.

[17] Zhao Y, Wang J, Pan F, et al. [Pilot research: construction of emergency rescue database]. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2018; 30(6): 609-612. http://zhwzbjjyx.yiigle.com/

[18] Li K, Wu H, Pan F, Chen L, Feng C, Liu Y, et al. A Machine Learning-Based Model to Predict Acute Traumatic Coagulopathy in Trauma Patients Upon Emergency Hospitalization. Clin Appl Thromb Hemost 2020;261076029619897827.

[19] Chen LF, Li H, Zhuang Y, Luo Q,Yu Y, Pan JC, et al. Application and construction of a clinical transfusion
intelligent management and evaluation system. Chinese Journal of Blood Transfusion. 2015;28(09): 1167-73.

 [20] Zhang S, Hu Z, Ye L, Zheng Y. [Application of Logistic regression and decision tree analysis in prediction of acute myocardial infarction events]. Zhejiang Da Xue Xue Bao Yi Xue Ban. 2019, 48(6): 594-602.

[21] El-Menyar A, Mekkodathil A, Abdelrahman H, et al. Review of Existing Scoring Systems For Massive Blood Transfusion in Trauma Patients: Where Do We Stand. Shock. 2019 .

[22] Galvagno SM Jr, Hu P, Yang S, et al. Accuracy of continuous noninvasive hemoglobin monitoring for the prediction of blood transfusions in trauma patients. J Clin Monit Comput. 2015; 29(6): 815-21.

[23] El-Menyar A, Goyal P, Tilley E, Latifi R. The clinical utility of shock index to predict the need for blood transfusion and outcomes in trauma. J Surg Res. 2018; 227: 52-59.

[24] Schroll R, Swift D, Tatum D, et al. Accuracy of shock index versus ABC score to predict need for massive transfusion in trauma patients. Injury. 2018; 49(1): 15-19.

[25] Rainer TH, Ho AM, Yeung JH, et al. Early risk stratification of patients with major trauma requiring massive blood transfusion. Resuscitation. 2011; 82(6): 724-9.

[26] Thorup L, Koch KU, Upton RN, Østergaard L, Rasmussen M. Effects of Vasopressors on Cerebral Circulation and Oxygenation: A Narrative Review of Pharmacodynamics in Health and Traumatic Brain Injury. J Neurosurg Anesthesiol. 2020; 32(1): 18-28.

[27] Levy JH, Welsby I, Goodnough LT. Fibrinogen as a therapeutic target for bleeding: a review of critical levels and replacement therapy. Transfusion. 2014; 54(5): 1389-405; quiz 1388.

[28] Halldorsdottir HD, Eriksson J, Persson BP, et al. Heparin-binding protein as a biomarker of post-injury sepsis in trauma patients. Acta Anaesthesiol Scand. 2018; 62(7): 962-973.

[29] Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639): 115-118.

[30] Liang H, Tsui BY, Ni H, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. 2019 .

[31] Chilamkurthy S, Ghosh R, Tanamala S, et al. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet. 2018; 392(10162): 2388-2396.

[32] Al'Aref SJ, Maliakal G, Singh G, et al. Machine learning of clinical variables and coronary artery calcium scoring for the prediction of obstructive coronary artery disease on coronary computed tomography angiography: analysis from the CONFIRM registry. Eur Heart J. 2019 .

[33] Bertsimas D, Dunn J, Steele DW, Trikalinos TA, Wang Y. Comparison of Machine Learning Optimal Classification Trees With the Pediatric Emergency Care Applied Research Network Head Trauma Decision Rules. JAMA Pediatr. 2019; 173(7): 648-656.

[34] Ruffle JK, Farmer AD, Aziz Q. Artificial Intelligence-Assisted Gastroenterology- Promises and Pitfalls. Am J Gastroenterol. 2019; 114(3): 422-428.

[35] Afshar M, Phillips A, Karnik N, et al. Natural language processing and machine learning to identify alcohol misuse from the electronic health record in trauma patients: development and internal validation. J Am Med Inform Assoc. 2019; 26(3): 254-261.

[36] Hodgman EI, Cripps MW, Mina MJ, et al. External validation of a smartphone app model to predict the need for massive transfusion using five different definitions. J Trauma Acute Care Surg. 2018; 84(2): 397-402.

[37] Seheult JN, Anto VP, Farhat N, et al. Application of a recursive partitioning decision tree algorithm for the prediction of massive transfusion in civilian trauma: the MTPitt prediction tool. Transfusion. 2019;59(3): 953-964.

[38] Liu NT, Salinas J. Machine Learning for Predicting Outcomes in Trauma. Shock. 2017. 48(5): 504-510.

[39] Rau CS, Wu SC, Chuang JF, et al. Machine Learning Models of Survival Prediction in Trauma Patients. J Clin Med. 2019; 8(6).

[40] Xu Y, Ju L, Tong J, Zhou CM, Yang JJ. Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection. Sci Rep. 2020; 10(1): 2519.

[41] Lu Y, Fu X, Chen F, Wong K. Prediction of fetal weight at varying gestational age in the absence of ultrasound examination using ensemble learning. Artif Intell Med. 2020; 102: 101748.

Tables

Table 1 Univariate analysis of predicting factors for transfusion

Variable

N(%)

No-transfusion

Mean(SD)/

Median[Range]

Transfusion 

Mean(SD)/

Median[Range]

P-value

N=1183

N=188

Age*

1371(100%)

44.00 [29.00,56.50]

42.00 [28.00,54.25]

0.049

Sex

Female

324(23.6%)

284 (24.01)

40 (21.28)

0.468

Male

1047(76.4%)

899 (75.99)

148 (78.72)

Height(cm)

488(35.6%)

170.00 [164.00,175.00]

170.00 [163.50,173.25]

0.242

Weight(kg)

477(34.8%)

68.00 [60.00,75.00]

67.75 [60.00,76.00]

0.541

Non-invasive parameters

HR**

785(57.3%)

96.95 (24.31)

103.55 (25.87)

0

R

786(57.3%)

21.00 [19.00,23.00]

21.00 [19.00,26.00]

0.071

SBP(mmHg) **

787(57.4%)

124.32 (25.28)

117.65 (27.46)

0

DBP(mmHg) **

787(57.4%)

77.58 (15.96)

74.62 (17.83)

0

SpO2(%)

783(57.1%)

98.00 [96.00,99.00]

98.00 [96.00,99.00]

0.113

SI**

785(57.3%)

0.82 (0.29)

0.95 (0.43)

0

T()

637(46.5%)

37.00 [36.80,37.30]

37.00 [36.70,37.30]

0.389

Invasive detection parameters

HB(g/L) **

1287(93.9%)

126.00 [107.00,143.00]

107.00 [82.00,135.00]

0

Hct(L/L) **

1300(94.8%)

3.80 [0.50,22.00]

0.46 [0.25,4.00]

0

PLT(109/L) **

1225(98.4%)

216.81 (94.70)

201.76 (99.53)

0

CRP(mg/L)

1075(78.4%)

0.95 [0.10,4.87]

0.41 [0.10,3.41]

0.806

IL-6(pg/ml)

581(42.4%)

182.37 (380.08)

219.26 (388.41)

0.412

PT(s) **

1230(89.7%)

14.70 [14.00,16.00]

15.40 [14.20,17.08]

0

APTT(s) **

1227(89.5%)

37.03 (10.39)

38.50 (12.86)

0

INR

1224(89.3%)

15.40 [14.60,16.40]

15.60 [14.60,16.60]

0.698

PTA(%)**

1230(89.7%)

80.00 [68.00,89.00]

73.50 [61.25,85.00]

0

Fib(g/L) **

1216(88.7%)

3.15 (1.76)

2.72 (1.51)

0

PH**

1230(89.7%)

1.16 [1.08,1.28]

1.22 [1.11,1.39]

0

PO2(mmHg) **

1123(81.9 %)

120.73 (62.16)

134.12 (73.88)

0.001

PCO2(mmHg)

1124(82.0%)

37.00 [33.00,41.00]

37.00 [32.00,41.00]

0.116

TCO2(mmol/L) ** 

1121(81.8%)

24.01 (4.38)

22.66 (4.85)

0

Lac(mmol/L) **

1122(81.8%)

7.41 [7.37,7.45]

7.39 [7.35,7.43]

0

AB(mmol/L) **

1124(82.0%)

22.88 (4.24)

21.54 (4.73)

0

SB(mmol/L) **

1122(81.8%)

23.69 (3.56)

22.39 (4.31)

0

K(mmol/L) **

1278(93.2%)

3.88 [3.56,4.10]

3.90 [3.60,4.24]

0.008

Endotracheal intubation*

No

1148(83.7%)

1003 (84.78)

145 (77.13)

0.011

Yes

223(16.3%)

180 (15.22)

43 (22.87)

Vasoactive drugs**

No

1264(92.2%)

1113 (94.08)

151 (80.32)

0

Yes

107(7.8%)

70 (5.92)

37 (19.68)

Trauma site**

Upper extremity

24(1.8%)

21 (1.87)

3 (1.66)

0

Lower extremity

49(3.6%)

42 (3.73)

7 (3.87)

Head and neck

414(30.2%)

386 (34.28)

28 (15.47)

Chest and abdomen

639(46.7%)

538 (47.78)

101 (55.8)

Spine

75(5.5%)

62 (5.51)

13 (7.18)

Trunk

27(2.0%)

20 (1.78)

7 (3.87)

Pelvis

79(5.8%)

57 (5.06)

22 (12.15)

Trauma severity classification

First level

1160(84.6%)

997 (84.42)

163 (86.7)

0.529

Second level

204(14.9%)

179 (15.16)

25 (13.3)

Third level 

5(0.4%)

5 (0.42)

0

Trauma type

Open trauma

867(63.2%)

747 (63.14)

120 (63.83)

0.921

Blunt injury

504(36.8%)

436 (36.86)

68 (36.17)

RBC(U) **

1371(100%)

0.00 

2.00 [0.00,4.00]

0

24h RBC(U) **

1371(100%)

0.00 

2.00 [0.00,4.00]

0

Emergency department time(h) **

1371(100%)

23.58 (33.69)

25.61 (37.80)

0.001

Remarks: HR: heart rate, R: respiration, SBP: systolic blood pressure, DBP: diastolic blood pressure, SpO2: blood oxygen saturation, T: temperature, SI: shock index, Hb: haemoglobin, Hct: haematocrit, PLT: platelet count, CRP: C-reactive protein, IL-6: Interleukin-6, PT: prothrombin time, APTT: activated partial thromboplastin time, INR: international standardized ratio, PTA: prothrombin activity, Fib: fibrinogen, PH: potential of hydrogen, PO2: partial pressure of oxygen, PCO2: partial pressure of carbon dioxide, TCO2: total carbon dioxide, SPO2: oxygen saturation, Lac: lactate, AB: actual bicarbonate, SB: standard bicarbonate, K: potassium, RBC: volume of red blood cell transfusion, 24h RBC: the volume of 24-hour red blood cell transfusion. N: number.

P<0.05, there were significant differences. “*: P<0.05, **: P<0.01.

Table 2 Comparison between LR, CRT and the XGBoost model in predicting blood transfusion

Parameters type

Methods

AUC

Sensitivity

Specificity

Accuracy

Youden Index

P-value

Non-invasive parameters

Xgboost

0.705

0.66

0.77

0.75

0.19

<0.001

Logistic

0.716

0.86

0.5

0.55

0.12

CRT

0.692

0.89

0.42

0.48

0.16

All parameters

Xgboost

0.937

0.94

0.82

0.83

0.10

<0.001

Logistic

0.797

0.8

0.7

0.72

0.12

CRT

0.816

0.69

0.92

0.89

0.09

Remarks: AUC: area under the curve; CRT: classification and regression tree.

Table 3 Binary logistic regression analysis for predicting transfusion with all (non-invasive + invasive) parameters

Variable

OR

2.5%

97.5%

P-value

SB

0.898

0.844

0.957

0.001

Hct

0.923

0.899

0.948

0

VD

2.039

1.092

3.808

0.025

Trauma site

7.961

1.422

44.567

0.018

PTA

0.975

0.964

0.988

0

SpO2

1.023

0.977

1.071

0.323

PLT

0.995

0.992

0.998

0.001

Fib

0.789

0.674

0.924

0.003

Remarks: SB: standard bicarbonate, Hct: haematocrit, VD: vasoactive drugs, PTA: prothrombin activity, SpO2: blood oxygen saturation, PLT: platelet count, Fib: fibrinogen, OR: odds ratio.