Study design
We performed a retrospective cross-sectional observational study based on clinical data collected by the data warehouse of a single tertiary center.
Setting and study population
We included all patients older than 16 years admitted at the Emergency Department of Lausanne University Hospital (CHUV) for UGIB. Lausanne University Hospital is a tertiary hospital in Western Switzerland with around 65,000 ED visit per year.
UGIB was identified by symptoms at ED admission: hematemesis, melena, hematochezia, or other symptoms associated with ED final diagnosis of gastrointestinal bleeding (syncope, hypotension, anemia, hemorrhagic shock, or asthenia) between January 1, 2015 and December 31, 2019. Exclusion criteria were pregnancy, patients aged less than 16 years old and patients refusing authorization to analyze their data.
Data collection
We extracted all variables from the hospital data warehouse collecting data from medical recorded files, administrative files, diagnosis and surgery coding databases. We collected demographic data (age, sex), date and time of admission, duration of hospitalization, past medical history, physiological data at admission (blood pressure, heart rate, respiration rate, level of consciousness according to the AVPU scale), surgical interventions (type), endoscopic interventions, need for blood transfusions, level of triage priority, and final diagnoses. We also collected laboratory data (hemoglobin, lactate, excess base, urea, albumin, blood transaminases, prothrombin time and INR (International Normalized Ratio), platelet count, plasma fibrinogen, aPTT (activated prothromboplastin time) and therapeutics applied (norepinephrine, blood products, tranexamic acid, octreotide, esomeprazole).
Clinical scores compared
The Rockall score (RS) predicts mortality and was developed in 1996 from a study of 4,185 patients with UGIB in the UK during the period from 1993-1996 [6]. Since the full score requires endoscopic findings, initial application for risk stratification is limited. Adaptation of the Rockall score, based only on "pre-endoscopic" clinical data (PERS) also predicted in-hospital death and allowed early risk stratification. Considering the aim of this study, only PERS will be analyzed. PERS is obtained from 3 clinical variables: age at presentation, signs of shock, and comorbidities (as congestive heart failure, ischemic heart disease, any major comorbidity, renal failure, liver failure and disseminated malignancy). The minimum value of the score is 0, the maximum is 7.
The AIMS-65 score predicts mortality and was developed in 2011 in the USA, based on a retrospective research conducted on 29,222 patients admitted for UIGB between 2004 and 2005 in 187 hospitals. The same authors externally validated the AIMS-65 one year later on 32,504 patients, from the same national database used for development. AIMS-65 includes 5 clinical or laboratory variables: age at presentation, albumin, INR, alteration in mental status and systolic blood pressure. This score has a narrow score spectrum: minimum score is 0, maximum is 5 [7].
The Glasgow-Blatchford Score (GBS) was developed in 2000, based on 1,748 patients with the objective of identify a patient’s need for intervention in the UK (defined as blood transfusion, endoscopic treatment, or surgery).
GBS includes 8 clinical or laboratory variables: blood urea nitrogen, hemoglobin (adapted to sex), systolic blood pressure, heart rate, presentation with melena, presentation with syncope, presence of a hepatic disease (known history or clinical/laboratory evidence) and presence of cardiac disease (known history or clinical/laboratory evidence). The minimum value of the score is 0, the maximum is 23 [8]. According to the European Society of Gastrointestinal Endoscopy (ESGE), patients with a GBS score of 0-1 are considered to be at a very low risk and do not require early endoscopy nor hospital admission; they can be managed as outpatients, informed of the risk of recurrent bleeding and be advised to maintain contact with the discharging hospital. [9]
The modified Glasgow-Blatchford Score (mGBS) predicts need for intervention and represents a simple version of the GBS by omitting anamnestic variables potentially requiring interpretation or subjective judgment which can increase the risk of bias, i.e.: syncope, hepatic disease and cardiac disease. As in mGBS, the minimum value is 0, the maximum is 16 [10].
Outcomes
The primary outcome was the need for intervention or death. It includes blood transfusion or endoscopic treatment or surgery or in-hospital death.
Our secondary outcome was in-hospital death.
Statistical analysis
All variables are presented as either mean with Standard Deviation (SD) or median with interquartile range, as appropriate. Qualitative variables are expressed as numbers and proportion (percentage).
We assessed overall performance, discrimination and calibration of the scores.
Overall performance of the models was assessed by the Brier score, quantifying the distance between the predicted outcome and the actual outcome. We scaled the Brier score by its maximum to standardize for low incidence. The scaled Brier score ranges from 0–100% and indicates the degree of error in prediction; a scaled Brier score of 0% shows a perfect performance.
Brier score was not estimated for the GBS and mGBS scores because authors did not report the predicted outcome.
The discrimination of a model relates to how well a prediction model can discriminate those with the outcome from those without the outcome. We estimated sensitivity, specificity, positive likelihood ratio (PLR) and negative likelihood ratio (NLR) for each threshold of the 4 scores.
The PLR is the ratio of sensitivity to 1-specificity. A PLR of 10 or above will result in a large increase in the probability of the outcome. The NLR is the ratio of 1-sensitivity to specificity. A NLR of 0.1 or less will result in a large decrease in the probability of the outcome.
The discrimination of the risk scores was compared by plotting the Receiver-Operating Characteristic (ROC) curve - which is true positives (sensitivity) on false positive (1-specificity) - and estimating the area under the receiver-operating characteristic (AUROC). The AUROC can be interpreted as the probability that a patient with the outcome is given a higher probability of the outcome by the model than a randomly chose patient without the outcome. For a binary outcome the concordance (C-statistic) is identical to the AUROC. A perfect model has a C-Statistic of 1.0.
Calibration relates to the agreement between observed outcomes and predicted outcome. Calibration in the large is the difference between the mean predicted and observed probabilities and the ratio of the predicted and observed number of events (P/O). We plotted a calibration graph for AIMS65 and Pre-Rockall score with observed probability of death on predicted probability of death by decile of score combining with a local polynomial regression (Loess algorithm).
We assessed the calibration intercept and calibration slope as a measure of the spread between predicted and observed probability of death. A perfect model has an intercept of zero and a slope of 1; indicating that predictions are neither too low nor too high [11]. We managed missing value using multiple imputation by chained equation (MICE). We fulfill 20 datasets with MICE. We included all variables with missing values needed to estimate the different scores (urea, hemoglobin, blood pressure, heart rate, albumin, INR, AVPU scale). Explanatory variables (death, surgical or endoscopic intervention and transfusion) were not missing. Missing values are reported for each variable in Table 1.
Table 1
– DEMOGRAPHIC / CLINICAL FEATURES
| N=1521 N (%) | Missing values, N (%) |
Male, N (%) | 940 (62%) | - |
Female, N (%) | 581 (38.2%) | - |
Median age, N (SD) | 68 (20.7) | - |
16 – 24 25 – 44 45 – 64 65 – 84 > 85 | 91 (6%) 356 (23%) 356 (23%) 610 (40%) 229 (15%) | - |
TAS, M (SD) ¬ < 90 | 126 (21.45) 27 (1.8%) | 208 (13.6%) |
FC, M (SD) ¬ > 100 | 84.5 (17.12) 209 (13.7%) | 206 (13.5%) |
Hemoglobin (g/dl), M (SD) | 107 (32.18) | 25 (1.6%) |
INR, M (SD) | 1 (0.5) | 215 (14.1%) |
Creatinine (g/dl), M (SD) | 107 (106.9) | 39 (2.5%) |
Level of triage priority (1-4), N (%) 1 2 3 4 | 66 (4.72%) 569 (40.7%) 760 (54.3%) 4 (0.3%) | 122 (8%) |
PRESENTING SYMPTOMS | | |
Melena, N (%) | 693 (49%) | |
Hematemesis and melena, N (%) | 1020 (73%) | |
Syncope, N (%) | 173 (12.2%) | |
Hematochezia, N (%) | 89 (6.4%) | |
COMORBIDITIES | | |
Hepatic failure, N (%) | 308 (22%) | |
Cardiac Failure, N (%) ¬ Coronary ins. ¬ Valvular ins. ¬ Rhythm Ins. | 250 (18%) 167 (12%) 236 (17%) | |
Diabetes, N (%) | 244 (17%) | |
Hypertension, N (%) | 594 (42%) | |
Renal Ins, N (%) | 280 (20%) | |
Cancer, N (%) | 302 (21%) | |
All analyses were performed using STATA software (version 16.0; Stata Corp, College Station, TX, USA).
Ethics
The study was approved by the Ethics committee of the Canton Vaud (CER-VD) (project-ID: 2020-00515).