Repeated scoring with Adult Appendicitis Score improves the sensitivity and the specificity of appendicitis diagnosis in patients with early equivocal signs of appendicitis: A secondary analysis

doi:10.21203/rs.3.rs-4445338/v1

Download PDF

Research Article

Repeated scoring with Adult Appendicitis Score improves the sensitivity and the specificity of appendicitis diagnosis in patients with early equivocal signs of appendicitis: A secondary analysis

https://doi.org/10.21203/rs.3.rs-4445338/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background

The use of computed tomography at the early stage of acute appendicitis can lead to overdiagnosis and predispose patients unnecessarily to ionising radiation. Adult Appendicitis Score (AAS) can be used to select patients for imaging. Observation and re-scoring in the DIAMOND trial reduced the need for imaging. In this study, we wanted to determine if the AAS change (ΔAAS) can be used as a diagnostic tool to select patients for imaging even more precisely.

Methods

Eighty-eight patients with early equivocal appendicitis entered the observation arm in the DIAMOND trial. The data of these patients were reanalysed, and ΔAAS during the observation was calculated. The baseline AAS, final AAS, and the CRP change (ΔCRP) were selected as reference standards.

Results

Eighty-three patients with complete data were analysed. The AUROC values: ΔAAS 0.932 (95%CI 0.868–0.996), baseline AAS 0.629 (95%CI 0.498–0.760), final AAS 0.936 (95%CI 0.886–0.987), and ΔCRP 0.796 (95%CI 0.696–0.897). From receiver operating characteristic curves, we identified the limits for low (ΔAAS ≤ -2), intermediate (ΔAAS − 1 − 0), and high (ΔAAS ≥ 1) probability of appendicitis. The negative predictive value of the low probability group and the positive predictive value of the high probability group for acute appendicitis were 97% and 94%, respectively.

Conclusions

Patients with equivocal signs of appendicitis could benefit from short observation and calculation of ΔAAS to reduce overdiagnosis and exposure to excessive imaging.

Trial registration

The DIAMOND trial was originally registered in ClinicalTrials.gov (NCT02742402) on April 7th, 2016 and approved by the institutional review board and the ethical committee of Helsinki University Hospital (reference number 27/13/03/02/2016).

Appendicitis

Early appendicitis

General surgery

Acute care surgery

Diagnostic scoring

Adult Appendicitis Score

Acute appendicitis is one of the most common abdominal emergencies worldwide, and patients with suspected acute appendicitis are even more abundant. Historically, the diagnosis of appendicitis relied solely on patient history and clinical examination, which remain the basis of the diagnosis. While clear signs of appendicitis are usually easily recognised, the equivocal symptoms often prove the most problematic. Besides laboratory tests, such as the inflammatory markers, abdominal imaging is commonly used. In 2010, Raja et al.¹ concluded that the routine use of computed tomography (CT) in patients suspected of having appendicitis reduced the rate of negative appendectomy from 23.0–1.7%. However, for the known adverse effects² of ionising radiation, often limited availability of imaging services, and the risk of overdiagnosing appendicitis^3,4, the extensive use of CT can not be recommended. Observation should remain an option for imaging when managing these patients. Observation and repeated evaluation of inflammatory markers and clinical signs have been little studied. An observational study⁵ of 420 patients, where patients were re-evaluated after a median duration of 6 hours, showed that the inflammatory response remained high or increased during the observation in patients with acute appendicitis and decreased among patients without appendicitis.

Several scoring systems⁶ have been introduced to improve the diagnostics of acute appendicitis, especially to avoid over-treatment. These models use patient history, symptoms, clinical signs, and laboratory results to form a score to divide patients into different risk groups. This risk assessment guides the clinicians in deciding whether a patient would benefit from abdominal imaging or if the treatment plan can be made without it. Adult Appendicitis Score (AAS) divides patients into groups of low, intermediate, and high risk for acute appendicitis.⁷ Patients in the intermediate risk group usually require imaging studies to confirm the diagnosis, whereas high-score patients can be scheduled for surgery without further investigations.

We have shown in the DIAMOND trial⁸ that observing patients with intermediate-risk AAS for six to eight hours is safe and effective. At the beginning of the trial, patients were randomly assigned into two groups: imaging and observation. The observation protocol reduced the need for both imaging and surgery for acute appendicitis. Observation revealed that 15 per cent of patients had spontaneously resolving appendicitis requiring no treatment. Also, by using observation protocol, 55 per cent of patients could be managed without abdominal imaging. The observation protocol did not raise the numbers of complicated appendicitis or negative appendectomies compared to the imaging group.

To our knowledge, the diagnostic performance of AAS change over time has yet to be studied. In this study, our object was to explore this change in AAS during observation and whether this could be a valuable tool in managing patients with early equivocal appendicitis and reducing the need for diagnostic imaging even further.

The patients were recruited to the DIAMOND trial from May 3, 2016, to March 9, 2020. The prospective trial was initially registered in ClinicalTrials.gov (NCT02742402) and approved by the institutional review board and the ethical committee of Helsinki University Hospital (reference number 27/13/03/02/2016). No additional approval was required for this secondary analysis. The updated STARD statement⁹ was adhered to in the present study.

Before inclusion in the study, AAS was calculated for the patients. AAS ranges from 0 to 23, where 0 to 10 denotes low risk, 11 to 15 intermediate risk, and 16 to 23 high risk. The inclusion criteria were adults (18 years or older) suspected of acute appendicitis in the intermediate-risk AAS group with symptom duration of less than 24 h and C-reactive protein (CRP) level below 100 mg/l. The exclusion criteria were pregnancy, antibiotic medication during the previous 24 hours, suspicion of some other illness requiring immediate intervention, prior participation in this study, and absence of written consent. The details concerning the recruitment are described in the original article.⁸

The non-consecutive participants were randomly allocated in a 1:1 ratio into two arms: imaging or observation. The present study is a secondary analysis of patients in the observation arm. The clinical evaluation and laboratory testing were repeated in the observation arm after six to eight hours, and new AAS was calculated based on these variables. Patients with declining scores were discharged from the ER if no other illness was suspected. A score that did not decrease and remained below 16 led to abdominal imaging. AAS 16 or higher indicated a high risk of acute appendicitis, and an emergency appendectomy was scheduled without imaging. The diagnosis was confirmed with a histopathological analysis of all removed appendices. Complicated appendicitis is defined as perforated appendicitis or abscess.

Our data recorded the AAS at the beginning (baseline AAS) and after the observation (final AAS). ΔAAS (AAS change, i.e., final AAS minus baseline AAS) is the index test. The baseline AAS, final AAS, and ΔCRP (CRP change, i.e., final CRP minus baseline CRP) were selected as reference standards. AAS is routinely used when managing patients with suspected appendicitis in our hospital. ΔCRP expresses the change in the inflammatory response during the observation period.

Statistical analysis

Sensitivity, specificity, positive and negative likelihood ratios (+ LR and -LR), and diagnostic odds ratio (DOR) at different cut-off values of ΔAAS were calculated. A receiver operating characteristic curve (ROC) was formed using the variables of interest. The area under the receiver operating characteristic curve (AUROC) was calculated. The optimal cut-off points for ΔAAS were defined according to the LR-, DOR, sensitivity and specificity values, and the ROC curve. These cut-off points categorised patients into three risk groups after observation. The cut-off points for the reference standards were also determined according to their respective ROC curves. McNemar's test was used to test if the need for imaging could be reduced using the new risk assessment compared with the number of patients imaged during the trial. The statistical analysis was accomplished using SPSS® version 29 (IBM, Armonk, NY, USA).

Eighty-eight patients in the observation group received the allocated intervention in the DIAMOND trial. For the present study, four patients were excluded due to protocol violations, as we did not have their final AAS recorded. Also, one patient with radiologically diagnosed appendicitis treated nonoperatively was excluded as we did not have the histopathological confirmation of acute appendicitis. In the end, 83 patients were available for analysis in this study.

The median age of the patients was 29 (i.q.r. 25–38), and 43 (52%) were female. Table 1 describes the demographics and clinical characteristics of the patients.

Table 1

Demographics and clinical characteristics
Age, years^*	29 (25–38)
Female	43 (52)
Duration of symptoms, hours^†	12.7 (5.6)
Baseline CRP, mg/L^*	6.0 (3.0–14.0)
Baseline WBC count, E9/L^*	12.3 (10.0–15.0)
Baseline AAS^*	13 (12–14)

Values in parentheses are percentages unless indicated otherwise; values are *median (i.q.r.) and †mean (s.d.). CRP = C-reactive protein, WBC = white blood cell, AAS = Adult Appendicitis Score.

Twenty-five (30%) patients had a final AAS of 16 or higher, and the decision to operate was made without imaging. All these patients had appendicitis; two of them had complicated appendicitis. Thirty-one (37%) patients underwent abdominal imaging after the re-scoring, resulting in 24 surgeries for acute appendicitis, one of which was a negative appendectomy. Twenty-seven patients (33%) were discharged without imaging. None of these patients returned due to appendicitis within the first 30 days. In the end, forty-eight (58%) patients were diagnosed with histologically confirmed appendicitis.

The receiver operating characteristic curves (Fig. 1) and the AUROC values (Table 2) show that both the ΔAAS and final AAS perform better in diagnosing acute appendicitis (ROC area values 0.932 (95%CI 0.868–0.996) and 0.936 (95%CI 0.886–0.987)) compared to baseline AAS and ΔCRP (ROC area values 0.629 (95%CI 0.498–0.760) and 0.796 (95%CI 0.696–0.897)).

Table 2

The values of AUROC for the 𝛥AAS and the reference standards
	AUROC	SE	p
Baseline AAS	0.629 (0.498–0.760)	0.067	0.046
Final AAS	0.936 (0.886–0.987)	0.026	< 0.001
𝛥AAS	0.932 (0.868–0.996)	0.033	< 0.001
𝛥CRP	0.796 (0.696–0.897)	0.051	< 0.001

95% confidence interval in parenthesis. AUROC = area under the receiver operating characteristics, SE = standard error, AAS = Adult Appendicitis Score, 𝛥AAS = change in Adult Appendicitis Score, 𝛥CRP = change in C-reactive protein. P-values were determined with Mann-Whitney U-test.

To identify the low, intermediate, and high probability of appendicitis after observation, two cut-off values for ΔAAS were determined using the ROC curves: ΔAAS ≥ -1 and ΔAAS ≥ 1. For comparison, the reference standards' cut-off values were also determined: baseline AAS ≥ 14, final AAS ≥ 16, and ΔCRP ≥ 9, Fig. 1. The sensitivity, specificity, likelihood ratios, and diagnostic odds ratios of acute appendicitis were calculated for specific ranges of baseline ASS, final AAS, ΔAAS, and ΔCRP, as shown in Table 3. These cut-off values are indicated in Fig. 1.

Table 3

Defining the cut-off values for ΔAAS and the reference standards
	ΔAAS ≥ -1	ΔAAS ≥ 1	Baseline AAS ≥ 14	Final AAS ≥ 16	ΔCRP ≥ 9
Sensitivity	97.9% (89.1–99.6%)	70.8% (56.8–81.8%)	52.1% (38.3–65.5%)	52.1% (38.3–65.5%)	75.0% (61.2–85.1%)
Specificity	80% (64.1–90.0%)	94.3% (81.4–98.4%)	68.6% (52.0–81.4%)	100.0% (90.1–100.0%)	80.0% (64.1–90.0%)
+LR	4.90 (2.52–9.51)	12.40 (3.19–48.20)	1.66 (0.95–2.90)	infinity	3.75 (1.90–7.42)
-LR	0.03 (0.004–0.18)	0.31 (0.20–0.48)	0.70 (0.51–0.96)	0.48 (0.36–0.64)	0.31 (0.19–0.52)
DOR	188.00 (21.97–1608.95)	40.07 (8.45–190.14)	2.37 (0.95–5.90)	infinity	12.00 (4.18–34.46)

95% confidence interval in parenthesis. 𝛥AAS = change in Adult Appendicitis Score, AAS = Adult Appendicitis Score, 𝛥CRP = change in C-reactive protein, +LR = positive likelihood ratio, -LR = negative likelihood ratio, DOR = diagnostic Odd's ratio.

Based on the chosen cut-off values of ΔAAS, patients can now be classified into three groups corresponding to the probability of appendicitis: low (ΔAAS ≤ -2), intermediate (ΔAAS − 1 − 0), and high (ΔAAS ≥ 1 ) probability. The proportions of acute appendicitis were then calculated in these three groups, Table 4. There were 29 (35%) patients in the low probability group, and only 1 (3%) of them had acute appendicitis. Eighteen (22%) patients fell into the intermediate probability group, and 13 (72%) of them had acute appendicitis. There were 36 (43%) patients in the high probability group, and 34 (94%) of them had acute appendicitis. The negative predictive value of the low (ΔAAS ≤ -2) probability of acute appendicitis is 97 per cent, and the positive predictive value of the high (ΔAAS ≥ 1) probability of acute appendicitis is 94 per cent. The flow of patients into these groups is presented in Fig. 2.

Table 4

Three groups were formed: low, intermediate, and high probability of acute appendicitis
	Low: ΔAAS ≤ -2	Intermediate: ΔAAS − 1 − 0	High: ΔAAS ≥ 1
N	29 (35)	18 (22)	36 (43)
Acute appendicitis diagnosis	1 (3)	13 (72)	34 (94)

Values in parentheses are percentages. 𝛥AAS = change in Adult Appendicitis Score.

Thus, utilising these groups based on ΔAAS, only the 18 (22%) patients in the intermediate probability group would have benefited from imaging studies. Compared with the 31 (37%) patients actually imaged during the trial according to the trial protocol, this is a significantly lower number of patients, p < 0.001.

During observation of patients with equivocal signs of appendicitis, the measurable inflammatory response and clinical findings evolve, causing changes in AAS. Repeated scoring allows the change in AAS to be used as a diagnostic tool. This study showed that the AAS change during the observation has significantly better diagnostic accuracy and overall sensitivity than baseline AAS. Patients with appendicitis showed increasing CRP value, but the diagnostic accuracy of the change in CRP was significantly poorer than the change in AAS. In accordance with these results, an earlier study⁵ demonstrated that CRP change had independent diagnostic value in addition to the final CRP value after observation of patients with equivocal signs of appendicitis.

In the DIAMOND trial⁸, we showed that the need for diagnostic imaging was considerably lower when the AAS-based observation protocol was utilised compared with routine imaging. Observation and re-scoring with AAS can further reduce the need for imaging if the change in AAS is used instead of a predefined cut-off score of 16 or more. The change in AAS can then be used to categorise patients into three probability groups. The risks of missing appendicitis in the low-probability group or ending up with negative appendectomy in the high-probability group are small. Thus, as the patients in the high and low probability groups would not benefit from imaging, the imaging rate could be reduced considerably. This classification can speed up the decision process after observation, release the imaging resources of the emergency department for other patients, and reduce the cancer risk² caused by the ionising radiation of CT.

The increased use of CT has led to raised detection of uncomplicated appendicitis, as CT can not differentiate the self-resolving appendicitis from the perforating type. This has resulted in overdiagnosis of appendicitis^3,4. As concluded in the DIAMOND trial⁸ earlier, observation and avoidance of early imaging can help distinguish patients with spontaneously resolving appendicitis. Most patients could be managed without imaging, and much fewer patients required treatment for appendicitis. This new classification can reduce overtreatment even further by reducing excess imaging. As shown in DIAMOND-trial and in a recent PERFECT-trial¹⁰, observation or longer delay to surgery does not increase perforation rate or postoperative complications, thus making short, up to 8 hours, observation a safe alternative to early imaging.

According to the prospective validation study¹¹, the AAS intermediate risk group comprises 39 per cent of all patients suspected of acute appendicitis. Only this group of patients is generally recommended to be imaged: ultrasound first and CT if appendicitis was not detected. Considering observation with the new classification based on ΔAAS, the overall proportion of patients suspected of appendicitis needing imaging could be reduced substantially.

As this study shows, there are many ways to improve observation protocols even further. Utilisation of change in AAS may be one way. Selecting suitable patients for observation is another critical area that requires further studies. Using scoring systems that differentiate patients with uncomplicated appendicitis from complicated appendicitis may also be helpful. Unfortunately, such scoring that does not use imaging studies for differentiation has not been published to date.

This study has limitations. Only patients with less than 24 hours of symptom duration and CRP levels below 100 mg/l were included in the study. Also, the number of patients in the observation arm is small. These results should be confirmed in a more extensive, preferably prospective study, with less strict inclusion criteria.

Re-scoring with AAS after six to eight hours of observation is a worthy and feasible alternative to imaging in patients with an intermediate risk of appendicitis, according to AAS. The change in AAS can be utilised as a diagnostic tool, effectively reducing the need for imaging and easing the decision-making process in these patients.

ASS

Adult Appendicitis Score

ΔAAS

change of AAS

AUROC

area under receiver operating characteristic curve

CRP

C-reactive protein

ΔCRP

change in CRP

computed tomography

emergency department

i.q.r.

interquartile range

DOR

diagnostic odds ratio

ROC

receiver operating characteristic curve

95%CI

95 percent confidence interval

+LR

positive likelihood ratio

-LR

negative likelihood ratio

Ethics approval and consent to participate

The ethics approval (the ethical committee of Helsinki University Hospital, reference number 27/13/03/02/2016) and patients’ individual written consents were secured during the original DIAMOND trial. There was no need to renew these as no new interventions or data gathering was conducted.

Consent for publication

Not applicable

Availability of data and materials

The datasets generated and/or analysed during the current study are not publicly available due to Finnish legislation.

Competing interests

The authors declare that they have no competing interests.

Funding

This study was supported financially by HUS Research Funds, Finland (Government Research Funds).

Authors' contributions

Conceptualization and methodology: KL, PM, and AL; Data collection: KL; Validation PM; Formal analysis: KL and PM, Writing – original draft preparation: KL; Writing – review and editing KL, PM, and AL; Visualization: KL.

Acknowledgements

The authors thank all participating clinicians for recruiting patients at the ED of Meilahti Hospital.

Raja AS, Wright C, Sodickson AD, Zane RD, Schiff GD, Hanson R, et al. Negative Appendectomy Rate in the Era of CT: An 18-year Perspective. Radiology. 2010 Aug;256(2):460–5.
Lee KH, Lee S, Park JH, Lee SS, Kim HY, Lee WJ, et al. Risk of Hematologic Malignant Neoplasms From Abdominopelvic Computed Tomographic Radiation in Patients Who Underwent Appendectomy. JAMA Surgery. 2021 Jan 20;1–9.
Livingston EH, Woodward WA, Sarosi GA, Haley RW. Disconnect Between Incidence of Nonperforated and Perforated Appendicitis. Annals of Surgery [Internet]. 2007 Jun;245(6):886–92. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1876946/pdf/20070600s00009p886.pdf
Andersson RE, Anning HL. Resolving appendicitis is common: Further evidence - Reply. Annals of Surgery. 2008;247(3):553–553.
Andersson RE, Hugander A, Ravn H, Offenbartl K, Ghazi SH, Nyström PO, et al. Repeated Clinical and Laboratory Examinations in Patients with an Equivocal Diagnosis of Appendicitis. World J Surg. 2000;24(4):479–85.
Collaborative RSG on behalf of the WMR. Evaluation of appendicitis risk prediction models in adults with suspected appendicitis. British Journal of Surgery. 2019 Dec 3;103(1):971–86.
Sammalkorpi HE, Mentula P, Leppäniemi A. A new adult appendicitis score improves diagnostic accuracy of acute appendicitis - a prospective study. BMC Gastroenterology. 2014 Jun 26;14(1):910.
Lastunen KS, Leppäniemi AK, Mentula PJ. DIAgnostic iMaging or Observation in early equivocal appeNDicitis (DIAMOND): open-label, randomized clinical trial. Brit J Surg. 2022;109(7):588–94.
Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig L, et al. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ : Br Méd J. 2015;351:h5527.
Jalava K, Sallinen V, Lampela H, Malmi H, Steinholt I, Augestad KM, et al. Role of preoperative in-hospital delay on appendiceal perforation while awaiting appendicectomy (PERFECT): a Nordic, pragmatic, open-label, multicentre, non-inferiority, randomised controlled trial. Lancet. 2023;
Sammalkorpi HE, Mentula P, Savolainen H, Leppäniemi A. The Introduction of Adult Appendicitis Score Reduced Negative Appendectomy Rate. Scandinavian journal of surgery: SJS: official organ for the Finnish Surgical Society and the Scandinavian Surgical Society. 2017 Sep;106(3):196–201.

No competing interests reported.

Download PDF

Editorial decision: Revision requested
28 May, 2024
Submission checks completed at journal
21 May, 2024
Editor assigned by journal
21 May, 2024
First submitted to journal
19 May, 2024

You are reading this latest preprint version

Repeated scoring with Adult Appendicitis Score improves the sensitivity and the specificity of appendicitis diagnosis in patients with early equivocal signs of appendicitis: A secondary analysis

Status:

Version 1

Abstract

Background

Methods

Results

Conclusions

Trial registration

Figures

Background

Methods

Statistical analysis

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Status:

Version 1