Characteristics and quality of the included studies
We identified 75 publications in MEDLINE, 71 publications in Embase, 5 publications in the Cochrane Library, and 2 publications manually searched from reference lists. Of these 153 studies, 12 studies4,9,18,19,10–17 (n=2,204) met the inclusion criteria and were included (Appendix Figure 1). One study used clinical diagnostic criteria of COVID-19, but the details provide subgroup data in which the patients were diagnosed with RT-PCR test and received chest CT test14. The data from this subgroup were used in the present meta-analysis. Another study excluded patients with negative CT scan results inappropriately, the original data with all patients that met our selection criteria were used in this meta-analysis.16
The characteristics of the nine included studies are summarized in Table 1. The mean age of the participants ranged from 40.0 to 58.2 years old, with 45.5 to 59.8% males. Nine of the studies were performed in China, and the other two studies were carried out in Italy and Japan, respectively. One case control study recruited COVID-19 patients in China and control group in America. Sample sizes ranged from 21 to 1014 participants.
Quality assessment
The methodological quality of studies was assessed with QUADAS-2. (Appendix Figure 2). Some study characteristics that might increase the risk of bias were identified. The main concerns of high risk studies focused on patient selection and flow and timing domains. Bai’s study was a case-control study that was not a random sample of patients clinically suspected COVID-1911, while Li’s study and Miao’s study18 only focused on patients with abnormal chest CT among the COVID-19 suspected population12. In Himoto’s study, some of the control group patients were confirmed negative for COVID-19 by careful observation more than 2 weeks without RT-PCR test10. Due to the different diagnostic criteria used in Xiong’s study16, the patient flow could introduce bias with high risk and only part of the valid data were analyzed in our meta-analysis. Most studies did not clarify whether RT-PCR results were interpreted without knowledge of the results of chest CT.
Diagnostic performance of chest CT compared with initial RT-PCR for COVID-19
Forest plots of the sensitivity and specificity of chest CT and initial RT-PCR assays for diagnosing COVID-19 were shown in Figures 1. The following pooled parameters of chest CT were calculated over eight studies with both sensitivity and specificity data: sensitivity, 94.5% (95% CI: 89.5-97.2%); specificity, 41.8% (95% CI 24.2 to 61.6%); PLR, 1.6 (95% CI: 1.6-2.3); NLR, 0.13 (95% CI: 0.06-0.31); and DOR, 12.4 (95% CI: 4.0-38.5), I2 =93%. For initial RT-PCR test, pooled sensitivity, 95.1% (95% CI: 87.7-100.0%); specificity, 100.0% (95% CI: 99.3-100.0%); PLR, 119.6 (95% CI: 22.6-633.5); NLR, 0.13 (95% CI: 0.03-0.52); and DOR, 921 (95% CI: 74-11496), I2 =67%. (Table 2) The corresponding symmetric SROC curves were plotted and shown in Figure 2. Initial RT-PCR revealed a better diagnostic performance compared to chest CT.
Estimated posterior probabilities according to COVID-19 prevalence
The prior probabilities of COVID-19 were estimated from the prevalence data. In a previous meta-analysis, the pooled prevalence of COVID-19 was 38%7. As for close contacts of confirmed COVID-19 patients, secondary attack rate ranged from 0.45 to 5% among general close contacts and 3 to 10% among household members. We used prior probabilities of 1%, 10%, and 38% to estimate the Bayesian posterior probabilities. The estimated PPV of chest CT were 2%, 15%, and 50% at a disease prevalence of 1%, 10% and 38%, respectively. While the estimated NPV of chest CT were 0.1%, 1%, and 7% at a disease prevalence of 1%, 10% and 38%, respectively. (Figure 3)
Comparison of different chest CT manifestations in diagnosing COVID-19
To further observe the diagnostic performance of different chest CT manifestations, the diagnostic values were summarized in Table 2. Forest plots of CT manifestations were shown in Figure 4 while the corresponding SROC curves were shown in Figure 5. Peripheral lesions, bilateral involvement, multiple lesions, and ground-glass opacities (GGO), revealed to be with better diagnostic value than other CT manifestations. Meanwhile, pleural effusion, consolidation, and lymphadenopathy were with worst diagnostic performance.
Publication bias in the literature evaluation
Publication bias was assessed and presented in Appendix Figure 3. The Deeks’ test showed a statistically significant value (p < 0.001) in initial RT-PCT test, indicating that there was potential publication bias. While no significant publication bias was found in other tests evaluated in this meta-analysis. (p>0.05)