Study inclusion
The results of the search and selection process are presented in Fig. 1. A total of 202 articles were identified. Among these, 40 were MedRxiv preprints and 162 were fully published articles from MEDLINE Complete (EBSCO) and PubMed. 2 articles were identified from other sources for example manual search. After abstract/title exclusion and removing duplicates, 74 articles were submitted to full text screening and 31 of these were included for the systematic review. Most articles were excluded because they did not present sufficient data hence it was not possible to extract data to construct 2 × 2 table and 1 article was excluded because it was only available in Chinese. A total of 29 articles describing the results of 99 independent studies (99 data sets19, 23 and 57 investigating IgG or IgM or IgG-IgM based LFIA, CLIA and ELISA serological test respectively) were eligible for the meta-analysis.
Characteristics of the studies
The general characteristics of the included articles are presented in table 1. All the published articles (n=14) included in the review were published in 2020 because COVID-19 is an emerging disease. The 17 unpublished articles were MedRxiv preprints which have been submitted to different journals for publication. Twenty five articles included in the review had a case-control design, comparing a group of well-defined cases with a group of healthy controls or controls with diseases or COVID-19 rRT-PCR negative patients and only six studies were cross sectional studies. One study had no control group and was excluded in the meta-analysis 26. Most of the studies (n=22) were conducted in China where the COVID-19 pandemic began and 3 studies were conducted in Italy whilst, USA whilst UK, Denmark, Germany, Spain and Japan each conducted one study.
Study ID
|
Country
|
Antibody type
|
Antigen type
|
Commercial
|
Reference standard
|
Index test
|
Control group/comparison group
|
Kai-Wang
To 26 P, CS
|
China
|
IgG and IgM
|
S and N
|
Inhouse
|
rRT-PCR
|
ELISA
|
No control
|
Cassaniti 30 P, CS
|
Italy
|
IgG, IgM and IgG-IgM
|
S
|
Commercial
|
rRT-PCR
|
POC LFIA
|
Patients with fever and respiratory
syndrome/RT-PCR negative
|
Duchuan Lin 31
|
China
|
IgG, IgM and IgG-IgM
|
N
|
Inhouse
|
Epidemiological risk/clinical features/rRT-PCR
|
CLIA
|
Healthy individuals and tuberculosis patients
|
Jie Xiang 32
|
China
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
rRT-PCR
|
ELISA and POC LFIA
|
Healthy individuals
|
Li Guo 11 P
|
China
|
IgM
|
N
|
Inhouse
|
Deep sequencing and rRT-PCR
|
ELISA
|
Adult patients with acute lower respiratory
tract infections (ALRTIs)
|
Rui Liu 28 CS
|
China
|
IgM
|
N
|
Inhouse
|
rRT-PCR
|
|
COVID-19 rRT-PCR negative patients
|
Wanbing Liu 33 P
|
China
|
IgG, IgM and IgG-IgM
|
S and N
|
Commercial
|
rRT-PCR
|
ELISA
|
Healthy individuals
|
Xuefei Cai 34
|
China
|
IgG, IgM and IgG-IgM
|
S
|
Inhouse
|
rRT-PCR
|
Peptide-based Magnetic CLIA
|
Mixed dieases and Healthy controls
|
Yu bao Pan 35 CS
|
China
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
rRT-PCR
|
POC LFIA
|
COVID-19 rRT-PCR negative patients
|
Yujiao Jin 36 P
|
China
|
IgG, IgM and IgG-IgM
|
S-N
|
Commercial
|
rRT-PCR
|
CLIA
|
Patients with suspected SARS-CoV-2
infection but with negative rRT-PCR results
|
Zhao 37 P
|
China
|
IgG and IgM
|
S
|
Commercial
|
Chest CT images/Epidermiological history/Clinical diagnosis/rRT-PCR
|
ELISA
|
Healthy individuals
|
Zhengtu Li38 P
|
China
|
IgG, IgM and IgG-IgM
|
S
|
Commercial
|
rRT-PCR
|
POC LFIA
|
Healthy individuals
|
Rongqing Zhao 27
|
China
|
IgG-IgM
|
S
|
Inhouse
|
Not clear but all cases were confirmed COVID-19 patients
|
ELISA
|
Healthy individuals (samples collected
before and during the COVID-19 pandemic)
|
Pingping Zhang 39
|
China
|
IgG-IgM
|
S
|
Inhouse
|
rRT-PCR
|
POC LFIA
|
COVID-19 rRT-PCR negative patients
|
Paradiso 40
|
Italy
|
IgG-IgM
|
S
|
Commercial
|
rRT-PCR
|
POC LFIA
|
Patients with Covid-19 disease
orienting-symptoms but rRT-PCR negative
|
Huan Ma 41
|
China
|
IgA, IgG, IgM, IgG-IgM and Ab
|
S and N
|
Inhouse
|
rRT-PCR
|
CLIA
|
Healthy individuals, COVID-19 suspected
individuals and Mixed disease group
|
Qian 14
|
China
|
IgG and IgM
|
S-N
|
Commercial
|
rRT-PCR
|
CLIA
|
Healthy individuals and Hospitalised individuals
|
Ling Zhong 42 P
|
China
|
IgG and IgM
|
|
Inhouse
|
rRT-PCR
|
CLIA and ELISA
|
Healthy individuals
|
Jiajia Xie 43 P, CS
|
China
|
IgG and IgM
|
E-N
|
Commercial
|
Chest CT images/Epidermiological history/Clinical diagnosis/rRT-PCR
|
CLIA
|
Clinically confirmed COVID-19 rRT-PCR negative patients
|
Infantino 44 P
|
Italy
|
IgG and IgM
|
S-N
|
Commercial
|
rRT-PCR
|
CLIA
|
Mixed dieases patients and blood donars Pre-COVID-19
|
Adams 45
|
UK
|
IgG, IgM and IgG-IgM
|
S
|
Inhouse (ELISA) and Commercial (LFIA)
|
rRT-PCR
|
ELISA and POC LFIA
|
Healthy blood and ICU cerebral organ donars before
the COVID-19 pandemic
|
Lassaunière 46
|
Denmark
|
IgA, IgG and Ab
|
S
|
Commercial
|
rRT-PCR
|
ELISA and POC LFIA
|
Healthy individuals and mixed dieases patients (Including
Acute respiratory tract infections caused by other
corona viruses and
non-corona viruses
|
Qiang Wang 47 P
|
China
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
Chest CT images/Epidermiological history/Clinical diagnosis/rRT-PCR
|
ELISA and POC LFIA
|
COVID-19 clinical negative mixed diseases patients
|
Fei Xiang 32 P
|
China
|
IgG and IgM
|
N
|
Commercial
|
rRT-PCR
|
ELISA
|
Healthy blood donors or from patients with other
disease hospitalized
|
Bin Lou 48
|
China
|
IgG, IgM and Ab
|
S and N
|
Commercial
|
rRT-PCR
|
ELISA, CLIA and POC LFIA
|
Healthy Individuals
|
Lei Liu 49
|
China
|
IgG-IgM
|
N
|
Commercial
|
rRT-PCR
|
ELISA
|
Randomly-selected ordinary patients and
healthy blood donors
|
Imai 50
|
Japan
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
rRT-PCR
|
POC LFIA
|
Non-COVID-19 patients (from April to October 2019
|
Pérez-García 51
|
Spain
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
rRT-PCR
|
POC LFIA
|
Healthy individuals (samples collected
before the COVID-19 pandemic)
|
Zhenhua Chen 13 P
|
China
|
IgG
|
N
|
Inhouse
|
rRT-PCR
|
POC LFIA
|
Clinically suspicious for the presence of anti-SARS-CoV-2
|
Dohla 52 P, CS
|
German
|
IgG, IgM and IgG-IgM
|
|
Commercial
|
RT-qPCR
|
POC LFIA
|
COVID-19 RT-qPCR negative patients
|
Burbelo 29
|
USA
|
Ab
|
S and N
|
Inhouse
|
RT-PCR
|
LIPS
|
Subjects with COVID-19-like symptoms or household
contacts of persons with COVID-19 (not tested by PCR),
and blood donors who donated samples before 2018.
|
Key
- Studies with P superscripts were published articles and without P superscripts were MedRxiv preprints.
- Studies with CS superscripts are cross sectional studies and without CS superscripts are case control studies.
- IgG-IgM means that either one of them or both were detected in serum.
- Ab means total antibodies.
|
Table 1
The general characteristics of the studies included in the review.
Most articles (n=26) included in the review clearly stated that the gold standard nucleic acid tests (rRT-PCR or deep sequencing) were used as the reference standard. However, five articles used a combination of epidemiological risk, clinical features, chest CT images and rRT-PCR. In one article the reference standard used was not stated but all the patients in the study were COVID-19 patients 27.
Point-of-care (POC) lateral flow immunoassays (LFIA) were used in 14 articles, CLIA were used in 9 articles and ELISA were used in 13 articles. We did not identify articles using FIA that met our inclusion criteria. One study did not specify the serological assay used and it was excluded from the review 28. One study used a LIPS which is performed in solution, thus maintaining the native antigen conformation 29. Most of the serological assay test kits were commercial (n=21) and 12 were in-house. Three SARS-CoV-2 antigens, Spike protein (S), nucleocapsid protein (N) and envelope protein (E) were used together or separately in studies included in the review. The spike protein and nucleocapsid were used as the antigen in 9 articles and 6 articles respectively. Five articles used both S and N as the antigens separately. In three articles S and N antigens (S-N) were used together as the antigen. In one article N and E antigens (N-E) were used together as the antigen. In seven articles the name of antigen used was not given.
Methodological quality of included studies
The methodological quality of the included studies for the IgG or IgM or IgG-IgM based LFIA, CLIA and ELISA summarised across all studies are shown in Figures 2b, 3b and 4b. Figures 2a, 3a and 4a show for the risk of bias and applicability concerns summary results for the LFIA, CLIA and ELISA individual studies respectively. None of the studies included in this review had low risk of bias in all four QUADAS-2 domains. Generally case control studies were of high risk of bias and high concern in the patients and timing and flow domains and cross sectional studies were of low risk of bias and low concern in all domains.
Patient selection domain
Generally most studies included were at risk of bias and had high concerns regarding applicability. Studies were mostly case control studies and they did not include a consecutive or random series of participants implying that the patients that were included are not representative for clinical use. All thirteen ELISA studies were at high risk of bias and had high concerns regarding applicability. For CLIA all the nine studies included had high risk of bias and only one cross sectional study had low applicability concerns. Generally LFIA had more studies (n=4) with low risk of bias and applicability concerns in the patient selection domain because there were 4 LFIA cross sectional studies.
Index test domain
In the index test domain, studies generally had a low risk of bias (13/14, 5/9 and 9/13 for LFIA, CLIA and ELISA respectively). This was because most studies had a pre-specified threshold (cut-off value to decide whether a test is positive or negative). The studies that had high risk of bias did not have a pre-specified threshold and in two studies the risk of bias could not be determined as it was not clear whether the threshold was pre-specified or not. Likewise, studies generally had low applicability concerns in the index test domain (12/14, 5/9 and 8/13) for LFIA, CLIA and ELISA respectively because they used commercial index tests.
Reference standard domain
Like the index test domain, studies generally had a low risk of bias (10/14, 8/9 and 10/13 for LFIA, CLIA and ELISA respectively) in the reference standard domain. Generally the studies were of low applicability concern, 10/14, 8/9 and 11/13 for LFIA, CLIA and ELISA respectively.
Flow and timing domain
All the CLIA studies (n=9) and ELISA studies (n=13) were at high risk of bias in the flow and timing domain. These studies were all case control studies. Most of the LFIA studies were also at a high risk of bias however four cross sectional studies LFIA studies were at low risk of bias.
Quantitative synthesis and meta-analysis
Firstly, we considered performance of the LFIA devices using RT-PCR-confirmed cases as the reference standard. The forest plots in Figure 5 show the sensitivity, specificity range, and heterogeneity for the three IgG or IgM or IgG-IgM based LFIA detecting COVID-19 across the included studies. Overall, the sensitivity varied widely across studies in contrast to the specificity which did not vary much except for 2 studies, Yunbao Pan, 2020 and Qiang Wang, 2020, which had the lowest and second lowest specificities respectively. Amongst the IgG based LFIA tests (n=17) the sensitivity estimates ranged from 0.14 (95% CI 0.09-0.21) (Imai, 2020) to 1.00 (95% CI 0.77-1.00) (Qiang Wang, 2020) and specificity estimates ranged from 0.41 (95% CI 0.21-0.64) (Yunbao Pan, 2020) to 1.00 (95% CI 0.97-1.00) (Bin Lou, 2020) (Figure 5a). For the IgM based LFIA tests (n=16) the sensitivity estimates ranged from 0.05 (95% CI 0.01-0.18) Adams assay 4 to 1.00 (95% CI 0.77-1.00) (Qiang Wang, 2020) and specificity estimates ranged from 0.64 (95% CI 0.41-0.83) (Yunbao Pan, 2020) to 1.00 (95% CI 0.94-1.00) (Adams assays 4 and 5) (Figure 5b). For the IgG-IgM based LFIA tests (n=24) the sensitivity estimates ranged from 0.18 (95% CI 0.08-0.34) (Cassaniti, 2020) to 1.00 (95% CI 0.77-1.00) (Qiang Wang, 2020), with most of the studies having sensitivities over 0.55 and specificity estimates ranged from 0.36 (95% CI 0.17-0.59) (Yunbao Pan, 2020) to 1.00 (95% CI 0.94-1.00) (Adams assays 2 and 3) (Figure 5c)
We then considered performance of the different IgG or IgM or IgG-IgM based CLIA test using RT-PCR-confirmed cases as the reference standard (Figures 6a, 6b and 6c). Considering any positive result (IgM positive, IgG positive or both), CLIA serological tests achieved sensitivity ranging from 0.48 (95% CI 0.29-0.68%) (Yujiao Jin, 2020) to 1.00 (95 % CI 0.79-1.00) with most studies being between 0.80 and 1. The specificity was over 0.80 in most tests except for two tests, one IgG based test and one IgM based test which had the lowest 0.00 (95% CI 0.00-0.009) and second lowest 0.15 (95% CI 0.06-0.30) specificities respectively.
Lastly, we evaluated the performance of the different IgG or IgM or IgG-IgM based ELISA tests using RT-PCR-confirmed cases as the reference standard (Figures 7a, 7b and 7c). The sensitivities and specificities were generally high, ranging from 0.80 to 1.00 and 0.95 to 1.00 in most studies. For all the IgG based ELISA tests (n=10) the sensitivity estimates ranged from 0.65 (95% CI 0.57-0.72) (Zhao, 2020) to 1.00 (95% CI 0.79-1.00) (Kai-Wang To, 2020) and specificity estimates from 0.86 (95% CI 0.51-0.89) to 1.00 (95% CI 0.98-1.00) (Ling Zhong, 2020) (Figure 7a). In the IgM based tests (n=11), the sensitivity and specificity in the individual studies ranged from 0.44 (95% CI 0.32-0.58) (Jie Xiang, 2020) to 1.00 (95% CI 0.77–1.00) (Qiang Wang, 2020) and 0.69 (95% CI 0.57-0.80) (Qiang Wang, 2020) to 1.00 (95% CI 0.99–1.00) (Ling Zhong, 2020), respectively (Figure 7b). The sensitivity across the 5 studies included in the IgG-IgM based ELISA tests ranged from 0.80 (95% CI 0.74-0.85) (Wanbing Liu, 2020) to 0.87 (95% CI 0.77-0.94) (Rongqing Zhao, 2020). On the other hand, specificity across the 5 studies ranged from 0.97 (95% CI 0.92-0.99) (Lei Liu, 2020) to 1.00 (95% CI 0.98-1.00) (Rongqing Zhao, 2020) (Figure 7c).
We also constructed the SROC curves for all the three antibody based serological tests, figure 8, however we did not calculate the area under the ROC (AUROC). From the SROC we visually assessed heterogeneity between the different tests. Diagonal line indicated useless tests and the best tests were clustered further up to the top left hand corner.
The bivariate model and the hierarchical summary receiver operating characteristic curve (HSROC) model were performed to evaluate the diagnostic accuracy of the serological tests. The outputs of the meta-analysis (bivariate and HSROC parameter estimates, as well as the summary values of sensitivity and specificity) are presented in Table 2 and Figure 9. The pooled sensitivity for the IgG, IgM and IgG-IgM based LFIA tests were 0.5856, 0.4637 and 0.6886 respectively compared to RT-PCR. The pooled sensitivity for the IgG and IgM based CLIA tests were 0.9311 and 0.8516 respectively compared to RT-PCR. The pooled sensitivity for the IgG, IgM and IgG-IgM based ELISA tests were 0.8292, 0.0.8388 and 0.8531 respectively compared to RT-PCR. All the tests had high specificities ranging from 0.9693 to 0.9991 compared to RT-PCR. The estimated SROC curves for bivariate models are not presented.
HSROCs were also to visually access the overall performance of the diagnostic tests, access the overall diagnostic accuracy of the tests and compare the diagnostic accuracy of the different tests used for diagnosing COVID-19 in the review (Figure 9). The overall diagnostic test accuracy was measured by the closeness of the curve to the top left corner which represents high sensitivity and specificity. The closer the curve was to the upper left hand corner, the better the diagnostic accuracy 53. From figure 9 it can be observed that ELISA and CLIA have better diagnostic accuracy compared to LFIA and IgG-IgM based ELISA tests have the best overall diagnostic test accuracy.
Test type
|
Antibody type
|
Number of studies/tests
|
Sensitivity (95 %-CI)
|
Specificity (95 %-CI)
|
Correlation
|
LFIA
|
IgG
|
17
|
0.5856 (0.4397-0.7179)
|
0.9896 (0.9561-0.9976)
|
-0.4454
|
CLIA
|
IgG
|
9
|
0.9311 (0.9309-0.9312)
|
0.9757 (0.9757-0.9758
|
-0.511
|
ELISA
|
IgG
|
10
|
0.8292 (0.7416-0.8915)
|
0.9948 (0.9675-0.9992)
|
-0.1709
|
LFIA
|
IgM
|
16
|
0.4637 (0.3016-0.6339)
|
0.9734 (0.9275-0.9905)
|
-0.7925
|
CLIA
|
IgM
|
10
|
0.8516 (0.7356-0.9221)
|
0.9693 (0.855-0.9941)
|
-0.7074
|
ELISA
|
IgM
|
11
|
0.8388 (0.7307-0.909)
|
0.9991 (0.9778-1)
|
-0.7247
|
LFIA
|
IgG-IgM
|
24
|
0.6886 (0.5878-0.7742)
|
0.9757 (0.9466-0.9892)
|
0.1011
|
CLIA
|
IgG-IgM
|
3
|
-
|
-
|
-
|
ELISA
|
IgG-IgM
|
5
|
0.8531 (0.7851-0.9023)
|
0.9901 (0.9287-0.9987)
|
-0.6771
|
Table 2
Summary estimates of test accuracy.
We identified one study (Burbelo, 2020) reporting total antibody (Ab) based luciferase immunoprecipitation assay system (LIPS) using N and S antigens with sensitivities and specificities of 0.91 (95 % CI 0.77-0.99) and 1.00 (0.80-1.00) and 1.00 (0.92-1.00) and 1.00 (0.92-1.00) respectively. We also identified studies reporting other Ab based serological assays and IgA based serological assays but results are not reported in this review.
Heterogeneity investigations
Generally high overall I^2 values above 85 %, which indicate high heterogeneities, were observed for both the sensitivities and specificities when we performed antigen subgroup meta-analysis with the exception of IgG-IgM based ELISA. IgG-IgM based ELISA had an overall sensitivity I^2 value of 52. 12 % which is considered moderate heterogeneity and overall specificity I^2 value of 0 % which is considered to be low heterogeneity. However it should be noted that only 5 studies were included for this subgroup meta-analysis. Overall I^2 values for sensitivities and specificities heterogeneities for the antigen type subgroup meta-analysis are shown in table 3. We did not investigate heterogeneity for LFIA because most studies included in the review did not specify the type of antigen they used in their serological tests.
Detailed results of heterogeneity for the different antigen type sensitivities and specificities for each test type and antibody type combination are presented in Additional file 2.
Test type
|
Antibody type
|
Heterogeneity (I^2 )
|
Sensitivity
|
Specificity
|
LFIA
|
IgG
|
-
|
-
|
IgM
|
-
|
-
|
IgG-IgM
|
-
|
-
|
CLIA
|
IgG
|
93.56 %
|
86.5 %
|
IgM
|
93. 42 %
|
95.17 %
|
IgG-IgM
|
-
|
-
|
ELISA
|
IgG
|
78.07 %
|
84.97 %
|
IgM
|
85. 47 %
|
90.08 %
|
IgG-IgM
|
52.12 %
|
0 %
|
Table 3
Overall antigen type subgroup meta-analysis heterogeneity