Performance of SARS-CoV-2 antigen-detection rapid diagnostic tests for COVID-19 self-testing and self-sampling in comparison to molecular and professional-use antigen tests: A systematic review and meta-analysis

doi:10.21203/rs.3.rs-3263909/v1

Download PDF

Research Article

Performance of SARS-CoV-2 antigen-detection rapid diagnostic tests for COVID-19 self-testing and self-sampling in comparison to molecular and professional-use antigen tests: A systematic review and meta-analysis

https://doi.org/10.21203/rs.3.rs-3263909/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Purpose

Self-testing is an effective tool to bridge the testing gap for several infectious diseases; however, its performance in detecting SARS-CoV-2 using antigen-detection rapid diagnostic tests (Ag-RDTs) has not been systematically reviewed. To inform WHO guideline development, we evaluated the accuracy of COVID-19 self-testing and/or self-sampling using Ag-RDTs.

Methods:

We searched multiple databases for articles evaluating the accuracy of COVID-19 self-testing or self-sampling through November 7th, 2022. Cohen’s kappa was estimated to assess concordance between self-testing/self-sampling and fully professional-use Ag-RDT results. Bivariate meta-analysis was performed to obtain pooled performance estimates compared to molecular testing. The QUADAS-2 and GRADE tools were used to evaluate quality and certainty of evidence.

Results:

Among 43 studies included in the review, twelve reported on self-testing, while 31 studies assessed self-sampling only. The risk of bias was low in 49.6% of the studies. Overall concordance with professional-use Ag-RDTs (n = 7 datasets) was high (kappa 0.92 [95% confidence interval (CI) 0.89 to 0.95]). Overall pooled sensitivity and specificity of Ag-RDT testing using self-testing/self-sampling (n = 54 datasets) was 70.5% (95% CI 64.3 to 76.0) and 99.4% (95% CI 99.1–99.6), respectively.

Conclusion:

Despite high heterogeneity among studies, COVID-19 self-testing/self-sampling exhibits high concordance with professional-use Ag-RDTs. This suggest that self-testing/self-sampling can be offered as part of COVID-19 testing strategies.

Trial registration

PROSPERO: CRD42021250706

COVID-19

self-testing

rapid antigen test

systematic review

meta-analysis

SARS-CoV-2 self-sampled antigen testing with and without self-readout achieves high concordance with professional antigen testing and acceptable accuracy against RT-PCR performed with self- or professional-collected samples.

Self-testing allows individuals to collect their own sample, conduct the diagnostic test, and interpret the result. A growing body of evidence has shown self-testing with simple antigen-detection rapid diagnostic tests (Ag-RDTs) to be feasible, acceptable and accurate [1]. Over the last decade, particularly for HIV and Hepatitis C, self-testing methods, using lateral flow assays have shown high agreement and increased testing uptake in comparison to professional testing, as well as a low failure rate [2–5]. As a result, the World Health Organization (WHO) recommended self-testing for HIV in 2016 and for Hepatitis C in 2021 [6, 7].

With the emergence of the COVID-19 pandemic, Ag-RDTs for SARS-CoV-2 became widely available. While less accurate compared to the gold standard nucleic acid amplification tests (NAATs), Ag-RDTs enabled easy-to-use and rapid point-of-care (POC) testing [8]. This resulted in the WHO recommendation of SARS-CoV-2 Ag-RDTs for various use cases, including primary case detection and contact tracing [9]. Further, a sensitivity target ≥ 80% is currently recommended for Ag-RDTs [10]. However, the limited number of professional test operators hampered scale-up of and timely access to testing.

Building on the self-testing experiences for HIV and Hepatitis C, self-sampling coupled with professional Ag-RDT test conduct and interpretation (henceforth named self-sampling) as well as self-testing for COVID-19 was explored [11–13]. However, to date, no systematic review focusing solely on the performance of Ag-RDT self-testing and/or self-sampling has been performed. To address this knowledge gap and inform WHO guideline development, we conducted a systematic review and meta-analysis to (1) assess the concordance between self-testing and/or self-sampling and professional testing using commercially available Ag-RDTs for SARS-CoV-2 and (2) assess the accuracy of self-testing and/or self-sampling for COVID-19 using commercially available Ag-RDTs against RT-PCR performed on self-collected or professionally collected samples.

The methods were adapted from a living systematic review our group had previously published [8, 14]. The systematic review protocol (Supplement, S1 Text Study Protocol) is registered on PROSPERO (CRD42021250706). We followed the Preferred Items for Systematic Reviews and Meta-analysis (PRISMA) guideline to report our findings (Supplement, PRISMA Checklist) [15].

Search strategy

We searched the databases MEDLINE (via PubMed), Web of Science, medRxiv and bioRxiv (via Europe PMC), using search terms developed with an experienced medical librarian (MGr) using combinations of subject headings (when applicable) and text-words for the concepts of the search question. The main search terms were “Severe Acute Respiratory Syndrome Coronavirus 2,” “COVID-19,” “Betacoronavirus,” “Coronavirus,” and “Point of Care Testing” and checked against an expert-assembled list of relevant papers. The full list of search terms is available in the supplementary material (Supplement Text 2 Search Strategy). Furthermore, we looked for relevant studies on the FIND website (https://www.finddx.org/sarscov2-eval-antigen/). We conducted the search without applying any language, age, or geographic restrictions from inception up until November 7th, 2022.

Eligibility criteria

We included studies evaluating the accuracy of self-testing and/or self-sampling using commercially available Ag-RDTs to establish a diagnosis of SARS-CoV-2 infection against RT-PCR as the reference standard. In studies assessing self-sampling, the Ag-RDT performance (including readout and interpretation) was conducted by a professional. Sampling conducted or assisted by caregivers was included as self-sampling. RT-PCR samples were eligible if they were either self-collected or professionally collected without a restriction on sample type (henceforth referred to as ‘RT-PCR’).

We included all studies reporting on any population, irrespective of age, symptom presence, or study location. We considered cohort studies, nested cohort studies, case–control, cross-sectional studies, and randomized controlled trials (RCTs). We included both peer-reviewed publications and preprints. We excluded studies in which persons underwent testing for the purposes of monitoring or ending quarantine. In addition, publications with a sample size under ten were excluded to minimize bias in clinical performance estimates.

Assessment of methodological quality

The quality of clinical accuracy studies was assessed by applying the quality assessment of studies of diagnostic accuracy (QUADAS-2) tool, which was adjusted to the needs of this review [16]. Details can be found in the supplementary material (Supplement Text 3 QUADAS).

Assessment of certainty of evidence (CoE)

We defined three individual outcomes for this review: (1) concordance between self-testing/self-sampling coupled with professionally performed Ag-RDT and entirely professionally conducted Ag-RDTs, calculating Positive Percentage Agreement (PPA), Negative Percentage Agreement (NPA), and Overall Percentage Agreement (OPA). (2) sensitivity, and (3) specificity against RT-PCR performed on a self-collected or professionally collected sample as reference.

Certainty of Evidence (CoE) was assessed following the GRADE guidelines for each individual outcome [17]. After rating the respective study type (e.g., RCT or observational trial), each outcome was independently evaluated according to five categories: study design, risk of bias (RoB), inconsistency, indirectness, and imprecision.

Assessment of independence from manufacturers

We examined whether a study received financial support from a test manufacturer (including free provision of Ag-RDTs), whether any study authors were affiliated with the manufacturer, and whether a respective conflict of interest was declared. If at least one of these conditions was met, the study was deemed as not independent from the test manufacturer; otherwise, it was considered as independent.

Statistical analysis and data synthesis

We extracted data from eligible studies using a standardized data extraction form. Wherever possible we recalculated performance estimates based on the extracted data or contacted authors to provide additional information on concordance between self-tested and professionally tested Ag-RDTs. The final data set used is accessible under https://doi.org/10.11588/data/P9JEPG.

We calculated Cohen’s kappa as a measure of concordance, its variance, and 95% confidence intervals (CIs) for comparison of results with fully professional-use Ag-RDTs. If four or more studies with at least 20 positive samples were available, we conducted a meta-analysis of Cohen’s kappa using the “metafor" package version 3.4-0 in R [18].

We derived the estimates for sensitivity and specificity against RT-PCR and performed meta-analysis using a bivariate model when at least four data sets, each with at least 20 positive samples, were available (meta-analysis was implemented with “reitsma” command from the R package “mada,” version 0.5.11). If less than four studies were available for an outcome, only a descriptive analysis was performed, and accuracy ranges were reported. Univariate random-effects inverse variance meta-analysis was performed (using the “metaprop” and “metagen” commands from the R package “meta,” version 5.5–0) for the pooled sensitivity analysis per Ct values. We predefined subgroups for meta-analysis based on the following characteristics: Ct value range (< 20, < 25, <30, ≥ 20, ≥25, ≥ 30), sampling and testing procedure in accordance with manufacturer and/or study team instructions (‘IFU-conforming’ versus ‘not IFU-conforming’), patient age (‘<18 years’ versus ‘≥18 years’), presence of symptoms (‘symptomatic’ versus ‘asymptomatic’), and duration of symptoms (‘DoS ≤ 7 days’ versus ‘ DoS > 7 days’).

To make the most of the heterogeneous data available, the cutoffs for the Ct value groups were relaxed by up to three points within each range (e.g., Ct value range group < 20 can include studies with Ct values ≤ 17 to ≤ 23). For the same reason, when categorizing by age, the age group < 18 years (children) included samples from persons whose age was reported as < 16 or < 18 years, whereas the age group ≥ 18 years included samples from persons whose age was reported as ≥ 16 years or ≥ 18. Additionally, samples from the anterior nares (AN) and nasal mid-turbinate (NMT) were summarized as AN. IFU-conformity was judged based on the study team’s information. As self-testing was an off-label use at that time for some Ag-RDTs, following the study team’s instructions was defined as IFU-conforming. Observed sampling and testing were defined when a professional watched the testing procedure without intervening. Predominant variants of concern (VoC) for each study were analyzed using the online tool CoVariants [19] with respect to the stated study period. The respective VoCs were extracted according the current WHO listing [20].

Heterogeneity was interpreted visually in forest plots. Further, we performed the Deeks test for funnel-plot asymmetry as recommended to investigate publication bias for diagnostic test accuracy meta-analyses [21] (using the “midas” command in Stata, version 15); a p-value < 0.10 for the slope coefficient indicates significant asymmetry. Remaining analyses were performed using R 4.2.1 (R Foundation for Statistical Computing, Vienna, Austria).

Sensitivity analysis

Three types of sensitivity analyses were planned: concordance and estimation of performance (sensitivity, specificity) of self-testing and/or self-sampling compared to RT-PCR excluding case–control studies, preprints, and manufacturer-dependent studies. We compared the results of the respective sensitivity analysis against the overall results to assess the potential bias.

Our search strategy yielded a total of 20,431 titles after removal of duplicates. Twelve studies [11, 22–32] incorporating 28 data sets on self-testing (27,506 samples) and 31 studies [12, 13, 33–61] incorporating 37 data sets on self-sampling (31,792 number of samples) were found to be eligible for inclusion in the review (Fig. 1). One study was analyzed as self-sampling because it was unclear whether or not self-testing was performed [55].

Methodological quality of all included studies

The included studies were assessed to be of high applicability overall and variable bias (Fig. 2A).

Low risk of bias was observed in 41 out of 65 datasets (63.1%), when assessing the timing of the index test, the inclusion of participants, and whether the same reference standard was used throughout the study. However, in only 40.0% of the studies were the results of the reference standard (PCR) interpreted without knowledge of the index test results; this was unclear for the remaining 60.0%. For 67.7% of the studies, the conduct and interpretation of the index test was of low concern because the Ag-RDT results were interpreted without knowledge of results of the reference standard. Only 33.8% of the studies had a representative study population, avoiding inappropriate exclusions or a case-control design thereby resulting in low risk of bias. Out of the remaining studies, the risk of bias for patient selection remained unclear for 16.9%, and 6.2% had high risk of bias and 43.1% had an intermediate risk of bias. Applicability was deemed to be of low concern in 86.2% of the studies across all domains since the methods (i.e., patient selection, index test conduct, reference standard choice) in the respective studies matched our research question (Fig. 2B; with further details in Supplementary Fig. 1). Potential conflict of interest due to financial support from or employment by the test manufacturer was present in 17 studies (34.7%) [26, 28, 32, 37, 41, 46, 47, 49, 50, 52, 53, 55, 57, 58]. In studies focusing on self-sampling, 30 out of 36 datasets reported IFU-conform conduct of the test, even though sampling was explicitly observed in only 22 datasets (61.1%). For studies evaluating self-testing, 26 datasets stated IFU-conformity, while for the remaining two datasets it was unclear.

The result of the Deeks test for all datasets with complete results (p = 0.31) indicates a symmetrical funnel shape, suggesting that publication bias is absent (Supplement, S2 Figure Funnel Plot).

Study description

Most of the studies included in the review were conducted in high-income countries (HIC): the USA (n = 10), Germany (n = 7), the Netherlands (n = 6), UK, and Canada (n = 2, each), as well as Greece, Denmark, Japan, France, Belgium, Austria, France, and Hong Kong (n = 1, each). On the contrary, eight studies were conducted in middle-income countries (MIC): India (n = 3), Brazil, Morocco, Malaysia, South Korea, and China (n = 1, each) [62]. No studies were performed in low-income countries. Considering the study participant’s level of education, in two studies reporting on self-testing, the majority of participants (59.6% and 98.1%) had at least a high school degree [11, 22]. Out of the 17 studies reporting on self-sampling, one study stated that 52.5% of participants had a higher education degree [43]. Another study included only high school students (78.6%) or teachers (21.4%) [36], while two other studies included only college students [33, 54]. The remaining studies provided no information on the participants’ educational backgrounds. Participants had prior medical training (i.e., health care worker) in three self-sampling datasets (2,506 samples, 9.1%) [12, 43]. Participants were lay people without any medical training for six datasets totaling 5,023 samples, but for the other datasets, it remained unclear. Information on the participants' professional backgrounds and prior testing experiences was only reported in one self-testing study [10]. Out of the 144 participants in this study, 12 (8.3%) had prior medical training, 66 (45.8%) had undergone SARS-CoV-2 testing in the past, and four (2.8%) had performed at-home COVID-19 testing.

Most of the self-sampling data (32 datasets; 88.9%) were collected at testing or clinical sites, while for others no information was available. The sampling process was observed in 17 of the self-sampling studies (22 datasets), totaling 19,280 samples (60.6%) [12, 13, 33, 36, 38, 39, 41, 42, 45, 49, 52, 56–60], whereas sampling was not observed in four studies (4 datasets; 10.8%) [37, 43, 50, 54]. For the remaining ten studies (10 datasets; 27.0%), it was unclear whether the sampling was observed or not [34, 35, 40, 44, 46–48, 51, 53, 61]. Overall, 78.6% of the self-testing studies were carried out at a testing site, and the testing procedure was observed (without providing instructions) by the study team in three studies (1083 samples; 2.9%) [11, 28, 32].

A total of 27,506 samples were evaluated in the self-testing studies. With 13,166 individuals presenting with symptoms suggestive of a SARS-CoV-2 infection, while 10,103 persons did not show any symptoms at the time of testing. For the rest, the authors did not specify the participants’ symptom status. A total of 31,069 individuals participated in the self-sampling studies, of whom 6,325 had symptoms, 20,569 were asymptomatic, and 4,175 had unclear symptom status.

The most used Ag-RDTs across all studies were the BinaxNow nasal test by Abbott (USA, henceforth called BinaxNow) and the Standard Q nasal test by SD Biosensor (South Korea; distributed in Europe by Roche, Germany; henceforth called Standard Q nasal), with six datasets each. The BD Veritor lateral flow test for Rapid Detection of SARS-CoV-2 (Becton, Dickinson and Company, MD, US; henceforth called BD Veritor), the CLINITEST Rapid COVID-19 Antigen Test (Siemens Healthineers, Germany; henceforth called CLINITEST), and the Rapid SARS-CoV-2 Antigen Test (MP Biomedicals, CA, US; henceforth called MP Bio) were used in three datasets each.

Most self-samples for antigen testing were taken from the anterior nares (‘AN’; 28 datasets, 77.8%). The remaining datasets made use of either combined oropharyngeal/anterior nasal (OP/AN) (2 datasets, 5.6%), saliva (2 dataset, 5.6%), a combination of the above (AN/saliva, 1 dataset), or OP (3 datasets, 8.4%) samples. Similarly, many self-testing datasets used AN sample (20 datasets, 71.4%); whereas OP/AN and saliva accounted for 4 datasets (14.3%) each. The following samples were used for RT-PCR testing: AN (13 datasets, 20.0%), nasopharyngeal (NP) (21 datasets, 32.3%), NP/OP (13 datasets, 20.0%), OP (9 datasets, 13.8%), OP/AN (5 datasets, 7.7%), or saliva (3 dataset, 4.6%).

The RT-PCR and Ag-RDT analyses were conducted on the same sample type across 20 self-sampling datasets [31, 33–38, 41, 45, 46, 50, 54, 58–61]. Self-collected samples were used for RT-PCR in 14 of those datasets [33, 36–38, 41, 45, 46, 54, 59, 60]. In all self-testing studies, RT-PCR samples were collected by a professional.

Two self-testing and one self-sampling studies provided additional instructional videos [22, 29, 35]. Regarding self-testing studies, four studies provided study-specific test instructions since no manufacturer instructions for self-testing were available at the time [11, 22, 25, 29].

Table 1a, b provides further information on each of the studies included in the review.

Table 1

a Clinical accuracy data for self-sampled Ag-RDTs.
Study	Test assessed	Country	Type of location	Study population	Screening criteria	Sample type	Sensitivity (95%CI)	Specificity (95%CI)
Harris, 2021[12]	Sofia	USA	testing site	adults	sympt., HRC	AN	82.3% (77.5^# to 86.4^#)	98.8%^# (97.5^# to 99.5^#)
Harris, 2021[12]	Sofia	USA	testing site	adults	asympt.	AN	31.6% (0.0^# to 24.7^#)	100% (99.8^# to 100)
Lindner, 2021[13]	Standard Q	Germany	testing site	adults	sympt.	AN*	74.4% (57.9^# to 87.0^#)	99.2% (97.1 to 99.9^#
Tinker, 2021[33]	BinaxNow	USA	testing site	adults	asympt.	AN*	20.0% (9.1^# to 35.6^#)	100% (99.8^# to 100#)
Tanimoto, 2021[34]	Lumipulse	Japan	unclear	unclear	unclear	saliva	61.8% (47.7^# to 74.6^#)	100% (94.1 to 100)
Mak, 2022[35]	Standard Q	Hong Kong	testing site	unclear	HRC	OP/AN*	100% (15.8^# to 100)	100% (90.7^# to 100)
Blanchard, 2021[36]	Panbio nasal	Canada	testing site	adults, children	sympt.	AN*	78.6% (49.2^# to 95.3^#)	100% (98.7^# to 100)
Harmon, 2021[37]	E25Bio	USA	testing site	adults	sympt., asympt.	AN	92.3% (64.0^# to 99.8^#)	99.6% (97.7^# to 100)
Ford, 2021[38]	BinaxNow	USA	testing site	children	sympt., HRC, asympt.	AN*	71.4% (53.7 to 85.4)	100% (98.0 to 100)
Ford, 2021[38]	BinaxNow	USA	testing site	adults	sympt., HRC, asympt.	AN*	80.9% (75.9 to 85.3)	99.9% (99.5 to 100)
Klein, 2021[39]	Panbio nasal	Germany	testing site	adults	sympt., HRC	AN	86.4% (72.6^# to 94.8^#)	99.2% (97.0 to 99.9^#)
Nikolai, 2021[43]	Standard Q	Germany	clinical	adults	sympt.	AN	91.2% (76.3^# to 98.1^#)	98.4% (91.3^# to 100^#)
Okoye, 2021[54]	BinaxNow	USA	testing site	adults	asympt.	AN*	53.3% (37.9^# to 68.3^#)	100% (99.9 to 100)
Krüger, 2021[56]	LumiraDx	Germany	testing site	adults	sympt., HRC	AN	82.2% (75.0^# to 88.0^#)	99.3% (98.3 to 99.7)
Osmanodja, 2021[57]	Dräger	Germany	both	adults	sympt., asympt.	AN	88.6% (78.7 to 94.9)	99.7% (98.2 to 100)
Chiu, 2021[58]	Indicaid	USA	clinical	adults, children	sympt.	AN	82.7% (72.2^# to 90.4^#)	96.4% (93.4 to 98.2^#)
García-Fiñana, 2021[59]	Innova	UK	testing site	adults	asympt.	OP/AN	40.0% (28.5 to 52.4)	99.9% (99.8 to 99.9)
Shah, 2021[60]	BinaxNow	USA	testing site	adults, children	sympt, HRC, asympt.	AN	81.4% (76.8 to 85.5)	99.6% (99.2 to 99.8)
Frediani, 2021[61]	BinaxNow	USA	unclear	adults, children	unclear	AN	56.2%^# (29.9^# to 80.2^#)	100% (87.7^# to 100)
Tinker, 2021[33]	BinaxNow	USA	testing site	adult	asympt.	AN*	20.0 (9.1^# to 35.6^#)	100 (99.8^# to 100^#)
Tanimoto, 2021[34]	Lumipulse	Japan	unclear	unclear	unclear	saliva	61.8 (47.7^# to 74.6^#)	100 (94.1 to 100)
Mak, 2022[35]	Standard Q	Hong Kong	testing site	unclear	HRC	OP/nasal	100 (15.8^# to 100)	100 (90.7^# to 100)
Blanchard, 2022[69]	Panbio nasal	Canada	testing site	adult, children	sympt.	AN*	78.6 (49.2^# to 95.3^#)	100 (98.7^# to 100)
Harmon, 2021[37]	E25Bio	USA	testing site	adult	sympt., asympt.	AN*	92.3 (64.0^# to 99.8^#)	99.6 (97.7^# to 100)
Ford, 2021[38]	BinaxNow	USA	testing site	children	sympt, HRC, asympt.	AN*	71.4 (53.7 to 85.4)	100 (98.0 to 100)
Ford, 2021[38]	BinaxNow	USA	testing site	adult	sympt, HRC, asympt.	AN*	80.9 (75.9 to 85.3)	99.9 (99.5 to 100)
Ahmed, 2022[40]	ProDetect	Malaysia	unclear	adult, children	sympt, HRC,	AN	96.1^# (86.5^# to 99.5^#)	98.0 (89.1^# to 99.9^#)
Cardoso, 2022[41]	Wondfo	Brazil	testing site	unclear	sympt	AN*	73.0 (64.7^# to 80.2^#)	98.6 (95.2 to 99.8^#)
Chen, 2022[42]	Labnovation	China	clinical	adult	unclear	AN	70.4^# (49.8^# to 86.2^#)	100^# (29.2^# to 100^#)
Chen, 2022[42]	Labnovation	China	clinical	adult	unclear	AN	81.4^# (66.6^# to 91.6^#)	64.0^# (42.5^# to 82.0^#)
Gagnaire, 2022[44]	Biospeedia	France	testing site	adult, children	sympt, HRC, asympt.	AN/saliva	59.4 (51.5 to 67.0)	99.8 (99.7^# to 99.9)
Goodall, 2022[45]	Panbio	Canada	testing site	unclear	asympt.	AN*	64.5 (51.3^# to 76.3^#)	100 (99.5^# to 100^#)
Goodall, 2022[45]	Panbio	Canada	testing site	unclear	asympt.	TN*	64.5 (51.3^# to 76.3^#)	100 (99.5^# to 100^#)
Goodall, 2022[45]	Panbio	Canada	testing site	unclear	asympt.	AN*	68.4 (51.3^# to 82.5^#)	100 (99.2^# to 100^#)
Goodall, 2022[45]	Panbio	Canada	testing site	unclear	asympt.	TN*	81.6 (65.7^# to 92.3^#)	100 (99.2^# to 100^#)
Igloi, 2021[46]	Standard Q	Netherlands	testing site	adult	sympt., HRC	saliva*	66.1 (52.9 to 77.6)	99.6 (98.8 to 99.9
Mane, 2022[47]	Coviself	India	testing site	adult	sympt., HRC	OP	54.2^# (39.2^# to 68.6^#)	96.9^# (92.9^# to 99.0^#)
Rangaiah, 2022[48]	Coviself	India	unclear	unclear	unclear	AN	61.5 (50.7 to 71.5)	100 (97.4 to 100)
Robinson, 2022[49]	BD Veritor nasal	USA	testing site	unclear	sympt., HRC,	AN	-	-
Savage, 2022[50]	Covios	UK	testing site	adult	sympt.	AN	90.5 (83.9 to 97.2)	99.4 (98.3 to 100)
Shin, 2022[51]	Standard Q	Korea	clinical	unclear	sympt., asympt.	AN	94.9 (87.5 to 98.6)	100 (98.3 to 100)
Sukumaran, 2022[52]	AG-Q	India	clinical	unclear	unclear	AN	77.9 (67.7 to 86.1)	100 (94.4 to 100)
Tsao, 2022[55]	BinaxNow	USA	testing site	adult	sympt., asympt.	AN	63.0 (50.9^# to 74.0^#)	99.8 (99.1^# to 100)
Wölfl-Duchek, 2022[53]	Medomics	Austria	clinical	adult	sympt., asympt.	AN	63.0 (47.5 to 76.8)	100 (91.0^# to 100)
Abbreviations: sympt. = symptomatic; asympt. = asymptomatic without known contact; HRC = high risk contact; AN = anterior nasal; OP = oropharyngeal; TN = throat; * RT-PCR sample was self-sampled # Values have been recalculated due to missing or contradictory data

Table 1

b Clinical accuracy data for self-testing Ag-RDTs.
Study	Test assessed	Country	Type of location	Study population	Screening criteria	Sample type	Sensitivity (95%CI)	Specificity (95%CI)
Lindner, 2021[11]	Standard Q	Germany	clinical	adults	sympt.	AN	82.5% (67.2^# to 92.7^#)	100% (96.5 to 100)
Stohr, 2022[22]	BD Veritor	Netherlands	testing site	adults	sympt., asympt.	AN	48.9% (41.3^# to 56.5^#)	99.9% (99.5 to 100)
Stohr, 2022[22]	Standard Q	Netherlands	testing site	adults	sympt., asympt.	AN	61.5% (54.2^# to 68.4^#)	99.7% (99.3 to 99.9)
De Meyer, 2022[25]	V-Chek	Belgium	testing site	adult, children	unclear	saliva	7.7 (0.2^# to 36.0^#)	100 (90.5^# to 100^#)
De Meyer, 2022[25]	Whistling	Belgium	testing site	adult, children	unclear	saliva	9.1 (3.0^# to 20.0^#)	100 (92.5^# to 100^#)
Diawara, 2022[26]	PCL	Morocco	unclear	adult, children	unclear	saliva	90.1 (80.7 to 95.9)	99.6 (97.9 to 99.9)
Diawara, 2022[26]	PCL	Morocco	unclear	adult, children	unclear	AN	91.4^# (82.3^# to 96.8^#)	100 (98.5 to 100)
Iftner, 2022[27]	Anbio	Germany	testing site	adult	asympt.	AN	-	99.8^# (98.8^# to 100^#)
Iftner, 2022[27]	Clungene	Germany	testing site	adult	asympt.	AN	-	97.9^# (96.2^# to 99.0^#)
Iftner, 2022[27]	Hotgen	Germany	testing site	adult	asympt.	AN	-	99.8^# (98.8^# to 100^#)
Iftner, 2022[27]	Mexacare	Germany	testing site	adult	asympt.	AN	-	99.8^# (98.8^# to 100^#)
Leventopoulos, 2022[28]	Boson	Greece	testing site	adult, children	sympt., asympt.	AN	98.2 (96.7 to 99.6)	100 (99.9 to 100)
Møller, 2022[29]	DNA Diagnostics	Denmark	testing site	adult	sympt, HRC, asympt.	AN	65.7 (49.2 to 79.2)	100 (99.0 to 100)
Møller, 2022[29]	Hangzhou	Denmark	testing site	adult	sympt, HRC, asympt.	AN	62.1 (50.1 to 72.9)	100 (98.9 to 100)
Schuit, 2022[31]	Flowflex	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	79.0 (74.7 to 82.8)	97.2 (93.9 to 98.9)
Schuit, 2022[31]	MPBio	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	69.9 (65.1 to 74.4)	98.8 (97.3 to 99.6)
Schuit, 2022[31]	Clinitest	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	70.2 (65.6 to 74.5)	99.3 (97.6 to 99.9)
Schuit, 2022[31]	MPBio	Netherlands	testing site	adult	sympt, HRC, asympt.	OP/nasal	83.0 (78.8 to 86.7)	97.8 (94.3 to 99.4)
Schuit, 2022[31]	Clinitest	Netherlands	testing site	adult	sympt, HRC, asympt.	OP/nasal	77.3 (82.9 to 81.2)	97.0 (93.9 to 98.8)
Schuit, 2022[30]	SD Biosensor	Netherlands	testing site	adult	sympt, HRC, asympt.	NP/OP	68.9 (61.6 to 75.6)	99.5 (99.2 to 99.8)
Schuit, 2022[30]	Hangzhou	Netherlands	testing site	adult	sympt, HRC, asympt.	NP/OP	46.7 (39.3 to 54.2)	99.0 (98.5 to 99.4)
Tonen-Wolyec, 2022[32]	Biosynex	France	testing site	adult	sympt, HRC, asympt.	AN	90.9 (70.8^# to 98.9^#)	100 (95.7^# to 100)
Venekamp, 2023[23]	FlowFlex	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	27.5 (21.3 to 34.3)	99.8 (99.3 to 100)
Venekamp, 2023[23]	MPBio	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	20.9 (13.9 to 29.4)	99.8 (99.2 to 100)
Venekamp, 2023[23]	Clinitest	Netherlands	testing site	adult	sympt, HRC, asympt.	AN	25.6 (19.1 to 33.1)	99.9 (99.5 to 100)
Zwart, 2022[24]	BD Veritor	Netherlands	clinical	adult	sympt., asympt.	OP/nasal	61.5 (56.6 to 66.3)	100 (99.8 to 100)
Zwart, 2022[24]	BD Veritor	Netherlands	clinical	adult	sympt., asympt.	AN	50.3 (43.0^# to 57.6^#)	99.7 (99.3 to 99.8)
Zwart, 2022[24]	Roche	Netherlands	clinical	adult	sympt., asympt.	OP/nasal	74.3^# (66.6^# to 81.1^#)	99.7 (99.4^# to 99.9)
Abbreviations: sympt. = symptomatic; asympt. = asymptomatic without known contact; HRC = high risk contact; AN = anterior nasal; OP = oropharyngeal; TN = throat; * RT-PCR sample was self-sampled # Values have been recalculated due to missing or contradictory data

Concordance with professional-use Ag-RDTs

The concordance between self-testing and professional testing was only reported in one study, which found high concordance with a kappa of 0.94 [11]. The concordance between self-sampling and professional testing was reported in six studies and ranged from 0.86 to 0.93 [13, 39, 42, 43, 58]. The pooled Cohen’s kappa for self-sampling studies was 0.91 (95% CI 0.88 to 0.94) (Fig. 3).

We also performed an exploratory analysis of concordance combining datasets from self-sampling and self-testing studies, assuming that sampling is a major driver of differences between self-testing and professional testing. We observed a pooled Cohen’s kappa of 0.92 (95% CI 0.89 to 0.95) (Supplemental Fig. 3).

Performance of self-testing and self-sampling in comparison to RT-PCR

When comparing the performance of self-testing using Ag-RDTs to the reference standard, sensitivity ranged widely from 7.7% [25] to 98.2% [28]. Specificity was high, above 99.5% in all datasets.

Across 36 datasets from 31 self-sampling studies, sensitivity again ranged widely from 20.0% [33] to 100% [35] with wide CIs. Specificity for self-sampling studies ranged from 96.4% [58] to 100% [12] with narrow CIs. Sensitivity of ≥ 80% was achieved in 15 self-sampling [12, 35, 37–40, 42, 43, 45, 50, 51, 56–58, 60] and five self-testing studies [11, 26, 28, 31, 32].

A total of 54 datasets assessing 55,115 self-tested or self-sampled samples were eligible for meta-analysis. The meta-analysed summary estimates of sensitivity and specificity across both self-sampling and self-testing datasets were 70.5% (95% CI 64.3 to 76.0) and 99.4% (95% CI 99.1 to 99.6), respectively. The pooled sensitivities for self-tested (23 datasets) and self-sampled (31 datasets) samples were 66.1% (95% CI 53.5 to 76.7) and 73.5% (95% CI 67.4 to 78.7), respectively.

When only AN sample (40 datasets, 74.1%) were considered, the pooled sensitivity marginally increased to 72.9% (95% CI 65.8 to 79.0). Test-specific summary estimates of sensitivity were possible for BinaxNow (6 datasets), Standard Q nasal (6 datasets) and Panbio (Abbott, Germany; henceforth called Panbio) (6 datasets), resulting in a sensitivity of 63.5% (95% CI 43.4 to 79.8), 79.8% (95% CI 66.0 to 88.9), and 67.7% (95% CI 60.8 to 73.8), respectively. Data were insufficient for a meta-analysis of other Ag-RDTs or sample types. Supplementary Table S1 provides the full ranges for the clinical performance of each Ag-RDT.

IFU-Conformity

Across all self-sampling and self-testing datasets, the overall summary estimate of sensitivity for all IFU-conforming studies was 71.3% (95% CI 64.5 to 77.3) (Fig. 4A), with marginal differences between self-testing and self-sampling studies (Supplement Fig. 4, 5). In total three datasets had unclear IFU-conformity with sensitivity ranging from 48.9% [22] to 78.6% [36].

In the one study in which participants were observed as they self-tested, the majority of deviation from instructions happened during the sampling procedure, with 41.8% of participants failing to rub the swab against the nasal walls [11]. Another common mistake made during sampling involved too little rotation time in the nose (24.1%) [11]. Squeezing the tube while the swab was still inside and squeezing the tube when the swab was being removed were the steps with most frequent deviations during the testing procedure, at 34.9% and 33.1%, respectively. These deviations, however, did not appear to impact test performance in this study, as performance against RT-PCR (Sensitivity 82.5%) was acceptable and concordance with professional testing was high (kappa 0.91).

Presence of Symptoms

The summary estimates of sensitivity across all studies were lower in the asymptomatic group compared to the symptomatic group, with 38.1% (95% CI 23.4 to 55.3) compared to 77.4% (95% CI 71.1 to 82.6), respectively (Fig. 4B). Specificity was above 99.0% in both subgroups. Self-testing studies, which are included in the pooled analysis, reported a range of sensitivity from 51.0% [30] to 82.5% [11] in symptomatic persons.

Duration of Symptoms (DoS)

We were unable to perform a bivariate subgroup meta-analysis for a DoS of more than seven days (DoS > 7) due to an insufficient number of available datasets (n = 1). The reported sensitivity and specificity in this study was 53.8% and 100%, respectively [56]. The pooled estimates of sensitivity and specificity in studies reporting DoS ≤ 7 was 79.4% (95% CI 72.7 to 84.8) and 99.4% (95% CI 98.9 to 99.7), respectively.

Ct Values

For the subgroup analysis based on Ct value range, 22 datasets from nine self-sampling studies were available for univariate meta-analysis. For the Ct value groups < 25 and < 30, the pooled sensitivities were 93.6% (95% CI 90.4 to 96.8) and 76.6% (95% CI 57.6 to 95.6), respectively (Fig. 4C).

Testing using self-sampling in patients who had samples with Ct values ≥ 25 and ≥ 30 showed a broader range, with pooled sensitivities of 35.9% (95% CI 9.8 to 62.0) and 10.2% (0.0 to 28.1), respectively.

One self-testing study reported a sensitivity of 85.0% and a specificity of 99.1% when only samples with high viral load (≥ 7.0 log₁₀ SARS-CoV-2 RNA copies/mL) were analyzed [11].

Age

Across all the studies included in the review, we had 32 datasets with samples from people aged 18 years and older (‘≥18 years’), achieving a pooled sensitivity of 65.5% (95% CI 57.8 to 72.4) (Fig. 4D). For the ‘<18 years’ group, a meta-analysis was not possible, as only three datasets were available for this age group. However, the reported sensitivity in these three datasets had a comparable range to that in the ‘≥18 years’ group (71.4% [38] to 92.3% [37]). The pooled specificity was 99.6% (95% CI 99.2 to 99.8) in the ‘≥18 years’ group and was above 99.6% in all datasets in the ‘<18 years’ group.

Virus variant

VoC could be determined for 53 datasets out of 54, wild type observed in 21 datasets (39.6% of all datasets). The pooled sensitivity across these 21 datasets was 69.8% (95% CI 62.5 to 76.3) and the pooled specificity was 99.7% (95% CI 99.5 to 99.8). The highest sensitivity was found across studies conducted when the alpha VoC (8 datasets, 15.1%) was predominant, with 78.5% (95% CI 60.8 to 89.6). Across studies conducted during an Omicron wave (4 datasets, 7.5%), the pooled sensitivity was significantly lower with 32.8% (95% CI 17.8 to 52.3). When Delta (6 datasets, 11.3%) was predominant, the pooled sensitivity increased to 57.8% (95% CI 28.0 to 82.8). However, in other studies when Delta and Omicron were predominant had a pooled sensitivity of 76.1% (95% CI 70.7 to 80.7) (Fig. 5).

Self-testing studies showed similar pooled estimates for sensitivity for wild type, combined Delta/Omicron, and alpha VoC with 62.6% (95% CI 52.2 to 72.0), 76.1% (95% CI 70.7 to 80.7), and 85.3% (54.0 to 96.6), respectively.

Middle-income countries (MIC) vs. High income countries (HIC)

Studies conducted in high income countries (HIC) accounted for 44 datasets (53090 samples), resulting in a pooled sensitivity and specificity of 67.6% (95% CI 60.5 to 74.0) and 99.5% (95% CI 99.3 to 99.7), respectively. In contrast, studies from MIC (10 datasets; 2025 samples) had higher sensitivity and comparable specificity with 81.0% (95% CI 70.4 to 88.4) and 98.1% (95% CI 93.9 to 99.4), respectively (Supplement Figs. 6 and 7).

Sensitivity Analysis

When excluding case-control studies (5 datasets), the sensitivity remained comparable to the overall pooled sensitivity estimate with 69.5% (95% CI 62.8 to 75.5) (Supplement Fig. 8).

Datasets from manufacturer-independent studies (40 datasets; 20 self-testing studies) achieved an accuracy comparable to the overall summary estimates with a pooled sensitivity of 66.5% (95% CI 59.2 to 73.1) and a pooled specificity of 99.5% (95% CI 99.1 to 99.7) (Supplement Fig. 9). Excluding preprints (5 datasets) resulted in no substantial change in sensitivity (69.9% [95% CI 63.2 to 75.8]) and specificity (99.4% [95% CI 99.0 to 99.6]) (Supplement Fig. 10).

Certainty of Evidence (CoE)

We found CoE to be high for specificity and sensitivity, and low for concordance and user errors. As for ‘imprecision’, we downgraded the CoE for concordance by one point due to the low number of studies and small sample size. For studies assessing concordance and user errors, ‘inconsistency’ was rated ‘serious’ and consequently also downgraded by one point, since there was only one study available (Table 2).

Table 2

GRADE table: Should COVID-19 self-testing, defined as self-sampling, processing of the sample and self-readout using Ag-RDTs, be offered as an additional approach to professionally administered testing services? The following table summarizes the certainty of evidence according to the GRADE approach.
Certainty assessment								Impact	Certainty	Importance
№ of studies	Study design	Risk of bias	Inconsistency	Indirectness	Imprecision		Other considerations	Impact	Certainty	Importance
Accuracy – sensitivity (Ag-RDT self-testing vs. rRT-PCR)
23 [11, 22, 31, 32, 23–30]	observational studies	not serious^a	not serious^b	not serious^c	not serious^d	none		Normalized to a study population with 1,000 participants and 10% prevalence, 66 true positive and 34 false negative self-testing results were reported. Pooled sensitivity was 66.1% (95% CI 53.5 to 76.7)	⨁⨁⨁⨁ High	CRITICAL
Accuracy – specificity (Ag-RDT self-testing vs. rRT-PCR)
23 [11, 22, 31, 32, 23–30]	observational studies	not serious^a	not serious^b	not serious^c	not serious^d	none		Normalized to a study population with 1,000 participants and 10% prevalence, 874 true negative and 2 false positive self-testing results were reported. Pooled specificity was high with 99.5% (95% CI 99.1 to 99.7)	⨁⨁⨁⨁ High	CRITICAL
Accuracy – concordance (Ag-RDT self-testing vs. Ag-RDT performed by professionals)
1[11]	observational studies	not serious^a	serious^b	not serious^c	serious^d	none		Kappa: 0.92 (out of 1.00); (95% CI 0.85 to 1.00)	⨁⨁◯◯ Low	CRITICAL
Accuracy – Proportion of user errors
1 [11]	observational studies	not serious^a	serious^b	not serious^c	not serious^e	none		15.5% of the sampling steps and 15.0% of testing steps, were found to have deviations by study participants. However, these did not impede the self-test's performance.	⨁⨁◯◯ Low	IMPORTANT
Certainty assessment								Impact	Certainty	Importance
№ of studies	Study design	Risk of bias	Inconsistency	Indirectness	Imprecision		Other considerations	Impact	Certainty	Importance
Explanation: a. We used QUADAS-2 to assess risk of bias. The studies enrolled patients consecutively and assessed the self-testing, defined as self-sampling and self-performing the Ag-RDT, results blinded to the reference standard result (rRT-PCR or prof. Ag-RDT testing). While for one study it was not clear whether all self-tests were performed as per manufacturer’s instructions, this was ensured in the other. Furthermore, we could not detect any potential bias resulting from the study flow and timing. Therefore, we did not downgrade the quality of evidence for this criterion.
b. The heterogeneity/inconsistency in findings, as shown by the wide-ranging point estimates with only marginally overlapping confidence intervals, is likely to originate from differences in the study population. This is strengthened by the fact that the head-to-head comparison between self-testing and professionally testing on the same study population shows similar performance of Ag-RDTs. However, as there are only a few studies available for concordance and one study for user errors, we downgrade for these two outcomes by one.
c. Following current guidance from the GRADE guideline, we do not downgrade by one point for all studies but acknowledge that the study populations are not fully representative of the populations of interest. Furthermore, the intervention did not differ from the one of interest and outcomes were reported directly, therefore indirectness was judged 'not serious'.
d. The number of studies and sample size were small, and only one study reported on concordance between self-testing and professionally testing using Ag-RDTs.
e. For this outcome only qualitative data, or quantitative data in isolated studies in well-described but not comparable settings were available, therefore the criterion 'imprecision' is negligible and rated as 'not serious'.

Our systematic review and meta-analysis found that concordance between self-testing/self-sampling and professional testing using Ag-RDTs is very high with a pooled Cohen’s kappa of 0.92 (95% CI 0.89 to 0.95). Compared to RT-PCR, sensitivity of self-testing/self-sampling across all studies included in our review compared to RT-PCR (70.5% [95% CI 64.3 to 76.0]) was estimated to be almost the same as that of Ag-RDTs when performed by professionals (72.0% [8]). The summary point estimate of sensitivity for self-testing studies (66.1% [95% CI 53.5 to 76.7]) was also comparable to that of professional-conducted Ag-RDT with overlapping CIs.

Pooled sensitivity across self-testing and self-sampling studies increased to 77.4% (95% CI 71.1 to 82.6) in symptomatic persons, which is in line with the results of earlier reports that showed that presence of symptoms was a key variable affecting sensitivity of Ag-RDT and correlated with viral load [8, 63]. Thus, neither overall nor symptomatic pooled sensitivity achieved WHO sensitivity targets of ≥ 80% [10]. Notably, a recent meta-analysis found a pooled sensitivity of 91.1% for self-taken nasal AgRDTs [64].

The results of subgroup analysis based on Ct values are consistent with those of earlier studies, suggesting that viral load is the main determinant of test sensitivity, irrespective of the sampling procedure or the person administering the test [8]. In addition, it is worth noting that in most cases (60.0% of datasets), the sampling process was unsupervised, which implies the general applicability of our findings to unobserved home-testing. Moreover, even though deviations from the IFU did occur in some cases, this did not appear to have an impact on test performance [11].

Although limited, the data on deviations from sampling and testing procedures demonstrated that most instruction deviations occurred during sampling, supporting our approach to conduct a pooled exploratory analysis of self-sampling and self-testing. This was additionally bolstered by a positive self-judgement of test execution and interpretation, showing confidence of lay-users to perform Ag-RDTs reliably [22]. Moreover, one study reported that healthcare professionals and laypersons had a high level of readout agreement when clear instructions with illustrations were available [11]. It is, however, crucial to note that the observed sampling deviations are more likely to affect test sensitivity than specificity, because poor sampling is likely to result in decreased sample quality, and thus lower viral load, leading to false negative results. Nevertheless, the results of the sensitivity analysis showed that the pooled sensitivity estimate for self-testing studies is still lower than that for self-sampling studies, which suggests that self-sampling is not the only variable influencing the differences between self-testing and professional testing. To fully understand all the variables and how they affect test performance, more research is necessary.

Our subgroup analysis on VoC showed higher sensitivity when Delta and Omicron (76.1% [95% CI 70.7 to 80.7]) were predominant compared to Omicron (32.8% [95% CI 17.8–52.3%]) alone. However, the four data sets for Omicron analysis emerged from two studies [55, 65]. Both studies included primarily asymptomatic persons and had a > 92% vaccination rate, resulting likely in a lower viral load and thus affecting test sensitivity [55, 65].

Our study has several strengths. We thoroughly assessed the included studies with the QUADAS-2 tool using an a-priori developed interpretation guide. In addition, our review was supported by an independent methodologist and followed rigorous methods, aligning with other WHO-commissioned reviews for self-testing. Furthermore, we report on both peer-reviewed articles and preprints from a period that nearly covers the whole pandemic. Another strength of this study lies within our subgroup analyses that provide a clearer picture of the accuracy of self-sampling and self-testing across different populations and testing approaches.

Our systematic review is, however, limited by the small number of studies that were deemed eligible (particularly those evaluating self-testing) as well as the shortcomings of these studies as revealed by the quality assessment. The degree to which study participants with a relatively high rate of symptomatic individuals with prior training or testing experience are representative of the general population is another drawback. Furthermore, the majority of studies were conducted in HIC; at the same time, populations in MIC, particularly those with a high-burden of HIV, were likely to have more experience with self-testing compared to HIC at the beginning of the pandemic [3]. Recent reports find good concordance between COVID-19 self-testing and professionally-conducted Ag-RDTs in a middle-income country [66]. This is corroborated by our subgroup analysis, which found that a higher pooled estimate of sensitivity was observed in LMIC compared to HIC.

Self-testing and/or self-sampled testing using Ag-RDTs likely achieves similar accuracy as professional-use Ag-RDTs. In the light of the evidence presented in this review and other supporting studies, the WHO recommends COVID-19 self-testing to scale-up testing capacity [67, 68]. Further evidence is required to assess the impact of testing strategies including self-testing on the population-level control of SARS-CoV-2 transmission.

Ag-RDT	Antigen detection rapid diagnostic test
AN	Anterior nasal
CI	Confidence interval
CoE	Certainty of Evidence
Ct	Cycle threshold
DOS	Duration of symptoms
FN	False negative
FP	False positive
HIC	High-income countries
IFU	Instructions for use
MIC	Middle-income countries
NAATs	Nucleic acid amplification tests
NMT	Nasal mid-turbinate
NP	Nasopharyngeal
NPA	Negative percentage agreement
OP	Oropharyngeal
OPA	Overall percentage agreement
POC	Point of care
PPA	Positive percentage agreement
PRISMA	Preferred Items for Systematic Reviews and Meta-analysis
RCT	Randomized controlled trial
RT-PCR	Reverse transcription polymerase chain reaction
TN	True negative
TP	True positive
VoC	Variant of Concern
WHO	World Health Organization

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and materials

The raw data is available under https://doi.org/10.11588/data/P9JEPG

Competing interests

All authors declare that they have no conflict of interest.

Funding

This work was supported by the Ministry of Science, Research and Arts of the State of Baden-Wuerttemberg, Germany (no grant number; https://mwk.badenwuerttemberg.de/de/startseite/) and internal funds from the Heidelberg University Hospital (no grant number; https://www. heidelberg-university-hospital.com/de/) to CMD. Further, this project was funded by United Kingdom (UK) aid from the British people (grant number: 300341-102; Foreign, Commonwealth & Development Office (FCMO), former UK Department of International Development (DFID); www.gov.uk/fcdo), and supported by a grant from the World Health Organization (WHO; no grant number; https://www.who.int) and a grant from Unitaid (grant number: 2019-32-FIND MDR; https://unitaid.org) to Foundation of New Diagnostics (FIND; JAS, SC, SO, AM, BE). For the publication fee we acknowledge financial support by Deutsche Forschungsgemeinschaft within the funding programme „Open Access Publikationskosten” (no grant number; https://www.dfg.de/en/index.jsp), as well as by Heidelberg University (no grant number; https://www.uni-heidelberg.de/en). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Authors’ contributions

SK, LEB, CMD, SY made substantial contributions to the conception of this work. SK, LEB, CMD and SY designed the work. MGr performed the literature search. SK, LEB, KM, HT, CE and SS performed the data acquisition. Mga, FT, AM, BE, and SY performed the data analysis. SK, LEB, JAS, CCJ, NRP, CMD and SY contributed to the interpretation of data. SK drafted the manuscript and all authors have substantively revised it. The final version of this manuscript is read an approved by all authors.

Acknowledgements

Not applicable

Tahlil KM, Ong JJ, Rosenberg NE, Tang W, Conserve DF, Nkengasong S, et al. Verification of HIV Self-Testing Use and Results: A Global Systematic Review. AIDS Patient Care STDS 2020;34:147–56. https://doi.org/10.1089/apc.2019.0283.
Devillé W, Tempelman H. Feasibility and robustness of an oral HIV self-test in a rural community in South-Africa: An observational diagnostic study. PLoS One 2019;14:1–13. https://doi.org/10.1371/journal.pone.0215353.
Figueroa C, Johnson C, Ford N, Sands A, Dalal S, Meurant R, et al. Reliability of HIV rapid diagnostic tests for self-testing compared with testing by health-care workers: a systematic review and meta-analysis. Lancet HIV 2018;5:e277–90. https://doi.org/10.1016/S2352-3018(18)30044-4.
Eshun-Wilson I, Jamil MS, Witzel TC, Glidded D V., Johnson C, Le Trouneau N, et al. A Systematic Review and Network Meta-analyses to Assess the Effectiveness of Human Immunodeficiency Virus (HIV) Self-testing Distribution Strategies. Clin Infect Dis 2021;73:E1018–28. https://doi.org/10.1093/cid/ciab029.
World Health Organization. Recommendations and guidance on hepatitis C virus self-testing. Web Annex D, Values and preferences on hepatitis C virus self-testing. 2021.
World Health Organization. Guidelines on HIV Self-Testing and Partner Notification: Supplement to Consolidated Guidelines on HIV Testing Services. 2016.
World Health Organization. Recommendations and guidance on hepatitis C virus self-testing 2021:32.
Brümmer LE, Katzenschlager S, Mcgrath S, Schmitz S, Gaeddert M, Erdmann C, et al. Accuracy of rapid point-of-care antigen-based diagnostics for SARS-CoV-2: an updated systematic review and meta-analysis with meta regression analyzing influencing factors. PLoS Med 2022;19:1–36. https://doi.org/10.1371/journal.pmed.1004011.
World Health Organisation. Antigen-detection in the diagnosis of SARS-CoV-2 infection - Interim guidance 2021.
World Health Organisation. Antigen-detection in the diagnosis of SARS-CoV-2 infection using rapid immunoassays Interim guidance, 11 September 2020. 2020:1–9.
Lindner AK, Nikolai O, Rohardt C, Kausch F, Wintel M, Gertler M, et al. Diagnostic accuracy and feasibility of patient self-testing with a SARS-CoV-2 antigen-detecting rapid test. J Clin Virol 2021;141. https://doi.org/10.1016/j.jcv.2021.104874.
Harris DT, Badowski M, Jernigan B, Sprissler R, Edwards T, Cohen R, et al. SARS-CoV-2 rapid antigen testing of symptomatic and asymptomatic individuals on the University of Arizona campus. Biomedicines 2021;9. https://doi.org/10.3390/biomedicines9050539.
Lindner AK, Nikolai O, Rohardt C, Burock S, Hülso C, Bölke A, et al. Head-to-head comparison of SARS-CoV-2 antigen-detecting rapid test with self-collected anterior nasal swab versus professional-collected nasopharyngeal swab. Eur Respir J 2020:2–9. https://doi.org/10.1101/2020.12.03.20243725.
Brümmer LE, Katzenschlager S, Gaeddert M, Erdmann C, Schmitz S, Bota M, et al. Accuracy of novel antigen rapid diagnostics for SARS-CoV-2: A living systematic review and meta-analysis. PLoS Med 2021;18:1–41. https://doi.org/10.1371/journal.pmed.1003735.
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021;372. https://doi.org/10.1136/bmj.n71.
Whiting PF, Rutjes AWS, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: A Revised Tool for the Quality Assessment of Diagnostic Accuracy Studies. Ann Intern Med 2011;155:529–36. https://doi.org/10.7326/0003-4819-155-8-201110180-00009.
Alonso-Coello P, Oxman AD, Moberg J, Brignardello-Petersen R, Akl EA, Davoli M, et al. GRADE Evidence to Decision (EtD) frameworks: a systematic and transparent approach to making well informed healthcare choices. 2: Clinical practice guidelines. BMJ 2016;353:i2089. https://doi.org/10.1136/bmj.i2089.
Sun S. Meta-analysis of Cohen ’ s kappa. Heal Serv Outcomes Res Method 2011;11:145–63. https://doi.org/10.1007/s10742-011-0077-3.
Hodcroft E. CoVariants: SARS-CoV-2 Mutations and Variants of Interest 2021. https://covariants.org (accessed March 21, 2023).
World Health Organization. Tracking SARS-CoV-2 variants. 2023.
Van Enst WA, Ochodo E, Scholten RJ, Hooft L, Leeflang MM. Investigation of publication bias in meta-analyses of diagnostic test accuracy: A meta-epidemiological study. BMC Med Res Methodol 2014;14:1–11. https://doi.org/10.1186/1471-2288-14-70.
Stohr JJJM, Zwart VF, Goderski G, Meijer A, Nagel-Imming CRS, Kluytmans-van den Bergh MFQ, et al. Self-testing for the detection of SARS-CoV-2 infection with rapid antigen tests for people with suspected COVID-19 in the community. Clin Microbiol Infect 2021. https://doi.org/10.1016/j.cmi.2021.07.039.
Venekamp RP, Schuit E, Hooft L, Veldhuijzen IK, van den Bijllaardt W, Pas SD, et al. Diagnostic accuracy of SARS-CoV-2 rapid antigen self-tests in asymptomatic individuals in the omicron period: a cross-sectional study. Clin Microbiol Infect 2023;29:391.e1-391.e7. https://doi.org/10.1016/j.cmi.2022.11.004.
Zwart VF, Moeren N van der, Stohr JJJM, Feltkamp MCW, Bentvelsen RG, Diederen BMW, et al. Performance of Various Lateral Flow SARS-CoV-2 Antigen Self Testing Methods in Healthcare Workers: a Multicenter Study. MedRxiv 2022:2022.01.28.22269783. https://doi.org/https://doi.org/10.1101/2022.01.28.22269783.
De Meyer J, Goris H, Mortelé O, Spiessens A, Hans G, Jansens H, et al. Evaluation of Saliva as a Matrix for RT-PCR Analysis and Two Rapid Antigen Tests for the Detection of SARS-CoV-2. Viruses 2022;14:1931. https://doi.org/10.3390/v14091931.
Diawara I, Ahid S, Jeddane L, Kim S, Nejjari C. Saliva-based COVID-19 Rapid Antigen Test: a practical and accurate alternative mass screening method. MedRxiv 2022:2022.10.24.22278691. https://doi.org/https://doi.org/10.1101/2022.10.24.22278691.
Iftner T, Iftner A, Pohle D, Martus P. Evaluation of the specificity and accuracy of SARS-CoV-2 rapid antigen self-tests compared to RT-PCR from 1015 asymptomatic volunteers. MedRxiv 2022. https://doi.org/https://doi.org/10.1101/2022.02.11.22270873.
Leventopoulos M, Michou V, Papadimitropoulos M, Vourva E, Manias NG, Kavvadas HP, et al. Evaluation of the Boson rapid Ag test vs RT–PCR for use as a self–testing platform. Diagn Microbiol Infect Dis 2022;104:115786. https://doi.org/10.1016/j.diagmicrobio.2022.115786.
Møller IJB, Utke AR, Rysgaard UK, Østergaard LJ, Jespersen S. Diagnostic performance, user acceptability, and safety of unsupervised SARS-CoV-2 rapid antigen-detecting tests performed at home. Int J Infect Dis 2022;116:358–64. https://doi.org/10.1016/j.ijid.2022.01.019.
Schuit E, Venekamp RP, Veldhuijzen IK, van den Bijllaardt W, Pas SD, Stohr JJJM, et al. Head-to-head comparison of the accuracy of saliva and nasal rapid antigen SARS-CoV-2 self-testing: cross-sectional study. BMC Med 2022;20:406. https://doi.org/10.1186/s12916-022-02603-x.
Schuit E, Venekamp RP, Hooft L, Veldhuijzen IK, van den Bijllaardt W, Pas SD, et al. Diagnostic accuracy of covid-19 rapid antigen tests with unsupervised self-sampling in people with symptoms in the omicron period: cross sectional study. BMJ 2022:e071215. https://doi.org/10.1136/bmj-2022-071215.
Tonen-Wolyec S, Dupont R, Awaida N, Batina-Agasa S, Hayette M-P, Bélec L. Evaluation of the Practicability of Biosynex Antigen Self-Test COVID-19 AG+ for the Detection of SARS-CoV-2 Nucleocapsid Protein from Self-Collected Nasal Mid-Turbinate Secretions in the General Public in France. Diagnostics 2021;11:2217. https://doi.org/10.3390/diagnostics11122217.
Tinker SC, Szablewski CM, Litvintseva AP, Drenzek C, Voccio GE, Hunter MA, et al. Point-of-care antigen test for sars-cov-2 in asymptomatic college students. Emerg Infect Dis 2021;27:2662–5. https://doi.org/10.3201/eid2710.210080.
Tanimoto Y, Mori A, Miyamoto S, Ito E, Arikawa K, Iwamoto T. Comparison of RT-PCR, RT-LAMP, and antigen quantification assays for the detection of SARS-CoV-2. Jpn J Infect Dis 2021. https://doi.org/10.7883/yoken.JJID.2021.476.
Mak GCK, Au SSM, Yeung MCW, Lau DMW, Lau KKS, Ng KHL, et al. Evaluation of rapid antigen detection test for individuals at risk of SARS-CoV-2 under quarantine. J Med Virol 2022;94:819–20. https://doi.org/10.1002/jmv.27369.
Blanchard A, Desforges M, Labbé A-C, Nguyen CT, Petit Y, Besner D, et al. Evaluation of real-life use of Point-Of-Care Rapid Antigen Testing for SARS-CoV-2 in schools for outbreak control (EPOCRATES). MedRxiv Prepr Serv Heal Sci 2021. https://doi.org/https://doi.org/10.1101/2021.10.13.21264960.
Harmon A, Chang C, Salcedo N, Sena B, Herrera BB, Bosch I, et al. Validation of an At-Home Direct Antigen Rapid Test for COVID-19. JAMA Netw Open 2021;4:10–3. https://doi.org/10.1001/jamanetworkopen.2021.26931.
Ford L, Whaley MJ, Shah MM, Salvatore PP, Segaloff HE, Delaney A, et al. Antigen Test Performance Among Children and Adults at a SARS-CoV-2 Community Testing Site. J Pediatric Infect Dis Soc 2021;10:1052–61. https://doi.org/10.1093/jpids/piab081.
Klein JAF, Krüger LJ, Tobian F, Gaeddert M, Lainati F, Schnitzler P, et al. Head-to-head performance comparison of self-collected nasal versus professional-collected nasopharyngeal swab for a WHO-listed SARS-CoV-2 antigen-detecting rapid diagnostic test. Med Microbiol Immunol 2021;210:181–6. https://doi.org/10.1007/s00430-021-00710-9.
Ahmed N, Kalil MNA, Yusof W, Bakar MAA, Sjahid AS, Hassan R, et al. A Performance Assessment Study of Different Clinical Samples for Rapid COVID-19 Antigen Diagnosis Tests. Diagnostics 2022;12:847. https://doi.org/10.3390/diagnostics12040847.
Cardoso JM de O, Roatt BM, Vieira PM de A, de Paiva NCN, Bernardes-Souza B, Lisboa OC, et al. Performance of the Wondfo 2019-nCoV antigen test using self-collected nasal versus professional-collected nasopharyngeal swabs in symptomatic SARS-CoV-2 infection. Diagnosis 2022;9:398–402. https://doi.org/10.1515/dx-2022-0003.
Chen M, Xu J, Ying L, Cai M, Tung TH, Zhou K, et al. Clinical practice of rapid antigen tests for SARS-CoV-2 Omicron variant: A single-center study in China. Virol Sin 2022;37:842–9. https://doi.org/10.1016/j.virs.2022.08.008.
Nikolai O, Rohardt C, Tobian F, Junge A, Corman VM, Jones TC, et al. Anterior nasal versus nasal mid-turbinate sampling for a SARS-CoV-2 antigen-detecting rapid test: does localisation or professional collection matter? Infect Dis (Auckl) 2021;53:947–52. https://doi.org/10.1080/23744235.2021.1969426.
Gagnaire J, Bonjean P, Verot E, Boulamail B, Labetoulle R, Gonzalo S, et al. SARS-CoV-2 rapid test versus RT-qPCR on noninvasive respiratory self-samples during a city mass testing campaign. J Infect 2022;85:90–122. https://doi.org/10.1016/j.jinf.2022.04.001.
Goodall BL, LeBlanc JJ, Hatchette TF, Barrett L, Patriquin G. Investigating the Sensitivity of Nasal or Throat Swabs: Combination of Both Swabs Increases the Sensitivity of SARS-CoV-2 Rapid Antigen Tests. Microbiol Spectr 2022;10. https://doi.org/10.1128/spectrum.00217-22.
Igloi Z, Velzing J, Huisman R, Geurtsvankessel C, Comvalius A, IJpelaar J, et al. Clinical evaluation of the SD Biosensor SARS-CoV-2 saliva antigen rapid test with symptomatic and asymptomatic, non-hospitalized patients. PLoS One 2021;16:e0260894. https://doi.org/10.1371/journal.pone.0260894.
Mane A, Jain S, Jain A, Pereira M, Sirsat A, Pathak G, et al. Diagnostic performance of oral swab specimen for SARS-CoV-2 detection with rapid point-of-care lateral flow antigen test. Sci Rep 2022;12:7355. https://doi.org/10.1038/s41598-022-11284-8.
Rangaiah A, Shankar SM, Padukone S, Shah PA, Rangappa KG, Vijay N, et al. New phase of diagnostics with India’s first home-based COVID-19 Rapid Antigen Detection kit: Brief evaluation and validation of CoviSelf^TM through a pilot study. Indian J Med Microbiol 2022;40:320–1. https://doi.org/10.1016/j.ijmmb.2022.01.008.
Robinson ML, Mirza A, Gallagher N, Boudreau A, Garcia Jacinto L, Yu T, et al. Limitations of Molecular and Antigen Test Performance for SARS-CoV-2 in Symptomatic and Asymptomatic COVID-19 Contacts. J Clin Microbiol 2022;60. https://doi.org/10.1128/jcm.00187-22.
Savage HR, Finch L, Body R, Watkins RL, Hayward G, Cook E, et al. A prospective diagnostic evaluation of accuracy of self-taken and healthcare worker-taken swabs for rapid COVID-19 testing. PLoS One 2022;17:e0270715. https://doi.org/10.1371/journal.pone.0270715.
Shin H, Lee S, Widyasari K, Yi J, Bae E, Kim S. Performance evaluation of STANDARD Q COVID‐19 Ag home test for the diagnosis of COVID‐19 during early symptom onset. J Clin Lab Anal 2022;36:1–6. https://doi.org/10.1002/jcla.24410.
Sukumaran A, Suvekbala V, R AK, Thomas RE, Raj A, Thomas T, et al. Diagnostic Accuracy of SARS-CoV-2 Nucleocapsid Antigen Self-Test in Comparison to Reverse Transcriptase–Polymerase Chain Reaction. J Appl Lab Med 2022;7:871–80. https://doi.org/10.1093/jalm/jfac023.
Wölfl-Duchek M, Bergmann F, Jorda A, Weber M, Müller M, Seitz T, et al. Sensitivity and Specificity of SARS-CoV-2 Rapid Antigen Detection Tests Using Oral, Anterior Nasal, and Nasopharyngeal Swabs: a Diagnostic Accuracy Study. Microbiol Spectr 2022;10. https://doi.org/10.1128/spectrum.02029-21.
Okoye NC, Barker AP, Curtis K, Orlandi RR, Snavely EA, Wright C, et al. Performance characteristics of BinaxNOW COVID-19 antigen card for screening asymptomatic individuals in a university setting. J Clin Microbiol 2021;59:1–20. https://doi.org/10.1128/JCM.03282-20.
Tsao J, Kussman AL, Costales C, Pinsky BA, Abrams GD, Hwang CE. Accuracy of Rapid Antigen vs Reverse Transcriptase–Polymerase Chain Reaction Testing for SARS-CoV-2 Infection in College Athletes During Prevalence of the Omicron Variant. JAMA Netw Open 2022;5:e2217234. https://doi.org/10.1001/jamanetworkopen.2022.17234.
Krüger LJ, Klein JAF, Tobian F, Gaeddert M, Lainati F, Klemm S, et al. Evaluation of accuracy, exclusivity, limit-of-detection and ease-of-use of LumiraDx^TM - Antigen-detecting point-of-care device for SARS-CoV-2. MedRxiv Prepr Serv Heal Sci 2021:1–37. https://doi.org/https://doi.org/10.1101/2021.03.02.21252430.
Osmanodja B, Budde K, Zickler D, Naik MG, Hofmann J, Gertler M, et al. Accuracy of a novel sars-cov-2 antigen-detecting rapid diagnostic test from standardized self-collected anterior nasal swabs. J Clin Med 2021;10:4–11. https://doi.org/10.3390/jcm10102099.
Chiu RYT, Kojima N, Mosley GL, Cheng KK, Pereira DY, Brobeck M, et al. Evaluation of the INDICAID COVID-19 Rapid Antigen Test in Symptomatic Populations and Asymptomatic Community Testing. Microbiol Spectr 2021;9. https://doi.org/10.1128/spectrum.00342-21.
Garciá-Fiñana M, Hughes DM, Cheyne CP, Burnside G, Stockbridge M, Fowler TA, et al. Performance of the Innova SARS-CoV-2 antigen rapid lateral flow test in the Liverpool asymptomatic testing pilot: Population based cohort study. BMJ 2021;374:1–8. https://doi.org/10.1136/bmj.n1637.
Shah MM, Salvatore PP, Ford L, Kamitani E, Whaley MJ, Mitchell K, et al. Performance of Repeat BinaxNOW Severe Acute Respiratory Syndrome Coronavirus 2 Antigen Testing in a Community Setting, Wisconsin, November 2020-December 2020. Clin Infect Dis 2021;73:S54–7. https://doi.org/10.1093/cid/ciab309.
Frediani JK, Levy JM, Rao A, Bassit L, Figueroa J, Vos MB, et al. Multidisciplinary assessment of the Abbott BinaxNOW SARS-CoV-2 point-of-care antigen test in the context of emerging viral variants and self-administration. Sci Rep 2021;11:1–9. https://doi.org/10.1038/s41598-021-94055-1.
World Bank, high income countries n.d. https://data.worldbank.org/country/XD (accessed December 15, 2022).
Dinnes J, Deeks J, Berhane S, Taylor M, Adriano A, Davenport C, et al. Rapid, point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection (Review). Cochrane Database Syst Rev 2021. https://doi.org/10.1002/14651858.CD013705.pub2.
Karlafti E, Tsavdaris D, Kotzakioulafi E, Kaiafa G, Savopoulos C, Netta S, et al. The Diagnostic Accuracy of SARS-CoV-2 Nasal Rapid Antigen Self-Test: A Systematic Review and Meta-Analysis. Life 2023;13:281. https://doi.org/10.3390/life13020281.
Venekamp RP, Schuit E, Hooft L, Veldhuijzen IK, van den Bijllaardt W, Pas SD, et al. Diagnostic accuracy of SARS-CoV-2 rapid antigen self-tests in asymptomatic individuals in the omicron period: a cross-sectional study. Clin Microbiol Infect 2023;29:391.e1-391.e7. https://doi.org/10.1016/j.cmi.2022.11.004.
Kalil MNA, Yusof W, Ahmed N, Fauzi MH, Bakar MAA, Sjahid AS, et al. Performance validation of covid-19 self-conduct buccal and nasal swabs rtk-antigen diagnostic kit. Diagnostics 2021;11. https://doi.org/10.3390/diagnostics11122245.
Brümmer L, Erdmann C, Tolle H, McGrath S, Olaru ID, Katzenschlager S, et al. The clinical utility and epidemiological impact of self-testing for SARS-CoV-2 using antigen detecting diagnostics: a systematic review and meta-analysis. MedRxiv Prepr Serv Heal Sci 2022:1–28. https://doi.org/10.1101/2022.07.03.22277183.
World Health Organization. Use of SARS-CoV-2 antigen-detection rapid diagnostic tests for COVID-19 self-testing - Interim Guideance 2022:1–16.
Blanchard AC, Desforges M, Labbé A-C, Nguyen CT, Petit Y, Besner D, et al. Evaluation of Real-life Use of Point-of-care Rapid Antigen Testing for SARS-CoV-2 in Schools (EPOCRATES): a cohort study. C Open 2022;10:E1027–33. https://doi.org/10.9778/cmajo.20210327.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Performance of SARS-CoV-2 antigen-detection rapid diagnostic tests for COVID-19 self-testing and self-sampling in comparison to molecular and professional-use antigen tests: A systematic review and meta-analysis

Status:

Version 1

Abstract

Purpose

Methods:

Results:

Conclusion:

Trial registration

Figures

Short summary

Introduction

Methods

Search strategy

Eligibility criteria

Assessment of methodological quality

Assessment of certainty of evidence (CoE)

Assessment of independence from manufacturers

Statistical analysis and data synthesis

Sensitivity analysis

Results

Methodological quality of all included studies

Study description

Concordance with professional-use Ag-RDTs

Performance of self-testing and self-sampling in comparison to RT-PCR

IFU-Conformity

Presence of Symptoms

Duration of Symptoms (DoS)

Ct Values

Age

Virus variant

Middle-income countries (MIC) vs. High income countries (HIC)

Sensitivity Analysis

Certainty of Evidence (CoE)

Discussion

Conclusion

List of abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1