Selection and characteristics of studies
As a result of the literature search, a total of 298 studies were identified, including 97 from PubMed, 91 from Embase, 62 from CNKI, and 48 from Wanfang debase. Figure 1 showed the study selection process. 111 duplicate publications were excluded. According to the inclusion and exclusion criteria, we excluded 145 studies by evaluating the titles and abstracts. The remaining 42 studies were further scrutinized by reading the full text. Finally, only 19 studies were included in this meta-analysis, of which 9 reported the predictive value on disease severity [26-34], 6 reported the predictive value on mortality [35-40], and 4 reported the predictive value on both disease severity and mortality [41-44].
The characteristics of the included studies and the predictive value of NLR on disease severity or mortality in each study are presented in Table 1. Most studies were conducted in China. Twelve studies were published in English, six in Chinese and one in Spanish. Except one prospective study [27] , all others were retrospective studies. The number of participants across studies ranged from 45 to 1004. Notably, the SEN, SPE, AUC and cut-off value of NLR predicting mortality and disease severity ranged greatly among the included studies. Except two studies [41, 42], all other studies defined severe patients as meeting at least one of the following criterions: Shortness of breath, respiratory rate (RR)≥30 times/min, or oxygen saturation (resting state) ≤ 93%, or PaO2/FiO2 ≤300mmHg.
Study quality and publication bias
The methodological quality of the included studies is presented in Additional File 2. One study only included patients classified as moderate [36], one included only severe patients [35], and another included only elderly patients [40]. Therefore, these three studies were considered to have a high risk of patient selection bias. One study included 32 moderate cases, and another 31 severe cases were included as a control group [32]. One study included 48 moderate cases, and another 37 severe cases were included as a control group [34]. One study included 50 moderate cases, and another 43 severe cases were included as a control group [43]. One study included 42 dead patients, and another 42 discharged patients were included as a control group [37]. These four studies were also assessed to show high risk of patient selection bias, because they did not avoid a case-control design. One study did not provide sufficient information about patients enrolled and leaded to a high risk of patient selection in our opinion [33].Most studies were considered to have unclear risk of bias regarding index tests, because they did not report the blindness between index and reference tests. Deek funnel plot is shown in Additional File 3, publication bias may exist among studies reporting the predictive value of NLR on disease severity (P=0.04).
Predictive value of NLR on disease severity
Thirteen studies involving 1579 patients reported the predictive value of NLR on disease severity. The pooled SEN and SPE were 0.78 (95% CI 0.70-0.84, I²=71.83) and 0.78 (95% CI 0.73-0.83, I²=77.80), respectively (Figure 2a). The positive likelihood ratio was 3.6 (95% CI 2.9-4.4), and the negative likelihood ratio was 0.28(95% CI 0.21-0.38). The DOR was 13(95% CI 9-18). The SROC curve is shown in Figure 3a. The AUC of NLR for predicting disease severity was 0.85 (95% CI 0.81-0.88), indicating high diagnostic value. We can learn from Fagan nomogram (Figure 4a) that if the pre-test probability was set to 50%, the post-test probability of NLR for the detection of severe cases was 78% when the NLR was above the cut-off value. On the contrary, when the NLR was below the cut-off value, the post-test probability was 26%.
Predictive value of NLR on mortality
Ten studies involving 2967 patients reported the predictive value of NLR on mortality. The pooled SEN and SPE were 0.83 (95% CI 0.75-0.89, I²=66.13) and 0.83 (95% CI 0.74-0.89, I²=90.34), respectively (Figure 2b). The positive likelihood ratio was 4.8 (95% CI 3.3-7.0), and the negative likelihood ratio was 0.21(95% CI 0.15-0.30). The DOR was 23(95% CI 15-36). The SROC with pooled diagnostic accuracy was 0.90 (95% CI 0.87-0.92), presented in Figure 3b. The Fagan nomogram showed that the post-test probability of NLR for the detection of mortality was 83% when the NLR was above the cut-off value and the post-test probability was 17% when the NLR was below the cut-off value. (Figure 4b).
Subgroup analyses and sensitivity analyses
We conducted the subgroup analyses based on the cut-off value. In terms of predicting disease severity, the cut-off value in six studies was higher than 4.5, and were termed the “high cut-off value” subgroup. Seven others used a lower cut-off value, which were included in the “low cut-off value” subgroup. The AUC were 0.86 (95% CI 0.83-0.89) and 0.82 (95% CI 0.78-0.85), respectively. Similarly, ten studies reporting the predictive value of NLR on mortality were divided into “high cut-off value” (cut-off≥6.5) and “low cut-off value”(<6.5) subgroups, and the AUC were 0.92 (95% CI 0.89-0.94) and 0.84 (95% CI 0.80-0.87), respectively. In the sensitivity analyses, we only included studies published in English. The pooled AUC for predicting disease severity and mortality were 0.83 (95% CI 0.80-0.86) and 0.90 (95% CI 0.87-0.92), respectively. Detailed results about subgroup analyses and sensitivity analyses are presented in Table 2.