In the present study, the prevalence of certainty of the evidence assessed through the GRADE approach was 23.6%. These findings highlight the information present in the literature that the GRADE approach has not been widespread among SRs in dentistry. Previous methodological studies showed that the approach utilization rate was less than 50% [8, 10, 26]. For example, in a study performed by Page et al. [8], from 300 SRs published in the biomedical field in one month, only 32 (11%) assessed the certainty of the evidence using the GRADE approach. In another study with SRs in dentistry, it was observed that only 19% of the SRs published over one year (SRs indexed in PubMed in 2017) used the approach to assess the certainty of the evidence [10]. However, these studies [8, 10] evaluated other types of SRs besides intervention and, for this reason, the use of the GRADE approach was lower than found in our study (23.6%) that evaluated only SRs of interventions, considering that the GRADE approach was initially developed for use in SRs of interventions [27].
Furthermore, comparing the present study's findings, another previous methodological survey performed only with Cochrane SRs of medical and health-related interventions found a prevalence of 43.6% on using the GRADE approach to appraise the certainty of the evidence [26]. However, despite the higher prevalence found in the study of Fleming et al. [26] for the use of the GRADE approach in health sciences SRs, it is worth mentioning that it is in common agreement in the literature that Cochrane SRs have more rigorous methods when comparing to non-Cochrane SRs, explaining the greatest adherence of the SRs included in these results to the GRADE approach for the assessment of the certainty of the evidence [28–31].
Considering the information about the main application of the GRADE approach, it is important to highlight that it was not created to be adapted to other types of studies. The GRADE's developers emphasize that the approach comprises validated modifications only for diagnostic questions and accepts the assessment of observational SRs only when the SRs also include intervention studies (and starting with significant initial low certainty of the evidence) [27]. Since there are not many approaches to assess the quality of the evidence in other types of SRs, SRs authors are wrongly induced to adapt and/or modify the GRADE approach corroborating with this methodological gap.
Also, it is essential to consider that using the GRADE approach is not easy or intuitive, and SR’s authors may face difficulties and doubts. Indeed, in the present study, six SR erroneously used it individually for each included study and were considered inappropriate by the authors. The GRADE guidelines state that it should be used to assess the quality of the body of evidence to support specific outcomes gathered in SRs of intervention research and not to assess individual studies [20, 32, 33]. Besides, with the incorporation of the certainty of the evidence in the 2020 update of the PRISMA statement [34], an increase in the number of SRs assessing the certainty evidence evaluation is expected. However, for this increase to occur in association with the correct application of the approach, SR authors in dentistry must be trained to use the GRADE approach and its methodology, limitations, and judgments. Therefore, it is expected that the new PRISMA 2020 and future GRADE guidelines may contribute to making the reporting process of the assessment of the certainty of evidence clearer and more objective [15, 34].
It is essential to highlight that after assessing the five domains of the GRADE approach, authors should clearly state the reasons for their judgments. For example, in the SRs included in the present study, assessing the risk of bias domain of the GRADE approach was evaluated through general observation of the individual risk of bias judgements of each included study. If these were mainly studies with a high risk of bias, in agreement with the recommendations of the GRADE guidelines for this domain, the domain could be downgraded by one or two levels [35]. As well, inconsistency, as indicated by the guidelines for the SRs, must consider the heterogeneity of the results across the included studies of the SR, which can be performed through heterogeneity tests such as I2, Cochran Q test, Tau2 test, and chi-square test; the significance for the heterogeneity tests; similar effect estimates among studies and overlap of 95%CI [36].
In addition, the imprecision domain, described in some SRs included in this study, was evaluated by the sample size, the number of events, confidence interval and inaccurate. Since the guidelines also have recommendations regarding the confidence interval, after the number of events, as a criterion for judging precision, the percentage found about this criterion in the present study was considered low among the literature [37]. The indirectness domain also agreed with the guidelines, since the recommendation is to assess this domain by judging when there are differences between the key points of the research question related to the studies where the evidence comes from, making it difficult to link the evidence to the research question. All SRs included in the present study evaluated if the evidence where the studies come from are related or applicable to the research question (PICO) [38]. Finally, the SRs that assessed publication bias evaluated the Funnel plot and/or Egger's test, as recommended [39]. Additionally, few SRs also reported that it was impossible to estimate publication bias through the Funnel plot and/ or Egger’s test as the number of studies included in the SR was less than ten included studies.
It is important to discuss the prevalence found in our study for each level of the certainty of the evidence for the studies that presented just one clearly stated the main outcome assessed by the GRADE approach (n = 44/41.5%) (29.5% "very low"; 25% "low"; 38.6% "moderate"; and 6.8% "high"), that also agrees with other evidence found in the literature. For example, Pandis et al. [41] evaluated 91 Cochrane and non-Cochrane SRs in dentistry, published from 2008 to 2013. In the Cochrane SRs (n = 41), 27% of the studied outcomes were judged as " moderate" certainty of the evidence, 47% were judged as "low," and 27% as "very low" certainty of the evidence. On the other hand, in the group of the non-Cochrane SRs, 30% of the outcomes studied were judged as "very low" certainty of the evidence, 48% were judge as "low" certainty, 18% were judged as "moderate" certainty, and 4% were judged as "high" certainty of the evidence. Our results confirm Pandis et al. results that currently, over 50% of clinical decisions are still based on SR's findings with low and very low certainty of the evidence, highlighting the need to improve the studies in the dentistry area [40].
Furthermore, although the GRADE Working Group states that SRs should only assess the certainty of the evidence [20, 27], some SRs included in the present study (n = 21/19.8%) used the overall certainty of the evidence to recommend some results. However, according to the GRADE Working Group, recommendations regarding a health care decision should be restricted solely to guidelines and public policy guidance [12, 27]. Therefore, although the approach might differ among reviewers, it is essential to correctly and carefully follow GRADE approach instructions published by its developers since clinical practices and health care decisions might be influenced by the results of SRs assessed by the GRADE approach [20].
Finally, there are some similarities between our results and the study of Bassani et al. [10], which evaluated SRs in the dentistry area. In agreement, both studies found SRs published predominantly from Brazil, being 23.6% in the 2019 study [10] and 35.8% in our study. However, about the specialty’s areas of publication of the SRs included in both studies, there was a high prevalence of SRs of Oral and Maxillofacial Surgery, Oral and Maxillofacial Pathology/stomatology, and Implantology and Periodontology in the study of Bassani et al. [10], while the most studied specialties areas in our study were Orthodontics and Dentofacial Orthopedics, Periodontics, Oral and Maxillofacial Surgery and Prosthodontics, respectively.
Strengths and Limitations
The present study was conducted with a limited search of one database over one year of SRs of interventions in dentistry. Although MEDLINE/PubMed is one of the most relevant and large databases for health sciences [41], it is possible that some studies that would be eligible for inclusion were not identified for this reason. However, we believe that the SR sample included in this methodological survey study represents the SRs of dentistry interventions. Nevertheless, it is important to emphasize that the objective of the present study was not to analyze all the SR of dentistry published in one year but to have a glimpse of what is happening on the certainty of evidence analysis in SR of interventions in dentistry.
Lastly, it is essential to highlight that the goal of this study was not to judge the conclusions of the included studies directly but to map out how the evaluation of the certainty of the evidence in the included SRs of interventions in dentistry using the five domains of the GRADE approach. Our findings highlight how SR authors in the dental field can improve its application and reporting rigorousness. In addition, this study is the first to be carried out with a specific focus on dentistry.