We screened a total of 1893 unique records (after 661 duplicates were removed) identified from the five electronic databases. No additional records were identified through other sources. Details of the process of screening and selecting studies for inclusion in the review are presented in Fig. 1. Overall, there were a total of 7 included trials (corresponding to 13 publications), 34 excluded articles, and 5 registered protocols awaiting classification. Details on trials excluded at full-text and on studies awaiting classification are reported respectively in Appendix 2 and Appendix 4.
Of the seven included studies, three studies tested fluoxetine(21, 22, 43), one citalopram(44), two sertraline(23, 45) and one fluvoxamine(46), and the comparator was placebo in each case. Five studies used a parallel design(22, 23, 44, 45, 47), and two a crossover design(43, 46). Both studies with crossover designs observed a washout period compatible with the recommended length for the specific SSRI. However, because of a possible period effect in the Hollander(43) study, we pulled data in the meta-analysis only from the first period of treatment. No data was extracted from the Sugie(48) study because results were only presented for genotypes rather than as treatment-placebo comparison. Authors from three(21, 32, 45) studies provided us with additional unpublished data. The Greiss Hess(45) study included participants with Fragile X, however, we considered only the subgroup of patients with ASD. Unpublished data were collected on global functioning (dichotomus(21, 32, 45) and continuous results(45)), disruptive behaviours(32) and adverse events(45).
We did not perform the planned subgroup and sensitivity analyses because of the limited data available for meta-analysis. To assess publication bias we searched databases for protocols of unpublished studies(49) but did not construct a funnel plot due to the small number of studies included in the analyses.
Characteristics of included studies are presented in Appendix 3.
Risk of bias of included studies: The allocation sequence was randomly generated and it was concealed in four studies with no apparent baseline imbalances(22, 44, 45, 48). In one study there was no information about the randomization process but no baseline imbalances were reported(50). In another study, although random sequence generation was adequate and allocation was concealed, some baseline imbalances were likely due to chance(51). Finally, in one study no details on allocation sequence concealment were reported and the two groups appear unbalanced for various baseline characteristics(23). Risk of bias due to deviation from the intended interventions was low in five of the seven included studies(22, 45, 47, 48, 50), while in one study there were some concerns about the possibility of patients and study personnel being aware of the intervention(44) and in another study(23) there were some concerns because Intention-to-Treat (ITT) analysis was conducted for the primary outcomes but not specified for the outcomes of our interest. Risk of bias due to missing outcome data was detected in three studies: two studies had differential and high dropout rates that led to some concerns(22) and high risk(47); for one study(50) there were some concerns because of a 11% of drop-outs that were not included in the ITT analysis and four studies were at low risk(23, 44, 45, 48). Risk of bias in measurement of the outcome was judged low in all included studies. Finally, risk of bias for selective reporting was low in five studies(22, 23, 44, 45, 47) for which trial registrations were available as well as details on prespecified outcomes. Whereas, for one study(50) the protocol didn’t provide any detail on outcomes, and for another study the protocol was not available and the analysis reported effectiveness only for genetic subgroups(48). Details on risk of bias assessment of the included studies are reported in Appendix 3 and associated plots in Fig. 2 and Fig. 3. When risk of bias has been found to be different for different outcomes in the same study, the judgment has been reported in the specific domain along with the outcome (underscored in Appendix 3).
1-Restricted Repetitive Behaviors
Two studies(21, 44) reported restricted repetitive behaviours measured with the Repetitive Behavior Scale-Revised (RBS-R). However, while King 2009(44) reported the mean differences for the 5 subscales, Reddihough 2019(21) reported only the total score. Pooling the total scores was possible by imputing the SD from Reddihough 2019(21).
Pooling data from two RCTs(44, 51) including 230 participants, treated for 12–16 weeks, showed a MD -3.24 in restricted repetitive behaviours on the RBS-R score with the use of SSRIs (95%CI -14.91, 8.43; I2 = 74%; MID − 12.9 points; Analysis 1). Owing to concerns of serious risk of bias, inconsistency and imprecision, the evidence is very uncertain about the effect SSRIs in children with ASD on Restricted Repetitive Behaviors (Summary of Findings, Table 1).
2-Obsessive-Compulsive Symptoms
Four studies(21, 22, 44, 50) reported measures of obsessive-compulsive behaviours. However, Hollander 2005(50) used a version of the CYBOCS modified by the investigators recording only compulsive symptoms, and therefore was not included in the meta-analysis.
Pooling data from the 3 RCTs(44, 51, 52) including 413 participants, treated for 12–16 weeks, showed a MD -0.10 in obsessive-compulsive symptoms on the CYBOCS-PDD score with the use of SSRIs (95%CI -1.57, 1.37; I2 = 74%; MID − 2 points; Analysis 2). Due to serious concerns of risk of bias and inconsistency, there is low certainty evidence that suggests that use of SSRIs in children with ASD results in little to no difference in obsessive-compulsive symptoms (Summary of Findings, Table 1).
3-Anxiety Symptoms
Two studies(21, 23) reported anxiety symptoms using two different scales. Potter 2019(23) reported results from the Preschool Anxiety Scale-Revised (PAS-R) in 47 young children (2–6 years) treated for 26 weeks with a low dose of sertraline (MD 3.34; 95%CI -3.78, 10.46; MID − 13.6 point). Reddihough 2019(21) instead used the Spence Children Anxiety Scale (SCAS) and reported the results on 106 older children (7.5–18 years) treated for 16 weeks with a moderate dose of fluoxetine (MD -4.12; 95%CI -10.84, 2.60; MID − 11.4 point). Considering the serious concern of risk of bias, inconsistency and imprecision, the evidence is very uncertain about the effect of SSRIs in children with ASD on anxiety symptoms (Summary of Findings, Table 1).
4-Depressive Symptoms
None of the included studies reported any measure of depressive symptoms in children and adolescents with ASD.
5-Disruptive Behaviors
Three studies(21, 23, 44) reported maladaptive behaviours using the Aberrant Behavior Checklist - Community Version (ABC-CV) checklist and we reported the irritability subscale that mainly measures disruptive behaviours.
Pooling data from three RCTs(21, 32, 44) (published and unpublished) including 276 participants treated for 12–26 weeks, showed a MD 0.20 in disruptive behaviours measured with the Irritability Subscale of ABC-CV with the use of SSRIs (95%CI -2.00, 2.40; I2 = 0%; MID − 4.5 points; Analysis 3). Because of serious concerns of risk of bias and imprecision, there is low certainty evidence that suggests that use of SSRIs in children with ASD results in little to no difference in disruptive behaviours (Summary of Findings, Table 1).
6-Global Functioning
All of the included studies used some version of the Clinical Global Impression – Improvement scale (CGI-I) to measure change in global functioning. However, Sugie 2005(46), in a cross-over trial including 18 patients, presented the CGI scale results for different genotypes only, therefore it was not possible to determine the post-treatment score for the two groups.
Collecting additional unpublished data allowed us to pool both dichotomous results (responders defined as “much improved” or “very much improved”), as well as continuous results.
Pooling data from five RCTs(23, 31, 44, 51, 52) including 491 participants, treated for 12–26 weeks, showed a RR of 0.99 (95%CI 0.73, 1.36; I2 = 32%) in the number of responders at the CGI-I score with the use of SSRIs, corresponding to an absolute effect of 0 more participants improving per 100 using SSRIs (from 9 less to 12 more, Analysis 4). Because of the serious concern of risk of bias, there is moderate certainty evidence that suggests that SSRIs in children with ASD probably result in little to no difference in the number of patients showing an improvement in global functioning (Summary of Findings, Table 1).
An additional analysis pooling continuous data on the CGI-I scale from four RCTs(32, 43, 45, 51) including 226 participants showed comparable results (Analysis 5).
7-Quality of Life
Only one(52) RCT reported quality of life measured with the Caregiver Strain Questionnaire (CSQ) for a group of 121 participants treated for 14 weeks (MD 0.00; 95%CI -0.27, 0.27; MID − 0.5 point). Because of serious concerns of indirectness and imprecision, there is low certainty evidence that suggests that SSRIs may not increase nor reduce parent's quality of life (Summary of Findings, Table 1).
8-Adverse Events
Withdrawal due to adverse events
Three studies(33, 44, 52) reported the number of withdrawals due to adverse events and one study(23) provided us with unpublished data (Analysis 6a). One additional study reported no withdrawal due to adverse events in both arms (Greiss Hess 2016(31) unpublished data for the subgroup of 32 participants with ASD). Similarly, there were no discontinuation due to adverse events reported in the publication of the two cross-over studies: Hollander 2005(50) (44 participants) and Sugie 2005(46) (19 participants). Those results, excluded from the meta-analysis of RRs, were included in an additional analysis of Risk Difference (Analysis 6b) obtaining comparable results.
Pooling data from 4 RCTs(21, 32, 44, 52) including 511 participants, treated for 12–26 weeks, showed a RR of 1.14 (95%CI 0.64 to 2.04, I2 = 0%) in the number of withdrawals due to AEs with the use of SSRIs, corresponding to an absolute effect of 1 more event per 100 people using SSRIs (from 3 less to 8 more). Because of serious concern of risk of bias, there is moderate certainty evidence that suggests that SSRIs probably result in a slight increase of discontinuation due to adverse events (Summary of Findings, Table 1).
Any adverse events
Adverse events were inconsistently described among studies. The most commonly reported adverse events were gastrointestinal disorders, activation/hyperactivity, sleep disorders, and agitation/irritability, mainly of mild or moderate level of severity. Among the few serious adverse events reported there were four cases of suicidal ideation, three with placebo(33, 50) and one with SSRI(52), and one case of seizure with SSRI(53) requiring emergency hospitalization.
Pooling data from five RCTs(21, 32, 44, 45, 52) (both published and unpublished data) including 543 participants, treated for 12–26 weeks, showed a RR of 1.05 (95%CI 0.95, 1.15; I2 = 0%; Analysis 7) in the number of participants experiencing at least one adverse event with the use of SSRIs, corresponding to an absolute effect of 3 more participants experiencing adverse events per 100 people using SSRIs (from 4 less to 11 more). Because of serious concern of risk of bias, there is moderate certainty evidence that suggests that SSRIs in children with ASD probably result in a slight increase of adverse events (Summary of Findings, Table 1).