From our data it appeared that oocyte retrieval does not benefit from the use of DLN with follicle flushing. Specifically, there were no differences in terms of proficiency, efficacy and safety, while the use of the DLN appeared to be associated with a significant increase in the duration of the retrieval procedure. More in detail, our study showed no statistically significant difference in the efficacy of OR with one needle or the other in terms of number of retrieved oocytes per procedure (10.2 ± 6.5 with the DLN vs 10.7 ± 7.0 with the SLN, p = 0.810). No statistically significant differences resulted also for the secondary study outcomes, i.e. number of retrieved oocytes/number of punctured follicles, number of retrieved oocytes/number of follicles at trigger, number of mature oocytes retrieved, as well as no statistically significant differences in pregnancy outcomes were detected, even though the study was not firstly intended or powered to study these results. From a subsequent analysis on the study population, based on the number of follicle at trigger-day, we can conclude that it was representative of the population of women generally presenting at ART centers to undergo controlled ovarian stimulation cycles 19 and it permits to exclude bias in the patient selection.
Comparison with previous literature
Despite the limited evidence supporting the routine use of DLN with follicular flushing 6,20, it continues to be common in many ART clinics and some questions remain regarding the potential benefits of flushing on the oocyte yield and treatment cycle outcomes 21. The topic of flushing was firstly reviewed in a meta-analysis in 2012 by Levy 22. No difference was observed in the oocyte yield (number of retrieved oocytes/numbers of punctured follicles) between the non-flushing and flushing groups (OR: 1.06, 95% CI: 0.95–1.18). There was also no difference in the number of oocytes retrieved in the flushing group in a trial evaluating poor responders (non-flushing group: 6.5 oocytes versus flushing group: 7.2 oocytes, p = 0.38) 11 mentioned in this meta-analysis. In all studies taken under consideration by Levy, the procedural time for OR was significantly longer in the follicle flushing group than in the non-flushing group. In trials performed using older ART techniques, follicle flushing was associated with a longer procedure time as compared with non-flushing and with an increased amount of anesthetic needed for the procedure in the follicle flushing group when compared with the non-flushing group (100 versus 50 mg of pethidine, p = 0.0001). Procedural times for OR shortened from the older to the newer studies; however, follicle flushing was associated with a longer procedure time for OR during all time periods in these studies. No differences in the fertilization of mature oocytes were reported between the flushing and non-flushing groups 23 and similar clinical pregnancy rates between the non-flushing and flushing. Martini et al 14 in January 2021 published the most recent systematic review and meta-analysis on this topic confirming that no significant differences in live birth rates and other OR outcomes could be observed, except for a prolonged procedure time in the double-lumen procedures. The total number of retrieved oocytes is one of the most important prognostic factors in IVF. Live birth rates following IVF increase with increasing number of retrieved oocytes, especially if they exceed the number of 15 15. Especially in patients with diminished ovarian reserve (DOR) or premature ovarian aging (POA), even a small increase in the number of oocytes has the potential to significantly improve pregnancy rates 16.
Speculations for clinical practice
Our center used only DLNs until March 2004 and after that date several types of needles were introduced in the routine clinical practice. No internal guidelines were established and physicians preferred a SLN when many follicles were present on the day of trigger. On the contrary, if patients had few follicles or low values in 17-beta estradiol or had no retrieved oocytes at a previous OR, DLNs were more likely chosen. No cut-off was set, but each surgeon was driven by his/her personal expertise. A retrospective analysis on OR conducted in 2013 at the same Fertility Center found no differences among SLNs and DLNs. A bias, however, of that analysis was a not randomized sample, with most of retrievals performed with double lumen in patients with fewer follicles. Our latest results therefore confirm latest literature data, giving further strong evidence, in view of the randomized controlled design of the present study, that it may be time to set the DLN apart. Indeed, it is worth noting that our study showed a significant increase of time taken for retrieval with the DLN in terms of the whole procedure and of the single retrieved oocyte. Although the evaluation of this outcome was not the primary objective of the study, this finding opens interesting perspectives for the future. Indeed, the procedure with the DLN required an average of 2 minutes and 24 seconds longer per OR procedure than with the SLN (11 minutes and 5 seconds vs 8 minutes and 41 seconds, respectively), with an overall difference of 240 minutes every 100 patients. Similarly, mean time for a single retrieved oocyte was 1.5 ± 1.3 vs 1.1 ± 0.9 minutes, p = 0.002. Therefore, on a large scale, using only SLNs would provide substantial timesaving with eventual favorable repercussions in terms of less adverse events and complications related to the exposure to a more prolonged procedure and consequently protracted anesthesia. The decision whether to give up the use of DLNs should consider the fact that they may favor the less experienced operator while increasing the procedure duration. Undoubtedly the time saving resulting from the present study translates into a financial saving for the hospital in terms of less staff involvement, less costs for the operating room and for anesthetic drugs.
Implications for research
Our study showed only 3 minor adverse events but, due its size and the limited number of patients in the two arms, it is not possible to extrapolate this result to a general safety evaluation of the procedure in the clinical setting since the protocol was designed with a different objective. Concerning post-retrieval complications, our group 24 already published in 2018 a retrospective study analyzing 23,833 OR procedures between 1996 and 2016 and the associated complications. This study showed a hospitalization rate of 0.29% within the first 24 hours after retrieval with a mean length of hospital stay of 2.77 ± 2.5 days and only 0.1% of patients requiring an urgent surgery procedure. Significant risk factors for the occurrence of complications were reported to be: a high number of follicles at ovulation trigger, a prolonged duration of the retrieval procedure, lack of experience of the surgeon (i.e. less than 300 procedures performed), patients with a low BMI, history of prior abdominal or pelvic surgery or pelvic inflammatory disease. Although our study was not primarily aimed at comparing the rate of complications following the two OR procedures, it can however be considered as an interesting preliminary observation and exploratory study for future relevant prospective studies.
Strengths and limitations
Considering the external validity of our study, it is interesting to note that the characteristics of the sample broken down by randomization group were statistically homogeneous across all the main variables considered and different only for smoking habit and the stimulation protocol used. A strength of the present study lies in the important pragmatic scope of its results that, if confirmed by future prospective studies specifically aimed at evaluating the reduction of complications and of financial costs, may influence the daily clinical practice of large ART centers. Indeed, the present study has a solid base on account of its total protocol adherence (no cross overs) and its randomized controlled design, blinded for the biologist who evaluated outcomes and for the patient (but not for the gynecologist). Moreover, the number of retrieved oocytes (primary outcome) per member of the Fertility Center staff who operated in the present study was not analyzed since they were all equally experienced having previously carried out at least 250 OR procedures. It was therefore not considered necessary to evaluate a possible impact of the “human factor” on the efficacy of the technique. On the other hand, a weakness of the study is that it was not double-blind randomized, a design that anyhow would have not been feasible since the gynecologist could not be kept blinded to the type of needle used being in fact the operator who had to perform the OR procedure. A second limitation of the study is the choice of the primary outcome, a surrogate one of apparently limited clinical interest. Indeed, from a clinical perspective, it would have probably been more interesting to assess the effect of the two needles on a strong outcome such as birth rate. However, we believe that ART comprises a series of procedures involving different steps and operators, each with a potentially key role in determining the success of the treatment. Our objective was therefore to evaluate a given portion of the procedure, namely OR, in which connection we believe that the number of retrieved oocytes is the most relevant outcome. There are in fact a number of studies in the literature showing that the number of retrieved oocytes correlates better than other steps of the procedure with the success rate of an ART cycle, especially in patients with an a priori expected lower probability of a successful treatment 25. Indeed, as concerns women considered as “low prognosed” we wanted to assess if they were adequately represented in the study population to confirm that our selection of patients was representative of the general population of infertile women undergoing ART procedures. Since the 32% percentage of low prognosis patients (previously defined as poor or sub-optimal responders) found in our study is within the range identified by Esteves et al. 26 in their systematic review and actually close to its upper limit of 35%, a selection bias can be confidently excluded. Considering the overall number of potentially eligible patients, a suspicion may arise that a significantly lower number of poor responders could have been enrolled, possibly because of a potential negative bias of the operators involved in the study towards the use of the single-lumen technique in patients with a lower number of follicles. It is possible that operators were worried to perform sub-optimal retrieval procedures in these patients and this could have induced them to “select” patients, maybe unintentionally, at the time of enrollment. Lastly, after being informed, it could be that the patients with less favorable prognosis decided not to give their assent to participate to the study, being worried that their already limited chances of success could be jeopardized. However, analyzing the population of our study, it appears that the prevalence of the three categories of responders are fully in line with literature data concerning poor responders and also with data published in 2019 by the same research group of our center 27 in a retrospective study evaluating the prognosis and gonadotropin response of low prognosis patients defined according to the POSEIDON criteria 28. Based on these data, we could have expected beforehand a prevalence of poor responders (1–3 follicles at trigger) of 20.8% and of sub-optimal responders (4–9 follicles at trigger) of 79.2% in the sub-group of patients with low prognosis, and indeed our study confirmed these data (9.4% and 80.6%, respectively in a population of 200 patients). It is also worth noting that there was no statistically significant difference in the distribution of low prognosis patients between the two study groups. On the other hand, their limited number did not allow to verify the assumption of a possibly greater efficacy of the DLN in these patients which would require a specifically designed randomized trial.