Digital image enhancement using Deep learning algorithm in 3D heads-up vitreoretinal surgery

doi:10.21203/rs.3.rs-4097714/v1

Download PDF

Article

Digital image enhancement using Deep learning algorithm in 3D heads-up vitreoretinal surgery

https://doi.org/10.21203/rs.3.rs-4097714/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

This study aims to predict the optimal imaging parameters using a deep learning algorithm in 3D heads-up vitreoretinal surgery and assess its effectiveness on improving the vitreoretinal surface visibility during surgery. To develop the deep learning algorithm, we utilized 212 manually-optimized still images extracted from epiretinal membrane (ERM) surgical videos. These images were applied to a two-stage Generative Adversarial Network (GAN) and Convolutional Neural Network (CNN) architecture. The algorithm’s performance was evaluated based on the peak signal-to-noise ratio (PSNR) and structural similarity index map (SSIM), and the degree of surgical image enhancement by the algorithm was evaluated based on sharpness, brightness, and contrast values. A survey was conducted to evaluate the intraoperative suitability of optimized images. For an in-vitro experiment, 121 anonymized high-resolution ERM fundus images were optimized using a 3D display based on the algorithm. The PSNR and SSIM values are 34.59 ± 5.34 and 0.88 ± 0.08, respectively. The algorithm enhances the sharpness, brightness and contrast values of the surgical images. In the in-vitro experiment, both the ERM size and color contrast ratio increased significantly in the optimized fundus images. Both surgical and fundus images are digitally enhanced using a deep learning algorithm. Digital image enhancement using this algorithm can be potentially applied to 3D heads-up vitreoretinal surgeries.

Health sciences/Medical research

Physical sciences/Engineering

Improving visibility in ophthalmology is important in several aspects. The eye is a small organ, and its detailed structure cannot be observed easily with the naked eye; therefore, advanced technology is crucial for facilitating its observation and evaluation. The collaborative efforts of ophthalmologists and biomedical engineers have played a pivotal role in the advancements witnessed in the technology and equipment used for ophthalmological visualization. In this context, the two most representative methods for improving visibility are deep learning algorithms coupled with artificial intelligence^1–3 and digital image enhancement^{4, 5}.

Researchers have used deep learning algorithms coupled with artificial intelligence to improve the visibility and legibility of ophthalmic images. In particular, some researchers improved the resolution and quality of optic disc photography,¹ reduced artifacts in fundus photography,² and proposed an enhancement in image quality.³However, these studies were confined to static ophthalmic images; dynamic images, such as surgical scenes, were not incorporated.

Recently, a three-dimensional (3D) heads-up visualization system was used for digital image enhancement in ophthalmology.^{4, 5} This digital visualization system offers many advantages in vitreoretinal surgery, which requires delicate operations in the macular area. Additionally, the system can provide a 3D vitreoretinal interface, a much larger depth of focus,smooth communication with assistants and improved ergonomics.^6–9 Using this system, surgery can be performed using processed images as desired by adjusting the image parameters. Through the adjustment of imaging parameters, several authors have presented the study about reducing the concentration of indocyanine green dye during internal limiting membrane peeling, making macular pigment more visible during macular surgery, and image-sharpening algorithms to improve the intraoperative visibility of 3D heads-up surgery.^10–12 However, the modified images obtained from the aforementioned studies involved the application of uniform changes to all images.

Given the differences in the structure and color of the retina among individuals, surgical accuracy and efficiency would increase if the surgery can be performed using optimized images tailored to each eye. To harness these advantages, we employed both deep learning and digital image enhancement to elevate visibility during surgery. Herein, we propose a deep learning algorithm designed to predict optimal imaging parameters for individuals during 3D heads-up vitreoretinal surgery, evaluate the performance, and improve visualization during surgery.

In this clinical and experimental study, both surgical video still images and experimental fundus images were used to evaluate digital image enhancement using a deep learning algorithm. Due to the nature of the retrospective design, it was approved to omit the process of obtaining informed content. This study was approved by the Institutional Review Board of our university (GDIRB2023-179). This study conformed to the tenets of the Declaration of Helsinki.

1. Deep learning algorithm for digital image enhancement

To achieve optimal parameter estimation, we constructed an architecture comprising two stages (Fig. 1 and Supplementary digitalcontent 1). The first introduces the Pix2Pix architecture, which was recently developed to address extensive image translation problems and is a variant of generative adversarial network (GAN)-based models that can reproduce fake images similar to authentic images.^13–15 This preliminary step uses training datasets composed of both an original microscopy image as input data and a visually optimized image as label data to generate a fake image mimicking the optimal image. In the second stage of the architecture, the ResNet architecture is used to predict eight control parameters, thereby realizing the optimal image using paired data comprising the original image and the fake image created in the first step.^{16, 17} The image set generated by the GAN model is expected to further contribute to the improvement in parameter estimation compared with using only original images, since the graphical properties of the fake data-imitated images were intended to be optimally tuned by a vitreoretinal surgeon.

In order to compare the performance of this two-stage algorithm model, we conducted additional experiments with same Pix2pix architecture in the first stage and two different convolutional neural network (CNN) models (VGG-16 and Inception V3) in the second stage. The performance of the proposed algorithm was evaluated based on the peak signal-to-noise ratio (PSNR) and structural similarity index map (SSIM) (Table 1). The PSNR measures the signal-to-noise ratio between the original and restored images, with higher values indicating less difference.¹⁸ The SSIM measures the structural similarity between the original and restored images, with values of approximately 1 indicating a slight or no difference.¹⁹

Table 1

Comparison of quantitative evaluations of all referenced deep learning algorithms.
	Used images	Used deep learning algorithm		PSNR	SSIM
This study	Surgical video still images	Pix2pix	ResNet	34.59 ± 5.34	0.88 ± 0.08
			VGG-16	32.73 ± 6.80	0.87 ± 0.09
			Inception V3	32.54 ± 5.78	0.87 ± 0.08
Halupka KJ et al. (23)	OCT images	CNN-MSE		32.28 ± 1.27	0.78 ± 0.03
Zhao X et al. (22)		U-Net		35.01 ± 1.25	0.92 ± 0.03
Cahyo DAY et al. (24)		SA-Net		Not used	0.72 ± 0.07
Wan C et al. (3)	Fundus images	Cycle-CBAM		24.7386	0.8103
Optical coherence tomography (OCT), peak signal-to-noise ratio (PSNR), and structural similarity index map (SSIM)

We used Keras 2.3.1 and TensorFlow 2.1.0 in Python (version 3.7.0). Model training was performed on a ppc64le central processing unit architecture of the IBM POWER9 system (IBM, Armonk, NY, USA) and an NVIDIA Tesla V100 (NVIDIA, Santa Clara, CA, USA) graphics processing unit. In terms of the training environment, the Adam optimizer (initial learning-rate = 0.001) and the binary cross-entropy loss function were used for each network. We also used the sigmoid activation function to output the network’s inference results. The hyper-parameters used for our model are as follows: batch-size = 4; total training epochs = 300; learning-rate reduction factor = 0.1 and learning-rate reduction patience = 5. Lastly, the early stopping approach was applied with 50 patience. All networks were designed to perform the multi label classification based on the comparison of probabilities resulting from 6 sigmoid functions corresponding to 6 classes, generated by the combination of 3 sinusitis levels and two sides, the left and right sinus regions.

2. Surgical images: Data preparation and evaluation

Surgical videos of the Ngenuity 3D Visualization System (Alcon Laboratories, Fort Worth, TX) of patients who were diagnosed ofepiretinal membrane and underwent vitrectomy at a single center of a university hospital between July 1, 2020 and May 31, 2022 were reviewed through TrueWare software (version 9.5.4). The standard three-port vitrectomy with a 25-gauge setup (CONSTELLATION®, Alcon, Fort Worth, TX, USA) under retrobulbar anesthesia was performed on all patients by an experienced surgeon (D.H.N). In phakic eyes, cataract surgery is typically performed prior to vitrectomy.

A total of 212 surgical video still images were acquired by reviewing the 3D surgical videos. Using these images, the vitreoretinal surgeon recorded eight types of imaging parameter values (brightness, saturation, contrast, hue, gamma, cyan, magenta, and yellow), thus enabling an optimal visual configuration for highlighting the vitreoretinal surfaces through manual adjustment using software (Supplementary digitalcontent2). Consequently, manually optimized surgical images and image parameter values were obtained for each image. The 212 pairs of images and their parameter values were used to train the algorithms to predict the optimal parameter values.

To circumvent overfitting during the training process, datasets were segregated into a validation set and a training set at a ratio of 1:9, which is equivalent to a 10-fold cross validation.²⁰Since the number of data included in this study was small, the process of data augmentation was conducted. Data augmentation helped to avoid overfitting and improve the accuracy of the model. For each image, we generated additional images that included random horizontal and vertical flips and rotations. This augmentation process was performed only in the training phase. In addition, to enhance the estimation accuracy of the parameters, only the retinal area of interest was segmented from the entire image, i.e., undesired peripheral black areas on the screen were excluded. The cropped images were uniformly resized to a resolution of 512 × 512 pixels, and the pixel intensities ranged from 0 to 255. After training the deep learning model, eight image parameter values for optimization were predicted when the original surgical image was applied to the algorithm. Optimized surgical images were obtained by applying the predicted parameter values to the original surgical images (Fig. 1).

For the quantitative analysis of the optimized surgical images, the sharpness, brightness, and contrast values were measured and compared with those of the original surgical images (Fig. 2). Sharpness indicates the clarity and crispness of an image, with higher values indicating sharper edges and boundaries. Brightness represents the overall brightness level of an image, with higher values indicating brighter images and lower values indicating darker images. Contrast represents the difference in brightness between adjacent pixels in an image, with higher values indicating greater contrast and lower values indicating less distinct differences in brightness (Table 2).

Table 2

Comparison of sharpness, brightness, and contrast between original and optimized surgical images.
	Original surgical images	Optimized surgical images	p-value
Sharpness	2.82 ± 0.57	4.20 ± 0.80	< 0.001
Brightness	43.54 ± 11.41	59.29 ± 15.11	< 0.001
Contrast	27.99 ± 8.49	38.60 ± 10.46	< 0.001

Additionally, for qualitative evaluation, a survey was conducted with seven vitreoretinal surgeons, which involved comparing surgical images before and after optimization. The survey included four questions for four representative pairs of images, and the ratio of each answer is summarized in the table (Fig. 3 and Supplementary digitalcontent3).

3. Experimental fundus images: in-vitro experiments

The images taken during the surgery might be less consistent, and there is a limit to how much the image quality can be lowered in the process of cropping only the parts necessary for the image analysis. In this study, in-vitro experiments were performed to overcome these limitations (Fig. 4). For the in-vitro experiments, 121 high-resolution ERM fundus images were extracted using Google Image and Google dataset searches, which included English keywords related to ERM. The keywords for searching ERM images were as follows: “epiretinal membrane,” “macular pucker,” “macular membrane,” and “ERM fundus photo.”The fundus images of the ERM were directly observed in a 3D heads-up surgery setting; additionally, eight parameters were adjusted to obtain optimal fundus images, which were then compared with the original fundus images.

First, the color-contrast ratio (CCR) was calculated between the dark and bright regions of the retinal folds caused by the ERM to quantitatively evaluate the visibility. Color contrast is the difference in brightness between foreground and background colors. RGB (R = red; G = green; B = blue) values were extracted from the pixels in the same position as the original and optimized images, and the CCR values were compared. We used the ImageJ software (version 1.51j8, NIH, USA) to extract RGB values from high-resolution ERM fundus images (Fig. 5). A higher CCR corresponds to a greater difference in brightness between the two points, which can be interpreted as an improvement in contrast. The CCR was calculated based on previous studies.^{10, 21} Meanwhile, the images were analyzed using the ImageJ software and converted to a histogram depicting the R, G, and B values of the images. If R < 0.03928, then Rs was estimated to be R/12.92. If R > 0.03928, then Rs was estimated to be [(R + 0.055)/1.055]^2.4. The values for G and B were calculated in the same manner. Meanwhile, the color luminance was estimated using the following equation: Color luminance (L) = 0.2126Rs + 0.7152Gs + 0.0722Bs. Finally, the CCR was calculated using the following equation: CCR = (Lmax + 0.05)/(Lmin + 0.05), where Lmax is the luminance of the brighter background and Lmin is the luminance of the darker background.

Second, for a qualitative evaluation, the ERM size was measured by two vitreoretinal surgeons (SHH and JHL) using the ImageJ software (Fig. 5). The ERM size was measured in both the original and optimized fundus images, and the mean difference between the results of each investigator for each image was calculated. We assumed that the ERM size would be measured with good visibility in the images and that the difference in results between the investigators would be insignificant.

Finally, the Pelli–Robson contrast chart used for testing the contrast sensitivity was printed in a small size and directly observed in the 3D heads-up surgery setting (Fig. 4). The image was optimized in terms of its visibility and then compared with the original image. From the 25 randomly selected investigators working in our hospital (residents, nurses, optometrists and contract employees), the number of letters that can be read on the chart was compared between the original and optimal images in a 3D heads-up surgery setting.

4. Statistical analysis

The sharpness, brightness, and contrast between the original and optimized surgical images were compared using the Wilcoxon signed-rank test. The CCR, ERM size measurement, and number of letters read on the Pelli–Robson contrast chart between the original and optimized fundus images were statistically analyzed in the same manner. The difference between the ERM size measurements by the two investigators was assessed to determine the inter-grader agreement for each image using Bland–Altman plots (Supplementary digitalcontent4). The Mann–Whitney U test was used as a quantitative assessment to compare the ERM size in both images between the two investigators. Statistical significance was set at p < 0.05. The SPSS software (version 23.0; IBM Corp., Armonk, NY, USA) was used for statistical analysis.

A total of 333 images were used in this study. The proposed deep learning algorithm was trained using 212 images captured from a 3D heads-up visualization system, and 121 high-resolution ERM images were analyzed using the in-vitro method. To evaluate the performance of the proposed algorithm, we verified the PSNR and SSIM values on three different models. The proposed algorithm model (Pix2pix + ResNet) showed the best performance (PSNR 34.59 ± 5.34 and SSIM 0.88 ± 0.08) compared to other combinations such as Pix2pix + VGG-16 (PSNR 32.73 ± 6.80 and SSIM 0.87 ± 0.09) and Pix2pix + Inception V3 (PSNR 32.54 ± 5.78 and SSIM 0.87 ± 0.08) (Table 1).

Subsequently, for a quantitative assessment of the enhanced visibility in surgical images achieved through optimal parameter values derived from the deep learning algorithm, we conducted a comparison of sharpness, brightness, and contrast levels in the RGB composite color channel between the original and optimized surgical images. The result shows that the sharpness (2.82 ± 0.57 vs. 4.20 ± 0.80, p < 0.001), brightness (43.54 ± 11.41 vs. 59.29 ± 15.11, p < 0.001) and contrast (27.99 ± 8.49 vs. 38.60 ± 10.46, p < 0.001) values all increased significantly (Table 2). Figure 2 shows two representative examples of optimization using a deep learning algorithm based on surgical images. Vitrectomy was performed on both patients who received a diagnosis of ERM. The first patient was a 65-year-old female, and the sharpness, brightness, and contrast values for the original and optimized images were 3.03, 44.59 and 25.06, and 4.17, 58.71 and 37.77, respectively. The second patient was a 75-year-old female, and the sharpness, brightness, and contrast values for the original and optimized images were 2.67, 47.01 and 23.76, and 4.24, 59.15 and 37.36, respectively.

For qualitative evaluation, seven vitreoretinal surgeons responded to a survey of four pairs of original and optimized images to compare their preferences. In general, their preference for the optimized images was high (82.1%; Fig. 3).

By performing in-vitro experiments using high-resolution fundus images, we calculated the CCR by verifying the RGB values on the same pixels, which were the darkest and brightest points of the folds of the ERM in each of the original and optimal fundus images. The mean CCR values of the original and optimized fundus images were 1.31 ± 0.16 and 1.86 ± 0.36, respectively, which indicate a significant increase in the optimal fundus images (p < 0.001).

To qualitatively assess the optimal fundus image, two vitreoretinal surgeons measured the ERM size based on 121 high-resolution ERM fundus images. The ERM sizes measured by the two investigators did not significantly differ for both the original and optimized fundus images (p = 0.586 and p = 0.812, respectively). However, the mean ERM size for the optimized fundus images was larger than that for the original fundus images, and the difference between the measurements by both investigators, as assessed using Bland–Altman plots, was significant (p < 0.001) (Table 3). Among the 121 pairs of images, the statistical outliers from 95% of the dataset included five original fundus images and one optimized fundus image. All subsequent statistical analyses were performed after excluding statistical outliers. The mean difference between the two investigators was 2563.48 and 5878.76 pixels for the optimized and original fundus images, respectively, and the difference between their measurements was significant (p < 0.001) (Supplementary digitalcontent4).

Table 3

Comparison of measurements of epiretinal membrane area between original and optimized fundus images by two investigators.
	Investigator 1	Investigator 2	p-value*
Original images [mean number of pixels (range)]	188173.14 (3315-336634)	194051.91 (3247-344777)	0.586
Optimized images [mean number of pixels (range)]	236446.12 (4369-497760)	239009.60 (4522-498083)	0.812
Original : Optimized (ratio)	1.26 ± 0.16	1.23 ± 0.17	0.102
p-value**	< 0.001	< 0.001
* Mann–Whitney U test.
** Wilcoxon signed-rank test.

Finally, we requested 25 general investigators to read the letters on the printed Pelli–Robson contrast sensitivity chart using a 3D heads-up visualization system before and after adjusting the parameters for optimization. The investigators read 41.32 ± 4.90 and 46.20 ± 2.73 letters before and after the parameter adjustment, respectively, and the difference was significant (p < 0.001).

There are many fields in ophthalmology that require improvement in visibility. This is because there are many types of imaging tests to diagnose diseases, and most surgical treatments are performed with the help of a microscope. Vitreoretinal surgery, which is performed using an additional magnifying lens to the basic microscope, is a representative ophthalmic surgery where visibility is important.

Ensuring successful vitreoretinal surgery requires skilled surgeons equipped with a clear and comprehensive view of the ocular structures. In particular, when performing procedures such as macular membrane peeling, detailed manipulation is required. As mentioned above, the 3D heads-up visualization system enables surgery to be performed using images converted via adjustment to the image parameters.⁶ We believe that this can be facilitated by obtaining image parameters optimized for the color of each patient’s retina and vitreous body, as well as by developing a deep learning algorithm to predict the optimal image parameters. In this study, we attempted to predict and apply parameters for optimal imaging during 3D heads-up vitreoretinal surgery using both deep learning algorithms and digital image enhancement methods.

This study aims to derive optimal parameter values applicable to 3D heads-up visualization systems for digital image enhancement. To achieve this goal, we employed a two-stage deep learning algorithm designed to predict the parameter values for optimizing surgical images. Briefly, several pairs of images were learned using a deep learning algorithm by repeating the process of obtaining an optimal surgical image from the original surgical image through manual parameter adjustments in the software. For the two-stage deep learning algorithm, first, a fake image was created via the Pix2Pix approach; next, the original and generated fake images were learned to predict the parameter values more efficiently via the ResNet architecture. Consequently, satisfactory PSNR and SSIM values were measured (34.59 ± 5.34 and 0.88 ± 0.08, respectively), which were similar to or slightly better than those yielded by the deep learning algorithm developed by other authors (Table 1).^{3, 22–24} The SSIM is a measurement indicator for the human visual system that considers factors such as luminance, contrast, and structure instead of simple objective differences between images.^{18, 19} In other words, an SSIM value of approximately 1 implies that humans, particularly vitreoretinal surgeons, do not perceive any difference between manually adjusted and optimized surgical images. Therefore, the performance of the proposed algorithm can be regarded as excellent.

The sharpness, brightness and contrast values of the RGB channels increased, with a significant difference indicated in the optimized surgical image; this is attributable to an increase in the objective clarity of the image. We speculate that this occurred because vitreoretinal surgeons adjusted the parameter values in the direction of increasing these values to clearly observe the vitreoretinal surface while creating a manually adjusted image, and that the deep learning algorithm accurately predicted the parameter values for a similar image.

In addition, we conducted a survey with seven vitreoretinal surgeons to assess the utility of adjusting the image parameters. Most reported that peeling the macular membrane without staining the original images would be difficult, although it can be attempted using the optimized images. In addition, most of them reported that the optimized images offered better visibility and expressed their preference for optimized images for performing operations.

The contrast in visual perception is the difference between two or more regions of a field.²⁵ If the ERM is prominent in the macula, then several folds are created on the retinal surface. Clearer folds aid surgeons in peeling the membrane. Therefore, the color contrast of the folds can significantly affect the visualization of the ERM. Notably, it is generally determined by the relationship between the front and back luminances.²¹ In some previous studies, the visibility during vitreoretinal surgery was compared based on the CCR, where a high CCR corresponded to high visibility.^{10, 21} In our study, the optimized fundus images showed significantly improved CCRs. In other words, adjusting the image parameters improved the visibility of the ERM folds.

The ultimate goal of enhancing surgical images is to enable surgeons to operate effectively and safely. To evaluate this, we requested two vitreoretinal surgeons to measure and compare the ERM size for each fundus image. If the visibility of the ERM improves, then the average measured ERM area is expected to increase, and the difference in measurement between investigators is expected to decrease.²⁶ In our study, the mean ERM area in the optimized fundus images was significantly wider than that in the original fundus images. This shows that performing surgery after optimization is more useful as it allows the ERM to be identified more easily. Additionally, the mean difference between the two measurements was closer to zero in the optimized fundus images (-2563.48 pixels) than in the original fundus images (-5878.76 pixels), which can be interpreted as a lower risk of overestimation or underestimation of the ERM in the optimized images. In summary, the ERM was verified more extensively, and the difference in measurement between the investigators decreased by using the optimized fundus images; therefore, the surgery can be rendered more efficient and safer by adjusting the parameters for optimization.

Finally, after adjusting the image parameters, the number of readable letters by the same investigators on the Pelli–Robson contrast sensitivity chart increased. This can be interpreted as the same person having a higher contrast sensitivity through image parameter adjustment.

Despite the advantages, this study presents several limitations. First, the deep learning algorithm cannot be readily applied to actual surgical scenarios. The actual application goal of our study is to convert images in real time during surgery to proceed through the most optimal surgical view. This is because of licensing issues such as technology transfer and technical limitations in performing surgery while applying optimal parameters automatically in real time. Nevertheless, given the capability of our deep learning algorithm to predict optimal parameters, coupled with the option for manual input and application by the assistant, we believe that this study sufficiently demonstrates the potential for future utilization. Second, the CCR and ERM size measurements were conducted in vitro using high-resolution fundus images with prominent ERMs instead of using actual surgical images. In the actual surgical images, the ERM was not clearly visible prior to staining with dye or triamcinolone acetonide, and the quality of the captured image was low; hence, actual surgical images are not suitable as they do not allow the necessary values to be obtained for analysis. Further studies regarding these variables based on actual surgical images are necessary to evaluate the quantitative improvement more accurately. Finally, as this study is based on a retrospective design, these limitations should be addressed in prospective studies.

Despite these limitations, there were no studies that used both methods of deep learning algorithms and digital image enhancement to improve the visibility and suitability of ophthalmic surgical images. Although the method of this algorithm does not optimize the video every moment during the surgery, it allows a good view of the vitreoretinal surface during macular membrane peeling, which is the most important moment in all processes. In general, the proposed deep learning algorithm demonstrated excellent performance, and the visibility of the predicted optimal surgical image improved objectively, thus allowing it to be used by vitreoretinal surgeons to perform vitrectomies. In conclusion, applying digital image enhancement using deep learning algorithms to actual surgeries seems promising in the near future.

Acknowledgements: This work was supported by the Gachon University Gil Medical Center (Grant number: FRD2020-18)).

Authors’ contributions: SHH wrote the main article, DHN and YJK supervised the revision of the article, JBC collected the data, and DHN and YJK assisted in the writing of the article.

Availability of data and materials: The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Competinginterests: The algorithm used in this study has beenpatented in South Korea. (Patentee: Oculight, Inventor: Dong Heun Nam and Young Jae Kim,Application number: 1020200129430)

Ethics approval and consent to participate: This study was approved by the Institutional Review Board of Gachon University (approval number GDIRB2023-179). The study followed the tenets of the Declaration of Helsinki. Due to the retrospective nature of the study, the need of informed consent was waived by Institutional Review Board of Gachon University.

Ha A, Sun S, Kim YK, Lee J, Jeoung JW, Kim HC, et al. Deep-learning-based enhanced optic-disc photography. PLoS One 2020; 15(10):e0239913.
Yoo TK, Choi JY, Kim HK. CycleGAN-based deep learning technique for artifact reduction in fundus photography. Graefes Arch ClinExpOphthalmol 2020; 258(8): 1631–7.
Wan C, Zhou X, You Q, Sun J, Shen J, Zhu S, et al. Retinal image enhancement Using Cycle-Constraint Adversarial Network. Front Med (Lausanne) 2021; 8: 793726.
Minaker SA, Mason RH, Chow DR. Optimizing color performance of the Ngenuity 3-dimensional visualization system. OphthalmolSci 2021; 1(3): 100054.
Melo AGR, Conti TF, Hom GL, Greenlee TE, Cella WP, Talcott KE, et al. Optimizing visualization of membranes in macular surgery with heads-up display. Ophthalmic Surg Lasers Imaging Retina 2020; 51(10): 584–7.
Kim DJ, Kim DG, Park KH. Three-dimensional heads-up vitrectomy versus conventional microscopic vitrectomy for patients with epiretinal membrane. Retina 2022; 10: 1097.
Kim YJ, Kim YJ, Nam DH, Kim KG, Kim SW, Chung TY, et al. Contrast, visibility, and color balance between the microscope versus intracameral illumination in cataract surgery using a 3D visualization system. Indian J Ophthalmol 2021; 69(4): 927–31.
Coppola M, La Spina C, Rabiolo A, Querques G, Bandello F. Heads-up 3D vision system for retinal detachment surgery. Int J Retina Vitreous 2017; 3: 46.
Romano MR, Cennamo G, Comune C, Cennamo M, Ferrara M, Rombetto L, et al. Evaluation of 3D heads-up vitrectomy: outcomes of psychometric skills testing and surgeon satisfaction. Eye (Lond) 2018; 32(6): 1093–8.
Park SJ, Do JR, Shin JP, Park DH. Customized color settings of digitally assisted vitreoretinal surgery to enable use of lower dye concentrations during macular surgery. Front Med (Lausanne) 2021; 8: 810070.
Sandali O, TahiriJouteiHassani R, ArmiaBalamoun A, Franklin A, Sallam AB, Borderie V. Operative digital enhancement of macular pigment during macular surgery. J Clin Med 2023; 12(6): 2300.
Nakajima K, Inoue M, Mizuno M, Koto T, Ishida T, Ozawa H, et al. Effects of image-sharpening algorithm on surgical field visibility during 3D heads-up surgery for vitreoretinal diseases. Sci Rep 2023; 13(1): 2758.
Yi X, Walia E, Babyn P. Generative adversarial network in medical imaging: A review. Med Image Anal 2019; 58: 101552.
Kugelman J, Alonso-Caneiro D, Read SA, Collins MJ. A review of generative adversarial network applications in optical coherence tomography image analysis. J Optom 2022; 15; S1-S11.
Abdelmotaal H, Abdou AA, Omar AF, El-Sebaity DM, Abdelazeem K. Pix2pix Conditional generative adversarial networks for scheimpflug camera color-coded corneal tomography image generation. Transl Vis Sci Technol. 2021; 10(7): 21.
Xu W, Fu YL, Zhu D. ResNet and its application to medical image processing: research progress and challenges. Comput Methods Programs Biomed 2023; 240: 107660.
Pan Y, Liu J, Cai Y, Yang X, Zhang Z, Long H, et al. Fundus image classification using Inception V3 and ResNet-50 for early diagnosis of fundus diseases. Front Physiol 2023; 14: 1126780.
de Boer JF, Cense B, Park BH, Pierce MC, Tearney GJ, Bouma BE. Improved signal-to-noise ratio in spectral-domain compared with time-domain optical coherence tomography. Optics Lett 2003; 28(21):2067–9.
Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 2004; 13(4): 600–12.
Mohr F, van Rijn JN. Fast and Informative Model Selection using learning curve cross-validation. IEEE Trans Pattern Anal Mach Intell. 2023; 45(8): 9669–80.
Kadonosono K, Arakawa A, Inoue M, Yamane S, Uchio E, Yamakawa T, et al. Internal limiting membrane contrast after staining with indocyanine green and brilliant blue G during macular surgery. Retina 2013; 33(4): 812–7.
Zhao X, Lv B, Meng L, Zhou X, Wang D, Zhang W, et al. Development and quantitative assessment of deep learning-based image enhancement for optical coherence tomography. BMC Ophthalmol 2022; 22(1): 139.
Halupka KJ, Antony BJ, Lee MH, Lucy KA, Rai RS, Ishikawa H, et al. Retinal optical coherence tomography image enhancement via deep learning. Biomed Opt Express 2018; 9(12): 6205–21.
Cahyo DAY, Yow AP, Saw SM, Ang M, Girard M, Schmetterer L, et al. Multi-task learning approach for volumetric segmentation and reconstruction in 3D OCT images. Biomed Opt Express 2021; 12(12): 7348–60.
Legge GE, Parish DH, Luebker A, Wurm LH. Psychophysics of reading. XI. Comparing color contrast and luminance contrast. J Opt Soc Am A 1990; 7(10): 2002–10.
Song JH, Moon KY, Jang S, Moon Y. Comparison of MultiColor fundus imaging and color fundus photography in the evaluation of epiretinal membrane. ActaOphthalmol 2019; 97(4): e533-e9.

Competing interest reported. The algorithm used in this study has been patented in South Korea. (Patentee: Oculight, Inventor: Dong Heun Nam and Young Jae Kim, Application number: 1020200129430)

Download PDF

Reviewers invited by journal
10 Jun, 2024
Editor assigned by journal
28 May, 2024
Editor invited by journal
02 Apr, 2024
Submission checks completed at journal
01 Apr, 2024
First submitted to journal
14 Mar, 2024

You are reading this latest preprint version

Digital image enhancement using Deep learning algorithm in 3D heads-up vitreoretinal surgery

Status:

Version 1

Abstract

Figures

Introduction

Methods

1. Deep learning algorithm for digital image enhancement

2. Surgical images: Data preparation and evaluation

3. Experimental fundus images: in-vitro experiments

4. Statistical analysis

Results

Discussions

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1