Enhancing image resolution of fluorescence microscopy with deep learning

doi:10.21203/rs.3.rs-1838136/v1

Download PDF

Research Article

Enhancing image resolution of fluorescence microscopy with deep learning

https://doi.org/10.21203/rs.3.rs-1838136/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 05 Jan, 2023

Read the published version in PhotoniX →

You are reading this latest preprint version

Super-resolution optical imaging is crucial to the study of cellular processes. Current super-resolution fluorescence microscopy is restricted by the need of special fluorophores or sophisticated optical systems, or long acquisition and computational times. In this work, we present a deep-learning-based super-resolution technique of confocal microscopy. We devise a two-channel attention network (TCAN), which takes advantage of both spatial representations and frequency contents to learn a more precise mapping from low-resolution images to high-resolution ones. This scheme is robust against changes in the pixel size and the imaging setup, enabling the optimal model to generalize to different fluorescence microscopy modalities unseen in the training set. Our algorithm shows comparable or better performance in comparison with super-resolution method of stimulated emission depletion (STED) microscopy, which is validated on diverse biological structures and dual-color confocal images of actin-microtubules, where we can observe better image quality and higher resolution. Last but not least, we demonstrate live-cell super-resolution imaging by revealing the detailed structures and dynamic instability of microtubules.

Super-resolution fluorescence microscopy

Image resolution enhancement

Deep learning

Generative adversarial network

Since super-resolution fluorescence microscopy can resolve biological structures at the nanometer scale, effectively overcoming the limitation in resolution capacity of diffraction-limited optical microscopy, it has been considered a powerful technique to allow us to probe into many fundamental processes of life, such as the inner workings of cells and the ultrastructures of organelle dynamics [1–4].

Among a variety of super-resolution methods, stimulated emission depletion (STED) microscopy is the most promising due to its high spatial resolution and temporal resolution [5, 6]. STED microscopy employs a second laser beam (STED beam) in the confocal setup to rapidly deplete the excited state of fluorophores back to their ground state, which requires the stimulated emission to win the competition with spontaneous fluorescence emission within a few nanoseconds [7, 8]. The STED beam has a doughnut-shaped focal intensity distribution with zero intensity at the center, and its overlap with the excitation laser beam results in a smaller residual fluorescence spot, thereby sharpening the point spread function (PSF) and improving resolution. Nevertheless, the strong power of the depletion laser is prone to cause photodamage in the biological samples and photobleaching of the fluorophores, which prevents the wide adoption of STED for practical long-term live-cell imaging.

To address this issue, in addition to developing adequately photostable and live-cell compatible highly fluorescent labelling reagent, researchers also explore computational algorithms that can transform a captured low-resolution image into a high-resolution one without the need for directly applying super-resolution fluorescence microscopy to live-cell imaging. There has been research presenting deep-learning-based algorithms where they build a generative adversarial network (GAN) or deep Fourier channel attention network (DFCAN) to achieve super-resolution and cross-modality image transformations [9, 10]. The networks do not require modeling of the image-formation process or manually tuning of the parameters. Although these algorithms can enhance the resolution of diffraction-limited low-resolution images to match those obtained by super-resolution microscopy, they are not applicable to confocal images of microtubules and microfilaments, particularly with live-cell imaging, and the resolution of their network output images needs to be further improved. To overcome this limitation, we propose an efficient resolution enhancement algorithm based on deep learning. We note that automatic feature extraction is a remarkable advantage that deep learning has over conventional machine learning algorithms, and deep learning has more complex ways of connecting layers together with a larger amount of computing power than previous networks [11, 12]. All these advances have kindled a lot of interest in this approach. Recent applications of deep learning to image processing have been implemented successfully in a variety of research fields [13–16].

In our approach, we achieve super-resolution by building a two-channel attention network (TCAN) architecture and training the network to learn representations of information in both the spatial domain and the frequency domain. This enables the network to precisely map the diffraction-limited input images into super-resolved ones. TCAN framework requires neither special instrumentation nor special fluorophores, and does not constrain the pixel sizes or imaging modalities of the input images. Even if they are different from those in the training data, our network is still capable of super-resolving the low-resolution input images. This also promotes the application of our TCAN model to various fields of view (FOV) of input images. More importantly, we use this model, trained with only the static images, to achieve a long-term live-cell imaging, capturing the dynamic microtubules with finer structures and a higher resolution. This circumvents the need to acquire long time-lapse STED images of microtubules dynamics, which suffers from photobleaching/phototoxicity and remains challenging. Compared with STED, we demonstrate a superior algorithmic performance by inferring super-resolution images of diverse biological structures in terms of higher signal-to-noise ratio (SNR) and better image quality. We also improve the resolution of dual-color confocal images of microtubules and filaments, and their relative positions and crosstalk are better revealed by our method.

TCAN model and loss function. Inspired by U-net [17] and deep Fourier channel attention network (DFCAN) [10], we construct the TCAN architecture based on the conditional generative adversarial network (cGAN) framework, as depicted in Figure S1 (Supporting Information). It can be divided into two parts, a generative model that takes low-resolution images as input and performs image transformation, and a discriminative model that estimates the probability of the input image being the ground truth.

The generator in our TCAN is composed of U-net and DFCAN, enabling the network to learn representations of information in both the spatial domain and the Fourier domain. The former is proposed to learn to suppress irrelevant regions while highlighting salient structures of varying shapes and sizes, yielding improved prediction performance across diverse datasets [17]. The latter focuses on learning hierarchical representations of high-frequency information and more precise mappings form low-resolution images to high-resolution images. Figure S1 and S2 in Supporting Information illustrate the structure of the generator and the discriminator used in this work.

We design the loss function of the generative model as a combination of MSE, binary cross-entropy (BCE) and the structural similarity (SSIM) index [18]. MSE loss ensures prediction accuracy by penalizing the difference between the network output and ground truth. BCE loss recovers the minute detail from the blurred images, and SSIM loss enhances the perceptual quality fidelity of the output. This leads to the following loss function

$${\mathcal{L}_{G\left| D \right.}}\left( {X,Y} \right)={\text{MSE}}\left( {G\left( X \right),Y} \right)+{\text{SSIM}}\left( {G\left( X \right),Y} \right)+{\text{BCE}}\left( {D\left( {G\left( X \right)} \right) - D\left( Y \right),{Y_{{\text{label}}}}} \right),$$

in which and are input low-resolution image and high-resolution image used as ground truth, respectively. $G\left( {} \right)$ is the generative model output, and $D\left( {} \right)$ is the discriminative model prediction. ${Y_{{\text{label}}}}$ is set as 1 in the process of training the generator.

The loss function of the discriminative model calculates the binary cross-entropy, i.e.

$${\mathcal{L}_{D\left| G \right.}}\left( {X,Y} \right)=\frac{1}{2}\left\{ {{\text{BCE}}\left( {D\left( {G\left( X \right)} \right) - D\left( Y \right),{Y_{{\text{label}}}}} \right)+{\text{BCE}}\left( {D\left( Y \right) - D\left( {G\left( X \right)} \right),1 - {Y_{{\text{label}}}}} \right)} \right\},$$

when ${Y_{{\text{label}}}}$ is set as 0 in the process of training the discriminator. Specific loss functions are given in Supporting Information.

Training. For each type of specimen and each imaging modality, we capture a total of ~ 80 groups of confocal (512×512 pixels) and STED (512×512 pixels) images. To prevent the model from being overfitting, we select ~ 60 groups of original data and perform random cropping, rotation transformation and horizontal/vertical flipping to further enrich the training dataset, which eventually generate ~ 3,000 pairs of confocal images (256×256 pixels) and STED images (256×256 pixels). For the testing dataset, we select the remaining ~ 20 groups of original data to augment the dataset. More training details and image acquisition are described in Supporting Information.

Resolution enhancement in confocal microscopy images of nano-beads and nucleus. We begin with evaluating the performance of our proposed TCAN model using 23 nm fluorescent beads. The nano-bead samples are imaged on a Leica TCS SP8 STED confocal microscope, and 1000 pairs of confocal-STED image patches with a size of 256×256 pixels are used as training data. Our network takes the confocal image in Fig. 1a as input, which is unseen by the network in the training stage, and outputs a super-resolved image in Fig. 1b. The result of the network is compared with the image (Fig. 1c) acquired using STED microscopy. It can be seen that some of the nano-beads in our samples are too close to be discerned in the raw confocal microscopy image and STED image, while our method is capable of reducing artifacts and blur and resolving these closely spaced nano-beads, as presented in Fig. 1d-f. This is also consistent with the intensity profiles (Fig. 1m) along the white dashed lines in Fig. 1d-f.

We further assess the impact of the proposed TCAN by two image-based criteria: one is image resolution, measured by the full width at half maximum (FWHM) of the PSF, and the other one is image quality, estimated by the signal-to-noise ratio. There are 20 isolated nano-beads selected randomly for the PSF measurement in the images of the confocal microscope and STED microscope, as well as the network output image. The attained FWHM of the confocal microscope PSF is 130 nm, and the PSF distribution of the network output is even slightly better than that of the STED system, with a mean FWHM of 50 nm versus 60 nm, respectively. Since our method also establishes a data-driven image transformation, similar to that discussed in Ref. [9], the learned PSF does not require any prior information on modeling of the image formation process or its parameters.

Next, we verify the practicality of the proposed TCAN by applying it to fixed HeLa cell nucleus. Figure 1g-i displays the input confocal microscopy image, the network output result and the STED image of the same field of view, respectively. We observe that our method succeeds in transforming a low-resolution confocal image into a super-resolution image. As exemplified by the magnified images of the green boxes in Fig. 1j-l, TCAN resolves the densely labeled nuclear pore complexes (NPCs) [19] better than STED image and reduces the background noise, reaching a compromise between retaining useful information and denoising. The rationale behind this result is that the generator in our model benefits from both U-net and DFCAN, which simultaneously learns precise representation of the spatial structures and high-frequency information.

To verify the improvement of our network on image quality, we compare the SNR of the network output to the network input (confocal image), the STED images and the deconvolution of the STED image. SNR is calculated according to the following formula in Ref. [9],

$${\text{SNR}}=\left| {\frac{{s - \bar {b}}}{{{\sigma _b}}}} \right|,$$

where is the mean peak value of the signal calculated from a Gaussian fit to the particles, and $\mathop b\limits^{ - }$ is the mean value of the background (e.g. randomly selected regions which do not contain any objects), and ${\sigma _b}$ is the standard deviation of the background. The results listed in Table 1demonstrate that our proposed method can suppress noise and improve the image quality by different types of samples.

Table 1

Quantification of SNR improvement
Types	Network input (Confocal)	Network output	STED
Nano-beads	6.0	13.2	14.1
Nucleus	3.6	12.0	8.2
Microtubules	9.6	10.9	10.4

Resolution enhancement in confocal microscopy images of microtubules. In case the confocal-STED training image pairs are not available, our network model trained with images captured by different imaging modalities is still able to infer super-resolution image. We employ 3000 pairs of wide-field and structured illumination microscopy (SIM) patches with a size of $256 \times 256$ pixels as training data, and apply the present framework to microtubules, a more complex structure. The results are compared against the STED image and deconvolution of the STED image, and the deconvolution is performed by using Huygens Software. Our TCAN model, as expected, reveals noticeably improved resolution in comparison with the input confocal images (Fig. 2a). It is worth noting that the resolution of the network output images (Fig. 2b) is indeed improved, especially that the regions of dense and complex microtubule structures are better resolved and appear sharper, compared with STED images in Fig. 2c, as exhibited by the magnified results of the green boxes. There are artifacts and noise between adjacent microtubules in the STED microscope images. For the comparison to the deconvolution of the STED images in Fig. 2d, it can be seen that there are obvious broken structures, and the discontinuity is more severe for sparsely distributed microtubules. Here we also employ transfer learning, which uses a learned network trained with nano-beads as the initial model, to speed up the training process for nucleus, microtubules and actin.

To quantitatively evaluate the overall performance of our method, we use three metrics, i.e., SNR, mean square error (MSE), and resolution, to measure the quality of the output super-resolved image. MSE numerically computes the pixel-level data fidelity by calculating the difference between the resulting image and the ground truth. Image resolution is performed by means of decorrelation analysis, which describes the highest frequency from the local maxima of the decorrelation functions instead of the theoretical resolution [20]. These results are illustrated in Fig. 2e-g, where generally larger SNR and smaller resolution and MSE of the network output indicate that the conventional STED images and the deconvolution of the STED image are inferior to our inference images.

Figure 2e-g are plotted in Tukey box-and-whisker format. The box extends from the 25th and 75th percentiles and the line in the middle of the box indicates the median. To define whiskers and outliers, the inter-quartile distance (IQR) is firstly calculated as the difference between the 25th and 75th percentiles. The upper whisker represents the larger value between the largest data point and the 75th percentile plus 1.5 times the IQR; the lower whisker represents the smaller value between the smallest data point and the 25th percentile minus 1.5 times the IQR. Data points larger than the upper whisker or smaller than the lower whisker are identified as outliers, which are displayed as black diamonds.

For deep learning methods, the training data determines what we want the neural network to learn. To achieve the best results, the imaging modality for the training data should in principle be precisely matched to that of the input images. However, we find that the image quality rather than the imaging modality of the training data is a critical factor affecting the image inference performance. This can be observed from Figure S4 in Supporting Information. Even though the input images and STED images are captured with the same imaging platform, the output images of the network trained by using deconvolution of the STED images are worse than the results of the network trained with high-quality SIM images. This is related to the fact that the input and output of the framework share a high degree of mutual information, and the quality of the information in the training examples has an effect on the pixel-to-pixel transformation and the resolution enhancement learned by the network. For the task of translating one possible representation of a scene into another, it is broadly referred to as image-to-image translation problem [21]. They share common process of predicting pixels from pixels, and the network architecture used for our training, i.e., conditional GANs [22] have been proven to be effective in learning such mapping. In this problem the input and output are renderings of the same underlying structures, and the training process can be viewed as utilizing this mutual information between the input and label images to restrict the network output. Accordingly, the network pays attention to the quality of structures in the training examples more than the imaging platform of the training data.

Additionally, if the pixel size is large, one microtubule distributes across fewer pixels; otherwise, more pixels are required to show the same structure. Hence the pixel size is another important parameter affecting the feature representation to be learned by the network and the ability of the network to distinguish adjacent microtubules as separate objects. For instance, direct application of a network that is trained with images with a pixel size of 50 nm would produce acceptable biological structures only when the input images have pixel size of 35 nm-70 nm. Therefore, if the pixel size of the input images and training images are different, we upsample/downsample the input images to match that of the training image pairs. After the upsampling/downsampling, the neural network successfully suppresses the artifacts and further improves the resolution of the confocal microscopy images. In Fig. 3, compared the network output images in the third column to the network output images in the fourth column, it is important to note that the effect of the pixel sizes can be compensated by upsampling/downsampling the input images to match the pixel sizes to that of the training data, thereby improving the quality of the inference images. Since the pixel size of our training data is 50 nm, we upsampe the pixel size of 75 nm of the input confocal images in the first row, while downsample other pixel sizes of the input images in the third row to the seventh row. In addition, compared the network output images in the second column to the network output images in the third column, we notice that the model trained by L1 loss is more robust against the variations of pixel sizes than the model trained by L2 loss, although the latter can obtain better inference images when the pixel size of the input images and the training data is the same (50 nm in our experiments). The result is related to the fact that L2 loss is more sensitive to outliers and gets stuck more easily in a local minimum [23, 24].

This also facilitates the application of the TCAN model to a large field of view of the confocal images. Figure 4 displays the results of applying our method to super-resolve confocal images of , respectively, revealing finer features of the microtubules. The above results demonstrate that the proposed framework is able to achieve favorable performance for various fields of view of input images.

When the input image is captured with a new experimental setup, our TCAN network model does not need to be trained again. We apply the network model trained with wide-field and SIM image pairs to directly super-resolve the images of microtubules captured with the Nikon A1R MP + microscope. The confocal microscopy images are transformed into resolution-enhanced images, as shown in Fig. 5, exhibiting more sharp details of the microtubules. To provide further demonstration of the network’s generalization, two large confocal image patches of , also acquired by the Nikon A1R MP + microscope, are used as input, and Figure s5 in Supporting Information illustrates the advantage of the GAN-based super-resolution approach with upsampling/downsampling. It is possible to extend applications of our TCAN model to super-resolve low-resolution images captured with different imaging systems.

The generalization of our TCAN model includes improving resolution of images acquired with new imaging systems and improving image resolution on new types of samples that are not present in the training phase. As manifested in Fig. 5 and Figure S5, resolution enhancement of confocal images captured with the Nikon the A1R MP + microscope are achieved by our network model trained with wide-field and SIM image pairs. Another example of generation of our approach is supported by Fig. 8, where our TCAN model trained with only images of the microtubules is applied to super-resolve actins. Even though this new type of sample is unseen in the training dataset, our network is capable of inferring correctly their fine structures.

Resolution enhancement in confocal images of live-cell microtubules. To test whether TCAN is competent in live-cell imaging, we study the dynamic changes of microtubules by time-lapse imaging. The dynamic instability of the microtubules is important because of their involvement in delivering information, and it is a fast process demanding high spatiotemporal resolution imaging [25].

In this work, we employ the TCAN model trained with static microtubules images to transform low-resolution confocal images of live-cell microtubules into high-resolution ones. The raw images in both the confocal mode and STED mode are acquired for 10 frames at 45s intervals (Fig. 6a). Figure 6a shows the resolution enhancement and superior image quality when comparing with STED images, and the resolution of our network output images is almost constant within at least 7 minutes (See Visualization 1). Then, the dynamic instability of microtubules is visualized, for example, as marked by arrows in Fig. 6b-e. The dynamic changes can be divided into two kinds, one is changing in the shape of microtubules (Fig. 6b-c), and the other is changing in the length of microtubules (Fig. 6d-e). For the first kind, we capture that microtubule varies distinctly, becoming curved from originally straight. This is consistent with the current model for microtubule assembly and dynamics, which postulates that microtubules grow by attachment of curved guanosine triphosphate (GTP)-tubulins to the ends of curved photofilaments [26]. For the second kind, the plus end of the microtubule grows due to assembly, and the quick transitions between microtubule growth and temporal pause even can be observed at a high temporal resolution in our experiments. The high spatial resolution of our TCAN model ensures the precision of microtubules dynamic characterization and detection of densely packed microtubules undetectable with other methods.

Similar improvement can be obtained when applying our method to super-resolve confocal images of live-cell microtubules acquired with the Nikon A1R MP + microscope (See Visualization 2). We capture raw images for 31 frames at 20s intervals. This result discerns the dynamic changes at microtubule intersections, and we notice that the intersection, indicated by the blue arrows (Fig. 6h-i), gradually becomes separated because of the microtubule shrinkage. For the microtubule seen at the magenta arrow in Fig. 6j-k, it shrinks and the other microtubule grows over time until they are intersected.

The changes of the separation distance of the intersecting microtubules and microtubules shrinkage can also be viewed in Fig. 7, Visualization 3 and Visualization 4. We capture raw images for 61 frames at 20s intervals. As demonstrated in Ref. [27], lysosome transport has a strong correlation with the distance between the intersecting microtubules, and thus it is crucial to visualize the motion of the complex microtubule networks with a high-resolution. Moreover, the unchanged microtubules in the white boxes in Fig. 7 signify that our region of interest is in the focus plane during the observation period, excluding the possibility that the dynamic changes of the microtubules are from defocusing. It should also be noted that the imaging time of live-cell microtubules in Visualization 3 and Visualization 4 is about 20 minutes. Since confocal microscope does not suffer from photobleaching and phototoxicity as severely as the STED microscope, our method is fit for long-term super-resolving confocal images of live-cell.

The above results give prominence to the feasibility and advantage of improving image resolution based on deep learning. In other words, the proposed TCAN model is conducive to resist photobleaching in the traditional STED technique by extending the maximum number of usable consecutive frames of time-lapse images [28].

Resolution enhancement in dual-color confocal images of actin-microtubules. As the components of cytoskeleton, actin-microtubule crosstalk is important for the core biological process [29]. Thus, we simultaneously image actin filaments (cyan) and microtubules (magenta) with the Nikon A1R MP + microscope, and then improve the image resolution by our TCAN model trained with only the microtubules data. Raw confocal images in Fig. 8a, 8c and 8e exhibit spurious small structures outside of the filaments and large fluctuations in fluorescence along the actin filaments. In contrast, TCAN suppresses the artifacts and resolves successfully the densely packed structures of the microtubules and the fine branches of the actin filaments (Fig. 8b, 8d and 8f). The relative positions of the microtubules and filaments can also be observed in the super-resolved dual-color images. Typical means of crosstalk between the microtubules and actin can be found in our network output, for instance, actin-microtubule crosslinking (white box), actin barrier (green box), and mechanical cooperation (Fig. 8f) [29], while they are not clear in the confocal images due to poor resolution.

In this paper, a deep-learning-based algorithm is developed for the generation of super-resolution images directly from diffraction-limited confocal images without prior information about the image formation and imaging conditions. Quantitative comparison of the framework with STED indicates competitive and often superior performance of TCAN. We demonstrate this by taking confocal raw data as input, and then we can preserve more patterned information and finer structures when enhancing signals from the low-resolution samples, as reported in our Results.

We devise the network architecture, which incorporates both spatial representations and frequency content difference across distinct features, enabling the network to learn more precise mapping from low-resolution images to super-resolution images. The strategy helps us improve the image SNR.

To reduce the effect of pixel sizes on the network output, we upsample/downsample the pixel sizes of the input images to match those of the training data. Accordingly, our algorithm offers the benefit of creating higher-resolution images under the conditions of various FOV.

As discussed in Results, we apply an existing trained model on new types of samples and new imaging systems unseen in the training process. Our method can achieve effective image resolution enhancement of the other microscopy modalities and different samples. In fact, the image inference performance is more susceptible to pixel sizes and image quality of the training data.

Furthermore, TCAN assists the investigation of dynamic instability of live-cell microtubules by capturing long-term time-lapse images. The model needs only the static images as the training data, potentially enabling new opportunity for live-cell imaging with reduced photobleaching and phototoxicity.

We achieve co-imaging of the microtubules and actin cytoskeleton at sufficient spatial resolution by applying our method to resolve dual-color confocal images. This is desirable for exploring how actin and microtubules co-regulate each other and exert their functions in different cellular processes such as cell migration and division.

All these results allow the proposed algorithm to be a prime candidate for computational microscopy and super-resolution imaging, especially with the increasing demand for highly accurate and fast live-cell imaging applications.

Ethics approval and consent to participate. Not applicable.

Consent for publication. Not applicable.

Availability of data and materials. The datasets used and analysed during the current study are available from the corresponding author on reasonable request. The Structure of the generative model and the discriminative model, the training process, image acquisition, the effect of the training image quality on the network output, and enhanced images of live-cell microtubules (Videos) are in supplementary files.

Competing Interests. The authors declare that they have no competing interests.

Funding. National Natural Science Foundation of China (61835009, 62127819, 61620106016, 62005171, 61975127); National Basic Research Program of China (2017YFA0700500); Natural Science Foundation of Guangdong Province (2020A1515010679); Key Project of Guangdong Provincial Department of Education (2021ZDZX2013); Shenzhen International Cooperation Research Project (GJHZ20190822095420249).

Author Contributions. The manuscript was written through contributions of all authors. WY conceives the research. JZ contributes to the experiments with the help of ZY, and BH develops the code with the help of BY. BH prepares the data and figures with the help of JL. The manuscript is written by JL, starting from a draft provided by BH. EYL revises the manuscript. WY provides insight into imaging and microscopy. JQ supervises the research. All authors have given approval to the final version of the manuscript.

Acknowledgements. We thank Dr. Min Zhang from Department of Biochemistry at Free University of Berlin and Dr. Iullia Golovynska from Shenzhen University for their helpful discussions.

D Sage, H Kirshner, T Pengo, N Stuurman, J Min, S Manley, M Unser, “Quantitative evaluation of software package for single-molecule localization microscopy,” Nat. Methods 12(8), 717-724 (2015).
M Rust, M Bates, X Zhuang, “Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM),” Nat. Methods 3(10), 793-796 (2006).
MGL Gustafsson, “Surpassing the lateral resolution limit by a factor of two using structured illumination microscopy,” J. Microsc. 198, 82-87 (2000).
K Agarwal, R Macháň, “Multiple signal classification algorithm for super-resolution fluorescence microscopy,” Nat. Commun. 7, 13752 (2016).
SW Hell, J Wichmann, “Breaking the diffraction resolution limit by stimulated emission: stimulated-emission-depletion fluorescence microscopy,” Opt. Lett. 19, 780-782 (1994).
C Wang, M Taki, Y Sato, Y Tamura, H Yaginuma, Y Okada, S Yamaguchi, “A photostable fluorescent marker for the superresolution live imaging of the dynamic structure of the mitochondrial cristae,” Proc. Natl. Acad. Sci. USA 116(32), 15817-15822 (2019).
G Vicidomini, P Bianchini, A Diaspro, “STED super-resolved microscopy,” Nat. Methods 15(3), 173-182 (2018).
Z Yang, A Sharma, J Qi, X Peng, DY Lee, R Hu, D Lin, J Qu, J Seung Kim, “Super-resolution fluorescent materials: an insight into design and bioimaging applications,” Chem. Soc. Rev. 45, 4651-4667 (2016).
H Wang, Y Rivenson, Y Jin, Z Wei, R Gao, H Günaydɪn, L A Bentolila, C Kural, A Ozcan, “Deep learning enables cross-modality super-resolution in fluorescence microscopy,” Nat. Methods 16, 103-110 (2019).
C Qiao, D Li, Y Guo, C Liu, T Jiang, Q Dai, D Li, “Evaluation and development of deep neural networks for image super-resolution in optical microscopy,” Nat. Methods 18, 194-202 (2021).
J Patterson, A Gibson, Deep Learning: A Practitioner’s Approach Ch. 1 (O’Reilly Media, 2017).
Y LeCun, Y Bengio, G Hinton, “Deep Learning,” Nature 521(7533), 436-444 (2015).
K He, X Zhang, S Ren, J Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, Las Vegas, June 2016), p. 770-778.
K Zhang, W Zuo, Y Chen, D Meng, L Zhang, “Beyond a Gaussian denoiser: residual learning of deep CNN for image denoising,” IEEE Trans. Image Process. 26(7), 3142-3155 (2017).
W Ouyang, A Aristov, M Lelek, X Hao, C Zimmer, “Deep learning massively accelerates super-resolution localization microscopy,” Nat. Biotechnol. 36, 460-468 (2018).
DS Kermany, M Goldbaum, W Cai, CCS Valentim, H Liang, SL Baxter, A McKeown, G Yang, X Wu, F Yan, J Dong, MK Prasadha, J Pei, M Ting, J Zhu, C Li, S Hewett, J Dong, I Ziyar, A Shi, R Zhang, L Zheng, R Hou, W Shi, X Fu, Y Duan, VAN Huu, C Wen, ED Zhang, CL Zhang, O Li, X Wang, MA Singer, X Sun, J Xu, A Tafreshi, MA Lewis, H Xia, K Zhang, “Identifying medical diagnoses and treatable diseases by image-based deep learning,” Cell 172, 1122–1131 (2018).
O Ronneberger, P Fischer, T Brox, “U-net: convolutional networks for biomedical image segmentation,” arXiv: 1505.04597 (2015).
Z Wang, AC Bovik, HR Sheikh, EP Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. Image Process. 13, 600-612 (2004).
M Castello, G Tortarolo, M Buttafava, T Deguchi, F Villa, S Koho, L Pesce, M Oneto, S.Pelicci, L Lanzanó, P Bianchini, CJR Sheppard, A Diaspro, A Tosi, G Vicidomini, “A robust and versatile platform for image scanning microscopy enabling super-resolution FLIM,” Nat. Methods 16, 175-178 (2019).
A Descloux, KS Grußmayer, A Radenovic, “Parameter-free image resolution estimation based on decorrelation analysis,” Nat. Methods 16, 918-924 (2019).
P Isola, J–Y Zhu, T Zhou, AA Efros, “Image-to-image translation with conditional adversarial networks,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR) (IEEE, Hawaii, July 2017), p. 5967-5976.
IJ Goodfellow, J Pouget-Abadie, M Mirza, B Xu, D Warde-Farley, S Ozair, A Courville, Y Bengio, “Generative adversarial networks,” arXiv: 1406.2661 (2014).
R Girshick, “Fast R-CNN,” In 2015 IEEE International Conference on Computer Vision (ICCV) (IEEE, Santiago, December 2015), p. 1440-1448.
H Zhao, O Gallo, I Frosio, J Kautz, “Loss functions for image restoration with neural networks,” IEEE T. Comput. IMAG. 3(1), 47-57 (2017).
Y Guo, D Li, S Zhang, Y Yang, J–J Liu, X Wang, C Liu, DE Milkie, RP Moore, US Tulu, DP Kiehart, J Hu, JL-Schwartz, E Betzig, D Li, “Visualizing intracellular organelle and cytoskeletal interactions at nanoscale resolution on millisecond timescales,” Cell 175, 1430-1442 (2018).
NB Gudimchuk, JR McIntosh, “Regulation of microtubule dynamics, mechanics and function through the growing tip,” Nat. Rev. Mol. Cell Bio. 22, 777-795 (2021).
Š Bálint, IV Vilanova, ÁS Álvarez, M Lakadamyali, “Correlative live-cell and superresolution microscopy reveals cargo transport dynamics at microtubule intersections,” Proc. Natl. Acad. Sci. USA 110(9), 3375-3380 (2013).
X Huang, J Fan, L Li, H Liu, R Wu, Y Wu, L Wei, H Mao, A Lal, P Xi, L Tang, Y Zhang, Y Liu, S Tan, L Chen, “Fast, long-term, super-resolution imaging with Hessian structured illumination microscopy,” Nat. Biotechnol. 36(5), 451-459 (2018).
M Dogterom, G H Koenderink, “Actin-microtubule crosstalk in cell biology,” Nat. Rev. Mol. Cell Bio. 20, 38-54 (2019).

Supplemental Figures S1-S2 are not available with this version.

Download PDF

Journal Publication

published 05 Jan, 2023

Read the published version in PhotoniX →

Editorial decision: Major revision
23 Oct, 2022
Reviewer #2 agreed at journal
01 Oct, 2022
Reviewers agreed at journal
29 Sep, 2022
Reviewer #1 agreed at journal
28 Sep, 2022
Reviewers invited by journal
17 Jul, 2022
Editor assigned by journal
11 Jul, 2022
Submission checks completed at journal
10 Jul, 2022
Editor invited by journal
10 Jul, 2022
First submitted to journal
08 Jul, 2022

You are reading this latest preprint version

Enhancing image resolution of fluorescence microscopy with deep learning

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Methods

Results And Discussion

Conclusion

Declarations

References

Supplementary Material

Supplementary Files

Status:

Journal Publication

Version 1