Automated Lung Cancer Detection using Histopathological Images

doi:10.21203/rs.3.rs-3125425/v1

Download PDF

Research Article

Automated Lung Cancer Detection using Histopathological Images

https://doi.org/10.21203/rs.3.rs-3125425/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

Lung cancer is the leading cause of all cancer deaths. Assessment of histopathological images by a pathologist is the gold standard for lung cancer diagnosis. However, the number of qualified pathologists is too small to meet the substantial clinical demands. This study aimed to develop an automated lung cancer detection framework using while-slide histopathology images.

Methods

The algorithm development consisted of the data splitting, data preprocessing, deep learning models development, training and inference processes. Two different U-Net variants (U-Net and U-Net++) with two different encoders (ResNet34 and DenseNet121) were selected as base models, and two loss functions including dice loss and weighted binary cross entropy loss were used during training. Unweighted average was used to combine results of multiple base models.

Results

On the test dataset, the ensemble model using 5X magnification and 512X512 patches obtained an accuracy, sensitivity, specificity and dice similarity coefficient of 0.934, 0.877, 0.948, 0.840, respectively. Except for the specificity of 10X magnification being slightly higher than that of 5X magnification, no matter what model type, encoder, loss function and performance metric were used, the performances of using the 5X magnification outperformed those of using the 10x and 20x magnifications.

Conclusions

This algorithm achieved satisfactory results. Moreover, extensive experiments indicated that using 5X magnification 512X512 patches is a good choice in automated lung cancer detection. In the future, after improving the generalizability of this framework in real clinical settings, this framework can be used to assist histologists in their daily work.

lung cancer detection

semantic segmentation

U-Net

Histopathological images

According to the world health organization(WHO) cancer is the leading cause of morality in the world, and lung cancer is the leading cause of cancer death(1–3). Assessment of biopsy tissue by a pathologist is the golden standard for lung cancer diagnosis(4). Manual segmentation of cancer areas is time consuming and also susceptible to human error such as inter and intra-operator variability(5). Moreover, the number of qualified pathologists is too small to meet the substantial clinical demands, especially in countries such as China, with a significant population of lung cancer patients.

Digital pathology has been gradually introduced in pathological clinical practice. Digital pathology scanners could generate high-resolution WSIs (up to 160nm per pixel). It facilitates the development of automatic analysis algorithms for reducing the burden and improving the performance of pathologists. In the past few years, deep learning has made great success in the field of medical image analysis(6–8). Moreover, a large number of deep learning methods have been proposed for hematoxylin and eosin(H&E) whole slide images(WSIs) analysis including automatic lung cancer detection(9–11).

However, none of these studies made both the source code and trained models public available, so that it is difficult for other researchers to follow and verify. Because WSIs are too big to fit into neural networks, the patch-based method is the common practice in WSIs image analysis. In lung cancer detection, both tissue-level and cell-level features are both important. Therefore, the image resolution and field of view (FOV) of patches, which are determined by magnification and image size, are important factors for these algorithms.

Unfortunately, the optimal resolution and FOV may depend on specific task type and vary between humans and computer algorithms. For pathologists, a magnification of 20X is most commonly used in lung cancer detection(12). Compared with human eyes, computer algorithms have a strong ability to discriminate small image details, so computer algorithms may only need a small image resolution. For example, a lot of ImageNet models adopted an image size of 224 X224. In WSIs image analysis, some researches prefer using 10X magnification patches than using 20X and 40X magnification patches (9, 13), a few studies even using 2.5X magnified patches(14). So far, there was no quantitative performance comparison of the using different image resolutions in lung cancer detection.

This study aimed to develop an automatic lung cancer detection framework on while-slide histopathology images using the publicly available ACDC-LungHP dataset. The algorithm development included data preprocessing, deep learning models development, training and inference processes. We made both source code and trained models publicly available. Most importantly, we quantitatively compared the performances of using different magnifications including 20X, 10X and 5X on different neural network architectures, encoders and loss functions.

Dataset

The ACDC-LungHP dataset(9, 15) contains 200 H&E stained biopsy samples with cancer. All samples have been digitalized by a digital slide scanner (3DHISTECH Pannoramic 250) with objective magnifications of 20x. The cancer regions on tissue level for each WSI have been manually annotated by experienced pathologists. Among them 150 samples with reference standards are released as training data. The remaining 50 samples are test data. Whole-Slide images are in TIFF format, manual annotations are in XML format. Detailed information about the dataset can be found at the published paper and web site(9, 15). In the clinical practice, more than one sample from the same biopsy will be scanned. The dataset did not annotate all samples. Specifically, if samples have a similar shape, the pathologist only annotated one sample for the WSI. Because most of these slides contain multiple samples, these unannotated samples must be ignored during model training. In order to exclude these unused tissue samples, we use the ASAP(Automated Slide Analysis Platform) software to annotate the region of interest(ROI) areas. For every slide, a ROI annotation XML file was created.

Data preprocessing

The ACDC-LungHP dataset contains a training dataset and a test dataset, however the competition has finished and the ground truth annotations of the test dataset are not public available. Therefore, the training dataset was randomly divided into a training, validation, and test set based on a slide level splitting with 100, 25, and 25 slides, respectively. Besides the data splitting process, the data preprocess included ROI and tumor masks creation and image patches generation. For every slide, a binary ROI mask file and a tumor mask file were created from corresponding XML annotation files using the ASAP library.

The image size of patches was 512 X 512 pixels, which is a common practice in image segmentation task and WSIs image analysis (16). And in the pre-experiment, there was no obvious difference between using patch size of 448 X 448, 512 X 512 and 576 X 576. Whole slide images were commonly stored at multiple resolutions to accommodate a streamlined method for loading images. Patches were generated using different slide levels 0, 1 and 2, which correspond to magnifications of 20X, 10 X and 5 X, respectively. Using 2.5X or even lower magnification patches was not considered because during pre-experiments models performances degraded because of small sample size. For a slide in the training dataset, a bounding box list was created first, which contains both regular grid style non-overlapping bounding boxes and random bounding boxes. If a bounding box did not contain at least 1% ROI area, it was removed. The ratio of non-overlapping patches to random patches was 2 to 1. For every bounding box, an image patch and a tumor mask patch were simultaneously created. For an image from the validation dataset and test dataset, only non-overlapping image patches were created. The file naming rule of regular patches and random patches is different. The filename of the regular patch contains its position row and column number on the WSI, so that during inference the predicted tumor mask of the whole slide can be reconstructed by combining predicted mask patches. Details of data preprocessing can be found in the source code.

Neural networks

From the point of view of computer vision, lung cancer detection is usually regarded as a semantic segmentation task. For semantic segmentation tasks in medical image analysis, U-Net(17) and its variants are most popular models. Ensemble learning was best suited for models that are high accurate and different(18). In order to boost the performance of the ensemble model, two different U-Net variants U-Net and U-Net++(19) with two different encoders ResNet34(20) and DenseNet121(21) were selected as base models.

In preliminary experiments, U-Net and U-Net + + achieved comparable performances. However, the more complicated attention U-Net(22) and R2U-Net (23) not only consumed more GPU memory but also achieved worse performance, so they were not adopted. Under the condition of adopting the same U-Net variant, choosing ResNet34 and DenseNet121 as encoders obtained better or at least equal performances than using other encoders such as MobileNetV3, EfficientNet, Res2Net, and among others, so they were chosen as encoders. During training, encoders were initialized using ImageNet pre-trained models.

Loss function

In the ACDC-LungHP challenge, the dice similarity coefficient (DICE) was used as the evaluation metric. Therefore, it is natural to use the dice loss function. However, when using the dice loss, the sensitivity was apparently lower than the specificity, which was bad for lung cancer screening. The weighted binary cross-entropy loss can flexibly balance false-positives and false-negatives through setting different weights. Moreover, the cross-entropy loss has smooth gradients. For every model training, we trained this model independently using two loss functions, i.e., Dice loss(24, 25) and BCE loss. The mathematical formulae of the loss functions are:

$${\mathcal{L}}_{W-BCE}=-\beta (\text{y}\text{l}\text{o}\text{g}\left(\widehat{\text{y}}\right)-\left(1-\text{y}\right)\text{log}\left(1-\widehat{\text{y}}\right)$$

$${\mathcal{L}}_{Dice}=1-\frac{2\text{y}\widehat{\text{y}}+\epsilon }{y+\widehat{\text{y}}+\epsilon }$$

$\text{y}$ is the ground truth label and $\widehat{\text{y}}$ is the predicted probability. Because of label smoothing(26), $\text{y}$ is not always be 0 or 1. The β parameter is the penalty weight for false negatives, and based on experimental results it was set to 4. To quantify |$y$| and |$\widehat{\text{y}}$| in the Dice loss, the soft sum was used for this calculation. ε is a small number 1e-7 to avoid the divide by zero error.

Training strategies

In order to enlarge the samples size and improve the generalizability of the model, image augmentation was used during training(27). Compared with image augmentation before training, the on-the fly implementation not only save time but also is more flexible. The image augmentation operation included random horizontal and vertical flipping, random brightness and contrast modifications. The Albumentations(28) library and PyTorch dataset class were used to implement real time image augmentation. After image augmentation, all pixel values were normalized to (0–1). Technical details about image augmentation can be found in the source code.

Adam(29) with lookahead(30) (k = 5, alpha = 0.5) was used as the optimizer. Automatic mixed precision training(31) was used to speed up the training and inference and save GPU memory. Label smoothing (ε = 0.1) was used to calibrate probabilities and improve generalization ability(32). The batch size was set to 64 and the number of epochs were set to 4, 7, 10 for slide level 0(20X magnification), 1(10X magnification), 2(5X magnification) respectively. The initial learning rate was set to 1e-4, and multiplied by a factor of gamma = 0.1 after every 1 epoch until it reached 1e-6. Every model was trained 3 times under the same setting, and the model with the minimum validation loss was chosen as the best model. During experiments, performances were insensitive to these hyper-parameters.

Inference

Unweighted average was used to combine results of multiple models. Lung cancer detection can be regarded as a pixel classification task, every pixel was classified as cancer or not cancer. The mathematical formulae for every pixel prediction are:

prob_j =$\frac{\sum _{\text{i}=1}^{\text{N}}{(\text{W}}_{\text{i}}\times {\text{p}}_{\text{i}\text{j}}^{})}{{\sum }_{\text{i}=1}^{N}{\text{W}}_{\text{i}}}$

result_j = probs_final ≥ threshold

The number of base models is denoted by N, and Wi is the weight of the model No. i. p_ij is the predicted probability of model i on pixel j. For simplicity, instead of being learned by a meta-learner(33), Wi is set to 1 for all models(unweighted average). prob is the predicted cancer probability, which is ranged from 0 to 1. The final predicted class of pixel j is denoted by result_j, 1 stand for cancer and 0 stand for non-cancer. The default value 0.5 was used as the threshold.

To improve the training and reference speed, patches outside ROI areas and background patches were removed during patch generation. During inference, missing patches must be added first so that patches can be combined into a whole image. For every missing image patch, a predicted mask patch with all black pixels was created. Patches combination algorithm was implemented using the NumPy concatenate operation along the height and width dimensions. Please refer to the source code for more details.

Performance metrics

Accuracy, sensitivity, specificity and DICE coefficient were used to do performance evaluation(34, 35). We use positive for cancer and negative for non-cancer, based on that we use the following notations:

Accuracy is the total number of correct predictions divided by the total number of dataset. Sensitivity (true positive rate) refers to the probability of a positive test, conditioned on truly being positive. Specificity (true negative rate) refers to the probability of a negative test, conditioned on truly being negative. Dice coefficient is equivalent to F1, which is the harmonic mean of precision and recall. All these metrics are bounded between 0 and 1 (perfect). The mathematical formulas are as follows:

Accuracy=$\frac{TP+TN}{TP+TN+FP+FN}$

Sensitivity =$\frac{TP}{TP+FN}$

Specificity =$\frac{TN}{TN+FP}$

Fβ=$\frac{（1+{{\beta }}^{2}）TP}{（1+{{\beta }}^{2}）TP+{{\beta }}^{2}FN+FP}$

Dice = F1=$\frac{2TP}{2TP+FN+FP}$

Experimental settings

Hardware: Intel E5-2620 V4 * 2, 256GB Memory, Nvidia GTX 3090 * 2

Software: Ubuntu 20.04, CUDA 11.3, Anaconda 4.10.

The programming language and libraries: Python 3.8, Pytorch 1.10, Torchvision OpenCV, NumPY, SciPY, Sklearn, Matplotlib, Pandas, Albumentations, segmentation_models_pytorch, Cupy, Openslide-Python, Tqdm. Detailed information about these software libraries can be found in the file requirements.txt of the source code.

The performance metrics of the single models and ensemble models using 5X, 10X and 20X magnifications respectively are shown in Table 1. Unless otherwise specified, all models use a threshold of 0.5 as the cut-off value. All performance metrics of models using the 10X magnification outperform those of models using the 20x magnification, and after selecting the threshold, all performance metrics of models using the 5X magnification outperform those of models using 10x magnification.

Table 1

Performance comparison of 5X,10X and 20X magnifications under the condition of using different performance metrics, U-Net variants, encoders and loss functions
Magnification	U-Net variant	Encoder	Loss	Acc	Sen	Spe	Dice
20X	U-Net	Resnet34	BCE	0.900	0.881	0.905	0.783
	U-Net	DenseNet121	BCE	0.897	0.904	0.895	0.782
	U-Net++	Resnet34	BCE	0.894	0.902	0.892	0.777
	U-Net++	DenseNet121	BCE	0.903	0.897	0.905	0.792
	U-Net	Resnet34	DICE	0.918	0.806	0.947	0.801
	U-Net	DenseNet121	DICE	0.919	0.798	0.950	0.801
	U-Net++	Resnet34	DICE	0.918	0.790	0.951	0.798
	U-Net++	DenseNet121	DICE	0.916	0.806	0.944	0.796
	Ensemble model
10X	U-Net	Resnet34	BCE	0.915	0.915	0.915	0.811
	U-Net	DenseNet121	BCE	0.908	0.916	0.906	0.801
	U-Net++	Resnet34	BCE	0.912	0.902	0.914	0.804
	U-Net++	DenseNet121	BCE	0.908	0.919	0.906	0.801
	U-Net	Resnet34	DICE	0.928	0.801	0.960	0.817
	U-Net	DenseNet121	DICE	0.929	0.831	0.953	0.824
	U-Net++	Resnet34	DICE	0.927	0.813	0.955	0.817
	U-Net++	DenseNet121	DICE	0.930	0.814	0.959	0.824
	Ensemble model			0.930	0.844	0.952	0.829
5X	U-Net	Resnet34	BCE	0.915	0.924	0.912	0.811
	U-Net	DenseNet121	BCE	0.927	0.918	0.929	0.833
	U-Net++	Resnet34	BCE	0.918	0.913	0.920	0.816
	U-Net++	DenseNet121	BCE	0.919	0.925	0.917	0.818
	U-Net	Resnet34	DICE	0.931	0.851	0.951	0.830
	U-Net	DenseNet121	DICE	0.934	0.862	0.952	0.838
	U-Net++	Resnet34	DICE	0.931	0.848	0.952	0.831
	U-Net++	DenseNet121	DICE	0.932	0.861	0.950	0.834
	Ensemble model			0.934	0.877	0.948	0.840
Sen: sensitivity; Spe: specificity; BCE: binary cross entrpy. Bold values represent the best results

The training time ratio of one epoch of using 5X, 10X and 20X magnifications were 1, 4, 16, respectively. Consider that different number of epochs were used with different magnifications, i,e, 10, 7 and 4, the total training time ratio were 10, 28, 64.

Extensive experiments demonstrated that the performance of using 5X magnification patches was the best, and that of using 20X magnification patches was the worst. The performance difference can not be explained by the sample size. One the contrary, the sample size of 5X magnification is only one-fourth that of 10X magnification and one-sixteenth that of 20X magnification. Given a WSI, the inference time of using 5X magnification is one-fourth that of using 10X magnification and only one-sixteenth that of using 20X magnification.

This study has both strengths and limitations.

Strengths: This study quantitatively compared the performances of using different magnifications including 20X, 10X and 5X under the condition of different performance metrics, different U-Net variants, different encoders and different loss functions.

Limitations: The research objectives were limited, they did not cover the cancer subtypes classification and cancer prognosis analysis, which are crucial for cancer diagnosis and following treatment plan(11). Just as other cancer types, lung cancer is very heterogeneous. However, the data diversity in the ACDC-LungHP dataset is fairly limited. All slides were from a single center and scanned by one device, and all slides are cancer slides. The cancer areas do not cover all kinds of lung cancer subtypes, and do not contain poorly differentiated cancer and other atypical cancer areas. These slides do not contain hemorrhage, inflammation, necrotic, and tissue folding areas. Even though this framework achieved good results in the test dataset, it does not mean that it can be applied to the real world in the near future.

Our future work will focus on improving the generalization ability of the lung cancer detection model. We will collect more slides from multiple centers and different type of devices. Most importantly, these slides will cover most of lung cancer subtypes, and they will contain hemorrhage, inflammation, necrotic, and tissue folding areas. In the future, after improving the generalizability of this framework in real clinical settings, it can be use act as a second opinion to assist histologists in their daily work.

In this study, a deep learning framework for lung cancer segmentation in whole-slide histopathology images has been developed. It achieved satisfactory results on the hold-out test dataset.Except for the specificity of 10X magnification being slightly higher than that of 5X magnification, this study showed that no matter what model type, encoder, loss function and performance metric were used, the performances of using the 5X magnification outperformed those of using the 10x and 20x magnifications. This implies that the 5X magnification is a good choice for deep learning models development in lung cancer detection. In the future, after improving the generalizability of the model, this platform can be used to assist histologists in their daily work.

FN: false positive; FN: false negative; TP: true positive; TN: True negative; Acc: accuracy; Sen: sensitivity; Spe: specificity; BCE: binary cross entrpy.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials The used datasets were obtained from publicly open-source datasets from https://acdc-lunghp.grand-challenge.org/. The source code and trained models are publicly available at https://github.com/linchundan88/lung_cancer_detection.

Competing interests

The authors declare that they have no competing interests.

Funding

This paper is supported by Li Kashing Foundation Cross-Disciplinary Research Program under grant No. 2020LKSFG12B.

Authors’ contributions

Conceptualization, Liangli Hong and Yiqun Geng; methodology, Jie Ji and Weifeng Zhang; software, Jie Ji; validation, Yiqun Geng; investigation and resources, Yuejiao Dong, Ruilin Lin; data curation, Jie Ji; writing—original draft preparation, Jie Ji; writing—review and editing, Yiqun Geng and Liangli Hong; visualization, Jie Ji; supervision, Liangli Hong; project administration, Yiqun Geng; funding acquisition, Liangli Hong. All authors have read and agreed to the published version of the manuscript.

Acknowledgements

We would like to thank Dr. Stanley Lee for English language editing. This study was funded by the. Thank this founding in the study design, in the collection, analysis and interpretation of data; in the writing of the manuscript; and in the decision to submit the manuscript for publication.

Jemal A, Ward EM, Johnson CJ, Cronin KA, Ma J, Ryerson B, et al. Annual Report to the Nation on the Status of Cancer, 1975-2014, Featuring Survival. Journal of the National Cancer Institute. 2017;109(9).
Organization WH. Cancer. Available from: https://www.who.int/news-room/fact-sheets/detail/cancer.
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA: a cancer journal for clinicians. 2019;69(1):7-34.
Travis WD. Pathology of lung cancer. Clinics in chest medicine. 2011;32(4):669-92.
Wang X, Chen H, Gan C, Lin H, Dou Q, Tsougenis E, et al. Weakly Supervised Deep Learning for Whole Slide Lung Cancer Image Analysis. IEEE Transactions on Cybernetics. 2019:1-13.
Cen L-P, Ji J, Lin J-W, Ju S-T, Lin H-J, Li T-P, et al. Automatic detection of 39 fundus diseases and conditions in retinal photographs using deep neural networks. Nature Communications. 2021;12(1).
Wang J, Ji J, Zhang M, Lin J-W, Zhang G, Gong W, et al. Automated Explainable Multidimensional Deep Learning Platform of Retinal Images for Retinopathy of Prematurity Screening. JAMA network open. 2021;4(5):e218758-e.
Tang Y-W, Ji J, Lin J-W, Wang J, Wang Y, Liu Z, et al. Automatic Detection of Peripheral Retinal Lesions From Ultrawide-Field Fundus Images Using Deep Learning. The Asia-Pacific Journal of Ophthalmology. 2023;12(3).
Li Z, Zhang J, Tan T, Teng X, Sun X, Zhao H, et al. Deep Learning Methods for Lung Cancer Segmentation in Whole-slide Histopathology Images - the ACDC@LungHP Challenge 2019. IEEE Journal of Biomedical and Health Informatics. 2020:1-.
Wang S, Yang DM, Rong R, Zhan X, Fujimoto J, Liu H, et al. Artificial Intelligence in Lung Cancer Pathology Image Analysis. Cancers. 2019;11(11).
Coudray N, Ocampo PS, Sakellaropoulos T, Narula N, Snuderl M, Fenyö D, et al. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nature Medicine. 2018;24(10):1559-67.
Zarella MD, Bowman, D, Aeffner F, Farahani N, Xthona, et al. A Practical Guide to Whole Slide Imaging: A White Paper From the Digital Pathology Association. Archives of pathology & laboratory medicine. 2018;143(2):222-34.
Song Z, Yu C, Zou S, Wang W, Huang Y, Ding X, et al. Automatic deep learning-based colorectal adenoma detection system and its similarities with pathologists. BMJ Open. 2020;10(9):e036423.
Xu Z, Verma A, Naveed U, Bakhoum SF, Khosravi P, Elemento O. Deep learning predicts chromosomal instability from histopathology images. iScience. 2021;24(5):102394.
ACDC@LUNGHP Challenge. Available from: https://acdc-lunghp.grand-challenge.org/.
Iizuka O, Kanavati F, Kato K, Rambeau M, Arihiro K, Tsuneki M. Deep Learning Models for Histopathological Classification of Gastric and Colonic Epithelial Tumours. Sci Rep. 2020;10(1):1504.
Ronneberger O, Fischer P, Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. ArXiv e-prints [Internet]. 2015 May 1, 2015; 1505. Available from: http://adsabs.harvard.edu/abs/2015arXiv150504597R.
Sagi O, Rokach L. Ensemble learning: A survey. WIREs Data Mining and Knowledge Discovery. 2018;8(4):e1249.
Zhou Z, Mahfuzur Rahman Siddiquee M, Tajbakhsh N, Liang J. UNet++: A Nested U-Net Architecture for Medical Image Segmentation. ArXiv e-prints [Internet]. 2018 July 1, 2018; 1807. Available from: http://adsabs.harvard.edu/abs/2018arXiv180710165Z.
He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition. ArXiv e-prints [Internet]. 2015 December 1, 2015; 1512. Available from: http://adsabs.harvard.edu/abs/2015arXiv151203385H.
Huang G, Liu Z, van der Maaten L, Weinberger KQ. Densely Connected Convolutional Networks. ArXiv e-prints [Internet]. 2016 August 1, 2016; 1608. Available from: http://adsabs.harvard.edu/abs/2016arXiv160806993H.
Oktay O, Schlemper J, Le Folgoc L, Lee M, Heinrich M, Misawa K, et al. Attention U-Net: Learning Where to Look for the Pancreas. ArXiv e-prints [Internet]. 2018 April 01, 2018. Available from: https://ui.adsabs.harvard.edu/#abs/2018arXiv180403999O.
Zahangir Alom M, Hasan M, Yakopcic C, Taha TM, Asari VK. Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation. ArXiv e-prints [Internet]. 2018 February 1, 2018; 1802. Available from: http://adsabs.harvard.edu/abs/2018arXiv180206955Z.
Milletari F, Navab N, Ahmadi S, editors. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. 2016 Fourth International Conference on 3D Vision (3DV); 2016 25-28 Oct. 2016.
Jadon S. A survey of loss functions for semantic segmentation2020 June 01, 2020:[arXiv:2006.14822 p.]. Available from: https://ui.adsabs.harvard.edu/abs/2020arXiv200614822J.
Müller R, Kornblith S, Hinton G. When Does Label Smoothing Help? arXiv e-prints [Internet]. 2019 June 01, 2019. Available from: https://ui.adsabs.harvard.edu/abs/2019arXiv190602629M.
Shorten C, Khoshgoftaar TM. A survey on Image Data Augmentation for Deep Learning. Journal of Big Data. 2019;6(1).
Buslaev A, Iglovikov VI, Khvedchenya E, Parinov A, Druzhinin M, Kalinin AA. Albumentations: Fast and Flexible Image Augmentations. Information. 2020;11(2):125.
Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. arXiv e-prints. 2014:arXiv:1412.6980.
Zhang MR, Lucas J, Hinton G, Ba J. Lookahead Optimizer: k steps forward, 1 step back. arXiv e-prints [Internet]. 2019 July 01, 2019. Available from: https://ui.adsabs.harvard.edu/abs/2019arXiv190708610Z.
Micikevicius P, Narang S, Alben J, Diamos G, Elsen E, Garcia D, et al. Mixed Precision Training. arXiv e-prints. 2017:arXiv:1710.03740.
Guo C, Pleiss G, Sun Y, Weinberger KQ. On Calibration of Modern Neural Networks: ArXiv; 2017 [updated August 3, 2017; cited 2020 June 18]. Available from: https://arxiv.org/pdf/1706.04599.pdf.
Ju C, Bibaut A, van der Laan M. The Relative Performance of Ensemble Methods with Deep Convolutional Neural Networks for Image Classification. J Appl Stat. 2018;45(15):2800-18.
Parikh R, Mathai A, Parikh S, Chandra Sekhar G, Thomas R. Understanding and using sensitivity, specificity and predictive values. Indian J Ophthalmol. 2008;56(1):45-50.
Goutte C, Gaussier E, editors. A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation. European Conference on Information Retrieval Advances in Information Retrieval; 2005 2005//; Berlin, Heidelberg: Springer Berlin Heidelberg.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Automated Lung Cancer Detection using Histopathological Images

Status:

Version 1

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Dataset

Data preprocessing

Neural networks

Loss function

Training strategies

Inference

Performance metrics

Experimental settings

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Status:

Version 1