An Annotation-free Whole-slide Training Approach to Pathological Classification of Lung Cancer Types by Deep Neural Network

doi:10.21203/rs.3.rs-48727/v1

Download PDF

Article

An Annotation-free Whole-slide Training Approach to Pathological Classification of Lung Cancer Types by Deep Neural Network

https://doi.org/10.21203/rs.3.rs-48727/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 19 Feb, 2021

Read the published version in Nature Communications →

Version 1

posted

You are reading this latest preprint version

Deep learning for digital pathology is hindered by the extremely high spatial resolution of whole slide images (WSIs). Most studies adopt patch-based methods which, however, require well annotated data for training. These are typically done by laboriously free-hand contouring on the WSI by experts. To both alleviate annotation burdens of experts and enjoy benefits from scaling up amounts of data, we develop a whole-slide training method for entire WSIs to classify types of lung cancers using slide-level diagnoses. Our method leverages unified memory to offload the excessive amount of memory consumption to host memory to train a classifier by entire hundreds-of-million-pixels slides. Experiments were conducted on the lung cancer dataset which contains 9,662 digital slides with various main types. The results showed that the proposed method can achieve an AUC of 0.950 and 0.924 for adenocarcinoma and squamous cell carcinoma on a separate testing set respectively. Furthermore, critical regions highlighted by the class activation map (CAM) technique of our model reveals a high correspondence to cancerous areas annotated by pathologists.

Bioinformatics

Computational Biology

Deep neural network learning

Whole-slide Training

Lung cancer classification