Multi-class semantic segmentation of breast tissues from MRI images using U-Net based on Haar wavelet pooling

doi:10.21203/rs.3.rs-2465906/v1

Download PDF

Article

Multi-class semantic segmentation of breast tissues from MRI images using U-Net based on Haar wavelet pooling

https://doi.org/10.21203/rs.3.rs-2465906/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 20 Jul, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

MRI images for breast cancer diagnosis are inappropriate for reconstructing the natural breast shape in a standing position because they are taken in a lying position. Some studies have proposed methods to present the breast shape in a standing position using ordinary differential equation of the finite element method. However, it is difficult to obtain meaningful results because breast tissues have different elastic moduli. This study proposed a multi-class semantic segmentation method for breast tissues to reconstruct breast shape using U-Net based on Haar wavelet pooling. First, a dataset was constructed by labeling the skin, fat, and fibro-glandular tissues and the background from MRI images taken in a lying position. Next, multi-class semantic segmentation was performed using U-Net based on Haar wavelet pooling to improve the segmentation accuracy for breast tissues. The U-Net based on Haar wavelet pooling effectively extracted breast tissue features while reducing information loss of the image in a subsampling stage using multiple sub-bands. In addition, the proposed network is robust to overfitting. The proposed network showed an mIOU of 87.48 for segmenting breast tissues. The proposed networks showed high-accuracy segmentation for breast tissue with different elastic moduli to reconstruct the natural breast shape.

Biological sciences/Cancer/Breast cancer

Biological sciences/Cancer/Cancer imaging

Physical sciences/Engineering/Electrical and electronic engineering

Breast shape reconstruction

Multi-class semantic segmentation

Discrete wavelet transform

U-Net semantic segmentation

Wavelet U-Net

The International Agency for Research on Cancer has reported that breast cancer is one of the most prevalent cancers worldwide, accounting for 11.7% of all cancer cases¹. Due to increasing incidence of breast cancer, the demand for breast reconstruction surgery is also continuously increasing owing to breast removal. In the preparation stage for breast reconstruction surgery, a natural breast shape in a standing position is necessary before mastectomy. However, plastic surgeons can access only magnetic resonance imaging (MRI) or computed tomography (CT) images taken with patient lying in a prone position during the examination process. Consequently, they are limited in producing a natural-shaped breast implant in the standing position solely from images in a prone position. To overcome this limitation, studies have been conducted on reconstructing breast shapes in the standing position from prone-position MRI images by obtaining an approximate solution through ordinary differential equation of the finite element method for deformations caused by the center of gravity acting on the breast^2,3. However, it is difficult to obtain meaningful results from these studies for the natural breast shape in a standing position because elastic moduli of breast tissues such as skin, fat, and fibroglandular tissue affected by gravity are different from every other.

To address this issue, this study proposed a deep learning network using U-Net based on Haar wavelet pooling to segment breast tissues for reconstructing the breast shape in the standing. To train the deep learning network, we constructed a dataset consisting of background, skin, fat, and fibroglandular tissues from MRI breast images. For labeling of the dataset, the median filter, Otsu’s threshold algorithm, and a template-based segmentation method were utilized. In the subsampling stage of the conventional U-Net, the max pooling is sensitive to overfitting. It may cause a significant error during data segmentation because weak information about the breast tissue is lost. To improve the segmentation accuracy of breast tissues, we utilized the Haar wavelet pooling instead of the max pooling to robust for overfitting. The U-Net based on Haar wavelet pooling simultaneously uses a low-low (LL) sub-band that holds approximate values of the input image and three distinct frequency sub-bands (low-high (LH), high-low (HL), and high-high (HH)) with detailed edge feature information. Therefore, it can effectively extract features of breast tissues by reducing the loss of image information in the subsampling stage. It also implements a deep learning network that is robust to overfitting.

To compare performances of the proposed network and other networks described in previous studies for segmenting breast tissues, various experiments with max pooling and average pooling were conducted. The proposed U-Net based on Haar wavelet pooling achieved a mean intersection over union (mIoU) of 87.48, which was higher than those of other methods.

2.1. Deep learning-based breast tissue segmentation method

Previous studies on breast image analysis have used binary segmentation methods to diagnose breast diseases such as breast cancer and breast tumors^4–6. However, such methods were time-consuming and labor-intensive because the region of interest (ROI) for the breast tissue was manually set. Moreover, these methods had a disadvantage in that the quality of segmented tissues varied depending on the skill level of the worker and the algorithm used. Recently, various deep learning-based algorithms have been introduced for medical image processing to overcome such disadvantages.

Table 1. Comparison of deep learning-based segmentation methods for breast tissues in existing studies.

Related study	Backbone model	Input images	# of segment classes including background	Segmented tissue
Soulami et al. (2021)	End-to-End U-Net	Mammogram	2	Breast cancer
Negi et al. (2020)	RDA-U-Net	Ultrasound	2	Breast tumor
Ilesanmi et al. (2021)	VEU-Net	Ultrasound	2	Breast tumor
Zhang et al. (2019)	U-Net	MRI	3	Fat, Fibroglandular tissue
Zhang et al. (2021)	U-Net (Transfer learning)		3	Fat, Fibroglandular tissue
Huo et al. (2021)	nnU-Net		3	Fat, Fibroglandular tissue
Ours	Haar wavelet pooling U-Net	MRI	4	Skin, Fat, Fibroglandular tissue

Table 1 summarizes segmentation methods for breast tissues based on deep learning with MRI breast images used in previous studies. Most deep learning-based networks that segment breast tissues modified the U-Net⁷ to perform segmentation for a single class, such as breast cancer, breast tumor, and breast density. To detect breast cancer in digital mammography, Soulami et al. have improved the segmentation performance by proposing a deep learning network based on the end-to-end U-Net method⁸. Further, to segment tumors in breast ultrasound images, Negi et al. have used a deep learning network called RDA-U-Net and the Wasserstein GAN algorithm and reported remarkable performance⁹. Ilesanmi et al. have proposed a variant-enhanced block that combines max pooling and average pooling to segment tumors in breast ultrasound images, consequently improving the accuracy of semantic segmentation for tumors by using the VEU-Net network¹⁰.

Many studies have been conducted to segment fibroglandular tissues known to account for a large proportion of breast tissues using deep learning. Zhang et al. have segmented fat and fibroglandular tissues from MRI breast images using a U-Net¹¹. Subsequently, Zhang et al. have improved the segmentation accuracy by performing transfer learning for fat and fibroglandular tissues in a deep learning model that segments breast density of MRI breast images¹². Huo et al. have improved the segmentation accuracy of fibroglandular tissues by adopting nnU-Net to segment the entire breast and fibroglandular tissues in DCE-MRI breast images¹³.

In contrast to most previous studies that used binary segmentation methods, our study performed multi-class semantic segmentation through U-Net based on Haar wavelet pooling to segment various types of breast tissues for breast shape reconstruction.

2.2. Deep learning method based on wavelet pooling for images

The wavelet pooling used in the sampling operation of deep learning algorithms has an advantage of decreasing the effect of noise on segmentation by filtering the input image before sampling. Previous studies have conducted various image segmentation tasks by combining wavelet pooling and deep learning ^14,15. Table 2 summarizes improved performances of image classification, segmentation, recognition, and restoration using wavelet pooling-based deep learning in previous studies.

Table 2. Comparison of deep learning methods based on wavelet pooling for images.

Related study	Deep learning purpose	Wavelet transform	Deep learning type	Image dataset
¹⁶	Image fusion	Discrete wavelet transform	CNN	COCO
¹⁷	Image restoration	Multi-level wavelet transform	Multi-level wavelet CNN	Berkeley segmentation dataset, DIV2K, Waterloo exploration database
¹⁸	Image classification	Discrete wavelet transform	CNN	MNIST, CIFAR-10, SHVN, KDEF
¹⁹	Super resolution	Stationary wavelet transform	VDR-Net	Brain MRI (IXI-MR dataset)
²⁰	Semantic segmentation	Discrete wavelet transform	U-Net	Brain MRI (MICCAI dataset)
²¹	Semantic segmentation	Multi-scale wavelet transform	WU-Net	Pediatric echocardiographic (CAMUS dataset)
Ours	Semantic segmentation	Discrete wavelet transform	Haar wavelet pooling U-Net	Breast MRI (TCIA breast-diagnosis)

Previous studies have proposed an unsupervised image fusion algorithm for image restoration and classification that combines a deep learning network with a multi-scale discrete wavelet transform ^16–18. Suryanarayana et al. have converted low-resolution MRI images into high-resolution MRI images by combining VDR-Net with wavelet pooling that uses both low and high frequencies¹⁹. Alijamaat et al. have combined U-Net with wavelet pooling of low-frequency component (LL band) characterized by high image pixel concentration while maintaining the overall trend of images to improve the segmentation performance for multiple sclerosis of the brain²⁰. Zhao et al. have improved the performance of congenital heart disease diagnosis in pediatric echocardiography images by combining low-frequency information of multi-scale wavelet with WU-Net²¹.

Existing studies cited above have demonstrated that the combination of wavelet pooling and deep learning algorithms can improve image performance in various applications such as classification, segmentation, recognition, and restoration. This study improve the semantic segmentation performance for breast tissues by combining Haar wavelet pooling with U-Net. In particular, the image output through the high-frequency filter of wavelet pooling can effectively express locations of breast tissues and expression of micro-soft tissues by emphasizing edge features for vertical, horizontal, and diagonal components. Furthermore, characteristics of fine breast tissues are well-preserved because the image output through the low-frequency filter has approximate values of the input image.

3.1. Overview

This paper proposed a multi-class semantic segmentation method for breast tissues to reconstruct breast shape in a standing position using U-Net based on Haar wavelet pooling. To train deep learning networks, labeled breast tissue data are necessary. MRI images, which are essential for breast cancer screening, have higher resolution and lower noise than CT or ultrasound images. Moreover, MRI images are effective for representing and segmenting micro-soft tissues such as fat and fibrous granular variants in the breast.

Figure 1 shows breast tissue segmentation process based on deep learning network for breast shape reconstruction. In the first step, MRI images in Digital Imaging Communication in Medicine (DICOM) format were collected. In the second step, breast tissues from collected MRI breast images were labelled to build a dataset for training the deep learning network. Labeling steps included removing noise from the MRI image with a median filter using Otsu's threshold algorithm, expanding to all MRI images through the template-based segmentation method based on the segmented tissues, and verifying labels by a radiologist. In the third step, U-Net based on Haar wavelet pooling was designed to segment breast tissues for breast shape reconstruction. To improve the multi-class semantic segmentation performance for MRI breast images, this study combined U-Net with Haar wavelet pooling. The U-Net based on Haar wavelet pooling uses the LL sub-band, which holds approximate value of the input image, and three distinct frequency sub-bands (LH, HL, and HH), which have detailed edge features. Therefore, breast tissue features could be effectively extracted by reducing the loss of image information in the subsampling. The U-Net based on Haar wavelet pooling was trained with constructed datasets and its performance was then tested.

3.2. Building an MRI dataset for breast tissue segmentation

A set of data including labels for every pixel of the MRI data is required for breast tissue segmentation with deep learning. In this study, an MRI dataset was collected from the breast-diagnosis database ²² of The Cancer Imaging Archive (TCIA), an open access database for medical images for cancer research. The breast-diagnosis database contains medical images of breast cancer patients as well as cases of breast diagnosis such as high-risk normal, DCIS, fibroids, and lobular carcinomas. Each image was captured with three pulses (T2, STIR, BLISS) using a Phillips 1.5 T MRI system. Breast MRI images of 89 breast cancer patients were obtained at 2 mm intervals at resolutions of 500 to 600 DPI, with 80 to 90 MRI slices per person. This study used MRI slice images of T2-weighted pulse sequences data.

Figure 2 shows a step-by-step process of labeling breast tissues from MRI slice images. The breast tissue should be segmented into skin, fat, fibroglandular tissue, and background because the upper part of the pectoral muscle is incised in a mastectomy. As shown Fig. 2 (a), the noise of the slice image was removed using a median filter. Otsu's threshold algorithm was used to segment breast tissues from denoised MRI images. This algorithm could separate the foreground and background by a threshold based on the distribution of pixels in the image. Multiple thresholds were utilized to separate breast tissues with the same pixel distribution value.

Figure 2 (b) shows the result of segmenting fibroglandular tissues (yellow region) and the background (light blue region) using Otsu’s threshold algorithm by setting the background and the foreground (fibroglandular tissues). Figure 2 (c) shows result of segmenting fat (green region) and the rest of the breast tissues (red region) through Otsu’s threshold algorithm by setting the background and the foreground (fat, muscle, and chest wall). Figure 2 (d) shows that the background (brown region) and the inside of the human body (green region) are separated through Otsu’s threshold algorithm. Skin data are lost during T2-weighted pulse sequence images. Hence, boundary lines between green and brown regions were offset by pixels with a thickness of 2 mm that matched the thickness of the human body's skin and the result was used as skin data. These segmented skin data were validated through BLISS MRI images of breast cancer patients. In this process, breast-diagnosis cases such as fibroma and breast cancer were recognized and segmented as fibroglandular tissues because they were diseases expanded by the action of hormones on mammary glands. As shown in Fig. 2 (f), breast tissues obtained in the previous step were integrated. Light blue, blue, green, and yellow regions represented the background, skin, fat, and fibroglandular tissues, respectively.

In order to reduce the manual labeling of breast tissues, we used template-based segmentation. Template-based segmentation expands with a single MRI slice as a reference template for the remaining other MRI slices. It can extract individual breast tissue features after analyzing each tissue's location, size, shape, and pixel distribution values with MRI slice of a cross-section into which breast tissues are segmented and set as a reference template. The full MRI image in which breast tissues are finally segmented can be obtained by extracting breast tissues with similar characteristics from consecutive MRI slices. Figure 3 shows the process of segmenting the entire MRI slice image through template-based segmentation. T2-weighted pulse sequence data that were input for breast tissue segmentation and the segmented breast tissue are output at 800 × 800 DPI resolution. These labeled data with template-based segmentation were validated by a radiologist.

3.3. Designing U-Net based on Haar wavelet pooling

This study proposed a U-Net based on Haar wavelet pooling in the subsampling stage. The wavelet transform, which has information on the spatial domain and the frequency domain, is expressed as a vibration with an average of zero that vibrates while repeating increase and decrease within a preset time. Wavelet transform can effectively detect sudden signal changes because it describes regional features and provides signal analysis at different scales and levels. The wavelet transform for a signal $x\left(t\right)$ is defined by Eq. (1) ²³:

$${W}_{a}x\left(b\right)= \frac{1}{\sqrt{a}}\underset{-{\infty }}{\overset{+{\infty }}{\int }}x\left(t\right){\varPsi }^{*}\left(\frac{t-b}{a}\right)dt a>0$$

where $a$ is the parameter for scale change, $b$ is the displacement rate, and ${\varPsi }^{\text{*}}\left(t\right)$ is a continuous basis function called mother wavelet. In this study, a two-dimensional (2D) discrete Haar wavelet transform that could minimize the amount of computation when converting MRI breast images in the deep learning network was used.

The 2D-wavelet transform presents the input image as a matrix of two-dimensional signals based on the brightness of pixels. Data passing through the 2D-wavelet transform were divided into four bands according to the applied filter. Figure 4 shows the structure of wavelet pooling. Wavelet pooling passed through two steps. Input data were decomposed through a high pass filter and a low pass filter at each step. The size of the input data was reduced because down-sampling was performed in each step. In the first step of wavelet pooling, the input image was horizontally separated into low (L), which was a low-frequency component, and high (H), which was a high-frequency component by applying the horizontal filter. In this process, the approximate value of the input image was decomposed for the low-frequency component and the detailed value was decomposed for the high-frequency component. In the second step where the vertical filter was applied, images of low- and high-frequency components were vertically separated again and decomposed into four sets of data: LL, LH, HL, and HH bands. Resolutions of data in all bands were reduced to half the resolution of the input data. Data of the LL band had a low-frequency component, indicating the overall trend data of the input data. Data of LH, HL, and HH bands had edge features for vertical, horizontal, and diagonal components, respectively. The segmentation performance can be improved because decomposed data can represent various features such as micro breast tissues.

U-Net consists of a contracting path that extracts features from the training data and an expansive path for restoring the original resolution. The contracting path performs down-sampling by setting the stride size of the convolution to two in each step, whereas the expansive path performs up-sampling using transposed convolution. The max pooling in the subsampling stage used in previous studies was difficult to generalize because it was sensitive to overfitting of the dataset ²⁴. Although some studies have attempted to solve the vanishing gradient problem by passing the information in the contraction path to the expansive path through a skip connection, overfitting still occurs ²⁵. Breast tissues are delicate data linked by small pixels. Therefore, if max pooling is used, information on the breast tissues might be lost. This study designed a deep learning network of the U-Net architecture based on Haar wavelet pooling for subsampling to segment breast tissues.

Figure 5 shows the deep learning network architecture that combines Haar wavelet pooling with U-Net. The deep learning network was composed of 12 convolution layers, 5 Haar wavelet pooling layers, and 5 inverse wavelet-based up-sampling layers. Input breast image data were converted into LL, LH, HL, and HH band data by Haar wavelet pooling. These converted data were then transmitted to the convolution layer. The resolution was restored using an inverse wavelet, which could reconstruct data using the output value of wavelet pooling. In the proposed architecture, a batch normalization function and a ReLU activation function were used with each convolution layer. The amount of computation for the network was reduced compared with previous studies by applying the Haar wavelet pooling. The number of existing channels was maintained because the pooling result did not affect the number of channels in the deep learning network. However, the number of input channels of the convolution layer was increased by a factor of four because the U-Net based on Haar wavelet pooling simultaneously used LL, LH, HL, and HH bands.

4.1. Implementation environment

Table 3 shows the implementation environment for building the U-Net based on wavelet pooling. The U-Net was executed on Ubuntu Linux. It was implemented in Python using Anaconda, a math and science library, PyTorch, a deep learning library, and CUDA and CuDNN for GPU operation.

Table 3

Implementation environment.
Item	Usage	Version
Ubuntu	Operating system	16.04.5 LTS-64bit
Python	Development language	3.8.12
Anaconda	Math and science library	4.10.1
PyTorch	Deep learning library	1.10.0
CUDA	GPU parallel computing library	11.3
CuDNN	GPU-accelerated library	8.2.4

Two units of NVIDIA Quadro RTX 5000 16G were interconnected for distributed data parallel processing for deep learning operations. The interface module was implemented with the DistributedDataParallel library provided by PyTorch to synchronize IDs of GPU operation processes performed on two graphic cards.

4.2. Experiment and evaluation

Segmentation accuracies for background, skin, fat, and fibroglandular tissues were analyzed to evaluate the performance of the U-Net based on Haar wavelet pooling. The dataset of 5,202 images was divided into a training dataset, a validation dataset, and a test dataset at a ratio of 8:1:1. These data were rotated at a random angle for augmentation of the dataset in the deep learning network training process.

Multi-class semantic segmentation was performed using the U-Net based on Haar wavelet pooling. The resolution of the training data was set to be 800 × 800 DPI, with batch size of 8, epoch of 200, and learning rate of 0.002. Furthermore, focal loss and adaptive moment estimation optimizer (Adam) were applied. The loss function was compared with cross entropy, dice loss, and focal loss to find the optimal parameter. Max pooling, average pooling, and Haar wavelet pooling were applied in this experiment to prove the effectiveness of Haar wavelet pooling with subsampling. The segmentation performance was measured using Intersection over Union (IoU) commonly used as a performance evaluation index for segmentation, mIoU (the average of all IoU values), and pixel accuracy. Equations for IoU (2), mIoU (3), and pixel accuracy (4) are shown as follows:

$IOU= \frac{TP}{TP+FP+FN}$	(2)
$mIOU= \frac{1}{k}{\sum }_{i=0}^{k}\frac{TP}{TP+FP+FN}$	(3)
$pixel accuracy= \frac{TP+TN}{TP+TN+FP+FN}$	(4)

where TP, TN, FP, FN, and k represent true positive, true negative, false positive, false negative, and the number of classes, respectively.

Table 4 shows IoU, mIoU, and pixel accuracy results for background, skin, fat, and fibroglandular tissues of the test dataset. Haar wavelet pooling showed higher breast tissue segmentation performance than max pooling and average pooling in the same experimental environment. Furthermore, the deep learning network using focal loss and Haar wavelet pooling showed the highest mIoU and pixel accuracy values. The deep learning network using focal loss and Haar wavelet pooling confirmed that segmentation accuracies for skin and fibroglandular tissues were relatively high. This is because deep learning networks can be trained effectively because Haar wavelet pooling can reduce the influence of easy negative examples such as background and fat while focusing on training hard negative examples such as skin and fibroglandular tissues. By contrast, the deep learning network using both dice loss and average pooling showed low segmentation performance.

Table 4

Experimental results obtained with a combination of training parameters of the deep learning network for breast tissue segmentation.
Pooling method	Loss function	Background IoU (%)	Skin IoU (%)	Fat IoU (%)	Fibroglandular tissue IoU (%)	mIoU (%)	Pixel accuracy (%)
Max pooling	Cross entropy	98.78	75.44	94.62	76.28	86.28	99.29
Average pooling	Cross entropy	98.39	70.03	93.88	73.15	83.86	99.12
Haar wavelet Pooling	Cross entropy	98.80	75.37	94.70	77.66	86.63	99.31
Max pooling	Dice loss	98.62	72.07	94.17	75.52	85.09	99.20
Average pooling	Dice loss	98.40	74.65	93.84	63.94	82.71	98.78
Haar wavelet pooling	Dice loss	98.77	73.63	94.79	76.20	85.85	99.29
Max pooling	Focal loss	98.90	76.57	94.78	78.58	87.21	99.34
Average pooling	Focal loss	98.85	75.47	94.38	73.79	85.62	99.29
Haar wavelet pooling	Focal loss	98.90	77.14	94.81	79.06	87.48	99.35

Table 5

Comparison of complexity, segmentation accuracy, and pixel accuracy values of deep learning networks.
Method	# of params (million)	Background IoU (%)	Skin IoU (%)	Fat IoU (%)	Fibroglandular tissue IoU (%)	mIoU (%)	Pixel accuracy (%)
U-Net	31.04	98.85	74.34	94.20	76.46	85.96	99.28
End-to-End U-Net	27.91	98.85	74.16	93.66	71.99	84.67	99.23
RDA-U-Net	18.85	98.61	75.71	93.89	68.90	84.28	99.19
VEU-Net	14.22	98.63	74.43	94.03	69.77	84.21	99.04
nnU-Net	47.88	98.86	73.00	94.78	78.65	86.32	99.31
LL wavelet U-Net	21.98	97.00	68.30	90.75	43.67	74.95	98.54
Ours	24.28	98.90	77.14	94.81	79.06	87.48	99.35

Table 5 compares results of breast tissue segmentation accuracy between the proposed network and previous studies. The IoU, mIoU, and pixel accuracy values for background, skin, fat, and fibroglandular tissues were measured in this experiment. The U-Net based on Haar wavelet pooling achieved an mIoU of 87.48 and a pixel accuracy of 99.35% for breast tissue segmentation. The segmentation accuracy for these breast tissues showed a significant performance improvement of 1–2%p compared to a previous study ²⁶. In particular, the U-Net based on Haar wavelet pooling achieved a very high segmentation accuracy for skin and fibroglandular tissues with a small number of parameters.

4.3. Verification of segmentation results through visualization

Figure 6 shows original MRI images of three patients (A, B, and C) with different breast shapes and fibroglandular tissue densities in the test dataset and images of breast tissues segmented by the proposed network. The top of Fig. 6 depicts the original MRI images and the bottom shows breast tissue images segmented using the proposed network. Black, blue, green, and yellow indicates the background, skin, fat, fibroglandular tissues, respectively. For Patient A, a small breast shape and a high density of fibroglandular tissues were observed. Patient B was characterized by a medium-sized breast shape and low-density fibroglandular tissues close to the chest wall muscle. Patient C, with a large breast shape, was characterized by moderately dense fibroglandular tissues. These results showed that the proposed network could effectively segment skin, fat, and fibroglandular tissues even when MRI images with different breast shapes and fibroglandular tissue densities were used as input.

Figure 7 shows MRI images and segmentation results for two patients. Figure 7 (A) shows the 69th patient out of 89 patients, with an mIoU of 90.28. Figure 7 (B) shows an MRI image of the 58th patient with an mIoU of 77.94. In rectangle areas of Fig. 7 (A3) and Fig. 7 (B3), the fibroglandular tissue of Patient A had a higher density than that of Patient B. As for Patient B with a low density of fibroglandular tissues, the mIoU of the segmented breast tissue was lower than that of Patient A with a high density of fibroglandular tissues. This observation indicated that the U-Net based on Haar wavelet pooling could effectively segment breast images of women with a high density of fibroglandular tissues.

Figure 8 visualizes the ground truth image and the segmented breast tissue image with U-Net, nnU-Net, and U-Net based on Haar wavelet pooling. The rectangle area indicated that the U-Net based on Haar wavelet pooling segmented breast tissues more accurately than U-Net and nnU-Net. By contrast, in Fig. 8 (b), the outside of the background was misrecognized as skin tissue and the inside as fibroglandular tissue and fat owing to the noise of the background. When Fig. 8 (b) showing an image of breast tissue segmented through nnU-Net was compared with the ground truth, the background was incorrectly segmented into fibroglandular tissue, skin, and fat because of the noise inside the background. These results showed that the proposed network could distinguish the noise of the input image and the breast tissue more accurately than methods described in previous studies and accurately segment delicate soft tissues and skin of the mammary gland.

Recently, deep learning networks with excellent performance have been introduced in various studies for medical image segmentation. Many methods have been proposed for segmenting breast tissues using binary-image segmentation to diagnose breast diseases such as breast cancer and breast tumors. However, conventional methods are time-consuming and labor-intensive because the ROI for breast tissues is manually set and the quality of the segmented tissue varies depending on the skill level of the worker. To address this issue, we proposed a U-Net based on Haar wavelet pooling for multi-class semantic segmentation of breast tissues from MRI images. In addition, a labeled dataset was built to train network for breast shape reconstruction. The proposed network achieved an mIoU of 87.48 and a pixel accuracy of 99.35%. In particular, the network accurately segmented breast tissues of women with a high density of fine mammary glands.

In the future, additional construction of datasets for MRI breast images taken with other equipment such as STIR pulse sequences and BLISS pulse sequences is required. Furthermore, the performance for medical image segmentation can be improved by applying various wavelet transforms such as the Daubechies wavelet transform and the dual-tree complex wavelet transform.

Funding

This work was supported by a grant (grant number 2018R1D1A1B07050199) of the Basic Science Research Program through the National Research Foundation (NRF) funded by the Ministry of Education, Republic of Korea.

Conflicts of interest/Competing interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Availability of data and material

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Code availability

Not applicable

Authors' contributions

KwangBin Yang: Data preparation, Software, Methodology, Writing—Original draft preparation. Jinwon Lee: Methodology, Writing—Reviewing and Editing. Jeongsam Yang: Methodology, Supervision, Writing—Reviewing and Editing.

Ethics approval

Not applicable

Consent to participate

All authors consent for participation.

Consent for publication

All authors consent for publication.

Cancer, I. A. for R. on & others. IARC Biennial Report 2020-2021. Lyon: International Agency for Research on Cancer (2021).
Danch-Wierzchowska, M., Borys, D. & Swierniak, A. FEM-based MRI deformation algorithm for breast deformation analysis. Biocybernetics and Biomedical Engineering 40, 1304–1313 (2020).
Na, G.-Y., Yang, J. & Cho, S. Development of a 3D breast shape generation and deformation system for breast implant fabrication. Journal of Mechanical Science and Technology 33, 1293–1303 (2019).
Chai, Y., Liu, H. & Xu, J. A new convolutional neural network model for peripapillary atrophy area segmentation from retinal fundus images. Applied Soft Computing 86, 105890 (2020).
Inan, M. S. K., Alam, F. I. & Hasan, R. Deep integrated pipeline of segmentation guided classification of breast cancer from ultrasound images. Biomedical Signal Processing and Control 75, 103553 (2022).
Long, Z. et al. Segmentation and classification of knee joint ultrasonic image via deep learning. Applied Soft Computing 97, 106765 (2020).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. in International Conference on Medical image computing and computer-assisted intervention 234–241 (Springer, 2015).
Soulami, K. B., Kaabouch, N., Saidi, M. N. & Tamtaoui, A. Breast cancer: One-stage automated detection, segmentation, and classification of digital mammograms using UNet model based-semantic segmentation. Biomedical Signal Processing and Control 66, 102481 (2021).
Negi, A., Raj, A. N. J., Nersisson, R., Zhuang, Z. & Murugappan, M. RDA-UNET-WGAN: an accurate breast ultrasound lesion segmentation using wasserstein generative adversarial networks. Arabian Journal for Science and Engineering 45, 6399–6410 (2020).
Ilesanmi, A. E., Chaumrattanakul, U. & Makhanov, S. S. A method for segmentation of tumors in breast ultrasound images using the variant enhanced deep learning. Biocybernetics and Biomedical Engineering 41, 802–818 (2021).
Zhang, Y. et al. Automatic breast and fibroglandular tissue segmentation in breast MRI using deep learning by a fully-convolutional residual neural network U-net. Academic radiology 26, 1526–1535 (2019).
Zhang, Y. et al. Development of u-net breast density segmentation method for fat-sat mr images using transfer learning based on non-fat-sat model. Journal of digital imaging 34, 877–887 (2021).
Huo, L. et al. Segmentation of whole breast and fibroglandular tissue using nnU-Net in dynamic contrast enhanced MR images. Magnetic Resonance Imaging 82, 31–41 (2021).
Huang, H., He, R., Sun, Z. & Tan, T. Wavelet-srnet: A wavelet-based cnn for multi-scale face super resolution. in Proceedings of the IEEE International Conference on Computer Vision 1689–1697 (2017).
Liu, Y., Li, Q. & Sun, Z. Attribute-aware face aging with wavelet-based generative adversarial networks. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 11877–11886 (2019).
Liu, S., Wang, M. & Song, Z. Wavefuse: A unified deep framework for image fusion with discrete wavelet transform. arXiv preprint arXiv:2007.14110 (2020).
Liu, P., Zhang, H., Zhang, K., Lin, L. & Zuo, W. Multi-level wavelet-CNN for image restoration. in Proceedings of the IEEE conference on computer vision and pattern recognition workshops 773–782 (2018).
Li, Q., Shen, L., Guo, S. & Lai, Z. Wavelet integrated CNNs for noise-robust image classification. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 7245–7254 (2020).
Suryanarayana, G. et al. Accurate magnetic resonance image super-resolution using deep networks and Gaussian filtering in the stationary wavelet domain. IEEE Access 9, 71406–71417 (2021).
Alijamaat, A., NikravanShalmani, A. & Bayat, P. Multiple sclerosis lesion segmentation from brain MRI using U-Net based on wavelet pooling. International journal of computer assisted radiology and surgery 16, 1459–1467 (2021).
Zhao, C. et al. Multi-scale wavelet network algorithm for pediatric echocardiographic segmentation via hierarchical feature guided fusion. Applied Soft Computing 107, 107386 (2021).
Bloch, B. N., Jain, A. & Jaffe, C. C. Data from breast-diagnosis. The Cancer Imaging Archive 10, K9 (2015).
Nielsen, M. A. Neural networks and deep learning. vol. 25 (Determination press San Francisco, CA, USA, 2015).
Williams, T. & Li, R. Wavelet pooling for convolutional neural networks. in International Conference on Learning Representations (2018).
Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N. & Liang, J. Unet++: A nested u-net architecture for medical image segmentation. in Deep learning in medical image analysis and multimodal learning for clinical decision support 3–11 (Springer, 2018).
Gare, G. R. et al. W-Net: Dense and diagnostic semantic segmentation of subcutaneous and breast tissue in ultrasound images by incorporating ultrasound RF waveform data. Medical Image Analysis 76, 102326 (2022).

No competing interests reported.

Download PDF

Journal Publication

published 20 Jul, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
13 Mar, 2023
Reviews received at journal
18 Feb, 2023
Reviewers agreed at journal
08 Feb, 2023
Reviews received at journal
05 Feb, 2023
Reviewers agreed at journal
28 Jan, 2023
Reviewers agreed at journal
26 Jan, 2023
Reviewers invited by journal
24 Jan, 2023
Editor assigned by journal
24 Jan, 2023
Editor invited by journal
16 Jan, 2023
Submission checks completed at journal
16 Jan, 2023
First submitted to journal
11 Jan, 2023

You are reading this latest preprint version

Multi-class semantic segmentation of breast tissues from MRI images using U-Net based on Haar wavelet pooling

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Literature Review

2.1. Deep learning-based breast tissue segmentation method

3. Design Of Deep Learning Network For Segmentation

3.1. Overview

3.2. Building an MRI dataset for breast tissue segmentation

3.3. Designing U-Net based on Haar wavelet pooling

4. System Implementation And Experimental Evaluation

4.1. Implementation environment

4.2. Experiment and evaluation

4.3. Verification of segmentation results through visualization

5. Conclusion

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 1

\(IOU= \frac{TP}{TP+FP+FN}\)	(2)
\(mIOU= \frac{1}{k}{\sum }_{i=0}^{k}\frac{TP}{TP+FP+FN}\)	(3)
\(pixel accuracy= \frac{TP+TN}{TP+TN+FP+FN}\)	(4)