Deep neural network-based automatic bowel gas segmentation on X-ray images for particle beam treatment

doi:10.21203/rs.3.rs-2269635/v1

Download PDF

Research Article

Deep neural network-based automatic bowel gas segmentation on X-ray images for particle beam treatment

https://doi.org/10.21203/rs.3.rs-2269635/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 21 Mar, 2023

Read the published version in Physical and Engineering Sciences in Medicine →

You are reading this latest preprint version

Since particle beam distribution is vulnerable to change in bowel gas because of its low density, we developed a deep neural network (DNN) for bowel gas segmentation on X-ray images. We used 6688 image datasets from 209 cases as training data and 102 image datasets from 51 cases as test data. For the training data, we prepared three types of digitally reconstructed radiographic (DRR) images (all-density, bone and gas) by projecting the treatment planning CT image data. However, the real X-ray images acquired in the treatment room showed low contrast that interfered with manual delineation of bowel gas. Therefore, we used synthetic X-ray images converted from DRR images in addition to real X-ray images. We evaluated DNN segmentation accuracy for the synthetic X-ray images using Intersection over Union (IoU), recall, precision, and the Dice coefficient, which measured 0.708 ± 0.208, 0.832 ± 0.170, 0.799 ± 0.191, and 0.807 ± 0.178, respectively. The evaluation metrics for the real X-images were less accurate than those for the synthetic X-ray images (0.408 ± 0237, 0.685 ± 0.326, 0.490 ± 0272, and 0.534 ± 0.271, respectively). Our DNN appears useful in increasing treatment accuracy in particle beam therapy.

deep neural network

image segmentation

patient setup

particle beam therapy

bowel gas

The clinical demand for radiation therapy, including particle beam therapy, has increased in recent years, because it is a less invasive therapeutic option for inoperable patients and for those who decline surgery. Because the dose distribution of particle beams is highly dependent on differences in body composition, a treatment planning CT scan is generally acquired several days prior to treatment to optimize the dose distribution. To ensure accurate treatment, two X-ray images are acquired for patient positional verification before treatment beam irradiation. In patients receiving treatment to the abdominal and pelvic regions, medical staff check bowel gas positions to prevent dose degradation. The presence of bowel gas, which has complex shapes with low contrast, within the treatment beam field, could result in dose degradation, and the treatment must accordingly be postponed to allow the bowel gas to move, or the bowel gas must be removed using a rectal tube.

There is growing progress in the development of deep neural networks (DNNs), with an impact on medical imaging [1–3]. While conventional machine learning methods require manual feature extraction parameters, a DNN can automatically optimize much larger numbers of parameters using a much larger amount of data [4–8].

To develop a DNN for the detection of bowel gas collections on X-ray images, a pair of X-ray images and gas collection segmentation data are required [9]. However, manual delineation of gas regions is time-consuming, and subject to interobserver error [9]. To solve this problem, we developed a DNN was trained with digitally reconstructed radiography (DRR) images those were from treatment planning CT data, which were also used to calculate the ground truth of gas collections in individual patients. By using DRR images for training was to allow institution-independent model building.

In this work, we evaluated segmentation accuracy using X-ray images of the pelvic region by the DNN was trained with DRR images.

Patients and image acquisition

Image data of 283 cases of patients at our treatment centre with tumours of the prostate were randomly selected. The study was conducted with the approval of our Institutional Review Board (N21-002) and performed in accordance with the Declaration of Helsinki. All patients were immobilized on the treatment couch with a urethane resin cushion (Moldcare®, Alcare, Tokyo, Japan) and low-temperature thermoplastic shells (Shell Fitter®, Kuraray Co., Ltd., Osaka, Japan). No water restriction was imposed, and the rectum was emptied by the patient's effort or using a laxative or enema. Prior to treatment, two or three fiducial markers were implanted into the prostate.

Treatment planning CT image

Planning CT image data were acquired under breath-hold in exhalation using a 320-detector CT (Aquilion One Vision, Canon Medical Systems, Otawara, Japan) in the simulation room. CT imaging conditions were based on our clinical protocols, with a tube voltage of 120 kV. X-ray tube current was adjusted by automatic exposure control [10]. The reconstructed CT image pixel size and the slice thickness were 0.976 mm × 0.976 mm and 2.0 mm, respectively.

X-ray image

X-rays were acquired with a pair of X-ray tubes installed in the treatment room, with two indirect flat panel detectors (FPDs) (PaxScan 3030+®, Varian Medical Systems, Palo Alto, CA, USA) installed on the right and left sides of the vertical irradiation port at 35° and − 35°, respectively. Image size was 768 × 768 pixels, pixel size was 388 µm (2 × 2 binning mode using the original pixel size of 194 µm) and an amorphous silicon receptor. The distance between the isocentre of the room and the X-ray tube (ISO) was 1690 mm. The source-image receptor distance (SID) was 2390 mm. This allowed acquisition of an image measuring approximately 210 × 210 mm.

X-ray images used in this study were acquired after the patient setup verification process using 2D-3D image registration software [11]. This software coregistered patient anatomical structures on X-ray images to those on the reference DRR images which were generated from the planning CT.

Training data

Three types of DRR images were generated for the training input data (upper panel in Fig. 1). All DRR images were generated using our in-house software [12], which was programmed using CUDA (Compute Unified Device Architecture ver. 10.1, Microsoft Visual Studio 2013, Microsoft Corp, Redmond WA, USA) in a Windows 10 environment with a graphics processing unit (GPU) processor on an NVIDIA board (GeForce GTX 1080, NVIDIA Corporation, Santa Clara CA, USA).

Pre-processing

The planning CT was shifted to locate the centre of the tumour in the planning CT in the isocentre of the radiation field. Because the treatment couch positions on the planning CT and X-ray images were not always identical for the same point in the patient, we applied image processing to remove the treatment couch on the planning CT data before generating the DRR images. In addition, some CT data may have contained missing areas when converted to DRR images due to the insufficient number of slices. The first and last slices were therefore expanded based on the CT central slice to ensure a sufficient number of CT slices.

All-density DRR image

An all-density DRR image was generated by projecting the planning CT data (converting HU numbers to X-ray attenuation coefficients) along the virtual X-ray source-to-FPD beam path.

$$\begin{array}{c}q= \sum _{\text{k}=1}^{\text{n}}{\varDelta \text{L}\bullet {\mu }}_{\text{k}}\left(1\right)\end{array}$$

where q is the projection ray sum point on the DRR image position and ΔL is the calculation grid size (= 1mm in this study).

The projection angle, pixel size, and image size were ± 35°, 388 × 388 µm, and 768 × 768 pixels, respectively. To reflect the pixel value variation in X-ray images due to variations in the X-ray imaging dose, we generated three additional different qualities of DRR images by changing the weighting of the HU number. An additional three DRR images were generated by randomly shifting ± 5 mm and ± 2° along the respective axes from the reference position. A total of 24 DRR images (= 2 directions × 3 image qualities × 4 positions) were generated from one CT volume data

Bone DRR image

Patient position was registered to the reference position before the treatment beam irradiation, as closely as possible. Higher image contrast structures such as bones affected the extracted feature map more than lower-density structures. Because misalignment of the bony structures could decrease DNN segmentation accuracy, bone DRR images were generated from a region measuring ≥ 100 HU in the CT data and positioned randomly within ± 5 mm along the respective axes from the position of the all-density DRR image that had already been randomly shifted ± 5 mm and ± 2°.

Gas DRR image

Bowel gas collections on the planning CT were segmented using a threshold of -300 HU. Then, Gas DRR images were generated to match the location of all-density DRR images that had already been randomly shifted.

Post-processing

All datasets were resized to 256 × 256 pixels and the pixel values were normalised to the range of 0–1. We randomly selected one all-density DRR image from the three density groups, and applied gamma correction by randomly calculating a correction factor between 0.3–1.7. Finally, we added uniform random noise with a range of 0–0.01, and the pixel values were normalised to the range of 0–1 again.

Network architecture

Our DNN was a modified 2D convolutional autoencoder with skip connections (U-net). It output gas segmentation data by inputting the all-density DRR and bone DRR images (Fig. 2a). The DNN consisted of two ‘encoder blocks’ and one ‘decoder block’, which sequentially shared the features of the two images (FuseUNet) [13].

The two encoder sections performed feature extraction of the input all-density DRR image and the input bone DRR image. This approach has often been used for image segmentation models [14]. The encoder constructed three repeating convolutional layers with a kernel size of 3 × 3 pixels (Conv), a batch normalization layer (BN)[15], a rectified linear unit layer (ReLU)[16] and a max-pooling layer (Pooling). Then two Convs were added. The feature map of the bone DRR was merged with the feature map of the all-density DRR. This part is called a Fuse Block, which doubles the number of channels by concatenating feature maps (Fig. 2b). Each stream is then reduced to half its dimensionality by the Pooling layer. Most methods double the number of channels after the Pooling layer, but we kept the number of channels unchanged because we used Fuse Block. The encoder's channel transition is (64, 128, 256, 512, 1024) for all-density DRR streams and (64, 128, 256, 512) for bone DRR streams.

In the decoder section, Conv is used to match the number of channels in the encoder feature map to be combined in skip connection, and an Upsampling layer is usually used to double the dimensionality. The outermost skip connection was, however, removed to prevent information that was not fully extracted from the features from directly affecting the output. After that, the Fuse Block features in the encoder were combined to share the spatial location information of the bowel gas. The Conv, BN, and ReLU were then repeated as in the encoder section. A Conv with a kernel size of 1 × 1 and the sigmoid function were used as the activation function.

Network training

We used 6688 image sets (all-density DRR, bone DRR, and gas DRR images) for the 209 cases. The DNN parameters were optimized to predict gas DRR images from all-density- and bone DRR images. Adam (unpublished data) was used as the optimisation method, and the learning rate was set to 0.0001. The batch size was set to 16 and the epoch sets to 2000.

We used Focal Tversky loss applied to the imbalance problem [17]. The Tversky Index is an index that allows flexible adjustment of the balance between False Positives (FPs) and False Negatives (FNs) in the Dice score.

$$\begin{array}{c}{TI}_{c}=\frac{\sum _{i=1}^{N}{p}_{ic}{g}_{ic}+\epsilon }{\sum _{i=1}^{N}{p}_{ic}{g}_{ic}+\alpha \sum _{i=1}^{N}{p}_{i\stackrel{-}{c}}{g}_{ic}+\beta \sum _{i=1}^{N}{p}_{ic}{g}_{i\stackrel{-}{c}}+\epsilon }\left(2\right)\end{array}$$

$$\begin{array}{c}{FTL}_{c}=\sum _{c}{\left(1-{TI}_{c}\right)}^{\gamma }\left(3\right)\end{array}$$

where p_ic indicates that the i th pixel belongs to class c of bowel gas, and p_i$\overline{c}$ indicates that it does not, whereas g is the same for the ground-truth. ε is the total number of pixels in the image, and ε is a parameter that prevents it from dividing by zero. Focal Tversky Loss is the Tversky Index with parameter γ applied.

In this study, the parameters alpha and beta were used to adjust the balance of FP and FN, and gamma was used to adjust the balance with the background area. In this study, α = 0.7, β = 0.3, and γ = 0.75.

We used the deep learning framework ‘TensorFlow 2.6’, NVIDIA RTX A5000 GPU (VRAM: 24 GB) on Ubuntu 18.04 LTS.

Evaluations

In clinical treatment, our DNN predicted bowel gas collections from X-ray images acquired in the treatment room. However, it could cause manual delineation error because of the low quality of the real X-ray image. It is, therefore, difficult to evaluate DNN detection accuracy quantitatively. In the same situation, a few researchers [18] used DRR images, and not real X-ray images, for evaluation, but the qualities of the training data and test data were identical and did not reflect an actual clinical situation.

We used synthetic X-ray images converted from DRR images using the pre-trained DNN previously developed by our group (unpublished data) (lower panel in Fig. 1). Use of the synthetic X-ray images provided accurate ground-truth gas segmentation data without any delineation error. By doing this, our DNN segmentation accuracy could be evaluated quantitatively. A total of 102 image sets (synthetic X-ray, bone DRR, and gas DRR images) were obtained for the 51 cases, all of which differed from the training data.

Moreover, we evaluated DNN segmentation accuracy with real X-ray images, even though ground-truth gas collections might include delineation error. To minimize delineation error, one person delineated the bowel gas collections on the X-ray image, and another certified medical physicist, with over 20 years of clinical experience, checked carefully and modified manually if required. A total number of X-ray images was 102 for the 51 cases.

Bowel gas segmentation accuracy was evaluated using recall, precision, Intersection over Union (IoU) and Dice coefficient.

$$\begin{array}{c}IoU=\frac{TP}{TP+FN+FP}\left(4\right)\end{array}$$

$$\begin{array}{c}Recall=\frac{TP}{TP+FN}\left(5\right)\end{array}$$

$$\begin{array}{c}Precision=\frac{TP}{TP+FP}\left(6\right)\end{array}$$

$$\begin{array}{c}Dice = \frac{2TP}{2TP+FP+FN}\left(7\right)\end{array}$$

where TP, TN, FP and FN are true positive, true negative, false positive, and false negative, respectively.

Since DNN outputs a bowel gas probability density map with a range of 0–1, the threshold of the probability density should be determined to select bowel gas collections. We calculated Dice coefficients as a function of probability densities in the range of 0–1 in 0.005 steps using the training data. The Dice coefficient was used because it is an evaluation index that considers the balance of FP and FN [19, 20].

Predicting data threshold values

To find the maximum segmentation accuracy, we evaluated the Dice coefficient as a function of the predicted probability densities (Fig. 3). The Dice coefficient increased gradually as the binarization threshold increased. The maximum Dice coefficient was 0.952 at a threshold value of 0.995. We, therefore, used a threshold value of 0.995 in this study.

Synthetic X-ray image

Several types of bowel gas collections were visualized on the input synthetic X-ray image (Fig. 4a). The predicted and the ground-truth gas regions were overlaid on the synthetic X-ray image marked with red and green colors, respectively (Fig. 4b). The trained model was successful in segmenting small gas collections as well as the larger ones, not dependent on whether the regions were superimposed on the bony structures or not (Fig. 4b) The evaluation metrics, IoU, recall, precision and Dice coefficient, were 0.912, 0.979, 0.930, and 0.954, respectively.

Results for segmentation accuracy averaged over all cases are summarized in Table 1.

Table 1

Bowel gas segmentation assessment results for the synthetic X-ray and the real X-ray images with our developed DNN averaged over all cases.
	Synthetic X-ray			Real X-ray
	Mean	SD	95% percentile	Mean	SD	95% percentile
IoU	0.708	0.208	0.915	0.408	0.237	0.730
Recall	0.832	0.170	0.977	0.685	0.326	1.000
Precision	0.799	0.191	0.953	0.490	0.272	0,839
Dice coefficient	0.807	0.178	0.956	0.534	0.271	0.843
Abbreviations: IoU = Intersection over Union, Std = standard deviation, DNN = deep neural network

The box range in between upper and lower quartiles for the synthetic X-ray image are displayed for all metrics. IoU, recall, precision and Dice coefficient were 0.708 ± 0.208, 0.832 ± 0.170, 0.799 ± 0.191, and 0.807 ± 0.178, respectively. Several outlier values were generated in all metrics.

Real X-ray images

Large bowel gas collections with higher image contrast were visualized on the input X-ray image (marked as light blue arrow) (Fig. 5a). These were almost always detected by our DNN (green and red colors are detected and ground-truth gas collections, respectively) (Fig. 5b). Low-contrast gas collections on the X-ray image (marked with a yellow arrow in Fig. 5a) were not detected by the DNN (Fig. 5b). False positives were identified in the treatment couch (marked as green arrow) (Fig. 5b).

The evaluation metrics were IoU = 0.572, recall = 0.696, precision = 0.762, and Dice coefficient = 0.728 for this case.

Evaluation results averaged over all cases are summarize in Table 1. The evaluation metrics for the real X-ray images were less accurate than those for the synthetic X-ray images (IoU = 0.408 ± 0237, recall = 0.685 ± 0.326, precision = 0.490 ± 0272, Dice coefficient = 0.534 ± 0.271). The difference between the mean of each evaluation value in the real X-ray images and the synthetic X-ray images was > 0.1 (IoU = 0.300, recall = 0.147, precision = 0.309, Dice coefficient = 0.273). The box range between the upper and lower quartiles for the X-ray images were lower than that for the synthetic X-ray images (Fig. 6). In the real X-ray images, there were output results that were 0.0 for each evaluation value.

We developed a DNN for segmentation of bowel gas on X-ray images. The segmentation accuracy for the real X-ray images was less than that for the synthetic X-ray images. This can be explained by the differences in imaging modality and delineation error for the ground-truth.

Different image modalities

We used a DRR image dataset for the training data, and then used real and synthetic X-ray images for the evaluation. The areas of poor segmentation accuracy were those where the bowel gas collections were difficult to see or where bone or the treatment couch overlapped with a gas collection. In many cases, the image contrast of the gas was low in the real X-ray images, resulting in a lower recall value compared to the synthetic X-ray images (Table 1). In addition, false positive values were observed in regions with low brightness (Fig. 5a, green arrow). Particularly in the real X-ray images, there were regions with low brightness. As a result, the precision value was lower than that of the synthetic X-ray image (Table 1). To solve this problem, Data augmentation for the image quality pattern on the all-density DRR images would help to extract the feature map of the bowel gas independently of the quality of the input images.

Another reason for the lowered accuracy in real X-ray images compared to synthetic X-ray images is the delineation error for the ground-truth of the bowel gas collections on a real X-ray image. The ground-truth for the synthetic X-ray image was generated from the planning CT data, while the ground-truth for the real X-ray image was delineated manually. It was difficult to judge the bowel gas collections on both real and synthetic X-ray images. As a result, the evaluation metrics for the real X-ray images were degraded more than those of the synthetic images.

Overlap problems

In this study, we used the bone DRR images as well as the all-density DRR images to prevent the degradation of segmentation accuracy due to misregistration. Our DNN successfully segmented bowel gas overlying bone and improved segmentation accuracy compared to those without bone DRR images (IoU = 0.357 ± 0.215, recall = 0.698 ± 0.333, precision = 0.411 ± 0.249, and Dice coefficient = 0.484 ± 0.255 for the real X-ray image).

Loss function

Binary cross entropy (BCE) and Dice loss are often used for the segmentation task [21–24]. Use of these loss functions could affect the loss values of the background region [25], because a ratio of the bowel gas areas to non-bowel gas areas on DRR images was, however, often smaller than that of the non-bowel gas region (imbalance problem) [26]. Focal loss and Tversky loss improved robustness against the imbalance problem. Focal loss reduced the weight of loss for the background regions, which were already well classified by a hyper parameter γ in BCE calculation. Tversky loss adjusted the balance of FP and FN in Dice loss calculation by the hyper parameters α and β. (Eq. 2). For this reason, we used the Focal Tversky loss, which combines elements of Focal loss and Tversky loss to reduce FN and minimize the effect of the background, even though the loss was required to adjust several hyperparameters [27].

Related studies

Miura et al. proposed a bowel gas segmentation DNN which was trained with X-ray images [9]. The resulting mean Dice coefficient value was 0.85 ± 0.08, which was higher than in our study. This is explained as follows:

First, the image contrast of the bowel gas collections on their X-ray images was higher than ours. Their imaging conditions were 120 kV, 2 mAs, and SID of 100 cm using a Vero4DRT® (Mitsubishi Heavy Industries, Hiroshima, Japan) [28], while ours were tube voltage of 90 kV, 3.6 mAs, and SID of 240 cm [29]. The mAs value for the Vero converted to that for our imaging system was 12.6 mAs. The imaging dose in Vero4DRT was 3.5 times higher than ours. Segmentation accuracy is strongly affected by the visibility of the bowel gas [28]. Segmentation accuracy could be strongly affected by the visibility of the bowel gas. We considered that the real X-ray images we used were more difficult with our method. The second reason is that the imaging modalities used for the training data and the test data were different in our study. We performed the DNN training process by generating multiple quality patterns of DRR images as training data so that segmentation could be performed accurately from different imaging modalities. As a result, our DNN obtained generalization to any other X-ray imaging system compared to the training data with a real X-ray image. Moreover, manual delineation of the ground-truth bowel gas collections is subject to error. Our ground-truth derived from the planning CT included small bowel gas collections (Fig. 4); however, Miura’s study did not annotate it.

Study limitations

One limitation of this study should be mentioned here. The bowel gas collections detected by our DNN degraded dose distribution when the gas collections were within the beam field. Since the gas collections were detected as 2-dimentional information, we did not evaluate their impact on dose distribution. We plan to reconstruct bowel gas collections as a 3D space using DNN in our next study.

We have developed a DNN for detecting bowel gas collections in X-ray images. Segmentation accuracy for real X-ray images was decreased compared with that for synthetic X-ray images; however, our DNN appears useful in increasing treatment accuracy in particle beam therapy.

Funding: None

Competing Interests: Dr. Hirai is employed by the Toshiba Corporation, Kawasaki, Japan.

Author Contributions: T.K. and S.M. designed and performed computation codes. T.K. and S.M. drafted the manuscript. H.R., S.M., Y.M. A.H. and R.H. designed and verified experiments. H.I. and S.M. corrected data. Y.T. H.S. and H.I. critically reviewed the paper.

Ethics approval: The study was approved by the Institutional Review Board of our institution (N21-002)

Consent to Participate:

All patients were informed of the contents of the study and gave consent to participate.

Acknowledgement

We thank Libby Cone, MD, MA, from DMC Corp. (www.dmed.co.jp) for editing drafts of this manuscript.

Montoya JC, Zhang C, Li Y, Li K, Chen GH (2022) Reconstruction of three-dimensional tomographic patient models for radiation dose modulation in CT from two scout views using deep learning. Med Phys 49:901–916
Ying X, Guo H, Ma K, Wu J, Weng Z, Zheng Y (2019) X2CT-GAN: reconstructing CT from biplanar X-rays with generative adversarial networks. Proceedings of the IEEE/CVF conference on computer vision and pattern recognition pp. 10619-28
Zamir R, Bagon S, Samocha D, Yagil Y, Basri R, Sklair-Levy M et al (2021) Segmenting microcalcifications in mammograms and its applications. Medical Imaging 2021: Image Processing: SPIE; pp. 788 – 95
Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical image computing and computer-assisted intervention: Springer; pp. 234 – 41
Terunuma T, Tokui A, Sakae T (2018) Novel real-time tumor-contouring method using deep learning to prevent mistracking in X-ray fluoroscopy. Radiol Phys Technol 11:43–53
Rao D, Wu X-J, Li H, Kittler J, Xu T (2021) UMFA: a photorealistic style transfer method based on U-Net and multi-layer feature aggregation. J Electron Imaging 30:053013
Zhang K, Li Y, Zuo W, Zhang L, Van Gool L, Timofte R (2022) Plug-and-play image restoration with deep denoiser prior. IEEE Trans Pattern Anal Mach Intell 44:6360–6376. doi: 10.1109/TPAMI.2021.3088914
Yan Z, Guo S, Xiao G, Zhang H (2019) On combining cnn with non-local self-similarity based image denoising methods. IEEE Access 8:14789–14797
Miura H, Ozawa S, Doi Y, Nakao M, Ohnishi K, Kenjo M et al (2019) Automatic gas detection in prostate cancer patients during image-guided radiation therapy using a deep convolutional neural network. Phys Med 64:24–28
Papadakis AE, Perisinakis K, Oikonomou I, Damilakis J (2011) Automatic exposure control in pediatric and adult computed tomography examinations: can we estimate organ and effective dose from mean MAS reduction? Invest Radiol 46:654–662
Mori S, Kumagai M, Miki K, Fukuhara R, Haneishi H (2015) Development of fast patient position verification software using 2D-3D image registration and its clinical experience. J Radiat Res 56:818–829
Mori S, Inaniwa T, Kumagai M, Kuwae T, Matsuzaki Y, Furukawa T et al (2012) Development of digital reconstructed radiography software at new treatment facility for carbon-ion beam scanning of National Institute of Radiological Sciences. Australas Phys Eng Sci Med 35:221–229
Li C, Sun H, Liu Z, Wang M, Zheng H, Wang S (2019) Learning Cross-Modal Deep Representations for Multi-Modal MR Image Segmentation. Medical Image Computing and Computer Assisted Intervention – MICCAI 2019. MICCAI 2019. Lecture Notes in Computer Science, vol 11765. Springer, Cham. https://doi.org/10.1007/978-3-030-32245-8_7
Hazirbas C, Ma L, Domokos C, Cremers D (2016) Fusenet: Incorporating depth into semantic segmentation via fusion-based cnn architecture. Asian conference on computer vision: Springer; pp. 213 – 28. In: Lai, SH., Lepetit, V., Nishino, K., Sato, Y. (eds) Computer Vision – ACCV 2016. ACCV 2016. Lecture Notes in Computer Science(), vol 10111. Springer, Cham. https://doi.org/10.1007/978-3-319-54181-5_14
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. International conference on machine learning: PMLR; p. 448 – 56
Glorot X, Bordes A, Bengio Y (2011) Deep sparse rectifier neural networks. Proceedings of the fourteenth international conference on artificial intelligence and statistics: JMLR Workshop and Conference Proceedings; p. 315 – 23
Abraham N, Khan NM (2019) A Novel Focal Tversky Loss Function With Improved Attention U-Net for Lesion Segmentation. 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019). 683-7
Van Houtte J, Bazrafkan S, Vandenberghe F, Zheng G, Sijbers J (2019) A deep learning approach to horse bone segmentation from digitally reconstructed radiographs. 2019 Ninth International Conference on Image Processing Theory, Tools and Applications (IPTA), 2019, pp. 1–6, doi: 10.1109/IPTA.2019.8936082
Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26:297–302
Wang L, Wang C, Sun Z, Chen S (2020) An improved dice loss for pneumothorax segmentation by mining the information of negative areas. IEEE Access 8:167939–167949
Soomro TA, Afifi AJ, Gao J, Hellwich O, Paul M, Zheng L (2018) Strided U-Net model: Retinal vessels segmentation using dice loss. 2018 Digital Image Computing: Techniques and Applications. IEEE, DICTA), pp 1–8
Kieselmann JP, Fuller CD, Gurney-Champion OJ, Oelfke U (2021) Cross‐modality deep learning: Contouring of MRI data from annotated CT data only. Med Phys 48:1673–1684
Pravitasari AA, Iriawan N, Almuhayar M, Azmi T, Irhamah I, Fithriasari K et al (2020) UNet-VGG16 with transfer learning for MRI-based brain tumor segmentation. TELKOMNIKA (Telecommunication Computing Electronics and Control) 18:1310–1318
Abdollahi A, Pradhan B, Alamri A (2020) VNet: An end-to-end fully convolutional neural network for road extraction from high-resolution remote sensing data. IEEE Access 8:179424–179436
Lin T-Y, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. Proceedings of the IEEE international conference on computer vision p. 2980-8
Lin T, Goyal P, Girshick R, He K, Dollar P (2017) Focal Loss for Dense Object Detection. Proceedings of the IEEE international conference on computer vision
Jadon S (2020) A survey of loss functions for semantic segmentation. 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB) p. 1–7
Iramina H, Nakamura M, Mizowaki T (2020) Direct measurement and correction of both megavoltage and kilovoltage scattered x-rays for orthogonal kilovoltage imaging subsystems with dual flat panel detectors. J Appl Clin Med Phys 21:143–154
Mori S, Shirai T, Takei Y, Furukawa T, Inaniwa T, Matsuzaki Y et al (2021) Patient handling system for carbon ion beam scanning therapy. J Appl Clin Med Phys 13:3926

Download PDF

Journal Publication

published 21 Mar, 2023

Read the published version in Physical and Engineering Sciences in Medicine →

Reviewers invited by journal
16 Nov, 2022
Editor invited by journal
15 Nov, 2022
Editor assigned by journal
14 Nov, 2022
First submitted to journal
13 Nov, 2022

You are reading this latest preprint version

Deep neural network-based automatic bowel gas segmentation on X-ray images for particle beam treatment

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials And Methods

Patients and image acquisition

Treatment planning CT image

X-ray image

Training data

Pre-processing

All-density DRR image

Bone DRR image

Gas DRR image

Post-processing

Network architecture

Network training

Evaluations

Results

Predicting data threshold values

Synthetic X-ray image

Real X-ray images

Discussion

Different image modalities

Overlap problems

Loss function

Related studies

Study limitations

Conclusion

Declarations

References

Status:

Journal Publication

Version 1