Experiments
The evaluation of this work was divided into 2 sections. Section 1, the accuracy of DL based auto-segmentation was assessed using objective indicators, including geometric and dosimetric metrics. Section 2, subjective assessment of auto-segmented contours was evaluated by two experienced radiation oncologists. This study was approved by the Ethical Committee of Women’s Hospital, School of Medicine, Zhejiang University, and all methods were performed in accordance with the relevant guidelines and regulation.
Clinical datasets
60 patients of postoperative cervical cancer collected between August 2021 and June 2022 were included in this study, and informed consent was obtained from all participants. The enrolled patients were diagnosed with 2018 International Federation of Gynecology and Obstetrics (FIGO) stage IB1- IIIC1, treated with EBRT (45Gy-50.4Gy, 1.8Gy/fraction) and IGBT (12Gy-30Gy, 6Gy/fraction) as a boost treatment. The average age ± standard deviation of these patients was 48.20±14.52 years old. For each patient, the oral contrast (diatrizoate meglumine) was required for small intestine preparations before each BT implantation session. Meanwhile, the bladder was filled with 150cc of normal saline in the CT scanning room, and the CT images were reconstructed with 512×512 matrix size and 3 mm slice thickness using a Philips Brilliance Big Bore CT scanner system (Philips Healthcare,Best, the Netherlands).
Definitions for volume-based targets (HRCTVs) were established by the Groupe Européen de Curiethérapie-European Society for Radiotherapy and Oncology (GEC-ESTRO)[18], the boost dose was prescribed to the vaginal surface with multi-needles to the upper vagina. Relevant OARs included for IGBT plans were bladder, rectum, sigmoid and small intestine. All of the manual contours were reviewed and approved by senior radiation oncologists specialized in cervical cancer to generate the standard delineation.
Deep Learning for Segmentation
We introduced a robust deep learning model based on CNN to delineate the CTVs and OARs for cervical cancer patients with EBRT, which has been proven to provide high performance in automatic segmentation of planning structures[19]. This model is an end-to-end segmentation architecture that can predict pixel class labels in CT images. The inputs to the DL based model were the 2D CT images, and the outputs were the corresponding labels of HRCTV and OARs. The training and testing were performed using a graphics card equipped with IntelCore i7 processor. The final performance of DL based model was assessed using ten-fold cross-validation, in which each fold consisted of randomly selected 40 subjects for training, 10 for validation, and 10 for testing.
Objective indicators
The geometric and dosimetric metrics were used for quantitative analysis. Segmentation was assessed by Dice Similarity Coefficient (DSC), 95% Hausdorff Distance (HD) and Jaccard Coefficient (JC) for geometric calculations. The definitions are as follows:
DSC and JC describe the relative overlap between segmentation A and B. HD is used to quantify the 3D distance between two segmentation surfaces. The 95%HD is the distance that indicates the largest surface-to-surface separation among the closest 95% of surface points. For the complete overlap, the value of HD is 0, and the values of DSC and JC are 1. For the incomplete overlap, the value of HD is large, and the values of DSC and JC are close to 0.
Dose-volume index (DVI) were used to compare the dosimetric metrics. Original BT plans were designed and optimized based on the standard manual contours by using Oncentra Treatment Planning System (Nucletron, Elekta AB, Stockholm, Sweden, V.4.3), and the auto-segmentation structures were transmitted to original BT plans for dosimetric evaluation. For HRCTV, we mainly focused on Dmean and D90%, where Dmean and D90% are defined as the average dose and minimal dose to 90% of the HRCTV, respectively. For OARs, we mainly focused on D0.1cc, D2cc and D5cc, where Dxcc represents the minimum dose received by xcm3 of an OAR. A prescription dose of ≥60GyEQD2 (45-50.4Gy as EBRT) was considered for the target, equivalent dose of 2Gy with an α-to-β ratio of 10, and maximum dose (D2cc) of 80GyEQD2, 65GyEQD2, 70GyEQD2 and 70GyEQD2 to the bladder, rectum, sigmoid and small intestine, respectively, assuming α-to-β ratio of 3. Table 1 is presented the constraints and dosimetric metrics.
Table 1. The constraints and dosimetric metrics for BT planning structures.
Structures
|
Constraints
|
Dosimetric metrics
|
HRCTV
|
D90%≥60Gy(EQD210)[20]
|
Dmean, D90%
|
Bladder
|
D2cc<80Gy(EQD23)[21]
|
D0.1cc, D2cc and D5cc
|
Rectum
|
D2cc<65Gy(EQD23)[21]
|
D0.1cc, D2cc and D5cc
|
Sigmoid
|
D2cc<70Gy(EQD23)[22]
|
D0.1cc, D2cc
|
Small intestine
|
D2cc<70Gy(EQD23)[22]
|
D0.1cc, D2cc
|
Subjective assessment
For qualitative analysis, two experienced radiation oncologists would blindly evaluate the results of manual and automatic delineations on 10 tested patients, and were required to score as needing no edits, minor edits, or major edits. If the auto-segmentation scored as no edits, the contour was considered as suitable for employ in the clinic.
Statistical analysis
Bland-Altman test was calculated for the test of agreement between manual and DL based methods, P>0.05 means agreement of two segmented methods. Wilcoxon’s paired nonparametric signed-rank test was performed to compare the dosimetric differences, P<0.05 indicates the difference is statistically significant. The correlations between geometric metrics and dosimetric difference were evaluated with Spearman’s correlation analysis. All the statistical analyses were performed using IBM SPSS Statistics software (version 19.0, IBM Inc., Armonk, NY, USA) and Python software (version 3.6.5,Anaconda Inc.).