Data of patients
This study collected a total of 372 patients who underwent surgical treatment in our hospital's urology department from January 2015 to October 2020 and were pathologically diagnosed as urothelial carcinoma after surgery. Among them, there were 182 cases and 190 cases of low-grade urothelial carcinoma and high-grade urothelial carcinoma, respectively.
Selection criteria
Inclusion criteria
(1) Pathological diagnosis of urothelial carcinoma after transurethral resection of bladder tumor or total cystectomy during hospitalization, complete clinical data and complete pathological report; (2) Thin-layer enhanced CT of the bladder within one month, and the bladder is well filled, with complete imaging data; (3) Bladder tumor diameter>=1cm;
Exclusion criteria
(1) Those undergoing neoadjuvant treatment with bladder perfusion or intravenous chemotherapy before surgery; (2) Insufficient bladder filling and tumor minimum diameter <1cm (including squamous carcinoma); (3) Patients with insufficiency of important organs; Patients with inflammatory bladder diseases and mental disorders; (5) Patients with other malignant tumors; A total of 372 patients were enrolled and were randomly divided into training and verification groups according to the proportion of 7:3. The patient recruitment pathway is shown in Fig.1.
Methods
Clinical data and image data collection
The clinical data includes the patient's height, age, gender, urinary PH, weight, metabolic syndrome (MS), high blood pressure (HBP), urinary tract infection, body mass index (BMI), hematuria, albuminuria, high-density lipoprotein (HDL-C), tumor location, tumor number, tumor size (maximum diameter), diabetes mellitus (DM), Triglyceride (TG), pathological grade, past history and family history, etc. The patient’s imaging data includes: thin-layer CT images of the venous phase and arterial phase within one month before the operation, and the image quality is good, and the bladder tumor is clearly imaged without artifacts.
CT scanning method
All patients underwent Siemens 64-slice spiral CT bladder-enhanced examination in our hospital. After the image is reconstructed, it is uploaded to the picture archiving and communication system (PACS) imaging system in uncompressed digital imaging and communications in medicine (DICOM) format, and all the patient's arterial phase and venous thin layer image data are downloaded from the PACS system.
Delineation of image ROI
First, import the arterial and venous thin-layer images of the lesion into the 3D-Slicer image editing software. In order to reduce the influence of changes in contrast and brightness on the texture analysis results, the arterial and venous images are uniformly calibrated and merged into an image of arteriovenous phase fusion. Two residents with more than three years of experience in urinary system imaging diagnosis applied the hidden density processing (Threshold) function and the Sphere brush function in the texture analysis 3D-Slicer software when the pathological results were unknown. The continuous-level semi-automatic ROI is delineated for bladder tumor lesions. In order to unify the shape of the ROI cut from each tumor, smoothing is performed. The specific process is shown in Fig.2. In addition, in order to delineate the entire bladder tumor as completely as possible, try to keep the delineation line about 1-2mm away from the edge of the tumor. For multiple foci, considering the heterogeneity among different individuals, only the largest foci were included in the study.
Image processing and data acquisition
The complete tumor ROI is obtained by semi-automatic delineation. In order to transform the tumor ROI into visual data for analysis, a series of processing of the tumor ROI is required, parameters are set, and the original data required for the research is obtained. Select the Radiomics function of the 3D-Slicer software to perform a series of data analysis and processing on the tumor ROI outlined in the previous stage, and select the radiomics texture features we need. The feature parameters include: First-order, Shape features (shape2D, shape3D), gray-level run-length matrix (GLRLM), gray-level dependence matrix (GLDM), gray-level size zone matrix (GLSZM), neighboring gray-tone difference matrix (NGDTM) and gray-level co-occurrence matrix (GLCM) and filtering characteristics. Set resampling and filtering criteria were as follows: resampled voxel size (1, 1, 1), LoG kernel sizes (1, 1.5, 2, 2.5), and wavelet-based features. For multiple lesions, considering the heterogeneity among different individuals, only the most significant lesions were included.
Texture feature data preprocessing
The process is mainly operated by 3D-slicer software (www.slicer.org/) to extract effective features from tumor image data. Finally, the Radiomics package is used to extract three-dimensional (3D) radiomics features, including first-order features, second-order features, high-order features and filtering features (41, 43, 46-47). Among them, the first-order features are mainly used to describe the global features related to the frequency distribution of image gray levels, such as Energy, etc; The second-order features are mainly used to describe the local distribution characteristics of image gray levels, such as Autocorrelation, Contrast, Correlation, Difference Average and Difference Entropy; High-level features describe the gray-scale regional distribution characteristics of the image, such as Coarseness and Complexity; This study extracts a total of 80 types of features from tumor ROI, including First-order features (18 types), Shape features (3D) (16 types), Shape features (2D) (14 types), GLCM (24 types), GLRLM (16 types), GLSZM (16 types), NGDTM (5 types), and GLDM (14 types), total extracted 1223 radiomics features. In order to facilitate the subsequent calculation and analysis, the characteristics are numbered and assigned, which are sequentially numbered X1, X2...X1223. The preprocessing of the characteristic data is essential. The first is to process all the null values and replace them with the median. Then the extracted texture parameters are screened by the correlation coefficient between groups (ICC>0.75). In order to facilitate subsequent calculation and display, the feature is reduced by Z-score, so that the feature value is between [-1,1] after being reduced in dimension.
Texture feature extraction and screening
Selection operator (LASSO) regression method selects effective radiomics features from the training group. Logistic regression, decision tree, support vector machine (SVM) and adaptive boosting (Adaboost) are used for model construction, and accuracy, sensitivity, specificity and area under the curve (AUC) are used as model evaluation indicators.
Selection and assignment of clinical related factors
In order to construct a predictive model of the pathological grade of bladder cancer, and make the model more perfect and efficient. We add gender, age, height, weight, HBP, urine pH, urinary tract infection, hematuria, albuminuria, HDL-C, tumor location, tumor number, tumor size (maximum diameter), DM, TG, pathological grade, MS, past history and family history and other clinical characteristics. Considering that this research requires a lot of data and complicated parameters, in order to improve the calculation efficiency and intuitive display, all the parameters participating in the research are assigned. For example, low-grade urothelial carcinoma (LGUC) is assigned a value of "0" during classification, otherwise, it is marked as "1". Refer to Table 1 for the assignment of other clinical related factors.
Table 1. Clinical factor assignment.
Clinical factor
|
Assignment“0”
|
Assignment“1”
|
Gender
|
Female
|
Male
|
Urinary tract infection
|
No
|
Yes
|
Hematuria (%)
|
No
|
Yes
|
Proteinuria
|
No
|
Yes
|
Number
|
Single
|
Multiple (>=2)
|
Metabolic syndrome (MS)
|
No
|
Yes
|
BMI
|
BMI<25
|
BMI≥25
|
Triglyceride
|
<1.7
|
≥1.7
|
High blood pressure (HBP)
|
No
|
Yes
|
Diabetes mellitus (DM)
|
No
|
Yes
|
Definition: yes=1,no=0. BMI normal value=18.5-23.9. Triglyceride normal value=0.45~1.69mmol/L. High-Density Lipoprotein(HDL) normal value=0.7~2.0mmol/L. Urine PH normal value=6.0-6.5
Predictive model construction (comparison of four algorithm models)
First, build four prediction models for bladder cancer pathology classification on the already divided training group. By comparing accuracy, sensitivity, specificity and AUC select the radiomics feature prediction model with the highest prediction efficiency. In order to verify the predictive capabilities of the four predictive models, this study uses a 10-fold cross-validation method and performs 100 iterations; at the same time, it compares with the clinical feature prediction model based on clinical feature indicators and selects a model with better predictive capability.
Statistical software and methods
The software used in this study included 3D-Slicer (4.10.2-Win-amd 64), R-Studio (1.2.1335), and related software packages. The clinical-related factors were analyzed using SPSS22.0 software (IBM), and the measured data was expressed as x ± s. T-test was used to compare the two groups of measurement data, and the counting data was compared by chi-square test. Independent risk factors were found by logistic regression. We compared the values of radiological features in the differential diagnosis of LGUC and HGUC using a single-factor analysis of variance. The LASSO regression model was analyzed using the "glmnet" software package. We used the "proc" software package to draw ROC curves. The differences in AUC values among models were tested using the Delong test. P<0.05 (two-sided) was considered to indicate significance. The effectiveness of the model is expressed by the C-index, and the model is verified by the decision curve (DCA).