2.1 Clinical data collection and group classification
Panoramic radiographs acquired between Jan 2020 and Mar 2022 were randomly selected from the picture and communication system (PASC) of the Stomatology Hospital, School of Stomatology, Zhejiang University School of Medicine. All panoramic radiographs were captured with a Dentsply Sirona (Bensheim, Germany) dental X-ray instrument. The inclusion criteria for panoramic radiographs were as follows: 1) patients age >18 years old; 2) at least one MFP with fully developed dental roots must be present; 3) no prior root canal treatment of the MFP; 4) panoramic radiographs and CBCT scans were both available for the MFP. Exclusion criteria were as follows: 1) poor quality panoramic radiographs; 2) detectable external resorption, internal resorption, or apical foramen destruction in the MFP; 3) significant buccolingual inclination of the MFP. The study protocol was approved by the Ethics Committee of the Stomatology Hospital, School of Medicine, Zhejiang University, China, and all procedures were performed in accordance with the Helsinki Declaration.
The panoramic radiographs were screened by three independent dentists, and MFP root canal morphology was then examined in CBCT images. The MFP images were divided into single or non-single groups, the former group including MFPs with only one root canal(Figure1.A) and the latter group containing MFP images with multiple root canals and more complex root canal systems(Figure1.B-F). The single group contained 459 MFPs, while 366 MFPs were classified to the non-single group. The results of root canal on CBCT were used as the gold standard.
Prior to the observation, the intraclass correlation for the evaluation of root canal was examined. The three dentists evaluated this for 50 teeth. The resulting intraclass correlation coefficient was 1.000.
2.2 Automatic Root Canal Classification
The workflow of automatic root canal classification could be divided into following steps. In the first step, all panoramic radiographs were resized to 1440 ×2976 pixels. When the MFP had only one root canal on CBCT images, MFPs were divided into contact group, the MFPs were labeled “one” with the open-source software Labellmg. In total, 1630 MFPs were included, with 1304 for training, 163 for validation and 163 for testing.
In the second step, the U-NET deep learning model is applied for segmentation of teeth[12], which builds a U-shaped configuration from convolutional network layers with skip connection, which is widely used in biomedical image segmentation to prevent loss of information from layers. The symmetrical encoder-decoder architecture includes down-sampling blocks for extracting features and up-sampling blocks to infer the segmentation at the same resolution. The network hierarchically extracts low-level features, then recombines them into higher-level features in the encoder. The skip connection between the falling and the rising part of the U-shape, it expands the contextual information and performs element-wise classification from multiple features in the decoder. Details of the model architecture are shown in Figure 2. After U-Net segmentation of individual teeth, the MFPs can be located and the ROI is cropped for classification of canal architecture.
In the third step, two convolutional neural network (CNN) model, the ResNet-101 model and ResNet-152 model were applied in root canal classification. we also introduced the convolutional block attention module (CBAM; Figure 3) [13] to force the network to select class-specific features through a combination of channel and spatial attention strategies which we named ResNet-101(CBAM). A channel attention module was included to perform average and max pooling operations in order to extract more high-level features. To account for the influence of different MFP locations, the spatial attention module was employed to examine different places within the same channel, enhancing the representation of key features.
The three models were trained and tested with PyTorch on a GeForce RTX 2080Ti running on the Ubuntu (version 18.04) operating system with the CUDA10.1 toolkit and CUDNN8.0.5 library. A stochastic gradient descent (SGD) optimization strategy was used during model training. The initial learning rate was 0.001, epochs were 100, batch size was 10, and the learning rate optimization strategy was ReduceLROnPlateau.
2.3 Diagnostic performance analysis
To assess the performance of the proposed framework for classification, we used Precision (PR), Recall (RE), Accuracy (ACC), F1-score, and Area Under Curve (AUC) of Receiver Operating Characteristic curves, defined below:
Results:
in which TP represents true positives, TN represents true negatives, FP represents false positives, and FN represents false negatives.
Panorama radiograph results for MFPs were interpreted by a panel of three dentists with <3 years experiences and two dentists with >10 years. Metrics based on the results of MFP detection were used to estimate the performance of the deep learning network for comparison with the assessments of the dentists.
2.4 Statistical analysis
Diagnostic performance was compared using the Mann-Whitney U Test. AUC values were compared using McNemar’s chi-square analysis. Statistical analyses were performed with IBM SPSS Statistics 24.0, The level of significance was set at p < 0.05.