A. THE DATASET
This retrospective study included 41 CBCT scans with BL and 41 without bone lesions (w/o BL) performed between January 2019 and December 2020, at our oral maxillofacial imaging unit. This study was approved by the Institutional Review Board (HMO-0297-21). This study was retrospective and involved only medical record analysis, therefore, the IRB approved an exemption from informed consent.
The study group included CBCT scans with well-defined hypodense benign BL, with or without bone expansion, which were performed prior to surgical intervention. All BL were interpretated by a maxillofacial imaging expert (CN) and confirmed by histopathological results determined by microscopic examination of suspected tissue. The control group, w/o BL, consisted of CBCT scans performed prior to dental implant placement, of either the maxilla or the mandible, or both jaws.
CBCT scans with movement artifacts, were excluded from both groups since they are usually non diagnostic. However, scans with metal artefact were not excluded, since metal artefact are very common in CBCT scans and it is important to include them for training the deep learning algorithm. In addition, these artefacts do not usually appear in the same height as the lesions, and when both bone lesions and metal artifacts are visualized in the same slice, the artifact does not significantly modify the appearance of the lesions.
All CBCT scans were assigned into training (50 cases), validation (10 cases) and testing (22 cases) datasets, including 20214, 4530 and 6795 axial slices, respectively, as shown in Table 1. Due to the inclusion of axial slices which did not contain BL in the study group, the total number of axial slices w/o BL was markedly higher than the total number of slices with BL.
Table 1
The details of the CBCT datasets used for training, validation and testing
| | Training (N = 50) | Validation (N = 10) | Testing (N = 22) | Total (N = 82) |
| | BL | w/o BL | BL | w/o BL | BL | w/o BL | BL | w/o BL |
Size of dataset | Number of cases | 25 | 25 | 5 | 5 | 11 | 11 | 41 | 41 |
Number of axial slices with BL | 2754 | 0 | 396 | 0 | 1405 | 0 | 4555 | 0 |
Number of axial slices without BL | 7282 | 10178 | 1711 | 2423 | 1945 | 3445 | 10938 | 16046 |
Demographics | Female/Male | 12:13 | 14:11 | 5:0 | 4:1 | 5:6 | 8:3 | 22:19 | 26:15 |
Age (Mean ± SD, Range) | 34 ± 19 (4–67) | 61 ± 11.8 (41–79) | 46 ± 21 (12–68) | 41 ± 23.7 (19–74) | 27 ± 15 (3–51) | 50 ± 16 (26–70) | 33 ± 18.9 (3–68) | 56 ± 16 (19–79) |
Technical details | Type of machine ICAT/MORITA/CRANEX | 2:17:6 | 6:18:1 | 1:2:2 | 2:3:0 | 1:9:1 | 4:7:0 | 4:28:9 | 12:28:1 |
Field of View Small/Medium/Large | 12:9:4 | 5:10:10 | 2:2:1 | 1:2:2 | 6:3:2 | 2:5:4 | 20:14:7 | 8:17:16 |
Range of voxel size (mm) | 0.08–0.25 | 0.08–0.25 | 0.125-0.2 | 0.125-0.2 | 0.08–0.25 | 0.125–0.25 | 0.08–0.25 | 0.08–0.25 |
Rotation 3600/1800 | 19:6 | 24:1 | 4:1 | 4:1 | 9:2 | 10:1 | 32:9 | 38:3 |
Current (mA) | 6.7 ± 2.5 | 5.8 ± 1.1 | 7 ± 3.4 | 5.2 ± 0.5 | 5.5 ± 1.1 | 5.5 ± 0.6 | 6.4 ± 2.4 | 5.6 ± 0.9 |
Radiographic features of Bone Lesion | Ant/PM/Molar/Ascend | 10:6:6:3 | NR | 1:3:1:0 | NR | 1:3:6:1 | NR | 12:12:13:4 | NR |
Follicular/Non-follicular | 7:18 | NR | 1:4 | NR | 4:7 | NR | 12:29 | NR |
Adjacent to IAN(%, No.) | 52% (N = 13) | NR | 40% (N = 2) | NR | 82% (N = 9) | NR | 58.5% (N = 24) | NR |
Minimal diameter (Mean ± SD, Range) (mm) | 8.0 ± 3.6 (3–18) | NR | 7.4 ± 1.7 (6–10) | NR | 14 ± 7.4 (4–13) | NR | 7.9 ± 3.4 (3–18) | NR |
Maximal diameter (Mean ± SD, Range) (mm) | 24.0 ± 13.0 (7–63) | NR | 13.4 ± 4.7 (8–20) | NR | 25.4 ± 9.9 (12–41) | NR | 23.1 ± 12.0 (7–63) | NR |
Histopathology of the lesion | RC(8), KOT(8), DC(5), AF(1), OF(1), CAm(1), NC(1) | NR | RC(3), KOT(1), LPC(1) | NR | KOT(4), RC(3), DC(1), AF(1), AOT(1), SBC(1) | NR | RC(14), KOT(13), DC(6), AF(2), OF(1), CAm(1), NC(1), LPC(1), AOT(1), SBC(1) | NR |
Abbreviations: N = Number, BL = Bone lesion, w/o BL = without bone lesion ,IAN = Inferior Alveolar Nerve, RC = Radicular Cyst, KOT = Keratocystic Odontogenic Tumor, DC = Dentigerous Cyst, AF = Ameloblastic Fibroma, OF = Ossifying Fibroma, CAm = Cystic Ameloblastoma, NC = Nasopalatine Cyst, LPC = Lateral Periodontal Cyst, AOT = Adenomatoid Odontogenic Tumor, SBC = Simple Bone Cyst |
All CBCT slices were resized to 800x800 pixels and converted from DICOM to JPEG format with 256 gray level values. The gray level values were automatically modified according to the window attributes in the DICOM header. The training and the validation datasets were augmented by horizontal flipping of the axial slices.
Table 1 summarizes the details of the CBCT datasets regarding demographics, scan parameters and BL characteristics. Data retrieved from the patient’s file included gender, age, and histopathological diagnosis. The CBCT scans were preformed using three different CBCT devices. Most scans were obtained with the MORITA™ 3D Accuitomo 170 followed by I-CAT™ Next Generation and CRANEX®3D Dental Imaging System. The tube voltage ranged between 85–120 kV, tube current ranged between 3–13 mA, field of view (FOV) diameter ranged between 4–16 cm, height ranged between 4–14 cm and the voxel size ranged between 0.08–0.25 mm. Most of the scans used full rotation, though some cases used half rotation. The number of the axial slices per case ranged between 161–705 (Appendices A and B).
BL cases included the following ten, histologically confirmed, pathologies: Keratocystic Odontogenic Tumor (KOT), Radicular Cyst (RC), Simple Bone Cyst (SBC), Ameloblastic Fibroma (AF), Ossifying Fibroma (OF), Adenomatoid Odontogenic Cyst (AOC), Dentigerous Cyst (DC), Cystic Ameloblastoma (CAm), Nasopalatine Cyst (NC) and Lateral Periodontal Cyst (LPC). Most of the lesions were mandibular (83%), adjacent to the Inferior Alveolar Nerve (IAN) (58.5%) and equally distributed between the anterior region (Ant), the premolar region (PM), and the molar region while lesser were in the ascending ramus (Ascend). In each case, the lesion location, contour and diameter were different in every axial slice. The maximal lesion diameter, defined as the lesion diameter in the slice in which it appears the largest, ranged between 7–63 mm (mean 23.07 ± 11.99 mm). The minimal lesion diameter, defined as the lesion diameter in the slice in which it appears the smallest, ranged between 3–18 mm (mean 7.85 ± 3.36 mm).
B. BONE LESION DETECTION
a. The deep learning Mask-RCNN algorithm
In the first step of the 3D detection process, a Mask RCNN deep learning algorithm (26) was implemented to detect the BL separately in each CBCT axial slice. This algorithm extends the Faster RCNN process (27) which detects only a bounding box containing the bone lesion, by additionally segmenting the bone lesion mask within the bounding box.
In order to train the Mask-RCNN algorithm, the exact contours of the BL in the training and the validation datasets were manually drawn in a middle axial slice and both the most inferior slice and the most superior slice by an Oral Maxillofacial radiologist (RAA). Based on these marks, the contours of the BL in all other slices in between, were manually drawn by students (TB, AIP, AA) using the VGG Image Annotator (VIA) program (28). Finally, these manual contours were confirmed, and modified when necessary, by an Oral Maxillofacial radiologist (CN( having more than 10 years' experience.
Mask RCNN algorithm uses a Feature Pyramid Network (FPN) backbone for feature extraction from each slice and then applies a Region Proposal Network (RPN) to propose Regions of Interest (ROIs) which may contain BL. The proposed ROIs are then resized to yield an aligned-similar size for all ROIs in order to classify them as a normal region or a region containing a bone lesion. Then, the algorithm generates a pre-segmentation bounding box (yellow rectangle in Fig. 1) for each detected bone lesion and adds an instant segmentation process (29) for producing a mask (red marking in Fig. 1) for the lesion within each bounding box (Fig. 1).
The ResNet 101 FPN backbone was selected for feature extraction with the initial weights obtained from pre-training on the Common Objects in Context (COCO) dataset (30). The Mask RCNN algorithm was trained with a single NVIDIA Quadro P2000 GPU, using the Stochastic Gradient Descent (SGD) optimizer algorithm for minimizing the loss of the model (31) with a momentum of 0.9, starting with a learning rate of 0.001 and ending with a learning rate of 0.0001. The training process ran for 96 hours, with 60 epochs and 6,000 steps per epoch. We found that fewer epochs led to underfitting and more epochs did not lead to a noticeable improvement. The minimum confidence level for categorizing an ROI as containing a bone lesion in a CBCT axial slice was set to 85%. A higher confidence level resulted in missing many relevant ROIs and a lower confidence level caused many false marks.
b. Improving the lesions detection by gray-level filtering
For improving the performance of the Mask-RCNN algorithm, an additional algorithm was used for filtering the detected ROIs by the mean gray level value of the segmented lesion within the ROI. ROIs containing a very bright segmented objects, with a mean gray level value higher than 155, were removed assuming that they represent teeth or restorations as shown in Fig. 2A. Similarly, ROIs containing a dark segmented object, with mean gray level value less than 50, were also removed assuming they represent air as shown in Fig. 2B.
c. Improving the detection of lesions by analyzing sequential axial slices
In order to confirm that the segmented object in each detected ROI in a specific slice truly represented a bone lesion, there was a need to analyze all sequential slices of each CBCT examination. For this purpose, an additional algorithm was developed to analyze all the slices of the CBCT examination and to evaluate in sequential slices the spatial location of the suspected objects (masks). For each slice containing a suspected object, the algorithm calculated its overlapping with suspected objects in previous slices by:
$$Overlapping=\frac{A\cap B}{A}$$
(1)
Where A is the number of pixels in the suspected object and \(A\cap B\) is the number of pixels that are located identically in the suspected objects in this slice and in a previous slice, as illustrated in Fig. 3. We used several previous slices and not only a single slice, since the Mask RCNN algorithm may have missed the ROI containing the bone lesion in the neighboring slice, as shown in Fig. 4. Then, the algorithm defined subgroups of different ROIs in which the calculated overlapping between the objects was more than 50 percent. Any subgroup including at least 14 ROIs in any 20 successive slices was considered as a subgroup which contain a true bone lesion.
When the algorithm classified a CBCT case as containing a true bone lesion, it was important to identify the initial slice and the final slice containing the bone lesion. In the extreme slices the Mask RCNN may miss more ROIs since in these slices the lesion appears smaller and its borders are less defined. Therefore, the initial slice was defined as the first of at least 6 slices out of 20 successive slices, included in the same subgroup of ROIs. The final slice was defined as the last slice in the subgroup that fulfilled the same condition (6 out of 20 successive slices).
Then, the algorithm inserted a bone lesion mask in all the slices between the initial and final slices in which the bone lesion had been missed. This was performed by interpolating the shape of the detected objects in the two nearest neighboring slices since the mask of the bone lesion should be very similar in subsequent slices, as shown in Fig. 4. Finally, the algorithm removed all the suspected objects not included in the range of the initial and final slices.