Detection and classification of unilateral cleft alveolus with and without cleft palate on panoramic radiographs using a deep learning system

doi:10.21203/rs.3.rs-173173/v1

Download PDF

Research Article

Detection and classification of unilateral cleft alveolus with and without cleft palate on panoramic radiographs using a deep learning system

https://doi.org/10.21203/rs.3.rs-173173/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 06 Aug, 2021

Read the published version in Scientific Reports →

You are reading this latest preprint version

Although panoramic radiography has a role in the examination of patients with cleft alveolus (CA), its appearances is sometimes difficult to interpret. The aims of the present study were to develop a computer-aided diagnosis system for diagnosing the CA status on panoramic radiographs using a deep learning object detection technique with and without normal data in the learning process, to verify its performance, and to clarify some characteristic appearances probably related to the performance. The panoramic radiographs of 383 CA patients with cleft palate (CA with CP group) or without cleft palate (CA only group) and 210 patients without CA (normal group) were used to create 2 learning models on the DetectNet. The models 1 and 2 were developed based on the data with and without normal subjects, respectively, to detect the CAs and classify them into the CA only and CA with CP groups. The model 2 reduced the false positive rate (1/30) compared to the model 1 (12/30). The model 2 performances were higher in almost values than those in the model 1, but no difference in the recall of CA with CP groups. The model created in the present study appeared to have the potential to detect and classify CAs on panoramic radiographs.

Health Economics & Outcomes Research

Dentistry

Physiology

cleft alveolus (CA)

diagnosing the CA status

panoramic radiographs

Cleft lip and palate (CLP) is one of the most common types of congenital maxillofacial lesions^1.2. The frequency of babies born with CLP is approximately 1 in 500 in Japan³. Although various treatment protocols have been applied, soft tissue parts such as the lip and mucous membrane covering the hard plate are surgically treated within a relatively short period from birth⁴. Hard palatoplasty with bony transplant is frequently applied in patients with cleft alveolus (CA) to stabilize the maxillary segments, provide bony support for the teeth adjacent to the cleft, provide additional support to the lip and nose, and induce the canine eruption^5.6. This surgery is usually performed at age 8–10 years, when the maxilla has grown sufficiently for the surgery. Therefore, patients need regular follow-up consisting of physical and imaging examinations for a long period. In this regard, panoramic radiography plays an important role in the repeated evaluation of cleft status because of its lower cost and radiation exposure levels than other modalities, such as computed tomography (CT) and cone-beam CT for dental use^7.8. Focusing on the bony structures, patients with CA are divided into two types: CA with and without cleft palate (CP) ⁹.

Cleft status including the classification can be easily identified by physical examination. However, for radiologists who interpret many radiographs and make many reports of their findings as routine work in a hospital radiology department, it is difficult to physically examine the patients. Therefore, radiologists have to estimate the patients’ status only based on radiographs. In this regard, panoramic radiography plays an essential role. However, it is sometimes difficult to interpret the images, especially for inexperienced radiologists, because of the overlap of the cervical spine and the narrow panoramic image layer, resulting in misdiagnoses as other radiolucent diseases and conditions, such as cysts, tumors, and fossa between the alveolar yokes of the central and lateral incisors. Moreover, even for experienced radiologists, it may be difficult to distinguish patients with CA with CP from those without CP. Therefore, if a computer-aided detection/diagnosis (CAD) system for the diagnosis of patients with CLP on panoramic radiographs can be established, it could greatly support such diagnosis, especially for inexperienced radiologists.

Deep learning (DL) algorithms using convolutional neural networks (CNNs) can detect and classify the features of certain objects, and they have been applied to CAD systems for diagnosis based on panoramic radiographs^10.11. However, there have been no previous reports on such systems for patients with CLP. Some characteristic appearance features other than radiolucent areas directly indicating the CAs, on panoramic radiographs, such as the difference in the inferior line level of the piriform aperture and some findings of the maxillary lateral incisors of the affected side have been reported for patients with CA^{12.13.14.15.16}, and they may be related to DL performance. Although a new technique to clarify the areas on which DL models focus has been introduced^17.18, the findings contributing to DL performance are generally as unclear as a ‘black box’. Therefore, the above appearance features were analyzed in relation to the 3 groups (CA only, CA with CP and normal groups) in the present study.

The aims of the present study were to develop a CAD system for diagnosis of CLP on panoramic radiographs using a deep learning object detection technique with and without normal data in the learning process, to verify the DL model’s performance, and to clarify some characteristic appearance features probably related to the performance.

Informed consent was obtained from all patients for being included in the study. This study was approved by our university’s ethics review board (Nos. 496) and was performed in accordance with the tenets of the Declaration of Helsinki.

Patients

The panoramic radiographs of 383 patients (169 female and 214 male) with unilateral CA that were acquired between August 2004 and July 2020 were selected from our hospital image database retrospectively. All patients were verified as having unilateral CA by medical records and CT or cone-beam CT examinations. The mean age of both male and female patients was 9.3 years. Of the 383 patients, 174 had solely CA and were assigned as the CA only group, whereas 209 had CA with CP and were designated as the CA with CP group. The CA only group was differentiated from the CA with CP group by referring to the patients’ medical records and CT images. Cases in which the cleft was limited to the anterior area of the incisive foramen on the most inferior axial CT slice, where the foramen was visible, were assigned to the CA only group. Cases in which the cleft extended posteriorly beyond the incisive foramen were assigned to the CA with CP group. Patients who had received surgical interventions for bony structures around the cleft before the first panoramic examination were excluded. In most patients, panoramic examinations were performed several times before bony transplant surgery. The panoramic images taken just before the transplant were selected for the present study. As controls, 210 panoramic radiographs matching the mean age and sex distributions of patients were selected from the same database during the same period. These patients, who were assigned as the normal group in the present study, were examined for other purposes, such as the evaluation of unerupted permanent teeth and pre-examination for orthodontic treatment.

The panoramic radiographs were exposed using an AUTO III NTR unit (Asahi Roentgen Industry, Kyoto, Japan), with a tube voltage of 75 kV, tube current of 12mA and exposure time of 12s or a Veraview Epocs unit (J. Morita Mfg. Corp., Kyoto, Japan), with the tube voltage of 75 kV, tube current of 8 mA and exposure time of 16.2s.

DL architecture

The DL system was created on Ubuntu Linux operating system version 16.04.2. The workstation had a GeForce 1080Ti GPU with 11GB of memory (NVIDIA, Santa Clara, CA). The deep learning process was performed using a customized DetectNet built in the Digits version 5.0 (NVIDIA, Santa Clara, CA; https://developer.ndivia.com/digits) training system. The Adam (adaptive moment estimation) solver was used for the training process with 0.0001 as the base learning rate. DetectNet has five main parts: data ingestion and augmentation, a fully convolutional network, loss function measurement, bounding box clustering, and mean average precision calculation¹⁹.

Development of learning models

Two models (models 1 and 2) were created. The panoramic radiographs were downloaded from the database in JPEG format, and all images were cropped to 900×900 pixels. In each group (including the CA only, CA with CP, and normal groups), 30 images were randomly assigned to test dataset, and the remaining images, which included the training (approximately 80% of the remaining data) and validation datasets, were used to create the learning models (Table 1).

In model 1, only two groups—CA only and CA with CP—comprised the training and validation sets (i.e., the normal group was not included). Rectangular regions of interest (ROIs) were set on the training and validation images to encompass the area of the CA according to the following methods. The superior margin was set at the level of the inferior line of the piriform aperture on the contralateral healthy side, and the inferior margin was set at the alveolar ridge. The medial end was set at the alveolar ridge between the central incisors, and the distal end was set at the most distal portion of the piriform aperture. The coordinates of the upper left (x1, y1) and lower right (x2, y2) corners of the ROIs were labeled using ImageJ (National Institute of Health, Bethesda, MD, USA) (Figure 1a) and converted to text form (Figure 1b). The CA only group and the CA with CP group were assigned as class 1 and class 2, respectively. Model 2 included the normal group’s data in addition to those of the patient groups. Only the labels were created for the classifications as class 0 (not the coordinates). In both models, 1000 epochs of the training process was performed. Inference was then applied to the test data, including all three groups, using the created learning models. When the model detected a CA, the detected area was shown as a bounding box. Red and blue boxes were shown for the CA only group and the CA with CP group, respectively.

Analysis of image appearance features

To determine the characteristic image appearance features that could influence the DL models’ performance, the image datasets used for the training process (i.e., the training and validation datasets) were analyzed regarding two structures: the inferior line of the piriform aperture and the lateral incisor on the affected side. The former was evaluated based on its visibility and relative level to the contralateral or unaffected side. The latter was evaluated according to whether the tooth was present or absent, and regarding findings of microdontia, un-eruption and medial inclination. These evaluations were performed by two radiologists (YA and EA) with more than 20-years experiences of interpreting panoramic appearances. The final determinations were reached by consensus after discussion when the evaluations differed between the radiologists.

Statistical analysis

The differences in ratios between two groups were tested by chi-square test, with p < 0.05 established as the threshold of significant difference.

The results showed that no images had two or more bounding boxes on a panoramic image, indicating that only one box was detected per image when the model estimated a certain area as a cleft. All detected boxes sufficiently included the areas where a cleft was observed or would be in the normal group. Moreover, the boxes’ medial limit was not beyond the median line, and the distal limit was located medially to the canine. The superior and inferior limits were almost same as those of annotation area (namely, the inferior line of the piriform aperture and alveolar ridge). Therefore, we concluded that all test images could be predictively classified by both models into one of the three groups (CA only, CA with CP, or normal). Confusion matrix analyses were performed for both models (Figure 2), and the recall, precision, and F-measure values were calculated. The F-measure denotes the harmonic mean of recall and precision. Regardless of cleft status (CA only vs. CA with CP group), 53 (88.3%) and 51 (85.0%) of the 60 subjects who truly had a cleft could be detected by models 1 and 2, respectively. No difference was found between the models (p = 0.7883). Model 1 incorrectly assigned 12 (40%) of the 30 normal subjects as having a cleft, whereas only 1 normal subject was incorrectly assigned by model 2. No difference was found between models 1 and 2 in terms of the recall in the CA only group (p = 0.7041) or the CA with CP group (p = 0.7866), but the recall of the normal group was significantly higher in model 2 (p = 0.0017). Model 2 showed higher values of all three indices. The overall accuracy was higher in model 2 (82.2%) than model 1 (71.1%), but no significant difference (p = 0.0780) was found. The recall of the CA with CP group was poor (0.667) even in model 2.

The findings of the inferior line of the piriform aperture were divided into three types (Table 2, Figure 3 a-c).First, in 184 images, the lines could be observed clearly on the both right and left sides at an equivalent level and were assigned as “clear and equivalent level”. Second, 149 lines on the affected side were clearly visible but located at an inferior level relative to those on the unaffected side. This finding was defined as “clear and inferior level”. Third, 170 images were assigned as “obscure or invisible”, as they showed an obscure or invisible line on the affected side or on at least one side. The distributions of these three findings were significantly different among the three groups (p < 0.001). In the CA only group, most of the lines showed the finding of “clear and inferior level,” whereas the findings of “obscure or invisible” and “clear and equivalent level” were predominantly observed in the CA with CP and normal groups, respectively.

The findings of the lateral incisor on the affected side are summarized in Table 3. The distributions of all four findings were significantly different among the three groups (p < 0.001). The observation rates of all findings were low in the normal group relative to those in the CA only and CA with CP groups.

Examples of model-predicted results are shown in Figures 4–7. Figure 4 shows a result from the CA only group with a correctly detected and correctly classified bounding box. The result in Figure 5 was correctly detected as having CA but falsely classified into the CA with CP group. Figure 6 shows a falsely detected area in a normal group subject. In Figure 7, the cleft could not be detected.

The detection function of DL systems on panoramic radiographs has been studied for various lesions and conditions^20-27. Regarding vertical root fracture²⁰and dental implants²¹, detection models have been created with annotations of only fractured teeth or implant sites. Normal teeth without fractures or replacement by implants were not included in the training, validation, or test datasets, probably because many normal teeth were included in each panoramic image. Several authors reported relatively high detection sensitivity (recall in the present study) for maxillary and mandibular cyst-like lesions^22.23.24.25. Ariji et al.²² and Watanabe et al.²³ created and tested their DetectNet models without annotation of normal subjects’ data. Hyunwoo et al.²⁶ and Odeuk et al.²⁷ built models using the You Only Look Once system without normal subjects’ data and tested them with datasets including normal subjects without cyst-like lesions. In all of those studies, the training and validation processes were performed without normal subjects. Therefore, we created a model without normal subjects in addition to the model developed with normal subjects’ data, whereas both models’ test datasets included normal subjects. Model 1’s results showed that 12 (40%) of the 30 normal subjects in the test set were falsely assigned as having a cleft, whereas only one of those subjects was assigned to the CA only group by model 2 (which included the normal subjects in the learning process). Although the cause of this discrepancy between cyst-like lesions and CAs could not be completely elucidated, the characteristics of the lesions’ appearances might contribute to the difference. Cyst-like lesions showed definite borders on radiographs, but cleft areas were frequently represented as ill-defined radiolucency because of the superimposition of surrounding structures, such as unerupted permanent teeth.

The present results showed relatively high detection sensitivity (recall in the present study) compared with the corresponding values of other studies on panoramic radiographs^22.23, regardless of the cleft status for which the values were determined (CA only or CA with CP groups). However, even model 2 had a low recall value of 0.667 for the CA with CP group. This problem should be solved in future investigation and could be accomplished by increasing the size of the training dataset.

Although the most important feature for CA detection might be radiolucent area at the maxillary incisor region, the frequently reported characteristic appearance other than radiolucent area, such as findings of the lateral incisor on the affected side and of the inferior line of the piriform aperture, were analyzed to clarify potential features related to the created models’ performance. Regarding the inferior line of the piriform aperture, Hansen et al.¹² reported that the line was positioned 2.9 mm lower on the cleft side than on the noncleft side, whereas no difference was noted between the right and left sides in the normal group (mean difference: <1 mm). Therefore, based on the visibility and the line level, we analyzed the appearance with >2 mm difference defined as abnormal. Consequently, the appearance was divided into three types (clear and inferior level, obscure or invisible groups, and clear and equivalent level), and the ratios were 75%, 76%, and 90% for the CA only, CA with CP, and normal groups, respectively. This result suggests that the findings of the inferior line of the piriform aperture might be related to both classification performance and detection performance. Regarding the lateral incisor, microdontia were observed in 48% of patients with CA, but not in any patients in the normal group. Therefore, the presence of microdontia might be related to the model’s detection performance of CA. Similarly, unerupted lateral incisors were observed in 44% of patients with CA but only 2.7% of patients in the normal group, suggesting that this factor is also related to the models’ performance.

The present study had several limitations. First, patients with bilateral CA were excluded because it was difficult to clearly visualize the inferior line of the piriform aperture in such patients’ panoramic images. This would result in failure of the determination of the superior limit of the ROIs for the learning process. Future study should be conducted to address these patients. In this regard, it might be effective to test the model created in the present study by using the panoramic radiographs of patients with bilateral CAs. Second, the numbers of training and test data were so small that the results cannot be generalized although it was difficult to estimate an appropriate sample size²⁸. Future research should be planned with larger datasets obtained from multiple hospitals through the use of different panoramic machines. Third, other CNNs and functions, such as semantic segmentation, should be used to improve the model’s performance. Fourth, there were no comparisons to the performance of human observers. This is fundamentally important because a DL model should aim to assist the performance of human observers as a CAD system.

The model developed in the present study appears to have the potential to detect CAs and classify them into CA only and CA with CP groups on panoramic radiographs. Additionally, some performance-related differences between three experimental groups (CA only, Ca with CP and normal groups) were clarified.

Acknowledgments

We thank Richard Lipkin, PhD, from Edanz Group (https://en-author-services.edanz.com/ac) for editing a draft of this manuscript.

Additional Information

Competing interests: The authors declare no competing interests.

Sato Y, et al. Population Attributable Fractions of Modifiable Risk Factors for Nonsyndromic Orofacial Clefts: A Prospective Cohort Study From the Japan Environment and Children’s Study (e-pub ahead of print). J Epidemiol. doi: 10.2188/jea.JE20190347. accessed Dec 22, 2020.

Ono S, Ishimaru M, Matsui H, Fushimi K, Yasunaga H. Effect of Hospital Volume on Outcomes of Surgery for Cleft Lip and Palate. J Oral Maxillofac Surg. 2015;73:2219-24

Omiya T, Ito M, Yamazaki Y. Disclosure of congenital cleft lip and palate to Japanese patients: reported patient experiences and relationship to self-esteem. BMC Res Notes. 2014;7:924.

Raghavan U, Vijayadev V, Rao D, Ullas G. Postoperative Management of Cleft Lip and Palate Surgery. Facial Plast Surg. 2018;34:605-611

Murthy A, Lehman J. Secondary alveolar bone grafting: An outcome analysis. Can J Plast Surg. 2006;14:172-4

Kim K, Kim S, Baek S. Change in Grafted Secondary Alveolar Bone in Patients with UCLP and UCLA. Angle Orthod. 2008;78:631-40.

Pinheiro F, Drummond R, Frota C, Bartzela T, Santos P. Comparison of early and conventional autogenous secondary alveolar bone graft in children with cleft lip and palate: A systematic review. Orthod Craniofac Res. 2020 Nov;23:385-397.

Jacobs R, et al. Pediatric cleft palate patients show a 3- to 5-fold increase in cumulative radiation exposure from dental radiology compared with an age- and gender-matched population: a retrospective cohort study. Clin Oral Investig. 2018;22:1783-1793.

Allori A, Mulliken J, Meara J, Shusterman S, Marcus J. Classification of Cleft Lip/Palate: Then and Now. Cleft Palate Craniofac J. 2017 Mar;54:175-188.

Nakamoto T, Taguchi A, Ohtsuka M, Suei Y, et al. A computer-aided diagnosis system to screen for osteoporosis using dental panoramic radiographs. Dentomaxillofac Radiol. 2008 ;37:274-81.

Nakamoto T, Taguchi A, Verdonschot R, Kakimoto N. Improvement of region of interest extraction and scanning method of computer-aided diagnosis system for osteoporosis using panoramic radiographs. Oral Radiol. 2019 ;35:143-151

Hansen K, Mehdinia M. Isolated soft tissue cleft lip: the influence on the nasal cavity and supernumerary laterals. Cleft Palate Craniofac J. 2002 May;39(3):322-6.

Menezes C, et al. Nonsyndromic cleft lip and/or palate: A multicenter study of the dental anomalies involved. J Clin Exp Dent. 2018;10:e746-e750.

Pegelow M, Alqadi N, Karsten A. The prevalence of various dental characteristics in the primary and mixed dentition in patients born with non-syndromic unilateral cleft lip with or without cleft palate. Eur J Orthod. 2012;34:561-70.

Ribeiro L, Neves L, Costa B, Gomide M. Dental anomalies of the permanent lateral incisors and prevalence of hypodontia outside the cleft area in complete unilateral cleft lip and palate. Cleft Palate Craniofac J. 2003;40:172-5.

Tan E, Kuek M, Wong H, Ong S, Yow M. Secondary Dentition Characteristics in Children With Nonsyndromic Unilateral Cleft Lip and Palate: A Retrospective Study. Cleft Palate Craniofac J. 2018 ;55:582-589.

Selvaraju R, et al. Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization. Int J Comput Vis. 2020;128, 336–359.

Muramatsu C, et al. Tooth detection and classification on panoramic radiographs for automatic dental chart filing: improved classification by multi-sized input data. Oral Radiol. 2021;37(1):13-19.

Tao A, Barker J, Sarathy S. Detect Net: deep neural network for object detection in DIGITS. 2016. Available at: https://devblogs. nvidia.com/parallelforall/detectnet-deep-neural-network-objectdetection- digits/. Accessed Decenber 22, 2020.

Fukuda M, et al. Evaluation of an artificial intelligence system for detecting vertical root fracture on panoramic radiography. Oral Radiol. 2020;36:337-343.

Takahashi T, Nozaki K, Gonda T, Mameno T, Wada M, Ikebe K. Identification of dental implants using deep learning-pilot study. Int J Implant Dent. 2020 Sep 22;6:53.

Ariji Y, et al. Automatic detection and classification of radiolucent lesions in the mandible on panoramic radiographs using a deep learning object detection technique. Oral Surg Oral Med Oral Pathol Oral Radiol. 2019;128:424-430.

Watanabe H, et al. Deep learning object detection of maxillary cyst-like lesions on panoramic radiographs: preliminary study (e-pub ahead of print). Oral Radiol . doi: 10.1007/s11282-020-00485-4. accessed Dec 22, 2020.

Poedjiastoeti W, Suebnukarn S. Application of Convolutional Neural Network in the Diagnosis of Jaw Tumors. Healthc Inform Res. 2018;24:236-241.

Lee J, Kim D, Jeong S. Diagnosis of cystic lesions using panoramic and cone beam computed tomographic images based on deep learning neural network. Oral Dis. 2020;26:152-158.

Yang H, et al. Deep Learning for Automated Detection of Cyst and Tumors of the Jaw in Panoramic Radiographs. J Clin Med. 2020;9:1839.

Kwon O, et al. Automatic diagnosis for cysts and tumors of both jaws on panoramic radiographs using a deep convolution neural network. Dentomaxillofac Radiol. 2020;49:20200185

Balki I, et al. Sample-Size Determination Methodologies for Machine Learning in Medical Imaging Research: A Systematic Review. Can Assoc Rdiol J. 2019;70(4):344-353

Due to technical limitations, the tables are only available as a download in the supplemental files section.

No competing interests reported.

tables.xlsx

Download PDF

Journal Publication

published 06 Aug, 2021

Read the published version in Scientific Reports →

Editorial decision: Major revision
21 Jun, 2021
Reviews received at journal
12 May, 2021
Reviews received at journal
10 May, 2021
Reviewers agreed at journal
10 May, 2021
Reviewers agreed at journal
06 May, 2021
Reviewers invited by journal
06 May, 2021
Submission checks completed at journal
30 Apr, 2021
First submitted to journal
23 Apr, 2021

You are reading this latest preprint version

Detection and classification of unilateral cleft alveolus with and without cleft palate on panoramic radiographs using a deep learning system

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Material And Method

Results

Discussion

Conclusions

Declarations

References

Tables

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1