Convolutional neural network for detecting rib fractures on chest radiographs: A feasibility study

doi:10.21203/rs.3.rs-1995864/v1

Background: The application of artificial intelligence for the detection of rib fractures on chest radiographs is limited by image quality control and multi-lesion screening. We aimed to create a model for multiple rib fracture detection using a convolutional neural network (CNN) based on quality-normalised chest radiographs.

Methods: A total of 1,080 radiographs with rib fractures were obtained and randomly divided into training (918 graphs, 85%) and testing (162 graphs, 15%) sets. An object detection CNN, you only look once (YOLO) v3, was adopted to build the detection model. Receiver operating characteristic (ROC) and free-response ROC (FROC) were used to evaluate model performance. A joint testing group of 162 radiographs with rib fractures and 233 radiographs without rib fractures was used as the internal testing set. Furthermore, additional 201 radiographs, 121 with rib fractures and 80 without rib fractures, were independently validated to compare the CNN model performance with the diagnostic efficiency of radiologists.

Results: The sensitivity of the model in the training and testing sets was 92.0% and 91.1%, respectively, and the precision was 68.0% and 81.6%, respectively. FROC in the testing set showed that the sensitivity for whole-lesion detection reached 91.3% when the false-positive of each case was 0.56. In the joint testing group, the case-level accuracy, sensitivity, specificity, and area under the curve were 85.1%, 93.2%, 79.4%, and 0.92, respectively. In the independent validation set, at the fracture level, the sensitivity of the CNN model (87.3%) was higher than that of the senior (80.3%) and junior radiologists (73.4%), while the precision (80.3%) was slightly lower than that of the latter two (82.4% and 81.7%, respectively). At the case level, the accuracy and sensitivity of the CNN model (91.5% and 96.7%, respectively) were both higher than those of the junior radiologist (85.1% and 77.7%, respectively) and close to those of the senior radiologist (94.0% and 96.7%, respectively).

Conclusions: The CNN model based on YOLOv3 is sensitive for detecting rib fractures on chest radiographs and shows great potential in the preliminary screening of rib fractures.

rib fracture

convolutional neural network

YOLO

detection model

radiograph

Thoracic trauma is a common injury in the emergency department, accounting for approximately 10–15% of all trauma cases [1]. Globally, the mortality rate ranges from 20–25% [2]. Traumatic rib fracture, caused by a tremendous impact force on the chest wall, is the most common form of blunt thoracic injury, accounting for approximately 35% of all cases of thoracic traumas [3]. Rib fractures are associated with significant morbidity and mortality, both of which increase as the number of fractured ribs increases [4, 5]. Hence, rib fracture is an important indicator of trauma severity. Compared with other injuries, rib fractures when accurately detected can result in a higher treatment rate, avoid complications, and help solve medical-legal disputes, such as traffic accidents and physical fighting [6].

Chest radiography is a critical tool for detecting thoracic trauma and is usually the initial imaging modality for rib fracture screening [7–9]. The American College of Radiology criteria for the evaluation of rib fractures recommend chest radiography with the use of a posteroanterior view at four variant evaluations for suspected rib fractures in non-high-energy blunt trauma [8]. Chest radiography is also a complementary examination for high-energy blunt trauma [10]. However, it has been shown that the overall incidence of rib fractures is probably higher than that previously recognised [11]. A previous investigation reported that up to 50% of rib fractures may be missed on plain radiographs, which may lead to potential risks to patients [12]. The detection of rib fractures on chest radiographs depends mostly on the reader’s experience, quality of the displayed images, and/or clinical scenario of chest radiograph scanning. Rib fracture detection is a time consuming and demanding task for radiologists. Thus, a fast, easily available, and highly accurate method for rib fracture screening, which could be adopted to relieve radiologists and develop a cost-effective tool for clinical application, is urgently needed.

Artificial intelligence (AI) is widely used in the medical field, particularly in radiology. The deep learning algorithm of AI demonstrates good diagnostic accuracy and can be used to improve the quality and speed of image interpretation and increase the efficiency of physicians [13–15]. Convolutional neural network (CNN) is an essential branch of deep learning. The multiple processing layers of CNN are more sensitive to image features and can enhance recognition accuracy [16], which are commonly used AI techniques in medical imaging among radiology researchers [17, 18]. Yamashita et al. divided the application of CNN into classification; segmentation; detection; and others [16], such as lung nodule classification [19], liver segmentation [20], and breast cancer detection [21]. CNN also demonstrates high feasibility and potential for fracture detection. Studies on lateral wrist fractures, proximal humerus fractures, and orthopaedic trauma have shown promising results [22–24]. However, no study has verified the performance of the CNN model for the detection of rib fractures using radiography.

Thus, this study aimed to create a model for multiple rib fracture detection using a CNN based on quality-normalised chest radiographs. Towards this goal, we developed a CNN model for rib fracture detection using chest radiographs. First, radiographs from four hospitals were collected. Second, image quality normalisation using the multi-scale image contrast amplification (MUSICA) algorithm, which has been proven in image enhancement but seldom applied in radiographs [25], was performed. A CNN model was then constructed using the algorithm of you only look once (YOLO) v3. Finally, the detection ability of the CNN model was compared with that of junior and senior radiologists in an image reading experiment using an independent radiograph set.

Study design and patients

This retrospective study used only anonymised data. All chest radiographs were obtained from four local hospitals between 9 July 2017 and 25 June 2019. The scan manufacturer included SIEMENS Ysio AXIOM Aristos FX, GE Definium 6000 digital radiography (DR), Philips Digital Diagnost VS, Carestream CARESTR DR, Neusoft N600 DR, and TOSHIBA D50S DR. All images were stored in the Digital Imaging and Communications in Medicine format and reviewed by radiologists using InferScholar (https://research.infervision.com/, Beijing, CHINA). The graph size varied from 1,576×1,960 to 3,072×3,072. A total of 3,890 chest radiographs (one patient, one image) from patients aged 18–70 years were analysed for preliminary screening. Three radiologists with more than 15 years of radiological experience independently interpreted the images with relevant clinical information (e.g. palpation results and clinical history). Radiographs with the indication of no rib fracture, postoperative internal fixation of the rib, poor quality breathing, and surface foreign bodies that affected the diagnosis were excluded. Inconsistencies were resolved through discussion. In total, 1080 radiographs with rib fractures were obtained. The work diagram is illustrated in Fig. 1.

To collect data for constructing the CNN-based rib fracture detection model, the radiologists marked the fractures on the graphs. One radiologist marked the fracture sites on 1,080 radiographs with the following signs: (1) complete rip disruption with a lucent line, (2) disruption of the inner or outer cortex, (3) fracture rib end displacement, and (4) rib deformity with callus formation. As shown in Fig. 2, all rib fractures are marked by blue boxes. To reduce the mark error, another radiologist confirmed all markers. To evaluate the detection ability of the CNN model, additional 201 independent chest radiographs, including 121 radiographs with 402 rib fractures and 80 radiographs without rib fractures confirmed by the same three radiologists with more than 15 years of radiological experience, were collected as a validation set. One junior radiologist with 5 years’ experience and one senior radiologist with 10 years’ experience were also recruited for the rib fracture reading experiment.

Data processing

Image quality improvement using MUSICA

Owing to the diversity of data sources, the use of MUSICA is inevitable before CNN model training to reduce data heterogeneity from the four different hospitals with different imaging qualities. MUSICA [26] involves the following three steps (Fig. 3): (1) Gaussian pyramid decomposition of the image, (2) enhancement of the high-frequency (detailed) part of the image, and (3) image reconstruction.

Architecture of CNN

A CNN is formed by stacking the input, convolution, pooling, fully connected, and output layers. The input layer is the first layer of a CNN, and the input to a CNN consists of raw images, which are vectors in two or three dimensions. The convolutional layer, which is the core of a CNN, generally consists of a set of learnable filters or kernels with small perceptual fields. Each convolutional kernel has parameters such as the kernel size, padding, and stride. The inner product operation is performed sequentially from the top-left corner of the image to extract the high-level features of the image. The pooling layers do not change the depth of the network; however, they can downsize the matrices and reduce the number of nodes in the last fully connected layer to reduce the risk of overfitting. After several rounds of convolution and pooling layer processing, one or two fully connected layers are at the end of the neural network to generate the results. For classification tasks, a higher number of layers represents amplified input aspects, which are essential for discrimination and suppressing irrelevant variations (Fig. 4).

Establishment of training and testing sets

The 1,080 images after MUSICA were randomised into the training and testing sets with 918 and 162 graphs, respectively. An additional 233 radiographs without rib fractures were also added to the testing set as the joint testing group to evaluate model generalisability. In previous studies, radiographs were manually cropped into a square or to centre the objective [23]. The current study incorporated two additional steps for segmenting the radiographs and amplifying the data. First, the input image was downscaled from 2,458×2,881 to 1,229×1,440. Second, a sliding window was used to generate sub-graphs with a window size of 512×512 in steps of 256. Each image was divided into approximately 20 subgraphs (Fig. 5a). Once the marked area was cut, the smaller area was filtered out, and training data were generated after filtering (Fig. 5a). Regarding the training set, 19,974 sub-graphs were generated by the sliding window and sent to a deep learning network for training and testing.

Network training

YOLO v3 (https://pjreddie.com/darknet/yolo/) is a classic CNN algorithm with excellent network structure (Fig. 5b). This model has several inherent advantages: fast evaluation, multiscale predictions, and a better backbone classifier. First, Darknet-53 is trained as the backbone for object detection. Darknet-53 (Fig. 5c) consists of 53 convolutional layers and one fully connected layer. A number of consecutive 1×1 and 3×3 convolutions were added, and the first 52 layers were used to extract image features. The k-means algorithm was used to count the size of the fracture marker in the labelled sample. To better detect the location of the fracture, each cell is responsible for predicting four anchors. One of these cells was selected as the prediction result, which used a total of 12 anchors: (54, 58), (61, 76), (65, 59), (69, 99), (74, 71), (80,60), (85, 84), (94, 116), (104, 69), (111, 91), (122, 209), and (139, 123). Each box was classified using a logistic regression analysis to determine whether the fracture area was included. After 50 iterations of network training, the losses of the training and testing sets were no longer reduced, indicating that the network converged to a stable state, as shown by the loss curve (Fig. 5d).

Statistical analysis

The chi-square test was used to compare the performance of the CNN model with that of the senior radiologist and junior radiologist. To assess model performance, a conventional receiver operating characteristic (ROC) analysis was performed to examine model sensitivity and false-positive results. Conventional ROC analysis was also used to examine the model’s ability in the joint testing set (162 radiographs with rib fractures and 233 radiographs without rib fractures). The multi-lesion detection rate was assigned to the model using the free-response ROC (FROC) in the testing set. FROC defines the lateral axis as the overall average of false detections and the vertical axis as the true positive. Accuracy, area under the curve (AUC), sensitivity/specificity, and 95% confidence intervals (CI) were determined. All statistical analyses were performed using Python script (https://www.python.org/).

MUSICA pre-processing performance

After MUSICA, the contrast uniformity between the bone and lung tissues was significantly improved (Fig. 3). Although the raw images behaved differently with considerable differentiation of contrast and detail, the two processed images appeared to be similar in image quality and contrast.

Deep learning YOLOv3 network performance

The training set included 918 patients with 2647 fractures, and the CNN model detected 3,580 fractures, of which 2,435 were detected correctly, 212 were missed, and 1,145 were false. The test set included 162 patients with 437 fractures. The model detected 488 fractures, of which 398 were detected correctly, 39 were missed, and 90 were mistakenly detected. In the training set, the sensitivity (fractures detected correctly/marked fractures) was 92.0%, and the precision (fracture detected correctly/fracture detected) was 68.0%. In the test set, the sensitivity was 91.1% (Table 1). In the testing set, the multi-lesion detection rate was also verified with FROC; when the false-positive rate was set as 0.56, the sensitivity of the whole lesion detection reached 91.3% (Fig. 6b).

Table 1

Sensitivity and precision of the CNN model in the training and testing sets
Data	Marked fractures	Detected fractures	Correct detected fractures	Sensitivity	Precision
Training set	2647	3580	2435	92.0%	68.0%
Testing set	437	488	398	91.1%	81.6%
Note: Sensitivity = fractures detected correctly / fractures marked; Precision = fractures detected correctly / fractures detected

Radiographs without rib fractures were added to the test set to evaluate the ability of the model to detect rib fractures. Finally, 395 radiographs with 162 fractures and 233 without fractures were included in the study. The CNN model detected 199 radiographs with fractures and 196 radiographs without fractures. The accuracy was up to 85.1%, and the sensitivity and specificity were 93.2% and 79.4%, respectively (Table 2). ROC analysis showed that the AUC reached 0.92 (95% CI: 0.86–0.96) (Fig. 6a).

Table 2

Detection rate of the CNN model in the testing set based on case level
CNN model	Chest radiographs		Total
CNN model	With rib fractures	Without rib fractures	Total
Detected fractures	151	48	199
Undetected fractures	11	185	196
Total	162	233	395
Note: Sensitivity = TP / (TP + FN) ×100%=151/162×100%=93.2%, Specificity = TN/(TN + FP) ×100%=185/233×100%=79.4%, Positive predictive value (PPV) = TP/(TP + FP) ×100%=151/199×100%=75.9%, Negative predictive value (NPV) = TN/(TN + FN) ×100%=185/196×100%=94.4%, Accuracy = (TP + FN)/(TP + FN + TN + FN) ×100% =(151 + 185)/395 ×100%=85.1%

Reading experiment

Regarding the experimental results at the fracture level, the CNN model detected 97 radiographs with 437 fractures, of which 351 were detected correctly, 51 were missed, and 86 were false. The senior radiologist recognised 125 radiographs with 392 fractures, of which 323 were correctly detected, 79 were missed, and 69 were false. The junior radiologist identified 130 radiographs with 361 fractures, of which 295 were correct, 107 were missed, and 66 were false. The sensitivity and precision of the detection by the CNN model, senior radiologist, and junior radiologist were 87.3% and 80.3%, 80.3% and 82.4%, and 73.4% and 81.7%, respectively. The sensitivity of detection was significantly higher in the CNN model than among the junior radiologist (P = 0.01), indicating that the CNN model had better detection ability. Meanwhile, there was no significant difference between the senior and junior radiologists or between the CNN and senior radiologist (P > 0.05) (Table 3).

Table 3

Comparison of sensitivity and precision in the independent testing group based on fracture level
Data	Marked fractures	Detected fractures	Correct detected fractures	Sensitivity	Precision
CNN model	402	437	351	87.3%	80.32%
Senior radiologist	402	392	323	80.3%	82.40%
Junior radiologist	402	361	295	73.4%	81.72%
P₁	NA	NA	NA	0.15	0.57
P₂	NA	NA	NA	0.13	0.43
P₃	NA	NA	NA	0.01	0.43
Note: Sensitivity = fractures detected correctly/fractures marked; Precision = fractures detected correctly/fractures detected. P₁ = P value for senior vs. junior radiologists. P₂ = P-value for CNN vs. senior radiologist. P₃ = P-value for CNN vs. junior radiologist. Comparisons are performed using the chi-squared test.
NA = not available

For the model’s detection ability at the case level, the CNN model detected 130 radiographs with fractures and 71 without fractures. The senior radiologist identified 125 fractures and 76 without fractures. The junior radiologist identified 97 fractures and 104 without fractures. The accuracy and sensitivity of the identification by the CNN model, senior radiologist, and junior radiologist were 91.5% and 96.7%, 94.0% and 96.7%, and 85.1% and 77.7%, respectively (Table 4).

Table 4. Detection rate of marked fractures in the independent testing set at the case level

a. CNN model

CNN model	Chest radiographs		Total
CNN model	With rib fractures	Without rib fractures	Total
Detected fractures	117	13	130
Undetected fractures	4	67	71
Total	121	80	201

b. Senior radiologist

Senior radiologist	Chest radiographs		Total
Senior radiologist	With rib fractures	Without rib fractures	Total
Detected fractures	117	8	125
Undetected fractures	4	72	76
Total	121	80	201

c. Junior radiologist

Junior radiologist	Chest radiographs		Total
Junior radiologist	With rib fractures	Without rib fractures	Total
Detected fractures	94	3	97
Undetected fractures	27	77	104
Total	121	80	201

d. Sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy in the independent testing set based at the case level

Model	Sensitivity	Specificity	PPV	NPV	Accuracy
CNN model	96.7%	83.8%	90.0%	94.4%	91.5%
Senior radiologist	96.7%	90.0%	93.6%	94.7%	94.0%
Junior radiologist	77.7%	96.3%	96.9%	74.0%	85.1%

Note: Sensitivity= TP / (TP +FN) ×100%, Specificity=TN/(TN + FP) ×100%, Positive predictive value (PPV)=TP/(TP + FP) ×100%, Negative predictive value (NPV)=TN/(TN + FN) ×100%, Accuracy = (TP + FN)/(TP + FN + TN + FN) ×100%

This study created a powerful CNN model for the detection of rib fractures using chest radiographs. First, the quality of the input image was standardised. The CNN model was then trained to detect rib fractures with all lesions found, and it showed promising results with high sensitivity and accuracy. Finally, a standardised model for rib fracture detection was developed, and it outperformed the detection ability of both senior and junior radiologists.

Deep learning has advanced significantly with new algorithms and optimised network structures, and these greatly contributed to the current study. Kim et al. used X-ray-based AI to detect carpal fractures and the model showed sensitivity, specificity, and AUC values of 90%, 88%, and 0.954, respectively [22, 23]. Chung et al. used a deep learning model to detect proximal humeral fractures, and the sensitivity, specificity, and AUC were 99%, 97%, and 0.97, respectively [23]. AI showed promising results in fracture detection in the above two studies as well as in this study. There could be three reasons for these results. The first is quality normalisation, as discussed in the preceding paragraph. The second is the application of the innovative network YOLOv3, which combines YOLOv2, Darknet-19, and other new residual networks. Compared with ResNet-152 and ResNet-101, YOLOv3 has better training speed and accuracy [27], further expanding its use. The third reason is that K-means was used to count the fracture marker box in the labelled sample.

A series of subgraphs was trained to locate multiple foci and were free of the hand-engineered region, which is rarely used in rib fracture detection. By examining three different scale feature maps, the number and specific locations of rib fractures could be better detected. Signal features detected by handcrafted analysis were challenged by the CNN model with a sliding window [28, 29]. Comparative testing showed that the sensitivity for detecting rib fractures was significantly higher in the CNN model than that by the junior radiologist and close to that of the senior radiologist at the fracture level. In addition, although the precision is slightly lower than that of radiologists, the model can still provide radiologists with specific locations for suspicious fractures, reducing the rate of lateral missed diagnosis.

The CNN is the most commonly used AI technique for medical imaging [17, 18]. The CNN model is also becoming a popular constituent of medical diagnosis, not only with respect to efficiency, but also to precision medicine. Studies have shown that specific organ injuries are often correlated with a specific fractured rib [3, 30]. The number of displaced rib fractures could also be a strong predictor for developing pulmonary complications [31], which makes the detection of rib fractures important to prevent complications and help mitigate patient pain. This model used FROC to test the multi-lesion detection ability, and the sensitivity was 91.3% when the false-positive rate of each case was set as 0.56. In comparison, only 49% of rib fractures are traditionally detected on the physical evaluation of radiographs [32]. This result may expand the clinical value of chest radiographs and reduce the rate of recommendations for additional imaging (RAIs). Harvey et al. reported that the rate of RAIs have increased by as much as 200% since 1995 [7]. Especially in radiographic imaging of the chest, the increase is due to the low diagnostic accuracy of radiography. Some critics have implicated RAIs as a cause of the increased use of additional imaging and associated costs.

The process of image standardisation also includes some image enhancement techniques so that images from different devices can have the same image quality. Importantly, image standardisation forms the basis for the performance of the CNN model. Almost all radiology applications are highly dependent on radiographic image quality, especially when combined with AI. However, image quality standardisation has long presented a challenge and has affected the intelligent diagnostic development of radiography, ultrasonography, computed tomography (CT), and magnetic resonance imaging. Several methods have been proposed to solve this problem. As in the research by Li et al. [33], several steps were performed for standardisation, including rescaling, downsizing, and transformation. Smoothing, normalisation, and resampling were also performed in diabetic retinopathy research [34]. However, most studies have focused on noise elimination or uniform size rather than feature enhancement. In the current study, proper image enhancement to reduce the variability of different machine images was necessary, particularly because images were reviewed by different display systems.

Among various imaging modalities, chest radiography is the appropriate initial imaging modality for patients with rib fractures. Although CT may provide a more accurate diagnosis, it is usually only performed after diagnostic chest radiography [8]. Missed diagnoses of rib fractures on chest radiographs may cause legal disputes, especially in traffic accidents and physical fights. Importantly, it may lead to delayed treatment. Therefore, this study focused on chest radiography for early detection of rib fractures.

Despite the promising results, this study has few limitations. First, the fractures on the radiographs were labelled according to the physician’s comprehensive diagnosis without gold standard modalities, such as pathology. Second, only posteroanterior radiographs were obtained, and the lateral position of the rib was not considered. Third, radiography cannot consistently demonstrate fractures in the costal cartilage, which is an inherent problem that decreases the detection rate. Finally, because only 19,974 sub-graphs (918 radiographs) were included to train the model, more radiographs should be enrolled in the training data to improve model efficacy. This study is only a preliminary attempt of using a CNN model to examine rib fractures based on radiographs. The efficiency of CNN data models is expected to continue to improve with the advent of computer technology and big data.

The CNN model in the current study showed high diagnostic efficiency, indicating that CNN can improve the detection rate of rib fractures on chest radiographs, helping reduce missed diagnoses, avoiding medical accidents, and relieving radiologists’ workload. Although the detection ability requires further validation, CNN is promising for medical diagnosis.

AI artificial intelligence

AUC area under the curve

CI confidence intervals

CT computed tomography

DR digital radiography

FROC free-response ROC

MUSICA multi-scale image contrast amplification

RAIs recommendations for additional imaging

ROC receiver operating characteristic

YOLO you only look once

Ethics approval and consent to participate: This study was conducted in accordance with the principles of the Declaration of Helsinki and approved by the institutional review board of The First Affiliated Hospital of Xi’an Jiaotong University. Informed consent was obtained from all subjects and/or their legal guardians.

Consent for publication: Not applicable.

Availability of data and materials: The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.

Competing interests: The authors declare that they have no competing interests.

Funding: This work was supported by the Key Research and Development Program of Shaanxi (Grant No.2021SF-092) and Innovation Team Project of Natural Science Fund of Shaanxi Province (Grant No. 2019TD-018).

Authors’ contributions: Study concept and design: QS, JY, resources: NL, JS, YS, PC, methodology: JW, BC, JQ, analysis and interpretation of data: NL, JS, FW, YS , ML, drafting of the manuscript: QF, ZL, critical revision of the manuscript: QS, JW, statistical analysis: XL, ZL, study supervision: QS, JW. All authors read and approved the final manuscript.

Acknowledgements: We appreciate the help and support of all the participants involved in the study.

Authors’ information:¹Department of Radiology, The First Affiliated Hospital of Xi’an Jiaotong University, Xi’an, 710061, China. ²The Key Laboratory of Biomedical Information Engineering, Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an 710054, China. ³Department of Medical Imaging, No.215 Hospital of Shaanxi Nuclear Industry, Xianyang, 712000, China. ⁴School of Information Science and Technology, Northwest University, Xi’an, 710127, China. ⁵Department of Radiology, Shaanxi Provincial Tuberculosis Control Hospital, Xi’an, 710105, China. ⁶Academy for advanced interdisciplinary studies, Peking University, Beijing, 100191, China. ⁷InferVision Institute of Research, Beijing, 100025, China. ⁸GE Healthcare, Xi’an, 710076, China.

Battle C, Lovett S, Hutchings H, Evans PA. Predicting outcomes after blunt chest wall trauma: development and external validation of a new prognostic model. Crit Care. 2014;18:1–182.
Dogrul BN, Kiliccalan I, Asci ES, Peker SC. Blunt trauma related chest wall and pulmonary injuries: An overview. Chin J Traumatol. 2020;23:125–38.
Liman ST, Kuzucu A, Tastepe AI, Ulasan GN, Topcu S. Chest injury due to blunt trauma. Eur J Cardiothorac Surg. 2003;23:374–8.
Peek J, Ochen Y, Saillant N, Groenwold RHH, Leenen LPH, Uribe-Leitz T, et al. Traumatic rib fractures: a marker of severe injury. A nationwide study using the National Trauma Data Bank. Trauma Surg Acute Care Open. 2020;5:e000441.
Ziegler DW, Agarwal NN. The morbidity and mortality of rib fractures. J Trauma. 1994;37:975–79.
Chien CY, Chen YH, Han ST, Blaney GN, Huang TS, Chen KF. The number of displaced rib fractures is more predictive for complications in chest trauma patients. Scand J Trauma Resusc Emerg Med. 2017;25:1–10.
Harvey HB, Gilman MD, Wu CC, Cushing MS, Halpern EF, Zhao J, et al. Diagnostic yield of recommendations for chest CT examination prompted by outpatient chest radiographic findings. Radiology. 2015;275:262.
Henry TS, Kirsch J, Kanne JP, Chung JH, Donnelly EF, Ginsburg ME, et al. ACR Appropriateness Criteria® rib fractures. J Thorac Imaging. 2014;29:364–6.
Siela D. Chest radiograph evaluation and interpretation. AACN Adv Crit Care. 2008;19:444–73.
Chung JH, Cox CW, Mohammed T-LH, Kirsch J, Brown K, Dyer DS, et al. ACR appropriateness criteria blunt chest trauma. J Am Coll Radiol. 2014;11:345–51.
Davis S, Affatato A. Blunt chest trauma: utility of radiological evaluation and effect on treatment patterns. Am J Emerg Med. 2006;24:482–6.
Dubinsky I, Low A. Non-life-threatening blunt chest trauma: appropriate investigation and treatment. Am J Emerg Med. 1997;15:240–3.
Kahn Jr CE. From images to actions: opportunities for artificial intelligence in radiology. Radiology. 2017;285:719–20.
Kermany DS, Goldbaum M, Cai W, Valentim CCS, Liang H, Baxter SL, et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell. 2018;172:1122–31.
Haenssle HA, Fink C, Schneiderbauer R, Toberer F, Buhl T, Blum A, et al. Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists. Ann Oncol. 2018;29:1836–42.
Yamashita R, Nishio M, Do RKG, Togashi K. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018;9:611–29.
Wernick MN, Yang Y, Brankov JG, Yourganov G, Strother SC. Machine learning in medical imaging. IEEE Signal Process Mag. 2010;27:25–38.
Kohli M, Prevedello LM, Filice RW, Geis JR. Implementing machine learning in radiology practice and research. Am J Roentgenol. 2017;208:754–60.
Liang M, Tang W, Xu DM, Jirapatnakul AC, Reeves AP, Henschke CI, et al. Low-dose CT screening for lung cancer: computer-aided detection of missed lung cancers. Radiology. 2016;281:279–88.
Lu F, Wu F, Hu P, Peng Z, Kong D. Automatic 3D liver location and segmentation via convolutional neural network and graph cut. Int J Comput Assist Radiol Surg. 2017;12:171–82.
Kooi T, Litjens G, van Ginneken B, Gubern-Mérida A, Sánchez CI, Mann R, et al. Large scale deep learning for computer aided detection of mammographic lesions. Med Image Anal. 2017;35:303–12.
Kim DH, MacKinnon T. Artificial intelligence in fracture detection: transfer learning from deep convolutional neural networks. Clin Radiol. 2018;73:439–45.
Chung SW, Han SS, Lee JW, Oh KS, Kim NR, Yoon JP, et al. Automated detection and classification of the proximal humerus fracture by using deep learning algorithm. Acta Orthop. 2018;89:468–73.
Olczak J, Fahlberg N, Maki A, Razavian AS, Jilert A, Stark A, et al. Artificial intelligence for analyzing orthopedic trauma radiographs: deep learning algorithms—are they on par with humans for diagnosing fractures? Acta Orthop. 2017;88:581–6.
Staege MS. Gene expression music algorithm-based characterization of the Ewing sarcoma stem cell signature. Stem Cells Int. 2016;2016:7674824.
Sun M, Wang Y, le Bastard C, Pan J, Ding Y. Signal subspace smoothing technique for time delay estimation using MUSIC algorithm. Sensors. 2017;17:2868.
Kim K-J, Kim P-K, Chung Y-S, Choi D-H. Performance enhancement of yolov3 by adding prediction layers with spatial pyramid pooling for vehicle detection. In: 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). 2018. p. 1–6.
Liao C, Bilgic B, Manhard MK, Zhao B, Cao X, Zhong J, et al. 3D MR fingerprinting with accelerated stack-of-spirals and hybrid sliding-window and GRAPPA reconstruction. Neuroimage. 2017;162:13–22.
Tsui P-H, Chen CK Kuo WH, Chang KJ, Fang J, Ma HY, Chou D. Small-window parametric imaging based on information entropy for ultrasound tissue characterization. Sci Rep. 2017;7:1–17.
Ivey KM, White CE, Wallum TE, Aden JK, Cannon JW, Chung KK. Thoracic injuries in US combat casualties: a 10-year review of Operation Enduring Freedom and Iraqi Freedom. J Trauma Acute Care Surg. 2012;73:S514–9.
Talbot BS, Gange Jr CP, Chaturvedi A, Klionsky N, Hobbs SK, Chaturvedi A. Traumatic rib injury: patterns, imaging pitfalls, complications, and treatment. Radiographics. 2017;37:628–51.
Crandall J, Kent R, Patrie J, Fertile J, Martin P. Rib fracture patterns and radiologic detection–a restraint-based comparison. In: Annual proceedings/association for the advancement of automotive medicine. Association for the Advancement of Automotive Medicine. 2000. p. 235.
Li Z, Keel S, Liu C, He Y, Meng W, Scheetz J, et al. An automated grading system for detection of vision-threatening referable diabetic retinopathy on the basis of color fundus photographs. Diabetes Care. 2018;41:2509–16.
Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316:2402–10.

No competing interests reported.

Convolutional neural network for detecting rib fractures on chest radiographs: A feasibility study

Status:

Journal Publication

Version 1

Abstract

Figures

Background

Patients And Methods

Image quality improvement using MUSICA

Architecture of CNN

Establishment of training and testing sets

Network training

Statistical analysis

Results

Discussion

Limitations

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Status:

Journal Publication

Version 1