Automated Systems for Detection of COVID-19 Using Chest X-ray Images and Lightweight Convolutional Neural Networks

doi:10.21203/rs.3.rs-24305/v1

Download PDF

Research Article

Automated Systems for Detection of COVID-19 Using Chest X-ray Images and Lightweight Convolutional Neural Networks

https://doi.org/10.21203/rs.3.rs-24305/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Since December 2019, the appearance of an outbreak of a novel coronavirus disease namely COVID-19 and which is previously known as 2019-nCoV. COVID-19 is a type of coronavirus that leads to the general destruction of respiratory systems and a severe respiratory symptom which are associated with highly Intensive Care Unit (ICU) admissions and death. Like any disease, the early diagnosis of coronavirus leads to limit its wide-spreading and increases the recovery rates of patients. The gold standard of COVID-19 detection is the real-time reverse transcription-polymerase chain reaction (RT-PCR) which has been used by the clinician to discover the presence or absence of this type of virus. The clinicians report that this technique has a low positive rate in the early stage of this disease. Based on this, the clinicians were forced to use another way to help in the early diagnosis of COVID-2019. So, the clinician's attention moved towards the medical imaging modalities especially the computed Tomography (CT) and X-ray chest images. Both modalities show that there is a change in the lungs in the case of COVID-19 that is different from any other type of pneumonic disease. Therefore, this research targeted toward employing different Artificial Intelligence (AI) techniques to propose a system for early detection of COVID-19 using chest X-ray images. These images are classified using different AI algorithms and a combination of them, then their performance was evaluated to recognize the best of them. These algorithms include a convolutional neural network (CNN), Softmax, support vector machine (SVM), Random Forest, and K nearest neighbor (KNN). Here CNN is into two scenarios, the first one to classify the X-ray images using a softmax classifier, and the second one to extract automated features from the images and pass these features to other classifiers (SVM, RFF, and KNN). According to the results, the performance of all classifiers is good and most of them record accuracy, sensitivity, specificity, and precision of more than 98%.

Biomedical Engineering

Artificial Intelligence and Machine Learning

COVID-19

Artificial Intelligence

Hybrid Systems

Automated Features

COVID 2019 is the most recent type of coronavirus that emerges in Wuhan, Hubei Province. It was found in cases of pneumonia. This form of viral pneumonia spreads both inside and outside of Wuhan. This sharp rise in the number of infected cases worldwide leads the World Health Organization called it 2019-nCoV[1]. The name of this novel coronavirus that caused extreme acute respiratory syndrome is SARS-CoV-2 [2]. Feng Pan et al. used CT in the chest to determine the extent of COVID-19 pneumonic lung involvement, finding that COVID-19 CT-chest recovery patients displayed extent after 10 days after initial symptoms [3]. This new form of the crown virus has been discovered and identified in the cases of viral pneumonia that occurred in Wuhan, called COVID-19 by the World Health Organization (WHO). It spreads rapidly inside and outside the province of Hubei and even other countries, leading to sharp rises and widespread panic among people. In their paper, Rabi, Firas A., et al[4] presented a description of all the existing issues surrounding the novel coronavirus and the disease it causes as the medical point of view.

The most effective approaches to monitor the spread of coronavirus are quadrantile and adequate care for suspected cases. The pathogenic laboratory test is used as a diagnostic device for suspected cases of coronavirus, but it is time-consuming with the false positives and low supply. In the early stage detection of COVID-19, some patients may have positive pulmonary imaging manifestations, but may they have no sputum findings and they have negative test results in swabs of RT-PCR. Such cases like those are not diagnosed as suspected or confirmed [5,6]. Although COVID-19 in about 82% of cases causes milder symptoms, the others are serious or critical [7]. The total number of coronavirus cases is approximately 335,403, of which 14,611 died, and 97,636 were recovered. The number of patients infected is 223.156. Although the disease marginally survives 95% of the number of infected patients, 5% of the remainder have a severe or critical illness [8]. Infection signs include symptoms of coughing, nausea, cough, and dyspnea. The infection can cause pneumonia, extreme acute respiratory syndrome, septic shock, multi-organ failure and death in more serious cases [7]. It was determined that men are more infected than women and that children between 0-9 years of age do not die. Respiratory levels of COVID-19 pneumonia cases are higher compared with healthy individuals [9].

It has been reported that to prevent the spread of COVID-19 disease, successful patient screening and prompt medical response for the infected patients is a crying requirement. The Reverse Transcription Polymerase Chain Reaction (RT-PCR) test on respiratory specimens is the gold standard screening tool used to test the COVID-19 patients [10]. This technique is the most widely used test method for COVID-19 detection, but it is a manual, complex, laborious and time-consuming procedure with a positive result of just 63% [10]. There is a severe shortage of its stock leading to delay in efforts to prevent disease [11]. Several countries in the world suffer from inaccurate detection or count of the number of COVID-19 patients, which is due not to all patients being tested, but also to delays in the test results [12]. These delays may cause infected patients to communicate with and infect healthy patients in the process. It is reported that the RT-PCR kit costs approximately USD 120-USD 130 and also requires a specialized biosafety laboratory to house the PCR system, each of which can cost USD 15,000 to USD 90,000 [13]. These costly and delayed test results allow the disease to spread and make the situation much worse. Not only is this an issue for low-income nations, but some developed countries are still struggling to tackle it [14]. COVID-19's other diagnostic methods may be clinical symptom analysis, epidemiological history and positive radiographic images (CT / Chest radiograph (CXR)), as well as positive pathogenic monitoring. The clinical features of extreme COVID-19 infection are that of bronchopneumonia that induces fever, cough, dyspnea, and acute respiratory distress syndrome (ARDS) failure [16-19].

The big testing method for COVID-19 is readily available, including radiological imaging. Most COVID-19 cases have common characteristics on radiographic images including early-stage bilateral, multi-focal, ground-glass opacities with a peripheral or posterior distribution, primarily in the lower lobes, and late-stage pulmonary consolidation [19-25]. Although typical CXR images that help to screen suspected cases early, the images of various viral types of pneumonia are similar and overlap with other infectious and inflammatory diseases of the lungs. Radiologists also have trouble distinguishing COVID-19 from other viral pneumonia. The signs of COVID-19 being close to viral pneumonia can also lead to an erroneous diagnosis in the present scenario, where hospitals are crowded and run round the clock. Consequently, an incorrect diagnosis may result in Non-COVID viral pneumonia being falsely labeled as highly suspicious of COVID-19 and thus delaying care with consequent exposure costs, effort, and risk to positive COVID-19 patients. Many biomedical problems (e.g., brain tumor detection, breast cancer detection, etc.) currently use approaches based on Artificial Intelligence (AI) especially Deep Learning (DL) can help in the detection of COVID-19 and extract automated features from these images [26-29].

Deep learning techniques can reveal image features, which are not apparent in the original images. Specifically, Convolutional Neural Network (CNN) has been proven extremely beneficial in feature extraction and learning and therefore widely adopted by the research community. CNN was used to enhance image quality in low-light images from a high‐speed video endoscopy [30] and was also applied to identify the nature of pulmonary nodules via CT images [31]. Also, CNN was used to electrocardiogram (ECG) classification [32], melanoma skin cancer detection [33], optical coherence tomography (OCT) images classification [34] and brain tumor classification [35]. Deep machine learning techniques on the X‐Rays chest are becoming popular because they can be easily used with low-cost imaging techniques and there is plenty of data available to train various machine learning models.

Xu, Xiaowei, et. al. [5] used deep learning algorithms for 618 subjects in the classification of CT images, in three categories: COVID -19, influenza A and healthy people. In all CT cases, they obtained 86.7 % accuracy. Wang, Shuai, et. al. [6] have used profound learning methods to remove graphic features from CT images to provide pre-diagnosis to the clinician before the pathogenic examination. They reached 89.5% accuracy and 87% sensitivity, but when applied to the external dataset, the accuracy of their algorithm was 79.3%.

Abdullah Ahmed [36] proposed a method for detecting coronavirus based on image processing techniques and on measuring the fringe shift concerning the background. He obtained higher contrast images using multiple-beam interference rather than just two beams interference. The coronavirus refractive index is inferred from a fringe shift. While Narin, Ali, Ceren Kaya, and Ziynet Pamuk [37] proposed a methodology for evaluating different CNN models in their ability to detect COVID-19 using X-ray images. These models are ResNet50, InceptionV3, and Inception-ResNetV2 which are analyzed using 5-fold cross-validation. Results show that the ResNet50 model achieved the best performance with 98 percent accuracy for InceptionV3 and 87 percent accuracy for Inception-ResNetV2 among other proposed models.

The researchers were directed by this hot topic to find an appropriate diagnostic method to avoid spreading it wide. The previous work has addressed this topic from a medical point of view and the others have addressed it as the presentation of CT images using profound thinking, Nonetheless, this paper deals with the use of artificial intelligence algorithms to automatically identify and extract graphical features of chest X-ray images for COVID-19, normal and pneumonia classes, and compares various types of classifiers to inform the most applicant of all.

In this section, the used dataset and the proposed methodology including three CNN architectures, three classifiers (SVM, RF, and KNN), in addition to automated extracted features will be discussed in detail.

Dataset Description

In this research, several publicly available datasets of COVID-19 and Non-COVID-19 chest X-ray images have been used [40]. COVID-19 images have collected from different sources; such as Cohen dataset on GitHub (https://github.com/ieee8023/covid-chestxray-dataset), Italian Society of Medical and Interventional Radiology (SIRM) website, Radiopaedia and Radiological Society of North America (RSNA) and a totally of 310 COVID-19 chest X-ray images have been collected [38].

Then a collection of normal and pneumonia images are taken from Kaggle publicly available dataset (https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia) were 310 images for normal chest X-ray images and 310 images for pneumonia for both viral and bacterial have been taken [39, 40]. Also, all these images have been added to the dataset to be augmented during the training, the augmentation has been done to prevent CNN from overfitting and memorizing the exact details of the training images. The augmentation here includes flipping the images, rotating the image, translate the image, scale the image and make the total number of images increased which makes the dataset suitable for deep learning [40]. The augmented dataset is publicly available by Alqudah and Qazan using and published online at (https://data.mendeley.com/datasets/2fxz4px6d8). Fig.1 A, B, and C show an x-ray image for the Normal, COVID-19 and Pneumonia subjects respectively.

Convolutional Neural Network (CNN)

In general, Convolution Neural Networks (CNNs) is one of the recent advancements of the traditional artificial neural network (ANN), that is composed of self-learning neurons with their weights and biases. For each pixel from an input image entered for every neuron, it will be followed by a dot product calculation in addition to elective non-linearity capture [41]. The input layer represents a layer that will enhance the image with enhancement such as mean subtraction and feature-scaling. While the feature that distinguished between CNN's and ANNs is a large number of hidden layers, so it is called deep learning. CNN’s are consisting of convolution layers that are convolved the image with learnable filters, and each one will produce a features map in the output [42].

Then the spatial coordinate of the generated feature map will be reduced using max-pooling layers. The process is performed by applying a window on the image with a predefined stride value for each step and finally voting the maximum value of the pixel and putting it in the new image. Meanwhile, another method can be applied in the pooling layer based on the average value, not the maximum [43]. ReLU layer which introduces the nonlinearity for the network its function is [10]. The last layer is a fully connected layer which represents the output layer that has the classification percentages which is serving the classification result as a single vector of probabilities these probabilities passed to the softmax classifier to select the class with the highest probability [44].

AOCTNet

The AOCTNet [34] proposed by Alqudah and initially used to classify optical coherence tomography (OCT) images, and in this research paper to classify x-ray images of the chest. It starts with the first convolutional layer, which uses 32 filters with size 33 and one padding zeros, while the remaining convolutional layers devoted 16 filters of the same size, except for the third, which uses only 8 filters. The batch normalization layer between the convolution layer and the nonlinearity layer (RELU) is introduced for time-consuming, accelerating training stage purposes, and decreasing the sensitivity of network initialization. Max pooling layer is used with window size 22 in AOCTNet and also increases 2 pixels [35]. The output layer is a fully-connected layer with output size 2, softmax layer, and a layer of classification. Figure 2 illustrates the AOCTNet graphic representation.

ShuffleNet

The ShuffleNet [45] is a very computational and lightweight CNN architecture that is commonly used in mobile devices and embedded systems that use a small computing capacity. The ShuffleNet offers better performance on ImageNet classification tasks as compared with MobileNet. The architecture of ShuffleNet is composed essentially of stacking ShuffleNet units grouped into three levels. In each stage, the first building block is added with a step equal to 2. In each stage, the output channels are similar but multiplied for the next one. ShuffleNet architecture uses two different operations to reduce the cost of computation while preserving accuracy [45]. These operations are group convolution and shuffle of the channels pointwise. The channel shuffle operation allows the channels to be divided into several subgroups in each group, then feed each group to different subgroups in the next layer [45]. Figure 3 illustrates ShuffleNet units were introduced which are designed specifically for small networks.

MobileNet

Generally, MobileNet is the most commonly used and suitable CNN for embedded device applications such as smartphone and robot vision with a lightweight CNN architecture [46]. The MobileNet consists of 25 layered MobileNet architecture built for the current study that employs four layers of Conv2D, seven layers of batch normalization, seven layers of ReLU, three layers of Zero Padding 2D, and single layers of DepthwiseConv2D, Global Average Pooling, Dropout, and Dense [47]. Like other CNN's, MobileNet is pretrained on ImageNet [48]. Figure 4 Displays MobileNet architecture.

Transfer Learning

Due to the uncommon availability of a huge medical image dataset, training the entire convolutional neural network (CNN) by medical images to achieve a certain classification accuracy is unlikely often. Therefore, random initialization of weights is replaced using exist CNN that has been already trained on a large dataset usually ImageNet, and this process is called transfer learning. There are two ways to learn how to transfer; the first is to remove fixed features, whereby the data from the intermediate layer is used to train the entire classifier. The second is fine-tuning, which is focused on replacing the last fully connected layer with the new classes and retraining the entire CNN using new modification. This modification will update the weights in the deeper layers [23, 33].

In this paper, the AOCTNet, MobileNet and ShuffleNet pretrained CNNs are modified to be compatible with the goal of this research. For the AOCTNet the first layer image input layer (Image Input Layer) modified to be with the size of 128 128 3 and the fully connected (FC) layer (FC Layer) modified to 3 outputs. For the MobileNet the first layer image input layer (input_1) modified to be with the size of 128 128 3, the FC layer (Logits) modified to 3 outputs and the classification layer (ClassificationLayer_Logits ) modified to be suitable with the new FC layer. Finally, ShuffleNet the first layer image input layer(Input_gpu_0|data_0) modified to be with the size of 128 128 3, the FC layer (node_202) modified to 3 outputs and the classification layer (ClassificationLayer_node_203 ) modified to be suitable with the new FC layer.

X-ray Images Preprocessing

Since this research focused on using lightweight CNNs and any reduction in the size of the input images will result in reducing the time required for both training and classification of new images. Based on that the input image layer for all architectures was modified to be the size of 128 128 3. While the size used for RGB images and the chest x-ray represent only 2D images, a 3D concatenation was done for the images to make it RGB.

Features Extraction

Automatic feature extraction using CNN is usually extracted from the fully connected (FC) layer of any architecture. Fully connected layer extract only representative features that able to distinguish between the input class and in this research to distinguish Normal, COVID-19 and Pneumonia cases. In this paper, the features are extracted from the FC layer of three architectures (AOCT-Net, MobileNet, and ShuffleNet) which is a common method since the FC layer proceeds the Softmax classifier. Based on the selected layer only three features from each class will be extracted and these features will a fine selected and representative [49, 50]. The scatter distribution for the extracted features using three models is represented in Figure 5.

Furthermore, these features are used to build hybrid systems another classifier than Softmax classifier using Support Vector Machine (SVM) and Random Forest (RF) classifiers. All classifier's details are discussed in detail in the following sub-sections from 2.4 to 2.7.

Softmax Classifier

CNN's default classifier is the Softmax classifier, and it is a very powerful and commonly used form of discriminant classifier. The softmax discriminant function (SDF) assigns a new input of the test sample to the output class, using the nonlinear transformation of the distance between the test sample and training samples. In this way, the learning rule in a Softmax classifier for the binary units is similar to the regular binary unit law. The only difference is that the Softmax function model is the generalization of the logistic sigmoid function, which can handle classification problems with more than two possible values [49, 50].

Support Vector Machine (SVM) Classifier

Support Vector Machine (SVM) is one of the leading and commonly used supervised algorithms in machine learning used to classify all data into two main groups. The SVM uses training data to create a model that distinguishes the data entered and that can be used to predict the new data class. The main objective of the SVM is to find the best hyperplane separating the entire dataset and optimizing the distance between the nearest data point and the hyperplane separating [51, 52, 53]. In this research, we have been used radial basis function (RBF).

K-Nearest Neighbor (KNN) Classifier

K-nearest neighbor (KNN) algorithm is a non-parametric machine learning method that is fast, lazy, widely used, and instantaneous. In general, KNN's input vector includes the feature space and the target that represents the class member that is graded based on the classes of its neighbor's majority voting technique. The majority vote is applied to the weights that indicate the distance between each characteristic point and the vector mass center [54, 55]. In this research, we have been used a hamming distance function with three neighbors using the exhaustive search method.

Random Forest (RF) Classifier

In 2001 Breiman invented the Random Forest algorithm. It is composed of a large number of collective decision trees that function together. Each distinct tree in this form of classifier spits out a class prediction, and the class with the most choices to be the prediction of our model. Simplicity and powerfulness are critical impressions behind random wood. In data science-speaking, several fairly uncorrelated models are the explanation that works so well because of the random forest model [55, 56]. In this research, we have been used RF with 100 bags used for bootstrapping.

Performance Evaluation

The confusion matrix is the most commonly used tool for evaluating the efficiency of the artificial algorithm used. It compares device output to reference data. The matrix of confusion shows the most common metrics, such as accuracy, specificity, sensitivity, and accuracy. To test each, the four statistical indices used were determined: true positive (TP), false positive (FP), false negative (FN) and true negative (TN) [57]. Accuracy, sensitivity, precision, and specificity were therefore determined as follows:

[Please see the supplementary files section to view the equations.] (1)

(2)

(3)

(4)

The accuracy is indicated about the classifier's ability to properly distinguish between classes, while sensitivity refers to his ability to correctly detect the true positive, specificity measures the actual negatives that the classifier correctly identifies, and precision indicates his ability to predict positive from how many of them are positive [57]. Also, the classifier's F1 Score which measures the accuracy of a test and the Matthews Correlation Coefficient (MCC) which represents the essence as a coefficient of correlation between the class observed and expected [58, 59]:

[Please see the supplementary files section to view the equations.] (5)

(6)

The proposed methodology, using different CNN architectures for automated feature extraction to build a hybrid system for the detection of COVID-19 using chest X-ray images. The following sections discuss the results of the hybrid systems built using three different CNN Architecture. All results for training and evaluation below are obtained using tenfold-cross-validation for all classifiers. Also, all used CNN was trained using Adaptive Moment Learning Rate (ADAM) solver with an initial learning rate value of 0.001 and a maximum number of epochs value of 200.

AOCTNet Results

AOCTNet CNN has been used to extract features then to feed these features to four different classifiers (Softmax, SVM, KNN, and RF). The average training accuracies and loss are shown in Figure 6 and Figure 7 shows the activation maps for three cases using AOCTNet. While the testing confusion matrices for all classifiers that have been employed are shown in Figure 8. Finally, Table 1 shows a comparison between all performance evaluations values for the used classifiers.

Table 1. Performance Evaluation of Classifiers using Extracted Features Using AOCTNet

Classifier	Accuracy %	Sensitivity %	Specificity %	Precision %		F1 Score	MCC
Softmax	99.24	99.24	99.62	99.25	99.25		0.9887
SVM	98.71	98.71	99.35	98.73	98.71		0. 9808
RF	99.03	99.03	99.52	99.03	99.03		0.9855
KNN	99.03	99.03	99.52	99.04	99.03		0. 9855

MobileNet Results

MobileNet CNN has been used to extract features then to feed these features to four different classifiers (Softmax, SVM, KNN, and RF). The average training accuracies and loss are shown in Figure 9 and Figure 10 shows the activation maps for three cases using MobileNet. While the testing confusion matrices for all classifiers that have been employed are shown in Figure 11. Finally, Table 2 shows a comparison between all performance evaluations values for the used classifiers.

Table 2. Performance Evaluation of Classifiers using Extracted Features Using MobileNet

Classifier	Accuracy %	Sensitivity %	Specificity %	Precision %		F1 Score	MCC
Softmax	99.46	99.46	99.73	99.46	99.46		0. 9919
SVM	98.60	98.60	99.30	98.63	98.60		0.9791
RF	99.46	99.46	99.73	99.46	99.46		0. 9919
KNN	99.46	99.46	99.73	99.46	99.46		0. 9919

ShuffleNet Results

ShuffleNet CNN has been used to extract features then to feed these features to four different classifiers (Softmax, SVM, KNN, and RF). The training and validation accuracies and loss are shown in Figure 12 and Figure 13 shows the activation maps for two cases using ShuffleNet. While the testing confusion matrices for all classifiers that have been employed are shown in Figure 14. Finally, Table 3 shows a comparison between all performance evaluations values for the used classifiers.

Table 3. Performance Evaluation of Classifiers using Extracted Features Using ShuffleNet

Classifier	Accuracy %	Sensitivity %	Specificity %	Precision %		F1 Score	MCC
Softmax	99.35	99.35	99.68	99.36	99.35		0.9903
SVM	95.81	95.80	97.90	96.27	95.86		0.9395
RF	80.00	80.00	90.00	80.03	80.01		0.7001
KNN	99.35	99.35	99.68	99.36	99.35		0.9903

The results show that all used CNN architectures and hybrid systems achieved high performance except Random forest (RF) with ShuffleNet. Comparing the other system in terms of performance metrics we can notice that MobileNet with the three classifiers (Softmax, RF and KNN) achieved the highest performances that are mean that MobileNet is the best CNN architecture that can be used for classification using Tables 1, 2 and 3 and for automated features extraction using Figure 5.

However, classifiers must be tested for time consumption for classification of an input X-ray image, as shown in Figure 15 comparing time-consuming for classification for all used classifiers it can be noticed that all have a consuming time less than 13 seconds. The maximum time was 12.7557785 in the AOCTNet RF classifier and the minimum time was 7.0193524 in the ShuffleNet softmax classifier. Based on these results all classifiers performed very well concerning time-consuming with a little variation between them. Combining the performance evaluation with time consumption we can notice that the best performance combined with the time is the ShuffleNet Softmax followed by AOCTNet Softmax. Regarding the class activation maps (CAM) we can notice that all CNN architectures show specificity towards selection specific regions that able to extract the most useful features which have been reflected on the acquired results.

The present study aimed to investigate the impact of using lightweight deep learning CNN architectures for automated feature extraction from chest X-ray images for COVID-19 disease detection. The proposed method was applied to chest X-ray images dataset with three classes and to avoid overfitting during the training and testing stages of CNN the training dataset has been augmented and the system tested using tenfold cross-validation. Based on the results at first, all trained CNNs are utilized for two different scenarios, the first one is to detect the COVID-19, Normal and Pneumonia using default softmax classifier, while the second scenario is the CNN was employed to extract representative features for hybrid system implementation.

In future studies, these limitations of the present study can be overcome. A more in-depth analysis, in particular, requires much more patient data, particularly those suffering from COVID-19. A more promising approach to future research would concentrate on identifying patients with mild symptoms, rather than symptoms of pneumonia, although these symptoms might not be correctly visualized or visualized on X-rays at all. Also, it is important to establish models that can differentiate COVID -19 cases from other related viral cases, such as SARS, but also a greater variety of common pneumonia or even physiological X-rays. Besides, the automated diagnosis of cases was made using only a visual picture rather than a more nuanced approach to the patient, based on certain variables that could be given and behave as risk factors for disease onset.

The present research nonetheless leads to the possibility of using a lightweight CNN for low-cost, rapid and automatic COVID-19 disease diagnosis. But it's also about investigating whether the extracted features performed by the CNNs are accurate biomarkers that help detect COVID-19. Also, given the fact that the correct treatment is not determined solely by an X-ray image, an initial sampling of the cases will be helpful, not in the form of treatment but in the timely implementation of quarantine measures of the positive samples, before a more thorough evaluation and detailed treatment or follow-up protocol is followed. In addition to the benefit of automated COVID-19 identification from any medical picture lies in the sensitivity of nursing and medical personnel to the outbreak. Table 4 shows a comparison between the proposed method and other methods in the literature.

Table 4. Comparison Between the Proposed Method and the Literature.

Reference		Accuracy %	Sensitivity %	Specificity %
[5]		86.70	86.70	-
[6]		85.90	87.00	88.00
[37]		98.00	96.00	100.00
[60]		96.78	98.66	96.46
Proposed	MobileNet Softmax	99.46	99.46	99.73
	MobileNet RF	99.46	99.46	99.73
	MobileNet KNN	99.46	99.46	99.73
	ShuffleNet KNN	99.35	99.35	99.68
	AOCTNet Softmax	99.24	99.24	99.62

This research utilized the benefits of using different artificial intelligence techniques especially deep learning (CNN using softmax classifier) and machine learning (SVM, KNN, RF) to build a hybrid artificial intelligence system that able to detect the COVID-19 using only chest X-ray images. The results show that the proposed methodology is very useful and efficient in detecting COVID-19 in a few seconds. The results can be improved when obtaining a huge dataset of chest X-ray images in addition to if we build a model that used CT images and made a combination of them. Furthermore, in the future, we can use more types of classifiers that may be applied besides different types of features that can be extracted which are used to describe the texture of the chest. COVID-19 has now been a threat to the healthcare system and economies of the world and thousands have already died. Respiratory failure has caused deaths, leading to the loss of other organs. Since a significant number of outdoor or emergency patients attend, doctor's time is limited, and computer-aided diagnosis may save lives through early screening and prompt treatment. Also, due to the radiologist's knowledge differences, there is a significant degree of variability in sample images from the X-ray machines. All used CNN’s exhibits an excellent success in the detection of COVID-19 by training itself effectively from a comparatively lower picture set. We assume this computer-aided diagnostic method will greatly improve the speed and accuracy of COVID-19 diagnosis cases. This will be particularly useful in this pandemic where the risk of the disease and the need for preventive measures are at odds with the resources available.

Conflict of interest

The authors declare no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Funding

No funding received for this research.

Authors’ Contributions

All authors have contributed equally to this work.

Acknowledgments

The authors would like to thank the Italian Society of Medical Radiology and Interventional (SIRM) for making free access to the X-ray images of COVID-19 patients. Also, we would like to thank J. P. Cohen for taking the initiative to make a dataset for Chest X-Ray images from articles and online resources. Furthermore, the authors would like to thank the authors of the pneumonia database in Kaggle who helped significantly to make this work possible. Last but not the least, authors would make a great warm to all medical staff around the world who fight COVID-19 diseases and to the patients with the disease we dedicate this work to all of them and to the soul of those who died because of or during fighting the COVID-19.

Stoecklin, Sibylle Bernard, et al. "First cases of coronavirus disease 2019 (COVID-19) in France: surveillance, investigations and control measures, January 2020." Eurosurveillance 25.6 (2020).‏
Jin, Ying-Hui, et al. "A rapid advice guideline for the diagnosis and treatment of 2019 novel coronavirus (2019-nCoV) infected pneumonia (standard version)." Military Medical Research 7.1 (2020): 4.‏
Pan, Feng, et al. "Time course of lung changes on chest CT during recovery from 2019 novel coronavirus (COVID-19) pneumonia." Radiology (2020): 200370.‏
Rabi, Firas A., et al. "SARS-CoV-2 and Coronavirus Disease 2019: What We Know So Far." Pathogens 9.3 (2020): 231.‏
Xu, Xiaowei, et al. "Deep Learning System to Screen Coronavirus Disease 2019 Pneumonia." arXiv preprint arXiv:2002.09334 (2020).‏
Wang, Shuai, et al. "A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19)." medRxiv (2020).‏
https://www.nationalgeographic.com/science/2020/02/here-is-what-coronavirus-does-to-the-body/ 20.03.2020
https://www.worldometers.info/coronavirus/
Mahase, E. Coronavirus: covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate. The BMJ, 368 : m641, doi: 10.1136/bmj.m641, 2020.
Wang, Y., Hu, M., Li, Q., Zhang, X. P., Zhai, G., and Yao, N. Abnormal respiratory patterns classifier may contribute to large-scale screening of people infected with COVID-19 in an accurate and unobtrusive manner. arXiv preprint arXiv : 2002.05534, 1-6, 2020.
(2020). WHO Director‐Generalʹs opening remarks at the media briefing on COVID‐19 ‐ 11 March 2020. Available: https://www.who.int/dg/speeches/detail/who‐director‐general‐s‐opening‐remarks‐atthe‐media‐briefing‐on‐covid‐19‐‐‐11‐march‐2020
(2020). Coronavirus Disease 2019 (COVID‐19). Available: https://www.cdc.gov/coronavirus/2019‐ncov/need‐extra‐precautions/people‐at‐higher‐risk.html
H. Organization, ʺGlobal COVID‐19 report,ʺ March 25,2020 2020.
H. U. MEDICINE. (2020). Coronavirus COVID‐19 Global Cases by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU). Available: https://coronavirus.jhu.edu/map.html
Wang, Y. Xu, R. Gao, R. Lu, K. Han, G. Wu, et al., ʺDetection of SARS‐CoV‐2 in Different Types of Clinical Specimens,ʺ Jama, 2020.
Yang, Y.‐C. Wang, C.‐F. Shen, and C.‐M. Cheng, ʺPoint‐of‐Care RNA‐Based Diagnostic Device for COVID‐19,ʺ ed: Multidisciplinary Digital Publishing Institute, 2020.
J. NEWS. (2020). Indiaʹs poor testing rate may have masked coronavirus cases. Available: https://www.aljazeera.com/news/2020/03/india‐poor‐testing‐rate‐masked‐coronavirus‐cases200318040314568.html
J. NEWS. (2020). Bangladesh scientists create $3 kit. Can it help detect COVID‐19? Available: https://www.aljazeera.com/news/2020/03/bangladesh‐scientists‐create‐3‐kit‐detect‐covid‐19‐200323035631025.html
Wetsman. (2020). CORONAVIRUS TESTING SHOULDN’T BE THIS COMPLICATED. Available:https://www.theverge.com/2020/3/17/21184015/coronavirus‐testing‐pcr‐diagnostic‐point‐of‐carecdc‐techonology
Wang, B. Hu, C. Hu, F. Zhu, X. Liu, J. Zhang, et al., ʺClinical characteristics of 138 hospitalized patients with 2019 novel coronavirus–infected pneumonia in Wuhan, China,ʺ Jama, 2020.
Chen, M. Zhou, X. Dong, J. Qu, F. Gong, Y. Han, et al., ʺEpidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study,ʺ The Lancet, vol. 395, pp. 507‐513, 2020.
Li, X. Guan, P. Wu, X. Wang, L. Zhou, Y. Tong, et al., ʺEarly transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia,ʺ New England Journal of Medicine, 2020.
Huang, Y. Wang, X. Li, L. Ren, J. Zhao, Y. Hu, et al., ʺClinical features of patients infected with 2019 novel coronavirus in Wuhan, China,ʺ The Lancet, vol. 395, pp. 497‐506, 2020.
M. Corman, O. Landt, M. Kaiser, R. Molenkamp, A. Meijer, D. K. Chu, et al., ʺDetection of 2019 novel coronavirus (2019‐nCoV) by real‐time RT‐PCR,ʺ Eurosurveillance, vol. 25, 2020.
K. Chu, Y. Pan, S. Cheng, K. P. Hui, P. Krishnan, Y. Liu, et al., ʺMolecular diagnosis of a novel coronavirus (2019‐nCoV) causing an outbreak of pneumonia,ʺ Clinical chemistry, 2020.
Zhang, L. Wang, X. Deng, R. Liang, M. Su, C. He, et al., ʺRecent advances in the detection of respiratory virus infection in humans,ʺ Journal of medical virology, vol. 92, pp. 408‐417, 2020.
Chung, A. Bernheim, X. Mei, N. Zhang, M. Huang, X. Zeng, et al., ʺCT imaging features of 2019 novel coronavirus (2019‐nCoV),ʺ Radiology, p. 200230, 2020.
Hosseiny, S. Kooraki, A. Gholamrezanezhad, S. Reddy, and L. Myers, ʺRadiology perspective of coronavirus disease 2019 (COVID‐19): lessons from severe acute respiratory syndrome and Middle East respiratory syndrome,ʺ American Journal of Roentgenology, pp. 1‐5, 2020.
Salehi, A. Abedi, S. Balakrishnan, and A. Gholamrezanezhad, ʺCoronavirus Disease 2019 (COVID‐19): A Systematic Review of Imaging Findings in 919 Patients,ʺ American Journal of Roentgenology, pp. 1‐7, 2020.
Gómez, M. Semmler, A. Schützenberger, C. Bohr, and M. Döllinger, ʺLow‐light image enhancement of high‐speed endoscopic videos using a convolutional neural network,ʺ Medical & biological engineering & computing, vol. 57, pp. 1451‐1463, 2019.
Choe, S. M. Lee, K.‐H. Do, G. Lee, J.‐G. Lee, S. M. Lee, et al., ʺDeep Learning–based Image Conversion of CT Reconstruction Kernels Improves Radiomics Reproducibility for Pulmonary Nodules or Masses,ʺ Radiology, vol. 292, pp. 365‐373, 2019.
Alquran H, Alqudah AM, Abu-Qasmieh I, Al-Badarneh A, Almashaqbeh S. ECG classification using higher order spectral estimation and deep learning techniques. Neural Network World. 2019 Jul 1;29(4):207-19.
Alqudah AM, Alquraan H, Qasmieh IA. Segmented and Non-Segmented Skin Lesions Classification Using Transfer Learning and Adaptive Moment Learning Rate Technique Using Pretrained Convolutional Neural Network. In Journal of Biomimetics, Biomaterials and Biomedical Engineering 2019 (Vol. 42, pp. 67-78). Trans Tech Publications Ltd.
Alqudah AM. AOCT-NET: a convolutional network automated classification of multiclass retinal diseases using spectral-domain optical coherence tomography images. Medical & biological engineering & computing. 2020 Jan;58(1):41-53.
Alqudah, Ali Mohammad. "Towards classifying non-segmented heart sound records using instantaneous frequency based features." Journal of medical engineering & technology 43.7 (2019): 418-430.
Hamed, Abdallah Mohamed. "Image processing of corona virus using interferometry." Optics and Photonics Journal 6.05 (2016): 75.‏
Narin A, Kaya C, Pamuk Z. Automatic Detection of Coronavirus Disease (COVID-19) Using X-ray Images and Deep Convolutional Neural Networks. arXiv preprint arXiv:2003.10849. 2020 Mar 24.
Joseph Paul Cohen, COVID-19 image data collection, https://github.com/ieee8023/covid-13
Kermany, Daniel S., et al. "Identifying medical diagnoses and treatable diseases by image-based deep learning." Cell 172.5 (2018): 1122-1131.
Alqudah, Ali Mohammad; Qazan, Shoroq (2020), “Augmented COVID-19 X-ray Images Dataset”, Mendeley Data, v4http://dx.doi.org/10.17632/2fxz4px6d8.4
Żejmo, Michał, et al. "Classification of breast cancer cytological specimen using convolutional neural network." Journal of Physics: Conference Series. Vol. 783. No. 1. IOP Publishing, 2017.‏
Spanhol, Fabio Alexandre, et al. "Breast cancer histopathological image classification using convolutional neural networks." 2016 international joint conference on neural networks (IJCNN). IEEE, 2016.‏
Krizhevsky, I. Sutskever, and G. E. Hinton, ʺImagenet classification with deep convolutional neural networks,ʺ in Advances in neural information processing systems, 2012, pp. 1097‐1105.
Alqudah A, Younes AR, Alqudah AM. TOWARDS MODELING HUMAN BODY RESPONSIVENESS TO GLUCOSE INTAKE AND INSULIN INJECTION BASED ON ARTIFICIAL NEURAL NETWORKS. Jordanian Journal of Computers and Information Technology (JJCIT). 2020 Mar;6(01).
Zhang, X., Zhou, X., Lin, M., Sun, J.: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile In Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, Pages 6848–6856 (2018).https://doi.org/10.1109/CVPR.2018.00716.
Khosla A et al.ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis.2015; 115(3);211–252. https://doi.org/10.1007/s11263-015-0816-y
Howard AG et al.MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv Prepr. arXiv https://arxiv.org/abs/1704.04861. 2017.
Lin, Shufei & Cheng, Ruiqi & Wang, Kaiwei & Yang, Kailun. (2018). Visual Localizer: Outdoor Localization Based on ConvNet Descriptor and Global Optimization for Visually Impaired Pedestrians. Sensors. 18. 2476. 10.3390/s18082476.
Trosten DJ, Sharma P. Unsupervised Feature Extraction–A CNN-Based Approach. In Scandinavian Conference on Image Analysis 2019 Jun 11 (pp. 197-208). Springer, Cham. DOI: 10.1007/978-3-030-20205-7_17.
Garcia-Gasulla D, Parés F, Vilalta A, Moreno J, Ayguadé E, Labarta J, Cortés U, Suzumura T. On the behavior of convolutional nets for feature extraction. Journal of Artificial Intelligence Research. 2018 Mar 20;61:563-92. DOI: 10.1613/jair.5756.
Alqudah A, Alqudah AM. Sliding window based support vector machine system for classification of breast cancer using histopathological microscopic images. IETE Journal of Research. 2019 Mar 7:1-9.
Alqudah AM, Algharib HM, Algharib AM, Algharib HM. Computer aided diagnosis system for automatic two stages classification of breast mass in digital mammogram images. Biomedical Engineering: Applications, Basis and Communications. 2019 Feb 10;31(01):1950007.
Alqudah AM, Alquraan H, Abu-Qasmieh I, Al-Badarneh A. Employing Image Processing Techniques and Artificial Intelligence for Automated Eye Diagnosis Using Digital Eye Fundus Images. In Journal of Biomimetics, Biomaterials and Biomedical Engineering 2018 (Vol. 39, pp. 40-56). Trans Tech Publications.
Alqudah AM. Ovarian Cancer Classification Using Serum Proteomic Profiling and Wavelet Features A Comparison of Machine Learning and Features Selection Algorithms. Journal of Clinical Engineering. 2019 Oct 1;44(4):165-73.
Alqudah AM. Towards classifying non-segmented heart sound records using instantaneous frequency based features. Journal of medical engineering & technology. 2019 Oct 3;43(7):418-30.
Al-Sharu WN, Alqudah AM. Enhancing Prediction of Prosthetic Fingers Movement Based on sEMG Using Mixtures of Features and Random Forest.
Alqudah AM, Albadarneh A, Abu-Qasmieh I, Alquran H. Developing of robust and high accurate ECG beat classification by combining Gaussian mixtures and wavelets features. Australasian physical & engineering sciences in medicine. 2019 Mar 15;42(1):149-57.
Tharwat A. Classification assessment methods. Applied Computing and Informatics. 2018 Aug 21.
Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC genomics. 2020 Dec 1;21(1):6.
Apostolopoulos ID, Mpesiana TA. Covid-19: automatic detection from X-ray images utilizing transfer learning with convolutional neural networks. Physical and Engineering Sciences in Medicine. 2020 Mar 25:1.

Equations.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Automated Systems for Detection of COVID-19 Using Chest X-ray Images and Lightweight Convolutional Neural Networks

Status:

Version 1

Abstract

Figures

Introduction

Materials and Methods

Results

Discussion

Conclusion

Declarations

References

Supplementary Files

Status:

Version 1